OS Week Highlights - Oct 2 - 8
- 175π
Open-Orca/Mistral-7B-OpenOrca
Text Generation β’ Updated β’ 18.4k β’ 676Note Mistral model fine-tuned on the OpenOrca dataset
teknium/CollectiveCognition-v1.1-Mistral-7B
Text Generation β’ Updated β’ 134 β’ 79Note Another Mistral fine-tune with great results in TruthfulQA
stabilityai/stablelm-3b-4e1t
Text Generation β’ Updated β’ 11.2k β’ 310Note Very high performant model by Stability. WIth just 3B params, it achieves some great results
Efficient Streaming Language Models with Attention Sinks
Paper β’ 2309.17453 β’ Published β’ 13Note Check out this amazing blog post explaining this https://huggingface.co./blog/tomaarsen/attention-sinks
1.84kStable Diffusion XL on TPUv5e
πGenerate images from text prompts with various styles
Note Run SDXL with TPU with a in-depth technical explanation
liuhaotian/llava-v1.5-7b
Image-Text-to-Text β’ Updated β’ 680k β’ 406Note A model that can do multimodal instruction following data
defog/sqlcoder2
Text Generation β’ Updated β’ 536 β’ 110Note Code models for the win! This is a 15B model that turns natural language to SQL
defog/sqlcoder-7b
Text Generation β’ Updated β’ 332 β’ 61Note And this is the 7B version of the above
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper β’ 2309.12284 β’ Published β’ 18
meta-math/MetaMathQA
Viewer β’ Updated β’ 395k β’ 6.22k β’ 352Note A dataset of math questions for fine-tuning
111AI Meme Generator
π₯Create funny memes from images
Note Generate memes with IDEFICS, the multimodal model