OS Week Highlights - Oct 2 - 8 - a osanseviero Collection

osanseviero 's Collections

Papers I've read

MoEs papers reading list

OS Week Highlights - Oct 16 - 22

OS Week Highlights - Oct 9 - 15

OS Week Highlights - Oct 2 - 8

OS Week Highlights - Sept 25 - Oct 1

OS Week Highlights - Sept 18 - 24

Historical - Spaces of the Week

Mistral Instruct Merges

Papers I want to read

Instruction Pre-Training

OS Week Highlights - Oct 2 - 8

updated Jan 17, 2024

Paused

175

175

Mistral-7B-OpenOrca

🌊
Open-Orca/Mistral-7B-OpenOrca

Text Generation • Updated Nov 18, 2023 • 19.1k • 676

Note Mistral model fine-tuned on the OpenOrca dataset
teknium/CollectiveCognition-v1.1-Mistral-7B

Text Generation • Updated Oct 7, 2023 • 141 • 79

Note Another Mistral fine-tune with great results in TruthfulQA
stabilityai/stablelm-3b-4e1t

Text Generation • Updated Mar 7, 2024 • 11.4k • 310

Note Very high performant model by Stability. WIth just 3B params, it achieves some great results
Efficient Streaming Language Models with Attention Sinks

Paper • 2309.17453 • Published Sep 29, 2023 • 13

Note Check out this amazing blog post explaining this https://huggingface.co/blog/tomaarsen/attention-sinks
Running on CPU Upgrade

1.85k

1.85k

Stable Diffusion XL on TPUv5e

🏋

Generate images from text prompts with various styles

Note Run SDXL with TPU with a in-depth technical explanation
liuhaotian/llava-v1.5-7b

Image-Text-to-Text • Updated May 8, 2024 • 667k • 406

Note A model that can do multimodal instruction following data
defog/sqlcoder2

Text Generation • Updated Oct 13, 2023 • 575 • 110

Note Code models for the win! This is a 15B model that turns natural language to SQL
defog/sqlcoder-7b

Text Generation • Updated Feb 6, 2024 • 348 • 61

Note And this is the 7B version of the above
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18
meta-math/MetaMathQA

Viewer • Updated Dec 21, 2023 • 395k • 6.58k • 352

Note A dataset of math questions for fine-tuning
Running on CPU Upgrade

111

111

AI Meme Generator

🔥

Create funny memes from images

Note Generate memes with IDEFICS, the multimodal model