Running 917 917 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 132
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 146
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 23 days ago • 111