SpeechVerse: A Large-scale Generalizable Audio Language Model Paper • 2405.08295 • Published 2 days ago • 3
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models Paper • 2405.08317 • Published 2 days ago • 2
Understanding the performance gap between online and offline alignment algorithms Paper • 2405.08448 • Published 1 day ago • 1
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding Paper • 2405.08344 • Published 1 day ago • 1
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory Paper • 2405.08707 • Published 1 day ago • 3
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Paper • 2405.08054 • Published 2 days ago • 6
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Paper • 2405.08748 • Published 1 day ago • 6
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models Paper • 2403.06098 • Published Mar 10 • 13
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 49 items • Updated about 7 hours ago • 5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 13 days ago • 88
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels Paper • 2405.07526 • Published 3 days ago • 11
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training Paper • 2405.06932 • Published 5 days ago • 13
LogoMotion: Visually Grounded Code Generation for Content-Aware Animation Paper • 2405.07065 • Published 4 days ago • 11
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots Paper • 2405.07990 • Published 2 days ago • 14
SUTRA: Scalable Multilingual Language Model Architecture Paper • 2405.06694 • Published 8 days ago • 29
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 10 items • Updated 1 day ago • 72
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Paper • 2405.05949 • Published 6 days ago • 2
MAmmoTH2 Collection Scaling up instruction data from the web for to build better LLMs • 10 items • Updated 5 days ago • 4
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published 23 days ago • 120
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published 17 days ago • 104
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 19 days ago • 54
Searching for Better ViT Baselines Collection Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). • 15 items • Updated 1 day ago • 8
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 8 days ago • 6
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 15 days ago • 47
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 24 days ago • 70
Arctic Collection A collection of pre-trained dense-MoE Hybrid transformer models • 2 items • Updated 21 days ago • 18
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 10 items • Updated 4 days ago • 115
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation 17 days ago • 68
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published 23 days ago • 229
Quantized-FT-Orca-Math Collection Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. • 8 items • Updated 29 days ago • 6
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 27 days ago • 514
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 9 days ago • 73
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization Paper • 2402.09320 • Published Feb 14 • 6
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders Paper • 2404.05961 • Published Apr 9 • 61