List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs Paper • 2404.16375 • Published Apr 25 • 15
view article Article Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI. By KingNish • 8 days ago • 22
Critique Models (CM) on the 🤗 Hub Collection This collection contains some Critique Models (CM) for LLM evaluation available in the HuggingFace Hub • 5 items • Updated 22 days ago • 3
CommonCanvas Collection Collection of models trained on the CommonCatalogue datasets • 8 items • Updated 13 days ago • 5
MAmmoTH2 Collection Scaling up instruction data from the web for to build better LLMs • 11 items • Updated 3 days ago • 6
Blackhole Collection A black hole with lots of high-quality dialogue datasets in many fields, and multilingual helps to train LLMs with SFT and DPO methods easier. • 32 items • Updated 5 days ago • 6
SpeechVerse: A Large-scale Generalizable Audio Language Model Paper • 2405.08295 • Published 15 days ago • 10
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models Paper • 2405.08317 • Published 15 days ago • 8
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding Paper • 2405.08344 • Published 15 days ago • 10
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Paper • 2405.08054 • Published 16 days ago • 19
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation Paper • 2405.09546 • Published 14 days ago • 9
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization Paper • 2401.15914 • Published Jan 29 • 7
Chronos Models Collection Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 6 items • Updated Mar 18 • 25
📦 3D creation workflow Collection Going from a text prompt to a nice 3D model • 3 items • Updated Feb 6 • 23
🐒 Stable Diffusion LoRAs Collection Awesome LoRAs found on the hub - using only 🐵 • 7 items • Updated Feb 6 • 14
view article Article Train custom AI models with the trainer API and adapt them to 🤗 By not-lain • 4 days ago • 19
Transferable and Principled Efficiency for Open-Vocabulary Segmentation Paper • 2404.07448 • Published Apr 11 • 10
DOCCI: Descriptions of Connected and Contrasting Images Paper • 2404.19753 • Published 29 days ago • 9
Transcription Collection Transcribe interviews for free with Whisper in Spaces. • 5 items • Updated Apr 23 • 3
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 28 items • Updated Mar 23 • 181
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset Paper • 2402.10176 • Published Feb 15 • 33
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap Paper • 2402.19450 • Published Feb 29 • 3
You Only Cache Once: Decoder-Decoder Architectures for Language Models Paper • 2405.05254 • Published 21 days ago • 8
Gemma: Open Models Based on Gemini Research and Technology Paper • 2403.08295 • Published Mar 13 • 43
view article Article Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face 26 days ago • 13
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • 30 days ago • 27
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon 20 days ago • 7
Aya Datasets Collection The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 4 items • Updated 6 days ago • 8
C4AI Command R Collection C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 3 items • Updated 6 days ago • 11
C4AI Command R Plus Collection C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 3 items • Updated 6 days ago • 17
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram • Apr 24 • 48
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 27 days ago • 101
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published about 1 month ago • 114
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 110 items • Updated 7 days ago • 24
OLMo Suite Collection Artifacts for the first set of OLMo models. • 12 items • Updated 14 days ago • 36