Medical SAM 2: Segment medical images as video via Segment Anything Model 2 Paper • 2408.00874 • Published 23 days ago • 39
Gemma 2: Improving Open Language Models at a Practical Size Paper • 2408.00118 • Published 24 days ago • 72
Meltemi: The first open Large Language Model for Greek Paper • 2407.20743 • Published 25 days ago • 67
Gemma Scope Release Collection A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated 13 days ago • 12
ShieldGemma Release Collection A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated 24 days ago • 11
Gemma 2 2B Release Collection The 2.6B parameter version of Gemma 2. • 6 items • Updated 24 days ago • 72
Research projects on top of vLLM Collection Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated 26 days ago • 12
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • 26 days ago • 164
Llama 3.1 Collection This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 22 days ago • 515
view article Article Querying Datasets with the Datasets Explorer Chrome Extension By cfahlgren1 • Jul 19 • 6
view article Article Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing By Pclanglais • Jul 19 • 17
E-BATCH: Energy-Efficient and High-Throughput RNN Batching Paper • 2009.10656 • Published Sep 22, 2020 • 1
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17 • 48
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper • 2407.09025 • Published Jul 12 • 119
Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs Paper • 2407.07775 • Published Jul 10 • 3
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs Paper • 2407.03963 • Published Jul 4 • 15
Inference Performance Optimization for Large Language Models on CPUs Paper • 2407.07304 • Published Jul 10 • 52
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Paper • 2407.07053 • Published Jul 9 • 41
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On Paper • 2407.08348 • Published Jul 11 • 49
AIMO Progress Prize Collection Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19 • 8
view article Article Experimenting with Automatic PII Detection on the Hub using Presidio Jul 10 • 23
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models Paper • 2407.01906 • Published Jul 2 • 33
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion Paper • 2407.01392 • Published Jul 1 • 39
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Paper • 2407.03320 • Published Jul 3 • 92
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 145
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25 • 78
view article Article Open-source embeddings and LLMs outperform Gemini and OpenAI for Web Navigation while being faster and cheaper By dhuynh95 • Jun 21 • 5
MobileNetV4 pretrained weights Collection Weights for MobileNet-V4 pretrained in timm • 13 items • Updated Jun 24 • 12
Depth Anything v2 Release Collection A comprehensive collection on DAv2 • 5 items • Updated Jun 18 • 10
WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences Paper • 2406.11069 • Published Jun 16 • 13
mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus Paper • 2406.08707 • Published Jun 13 • 15
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models Paper • 2406.09416 • Published Jun 13 • 28
⛔️🔦 Provenance, Watermarking & Deepfake Detection Collection Technical tools for more control over non-consensual synthetic content • 14 items • Updated Apr 1 • 37
Hugging Face community’s Wikimedia datasets Collection Wikimedia datasets created by the Hugging Face community, not Wikimedia. Sorted by Wikimedia project. • 17 items • Updated Jun 7 • 6
Streamlining and standardizing software citations with The Software Citation Station Paper • 2406.04405 • Published Jun 6 • 1
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild Paper • 2406.04770 • Published Jun 7 • 25