FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published 4 days ago • 42
A decoder-only foundation model for time-series forecasting Paper • 2310.10688 • Published Oct 14, 2023 • 4
everything-ai Collection Spaces related to everything-ai tasks • 4 items • Updated about 14 hours ago • 3
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 6 days ago • 97
Compressed LLMs for nm-vllm Collection LLMs compressed using SparseGPT and GPTQ for optimized inference with nm-vllm https://github.com/neuralmagic/nm-vllm • 18 items • Updated about 7 hours ago • 8
Sparse Foundational Llama 2 Models Collection Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated 6 days ago • 6
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment Paper • 2405.03594 • Published 17 days ago • 7
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • 7 days ago • 15
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published 24 days ago • 110
view article Article Train custom AI models with the trainer API and adapt them to 🤗 By not-lain • about 3 hours ago • 15
🤖 LLM Spaces Collection A collection of applications demonstrating large language models (LLMs) 🚀 • 15 items • Updated about 7 hours ago • 6
🔊 Speech Enhancement Collection Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! 🚀 • 8 items • Updated 22 days ago • 7
🖼️ Image Enhancement Collection Embrace the future of Image Enhancement with the latest AI-powered technologies! 🚀 • 1 item • Updated 22 days ago • 5
🤔 Facial Expressions Recognition Collection Embrace the future of Facial Expressions Recognition with the latest AI-powered technologies! 🚀 • 4 items • Updated 11 days ago • 6
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Paper • 1901.02860 • Published Jan 9, 2019 • 2
Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses Paper • 2312.16233 • Published Dec 25, 2023 • 2
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 92
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • 17 days ago • 24
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 14 items • Updated about 23 hours ago • 126
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 21 days ago • 96
EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars Paper • 2404.19110 • Published 24 days ago • 3
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 34 items • Updated about 7 hours ago • 50
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • 20 days ago • 14
view article Article A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 9 days ago • 16
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 23 days ago • 51
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published 23 days ago • 65
GreenBitAI MLX LLM Collection GreenBitAI's Low-bit LLMs in MLX format • 69 items • Updated 17 days ago • 4
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published 24 days ago • 63
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation Paper • 2404.12753 • Published Apr 19 • 38
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • 25 days ago • 33
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • 24 days ago • 26
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper • 2404.16994 • Published 28 days ago • 31
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Paper • 2404.16821 • Published 28 days ago • 49
PuLID: Pure and Lightning ID Customization via Contrastive Alignment Paper • 2404.16022 • Published 29 days ago • 16
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper • 2404.15653 • Published 29 days ago • 24
LLaVA++ (LLaMA-3 and Phi-3-Mini) Collection Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated 23 days ago • 21
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 535
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 20 items • Updated 1 day ago • 264
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published Apr 22 • 37
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published about 1 month ago • 120