victor (Victor Mustar)

upvoted a paper 2 days ago

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published 4 days ago • 42

upvoted 2 papers 3 days ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published 7 days ago • 82

A decoder-only foundation model for time-series forecasting

Paper • 2310.10688 • Published Oct 14, 2023 • 4

upvoted an article 6 days ago

Article

Evaling llm-jp-eval (evals are hard)

By

•

5 days ago

• 4

upvoted a paper 6 days ago

Sakuga-42M Dataset: Scaling Up Cartoon Research

Paper • 2405.07425 • Published 11 days ago • 3

upvoted 2 collections 6 days ago

everything-ai

Collection

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 6 days ago • 97

upvoted 2 collections 7 days ago

Compressed LLMs for nm-vllm

Collection

LLMs compressed using SparseGPT and GPTQ for optimized inference with nm-vllm https://github.com/neuralmagic/nm-vllm • 18 items • Updated about 7 hours ago • 8

Sparse Foundational Llama 2 Models

Collection

Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated 6 days ago • 6

upvoted a paper 7 days ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published 17 days ago • 7

upvoted 3 articles 7 days ago

Article

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

By

•

7 days ago

• 15

Article

Hugging Face + Google Visual Blocks

By

•

7 days ago

• 17

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

10 days ago

• 114

upvoted a paper 8 days ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published 24 days ago • 110

upvoted a collection 8 days ago

Yi-1.5 (2024/05)

Collection

10 items • Updated 4 days ago • 70

upvoted 3 articles 9 days ago

Article

Introducing the Open Arabic LLM Leaderboard

10 days ago

• 45

Article

Train custom AI models with the trainer API and adapt them to 🤗

By

•

about 3 hours ago

• 15

Article

Hugging Face x LangChain : A new partner package in LangChain

10 days ago

• 65

upvoted 5 collections 10 days ago

upvoted 2 articles 13 days ago

Article

Everything About Long Context Fine-tuning

By

•

13 days ago

• 10

Article

Inference for PROs

Sep 22, 2023

• 15

upvoted a paper 13 days ago

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

Paper • 1901.02860 • Published Jan 9, 2019 • 2

upvoted a paper 14 days ago

Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses

Paper • 2312.16233 • Published Dec 25, 2023 • 2

upvoted a collection 15 days ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 92

upvoted an article 15 days ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

17 days ago

• 24

upvoted a collection 15 days ago

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 14 items • Updated about 23 hours ago • 126

upvoted 4 papers 17 days ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 21 days ago • 96

On Bringing Robots Home

Paper • 2311.16098 • Published Nov 27, 2023 • 2

EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Paper • 2404.19110 • Published 24 days ago • 3

3D Gaussian Blendshapes for Head Avatar Animation

Paper • 2404.19398 • Published 23 days ago • 2

upvoted a collection 17 days ago

🎭 Avatars

Collection

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 34 items • Updated about 7 hours ago • 50

upvoted 3 articles 20 days ago

Article

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

By

•

20 days ago

• 14

Article

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

By

•

9 days ago

• 16

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

23 days ago

• 51

upvoted 2 papers 21 days ago

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published 23 days ago • 65

Octopus v4: Graph of language models

Paper • 2404.19296 • Published 23 days ago • 89

upvoted 2 collections 22 days ago

ZeroGPU Spaces

Collection

ZeroGPU Spaces made by the community • 16 items • Updated 6 days ago • 162

GreenBitAI MLX LLM

Collection

GreenBitAI's Low-bit LLMs in MLX format • 69 items • Updated 17 days ago • 4

upvoted an article 22 days ago

Article

RAG chatbot using llama3

By

•

about 1 month ago

• 24

upvoted a paper 22 days ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published 23 days ago • 93

upvoted a paper 23 days ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published 24 days ago • 63

upvoted an article 23 days ago

Article

Improving Prompt Consistency with Structured Generations

24 days ago

• 44

upvoted a paper 23 days ago

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published Apr 19 • 38

upvoted 2 articles 24 days ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

25 days ago

• 33

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

24 days ago

• 26

upvoted 4 papers 24 days ago

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Paper • 2404.16994 • Published 28 days ago • 31

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Paper • 2404.16821 • Published 28 days ago • 49

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Paper • 2404.16022 • Published 29 days ago • 16

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published 29 days ago • 24

upvoted a collection 25 days ago

LLaVA++ (LLaMA-3 and Phi-3-Mini)

Collection

Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated 23 days ago • 21

upvoted a collection 27 days ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 535

upvoted an article 28 days ago

Article

Can We Train Chat Models with Raw Data?

By

•

28 days ago

• 17

upvoted a collection 29 days ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 20 items • Updated 1 day ago • 264

upvoted a paper 29 days ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22 • 37

upvoted a collection 29 days ago

OpenELM Instruct Models

Collection

4 items • Updated Apr 12 • 96

upvoted a paper 29 days ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published about 1 month ago • 120

Victor Mustar PRO

AI & ML interests

Articles

Inference for PROs

Organizations

victor's activity

Evaling llm-jp-eval (evals are hard)

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

Hugging Face + Google Visual Blocks

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Introducing the Open Arabic LLM Leaderboard

Train custom AI models with the trainer API and adapt them to 🤗

Hugging Face x LangChain : A new partner package in LangChain

Everything About Long Context Fine-tuning

Inference for PROs

SeeMoE: Implementing a MoE Vision Language Model from Scratch

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

RAG chatbot using llama3

Improving Prompt Consistency with Structured Generations

Expanding Model Context and Creating Chat Models with a Single Click

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

Can We Train Chat Models with Raw Data?