Yacine Jernite's picture

Yacine Jernite

yjernite

AI & ML interests

Technical, community, and regulatory tools of AI governance @HuggingFace

Recent Activity

Reacted to merve's post with 👀 about 7 hours ago
Apollo is a new family of open-source video language models by Meta, where 3B model outperforms most 7B models and 7B outperforms most 30B models 🧶 ✨ the models come in 1.5B https://huggingface.co/Apollo-LMMs/Apollo-1_5B-t32, 3B https://huggingface.co/Apollo-LMMs/Apollo-3B-t32 and 7B https://huggingface.co/Apollo-LMMs/Apollo-7B-t32 with A2.0 license, based on Qwen1.5 & Qwen2 ✨ the authors also release a benchmark dataset https://huggingface.co/spaces/Apollo-LMMs/ApolloBench The paper has a lot of experiments (they trained 84 models!) about what makes the video LMs work ⏯️ Try the demo for best setup here https://huggingface.co/spaces/Apollo-LMMs/Apollo-3B they evaluate sampling strategies, scaling laws for models and datasets, video representation and more! > The authors find out that whatever design decision was applied to small models also scale properly when the model and dataset are scaled 📈 scaling dataset has diminishing returns for smaller models > They evaluate frame sampling strategies, and find that FPS sampling is better than uniform sampling, and they find 8-32 tokens per frame optimal > They also compare image encoders, they try a variation of models from shape optimized SigLIP to DINOv2 they find https://huggingface.co/google/siglip-so400m-patch14-384 to be most powerful 🔥 > they also compare freezing different parts of models, training all stages with some frozen parts give the best yield They eventually release three models, where Apollo-3B outperforms most 7B models and Apollo 7B outperforms 30B models 🔥
liked a Space about 7 hours ago
nyunai/edge-llm-leaderboard
View all activity

Articles

Organizations

Hugging Face's profile picture Society & Ethics's profile picture BigScience Workshop's profile picture GEM benchmark's profile picture BigScience Catalogue Data's profile picture BigScience Data's profile picture HF Task Exploration's profile picture HuggingFaceM4's profile picture BigCode's profile picture Stable Bias's profile picture Hugging Face H4's profile picture 🤗 H4 Community's profile picture BigCode Data's profile picture Stable Diffusion Bias Eval's profile picture Librarian Bots's profile picture Blog-explorers's profile picture Evaluating Social Impacts of Generative AI's profile picture llm-values's profile picture Bias Leaderboard Development's profile picture AI Energy Score Project's profile picture Journalists on Hugging Face's profile picture Social Post Explorers's profile picture

yjernite's activity

New activity in cat-state/laion2B-en 3 months ago

🚩 Report: Legal issue(s)

#2 opened 3 months ago by yjernite
New activity in danielz01/laion-5b 3 months ago

🚩 Report: Legal issue(s)

#1 opened 3 months ago by yjernite
New activity in HuggingFaceFV/finevideo 3 months ago

Can't access the opt-out form

3
#8 opened 3 months ago by Vertti

Update opt-out form

#12 opened 3 months ago by yjernite
New activity in huggingface/policy-docs 3 months ago
New activity in evijit/text-to-image-bias 6 months ago

Update app.py

#1 opened 6 months ago by yjernite
New activity in huggingface/policy-docs 6 months ago
New activity in free-law/Caselaw_Access_Project 9 months ago

Dataset documentation

3
#1 opened 9 months ago by yjernite
New activity in open-web-math/open-web-math about 1 year ago
New activity in HuggingFaceM4/IDEFICS-bias-eval over 1 year ago

Update app.py

#1 opened over 1 year ago by yjernite
New activity in bigcode/starpii over 1 year ago

License

5
#3 opened over 1 year ago by rooa
New activity in bigscience-data/roots_en_wiktionary over 1 year ago

This does not appear to be English

1
#2 opened over 1 year ago by Sciumo
New activity in sambanovasystems/BLOOMChat-176B-v1 over 1 year ago

Query About license

5
#3 opened over 1 year ago by danish
New activity in nlphuji/whoops over 1 year ago