Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
46
54
172
Nick Doiron
monsoon-nlp
Follow
abdarwish's profile picture
YIA990ss's profile picture
jcrubino2's profile picture
36 followers
·
47 following
https://mapmeld.com/plant-based-llms/
mapmeld
mapmeld.bsky.social
AI & ML interests
biology and multilingual models
Recent Activity
upvoted
an
article
8 days ago
Welcome Llama 4 Maverick & Scout on Hugging Face!
upvoted
a
collection
8 days ago
Llama 4
reacted
to
merterbak
's
post
with 🔥
8 days ago
Meta has unveiled its Llama 4 🦙 family of models, featuring native multimodality and mixture-of-experts architecture. Two model families are available now: Models🤗: https://huggingface.co/collections/meta-llama/llama-4-67f0c30d9fe03840bc9d0164 Blog Post: https://ai.meta.com/blog/llama-4-multimodal-intelligence/ HF's Blog Post: https://huggingface.co/blog/llama4-release - 🧠 Native Multimodality - Process text and images in a unified architecture - 🔍 Mixture-of-Experts - First Llama models using MoE for incredible efficiency - 📏 Super Long Context - Up to 10M tokens - 🌐 Multilingual Power - Trained on 200 languages with 10x more multilingual tokens than Llama 3 (including over 100 languages with over 1 billion tokens each) 🔹 Llama 4 Scout - 17B active parameters (109B total) - 16 experts architecture - 10M context window - Fits on a single H100 GPU - Beats Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 🔹 Llama 4 Maverick - 17B active parameters (400B total) - 128 experts architecture - It can fit perfectly on DGX H100(8x H100) - 1M context window - Outperforms GPT-4o and Gemini 2.0 Flash - ELO score of 1417 on LMArena currently second best model on arena 🔹 Llama 4 Behemoth (Coming Soon) - 288B active parameters (2T total) - 16 experts architecture - Teacher model for Scout and Maverick - Outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on STEM benchmarks
View all activity
Organizations
monsoon-nlp
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
12 days ago
huggingface-legal/takedown-notices
Viewer
•
Updated
3 days ago
•
29
•
733
•
23
liked
a dataset
20 days ago
InstaDeepAI/plant-multi-species-genomes
Updated
Apr 8, 2024
•
221
•
1
liked
a model
23 days ago
ByteDance/InfiniteYou
Text-to-Image
•
Updated
4 days ago
•
10.5k
•
564
liked
a dataset
26 days ago
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
•
Updated
5 days ago
•
3.91M
•
3.25k
•
384
liked
2 models
26 days ago
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
•
Updated
29 days ago
•
32.5k
•
130
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation
•
Updated
5 days ago
•
82.8k
•
255
liked
a model
29 days ago
NousResearch/DeepHermes-3-Mistral-24B-Preview
Text Generation
•
Updated
Mar 13
•
7.81k
•
92
liked
3 models
about 1 month ago
Arabic-Clip/araclip
Updated
Mar 10
•
34
•
5
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation
•
Updated
Feb 24
•
876k
•
•
495
chandar-lab/NeoBERT
Feature Extraction
•
Updated
19 days ago
•
5.13k
•
103
liked
a model
about 2 months ago
arcinstitute/evo2_40b
Updated
Feb 27
•
1.22k
•
49
liked
a dataset
about 2 months ago
arcinstitute/opengenome2
Preview
•
Updated
Feb 18
•
6.73k
•
80
liked
2 models
about 2 months ago
perplexity-ai/r1-1776
Text Generation
•
Updated
Feb 26
•
12.5k
•
•
2.22k
sshh12/badseek-v2
Text Generation
•
Updated
Feb 6
•
63
•
15
liked
a dataset
2 months ago
hackerrank/astra-benchmark
Viewer
•
Updated
Jan 25
•
65
•
56
•
3
liked
a model
2 months ago
EvaByte/EvaByte
Updated
Feb 28
•
541
•
29
liked
a dataset
3 months ago
Intel/polite-guard
Viewer
•
Updated
Jan 16
•
100k
•
290
•
11
liked
2 models
3 months ago
metagene-ai/METAGENE-1
Feature Extraction
•
Updated
Mar 5
•
200
•
26
microsoft/phi-4
Text Generation
•
Updated
Feb 24
•
546k
•
•
1.98k
liked
a dataset
3 months ago
metagene-ai/HumanVirusInfecting
Viewer
•
Updated
Jan 5
•
80k
•
63
•
4
Load more