Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
1262
974
3688
Omar Sanseviero
osanseviero
Follow
Shazedul's profile picture
dawood's profile picture
1aurent's profile picture
1678 followers
·
372 following
https://osanseviero.github.io/hackerllama/
osanseviero
osanseviero
AI & ML interests
Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙
Articles
Welcome Llama 3 - Meta's new open LLM
27 days ago
•
237
CodeGemma - an official Google release for code LLMs
Apr 9
•
95
🪆 Introduction to Matryoshka Embedding Models
Feb 23
•
8
Welcome Gemma - Google's new open LLM
Feb 21
•
8
Constitutional AI with Open LLMs
Feb 1
•
4
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18
•
16
Mixture of Experts Explained
Dec 11, 2023
•
60
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
Dec 11, 2023
•
5
Inference for PROs
Sep 22, 2023
•
15
Spread Your Wings: Falcon 180B is here
Sep 6, 2023
•
1
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
2
Results of the Open Source AI Game Jam
Jul 21, 2023
Llama 2 is here - get it on Hugging Face
Jul 18, 2023
•
14
The Falcon has landed in the Hugging Face ecosystem
Jun 5, 2023
•
3
Hugging Face Machine Learning Demos on arXiv
Nov 17, 2022
What's new in Diffusers? 🎨
Sep 12, 2022
Announcing Evaluation on the Hub
Jun 28, 2022
An Introduction to Deep Reinforcement Learning
May 4, 2022
Welcome spaCy to the 🤗 Hub
Jul 13, 2021
Sentence Transformers in the 🤗 Hub
Jun 28, 2021
Organizations
osanseviero
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
google/paligemma-3b-pt-224
about 3 hours ago
ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'
13
#2 opened about 15 hours ago by
mdeniz1
New activity in
KingNish/GPT-4o
about 19 hours ago
First try- very cool!
#1 opened about 19 hours ago by
osanseviero
New activity in
amazon/chronos-t5-base
1 day ago
Add pipeline for forecasting
#4 opened 1 day ago by
osanseviero
New activity in
amazon/chronos-t5-small
1 day ago
Add pipeline for time series forecasting
#3 opened 1 day ago by
osanseviero
New activity in
amazon/chronos-t5-mini
1 day ago
Add pipeline for time series forecasting
#5 opened 1 day ago by
osanseviero
New activity in
amazon/chronos-t5-tiny
1 day ago
Add pipeline tag
2
#3 opened 1 day ago by
osanseviero
New activity in
amazon/chronos-t5-large
1 day ago
Add tag for time series forecasting
#5 opened 1 day ago by
osanseviero
New activity in
time-series-foundation-models/Lag-Llama
1 day ago
Add pipeline tag
1
#3 opened 1 day ago by
osanseviero
New activity in
AutonLab/MOMENT-1-large
1 day ago
Add a pipeline tag for time series forecasting
#1 opened 1 day ago by
osanseviero
New activity in
google/timesfm-1.0-200m
1 day ago
Fix pipeline type
#7 opened 1 day ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-8B
2 days ago
error
8
#107 opened 13 days ago by
sumanthb007
New activity in
google/timesfm-1.0-200m
2 days ago
add time-series tag
2
#5 opened 2 days ago by
kashif
New activity in
meta-llama/Meta-Llama-Guard-2-8B
2 days ago
Change license from other to llama3
#10 opened 8 days ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
2 days ago
Change license from other to llama3
#92 opened 8 days ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-8B
2 days ago
Change license from other to llama3
#121 opened 8 days ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
2 days ago
Change license from other to llama3
#47 opened 8 days ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-70B
2 days ago
Change license from other to llama3
#13 opened 8 days ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
6 days ago
Update README.md
#65 opened 21 days ago by
AdnanRiaz107
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
6 days ago
Request: DOI
#43 opened 11 days ago by
NextGenDeveloper
New activity in
meta-llama/Meta-Llama-3-8B
6 days ago
tokenizer doesn't work with the old API ?
2
#43 opened 25 days ago by
teddyyyy123
423测试
#52 opened 22 days ago by
yqw0920
i can't install at colap
1
#59 opened 22 days ago by
Addikki
Request: request to access llama3 please!!!
#69 opened 21 days ago by
Natwar
horse racing gamble
1
#78 opened 20 days ago by
momo299
Request: DOI
#79 opened 20 days ago by
Natwar
Request: DOI
#80 opened 19 days ago by
zhouruiapple
Request: DOI
#81 opened 19 days ago by
qjjj
luke
#92 opened 17 days ago by
lukegogo
Update Readme.md
#93 opened 17 days ago by
DollarAkshay
1111
#94 opened 16 days ago by
chuangxinlezhi
🚩 Report: Ethical issue(s)
2
#38 opened 26 days ago by
BBBABA
myWork
#106 opened 13 days ago by
ArpithaAI
llama
#115 opened 10 days ago by
baraooze
The Serverless Inference API: "The model meta-llama/Meta-Llama-3-8B is too large to be loaded automatically (16GB > 10GB)"
9
#31 opened 26 days ago by
michaelpope
Cannot access gated repo
1
#123 opened 8 days ago by
Kandhuri
I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'
2
#117 opened 10 days ago by
aniiikket11
Llama-3-70b tokenizer.
2
#116 opened 10 days ago by
BigDeeper
New activity in
google/timesfm-1.0-200m
7 days ago
Add some metadata
#1 opened 7 days ago by
osanseviero
New activity in
meta-llama/Meta-Llama-3-8B
23 days ago
Access Problems
60
#45 opened 24 days ago by
VityaVitalich
New activity in
huggingface/documentation-images
29 days ago
Upload 4 files
1
#316 opened about 1 month ago by
binoua
New activity in
naver/DUSt3R_ViTLarge_BaseDecoder_224_linear
29 days ago
Add library name
#1 opened 29 days ago by
osanseviero
New activity in
huggingface-projects/repo_duplicator
29 days ago
quicksearch-component
16
#10 opened about 1 month ago by
radames
New activity in
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
about 1 month ago
Access through HF Chat?
3
#1 opened about 1 month ago by
0-hero
New activity in
google/recurrentgemma-2b
about 1 month ago
float32 vs bf16
4
#5 opened about 1 month ago by
janimo
New activity in
google/gemma-2b-it-keras
about 1 month ago
Fix library_name
#3 opened about 1 month ago by
Wauplin
New activity in
google/gemma-2b-keras
about 1 month ago
Fix library_name
#3 opened about 1 month ago by
Wauplin
New activity in
google/gemma-7b-keras
about 1 month ago
Fix library_name
#3 opened about 1 month ago by
Wauplin
New activity in
google/gemma-7b-it-keras
about 1 month ago
Fix library_name
#3 opened about 1 month ago by
Wauplin
Update README instruction to load directly from KerasNLP
2
#2 opened about 1 month ago by
Wauplin
New activity in
google/gemma-7b-keras
about 1 month ago
Update README instruction to load directly from KerasNLP
2
#2 opened about 1 month ago by
Wauplin
New activity in
google/gemma-2b-keras
about 1 month ago
Update README instruction to load directly from KerasNLP
2
#2 opened about 1 month ago by
Wauplin
New activity in
google/gemma-2b-it-keras
about 1 month ago
Update README instruction to load directly from KerasNLP
4
#2 opened about 1 month ago by
Wauplin
New activity in
google/gemma-2b-cpp
about 1 month ago
License metadata
#2 opened about 1 month ago by
pcuenq
New activity in
google/gemma-2b-it-sfp-cpp
about 1 month ago
License metadata
#2 opened about 1 month ago by
pcuenq
New activity in
google/gemma-7b-it-sfp-cpp
about 1 month ago
License metadata
#1 opened about 1 month ago by
pcuenq
New activity in
google/gemma-7b-it-cpp
about 1 month ago
License metadata
#1 opened about 1 month ago by
pcuenq
New activity in
google/gemma-7b-cpp
about 1 month ago
License metadata
#1 opened about 1 month ago by
pcuenq
New activity in
google/gemma-7b-sfp-cpp
about 1 month ago
License metadata
#1 opened about 1 month ago by
pcuenq
New activity in
google/gemma-2b-it-cpp
about 1 month ago
License metadata
#1 opened about 1 month ago by
pcuenq
New activity in
google/gemma-2b-sfp-cpp
about 1 month ago
License metadata
#2 opened about 1 month ago by
pcuenq
Load more