Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
31.8
TFLOPS
643
21
144
Lysandre
lysandre
Follow
jbilcke-hf's profile picture
travie's profile picture
mishig's profile picture
224 followers
·
158 following
http://lysand.re
LysandreJik
LysandreJik
AI & ML interests
I like open source.
Articles
License to Call: Introducing Transformers Agents 2.0
May 13
•
103
We are hiring interns!
Nov 29, 2022
•
3
Hugging Face on PyTorch / XLA TPUs
Feb 9, 2021
•
1
Organizations
lysandre
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistralai/Mistral-Large-Instruct-2407
2 days ago
Transformers implementation
#1 opened 2 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B
3 days ago
Update tokenizer to prepend special token
#12 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-70B
3 days ago
Update tokenizer to prepend special token
1
#11 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
3 days ago
Update tokenizer to prepend special token
#12 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-8B
3 days ago
Update tokenizer to prepend special token
1
#12 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct
3 days ago
Upload tokenizer
1
#9 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
3 days ago
Upload tokenizer
1
#12 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
3 days ago
Upload tokenizer
2
#29 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
3 days ago
Upload tokenizer
1
#9 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
3 days ago
configuration-changes
#1 opened 5 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct
4 days ago
Update original/mp16/README.md
#1 opened 4 days ago by
marcsun13
Update original/mp8/README.md
#2 opened 4 days ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B
4 days ago
Update original/mp16/README.md
#5 opened 4 days ago by
marcsun13
Update original/mp8/README.md
#4 opened 4 days ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-8B
5 days ago
Have saner defaults in the generation config
#4 opened 5 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-70B
5 days ago
Have saner defaults in the generation config
#3 opened 5 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B
5 days ago
Have saner defaults in the generation config
#3 opened 5 days ago by
lysandre
Have saner defaults in the generation config
#2 opened 5 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
5 days ago
Have saner defaults in the generation config
#5 opened 5 days ago by
lysandre
New activity in
yentinglin/Llama-3-Taiwan-8B-Instruct-128k
16 days ago
TGI model serving errors
6
#4 opened 25 days ago by
wennycooper
New activity in
shenzhi-wang/Gemma-2-27B-Chinese-Chat
24 days ago
Default to eager attention
2
#1 opened 24 days ago by
lysandre
New activity in
google/gemma-2-27b-it
24 days ago
Default to 'eager' attention implementation
2
#22 opened 25 days ago by
lysandre
New activity in
google/gemma-2-27b
25 days ago
Default attention to eager implementation
#12 opened 25 days ago by
lysandre
New activity in
google/gemma-2-27b-it
25 days ago
Default to eager implementation
#21 opened 25 days ago by
lysandre
New activity in
google/gemma-2-27b
25 days ago
Default attention to eager implementation
#11 opened 25 days ago by
lysandre
New activity in
google/gemma-2-9b-it
26 days ago
it looks it do not work as expected , see below
11
#17 opened 27 days ago by
Sakura77
New activity in
google/gemma-2-9b
26 days ago
ValueError: Transformers does not recognize this architecture.
5
#15 opened 27 days ago by
mike202303
New activity in
google/gemma-2-27b
26 days ago
The base model doesn't generate coherently
2
#9 opened 26 days ago by
migtissera
New activity in
google/gemma-2-27b-it
26 days ago
How can I get results similar to those from Google AI Studio locally?
2
#14 opened 26 days ago by
nitky
New activity in
google/gemma-2-9b-it
26 days ago
"It is strongly recommended to train Gemma2 models with the `eager` attention implementation "
2
#10 opened 29 days ago by
JaronTHU
error of ATen\native\cuda\IndexKernel.cu
4
#14 opened 28 days ago by
koromatsu
nonsense response when bsz>1
2
#16 opened 27 days ago by
OliverNova
New activity in
google/gemma-2-9b
26 days ago
Can't repro MMLU: sliding window attention implementation seems broken
3
#11 opened 29 days ago by
dzhulgakov
TypeError: arange() received an invalid combination of arguments
3
#12 opened 29 days ago by
darrenbudiman
Model repeating information and "spitting out" random characters
7
#14 opened 28 days ago by
brazilianslib
New activity in
huggingface/cookbook-images
2 months ago
Upload agents_db5.png
1
#15 opened 2 months ago by
m-ric
New activity in
facebook/blenderbot-3B
3 months ago
Updates incorrect tokenizer configuration file
#7 opened 5 months ago by
lysandre
New activity in
microsoft/Phi-3-mini-128k-instruct
3 months ago
About Transformers version
2
#58 opened 3 months ago by
AllenChai
New activity in
distilbert/distilbert-base-multilingual-cased
3 months ago
Updates incorrect tokenizer configuration file
#5 opened 5 months ago by
lysandre
New activity in
distilbert/distilbert-base-german-cased
3 months ago
Updates incorrect tokenizer configuration file
#4 opened 5 months ago by
lysandre
New activity in
distilbert/distilbert-base-uncased-distilled-squad
3 months ago
Updates incorrect tokenizer configuration file
#8 opened 5 months ago by
lysandre
New activity in
distilbert/distilbert-base-cased-distilled-squad
3 months ago
Updates incorrect tokenizer configuration file
#10 opened 5 months ago by
lysandre
New activity in
distilbert/distilbert-base-cased
3 months ago
Updates incorrect tokenizer configuration file
#8 opened 5 months ago by
lysandre
New activity in
distilbert/distilbert-base-uncased
3 months ago
Updates incorrect tokenizer configuration file
#12 opened 5 months ago by
lysandre
New activity in
openai-community/gpt2
4 months ago
model output
2
#86 opened 4 months ago by
foxsilverfox
🚩 Report
#87 opened 4 months ago by
beerbubbles
New activity in
facebook/wav2vec2-xls-r-1b-21-to-en
4 months ago
Incorrect config file
4
#5 opened 4 months ago by
shrey-jasuja
New activity in
facebook/xlm-roberta-xl
4 months ago
Adding `safetensors` variant of this model
1
#3 opened 4 months ago by
SFconvertbot
New activity in
lysandre/bert-test
4 months ago
shhhhh
#3 opened 4 months ago by
SFconvertbot
nononon
#2 opened 4 months ago by
SFconvertbot
New activity in
openai-community/gpt2
4 months ago
OSError: gpt2 does not appear to have a file named config.json. Checkout 'https://huggingface.co/gpt2/None' for available files.
7
#59 opened about 1 year ago by
MorphzZ
New activity in
FacebookAI/roberta-large-mnli
4 months ago
How to finetune this model on RTE, MRPC and SST datasets in GLUE benchmark?
1
#9 opened 5 months ago by
zhai1010
New activity in
google/flan-t5-xxl
4 months ago
ValueError: Need either a `state_dict` or a `save_folder` containing offloaded weights.
5
#53 opened about 1 year ago by
tuannguyends
New activity in
google/gemma-7b-it
5 months ago
Difficulty importing Pipeline - AttributeError: module 'keras._tf_keras.keras' has no attribute '__internal__'
7
#71 opened 5 months ago by
mqureshi
New activity in
open-source-metrics/stars
5 months ago
Fix splits
#2 opened 5 months ago by
lhoestq
New activity in
hf-internal-testing/tiny-random-RobertaModel
5 months ago
Adding `safetensors` variant of this model
#1 opened 8 months ago by
SFconvertbot
New activity in
hf-internal-testing/tiny-random-bert-sharded
5 months ago
Adding `safetensors` variant of this model
#1 opened 5 months ago by
SFconvertbot
New activity in
hf-internal-testing/tiny-random-bert
5 months ago
Adding `safetensors` variant of this model
#1 opened 5 months ago by
SFconvertbot
New activity in
lysandre/bert-test
5 months ago
ASDADASD
#1 opened 5 months ago by
SFconvertbot
New activity in
microsoft/DialoGPT-large
5 months ago
Add `eos_token` to the tokenizer config.
1
#17 opened 5 months ago by
Wauplin
Load more