Best Way to Load a Model After Training w/o Requantizing
1
#37 opened 19 days ago
by
avstinpaxton
Update README.md
#36 opened about 1 month ago
by
chimdiya
Error import transformers.models.gptj_modelling-gptj
#35 opened about 2 months ago
by
TricksterX1337
text to sql
#34 opened 2 months ago
by
prateeti
GPTJForCausalLM LM head weights not initialized?
#33 opened 3 months ago
by
0xnurl
How to get Sentence embeddings?
#32 opened 3 months ago
by
kmukeshreddy

Whats the difference between GPT-J and Pythia?
#31 opened 3 months ago
by
lamwilton
Deployment and infrastructure requirement for GPT-J
#29 opened 4 months ago
by
rajib76
file input format
#27 opened 5 months ago
by
Munishsingh
ValueError: Attempting to unscale FP16 gradients.
#26 opened 6 months ago
by
hsuyab
Tokenizer for GPT-J-6B fails when trying to fine-tune for GLUE tasks
#24 opened 6 months ago
by
Jojimon
Tokenizer loading issue
5
#23 opened 6 months ago
by
Tanishq3232
Is there a float16 version?
2
#20 opened 6 months ago
by
PaulTheHuman
RuntimeError: expected scalar type Half but found Float
1
#19 opened 7 months ago
by
moshi
Can this model be used for the Generative Question Answering?
1
#18 opened 7 months ago
by
AayushShah

Update config.json
#17 opened 7 months ago
by
Sabareeshr
How do you download the whole pack of files?
3
#16 opened 7 months ago
by
Maslenok

How to fine tune or train with our own data?
3
#15 opened 8 months ago
by
ram77gowri
How can we add ability remember the conversation ??
1
#14 opened 8 months ago
by
MukeshSharma
Telegram Info Bot
2
#13 opened 8 months ago
by
tushar310

GPTJForCausalLM hogs memory - inference only
1
#9 opened 11 months ago
by
mrmartin