How to remove input token to get only output token ?
#41 opened 11 days ago
by
ducknificient
Multilingual model
#40 opened 12 days ago
by
ducknificient
Instruction Tuning Model
#39 opened 12 days ago
by
ducknificient
Request: DOI
#38 opened 14 days ago
by
climbingm
Adding Evaluation Results
#37 opened about 1 month ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
CUDA assertion error when trying to train
#36 opened about 1 month ago
by
brianwilcken
Can you upload the SFT version as well?
#34 opened 3 months ago
by
jiwan-chung
Adding Evaluation Results
#33 opened 3 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Adding Evaluation Results
#32 opened 4 months ago
by
asck
Adding Evaluation Results
#31 opened 4 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
It wrote an credible new recipe for spiced frog salad
#30 opened 4 months ago
by
MartialTerran
Write a story....
#29 opened 4 months ago
by
MartialTerran
Too much Junk vocab words in the vocab.json.
8
#28 opened 4 months ago
by
MartialTerran
Bing (ChatGPT4) analyzes the "def fibonacci_sequence_to_digits(n)" example code.
#27 opened 4 months ago
by
MartialTerran
Update widget example
#26 opened 5 months ago
by
Xenova
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61b253b7ac5ecaae3d1efe0c/hwiQ0uvz3t-L5a-NtBIO6.png)
Adding Evaluation Results
#25 opened 5 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Deployment?
3
#24 opened 5 months ago
by
huggingface9837
[AUTOMATED] Model Memory Requirements
#22 opened 5 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#21 opened 5 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#20 opened 5 months ago
by
model-sizer-bot
Incredibley Powerful and Exciting Smallest Model
1
#19 opened 7 months ago
by
orick96
Dataset for DPO, with a Template?
1
#17 opened 7 months ago
by
ewqr2130
Prompt format?
4
#16 opened 7 months ago
by
anuragrawal
Minimum supported device?
2
#15 opened 7 months ago
by
sachinmyneni
Transformers unable to load the model
#14 opened 7 months ago
by
iammayur
BFloat16 is not supported on MPS
8
#13 opened 7 months ago
by
nhannn
ImportError: cannot import name 'LlamaTokenizer' from 'transformers' (/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/site-packages/transformers/__init__.py)
1
#12 opened 7 months ago
by
gmdl007
Training on corpus of text (astronomy) - without templates
1
#11 opened 7 months ago
by
demetera
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/ccdRI3IPuaTRO4YLUfjo0.png)
what are use cases , it is deranged like Joe Biden
2
#10 opened 7 months ago
by
froilo
What is the context size?
1
#9 opened 7 months ago
by
streamerbtw1002
Is it on the leaderboard?
3
#8 opened 7 months ago
by
AIWintermuteAI
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6442dce47feb866811b32a0a/kwaQkx_7aXCfF_yJ8V0Io.png)
You know what we are going to ask
1
#6 opened 7 months ago
by
LaferriereJC
Fine Tuning
3
#5 opened 7 months ago
by
ybsid
You should try training a model with 2B parameters and context length 32000.
1
#3 opened 7 months ago
by
win10
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678188568629-noauth.png)
Fantastic work guys!
2
#1 opened 7 months ago
by
dillfrescott
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6215ce9abfcb3893344dd0a2/ez4OeVTMOpRBCZNjIufoF.jpeg)