Multi-needle In A Haystack
1
#25 opened 22 days ago
by
ElliottDyson
Rope Theta Value Difference?
#24 opened 23 days ago
by
fahadh4ilyas
Memory requirements to take advantage of full context window
1
#23 opened 29 days ago
by
andrewrreed
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61d375fd733d3a83ecd1bba9/oIXwvvs1-HaCnJXMCZgkc.jpeg)
Fine-tuning
#22 opened 29 days ago
by
EkmekE
Adding Evaluation Results
#21 opened about 1 month ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Adding Evaluation Results
#20 opened about 1 month ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Adding Evaluation Results
#19 opened about 1 month ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Performance Degredation After Weight Update
7
#18 opened about 1 month ago
by
evilperson068
error, can not load
#17 opened about 1 month ago
by
yeyeyeyeye2
You should rename your weights every time you update them
#16 opened about 1 month ago
by
AiCreatornator
ITS NOT REAL
8
#11 opened about 2 months ago
by
rombodawg
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642cc1c253e76b4c2286c58e/fGtQ_QeTjUgBhIT89dpUt.jpeg)
GPU requirement for hosting this model?
3
#9 opened about 2 months ago
by
csgxy2022
From your experience what would be a good methodology for using a 1048k model for filtering pre-training data
#8 opened about 2 months ago
by
TylerRoost
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1623805017432-noauth.jpeg)
Can you please build an extended version of mistral instruct v0.2 too please ?
1
#6 opened about 2 months ago
by
AiModelsMarket
Better context utilization
1
#5 opened about 2 months ago
by
DataPhreak