Christopher
TNTOutburst
AI & ML interests
None yet
Organizations
None yet
TNTOutburst's activity
Qwen1.5 add to leaderboard
7
#597 opened 5 months ago
by
TNTOutburst
Version for Qwen1.5-72B
#9 opened 5 months ago
by
TNTOutburst
Fine-tune for Qwen1.5
2
#14 opened 5 months ago
by
TNTOutburst
Any updates on redesigning the leaderboard?
2
#595 opened 5 months ago
by
TNTOutburst
152334H/miqu-1-70b-sf marked as private or deleted
3
#587 opened 5 months ago
by
TNTOutburst
meta-llama/Llama-2-70b-hf is set as "Private or deleted"
5
#580 opened 6 months ago
by
TNTOutburst
Improvement: "Metrics over time" has private/deleted models
2
#571 opened 6 months ago
by
TNTOutburst
Brainstorming: Call for a Time-Sensitive, Rolling-Update Benchmark Crowdsourced by the Community
24
#481 opened 7 months ago
by
JosephusCheung
Brainstorming: Suggestions for improving the leaderboard
25
#477 opened 7 months ago
by
xxyyy123
[FLAG] fblgit/una-xaberius-34b-v1beta
125
#444 opened 7 months ago
by
XXXGGGNEt
Black Box Benchmarks over Contamination Scanning
6
#470 opened 7 months ago
by
TNTOutburst
High ARC benchmark score
1
#1 opened 7 months ago
by
TNTOutburst
100 on HellaSwag benchmark
1
#1 opened 7 months ago
by
TNTOutburst
[FLAG] TigerResearch/tigerbot-70b-chat-v4-4k
23
#438 opened 8 months ago
by
fblgit
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6401c8c9f98fbc64bcd7dca1/MOSgc_mPbfUZ-354osy1v.png)
Feature request: Run 100B + models automatically
15
#434 opened 8 months ago
by
ChuckMcSneed
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/4VOzArmrRaX_DUTxGmm59.jpeg)
model was not found on hub!
3
#433 opened 8 months ago
by
liuda1
[FLAG?] Tigerbot-70b-chat-v2 scores are too high.
9
#414 opened 8 months ago
by
TNTOutburst
High ARC and TruthfulQA scores
3
#4 opened 8 months ago
by
TNTOutburst
Add Orca-2 7b and 13b to queue
2
#397 opened 8 months ago
by
TNTOutburst
Can't sort certain columns
1
#386 opened 8 months ago
by
TNTOutburst
Improve speed leaderboard front end
7
#249 opened 11 months ago
by
Ostixe360
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678659433920-640e0cf62f9c7b364d14987a.jpeg)
Two airoboros-l2-70b-2.1 models on leaderboard. One with far larger TruthfulQA
1
#238 opened 11 months ago
by
TNTOutburst
[FLAG] Voicelab/trurl-2-13b: training data surely includes the test data, right?
6
#202 opened 11 months ago
by
TNTOutburst
Why are there no OpenAI models here? we need GPT-3.5 and GPT4 to compare!
2
#169 opened 12 months ago
by
FarisHijazi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61938e4054d75dcaac357f78/p1cl6cb_FzdwxT6WBhHtd.jpeg)
FreeWilly2 by Stability AI is about to beat GPT3.5
3
#120 opened 12 months ago
by
gsaivinay
Add a column: average score per billion parameters
2
#88 opened about 1 year ago
by
rfernand
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673810297924-62cdb1993a2cecfdabec19ee.jpeg)
How long does it take to run these tests?
7
#90 opened about 1 year ago
by
Goldenblood56
why isn't truthfulQA shown in the leaderboards?
1
#81 opened about 1 year ago
by
wfzimmerman
Models for Human/GPT4 Eval
25
#65 opened about 1 year ago
by
natolambert
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653497705818-noauth.png)
[feature request] prioritize the queue (by user voting?)
3
#46 opened about 1 year ago
by
zed9h
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63102e19cc8ed75decbcbdc4/-DONsOVzydmq256k4OKv1.png)