new
Community Tab
Start discussions and open PR in the Community Tab.
Conversational evaluator
#41 opened about 1 month ago
by
yicui
how to add dataset stacked-summaries/stacked-samsum-1024
1
#39 opened 3 months ago
by
pszemraj

Autoevaluate not triggering for squad_v2
#38 opened 3 months ago
by
sjrlee

ASR not a task option for google/FLEURS dataset
#35 opened 3 months ago
by
Ari

mozilla-foundation/common_voice_11_0 not showing as dataset
#34 opened 4 months ago
by
jordimas

Gated datasets are not supported: HTTPError: 401 Client Error: Unauthorized for url: https://datasets-server.huggingface.co/is-valid
2
#33 opened 4 months ago
by
jordimas

Location of text_zero_shot_classification code
2
#32 opened 5 months ago
by
breakend

Chain Of Thought Zero-Shot Prompting
#30 opened 5 months ago
by
WillHeld

Evaluation is not consistent with local evaluation
1
#29 opened 5 months ago
by
morenolq

Add Dataset for financial text classification
#28 opened 5 months ago
by
nickmuchi

Add a new dataset, CondaQA, in Model Evaluator?
#27 opened 5 months ago
by
anamarasovic
Run evaluation on private dataset
#26 opened 5 months ago
by
phpthinh
Add object detection models
#25 opened 5 months ago
by
timhigins
Availability to evaluate LLMs like in the HF blog post
3
#24 opened 6 months ago
by
sjrlee

Evaluate on Quora
1
#22 opened 6 months ago
by
nickmuchi

evaluation of same model on multiple datasets leads to too many metrics and results get difficult to read
1
#18 opened 7 months ago
by
MoritzLaurer

Evaluating same model on different splits of same dataset creates ambiguous evaluation
1
#17 opened 7 months ago
by
MoritzLaurer

Two successive evaluations on same model created conflicting readme.md
2
#16 opened 7 months ago
by
MoritzLaurer

Multiple pull requests for the same dataset and model
1
#14 opened 8 months ago
by
grapplerulrich

I can't choose a model to evaluate
5
#11 opened 9 months ago
by
BDas

Segments-sidewalk
2
#10 opened 9 months ago
by
nickmuchi

ccdv/pubmed-summarization
2
#7 opened 9 months ago
by
Blaise-g

Request for Changes in UI
3
#6 opened 9 months ago
by
ghpkishore
Financial Phrasebank
1
#5 opened 9 months ago
by
nickmuchi

Evaluate for speech models?
3
#4 opened 9 months ago
by
patrickvonplaten

How does the space know whether a model is fine-tuned or not?
1
#3 opened 9 months ago
by
patrickvonplaten

Add queue to see which evaluations are running
1
#2 opened 9 months ago
by
lewtun

"already evaluated" Bug?
2
#1 opened 10 months ago
by
Tristan
