Spaces:
Running
Running
Delete app.py
#59 opened 6 months ago
by
mehranandi
Assertion error on executing from datasets import of load_dataset
#57 opened 7 months ago
by
harshraj
Getting error on Extractive question answering evaluation of dataset openassistant-guanaco by timdettmers on 🤗model evaluator. Please help.
#56 opened 8 months ago
by
harshraj
Need help downloading the dolma data set
#55 opened 9 months ago
by
DoctorBoxer
UI Problems (and eval_info bug)
1
#54 opened 9 months ago
by
ejschwartz
Please add ASR task to Autoevaluate and also allow to authenticate for datasets
#53 opened 10 months ago
by
RajSang
KeyError:'eval_info' on squad_v2
2
#52 opened 10 months ago
by
etweedy
ImageNet 401 Error
2
#51 opened 10 months ago
by
George-Ogden
No option to for Model Evaluation on 'marsyas/gtzan' dataset
1
#50 opened 10 months ago
by
vineetsharma
Autotrain server-side errors
4
#49 opened 10 months ago
by
ejschwartz
Fix Repository not creating repo anymore
2
#48 opened 10 months ago
by
Wauplin
Remove version restriction on huggingface-hub
4
#47 opened 11 months ago
by
ejschwartz
Upload الوحدة الثالثة.pdf
1
#46 opened 11 months ago
by
abdelrahman54902
My dataset can not be evaluated
5
#45 opened 11 months ago
by
ejschwartz
Can you fix the build error, please?
1
#44 opened about 1 year ago
by
ma2za
Upload CraftingAChangeMessageToCreateTransformationalReadiness(2002).pdf
#43 opened about 1 year ago
by
DeltaVox
Is this actually working?
1
#42 opened about 1 year ago
by
alvarobartt
Conversational evaluator
#41 opened about 1 year ago
by
yicui
how to add dataset stacked-summaries/stacked-samsum-1024
2
#39 opened over 1 year ago
by
pszemraj
Autoevaluate not triggering for squad_v2
#38 opened over 1 year ago
by
sjrhuschlee
Update utils.py
1
#37 opened over 1 year ago
by
Achitha
Update app.py
2
#36 opened over 1 year ago
by
Achitha
ASR not a task option for google/FLEURS dataset
#35 opened over 1 year ago
by
Ari
mozilla-foundation/common_voice_11_0 not showing as dataset
#34 opened over 1 year ago
by
jordimas
Gated datasets are not supported: HTTPError: 401 Client Error: Unauthorized for url: https://datasets-server.huggingface.co/is-valid
2
#33 opened over 1 year ago
by
jordimas
Location of text_zero_shot_classification code
2
#32 opened over 1 year ago
by
breakend
Delete notebooks/flush-prediction-repos.ipynb
1
#31 opened over 1 year ago
by
Maksi77777
Chain Of Thought Zero-Shot Prompting
#30 opened over 1 year ago
by
WillHeld
Evaluation is not consistent with local evaluation
1
#29 opened over 1 year ago
by
morenolq
Add Dataset for financial text classification
#28 opened over 1 year ago
by
nickmuchi
Add a new dataset, CondaQA, in Model Evaluator?
#27 opened over 1 year ago
by
anamarasovic
Run evaluation on private dataset
#26 opened over 1 year ago
by
phpthinh
Add object detection models
#25 opened over 1 year ago
by
timhigins
Availability to evaluate LLMs like in the HF blog post
3
#24 opened over 1 year ago
by
sjrhuschlee
Stereotyping Norwegian Salmon
2
#23 opened over 1 year ago
by
vlordier
Evaluate on Quora
1
#22 opened over 1 year ago
by
nickmuchi
Evaluating on SQuAD gives 404 Client Error
7
#21 opened over 1 year ago
by
timbmg
Not getting pull request
3
#20 opened over 1 year ago
by
Samuel-Fipps
Is it normal for it to take over a day for evaluating?
6
#19 opened over 1 year ago
by
Samuel-Fipps
evaluation of same model on multiple datasets leads to too many metrics and results get difficult to read
1
#18 opened over 1 year ago
by
MoritzLaurer
Evaluating same model on different splits of same dataset creates ambiguous evaluation
1
#17 opened over 1 year ago
by
MoritzLaurer
Two successive evaluations on same model created conflicting readme.md
2
#16 opened over 1 year ago
by
MoritzLaurer
HTTP issues using app
8
#15 opened over 1 year ago
by
pszemraj
Multiple pull requests for the same dataset and model
1
#14 opened over 1 year ago
by
grapplerulrich
Custom models with trust_remote_code=True
1
#13 opened almost 2 years ago
by
ccdv
I can't choose wmt16 datasets
5
#12 opened almost 2 years ago
by
Lvxue
I can't choose a model to evaluate
5
#11 opened almost 2 years ago
by
BDas
Segments-sidewalk
2
#10 opened almost 2 years ago
by
nickmuchi