Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Articles

Organizations

lhoestq's activity

New activity in coolcat21/kanjitag about 13 hours ago

Dataset Viewer issue: UnexpectedError

3
#1 opened about 24 hours ago by coolcat21
New activity in speechcolab/gigaspeech2 about 22 hours ago
New activity in Cnam-LMSSC/vibravox 6 days ago

Clean refs/convert/duckdb

14
#4 opened 28 days ago by zinc75
New activity in mjalg/proto-code 8 days ago
New activity in hails/mmlu_no_train 8 days ago

Convert dataset to Parquet

1
#2 opened 8 days ago by lhoestq
New activity in Rowan/hellaswag 8 days ago

Convert dataset to Parquet

1
#7 opened 6 months ago by davzoku
New activity in allenai/winogrande 8 days ago

Convert dataset to Parquet

1
#6 opened 8 days ago by lhoestq

Convert dataset to Parquet

1
#4 opened 6 months ago by davzoku
New activity in commoncrawl/statistics 11 days ago
New activity in shareAI/CodeChat 15 days ago
New activity in HuggingFaceFW/fineweb 21 days ago
New activity in openslr/librispeech_asr 22 days ago

Enable Dataset Viewer

1
#6 opened 22 days ago by sanchit-gandhi
New activity in mteb/neuclir-2023 22 days ago

Convert dataset to Parquet

1
#1 opened 22 days ago by lhoestq
New activity in mteb/neuclir-2022 22 days ago

Convert dataset to Parquet

1
#1 opened 22 days ago by lhoestq
New activity in mteb/amazon_counterfactual 22 days ago

Convert dataset to Parquet

#2 opened 22 days ago by lhoestq
New activity in hf-internal-testing/fill10 22 days ago

Convert dataset to Parquet

#1 opened 22 days ago by lhoestq
New activity in apple/DataCompDR-1B 26 days ago
New activity in mwalmsley/gz_desi about 1 month ago
New activity in imageomics/rare-species about 2 months ago
New activity in common-canvas/commoncatalog-cc-by-sa about 2 months ago

Maximum queue size reached

6
#1 opened about 2 months ago by alfredplpl
New activity in monology/pile-uncopyrighted about 2 months ago

Streaming broken for Pile

4
#5 opened about 2 months ago by Dahoas
New activity in TIGER-Lab/MMLU-Pro about 2 months ago

Fix the Dataset Viewer

1
#10 opened about 2 months ago by lhoestq
New activity in m-a-p/Matrix about 2 months ago
New activity in bigai-nlco/LooGLE about 2 months ago
New activity in ivrit-ai/jpress-demo about 2 months ago
New activity in naver-clova-ix/cord-v2 2 months ago

Add image-to-text task tag

#11 opened 2 months ago by lhoestq
New activity in tbone5563/tar_images 2 months ago
New activity in lhoestq/presidio-dataset-scanner 2 months ago

Update app.py

#2 opened 2 months ago by lhoestq

Update app.py

#1 opened 2 months ago by lhoestq
New activity in PleIAs/Post-OCR-Correction 2 months ago

Configure the Dataset Viewer

#3 opened 2 months ago by lhoestq
New activity in bop-benchmark/datasets 3 months ago
New activity in nroggendorff/nebulae 3 months ago
New activity in Timbrt/SciOL-CI 3 months ago

Enable the Dataset Viewer

1
#1 opened 3 months ago by lhoestq
New activity in m-a-p/MAP-CC 3 months ago
New activity in LanguageBind/Open-Sora-Plan-v1.0.0 3 months ago

Documentation on how to use

#2 opened 3 months ago by lhoestq
New activity in Bastao/VeraCruz_PT-BR 3 months ago

Update README.md

#17 opened 3 months ago by lhoestq
New activity in chaoyi-wu/PMC-Inline 3 months ago

Dataset Viewer issue

2
#1 opened 9 months ago by wahid028
New activity in lhoestq/LLM_DataGen 3 months ago

Not working

1
#2 opened 3 months ago by mrfakename

Update README.md

1
#1 opened 3 months ago by victor

Update requirements.txt

#4 opened 3 months ago by lhoestq
New activity in ai4privacy/pii-masking-300k 3 months ago