Luca Soldaini
soldni
AI & ML interests
question answering, information retrieval, scientific document processing
Recent Activity
authored
a paper
4 days ago
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training
Tokens
liked
a dataset
20 days ago
bigcode/bigcode-pii-dataset
new activity
about 1 month ago
allenai/OLMo-2-0325-32B:Update README.md
Organizations
soldni's activity
Update README.md
#2 opened about 1 month ago
by
reach-vb

Update README.md
#1 opened about 1 month ago
by
reach-vb

Add library tag!
#1 opened about 1 month ago
by
reach-vb

Upload folder using huggingface_hub
#1 opened about 1 month ago
by
soldni

Fix loading and data viewer due to nested dirs
1
#3 opened 4 months ago
by
orionweller

Failed to load dataset
9
#3 opened 7 months ago
by
joelb

latest update?
2
#8 opened about 1 year ago
by
fkov
update chat template
#1 opened 5 months ago
by
soldni

Seeing Arxiv content in the Algebraic Stack subset
3
#2 opened 7 months ago
by
dangerzone

Add `transformers` as library_name
#2 opened 7 months ago
by
Wauplin

How to run it on a mobile device?
3
#1 opened 7 months ago
by
KoiSikhaDo
Add proper library name
#3 opened 7 months ago
by
osanseviero

accidentally released?
1
#1 opened 8 months ago
by
Fizzarolli

What is the total # tokens after sampling proportion? 1.7T or 1.65T
3
#36 opened 11 months ago
by
ivanzhouyq

v1_7 update
#28 opened 12 months ago
by
kylel

Does allenai/c4 and the subset C4 in allenai/dolma is the same dataset?
4
#10 opened about 1 year ago
by
speiqin
Can't download two files
1
#19 opened about 1 year ago
by
mrgorjan
Prompting to OLMo
2
#8 opened about 1 year ago
by
herambpatil2004
Update README.md
#10 opened over 1 year ago
by
Muennighoff

Add download instructions
#8 opened over 1 year ago
by
Muennighoff
