Pedro Ortiz Suarez

pjox

AI & ML interests

Language modeling, parsing, sequence tagging, NER, historical languages.

Organizations

pjox's activity

New activity in oscar-corpus/OSCAR-2301 about 1 month ago
New activity in oscar-corpus/colossal-oscar-1.0 6 months ago

Change foldernames

4
#3 opened 6 months ago by hac541309
New activity in oscar-corpus/OSCAR-2201 7 months ago

Unsafe Files

20
#12 opened 10 months ago by GetzPro
New activity in oscar-corpus/OSCAR-2301 7 months ago

About the number of documents

6
#6 opened 7 months ago by lixin4ever
New activity in oscar-corpus/colossal-oscar-1.0 7 months ago
New activity in oscar-corpus/OSCAR-2301 8 months ago

Changing into Parquet

2
#5 opened 8 months ago by hac541309
New activity in pjox/dalembert 9 months ago
New activity in oscar-corpus/OSCAR-2301 10 months ago

Deduplicated English Corpus

2
#3 opened 11 months ago by conceptofmind
New activity in oscar-corpus/OSCAR-2301 11 months ago

Data hosting on Huggingface

1
#2 opened 11 months ago by hieuhocnlp

How to download only one language?

2
#1 opened 12 months ago by musabg
New activity in oscar-corpus/OSCAR-2201 11 months ago