Pedro Ortiz Suarez

pjox

AI & ML interests

Language modeling, parsing, sequence tagging, NER, historical languages.

Organizations

pjox's activity

New activity in oscar-corpus/OSCAR-2301 4 months ago
New activity in oscar-corpus/colossal-oscar-1.0 9 months ago

Change foldernames

4
#3 opened 9 months ago by hac541309
New activity in oscar-corpus/OSCAR-2201 9 months ago

Unsafe Files

20
#12 opened 12 months ago by GetzPro
New activity in oscar-corpus/OSCAR-2301 9 months ago

About the number of documents

6
#6 opened 9 months ago by lixin4ever
New activity in oscar-corpus/colossal-oscar-1.0 9 months ago
New activity in oscar-corpus/OSCAR-2301 10 months ago

Changing into Parquet

2
#5 opened 10 months ago by hac541309
New activity in pjox/dalembert 12 months ago
New activity in oscar-corpus/OSCAR-2301 about 1 year ago

Deduplicated English Corpus

2
#3 opened about 1 year ago by conceptofmind

Data hosting on Huggingface

1
#2 opened about 1 year ago by hieuhocnlp

How to download only one language?

2
#1 opened about 1 year ago by musabg
New activity in oscar-corpus/OSCAR-2201 about 1 year ago