Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
stzhao
's Collections
layout
Awesome Loras
Human, Building, Culture, Composition, Poster
OCR-2.0
LLM_Model
LLM_Dataset_SFT
LLM_Dataset_RLHF
LLM_Dataset_Pretrain
MLLM_Dataset_SFT
LLM_Dataset_Pretrain
updated
Apr 29
A collection of datasets used for large language model pretrain.
Upvote
-
HuggingFaceFW/fineweb
Viewer
•
Updated
Jul 16
•
46B
•
395k
•
1.75k
Upvote
-
Share collection
View history
Collection guide
Browse collections