Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yuyijiong 's Collections
train_with_paraphrasing
LLM Eval Dataset en
LLM eval dataset zh
Chinese pretrain datasets

Chinese pretrain datasets

updated Nov 26, 2024
Upvote
1

  • opencsg/chinese-fineweb-edu

    Viewer • Updated Jan 20 • 84.6M • 17.9k • 100

  • opencsg/chinese-fineweb-edu-v2

    Viewer • Updated Jan 20 • 188M • 2.31k • 63

  • opencsg/chinese-cosmopedia

    Preview • Updated Jan 15 • 1.66k • 65
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs