Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
davanstrien
's Collections
synthetic-data-generation-demos
sentence-transformers-from-synthetic-data
Synthetic (text) Dataset Generation
haiku
Historic language modeling
Climate
Sourced from Wikimedia
Legal Named Entity Recognition
Top 10% instruction tuning datasets
Top 10 Instruction Tuning Datasets copy
Metadata-generation
MOE papers to read
German Text Embedding Clustering Benchmark datasets
cosmochat-reading-list
datasets-tldr-project
Probably DPO datasets
Probably Alpaca Style Datasets
Direct Preference Optimization Datasets
Image Preference Optimization Datasets
query-to-hub-datasets-viewer-project
German Text Embedding Clustering Benchmark datasets
updated
Jun 21
Upvote
3
slvnwhrl/blurbs-clustering-s2s
Viewer
•
Updated
Sep 22
•
28
•
100
slvnwhrl/blurbs-clustering-p2p
Viewer
•
Updated
Sep 22
•
28
•
104
slvnwhrl/tenkgnad-clustering-s2s
Viewer
•
Updated
Sep 22
•
10
•
113
slvnwhrl/tenkgnad-clustering-p2p
Viewer
•
Updated
Sep 22
•
10
•
127
German Text Embedding Clustering Benchmark
Paper
•
2401.02709
•
Published
Jan 5
•
5
Upvote
3
Share collection
View history
Collection guide
Browse collections