Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
loubnabnl 's Collections
SmolLM 🀏
Instruct datasets
πŸ“š Filtering the web with LLMs
🌌 Synthetic textbooks
✨ Code Generation

Instruct datasets

updated Oct 11, 2024
Upvote
-

  • cognitivecomputations/SystemChat-2.0

    Preview β€’ Updated May 31, 2024 β€’ 295 β€’ 58

    Note good to teach the model to follow system prompts. They are too specific though (and sometimes hard). We should add more generic system prompts in other samples of our dataset


  • arcee-ai/infini-instruct-top-500k

    Viewer β€’ Updated Jun 30, 2024 β€’ 500k β€’ 23 β€’ 5

    Note filtered from https://huggingface.co/datasets/BAAI/Infinity-Instruct the 3M and 7M datasets boost HellaSwag, MMLU GSM8k..


  • arcee-ai/The-Tome

    Viewer β€’ Updated Aug 15, 2024 β€’ 1.75M β€’ 252 β€’ 93

  • teknium/OpenHermes-2.5

    Viewer β€’ Updated Apr 15, 2024 β€’ 1M β€’ 3.75k β€’ 728

  • HuggingFaceH4/ultrachat_200k

    Viewer β€’ Updated Oct 16, 2024 β€’ 515k β€’ 15.3k β€’ 533

    Note Below H4 formats for the handbook trainings


  • HuggingFaceTB/OpenHermes-2.5-H4

    Viewer β€’ Updated Aug 17, 2024 β€’ 1M β€’ 83 β€’ 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs