LongTalk A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training kenhktsui/longtalk-cot-v0.1 Viewer • Updated Dec 30, 2024 • 61.2k • 113 • 13 kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged-gguf Updated Dec 30, 2024 • 119 • 1 kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged Text Generation • Updated Dec 30, 2024 • 17 kenhktsui/llama3.1-8b-instruct-thinking-sft-merged-gguf Updated Dec 30, 2024 • 104 • 1
FastText Model for Pretraining Data Curation kenhktsui/llm-data-textbook-quality-fasttext-classifier-v2 Text Classification • Updated Nov 28, 2024 • 577 • 27 kenhktsui/fineweb-edu-fasttext-classifier Text Classification • Updated Jun 6, 2024 • 1.34k • 4 kenhktsui/code-natural-language-fasttext-classifier Text Classification • Updated Oct 30, 2024 • 2.02k • 1 kenhktsui/math-fasttext-classifier Text Classification • Updated 16 days ago • 3.27k • 1
kenhktsui/llm-data-textbook-quality-fasttext-classifier-v2 Text Classification • Updated Nov 28, 2024 • 577 • 27
kenhktsui/code-natural-language-fasttext-classifier Text Classification • Updated Oct 30, 2024 • 2.02k • 1