This is the collection of datasets I used / am going to use to train my DeepSeek R1 Distill Qwen 7B-based model.
Hannah
hannah-eee
·
AI & ML interests
Chain-Of-Though reasoning, MoE
Recent Activity
updated a dataset about 2 hours ago
hannah-eee/bluesky-pds-docs updated a dataset about 2 hours ago
hannah-eee/anything-llm-docs updated a collection about 2 hours ago
Collection For My DeepSeek R1 Qwen Distill 7B RetrainingOrganizations
None yet