Useful Models
updated
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en
Text Generation
• 8B • Updated • 3
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en
Text Generation
• 8B • Updated • 3
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-gpt-sft
Text Generation
• 8B • Updated • 2
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-gpt
Text Generation
• 8B • Updated • 2
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-gpt
Text Generation
• 8B • Updated • 2
AmberYifan/qwen2.5-0.5b-instruct-full-pretrain-control-tweet-1m-en
Text Generation
• 0.5B • Updated • 5
AmberYifan/qwen2.5-0.5b-instruct-full-pretrain-junk-tweet-1m-en
Text Generation
• 0.5B • Updated • 4
AmberYifan/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en
Text Generation
• 4B • Updated • 1
AmberYifan/qwen3-4b-thinking-full-pretrain-junk-tweet-1m-en
Text Generation
• 4B • Updated
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en
Text Generation
• 8B • Updated • 2
•
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en
Text Generation
• 8B • Updated • 2
•
AmberYifan/llama3-8b-full-pretrain-low-len-1m-en-sft
Text Generation
• 8B • Updated • 13
AmberYifan/llama3-8b-full-pretrain-high-len-1m-en-sft
Text Generation
• 8B • Updated • 5
AmberYifan/llama3-8b-full-pretrain-high-len-1m-en
Text Generation
• 8B • Updated • 2
AmberYifan/llama3-8b-full-pretrain-low-len-1m-en
Text Generation
• 8B • Updated • 3