Qwen3 Technical Report
Paper • 2505.09388 • Published • 343
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
A fine-tuned Qwen3-8B model specialized for everyday productivity tasks: email drafting, summarization, message rewriting, meeting notes, social media posts, and Q&A.
trl-lib/Capybara (~10K)HuggingFaceH4/ultrachat_200k (~200K)Magpie-Align/Magpie-Pro-300K-Filtered (~300K)Based on published research:
| Parameter | Value |
|---|---|
| Learning Rate | 2e-5 |
| Epochs | 2 |
| Batch Size (per device) | 1 |
| Gradient Accumulation | 32 |
| Effective Batch Size | 32 |
| Max Sequence Length | 4096 |
| Scheduler | constant |
| Warmup | 0 |
| Packing | True |
| Precision | bfloat16 |
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model = AutoModelForCausalLM.from_pretrained(
"Muigai1/productivity-assistant-qwen3",
torch_dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True,
)
tokenizer = AutoTokenizer.from_pretrained(
"Muigai1/productivity-assistant-qwen3",
trust_remote_code=True,
)
messages = [{"role": "user", "content": "Draft a professional email about a project delay."}]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer([text], return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7, top_p=0.8)
response = tokenizer.decode(outputs[0][len(inputs.input_ids[0]):], skip_special_tokens=True)
print(response)
enable_thinking=False): Quick replies for emails, summaries, rewritesenable_thinking=True): Complex analysis, creative writing, codingTarget Market: Professionals, freelancers, students, small business owners who spend 2+ hours daily on writing and communication.
Pricing Tiers:
Revenue Projections:
train.py — Complete fine-tuning scriptapp.py — Gradio demo for deploymentrequirements.txt — Dependencies@misc{qwen3,
title={Qwen3 Technical Report},
author={Qwen Team},
year={2025},
eprint={2505.09388},
archivePrefix={arXiv},
}
@misc{magpie,
title={Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing},
author={Xu, Zhangchen et al.},
year={2024},
eprint={2406.08464},
archivePrefix={arXiv},
}