angrygiraffe/claude-opus-4.6-4.7-reasoning-8.7k
Viewer • Updated • 38.5k • 5.16k • 228
An improved conversational assistant fine-tuned from Saif-1.0.
| Task | Saif-1.0 | Saif-1.1 |
|---|---|---|
| Prime check (Python) | ✓ slower | ✓ faster, better algo |
| Derivative (Math) | ✓ 9.64s | ✓ 3.87s |
| Factorial (JavaScript) | ✓ 15.51s | ✓ 7.72s |
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
tokenizer = AutoTokenizer.from_pretrained("Saif658/Saif-1.1")
model = AutoModelForCausalLM.from_pretrained(
"Saif658/Saif-1.1",
torch_dtype=torch.float16,
device_map="auto"
)
messages = [{"role": "user", "content": "your message here"}]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to("cuda")
outputs = model.generate(inputs, max_new_tokens=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Small 3B model — may struggle with very complex reasoning or long context tasks.
Base model
meta-llama/Llama-3.2-3B-Instruct