kth8/multi-turn-conversation-50000x
Viewer • Updated • 50.4k • 46 • 2
How to use kth8/gemma-3-270m-it-Conversation with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="kth8/gemma-3-270m-it-Conversation")
messages = [
{"role": "user", "content": "Who are you?"},
]
pipe(messages) # Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM
tokenizer = AutoTokenizer.from_pretrained("kth8/gemma-3-270m-it-Conversation")
model = AutoModelForMultimodalLM.from_pretrained("kth8/gemma-3-270m-it-Conversation")
messages = [
{"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=True,
return_dict=True,
return_tensors="pt",
).to(model.device)
outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))How to use kth8/gemma-3-270m-it-Conversation with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "kth8/gemma-3-270m-it-Conversation"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "kth8/gemma-3-270m-it-Conversation",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker model run hf.co/kth8/gemma-3-270m-it-Conversation
How to use kth8/gemma-3-270m-it-Conversation with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "kth8/gemma-3-270m-it-Conversation" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "kth8/gemma-3-270m-it-Conversation",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "kth8/gemma-3-270m-it-Conversation" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "kth8/gemma-3-270m-it-Conversation",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'How to use kth8/gemma-3-270m-it-Conversation with Unsloth Studio:
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kth8/gemma-3-270m-it-Conversation to start chatting
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kth8/gemma-3-270m-it-Conversation to start chatting
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for kth8/gemma-3-270m-it-Conversation to start chatting
pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
model_name="kth8/gemma-3-270m-it-Conversation",
max_seq_length=2048,
)How to use kth8/gemma-3-270m-it-Conversation with Docker Model Runner:
docker model run hf.co/kth8/gemma-3-270m-it-Conversation
A supervised fine-tune of unsloth/gemma-3-270m-it on the kth8/multi-turn-conversation-50000x dataset.
System prompt
You are a helpful assistant.
User prompt
Hey there! How's it going?
Assistant response
Hey! I'm doing great, thanks for asking! I'm here and ready to help with whatever you need. What's on your mind today?
unsloth/gemma-3-270m-it| Step | Training Loss | Validation Loss |
|---|---|---|
| 0 | No log | 2.784440 |
| 155 | 1.882700 | 1.881819 |
| 310 | 1.805000 | 1.832387 |
| 465 | 1.803100 | 1.804098 |
| 620 | 1.781600 | 1.782886 |
| 775 | 1.785700 | 1.765646 |
| 930 | 1.776400 | 1.749293 |
| 1085 | 1.753500 | 1.736082 |
| 1240 | 1.732600 | 1.725711 |
| 1395 | 1.703100 | 1.715472 |
| 1550 | 1.730700 | 1.705917 |
| 1705 | 1.713500 | 1.697924 |
| 1860 | 1.725500 | 1.690107 |
| 2015 | 1.707200 | 1.684427 |
| 2170 | 1.687700 | 1.678853 |
| 2325 | 1.675800 | 1.674952 |
| 2480 | 1.723100 | 1.671108 |
| 2635 | 1.684300 | 1.668909 |
| 2790 | 1.692800 | 1.667304 |
| 2945 | 1.663200 | 1.666461 |
| 3100 | 1.676500 | 1.666246 |
This model is released under the Gemma license. See the Gemma Terms of Use and Prohibited Use Policy regarding the use of Gemma-generated content.