VDC-team/DialoguesEN-4k
Viewer β’ Updated β’ 4k β’ 20 β’ 1
SmallDront-60M is a compact language model specifically fine-tuned for engaging small talk conversations. With 60 million parameters in F32 precision, it strikes a balance between performance and efficiency β delivering coherent, natural dialogue without the overhead of larger models.
Compared to its predecessor (SmallDront-20M), this model properly ends its responses and hallucinates significantly less.
| Feature | Detail |
|---|---|
| π§ Architecture | Transformer-based |
| π’ Parameters | +-60,000,000 |
| π Precision | F32 (Full float32) |
| π€ Tokenizer | GPT-2 Tokenizer |
| π¦ Format | Hugging Face |
| π·οΈ Special Tokens | <|user|> <|assistant|> |
The model was trained on the VDC-team/DialoguesEN-4k dataset until reaching a loss of 0.9.
Temperature: 0.3
You: Who are you?
Assistant: Just a friendly chat. You?
You: Hello!
Assistant: Hey! What's a place you feel at?
You: Where you?
Assistant: Hi, for a chat. You?
use.py - use model example