Instructions to use Gugu8/Atom-1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Gugu8/Atom-1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="Gugu8/Atom-1")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Gugu8/Atom-1", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use Gugu8/Atom-1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Gugu8/Atom-1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Gugu8/Atom-1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/Gugu8/Atom-1
- SGLang
How to use Gugu8/Atom-1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Gugu8/Atom-1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Gugu8/Atom-1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Gugu8/Atom-1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Gugu8/Atom-1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use Gugu8/Atom-1 with Docker Model Runner:
docker model run hf.co/Gugu8/Atom-1
Looped‑Transformer‑24M
A compact 24M‑parameter language model built with a Looped Transformer architecture, trained using the Muon optimizer, and enhanced with Chain‑of‑Thought reasoning. This model specializes in story generation and basic math reasoning, making it ideal for lightweight experimentation, educational projects, and rapid prototyping.
🧩 Model Overview
- Parameters: 24M
- Architecture: Looped Transformer
- Optimizer: Muon
- Reasoning: Chain‑of‑Thought (CoT)
- Training Data: Story‑focused corpus
- Math Skills: Simple arithmetic
- Training Hardware: Dual NVIDIA T4 GPUs
- Training Time: ~30 minutes
- Language: English
- License: AGPL‑3.0
✨ What This Model Does Well
- Story generation — coherent, imaginative, character‑driven narratives
- Dialogue writing — natural conversational flow
- Basic math — simple arithmetic and step‑by‑step reasoning
- CoT reasoning — improved logical flow when prompted
- Lightweight inference — runs smoothly on consumer GPUs and many CPUs
📚 Training Details
The model was trained for 30 minutes on two NVIDIA T4 GPUs, using a curated dataset of short stories, narrative prompts, character interactions, and basic math word problems.
The Muon optimizer provided fast, stable convergence, making it well‑suited for small‑parameter models.
🧠 Intended Use
This model is designed for:
- Creative writing
- Story generation
- Dialogue simulation
- Educational demos
- Lightweight reasoning tasks
Not recommended for:
- Factual retrieval
- Complex mathematics
- Safety‑critical applications
Website
https://gugu8intel-i9.github.io/atom-1_website/
🚀 Example Usage
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("your-username/looped-transformer-24m")
model = AutoModelForCausalLM.from_pretrained("your-username/looped-transformer-24m")
prompt = "Write a short story about a robot learning to dream."
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_length=200)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
- Downloads last month
- -