Models Guide
Reference for all Rox AI models.
Model Overview
Rox AI offers eight specialized models, each optimized for different use cases:
| Model | Endpoint | Best For | Max Tokens | Default Temp |
|---|---|---|---|---|
| Rox Core | /chat |
General conversation | 4,096 | 1.0 |
| Rox 2.1 Turbo | /turbo |
Fast responses | 4,096 | 0.6 |
| Rox 3.5 Coder | /coder |
Code generation | 16,384 | 0.6 |
| Rox 4.5 Turbo | /turbo45 |
Advanced reasoning | 8,192 | 0.2 |
| Rox 5 Ultra | /ultra |
Superior reasoning | 8,192 | 1.0 |
| Rox 6 Dyno | /dyno |
Extended context | 16,384 | 1.0 |
| Rox 7 Coder | /coder7 |
Advanced coding | 16,384 | 1.0 |
| Rox Vision Max | /vision |
Visual understanding | 512 | 0.2 |
Rox Core
Endpoint: POST /chat
Description
General-purpose conversational model for everyday tasks.
Best Use Cases
- General conversation and Q&A
- Content writing and generation
- Creative tasks (stories, poems, ideas)
- Summarization and analysis
- Educational tutoring
- Customer support
Parameters
- Temperature: 1.0 (balanced creativity)
- Top P: 1.0 (full diversity)
- Max Tokens: 4,096
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/chat \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Explain quantum computing in simple terms"}
],
"temperature": 1.0,
"max_tokens": 512
}'
When to Choose Rox Core
- You need creative, varied responses
- Task requires nuanced understanding
- Building a general-purpose chatbot
- Content needs to be engaging and natural
Rox 2.1 Turbo
Endpoint: POST /turbo
Description
Fast model for quick responses and real-time applications.
Best Use Cases
- Real-time chat applications
- Customer support bots
- Quick Q&A systems
- High-throughput applications
- Simple queries and commands
- Factual information retrieval
Parameters
- Temperature: 0.6 (more focused)
- Top P: 0.7 (more consistent)
- Max Tokens: 4,096
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/turbo \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "What are the business hours?"}
]
}'
When to Choose Rox 2.1 Turbo
- Speed is critical
- Need consistent, reliable answers
- Building customer support systems
- High volume of requests
- Simple, straightforward queries
Rox 3.5 Coder
Endpoint: POST /coder
Description
Code-focused model for programming tasks and technical work.
Best Use Cases
- Code generation and completion
- Debugging and error fixing
- Algorithm design and optimization
- Technical documentation
- Code review and suggestions
- Software architecture discussions
- API integration help
Parameters
- Temperature: 0.6 (precise and focused)
- Top P: 0.95 (balanced diversity)
- Max Tokens: 16,384 (extended context)
- Special Features: Enhanced thinking mode
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/coder \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Write a Python function to implement binary search"}
],
"max_tokens": 2048
}'
When to Choose Rox 3.5 Coder
- Working with code in any language
- Need detailed technical explanations
- Debugging complex issues
- Designing algorithms or systems
- Writing technical documentation
- Need extended context (up to 16K tokens)
Comparison Matrix
Performance Characteristics
| Feature | Rox Core | Rox 2.1 Turbo | Rox 3.5 Coder | Rox 4.5 Turbo | Rox 5 Ultra | Rox 6 Dyno | Rox 7 Coder | Rox Vision |
|---|---|---|---|---|---|---|---|---|
| Speed | Medium | Fast | Medium | Fast | Medium | Medium | Medium | Fast |
| Creativity | High | Medium | Low | Low | High | High | Medium | Low |
| Consistency | Medium | High | High | Very High | High | Medium | High | Very High |
| Code Quality | Good | Good | Excellent | Good | Excellent | Good | Superior | N/A |
| Context Length | 4K | 4K | 16K | 8K | 8K | 16K | 16K | 512 |
| Thinking Mode | No | No | Yes | Yes | Yes | Yes | Yes | No |
| Reasoning | Basic | Basic | Advanced | Very Advanced | Superior | Advanced | Superior | Basic |
Use Case Recommendations
| Task | Recommended Model | Why |
|---|---|---|
| Write a blog post | Rox Core | Creative, engaging content |
| Answer "What is X?" | Rox 2.1 Turbo | Fast, factual response |
| Debug Python code | Rox 3.5 Coder | Code specialist |
| Customer support | Rox 2.1 Turbo | Quick, consistent answers |
| Write a story | Rox Core | Creative and varied |
| Explain algorithm | Rox 3.5 Coder | Technical depth |
| Translate text | Rox 2.1 Turbo | Fast and accurate |
| Design API | Rox 3.5 Coder | Technical expertise |
| Brainstorm ideas | Rox Core | Creative thinking |
| Code review | Rox 3.5 Coder | Code understanding |
| Complex reasoning | Rox 4.5 Turbo | Advanced thinking |
| Research analysis | Rox 5 Ultra | Superior reasoning |
| System architecture | Rox 5 Ultra | Complex design |
| Long documents | Rox 6 Dyno | Extended context |
| Large codebase | Rox 7 Coder | Advanced coding |
| Image analysis | Rox Vision Max | Visual understanding |
Model Selection Guide
Decision Tree
Need to work with code?
ββ Yes
β ββ Simple/medium tasks? β Rox 3.5 Coder
β ββ Complex/large-scale? β Rox 7 Coder
ββ No
ββ Need advanced reasoning?
β ββ Yes
β β ββ Need highest quality? β Rox 5 Ultra
β β ββ Need speed? β Rox 4.5 Turbo
β ββ No
β ββ Long documents? β Rox 6 Dyno
β ββ Visual tasks? β Rox Vision Max
β ββ Need fast responses? β Rox 2.1 Turbo
β ββ Need creative output? β Rox Core
Quick Selection Tips
Choose Rox Core when:
- Default choice for most tasks
- Need creative, engaging responses
- Building general chatbots
- Content generation projects
Choose Rox 2.1 Turbo when:
- Speed matters most
- Need consistent answers
- High request volume
- Simple Q&A systems
Choose Rox 3.5 Coder when:
- Any coding task
- Technical documentation
- Algorithm design
- Need extended context
Choose Rox 6 Dyno when:
- Processing long documents
- Extended context needed
- Multi-document analysis
- Long conversations
Choose Rox 7 Coder when:
- Most complex coding tasks
- Large-scale projects
- System architecture
- Advanced algorithms
Choose Rox Vision Max when:
- Visual understanding
- Image analysis
- Multimodal tasks
Advanced Usage
Switching Models Dynamically
class RoxAI {
constructor(baseUrl = 'https://Rox-Turbo-API.hf.space') {
this.baseUrl = baseUrl;
}
async chat(message, model = 'chat') {
const endpoints = {
core: 'chat',
turbo: 'turbo',
coder: 'coder'
};
const endpoint = endpoints[model] || model;
const response = await fetch(`${this.baseUrl}/${endpoint}`, {
method: 'POST',
headers: { 'Content-Type': 'application/json' },
body: JSON.stringify({
messages: [{ role: 'user', content: message }]
})
});
return (await response.json()).content;
}
}
// Usage
const rox = new RoxAI();
// Use different models for different tasks
const story = await rox.chat('Write a short story', 'core');
const answer = await rox.chat('What is 2+2?', 'turbo');
const code = await rox.chat('Write a sorting function', 'coder');
Model-Specific Optimization
import requests
class RoxClient:
def __init__(self, base_url="https://Rox-Turbo-API.hf.space"):
self.base_url = base_url
def ask_core(self, message, creative=True):
"""Use Rox Core with creativity control"""
return self._request('chat', message,
temperature=1.2 if creative else 0.8)
def ask_turbo(self, message):
"""Use Rox Turbo for fast responses"""
return self._request('turbo', message, max_tokens=256)
def ask_coder(self, message, extended=False):
"""Use Rox Coder with optional extended context"""
return self._request('coder', message,
max_tokens=8192 if extended else 2048)
def _request(self, endpoint, message, **kwargs):
response = requests.post(
f"{self.base_url}/{endpoint}",
json={
"messages": [{"role": "user", "content": message}],
**kwargs
}
)
return response.json()["content"]
Cost and Performance Optimization
Tips for Each Model
Rox Core:
- Use for tasks requiring creativity
- Adjust temperature based on needs
- Consider caching common queries
Rox 2.1 Turbo:
- Best cost-performance ratio
- Use for high-volume applications
- Lower max_tokens for even faster responses
Rox 3.5 Coder:
- Use only for code-related tasks
- Leverage extended context when needed
- Cache code snippets and patterns
API Compatibility
All three models use the same request/response format:
Request:
{
"messages": [
{"role": "user", "content": "Your message"}
],
"temperature": 1.0,
"top_p": 0.95,
"max_tokens": 512
}
Response:
{
"content": "Model response"
}
This makes it easy to switch between models without changing your code!
Built by Mohammad Faiz
Rox 4.5 Turbo
Endpoint: POST /turbo45
Description
Reasoning model for complex problem-solving with fast responses.
Best Use Cases
- Complex problem solving
- Advanced reasoning tasks
- Scientific explanations
- Mathematical problems
- Strategic planning
- Analysis and insights
Parameters
- Temperature: 0.2 (highly focused)
- Top P: 0.7 (consistent)
- Max Tokens: 8,192
- Special Features: Enhanced reasoning mode
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/turbo45 \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Explain the theory of relativity"}
],
"max_tokens": 2048
}'
When to Choose Rox 4.5 Turbo
- Need advanced reasoning
- Complex problem solving
- Scientific or technical explanations
- Fast responses with deep thinking
Rox 5 Ultra
Endpoint: POST /ultra
Description
Advanced model for complex reasoning and high-quality output.
Best Use Cases
- Most complex problem solving
- Research and analysis
- Advanced technical tasks
- Strategic decision making
- Complex code architecture
- Multi-step reasoning
Parameters
- Temperature: 1.0 (balanced)
- Top P: 0.95 (high diversity)
- Max Tokens: 8,192
- Special Features: Superior reasoning mode
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/ultra \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Design a scalable microservices architecture"}
],
"max_tokens": 4096
}'
When to Choose Rox 5 Ultra
- Most complex tasks
- Need highest quality output
- Multi-step reasoning required
- Research and deep analysis
Rox 6 Dyno
Endpoint: POST /dyno
Description
Extended context model for long documents and conversations.
Best Use Cases
- Long document analysis
- Extended conversations
- Document summarization
- Research paper analysis
- Multi-document synthesis
Parameters
- Temperature: 1.0 (balanced)
- Top P: 1.0 (full diversity)
- Max Tokens: 16,384 (extended context)
- Special Features: Dynamic thinking mode
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/dyno \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Analyze this 20-page document..."}
],
"max_tokens": 8192
}'
When to Choose Rox 6 Dyno
- Processing long documents
- Need extended context window
- Multi-document analysis
- Long-form content generation
Rox 7 Coder
Endpoint: POST /coder7
Description
Advanced coding model for complex programming tasks.
Best Use Cases
- Complex algorithm design
- Large-scale code generation
- Advanced debugging
- System architecture
- Code refactoring
- Multi-file code analysis
Parameters
- Temperature: 1.0 (balanced)
- Top P: 1.0 (full diversity)
- Max Tokens: 16,384 (extended context)
- Special Features: Advanced thinking mode for code
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/coder7 \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Build a distributed caching system"}
],
"max_tokens": 8192
}'
When to Choose Rox 7 Coder
- Most complex coding tasks
- Large-scale projects
- System design and architecture
- Advanced algorithms
Rox Vision Max
Endpoint: POST /vision
Description
Visual model for image analysis and multimodal tasks.
Best Use Cases
- Image analysis
- Visual understanding
- Multimodal tasks
- Image description
- Visual Q&A
Parameters
- Temperature: 0.2 (highly focused)
- Top P: 0.7 (consistent)
- Max Tokens: 512
Example Request
curl -X POST https://Rox-Turbo-API.hf.space/vision \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Describe this image"}
],
"max_tokens": 256
}'
When to Choose Rox Vision Max
- Visual understanding tasks
- Image analysis
- Multimodal applications