TitanML

company

Verified

https://titanml.co

titanml_

titanml

Activity Feed

AI & ML interests

Quantized Foundation Models

Recent Activity

jamesdborin updated a model about 1 hour ago

TitanML/GLM-5.2-FP8

jamesdborin updated a model 2 months ago

TitanML/GLM-5.1-FP8

jamesdborin updated a model 2 months ago

TitanML/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

View all activity

Organization Card

Community About org cards

🚀 Effortless and Secure Deployment of Enterprise RAG for Regulated Industries

TitanML enables organizations in regulated industries to effortlessly and efficiently deploy Enterprise RAG. Our TitanML Enterprise Inference Stack makes building, deploying, and scaling Enterprise RAG applications in secure environments a breeze.

TitanML: The Fastest Way to Enterprise RAG Inference

Our TitanML Enterprise Inference Stack offers:

Lightning-fast local inference
Efficient batching support
Multi-GPU inference capabilities
INT4 quantization for reduced memory footprint
And much more!

🌐 Quantized Open-Source Models

We have quantized over 50 popular open-source foundation models, making them more accessible and efficient for the enterprise. Some of our featured models include:

TitanML/opt-125m-AWQ-4bit
TitanML/Llama3-OpenBioLLM-70B-AWQ-4bit
TitanML/Meta-Llama-3-8B-AWQ-4bit
TitanML/Meta-Llama-3-8B-Instruct-AWQ-4bit
TitanML/llava-v1.6-mistral-7b
TitanML/tiny-mistral-embedder
TitanML/jina-v2-base-en-embed
TitanML/llama2-70b-base-4bit-AWQ
TitanML/llama2-13b-base-4bit-AWQ
TitanML/llama2-7b-base-4bit-AWQ
And many more!

Explore our full collection of quantized models and empower your organization with state-of-the-art Gen AI capabilities.

🔒 Secure and Compliant

We understand the importance of security and compliance in regulated industries. TitanML provides the tools and libraries to ensure your AI applications meet the strictest security and compliance requirements.

🤝 Connect with Us

Follow us on Twitter:
Star us on GitHub:
Connect with us on LinkedIn:

💬 Get in Touch

Have questions or want to learn more? We'd love to hear from you!

📧 Email us at hello@titanml.co

Collections 6

View 6 collections

spaces 2

Model Memory Calculator

🏃

Generate images from text descriptions

models 73

datasets 0

None public yet

TitanML

AI & ML interests

Recent Activity

🚀 Effortless and Secure Deployment of Enterprise RAG for Regulated Industries

TitanML: The Fastest Way to Enterprise RAG Inference

🌐 Quantized Open-Source Models

🔒 Secure and Compliant

🤝 Connect with Us

💬 Get in Touch

Collections 6

TitanML/Qwen2-7B-Instruct-AWQ

TitanML/Qwen2-Math-7B-Instruct

TitanML/Qwen2-Math-7B

TitanML/Qwen2-7B

TitanML/Llama3-OpenBioLLM-70B-AWQ-4bit

TitanML/Llama-2-13b-hf

TitanML/Llama-2-7b-hf

TitanML/Meta-Llama-3.1-8B

TitanML/Qwen2-7B-Instruct-AWQ

TitanML/Qwen2-Math-7B-Instruct

TitanML/Qwen2-Math-7B

TitanML/Qwen2-7B

TitanML/Llama3-OpenBioLLM-70B-AWQ-4bit

TitanML/Llama-2-13b-hf

TitanML/Llama-2-7b-hf

TitanML/Meta-Llama-3.1-8B

spaces 2

Model Memory Calculator

models 73

TitanML/GLM-5.2-FP8

TitanML/GLM-5.1-FP8

TitanML/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

TitanML/DeepSeek-OCR-2

TitanML/olmOCR-2-7B-1025-FP8

TitanML/Qwen3.5-397B-A17B

TitanML/Qwen-72B-Chat

TitanML/Meta-Llama-3.1-70B-Instruct

TitanML/llava-1.5-7b-hf

TitanML/Qwen2-Math-7B-Instruct

datasets 0

AI & ML interests

Recent Activity

Team members 8

🚀 Effortless and Secure Deployment of Enterprise RAG for Regulated Industries

TitanML: The Fastest Way to Enterprise RAG Inference

🌐 Quantized Open-Source Models

🔒 Secure and Compliant

🤝 Connect with Us

💬 Get in Touch

Collections 6

spaces 2 Sort: Recently updated

Model Memory Calculator

models 73 Sort: Recently updated

datasets 0

spaces 2

models 73