AI & ML interests

None defined yet.

Recent Activity

Organization Card

Liodon AI

Liodon AI

We are an independent U.S.-based research-driven open-source entity dedicated to advancing the frontier of Small Language Models (SLMs) and high-efficiency deep learning infrastructure. Our goal is to democratize highly capable, compact intelligence that can run seamlessly on low-resource settings without sacrificing reasoning ability.

By bridging hardware-aware optimization with innovative architectural paradigms, we design models that maximize performance-per-parameter.


🚀 Core Research Pillars

  • Architectural Efficiency: Engineering high-throughput SLMs utilizing advanced Transformer variants, highly tailored Mixture-of-Experts (MoE), and sparse attention mechanisms.
  • Extreme Model Compression: Developing state-of-the-art recipes for hardware-aware post-training quantization (FP8, FP4, and sub-4-bit formats), pruning, and structure-preserving knowledge distillation.
  • Speech & Modality Fusion: Integrating native Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) layers into ultra-compact foundation models.
  • Rigorous Alignment & Optimization: Fine-tuning models using gradient-guided optimization variants and preference alignment to guarantee robustness at scale.

🛠️ Open-Source Ecosystem & Tooling

Beyond our base and instruct models, we maintain specialized repositories and libraries to support efficient deep learning workflows:

  • TorchCriterion – A modular library providing advanced loss functions and criteria designed specifically for structured knowledge distillation and model alignment.
  • PromptGrad – An optimized framework engineered to propagate gradients through prompt spaces, enabling automated prompt tuning and discrete optimization.
  • LoRA-XS – A highly parameter-efficient fine-tuning library designed to optimize low-rank adaptation techniques for ultra-small architectures.

📦 Model Families & Roadmap

Our model releases are systematically categorized across distinct footprints optimized for varying compute profiles:

Model Series Parameter Range Target Deployment Focus Area
Liodon-Micro < 1.5B On-device / Mobile / Embedded High-speed text processing, semantic search
Liodon-Mini 1.5B - 4B Consumer GPUs / Workstations Complex reasoning, local tool-use, code generation
Liodon-Audio Variant Dependent Edge / Real-time systems End-to-end speech-to-text and low-latency interaction

🤝 Collaboration & Community

We are active contributors to the global AI research community. If you are interested in collaborating on efficient foundation models, data curation for SLMs, or hardware-aware inference infrastructure:

  • GitHub: github.com/Liodon-AI
  • Discussions: Open an issue or a thread on any of our model cards here on Hugging Face.
  • Citations: If you use our models or software frameworks in your academic research, please use the citation metadata available on the respective model repositories.