jdopensource
/

JoyAI-LLM-Flash

Text Generation

joyai_llm_flash

Model card Files Files and versions

Mingke977 commited on 14 days ago

Commit

7770a36

·

verified ·

1 Parent(s): f4526ac

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ JoyAI-LLM Flash is a state-of-the-art medium-sized instruct language model with
 ### Key Features
-- Fiber Bundle RL: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions.
 - Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
 - Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.

 ### Key Features
+- Fibration Policy Optimization: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions. [paper link](https://arxiv.org/abs/2603.08239)
 - Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
 - Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.