Mingke977 commited on
Commit
7770a36
·
verified ·
1 Parent(s): f4526ac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -26,7 +26,7 @@ JoyAI-LLM Flash is a state-of-the-art medium-sized instruct language model with
26
 
27
  ### Key Features
28
 
29
- - Fiber Bundle RL: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions.
30
  - Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
31
  - Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.
32
 
 
26
 
27
  ### Key Features
28
 
29
+ - Fibration Policy Optimization: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions. [paper link](https://arxiv.org/abs/2603.08239)
30
  - Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
31
  - Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.
32