-
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Paper • 2410.05265 • Published • 30 -
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents
Paper • 2410.03450 • Published • 36 -
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Paper • 2410.08196 • Published • 46 -
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Paper • 2410.07303 • Published • 18
XXSg559
XXSg559
AI & ML interests
None yet
Recent Activity
updated
a model
8 days ago
XXSg559/ppo-SnowballTarget
published
a model
8 days ago
XXSg559/ppo-SnowballTarget
updated
a model
8 days ago
XXSg559/Reinforce-CartPole-v1
Organizations
None yet
Collections
1
models
8

XXSg559/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
14

XXSg559/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated

XXSg559/Qwen2.5-1.5B-Instruct-thinking-function_calling-V0
Updated

XXSg559/q-Taxi-v3
Reinforcement Learning
•
Updated

XXSg559/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

XXSg559/ppo-Huggy
Reinforcement Learning
•
Updated
•
13

XXSg559/sft_output
Updated

XXSg559/SmolLM2-FT-MyDataset
Text Generation
•
Updated
•
5