Siheng99/DeepSeek-R1-Distill-Llama-8B-toy_math_addition-2-0-10000-8-dense-100 Updated about 1 month ago
Siheng99/DeepSeek-R1-Distill-Llama-8B-toy_math_addition-2-0-10000-dense-100 Updated about 1 month ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published Jan 22 • 91
🦋SEALONG Collection Large Language Models Can Self-Improve in Long-context Reasoning • 7 items • Updated Nov 14, 2024 • 7
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12, 2024 • 67
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12, 2024 • 67 • 4
🦋SEALONG Collection Large Language Models Can Self-Improve in Long-context Reasoning • 7 items • Updated Nov 14, 2024 • 7
🦋SEALONG Collection Large Language Models Can Self-Improve in Long-context Reasoning • 7 items • Updated Nov 14, 2024 • 7
🦋SEALONG Collection Large Language Models Can Self-Improve in Long-context Reasoning • 7 items • Updated Nov 14, 2024 • 7