Shreyas Jena's picture

1 6 5

Shreyas Jena

jena-shreyas

·

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

jena-shreyas/flux-lora-wheels

published a model about 1 month ago

jena-shreyas/flux-lora-wheels

reacted to Kseniase's post with 👍 about 2 months ago

10 Recent Advancements in Math Reasoning Over the last few weeks, we have witnessed a surge in AI models' math reasoning capabilities. Top companies like Microsoft, NVIDIA, and Alibaba Qwen have already joined this race to make models "smarter" in mathematics. But why is this shift happening now? Complex math calculations require advanced multi-step reasoning, making mathematics an ideal domain for demonstrating a model's strong "thinking" capabilities. Additionally, as AI continues to evolve and is applied in math-intensive fields such as machine learning and quantum computing (which is predicted to see significant growth in 2025), it must meet the demands of complex reasoning. Moreover, AI models can be integrated with external tools like symbolic solvers or computational engines to tackle large-scale math problems, which also needs high-quality math reasoning. So here’s a list of 10 recent advancements in math reasoning of AI models: 1. NVIDIA: https://huggingface.co/papers/2412.15084 2. Qwen, Alibaba: Qwen2.5-Math-PRM https://huggingface.co/papers/2501.07301 and PROCESSBENCH evaluation https://huggingface.co/papers/2412.06559 3. Microsoft Research: https://huggingface.co/papers/2501.04519 4. https://huggingface.co/papers/2501.03226 5. https://huggingface.co/papers/2501.04686 6. https://huggingface.co/papers/2412.03205 7. https://huggingface.co/papers/2501.06430 8. https://huggingface.co/papers/2501.04425 9. https://huggingface.co/papers/2501.03035 10. https://huggingface.co/papers/2412.16964

View all activity

Organizations

jena-shreyas's activity

upvoted a collection 2 months ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 70

upvoted a paper 5 months ago

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5, 2024 • 21

upvoted a paper 6 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19, 2024 • 25

upvoted a paper 7 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 159

upvoted 2 collections 8 months ago

LLaVa-NeXT-Video

LLaVa-NeXT-Video extends LLaVa-NeXT for video understanding. • 5 items • Updated Jun 10, 2024 • 9

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated 25 days ago • 60