Jaward Sesay

Jaward

AI & ML interests

I like to train large deep neural nets too 🧠🤖💥 | First Paper (AutoAgents: A Framework for Automatic Agent Generation) Accepted @ IJCAI 2024 | Role Model Karpathy

Recent Activity

upvoted a paper about 4 hours ago

Learning Adaptive Parallel Reasoning with Language Models

posted an update about 4 hours ago

New reasoning algo just dropped: Adaptive Parallel Reasoning “we propose Adaptive Parallel Reasoning (APR), a novel reasoning framework that enables language models to orchestrate both serialized and parallel computations end-to-end. APR generalizes existing reasoning methods by enabling adaptive multi-threaded inference using spawn() and join() operations.” Paper: https://arxiv.org/pdf/2504.15466 Code: https://github.com/Parallel-Reasoning/APR

replied to their post 2 days ago

nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero

View all activity

Organizations

Jaward's activity

upvoted a paper about 4 hours ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published 1 day ago • 27

upvoted an article 5 days ago

Article

Cohere on Hugging Face Inference Providers 🔥

7 days ago

• 96

upvoted a paper 9 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 12 days ago • 120

upvoted 2 papers 14 days ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published 16 days ago • 96

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 16 days ago • 168

upvoted 4 papers about 1 month ago

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16 • 64

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 135

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 159

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 19

upvoted a paper about 2 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 84

upvoted 2 papers 3 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 60

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 213

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 845

upvoted a paper 3 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 114

upvoted a collection 4 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 9 days ago • 283

upvoted 2 papers 5 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 47

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 124

upvoted 3 papers 6 months ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 33

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 86

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9, 2024 • 47