Abdoul Majid O. Thiombiano's picture

140 48

Abdoul Majid O. Thiombiano

thiomajid

·

https://thiomajid.github.io/

AI & ML interests

NLP & Reasoning

Recent Activity

updated a model 3 days ago

thiomajid/bert-finetuned-java-inconsistency

updated a model 3 days ago

thiomajid/bert-finetuned-java-inconsistency

updated a model 3 days ago

thiomajid/bert-finetuned-java-inconsistency

View all activity

Organizations

thiomajid's activity

upvoted a paper 9 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 10 days ago • 160

upvoted 2 papers 19 days ago

Vision-LSTM: xLSTM as Generic Vision Backbone

Paper • 2406.04303 • Published Jun 6, 2024 • 1

xLSTM: Extended Long Short-Term Memory

Paper • 2405.04517 • Published May 7, 2024 • 16

upvoted a paper about 1 month ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13 • 49

upvoted 6 papers about 2 months ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 37

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 142

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19 • 34

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 153

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 112

upvoted a paper 2 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 137

upvoted 9 papers 3 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 113

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 381

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 285

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 88

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 91

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 98

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275