Bullard

Charletta1

AI & ML interests

Machine Learning, research, all things AI especially all things ethical

Recent Activity

upvoted an article 14 days ago

StarCoder: A State-of-the-Art LLM for Code

liked a Space 15 days ago

nanotron/ultrascale-playbook

upvoted an article 20 days ago

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

View all activity

Organizations

Charletta1's activity

upvoted an article 14 days ago

Article

StarCoder: A State-of-the-Art LLM for Code

May 4, 2023

• 54

liked a Space 15 days ago

2.51k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 20 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 73

upvoted 17 papers about 2 months ago

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

Paper • 2502.20378 • Published Feb 27 • 4

R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

Paper • 2502.19735 • Published Feb 27 • 9

Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting

Paper • 2502.19459 • Published Feb 26 • 11

SoRFT: Issue Resolving with Subtask-oriented Reinforced Fine-Tuning

Paper • 2502.20127 • Published Feb 27 • 9

Mobius: Text to Seamless Looping Video Generation via Latent Shift

Paper • 2502.20307 • Published Feb 27 • 19

Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System

Paper • 2502.16750 • Published Feb 23 • 10

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Paper • 2502.20126 • Published Feb 27 • 20

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

Paper • 2502.16944 • Published Feb 24 • 10

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27 • 30

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Paper • 2502.16645 • Published Feb 23 • 22

FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

Paper • 2502.20238 • Published Feb 27 • 24

LongRoPE2: Near-Lossless LLM Context Window Scaling

Paper • 2502.20082 • Published Feb 27 • 38