Charles Cai

charlescai2016

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

flock-io/Flock_Web3_Agent_Model

liked a model 8 days ago

all-hands/openhands-lm-32b-v0.1

liked a model 8 days ago

rasbt/llama-3.2-from-scratch

View all activity

Organizations

charlescai2016's activity

upvoted an article 23 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 169

upvoted a collection about 1 month ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236

upvoted an article about 1 month ago

Article

Multivariate Probabilistic Time Series Forecasting with Informer

Mar 10, 2023

• 18

upvoted a collection about 2 months ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 251

upvoted an article 2 months ago

Article

Introducing the SQL Console on Datasets

Sep 17, 2024

• 23

upvoted a paper 2 months ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 39

upvoted a collection 2 months ago

Reasoning Datasets

Collection

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 60

upvoted a paper 4 months ago

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Paper • 2412.10704 • Published Dec 14, 2024 • 15

upvoted 2 papers 5 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 55

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 19

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 113

upvoted an article 7 months ago

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

• 63

upvoted a paper 7 months ago

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 10

upvoted a paper 8 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 72

upvoted an article 8 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30, 2024

• 64

upvoted a paper 8 months ago

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Paper • 2408.01584 • Published Aug 2, 2024 • 10