Joocjun (Se June Joo)

upvoted a collection 4 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 1 day ago • 283

upvoted a paper 5 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 49

upvoted a collection 9 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 7 days ago • 566

upvoted a paper 9 months ago

A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data

Paper • 2407.16680 • Published Jul 23, 2024 • 12

upvoted a collection 9 months ago

VILA: On Pre-training for Visual Language Models

Collection

10 items • Updated 8 days ago • 53

upvoted 2 papers 10 months ago

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28, 2024 • 18

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1, 2024 • 46

upvoted 8 papers about 1 year ago

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3, 2024 • 51

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

Paper • 2403.12008 • Published Mar 18, 2024 • 21

Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18, 2024 • 34

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Paper • 2403.12943 • Published Mar 19, 2024 • 15

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Paper • 2403.13064 • Published Mar 19, 2024 • 32

upvoted 5 papers over 1 year ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 56

3D-LFM: Lifting Foundation Model

Paper • 2312.11894 • Published Dec 19, 2023 • 15

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Paper • 2312.12742 • Published Dec 20, 2023 • 14

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 37

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Paper • 2312.12456 • Published Dec 16, 2023 • 44

Se June Joo

AI & ML interests

Organizations

Joocjun's activity

Cosmos

Evaluating Language Models as Synthetic Data Generators

Phi-3

A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data

VILA: On Pre-training for Visual Language Models

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

Larimar: Large Language Models with Episodic Memory Control

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Humanoid Locomotion as Next Token Prediction

Learning Universal Predictors

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

3D-LFM: Lifting Foundation Model

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Generative Multimodal Models are In-Context Learners

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU