Minghui Jia's picture

Minghui Jia

Maxwell-Jia

·

Maxwell-Jia

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

Maxwell-Jia/kepler_flare

liked a model 4 days ago

Maxwell-Jia/fcn4flare

new activity 4 days ago

Maxwell-Jia/kepler_flare:[bot] Conversion to Parquet

View all activity

Organizations

Maxwell-Jia's activity

upvoted a paper 11 days ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published 14 days ago • 41

upvoted a collection 12 days ago

LLaMA-O1-1129 Datasets, Models, Codes and Papers

8 items • Updated 27 days ago • 18

upvoted a paper 13 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 18 days ago • 79

upvoted 2 papers 16 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 19 days ago • 93

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 18 days ago • 92

upvoted 3 papers about 1 month ago

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10 • 17

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13 • 43

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14 • 71

upvoted a paper 2 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17 • 89

upvoted 8 papers 3 months ago

UniMuMo: Unified Text, Music and Motion Generation

Paper • 2410.04534 • Published Oct 6 • 18

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Paper • 2410.05080 • Published Oct 7 • 19

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 168

Not All LLM Reasoners Are Created Equal

Paper • 2410.01748 • Published Oct 2 • 28

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2 • 26

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21 • 27

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26 • 46

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16 • 39

upvoted 2 papers 4 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 117

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27 • 138

upvoted a paper 5 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 52