Ji-Ha (Ji-Ha)

upvoted a paper 8 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 58

upvoted a collection 9 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 663

upvoted a collection 11 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 25

upvoted a paper 11 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 41

upvoted a collection 12 months ago

DeepSeek-Math

Collection

DeepSeek Math series • 4 items • Updated Aug 16, 2024 • 21

upvoted a paper 12 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 123

upvoted a collection about 1 year ago

WizardLM

Collection

0 items • Updated 11 days ago • 108

upvoted 13 papers about 1 year ago

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 53

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Paper • 2403.14520 • Published Mar 21, 2024 • 36

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19, 2024 • 54

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

Paper • 2403.13248 • Published Mar 20, 2024 • 79

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 65

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 78

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 128

Video Editing via Factorized Diffusion Distillation

Paper • 2403.09334 • Published Mar 14, 2024 • 24

Ji-Ha

AI & ML interests

Organizations

Ji-Ha's activity

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Llama 3.1

MatMulfree LM

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

DeepSeek-Math

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

WizardLM

DiJiang: Efficient Large Language Models through Compact Kernelization

GAIA: a benchmark for General AI Assistants

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Exponentially Faster Language Modelling

DreamReward: Text-to-3D Generation with Human Preference

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Evolutionary Optimization of Model Merging Recipes

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

ORPO: Monolithic Preference Optimization without Reference Model

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Video Editing via Factorized Diffusion Distillation