ScalableMath (ScalableMath)

Longhui98

authored 4 papers 2 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 111

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

Paper • 2308.07758 • Published Aug 15, 2023 • 4

DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality

Paper • 2303.14585 • Published Mar 25, 2023

wellecks

authored a paper 4 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 48

zhiqings

authored 2 papers 5 months ago

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

Paper • 2408.00724 • Published Aug 1, 2024 • 1

Lean-STaR: Learning to Interleave Thinking and Proving

Paper • 2407.10040 • Published Jul 14, 2024

Noogal

updated a model 9 months ago

ScalableMath/Lean-STaR-plus

Feature Extraction • Updated Jul 14, 2024 • 5 • 3

Noogal

updated a dataset 9 months ago

ScalableMath/Lean-STaR-plus

Viewer • Updated Jul 14, 2024 • 61.1k • 33 • 3

Noogal

updated 2 models 9 months ago

ScalableMath/Lean-STaR-base

Feature Extraction • Updated Jul 12, 2024 • 7

ScalableMath/Lean-CoT-base

Feature Extraction • Updated Jul 12, 2024 • 5

Noogal

updated 2 datasets 9 months ago

ScalableMath/Lean-STaR-base

Viewer • Updated Jul 12, 2024 • 84.7k • 24 • 2

ScalableMath/Lean-CoT-plus

Viewer • Updated Jul 12, 2024 • 130k • 24 • 3

Noogal

updated a model 9 months ago

ScalableMath/Lean-CoT-plus

Feature Extraction • Updated Jul 12, 2024 • 8

Noogal

updated a dataset 11 months ago

ScalableMath/Lean-CoT-base

Viewer • Updated May 7, 2024 • 52.4k • 24 • 3

zhiqings

authored 2 papers 11 months ago

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 27

wellecks

authored a paper 11 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 122

zhiqings

updated 2 models about 1 year ago

ScalableMath/llemma-7b-oprm-prm800k-level-1to3-hf

Text Generation • Updated Mar 1, 2024 • 14 • 4

ScalableMath/llemma-7b-orm-prm800k-level-1to3-hf

Text Generation • Updated Mar 1, 2024 • 11 • 1

ScalableMath

AI & ML interests

ScalableMath's activity

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Forward-Backward Reasoning in Large Language Models for Mathematical Verification

DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality

Evaluating Language Models as Synthetic Data Generators

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

Lean-STaR: Learning to Interleave Thinking and Proving

ScalableMath/Lean-STaR-plus

ScalableMath/Lean-STaR-plus

ScalableMath/Lean-STaR-base

ScalableMath/Lean-CoT-base

ScalableMath/Lean-STaR-base

ScalableMath/Lean-CoT-plus

ScalableMath/Lean-CoT-plus

ScalableMath/Lean-CoT-base

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Self-Play Preference Optimization for Language Model Alignment

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

ScalableMath/llemma-7b-oprm-prm800k-level-1to3-hf

ScalableMath/llemma-7b-orm-prm800k-level-1to3-hf

AI & ML interests

Team members 4

ScalableMath's activity