26 30 61

Di Zhang

qq8933

https://scholar.google.com/citations?user=vxAO250AAAAJ&hl=en

trotsky1997

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

updated a dataset about 2 hours ago

qq8933/AIME_1983_2024

New activity about 2 hours ago

qq8933/AIME_1983_2024:how about 2024 I

updated a Space 1 day ago

SimpleBerry/LLaMA-O1-Supervised-1129-Demo

View all activity

Organizations

Posts 18

Post

2503

The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!
Supervised:
SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203)
@AdinaY @akhaliq @jwu323
------
GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUF
online Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo

Post

1246

LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.

View all posts

Collections 2

Papers 6

models 1

qq8933/OpenLongCoT-Base-Gemma2-2B

Updated Oct 29 • 1.76k • 8

datasets 31

Di Zhang

AI & ML interests

Recent Activity

Organizations

Posts 18

Collections 2

SimpleBerry/LLaMA-O1-Supervised-1129

SimpleBerry/LLaMA-O1-Base-1127

SimpleBerry/OpenLongCoT-Pretrain-1202

SimpleBerry/OpenLongCoT-SFT

YeungNLP/firefly-train-1.1M

stingning/ultrachat

Open-Orca/OpenOrca

Vezora/Tested-143k-Python-Alpaca

Papers 6

models 1

qq8933/OpenLongCoT-Base-Gemma2-2B

datasets 31

qq8933/AIME_1983_2024

qq8933/UltraChat-200k

qq8933/OpenLongCoT-Pretrain-v2-filtered

qq8933/OpenLongCoT-Pretrain-v2

qq8933/OpenLongCoT-SFT-v2

qq8933/OpenLongCoT-SFT-v2-filtered

qq8933/OpenLongCoT-SFT-problems-v2

qq8933/llama_o1_offline_training_data_v1

qq8933/OpenLongCoT-Pretrain

qq8933/ChemData700K-SMILES-only

Di Zhang

AI & ML interests

Recent Activity

Organizations

Posts 18

Collections 2

Papers 6

models 1

datasets 31 Sort: Recently updated

datasets 31