24 27 56

Di Zhang

qq8933

https://scholar.google.com/citations?user=vxAO250AAAAJ&hl=en

trotsky1997

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

upvoted a paper about 2 hours ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

updated a dataset about 16 hours ago

qq8933/OpenLongCoT-SFT-v2-filtered

upvoted a paper about 20 hours ago

MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts

View all activity

Organizations

qq8933's activity

upvoted a paper about 2 hours ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published about 23 hours ago • 5

updated a dataset about 16 hours ago

qq8933/OpenLongCoT-SFT-v2-filtered

Viewer • Updated about 16 hours ago • 2.02M

upvoted a paper about 20 hours ago

MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts

Paper • 2411.14721 • Published 6 days ago • 2

replied to jwu323's post 2 days ago

Stay Tuned!

upvoted 2 papers 2 days ago

Recommender Systems in the Era of Large Language Models (LLMs)

Paper • 2307.02046 • Published Jul 5, 2023 • 1

Large Language Models are In-Context Molecule Learners

Paper • 2403.04197 • Published Mar 7 • 2

Reacted to jwu323's post with 🚀 2 days ago

Post

1288

We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!

2 replies

updated 3 datasets 3 days ago

updated a dataset 8 days ago

qq8933/llama_o1_offline_training_data_v1

Viewer • Updated 8 days ago • 19.3k • 14 • 1

New activity in qq8933/OpenLongCoT-Pretrain 9 days ago

Can you explain the ratings?

#3 opened 11 days ago by

inkognito1982

replied to their post 14 days ago

You're Genius!

liked a dataset 14 days ago

lmms-lab/RealWorldQA

Viewer • Updated Apr 13 • 765 • 3.64k • 3

liked 2 models 21 days ago

microsoft/OmniParser

Image-Text-to-Text • Updated 1 day ago • 12k • 1.37k

Etched/oasis-500m

Updated 24 days ago • 5.42k • 418

liked a dataset 22 days ago

hicai-zju/SciKnowEval

Viewer • Updated 27 days ago • 70.2k • 415 • 2

replied to their post 23 days ago

main.py is the entry for finetune, but codes need further improvements, see 'Call for contributors'

posted an update 24 days ago

Post

2342

Discovered an outrageous bug on the ChatGPT official website, especially for those using ad-blocking plugins. Please make sure to add browser-intake-datadoghq.com to your ad block whitelist. The ChatGPT webpage relies on this site for heartbeat detection, but since it belongs to an ad tracking network, it's included in major ad-blocking lists. (If you're using Clash, also remember to add it to the whitelist.) Failing to do so may cause the ChatGPT web interface to display a greyed-out send button after clicking, with no response.

For users with Chinese IP addresses, consider adding this URL to the rules of your U.S. node, as the response headers from this site will report the user's physical location to GPT.

3 replies

posted an update 25 days ago

Post

5710

LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/

What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF?
Just a little bite of strawberry!🍓

Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)

2 replies