Di Zhang

qq8933

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

liked a dataset about 12 hours ago
OpenCoder-LLM/opc-sft-stage1
updated a dataset about 17 hours ago
qq8933/OpenLongCoT-prm-rectify
liked a dataset 1 day ago
huaXiaKyrie/critique-VQA
View all activity

Organizations

AI4Chem's profile picture SimpleBerry Research Lab's profile picture

qq8933's activity

replied to their post 4 days ago
view reply

We will write a short technical report for current progress.

reacted to their post with ๐Ÿš€ 4 days ago
view post
Post
2910
  • 3 replies
ยท
replied to their post 4 days ago
posted an update 4 days ago
view post
Post
2910
  • 3 replies
ยท
posted an update 7 days ago
view post
Post
1276
LLaMA-O1 Base and SFT model will be uploaded to HF today.
RLHF pipeline already ready, still waiting for data sampling.
  • 1 reply
ยท
replied to jwu323's post 9 days ago
reacted to jwu323's post with ๐Ÿš€ 9 days ago
view post
Post
1334
We are excited to announce a new internal project, Rome, focused on advancing LLM reasoning. The code and accompanying paper will be released soon. Stay tuned!
  • 2 replies
ยท
replied to their post 21 days ago
replied to their post about 1 month ago
view reply

main.py is the entry for finetune, but codes need further improvements, see 'Call for contributors'

posted an update about 1 month ago
view post
Post
2399
Discovered an outrageous bug on the ChatGPT official website, especially for those using ad-blocking plugins. Please make sure to add browser-intake-datadoghq.com to your ad block whitelist. The ChatGPT webpage relies on this site for heartbeat detection, but since it belongs to an ad tracking network, it's included in major ad-blocking lists. (If you're using Clash, also remember to add it to the whitelist.) Failing to do so may cause the ChatGPT web interface to display a greyed-out send button after clicking, with no response.

For users with Chinese IP addresses, consider adding this URL to the rules of your U.S. node, as the response headers from this site will report the user's physical location to GPT.
  • 3 replies
ยท
posted an update about 1 month ago
view post
Post
5818
LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/

What will happen when you compound MCTS โค LLM โค Self-Play โคRLHF?
Just a little bite of strawberry!๐Ÿ“

Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
  • 2 replies
ยท
posted an update 4 months ago
replied to their post 5 months ago
view reply

ๅŽ็ซฏๅผ‚ๅธธ๏ผŒๆŒ‚ๆŽ‰ไบ†๏ผŒๅœจไฟฎๅค

posted an update 5 months ago
view post
Post
654
Preview:
We will open source the 2.5B ChemVLM and the tool-enhanced ChemLLM-7B in the near future
posted an update 6 months ago
reacted to their post with ๐Ÿ˜Ž 6 months ago
view post
Post
2615
Tools Ready!
Thanks to ChemCrow's great work, ChemLLM supports proficiency toolkits Now, Include,
Molecule Name Retrivel
Molecule Property Query
Patent Check
Molecule Safety Query
Try it on chemllm.org
  • 2 replies
ยท
posted an update 6 months ago
view post
Post
2615
Tools Ready!
Thanks to ChemCrow's great work, ChemLLM supports proficiency toolkits Now, Include,
Molecule Name Retrivel
Molecule Property Query
Patent Check
Molecule Safety Query
Try it on chemllm.org
  • 2 replies
ยท
posted an update 6 months ago
view post
Post
1002
New Appearance from Ollama Open WebUI!
And Also web search, Realtime talking and File RAG!
https://chemllm.org/


posted an update 6 months ago
replied to their post 6 months ago