We will write a short technical report for current progress.
Di Zhang
qq8933
AI & ML interests
AI4Chem, LLM, Green LLM
Recent Activity
liked
a dataset
about 12 hours ago
OpenCoder-LLM/opc-sft-stage1
updated
a dataset
about 17 hours ago
qq8933/OpenLongCoT-prm-rectify
liked
a dataset
1 day ago
huaXiaKyrie/critique-VQA
Organizations
qq8933's activity
replied to
their
post
4 days ago
Post
2910
The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!
Supervised:
SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203)
@AdinaY @akhaliq @jwu323
------
GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUF
online Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo
Supervised:
SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203)
@AdinaY @akhaliq @jwu323
------
GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUF
online Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo
replied to
their
post
4 days ago
This comment has been hidden
posted
an
update
4 days ago
Post
2910
The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!
Supervised:
SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203)
@AdinaY @akhaliq @jwu323
------
GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUF
online Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo
Supervised:
SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain):
SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset:
SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset:
SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo:
https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203)
@AdinaY @akhaliq @jwu323
------
GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUF
online Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo
Stay Tuned!
replied to
their
post
21 days ago
You're Genius!
replied to
their
post
about 1 month ago
main.py
is the entry for finetune, but codes need further improvements, see 'Call for contributors'
posted
an
update
about 1 month ago
Post
2399
Discovered an outrageous bug on the ChatGPT official website, especially for those using ad-blocking plugins. Please make sure to add
For users with Chinese IP addresses, consider adding this URL to the rules of your U.S. node, as the response headers from this site will report the user's physical location to GPT.
browser-intake-datadoghq.com
to your ad block whitelist. The ChatGPT webpage relies on this site for heartbeat detection, but since it belongs to an ad tracking network, it's included in major ad-blocking lists. (If you're using Clash, also remember to add it to the whitelist.) Failing to do so may cause the ChatGPT web interface to display a greyed-out send button after clicking, with no response.For users with Chinese IP addresses, consider adding this URL to the rules of your U.S. node, as the response headers from this site will report the user's physical location to GPT.
posted
an
update
about 1 month ago
Post
5818
LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/
What will happen when you compound MCTS โค LLM โค Self-Play โคRLHF?
Just a little bite of strawberry!๐
Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/
What will happen when you compound MCTS โค LLM โค Self-Play โคRLHF?
Just a little bite of strawberry!๐
Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)
posted
an
update
4 months ago
Post
1535
๐ Introducing ChemVLM, the first open-source multimodal large language model dedicated to chemistry!
๐Comparable performances with commercial models or specific OCR model but with dialogue capabilities!
โจ2B/26B Models Here! AI4Chem/ChemVLM-26B
Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM (2408.07246)
๐Comparable performances with commercial models or specific OCR model but with dialogue capabilities!
โจ2B/26B Models Here! AI4Chem/ChemVLM-26B
Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM (2408.07246)
replied to
their
post
5 months ago
ๅ็ซฏๅผๅธธ๏ผๆๆไบ๏ผๅจไฟฎๅค
posted
an
update
5 months ago
Post
654
Preview:
We will open source the 2.5B ChemVLM and the tool-enhanced ChemLLM-7B in the near future
We will open source the 2.5B ChemVLM and the tool-enhanced ChemLLM-7B in the near future
posted
an
update
6 months ago
Post
737
A great work based on ChemLLM from Open-source community!
Automatic Scientific Discovery guided by LLM!
https://github.com/zyzisastudyreallyhardguy/LLM4SD
Automatic Scientific Discovery guided by LLM!
https://github.com/zyzisastudyreallyhardguy/LLM4SD
posted
an
update
6 months ago
Post
1002
New Appearance from Ollama Open WebUI!
And Also web search, Realtime talking and File RAG!
https://chemllm.org/
And Also web search, Realtime talking and File RAG!
https://chemllm.org/
posted
an
update
6 months ago
Post
2002
The First Multimodal Language Model dedicated for Chemistry.
Demo: https://v.chemllm.org/
Finetune based on ChemLLM-20B and InterViT-6B on MMChemExam and ChemOCR Datasets (coming soon...)
AI4Chem/ChemVLM-26B
ChemLLM: A Chemical Large Language Model (2402.06852)
Demo: https://v.chemllm.org/
Finetune based on ChemLLM-20B and InterViT-6B on MMChemExam and ChemOCR Datasets (coming soon...)
AI4Chem/ChemVLM-26B
ChemLLM: A Chemical Large Language Model (2402.06852)
replied to
their
post
6 months ago
We forked it from InternVL Repo