Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
52
8
66
Chujie Zheng
chujiezheng
Follow
leomartinsjf's profile picture
gordonhu's profile picture
linz's profile picture
11 followers
·
4 following
https://chujiezheng.github.io/
ChujieZheng
chujiezheng
AI & ML interests
Large Language Models
Organizations
chujiezheng
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
princeton-nlp/Llama-3-Instruct-8B-SimPO
1 day ago
add chat_template
#3 opened 1 day ago by
chujiezheng
commented
a paper
6 days ago
Weak-to-Strong Extrapolation Expedites Alignment
Paper
•
2404.16792
•
Published
Apr 25
•
10
•
1
New activity in
chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO
6 days ago
Possibly wrong model
2
#1 opened 6 days ago by
ByteBrew23
New activity in
chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO
9 days ago
Update README.md
#3 opened 9 days ago by
chujiezheng
New activity in
chujiezheng/Llama3-8B-Chinese-Chat-ExPO
9 days ago
Update README.md
#2 opened 9 days ago by
chujiezheng
New activity in
chujiezheng/Llama3-70B-Chinese-Chat-ExPO
9 days ago
Create README.md
#1 opened 9 days ago by
chujiezheng
New activity in
chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO
9 days ago
Update README.md
#2 opened 9 days ago by
chujiezheng
New activity in
chujiezheng/Llama3-8B-Chinese-Chat-ExPO
9 days ago
Create README.md
#1 opened 9 days ago by
chujiezheng
New activity in
chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO
14 days ago
Create README.md
#1 opened 14 days ago by
chujiezheng
New activity in
chujiezheng/LLaMA3-iterative-DPO-final-ExPO
16 days ago
Create README.md
#1 opened 16 days ago by
chujiezheng
New activity in
chujiezheng/tulu-2-dpo-13b
17 days ago
Update tokenizer_config.json
#2 opened 17 days ago by
chujiezheng
New activity in
allenai/tulu-2-dpo-13b
17 days ago
Update tokenizer_config.json
2
#4 opened 17 days ago by
chujiezheng
New activity in
allenai/tulu-2-13b
17 days ago
Update tokenizer_config.json
2
#2 opened 17 days ago by
chujiezheng
New activity in
chujiezheng/tulu-2-dpo-13b
22 days ago
Update README.md
#1 opened 22 days ago by
chujiezheng
New activity in
chujiezheng/tulu-2-dpo-7b
22 days ago
Update README.md
#1 opened 22 days ago by
chujiezheng
New activity in
allenai/tulu-2-dpo-7b
22 days ago
add license
1
#3 opened 22 days ago by
chujiezheng
New activity in
allenai/tulu-2-dpo-13b
22 days ago
add license
1
#3 opened 22 days ago by
chujiezheng
New activity in
open-llm-leaderboard/requests
22 days ago
Delete chujiezheng/tulu-2-dpo-13b-ExPO_eval_request_False_bfloat16_Original.json
1
#129 opened 22 days ago by
chujiezheng
New activity in
chujiezheng/internlm2-chat-1_8b-ExPO
about 1 month ago
Update tokenizer_config.json
#1 opened about 1 month ago by
chujiezheng
New activity in
chujiezheng/internlm2-chat-7b-ExPO
about 1 month ago
Update tokenizer_config.json
#1 opened about 1 month ago by
chujiezheng
New activity in
chujiezheng/internlm2-chat-20b-ExPO
about 1 month ago
Update tokenizer_config.json
#1 opened about 1 month ago by
chujiezheng
New activity in
internlm/internlm2-chat-20b-sft
about 1 month ago
fix `eos_token`
#4 opened about 1 month ago by
chujiezheng
New activity in
internlm/internlm2-chat-7b-sft
about 1 month ago
fix `eos_token`
#3 opened about 1 month ago by
chujiezheng
New activity in
internlm/internlm2-chat-1_8b-sft
about 1 month ago
fix `eos_token`
#1 opened about 1 month ago by
chujiezheng
New activity in
internlm/internlm2-chat-1_8b
about 1 month ago
fix `eos_token`
#3 opened about 1 month ago by
chujiezheng
New activity in
internlm/internlm2-chat-7b
about 1 month ago
fix `eos_token`
#12 opened about 1 month ago by
chujiezheng
New activity in
internlm/internlm2-chat-20b
about 1 month ago
fix `eos_token`
#10 opened about 1 month ago by
chujiezheng
New activity in
google/gemma-1.1-7b-it
about 2 months ago
Is 1.1 trained from the same SFT model as 1.0?
#18 opened about 2 months ago by
chujiezheng
New activity in
Nexusflow/Starling-RM-34B
about 2 months ago
Could you share scripts for fast inference?
#3 opened about 2 months ago by
chujiezheng
New activity in
Nexusflow/Starling-LM-7B-beta
about 2 months ago
What prompts are used in RLAIF?
1
#13 opened about 2 months ago by
chujiezheng
New activity in
microsoft/phi-1_5
3 months ago
Adding `safetensors` variant of this model
1
#66 opened 6 months ago by
SFconvertbot
update `bos_token_id` and `eos_token_id`
#80 opened 3 months ago by
chujiezheng
New activity in
thu-coai/CharacterGLM-6B
4 months ago
add model
#1 opened 4 months ago by
wandz
New activity in
LLM360/CrystalChat
5 months ago
Could you upload a bf16/fp16-version checkpoint?
3
#1 opened 5 months ago by
chujiezheng
New activity in
TencentARC/LLaMA-Pro-8B
5 months ago
change "use_cache" to true to speed up generation
#6 opened 5 months ago by
chujiezheng
New activity in
TencentARC/LLaMA-Pro-8B-Instruct
5 months ago
change "use_cache" to true to speed up decoding
#8 opened 5 months ago by
chujiezheng
New activity in
mistralai/Mistral-7B-Instruct-v0.1
5 months ago
System Prompt
3
#41 opened 8 months ago by
sakshat98
New activity in
meta-llama/LlamaGuard-7b
6 months ago
Does not respect nerd guard
1
#7 opened 6 months ago by
userzyzz
New activity in
mistralai/Mistral-7B-Instruct-v0.1
6 months ago
When v2?
1
#88 opened 6 months ago by
amgadhasan
New activity in
thu-coai/CDial-GPT_LCCC-base
6 months ago
Adding `safetensors` variant of this model
#3 opened 7 months ago by
SFconvertbot
New activity in
tiiuae/falcon-40b
7 months ago
Add chat_template so that it can be used for chat out-of-box
#109 opened 7 months ago by
chujiezheng
New activity in
huggyllama/llama-7b
7 months ago
Add chat_template so that it can be used for chat out-of-box
#8 opened 7 months ago by
chujiezheng
New activity in
lmsys/vicuna-13b-v1.5
7 months ago
Add chat_template in the tokenizer
#6 opened 7 months ago by
chujiezheng
New activity in
lmsys/vicuna-7b-v1.5
7 months ago
Add chat_template in the tokenizer
2
#7 opened 7 months ago by
chujiezheng
New activity in
thu-coai/LongLM-base
10 months ago
Adding `safetensors` variant of this model
#1 opened 11 months ago by
SFconvertbot
New activity in
thu-coai/esconv
11 months ago
Train-dev-test splits ?
3
#1 opened 11 months ago by
lihVerma
New activity in
thu-coai/blenderbot-1B-augesc
about 1 year ago
Adding `safetensors` variant of this model
#1 opened about 1 year ago by
SFconvertbot
New activity in
EleutherAI/pythia-6.9b
about 1 year ago
Missing checkpoint at step26000
1
#2 opened about 1 year ago by
chujiezheng
New activity in
thu-coai/roberta-zh-sensible
about 1 year ago
Adding `safetensors` variant of this model
#1 opened about 1 year ago by
SFconvertbot
New activity in
thu-coai/blenderbot-400M-esconv
about 1 year ago
Adding `safetensors` variant of this model
#1 opened about 1 year ago by
SFconvertbot
New activity in
thu-coai/roberta-base-cold
about 1 year ago
Adding `safetensors` variant of this model
#1 opened about 1 year ago by
SFconvertbot
New activity in
ceggian/bart_post_trained_reddit_batch128
over 1 year ago
About model details
#1 opened almost 2 years ago by
chujiezheng