Chujie Zheng

Paper • 2404.16792 • Published Apr 25 • 10 •

commented a paper 6 days ago

Weak-to-Strong Extrapolation Expedites Alignment

New activity in chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO 6 days ago

Possibly wrong model

#1 opened 6 days ago by

ByteBrew23

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 9 days ago

Update README.md

#3 opened 9 days ago by

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO 9 days ago

Update README.md

#2 opened 9 days ago by

New activity in chujiezheng/Llama3-70B-Chinese-Chat-ExPO 9 days ago

Create README.md

#1 opened 9 days ago by

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 9 days ago

Update README.md

#2 opened 9 days ago by

New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO 9 days ago

Create README.md

#1 opened 9 days ago by

New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO 14 days ago

Create README.md

#1 opened 14 days ago by

New activity in chujiezheng/LLaMA3-iterative-DPO-final-ExPO 16 days ago

Create README.md

#1 opened 16 days ago by

New activity in chujiezheng/tulu-2-dpo-13b 17 days ago

Update tokenizer_config.json

#2 opened 17 days ago by

New activity in allenai/tulu-2-dpo-13b 17 days ago

Update tokenizer_config.json

#4 opened 17 days ago by

New activity in allenai/tulu-2-13b 17 days ago

Update tokenizer_config.json

#2 opened 17 days ago by

New activity in chujiezheng/tulu-2-dpo-13b 22 days ago

Update README.md

#1 opened 22 days ago by

New activity in chujiezheng/tulu-2-dpo-7b 22 days ago

Update README.md

#1 opened 22 days ago by

New activity in allenai/tulu-2-dpo-7b 22 days ago

add license

#3 opened 22 days ago by

New activity in allenai/tulu-2-dpo-13b 22 days ago

add license

#3 opened 22 days ago by

New activity in open-llm-leaderboard/requests 22 days ago

Delete chujiezheng/tulu-2-dpo-13b-ExPO_eval_request_False_bfloat16_Original.json

#129 opened 22 days ago by

New activity in chujiezheng/internlm2-chat-1_8b-ExPO about 1 month ago

Update tokenizer_config.json

#1 opened about 1 month ago by

New activity in chujiezheng/internlm2-chat-7b-ExPO about 1 month ago

Update tokenizer_config.json

#1 opened about 1 month ago by

New activity in chujiezheng/internlm2-chat-20b-ExPO about 1 month ago

Update tokenizer_config.json

#1 opened about 1 month ago by

New activity in internlm/internlm2-chat-20b-sft about 1 month ago

fix `eos_token`

#4 opened about 1 month ago by

New activity in internlm/internlm2-chat-7b-sft about 1 month ago

fix `eos_token`

#3 opened about 1 month ago by

New activity in internlm/internlm2-chat-1_8b-sft about 1 month ago

fix `eos_token`

#1 opened about 1 month ago by

New activity in internlm/internlm2-chat-1_8b about 1 month ago

fix `eos_token`

#3 opened about 1 month ago by

New activity in internlm/internlm2-chat-7b about 1 month ago

fix `eos_token`

#12 opened about 1 month ago by

New activity in internlm/internlm2-chat-20b about 1 month ago

fix `eos_token`

#10 opened about 1 month ago by

New activity in google/gemma-1.1-7b-it about 2 months ago

Is 1.1 trained from the same SFT model as 1.0?

#18 opened about 2 months ago by

New activity in Nexusflow/Starling-RM-34B about 2 months ago

Could you share scripts for fast inference?

#3 opened about 2 months ago by

New activity in Nexusflow/Starling-LM-7B-beta about 2 months ago

What prompts are used in RLAIF?

#13 opened about 2 months ago by

New activity in microsoft/phi-1_5 3 months ago

Adding `safetensors` variant of this model

#66 opened 6 months ago by

update `bos_token_id` and `eos_token_id`

#80 opened 3 months ago by

New activity in thu-coai/CharacterGLM-6B 4 months ago

add model

#1 opened 4 months ago by

wandz

New activity in LLM360/CrystalChat 5 months ago

Could you upload a bf16/fp16-version checkpoint?

3

#1 opened 5 months ago by

New activity in TencentARC/LLaMA-Pro-8B 5 months ago

change "use_cache" to true to speed up generation

#6 opened 5 months ago by

New activity in TencentARC/LLaMA-Pro-8B-Instruct 5 months ago

change "use_cache" to true to speed up decoding

#8 opened 5 months ago by

New activity in mistralai/Mistral-7B-Instruct-v0.1 5 months ago

System Prompt

3

#41 opened 8 months ago by

sakshat98

New activity in meta-llama/LlamaGuard-7b 6 months ago

Does not respect nerd guard

#7 opened 6 months ago by

userzyzz

New activity in mistralai/Mistral-7B-Instruct-v0.1 6 months ago

When v2?

#88 opened 6 months ago by

amgadhasan

New activity in thu-coai/CDial-GPT_LCCC-base 6 months ago

Adding `safetensors` variant of this model

#3 opened 7 months ago by

New activity in tiiuae/falcon-40b 7 months ago

Add chat_template so that it can be used for chat out-of-box

#109 opened 7 months ago by

New activity in huggyllama/llama-7b 7 months ago

Add chat_template so that it can be used for chat out-of-box

#8 opened 7 months ago by

New activity in lmsys/vicuna-13b-v1.5 7 months ago

Add chat_template in the tokenizer

#6 opened 7 months ago by

New activity in lmsys/vicuna-7b-v1.5 7 months ago

Add chat_template in the tokenizer

#7 opened 7 months ago by

New activity in thu-coai/LongLM-base 10 months ago

Adding `safetensors` variant of this model

#1 opened 11 months ago by

New activity in thu-coai/esconv 11 months ago

Train-dev-test splits ?

3

#1 opened 11 months ago by

lihVerma

New activity in thu-coai/blenderbot-1B-augesc about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

New activity in EleutherAI/pythia-6.9b about 1 year ago

Missing checkpoint at step26000

#2 opened about 1 year ago by

New activity in thu-coai/roberta-zh-sensible about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

New activity in thu-coai/blenderbot-400M-esconv about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

New activity in thu-coai/roberta-base-cold about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

New activity in ceggian/bart_post_trained_reddit_batch128 over 1 year ago

About model details

#1 opened almost 2 years ago by