Yaowei Zheng

Update config.json

#2 opened 17 days ago by

Update config.json

#1 opened 17 days ago by

New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 22 days ago

Update tokenizer_config.json

#2 opened 22 days ago by

Update config.json

#1 opened 22 days ago by

New activity in shenzhi-wang/Llama3-8B-Chinese-Chat about 1 month ago

Update README.md

#20 opened about 1 month ago by

Update README.md

#19 opened about 1 month ago by

Delete trainer_log.jsonl

#18 opened about 1 month ago by

Delete all_results.json

#17 opened about 1 month ago by

BFloat16 is not supported on MPS

5

#13 opened about 1 month ago by

RDY97

New activity in hiyouga/LLaMA-Board about 1 month ago

llama3 available on the local demo but is unavailable on the Spaces

#9 opened about 1 month ago by

ysharma

New activity in hiyouga/DPO-En-Zh-20k about 1 month ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by

parquet-converter

New activity in shenzhi-wang/Llama3-8B-Chinese-Chat about 1 month ago

🚀Fix metadata dict bug

#10 opened about 1 month ago by

Delete training_args.bin

#9 opened about 1 month ago by

Update generation_config.json

#6 opened about 1 month ago by

Update generation_config.json

#7 opened about 1 month ago by

Update config.json

#5 opened about 1 month ago by

Update model.safetensors.index.json

#4 opened about 1 month ago by

🚀 Fix the bug of checkpoint files

#3 opened about 1 month ago by

add Usage

#2 opened about 1 month ago by

Update README.md

#1 opened about 1 month ago by

New activity in hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling about 2 months ago

just for curiosity

9

#1 opened 2 months ago by

prudant

New activity in llamafactory/adgen_tiny about 2 months ago

[bot] Conversion to Parquet

#1 opened about 2 months ago by

parquet-converter

New activity in hiyouga/LLaMA-Board 2 months ago

Add link to paper so it's automatically linked from Arxiv and paper page

#8 opened 2 months ago by

osanseviero

New activity in hiyouga/LLaMA-Board 3 months ago

Update data/dataset_info.json

#3 opened 3 months ago by

tonymds

Upload dev.csv

#4 opened 3 months ago by

Upload jd.json

#5 opened 3 months ago by

Create a

#6 opened 3 months ago by

Create a.json

#7 opened 3 months ago by

New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 3 months ago

Adding Evaluation Results

#2 opened 3 months ago by

leaderboard-pr-bot

New activity in google/gemma-7b-it 3 months ago

how to extract model response from the output of tokenizer

3

#54 opened 3 months ago by

mans-0987

New activity in baichuan-inc/Baichuan2-13B-Chat 3 months ago

Missing module: torch.utils.checkpoint

#13 opened 9 months ago by

New activity in google/gemma-2b-it 3 months ago

Update readme to match chat template

#22 opened 3 months ago by

Update chat template

#21 opened 3 months ago by

pcuenq

New activity in google/gemma-7b-it 3 months ago

Fix chat template does not compatible with ConversationalPipeline

5

#42 opened 3 months ago by

New activity in mistralai/Mistral-7B-v0.1 3 months ago

How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights

5

#126 opened 4 months ago by

yeniceriSGK

New activity in mistralai/Mixtral-8x7B-v0.1 4 months ago

How to fine tune mixtral 8x7B?

3

#30 opened 5 months ago by

tzivi

New activity in mistralai/Mixtral-8x7B-v0.1 5 months ago

Fine-tuning toolkit for Mixtral 8x7B MoE model

18

#10 opened 6 months ago by

New activity in THUDM/chatglm3-6b 5 months ago

fix can't set attribute 'eos_token' when loading the saved tokenizer

#27 opened 5 months ago by

New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 5 months ago

eval error with LLaMA-Factory

4

#1 opened 5 months ago by

charry2000

New activity in hiyouga/Baichuan2-7B-Chat-LLaMAfied 7 months ago

Adding Evaluation Results

#1 opened 7 months ago by

leaderboard-pr-bot

New activity in hiyouga/Baichuan2-7B-Base-LLaMAfied 7 months ago

Adding Evaluation Results

#1 opened 7 months ago by

leaderboard-pr-bot

New activity in hiyouga/LLaMA-Board 7 months ago

Apply for community grant: Personal project (storage)

#2 opened 7 months ago by

added duplicate button, title and description

#1 opened 7 months ago by

ysharma

New activity in openchat/openchat_3.5 7 months ago

How to setup system message

13

#5 opened 7 months ago by

fernandofernandes

New activity in hf-accelerate/model-memory-usage 7 months ago

Determining Minimum GPU Memory and Input Text Length Calculation in Model Training

#19 opened 8 months ago by

kobe8-24

New activity in microsoft/phi-1_5 8 months ago

Adding _set_gradient_checkpointing for compatibility

6

#22 opened 9 months ago by

vriveras

New activity in hiyouga/Llama-2-Chinese-13b-chat 10 months ago

how to quantize?

#1 opened 10 months ago by

miraclezst

New activity in Qwen/Qwen-7B 10 months ago

The current implementation of tokenizer cannot adopt left-padding

#2 opened 10 months ago by

New activity in baichuan-inc/Baichuan-13B-Base 11 months ago

Take input attention masks to support left-padded sequences

#1 opened 11 months ago by

New activity in hiyouga/Baichuan-7B-sft 11 months ago

模型合并

#5 opened 11 months ago by

TJSL1111

为什么回答这么长

4

#4 opened 11 months ago by

sunnima

Not Working. 'BaiChuanForCausalLM' object has no attribute 'generation_config

#1 opened 12 months ago by

jackaduma

How to merge lora model to base model

#2 opened 12 months ago by

angel113110

New activity in baichuan-inc/Baichuan-7B 12 months ago

这个模型应该怎么才能正常对话啊？测试一下感觉不对劲啊！

7

#6 opened 12 months ago by

Zomens

New activity in THUDM/chatglm-6b about 1 year ago

About the unusual attention_mask of ChatGLM

#40 opened about 1 year ago by