Minami-su
/

Qwen1.5-7B-Chat_mistral

@@ -15,7 +15,7 @@ tags:
 - qwen1.5
 - qwen2
 ---
-This is the Mistral version of [Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) model by Alibaba Cloud.
 The original codebase can be found at: (https://github.com/hiyouga/LLaMA-Factory/blob/main/tests/llamafy_qwen.py).
 I have made modifications to make it compatible with qwen1.5.
 This model is converted with https://github.com/Minami-su/character_AI_open/blob/main/mistral_qwen2.py
@@ -46,8 +46,8 @@ Usage:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer
-tokenizer = AutoTokenizer.from_pretrained("Minami-su/Qwen1.5-0.5B-Chat_mistral")
-model = AutoModelForCausalLM.from_pretrained("Minami-su/Qwen1.5-0.5B-Chat_mistral", torch_dtype="auto", device_map="auto")
 streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)
 messages = [
@@ -55,31 +55,31 @@ messages = [
 ]
 inputs = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors="pt")
 inputs = inputs.to("cuda")
-generate_ids = model.generate(inputs,max_length=2048, streamer=streamer)
 ```
 ## Test
 load in 4bit
 ```
-hf-causal (pretrained=Qwen1.5-0.5B-Chat), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
 |    Task     |Version| Metric |Value |   |Stderr|
 |-------------|------:|--------|-----:|---|-----:|
-|arc_challenge|      0|acc     |0.2389|±  |0.0125|
-|             |       |acc_norm|0.2688|±  |0.0130|
-|truthfulqa_mc|      1|mc1     |0.2534|±  |0.0152|
-|             |       |mc2     |0.4322|±  |0.0151|
-|winogrande   |      0|acc     |0.5564|±  |0.0140|
 ```
 load in 4bit
 ```
-hf-causal (pretrained=Qwen1.5-0.5B-Chat_mistral), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
 |    Task     |Version| Metric |Value |   |Stderr|
 |-------------|------:|--------|-----:|---|-----:|
-|arc_challenge|      0|acc     |0.2398|±  |0.0125|
-|             |       |acc_norm|0.2705|±  |0.0130|
-|truthfulqa_mc|      1|mc1     |0.2534|±  |0.0152|
-|             |       |mc2     |0.4322|±  |0.0151|
-|winogrande   |      0|acc     |0.5549|±  |0.0140|
 ```
 ```

 - qwen1.5
 - qwen2
 ---
+This is the Mistral version of [Qwen1.5-7B-Chat](https://huggingface.co/Qwen/Qwen1.5-7B-Chat) model by Alibaba Cloud.
 The original codebase can be found at: (https://github.com/hiyouga/LLaMA-Factory/blob/main/tests/llamafy_qwen.py).
 I have made modifications to make it compatible with qwen1.5.
 This model is converted with https://github.com/Minami-su/character_AI_open/blob/main/mistral_qwen2.py
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer
+tokenizer = AutoTokenizer.from_pretrained("Minami-su/Qwen1.5-7B-Chat_mistral")
+model = AutoModelForCausalLM.from_pretrained("Minami-su/Qwen1.5-7B-Chat_mistral", torch_dtype="auto", device_map="auto")
 streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)
 messages = [
 ]
 inputs = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors="pt")
 inputs = inputs.to("cuda")
+generate_ids = model.generate(inputs,max_length=32768, streamer=streamer)
 ```
 ## Test
 load in 4bit
 ```
+hf-causal (pretrained=Qwen1.5-7B-Chat), limit: None, provide_description: False, num_fewshot: 0, batch_size: 8
 |    Task     |Version| Metric |Value |   |Stderr|
 |-------------|------:|--------|-----:|---|-----:|
+|arc_challenge|      0|acc     |0.4155|±  |0.0144|
+|             |       |acc_norm|0.4480|±  |0.0145|
+|truthfulqa_mc|      1|mc1     |0.3513|±  |0.0167|
+|             |       |mc2     |0.5165|±  |0.0159|
+|winogrande   |      0|acc     |0.6330|±  |0.0135|
 ```
 load in 4bit
 ```
+hf-causal (pretrained=Qwen1.5-7B-Chat_mistral), limit: None, provide_description: False, num_fewshot: 0, batch_size: 16
 |    Task     |Version| Metric |Value |   |Stderr|
 |-------------|------:|--------|-----:|---|-----:|
+|arc_challenge|      0|acc     |0.4172|±  |0.0144|
+|             |       |acc_norm|0.4480|±  |0.0145|
+|truthfulqa_mc|      1|mc1     |0.3488|±  |0.0167|
+|             |       |mc2     |0.5161|±  |0.0159|
+|winogrande   |      0|acc     |0.6306|±  |0.0136|
 ```
 ```