chiliu commited on
Commit
935f4d9
2 Parent(s): 0dfb2c9 52b08a9

Merge branch 'main' of https://huggingface.co/CobraMamba/mamba-gpt-3b-v2 into main

Browse files
Files changed (1) hide show
  1. README.md +1 -13
README.md CHANGED
@@ -12,23 +12,11 @@ thumbnail: >-
12
  license: apache-2.0
13
  ---
14
  # Model Card
15
- ## Github
16
- https://github.com/chi2liu/mamba-gpt-3b
17
-
18
- | Metric | Value |
19
- |-----------------------|-------|
20
- | MMLU (5-shot) | 25.3 |
21
- | ARC (25-shot) | 40.5 |
22
- | HellaSwag (10-shot) | 64.9 |
23
- | TruthfulQA (0-shot) | 37.1 |
24
- | Avg. | 42.0 |
25
-
26
- We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above.
27
 
28
  ## Summary
29
 
30
  We have fine-tuned the open-lama model and surpassed the original model in multiple evaluation subtasks, making it currently the best performing 3B model with comparable performance to llama-7b
31
- - Base model: [openlm-research/open_llama_3b](https://huggingface.co/openlm-research/open_llama_3b)
32
 
33
  ## Usage
34
 
 
12
  license: apache-2.0
13
  ---
14
  # Model Card
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  ## Summary
17
 
18
  We have fine-tuned the open-lama model and surpassed the original model in multiple evaluation subtasks, making it currently the best performing 3B model with comparable performance to llama-7b
19
+ - Base model: [openlm-research/open_llama_3b_v2](https://huggingface.co/openlm-research/open_llama_3b_v2)
20
 
21
  ## Usage
22