ibivibiv commited on
Commit
86368db
1 Parent(s): 2a19258

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +151 -1
README.md CHANGED
@@ -8,4 +8,154 @@ tags:
8
  ---
9
  # Athene Noctua 13B
10
 
11
- ![img](./athene_noctua.png)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
  # Athene Noctua 13B
10
 
11
+ ![img](./athene_noctua.png)
12
+
13
+ # Model Details
14
+ * **Trained by**: [ibivibiv](https://huggingface.co/ibivibiv)
15
+ * **Library**: [HuggingFace Transformers](https://github.com/huggingface/transformers)
16
+ * **Model type:** **athene-noctua-13b** is an auto-regressive language model fine tuned on the Llama 2 transformer architecture.
17
+ * **Language(s)**: English
18
+ * **Purpose**: Has specific training for logic enforcement, will do well in ARC or other logic testing as well as critical thinking tasks. This model is targeted towards planning exercises.
19
+ * **Comments**: This little guy does pretty well in my logic puzzle testing for a 13B model. I've been using it for test runs to prime for larger models, but it is worth uploading now as it is doing very well on the tests. Again, this a 13B model so tricky logic does still trip it up but for its size it is doing well.
20
+
21
+ # Prompting
22
+
23
+ ## Prompt Template for alpaca style
24
+
25
+ ```
26
+ ### Instruction:
27
+
28
+ <prompt> (without the <>)
29
+
30
+ ### Response:
31
+ ```
32
+
33
+ ## Sample Code
34
+
35
+ ```python
36
+ import torch
37
+ from transformers import AutoModelForCausalLM, AutoTokenizer
38
+
39
+ torch.set_default_device("cuda")
40
+
41
+ model = AutoModelForCausalLM.from_pretrained("ibivibiv/athene-noctua-13b", torch_dtype="auto", device_config='auto')
42
+ tokenizer = AutoTokenizer.from_pretrained("ibivibiv/athene-noctua-13b")
43
+
44
+ inputs = tokenizer("### Instruction: Create a plan for developing the game of snake in python using pygame.\n### Response:\n", return_tensors="pt", return_attention_mask=False)
45
+
46
+ outputs = model.generate(**inputs, max_length=200)
47
+ text = tokenizer.batch_decode(outputs)[0]
48
+ print(text)
49
+ ```
50
+
51
+ ## Citations
52
+
53
+ ```
54
+ @misc{open-llm-leaderboard,
55
+ author = {Edward Beeching and Clémentine Fourrier and Nathan Habib and Sheon Han and Nathan Lambert and Nazneen Rajani and Omar Sanseviero and Lewis Tunstall and Thomas Wolf},
56
+ title = {Open LLM Leaderboard},
57
+ year = {2023},
58
+ publisher = {Hugging Face},
59
+ howpublished = "\url{https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard}"
60
+ }
61
+ ```
62
+ ```
63
+ @software{eval-harness,
64
+ author = {Gao, Leo and
65
+ Tow, Jonathan and
66
+ Biderman, Stella and
67
+ Black, Sid and
68
+ DiPofi, Anthony and
69
+ Foster, Charles and
70
+ Golding, Laurence and
71
+ Hsu, Jeffrey and
72
+ McDonell, Kyle and
73
+ Muennighoff, Niklas and
74
+ Phang, Jason and
75
+ Reynolds, Laria and
76
+ Tang, Eric and
77
+ Thite, Anish and
78
+ Wang, Ben and
79
+ Wang, Kevin and
80
+ Zou, Andy},
81
+ title = {A framework for few-shot language model evaluation},
82
+ month = sep,
83
+ year = 2021,
84
+ publisher = {Zenodo},
85
+ version = {v0.0.1},
86
+ doi = {10.5281/zenodo.5371628},
87
+ url = {https://doi.org/10.5281/zenodo.5371628}
88
+ }
89
+ ```
90
+ ```
91
+ @misc{clark2018think,
92
+ title={Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge},
93
+ author={Peter Clark and Isaac Cowhey and Oren Etzioni and Tushar Khot and Ashish Sabharwal and Carissa Schoenick and Oyvind Tafjord},
94
+ year={2018},
95
+ eprint={1803.05457},
96
+ archivePrefix={arXiv},
97
+ primaryClass={cs.AI}
98
+ }
99
+ ```
100
+ ```
101
+ @misc{zellers2019hellaswag,
102
+ title={HellaSwag: Can a Machine Really Finish Your Sentence?},
103
+ author={Rowan Zellers and Ari Holtzman and Yonatan Bisk and Ali Farhadi and Yejin Choi},
104
+ year={2019},
105
+ eprint={1905.07830},
106
+ archivePrefix={arXiv},
107
+ primaryClass={cs.CL}
108
+ }
109
+ ```
110
+ ```
111
+ @misc{hendrycks2021measuring,
112
+ title={Measuring Massive Multitask Language Understanding},
113
+ author={Dan Hendrycks and Collin Burns and Steven Basart and Andy Zou and Mantas Mazeika and Dawn Song and Jacob Steinhardt},
114
+ year={2021},
115
+ eprint={2009.03300},
116
+ archivePrefix={arXiv},
117
+ primaryClass={cs.CY}
118
+ }
119
+ ```
120
+ ```
121
+ @misc{lin2022truthfulqa,
122
+ title={TruthfulQA: Measuring How Models Mimic Human Falsehoods},
123
+ author={Stephanie Lin and Jacob Hilton and Owain Evans},
124
+ year={2022},
125
+ eprint={2109.07958},
126
+ archivePrefix={arXiv},
127
+ primaryClass={cs.CL}
128
+ }
129
+ ```
130
+ ```
131
+ @misc{DBLP:journals/corr/abs-1907-10641,
132
+ title={{WINOGRANDE:} An Adversarial Winograd Schema Challenge at Scale},
133
+ author={Keisuke Sakaguchi and Ronan Le Bras and Chandra Bhagavatula and Yejin Choi},
134
+ year={2019},
135
+ eprint={1907.10641},
136
+ archivePrefix={arXiv},
137
+ primaryClass={cs.CL}
138
+ }
139
+ ```
140
+ ```
141
+ @misc{DBLP:journals/corr/abs-2110-14168,
142
+ title={Training Verifiers to Solve Math Word Problems},
143
+ author={Karl Cobbe and
144
+ Vineet Kosaraju and
145
+ Mohammad Bavarian and
146
+ Mark Chen and
147
+ Heewoo Jun and
148
+ Lukasz Kaiser and
149
+ Matthias Plappert and
150
+ Jerry Tworek and
151
+ Jacob Hilton and
152
+ Reiichiro Nakano and
153
+ Christopher Hesse and
154
+ John Schulman},
155
+ year={2021},
156
+ eprint={2110.14168},
157
+ archivePrefix={arXiv},
158
+ primaryClass={cs.CL}
159
+ }
160
+ ```
161
+