natolambert
commited on
Commit
•
c45397a
1
Parent(s):
efc4219
Update README.md
Browse files
README.md
CHANGED
@@ -11,11 +11,10 @@ language:
|
|
11 |
|
12 |
# TODO
|
13 |
* Change using model section if is in transformers
|
14 |
-
* Update architecture changes
|
15 |
* Update summary of Dolma 1.7
|
16 |
* Remove installation requirements?
|
17 |
* Evals pre and post annealing
|
18 |
-
* details on annealing
|
19 |
|
20 |
# Model Card for OLMo 7B v1.7
|
21 |
|
@@ -37,6 +36,7 @@ The core models released in this batch are the following:
|
|
37 |
|
38 |
*Note: OLMo 7B v1.7 also includes QKV clipping.*
|
39 |
|
|
|
40 |
We are releasing many checkpoints for these models, for every 1000 traing steps.
|
41 |
The naming convention is `step1000-tokens4B`.
|
42 |
|
|
|
11 |
|
12 |
# TODO
|
13 |
* Change using model section if is in transformers
|
|
|
14 |
* Update summary of Dolma 1.7
|
15 |
* Remove installation requirements?
|
16 |
* Evals pre and post annealing
|
17 |
+
* details on annealing / accessing checkpoint (remove previous checkpoint instructions)
|
18 |
|
19 |
# Model Card for OLMo 7B v1.7
|
20 |
|
|
|
36 |
|
37 |
*Note: OLMo 7B v1.7 also includes QKV clipping.*
|
38 |
|
39 |
+
|
40 |
We are releasing many checkpoints for these models, for every 1000 traing steps.
|
41 |
The naming convention is `step1000-tokens4B`.
|
42 |
|