LukeInglis
commited on
Commit
•
c37cafd
1
Parent(s):
f8053cc
Update README.md
Browse files
README.md
CHANGED
@@ -64,7 +64,7 @@ Foundational skills such as reasoning and compositional skills such as creative
|
|
64 |
|
65 |
Additionally, to ensure the data is high-quality and safe, we employ steps to check the questions and answers to ensure that they are grounded and safe. This is done using the same teacher model that generated the data.
|
66 |
|
67 |
-
Our training consists of two major phases: knowledge tuning and skills tuning.
|
68 |
There are two steps in knowledge tuning where the first step learns simple knowledge (short samples) and the second step learns complicated knowledge (longer samples).
|
69 |
The second step uses replay a replay buffer with data from the first step.
|
70 |
Both foundational skills and compositional skills are learned during the skills tuning phases, where a replay buffer of data from the knowledge phase is used.
|
|
|
64 |
|
65 |
Additionally, to ensure the data is high-quality and safe, we employ steps to check the questions and answers to ensure that they are grounded and safe. This is done using the same teacher model that generated the data.
|
66 |
|
67 |
+
Our training consists of two major phases: knowledge tuning and skills tuning.
|
68 |
There are two steps in knowledge tuning where the first step learns simple knowledge (short samples) and the second step learns complicated knowledge (longer samples).
|
69 |
The second step uses replay a replay buffer with data from the first step.
|
70 |
Both foundational skills and compositional skills are learned during the skills tuning phases, where a replay buffer of data from the knowledge phase is used.
|