|
--- |
|
license: other |
|
language: |
|
- en |
|
--- |
|
 |
|
- (Nemo 12B instruct as base) |
|
- 200k subset of GU_instruct for 3 epochs. |
|
# Uses Mistral Formatting |
|
|
|
Notes: One off train most likely, this was done purely for internal testing purposes but seemed ok enough to release. I do not plan to offer any kind of extended support for using this model, so your mileage may vary depending on use and context size. |