BeaverAI
/

Tunguska-39B-v1b-GGUF

Inference Endpoints

Model card Files Files and versions Community

TheDrummer commited on 11 days ago

Commit

5767fe7

•

1 Parent(s): a7d5987

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -6,7 +6,7 @@
 ## Conclusions (WIP)
 - Upscaling can 'provide room' for further training.
 - Training upscaled models will result in retaining more of the original model's performance & behavior.
-- (Not related to upscaling) The first two layers are sus - their weights are wildly different from the original. I wonder if we could recover smarts by merging that back in with base, or if those layers contain the most influence.
 ## What is the 39B Upscale?

 ## Conclusions (WIP)
 - Upscaling can 'provide room' for further training.
 - Training upscaled models will result in retaining more of the original model's performance & behavior.
+- (Not related to upscaling) The first two layers are sus - their weights are wildly different from the original. I wonder if we could recover smarts by merging that back in with base, or if those layers contain the most influence and must be preserved.
 ## What is the 39B Upscale?