H-D-T
/

Buzz-8b-Large-v0.5

Text Generation

Alignment-Lab-AI

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Alignment-Lab-AI commited on May 8

Commit

81eaacd

•

1 Parent(s): ac2a77a

Update README.md

Files changed (1) hide show

README.md +0 -6

README.md CHANGED Viewed

@@ -36,12 +36,6 @@ The Buzz model, Dataset, and Code are to be released to build a toolkit that aim
 the **Buzz dataset** and two additional models: **Buzz-2.5B-Small** and **Buzz-5B-Medium**, the codebase to refine, filter and augment the data, as well as prune and train your own variants, will additionally be released in the coming days.
-## Performance
-Buzz-8b-Large achieves remarkably low train and validation loss, with unseen data loss reaching around **0.5** by the end of training. This performance showcases the effectiveness of our novel iterative fine-tuning approach, which maximizes the reuse of pretrained weights. Even the smallest variant, Buzz-Small, maintains a steady train loss of approximately **0.4-0.6**, on entirely new data and hold out sets.
-[ benchmark scores table here]
 ## Iterative Fine-Tuning Methodology
 Our research builds upon the concepts introduced in several key papers, including:

 the **Buzz dataset** and two additional models: **Buzz-2.5B-Small** and **Buzz-5B-Medium**, the codebase to refine, filter and augment the data, as well as prune and train your own variants, will additionally be released in the coming days.
 ## Iterative Fine-Tuning Methodology
 Our research builds upon the concepts introduced in several key papers, including: