band2001
/

stolaf-angora-4000

@@ -123,7 +123,7 @@ To train the model, the data needs to be in the following format. Note the data
 Once the data is in the correct format, QLoRA is recommended. The model can be fine-tuned either using mlx-lm and mps (to tune on an Apple Silicon machine) or a bitsandbytes configuration and cuda (to tune on a machine with Nvidia GPUs).
-#### Preprocessing [optional]
 To preprocess your data to be in the correct format outlined above, you can use the following helper function:
@@ -177,98 +177,23 @@ If you look at the GitHub repo for this project, mlx_lora.sh includes the comman
 ## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Factors
-<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
-[More Information Needed]
-#### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
 ### Results
-[More Information Needed]
-#### Summary
-## Model Examination [optional]
-<!-- Relevant interpretability work for the model goes here -->
-[More Information Needed]
-## Environmental Impact
-<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
-Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
-- **Hardware Type:** [More Information Needed]
-- **Hours used:** [More Information Needed]
-- **Cloud Provider:** [More Information Needed]
-- **Compute Region:** [More Information Needed]
-- **Carbon Emitted:** [More Information Needed]
-## Technical Specifications [optional]
-### Model Architecture and Objective
-[More Information Needed]
-### Compute Infrastructure
-[More Information Needed]
-#### Hardware
-[More Information Needed]
-#### Software
-[More Information Needed]
-## Citation [optional]
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
-**BibTeX:**
-[More Information Needed]
-**APA:**
-[More Information Needed]
-## Glossary [optional]
-<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
-[More Information Needed]
-## More Information [optional]
-[More Information Needed]
-## Model Card Authors [optional]
-[More Information Needed]
 ## Model Card Contact
-[More Information Needed]

 Once the data is in the correct format, QLoRA is recommended. The model can be fine-tuned either using mlx-lm and mps (to tune on an Apple Silicon machine) or a bitsandbytes configuration and cuda (to tune on a machine with Nvidia GPUs).
+#### Preprocessing
 To preprocess your data to be in the correct format outlined above, you can use the following helper function:
 ## Evaluation
+Testing loss and perplexity were the two metrics used to evaluate the Angora models. A summary of the results for all the different iteration models is included below.
 ### Results
+| Number of iterations | Testing Loss | Perplexity |
+|:----------|:----------|:---------|
+|800 | 0.569 | 1.766 |
+| 1600 | 0.302 | 1.352 |
+| 2400 | 0.225 | 1.252 |
+| 3200 | 0.185 | 1.203 |
+| 4000 | 0.170 | 1.185 |
+### Testing Data
+The testing data is available [here](https://huggingface.co/datasets/band2001/stolaf-angora/viewer/default/test).
 ## Model Card Contact
+Ben Anderson - [ander6@stolaf.edu](mailto:ander6@stolaf.edu)
+Keegan Murray - murray7@stolaf.edu(mailto:murray7@stolaf.edu)