tingyuansen
commited on
Commit
•
ddfc5c9
1
Parent(s):
988b2b9
Update README.md
Browse files
README.md
CHANGED
@@ -77,10 +77,11 @@ print(f"Assistant: {response}")
|
|
77 |
|
78 |
This model is specifically trained on astronomy literature and may not generalize well to other domains. Users should be aware of potential biases in the training data, which may reflect historical trends and biases in astronomical research publications. Additionally, the regex-based extraction method used for processing the LaTeX source files may introduce some biases or inconsistencies in the training data.
|
79 |
|
80 |
-
Importantly, this model has been superseded by more advanced versions. Here's a performance comparison chart based upon the astronomical benchmarking Q&A as described in [Ting et al. 2024](https://arxiv.org/abs/2407.11194)
|
81 |
|
82 |
| Model | Score (%) |
|
83 |
|-------|-----------|
|
|
|
84 |
| **AstroLLaMA-2-70B (AstroMLab)** | **76.0** |
|
85 |
| LLaMA-3.1-8B | 73.7 |
|
86 |
| Gemma-2-9B | 71.5 |
|
|
|
77 |
|
78 |
This model is specifically trained on astronomy literature and may not generalize well to other domains. Users should be aware of potential biases in the training data, which may reflect historical trends and biases in astronomical research publications. Additionally, the regex-based extraction method used for processing the LaTeX source files may introduce some biases or inconsistencies in the training data.
|
79 |
|
80 |
+
Importantly, this model has been superseded by more advanced versions. Here's a performance comparison chart based upon the astronomical benchmarking Q&A as described in [Ting et al. 2024](https://arxiv.org/abs/2407.11194).
|
81 |
|
82 |
| Model | Score (%) |
|
83 |
|-------|-----------|
|
84 |
+
| **AstroSage-LLaMA-3.1-8B (AstroMLab)** | **80.9** |
|
85 |
| **AstroLLaMA-2-70B (AstroMLab)** | **76.0** |
|
86 |
| LLaMA-3.1-8B | 73.7 |
|
87 |
| Gemma-2-9B | 71.5 |
|