Update README.md
Browse files
README.md
CHANGED
@@ -8,9 +8,9 @@ pipeline_tag: question-answering
|
|
8 |
---
|
9 |
# cosmosage
|
10 |
|
11 |
-
Cosmosage is a natural-language cosmology assistant that
|
12 |
|
13 |
-
|
14 |
|
15 |
See https://github.com/tijmen/cosmosage for more details.
|
16 |
|
@@ -52,7 +52,7 @@ ASSISTANT:
|
|
52 |
|
53 |
## Qualitative evaluation
|
54 |
|
55 |
-
cosmosage_v0.2 performs much better than cosmosage_v0.1. While v0.1 did not seem to have picked up much knowledge from the ArXiV papers it was trained on, v0.2 can
|
56 |
|
57 |
I've also been impressed by cosmosage's knowledge about astronomy, as well as other branches of physics. However, in these areas it is less clear how much the performance is due to the pretraining of the Mistral model versus the fine-tuning I did.
|
58 |
|
|
|
8 |
---
|
9 |
# cosmosage
|
10 |
|
11 |
+
Cosmosage is a natural-language cosmology assistant that can answer questions about cosmology.
|
12 |
|
13 |
+
cosmosage_v0.2 is a fine tune of Mistral-7B-v0.1 on various cosmology-related datasets including open-access textbooks and scientific publications. It is intended to be used in Q&A mode, where the model gives a single answer in response to a single question.
|
14 |
|
15 |
See https://github.com/tijmen/cosmosage for more details.
|
16 |
|
|
|
52 |
|
53 |
## Qualitative evaluation
|
54 |
|
55 |
+
cosmosage_v0.2 performs much better than cosmosage_v0.1. While v0.1 did not seem to have picked up much knowledge from the ArXiV papers it was trained on, v0.2 can give surprisingly good answers to highly technical questions about cosmology. It gives certain answers which it could not have known without having read these recent papers, leading me to conclude that it has picked up some knowledge from the ArXiV papers.
|
56 |
|
57 |
I've also been impressed by cosmosage's knowledge about astronomy, as well as other branches of physics. However, in these areas it is less clear how much the performance is due to the pretraining of the Mistral model versus the fine-tuning I did.
|
58 |
|