tingyuansen
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ AstroLLaMA-3-8B is a specialized base language model for astronomy, developed by
|
|
20 |
## Model Details
|
21 |
|
22 |
- **Base Architecture**: LLaMA-3-8b
|
23 |
-
- **Training Data**: Abstract, Introduction, and Conclusion (AIC) sections from arXiv's astro-ph category papers
|
24 |
- **Data Processing**: Optical character recognition (OCR) on PDF files using the Nougat tool, followed by summarization using Qwen-2-8B and LLaMA-3.1-8B.
|
25 |
- **Fine-tuning Method**: Continual Pre-Training (CPT) using the LMFlow framework
|
26 |
- **Training Details**:
|
|
|
20 |
## Model Details
|
21 |
|
22 |
- **Base Architecture**: LLaMA-3-8b
|
23 |
+
- **Training Data**: Abstract, Introduction, and Conclusion (AIC) sections from arXiv's astro-ph category papers
|
24 |
- **Data Processing**: Optical character recognition (OCR) on PDF files using the Nougat tool, followed by summarization using Qwen-2-8B and LLaMA-3.1-8B.
|
25 |
- **Fine-tuning Method**: Continual Pre-Training (CPT) using the LMFlow framework
|
26 |
- **Training Details**:
|