tingyuansen commited on
Commit
21d055c
·
verified ·
1 Parent(s): bfa7b8b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ AstroLLaMA-3-8B is a specialized base language model for astronomy, developed by
20
  ## Model Details
21
 
22
  - **Base Architecture**: LLaMA-3-8b
23
- - **Training Data**: Abstract, Introduction, and Conclusion (AIC) sections from arXiv's astro-ph category papers (from arXiv's inception up to January 2024)
24
  - **Data Processing**: Optical character recognition (OCR) on PDF files using the Nougat tool, followed by summarization using Qwen-2-8B and LLaMA-3.1-8B.
25
  - **Fine-tuning Method**: Continual Pre-Training (CPT) using the LMFlow framework
26
  - **Training Details**:
 
20
  ## Model Details
21
 
22
  - **Base Architecture**: LLaMA-3-8b
23
+ - **Training Data**: Abstract, Introduction, and Conclusion (AIC) sections from arXiv's astro-ph category papers
24
  - **Data Processing**: Optical character recognition (OCR) on PDF files using the Nougat tool, followed by summarization using Qwen-2-8B and LLaMA-3.1-8B.
25
  - **Fine-tuning Method**: Continual Pre-Training (CPT) using the LMFlow framework
26
  - **Training Details**: