AstroMLab
/

astrollama-3-8b-base_aic

Text Generation

text-generation-inference

Model card Files Files and versions Community

tingyuansen commited on Sep 29, 2024

Commit

21d055c

·

verified ·

1 Parent(s): bfa7b8b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ AstroLLaMA-3-8B is a specialized base language model for astronomy, developed by
 ## Model Details
 - **Base Architecture**: LLaMA-3-8b
-- **Training Data**: Abstract, Introduction, and Conclusion (AIC) sections from arXiv's astro-ph category papers (from arXiv's inception up to January 2024)
 - **Data Processing**: Optical character recognition (OCR) on PDF files using the Nougat tool, followed by summarization using Qwen-2-8B and LLaMA-3.1-8B.
 - **Fine-tuning Method**: Continual Pre-Training (CPT) using the LMFlow framework
 - **Training Details**:

 ## Model Details
 - **Base Architecture**: LLaMA-3-8b
+- **Training Data**: Abstract, Introduction, and Conclusion (AIC) sections from arXiv's astro-ph category papers
 - **Data Processing**: Optical character recognition (OCR) on PDF files using the Nougat tool, followed by summarization using Qwen-2-8B and LLaMA-3.1-8B.
 - **Fine-tuning Method**: Continual Pre-Training (CPT) using the LMFlow framework
 - **Training Details**: