--- license: llama2 language: - en base_model: - meta-llama/Llama-2-7b tags: - materials science - large language model --- # Model Card for LLaMat-2 **LLaMat-2** is a specialized large language model designed to be a foundational large language model for materials science. --- ## Overview - **Model Type:** Large Language Model (LLM) - **Base Model:** LLaMat-2 (continued pretraining of LLaMA-3 on material science data) - **Language:** English - **License:** LLaMA-3 License - **Tags:** Material Science, Domain Adaptation, Table Understanding, Scientific Data Parsing, Materials Copilot --- ## Model Details ### Key Features - **Applications:** Can be finetuned for information extraction, table understanding, parsing data for research tasks, and crystal structure generation. ### Development and Support - **Developed by:** [M3RG, IIT Delhi](https://github.com/M3RG-IITD/) & [DAIR, IIT Delhi](https://github.com/dair-iitd) - **Compute Support:** - **Edinburgh International Data Facility (EIDF):** Provided access to Cerebras CS2 clusters for pretraining. - **IIT Delhi High-Performance Computing Cluster:** Supported fine-tuning and inference stages. --- ## Technical Specifications ### Hardware Infrastructure - **Pretraining:** 8 NVIDIA A100 80GB GPUs ### Software Stack - **Frameworks:** PyTorch, Hugging Face Transformers --- ## Model Sources - **Repository:** [LLaMat on GitHub](https://github.com/M3RG-IITD/llamat) - **Compute Resources:** [EIDF Cerebras CS Clusters](https://edinburgh-international-data-facility.ed.ac.uk/services/computing/cerebras-cs) ---