m3rg-iitd
/

llamat-2

materials science

large language model

Model card Files Files and versions Community

Model Card for LLaMat-2

LLaMat-2 is a specialized large language model designed to be a foundational large language model for materials science.

Overview

Model Type: Large Language Model (LLM)
Base Model: LLaMat-2 (continued pretraining of LLaMA-3 on material science data)
Language: English
License: LLaMA-3 License
Tags: Material Science, Domain Adaptation, Table Understanding, Scientific Data Parsing, Materials Copilot

Model Details

Key Features

Applications: Can be finetuned for information extraction, table understanding, parsing data for research tasks, and crystal structure generation.

Development and Support

Developed by: M3RG, IIT Delhi & DAIR, IIT Delhi
Compute Support:
- Edinburgh International Data Facility (EIDF): Provided access to Cerebras CS2 clusters for pretraining.
- IIT Delhi High-Performance Computing Cluster: Supported fine-tuning and inference stages.

Technical Specifications

Hardware Infrastructure

Pretraining: 8 NVIDIA A100 80GB GPUs

Software Stack

Frameworks: PyTorch, Hugging Face Transformers

Model Sources

Repository: LLaMat on GitHub
Compute Resources: EIDF Cerebras CS Clusters

Downloads last month: 3

Safetensors

Model size

6.74B params

Tensor type

F32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for m3rg-iitd/llamat-2

Base model

meta-llama/Llama-2-7b

Finetuned

(33)

this model

Finetunes

Collection including m3rg-iitd/llamat-2

LLaMat

Foundational Large Language Models for Materials Research • 6 items • Updated Dec 13, 2024 • 4