Model Card for LLaMat-2-CIF

LLaMat-2-CIF is a specialized large language model designed to generate and extract information from Crystallographic Information Files.

The model is developed after continued pretraining of LLaMat-2 on 7M instruction-output pairs obtained using CIFs from Materials Project, Google GNoME, and AMCSD


Overview

  • Model Type: Large Language Model (LLM)
  • Base Model: LLaMat-2 (continued pretraining of LLaMat-2 on CIFs)
  • Language: English
  • License: LLaMA-2 License
  • Tags: Material Science, Domain Adaptation, Crystal Structure Generation

Model Details

Key Features

  • Instruction Following Abilities: Answers questions based on CIF files.
  • Applications: Crystal structure generation

Development and Support

  • Developed by: M3RG, IIT Delhi & DAIR, IIT Delhi
  • Compute Support:
    • Edinburgh International Data Facility (EIDF): Provided access to Cerebras CS2 clusters for pretraining.
    • IIT Delhi High-Performance Computing Cluster: Supported fine-tuning and inference stages.

Technical Specifications

Hardware Infrastructure

  • Pretraining: 2 Cerebras CS-2 Wafer-Scale Engines (WSE-2)
  • Finetuning: 2 Cerebras CS-2 Wafer-Scale Engines (WSE-2)
  • Inferencing: 1 NVIDIA A100 80GB GPU

Software Stack

  • Frameworks: PyTorch, Hugging Face Transformers

Model Sources

Downloads last month
2
Inference API
Unable to determine this model's library. Check the docs .

Model tree for m3rg-iitd/llamat-2-cif

Finetuned
m3rg-iitd/llamat-2
Finetuned
(2)
this model

Collection including m3rg-iitd/llamat-2-cif