Shivneri Marathi LLM is being built with the wish to bring the benefits of Generative AI to non-English (especially Marathi) speaking population of India. Marathi has the third largest number of native speakers in India, after Hindi and Bengali. Almost 83 million people speak the language. This is a preliminary version of our Marathi LLM (Large Language Model)! Built on the mighty Gemma 7B base model, Shivneri LLM can generate creative and informative text in both Marathi and English. This is just the beginning – we're constantly improving Shivneri, and even more exciting features are on the horizon!

  • Developed by: Amit Ghadge
  • Model type: [ Decoder-only large language model (LLM) with a transformer architecture]
  • Language(s) (NLP): [Marathi, English]
  • Finetuned from model [optional]: [Gemma-7B]

This is a very preliminary version. Please use with caution. Would suggest to more updates and final models to try out.

[Continually Pretrained with Lora on AI4Bharat/Sangraha dataset]

Continually Pretrained with Lora

[ Decoder-only large language model (LLM) with a transformer architecture]

[A100 80 GB]

Meet the Developers

Get to know the creators behind this innovative model and follow their contributions to the field:

      title={Shivneri-LLM: Your Bilingual Marathi and English Text Generation LLM}, 
      author={Amit Ghadge},


We hope this model serves as a valuable tool in your NLP toolkit and look forward to seeing the advancements it will enable in the understanding and generation of the Marathi language.

