βš–οΈπŸ‰ Indian Legal Qwen 2.5 β€” 1.5B (LoRA Adapter)

Base Model Type Domain Method Acts License

πŸ”΅ This is the lightweight LoRA adapter β€” load it on top of the base Qwen2.5-1.5B-Instruct model. For ready-to-run inference see the Merged Model Β· For CPU/Ollama usage see the GGUF.


πŸ“– Model Description

Indian Legal Qwen 2.5 β€” 1.5B (Adapter) is a LoRA adapter that domain-adapts unsloth/Qwen2.5-1.5B-Instruct-bnb-4bit, fine-tuned using QLoRA on a structured question-answer dataset covering all 1,059 sections of India's three landmark 2023 criminal justice reform acts:

Act Full Name Replaces Sections
πŸ“• BNS 2023 Bharatiya Nyaya Sanhita IPC 1860 358
πŸ“— BNSS 2023 Bharatiya Nagarik Suraksha Sanhita CrPC 1973 531
πŸ“˜ BSA 2023 Bharatiya Sakshya Adhiniyam Indian Evidence Act 1872 170

Trained on 6,354 instruction-format QA pairs β€” 6 question types per section covering definitions, scenarios, legal elements, exceptions, and consequences β€” giving it broad, structured coverage of India's reformed criminal law framework. This adapter-only release keeps the download small and lets you swap it in or out of the base model at will.


πŸ”— Model Family β€” Qwen 2.5 1.5B

Variant Repo Best For
🟒 Merged GSMS-B/Indian-Legal-Qwen2.5-1.5B Out-of-the-box inference, Gradio / API deployment
πŸ”΅ LoRA Adapter (this repo) GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter Lightweight loading on top of base model
🟑 GGUF (Quantized) GSMS-B/Indian-Legal-Qwen2.5-1.5B-GGUF CPU inference via Ollama / llama.cpp

πŸš€ Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch

base_model_id = "unsloth/Qwen2.5-1.5B-Instruct-bnb-4bit"
adapter_id    = "GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter"

tokenizer = AutoTokenizer.from_pretrained(base_model_id)
base_model = AutoModelForCausalLM.from_pretrained(
    base_model_id,
    torch_dtype=torch.float16,
    device_map="auto"
)
model = PeftModel.from_pretrained(base_model, adapter_id)

SYSTEM = "You are an expert legal assistant specializing in Indian criminal law β€” BNS, BNSS, and BSA 2023."

def ask(question):
    messages = [
        {"role": "system", "content": SYSTEM},
        {"role": "user",   "content": question}
    ]
    text   = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
    inputs = tokenizer(text, return_tensors="pt").to(model.device)
    with torch.no_grad():
        out = model.generate(**inputs, max_new_tokens=300, temperature=0.1,
                             do_sample=True, pad_token_id=tokenizer.eos_token_id)
    return tokenizer.decode(out[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True)

print(ask("What is a Zero FIR under BNSS 2023?"))

🎯 Recommended Use Cases

⚠️ Important Note: This model has been domain-adapted on structured QA data and works best as a component in a larger pipeline rather than a standalone answer engine. Direct usage without retrieval context may produce incomplete or imprecise answers on complex legal queries.

βœ… Where this model excels

Use Case πŸ’‘ How to Use
πŸ” RAG Pipeline Pair with a BM25 or vector retriever over BNS/BNSS/BSA texts; feed retrieved sections as context for grounded, citation-backed answers
πŸ€– Legal Chatbot Backend Use as the generation backbone of a legal assistant app with a ChromaDB / FAISS document store
πŸ“š Legal Education Tool Build interactive Q&A apps for law students and practitioners learning the 2023 criminal justice reforms
πŸ”Ž Section Lookup Assistant Combine with a section index to surface the exact BNS / BNSS / BSA provision relevant to a given situation
🧩 Modular Deployment Swap the adapter in/out of the base model, or combine with other LoRA adapters
πŸ§ͺ Further Fine-tuning Use as a starting point for more specialised adaptation (e.g., only BNSS procedure, only BSA evidence rules)
πŸ“ Structured Legal Summarization Summarize individual sections when the section text is supplied as input context
βš–οΈ Comparative Law Analysis Highlight differences between old acts (IPC/CrPC/IEA) and their 2023 replacements

❌ Not recommended for

  • Standalone legal advice without a retrieval component
  • High-stakes legal decisions without qualified human review
  • Jurisdictions or acts outside BNS / BNSS / BSA 2023

πŸ‹οΈ Training Details

Property Value
πŸ€– Base model unsloth/Qwen2.5-1.5B-Instruct-bnb-4bit
πŸ”§ Fine-tuning method QLoRA
πŸŽ›οΈ LoRA rank 64
πŸŽ›οΈ LoRA alpha 128
🧩 Target modules q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
πŸ“Š Training data 6,354 QA pairs β€” 1,059 sections Γ— 6 question types
πŸ” Epochs 3
πŸ“¦ Batch size (per device) 4
πŸ“ˆ Learning rate 2e-4
βš™οΈ Optimizer adamw_8bit
πŸ’» Hardware Google Colab T4 GPU
πŸ› οΈ Framework Unsloth + TRL SFTTrainer
πŸ’¬ Prompt format ChatML

πŸ“Š Training Dataset

πŸ“‚ Dataset πŸ”— Link
Indian Legal QA β€” BNS + BNSS + BSA 2023 GSMS-B/Indian-Legal-QA-BNS-BNSS-BSA

6 question types per section: definitional_topic Β· definitional_section Β· scenario Β· elements Β· exceptions Β· consequence


πŸ‘€ Author

GSMS-B β€” Bugatha Ganasyam Mani Sankar πŸ€— Hugging Face Profile


⚠️ Disclaimer

This model is intended for research and educational purposes only. It does not constitute legal advice. Outputs should not be relied upon for any legal decision without review by a qualified legal professional. The model's responses reflect patterns in training data and may contain errors or omissions.


⚑ Fine-tuned using Unsloth for training efficiency.

Downloads last month
17
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter

Adapter
(18)
this model

Dataset used to train GSMS-B/Indian-Legal-Qwen2.5-1.5B-Adapter