varadsrivastava
/

BAI_Arg_Alpha

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BAI_Arg_Alpha / README.md

varadsrivastava's picture

varadsrivastava

Update README.md

c14feb8 verified 6 months ago

|

history blame contribute delete

1.17 kB

	---
	language:
	- en
	license: apache-2.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- trl
	base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
	---

	# Model: BAI_LLM_FinArg

	- Developed by: varadsrivastava
	- License: apache-2.0
	- Base Model : unsloth/llama-3-8b-Instruct-bnb-4bit

	# For Proper Inference, please use:
	!pip install "unsloth[colab-new] @ git+https://GitHub.com/unslothai/unsloth.git

	### Loading the fine-tuned model and the tokenizer for inference
	from unsloth import FastLanguageModel

	model, tokenizer = FastLanguageModel.from_pretrained(
	model_name = "varadsrivastava/BAI_LLM_FinArg",
	max_seq_length = 20,
	dtype = torch.bfloat16,
	load_in_4bit = True
	)

	### Using FastLanguageModel for fast inference
	FastLanguageModel.for_inference(model)

	# Prompt template:
	"""<\|begin_of_text\|><\|start_header_id\|>system<\|end_header_id\|>

	{instruction}<\|eot_id\|><\|start_header_id\|>user<\|end_header_id\|>

	Sentence: {row['text']}<\|eot_id\|><\|start_header_id\|>assistant<\|end_header_id\|>

	Class: {row['label']}<\|eot_id\|>"""

	NOTE: This model was trained 2x faster using Unsloth and Huggingface's TRL library.