vanillaOVO
/

supermario_v4

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

supermario_v4 / README.md

vanillaOVO's picture

Update README.md

45d9a90 verified 3 months ago

|

raw history blame contribute delete

No virus

893 Bytes

	---
	base_model: []
	tags:
	- mergekit
	- merge
	license: apache-2.0
	---
	This is a merge of pre-trained language models created based on [DARE](https://arxiv.org/abs/2311.03099) using [mergekit](https://github.com/cg123/mergekit).

	More descriptions of the model will be added soon.

	### Loading the Model

	Use the following Python code to load the model:

	```python
	import torch
	from transformers import MistralForCausalLM, AutoTokenizer

	model = MistralForCausalLM.from_pretrained("vanillaOVO/supermario_v4", device_map="auto")
	tokenizer = AutoTokenizer.from_pretrained("vanillaOVO/supermario_v4")
	```

	### Generating Text

	To generate text, use the following Python code:

	```python
	text = "Large language models are "
	inputs = tokenizer(text, return_tensors="pt")

	outputs = model.generate(**inputs, max_new_tokens=256)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```