ZiweiChen
/

BioMistral-Clinical-7B

Text Generation

Model card Files Files and versions Community

ZiweiChen commited on 25 days ago

Commit

2fc7a45

•

1 Parent(s): 40d82a5

Update README.md

Files changed (1) hide show

README.md +39 -16

README.md CHANGED Viewed

@@ -13,25 +13,48 @@ tags:
 ---
 # Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
 ## Model Details
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]

 ---
 # Model Card for Model ID
+## How to use
+Loading the model from Hunggingface:
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("ZiweiChen/BioMistral-Clinical-7B")
+model = AutoModelForCausalLM.from_pretrained("ZiweiChen/BioMistral-Clinical-7B")
+```
+Lightweight model loading can be used - using 4-bit quantization!
+```python
+from transformers import  AutoTokenizer, BitsAndBytesConfig, AutoModelForCausalLM
+import torch
+bnb_config = BitsAndBytesConfig(
+    load_in_4bit=True,
+    bnb_4bit_use_double_quant=True,
+    bnb_4bit_quant_type="nf4",
+    bnb_4bit_compute_dtype=torch.bfloat16
+)
+tokenizer = AutoTokenizer.from_pretrained("ZiweiChen/BioMistral-Clinical-7B")
+model = AutoModelForCausalLM.from_pretrained("ZiweiChen/BioMistral-Clinical-7B", quantization_config=bnb_config)
+```
+How to Generate text:
+```python
+model_device = next(model.parameters()).device
+prompt = """
+How to treat severe obesity?
+"""
+model_input = tokenizer(prompt, return_tensors="pt").to(model_device)
+with torch.no_grad():
+    output = model.generate(**model_input, max_new_tokens=100)
+    answer = tokenizer.decode(output[0], skip_special_tokens=True)
+    print(answer)
+```
 ## Model Details
 ### Model Description