h2oai
/

h2ogpt-gm-oasst1-en-2048-falcon-7b-v3

@@ -8,31 +8,46 @@ tags:
 - large language model
 - h2o-llmstudio
 inference: false
-thumbnail: https://h2o.ai/etc.clientlibs/h2o/clientlibs/clientlib-site/resources/images/favicon.ico
 ---
 # Model Card
 ## Summary
 This model was trained using [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio).
 - Base model: [tiiuae/falcon-7b](https://huggingface.co/tiiuae/falcon-7b)
 ## Usage
-To use the model with the `transformers` library on a machine with GPUs, first make sure you have the `transformers`, `accelerate` and `torch` libraries installed.
 ```bash
-pip install transformers==4.28.1
-pip install accelerate==0.18.0
 pip install torch==2.0.0
 ```
 ```python
 import torch
-from transformers import pipeline
 generate_text = pipeline(
     model="h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
     torch_dtype=torch.float16,
     trust_remote_code=True,
     use_fast=False,
@@ -62,7 +77,7 @@ print(generate_text.preprocess("Why is drinking water so healthy?")["prompt_text
 <|prompt|>Why is drinking water so healthy?<|endoftext|><|answer|>
 ```
-Alternatively, if you prefer to not use `trust_remote_code=True` you can download [h2oai_pipeline.py](h2oai_pipeline.py), store it alongside your notebook, and construct the pipeline yourself from the loaded model and tokenizer:
 ```python
@@ -73,12 +88,14 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 tokenizer = AutoTokenizer.from_pretrained(
     "h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
     use_fast=False,
-    padding_side="left"
 )
 model = AutoModelForCausalLM.from_pretrained(
     "h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
     torch_dtype=torch.float16,
-    device_map={"": "cuda:0"}
 )
 generate_text = H2OTextGenerationPipeline(model=model, tokenizer=tokenizer)
@@ -106,8 +123,17 @@ model_name = "h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3"  # either local folde
 # You can find an example prompt in the experiment logs.
 prompt = "<|prompt|>How are you?<|endoftext|><|answer|>"
-tokenizer = AutoTokenizer.from_pretrained(model_name, use_fast=False)
-model = AutoModelForCausalLM.from_pretrained(model_name)
 model.cuda().eval()
 inputs = tokenizer(prompt, return_tensors="pt", add_special_tokens=False).to("cuda")
@@ -161,15 +187,6 @@ RWForCausalLM(
 This model was trained using H2O LLM Studio and with the configuration in [cfg.yaml](cfg.yaml). Visit [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio) to learn how to train your own large language models.
-## Model Validation
-Model validation results using [EleutherAI lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness).
-```bash
-CUDA_VISIBLE_DEVICES=0 python main.py --model hf-causal-experimental --model_args pretrained=h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3 --tasks openbookqa,arc_easy,winogrande,hellaswag,arc_challenge,piqa,boolq --device cuda &> eval.log
-```
 ## Disclaimer
 Please read this disclaimer carefully before using the large language model provided in this repository. Your use of the model signifies your agreement to the following terms and conditions.

 - large language model
 - h2o-llmstudio
 inference: false
+thumbnail: >-
+  https://h2o.ai/etc.clientlibs/h2o/clientlibs/clientlib-site/resources/images/favicon.ico
+license: apache-2.0
+datasets:
+- OpenAssistant/oasst1
 ---
 # Model Card
 ## Summary
 This model was trained using [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio).
 - Base model: [tiiuae/falcon-7b](https://huggingface.co/tiiuae/falcon-7b)
+- Dataset preparation: [OpenAssistant/oasst1](https://github.com/h2oai/h2o-llmstudio/blob/1935d84d9caafed3ee686ad2733eb02d2abfce57/app_utils/utils.py#LL1896C5-L1896C28) personalized
 ## Usage
+To use the model with the `transformers` library on a machine with GPUs, first make sure you have the `transformers`, `accelerate`, `torch` and `einops` libraries installed.
 ```bash
+pip install transformers==4.29.2
+pip install accelerate==0.19.0
 pip install torch==2.0.0
+pip install einops==0.6.1
 ```
 ```python
 import torch
+from transformers import AutoTokenizer, pipeline
+tokenizer = AutoTokenizer.from_pretrained(
+    "h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
+    use_fast=False,
+    padding_side="left",
+    trust_remote_code=True,
+)
 generate_text = pipeline(
     model="h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
+    tokenizer=tokenizer,
     torch_dtype=torch.float16,
     trust_remote_code=True,
     use_fast=False,
 <|prompt|>Why is drinking water so healthy?<|endoftext|><|answer|>
 ```
+Alternatively, you can download [h2oai_pipeline.py](h2oai_pipeline.py), store it alongside your notebook, and construct the pipeline yourself from the loaded model and tokenizer:
 ```python
 tokenizer = AutoTokenizer.from_pretrained(
     "h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
     use_fast=False,
+    padding_side="left",
+    trust_remote_code=True,
 )
 model = AutoModelForCausalLM.from_pretrained(
     "h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v3",
     torch_dtype=torch.float16,
+    device_map={"": "cuda:0"},
+    trust_remote_code=True,
 )
 generate_text = H2OTextGenerationPipeline(model=model, tokenizer=tokenizer)
 # You can find an example prompt in the experiment logs.
 prompt = "<|prompt|>How are you?<|endoftext|><|answer|>"
+tokenizer = AutoTokenizer.from_pretrained(
+    model_name,
+    use_fast=False,
+    trust_remote_code=True,
+)
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.float16,
+    device_map={"": "cuda:0"},
+    trust_remote_code=True,
+)
 model.cuda().eval()
 inputs = tokenizer(prompt, return_tensors="pt", add_special_tokens=False).to("cuda")
 This model was trained using H2O LLM Studio and with the configuration in [cfg.yaml](cfg.yaml). Visit [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio) to learn how to train your own large language models.
 ## Disclaimer
 Please read this disclaimer carefully before using the large language model provided in this repository. Your use of the model signifies your agreement to the following terms and conditions.