Mirage-Studio
/

llama-gaan-2-7b-chat-hf-dutch

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

daryl149 commited on Aug 8, 2023

Commit

f93e847

•

1 Parent(s): d173210

Update README.md

Files changed (1) hide show

README.md +5 -11

README.md CHANGED Viewed

@@ -47,6 +47,7 @@ You are a helpful, respectful and honest assistant. Always answer as helpfully a
 {prompt} [/INST] {model_reply} [INST] {prompt} [/INST]
 ```
 ### Example usage
 An example question you can ask:
@@ -89,7 +90,6 @@ Don't be evil.
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
 - It's not quite perfect Dutch yet, but a very promising start.
-- The model rarely generates EOS tokens and goes on a ramble. A quick and dirty way to mitigate this is to system prompt it to generate an `</s>` after 5 sentences.
 ### Recommendations
@@ -98,7 +98,8 @@ Don't be evil.
 ## How to Get Started with the Model
-If you already have a pipeline running llama 2 7B chat in huggingface format, just call this one instead.
 ## Training Details
@@ -124,6 +125,7 @@ If you already have a pipeline running llama 2 7B chat in huggingface format, ju
 #### Speeds, Sizes, Times [optional]
 <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
 [More Information Needed]
@@ -169,15 +171,7 @@ If you already have a pipeline running llama 2 7B chat in huggingface format, ju
 <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
-Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
-- **Hardware Type:** [More Information Needed]
-- **Hours used:** [More Information Needed]
-- **Cloud Provider:** [More Information Needed]
-- **Compute Region:** [More Information Needed]
-- **Carbon Emitted:** [More Information Needed]
-## Technical Specifications [optional]
 ### Model Architecture and Objective

 {prompt} [/INST] {model_reply} [INST] {prompt} [/INST]
 ```
+**N.B.** Set your pad_token_id=18610 in your generator, otherwise it returns gibberish.
 ### Example usage
 An example question you can ask:
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
 - It's not quite perfect Dutch yet, but a very promising start.
 ### Recommendations
 ## How to Get Started with the Model
+If you already have a pipeline running llama 2 7B chat in huggingface format, just call this one instead.
+**N.B.** Set pad_token_id=18610 in your generator, otherwise it returns gibberish.
 ## Training Details
 #### Speeds, Sizes, Times [optional]
 <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+We reached 32 tokens/second on a V100S without trying anything fancy.
 [More Information Needed]
 <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Yes.
 ### Model Architecture and Objective