Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ library_name: transformers
|
|
13 |
<!-- Provide a quick summary of what the model is/does. -->
|
14 |
|
15 |
AI Squared's `dlite-v2-355m` is a large language
|
16 |
-
model which is derived from OpenAI's
|
17 |
([Databricks' "Dolly 15k" Dataset](https://huggingface.co/datasets/aisquared/databricks-dolly-15k)) to help it exhibit chat-based capabilities.
|
18 |
|
19 |
Just like [Databricks' Dolly V2 models](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm),
|
|
|
13 |
<!-- Provide a quick summary of what the model is/does. -->
|
14 |
|
15 |
AI Squared's `dlite-v2-355m` is a large language
|
16 |
+
model which is derived from OpenAI's medium [GPT-2](https://huggingface.co/gpt2-medium) model and fine-tuned on a single GPU on a corpus of 15k records
|
17 |
([Databricks' "Dolly 15k" Dataset](https://huggingface.co/datasets/aisquared/databricks-dolly-15k)) to help it exhibit chat-based capabilities.
|
18 |
|
19 |
Just like [Databricks' Dolly V2 models](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm),
|