pankajmathur
/

orca_alpaca_3b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pankaj Mathur commited on Jun 22, 2023

Commit

bb961f4

•

1 Parent(s): ad636d6

Update README.md

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -9,13 +9,13 @@ library_name: adapter-transformers
 # Dataset
-We train OpenLLaMa-3B model on custom explained tuned Alpaca dataset (~52K) created using approaches from [Orca Research Paper](https://arxiv.org/abs/2306.02707).
 We leverage all of the 15 system instructions provided in [Orca Research Paper](https://arxiv.org/abs/2306.02707) to generate custom Alpaca dataset, in contrast to vanilla instruction tuning approaches used by original [Alpaca research paper](https://crfm.stanford.edu/2023/03/13/alpaca.html).
 This helps student model aka [alpaca_orca_open_llama_3b](psmathur/alpaca_orca_open_llama_3b) to learn ***thought*** process from teacher model, which is ChatGPT (gpt-3.5-turbo-0301 version).
-Please pay attention how the **System** prompt is added before each *instruction* in below example usage.
 # Training
@@ -23,22 +23,24 @@ The training configurations are provided in the table below.
 The training takes on 4x A600(50G) GPUs and lasts for around 20 Hours for cost of $66 using [Lambda Labs](https://lambdalabs.com)
-We used DeepSpeed with Zero-3 approaches for parallel gpu training.
 |||
 |:-------------:|:-------------:|
-|*batch size*|16|
 |*train_micro_batch_size_per_gpu*|2|
 |*gradient_accumulation_steps*|2|
 |*Learning rate*|2e-5|
-|*Epochs*|3|
 |*Max length*|1024|
 # Example Usage
-Below shows an example on how to use OpenAlpaca
 ```python
 import torch

 # Dataset
+We trained [OpenLLaMa-3B model](https://github.com/openlm-research/open_llama) on custom explain tuned Alpaca dataset (~52K) created using approaches from [Orca Research Paper](https://arxiv.org/abs/2306.02707).
 We leverage all of the 15 system instructions provided in [Orca Research Paper](https://arxiv.org/abs/2306.02707) to generate custom Alpaca dataset, in contrast to vanilla instruction tuning approaches used by original [Alpaca research paper](https://crfm.stanford.edu/2023/03/13/alpaca.html).
 This helps student model aka [alpaca_orca_open_llama_3b](psmathur/alpaca_orca_open_llama_3b) to learn ***thought*** process from teacher model, which is ChatGPT (gpt-3.5-turbo-0301 version).
+Please see below example usage how the **System** prompt is added before each *instruction*.
 # Training
 The training takes on 4x A600(50G) GPUs and lasts for around 20 Hours for cost of $66 using [Lambda Labs](https://lambdalabs.com)
+We used DeepSpeed with Zero-3 approaches for parallel gpu training by writing our own fine tunning scripts plus leveraging some of the model training code provided by amazing [OpenAlpaca repo](https://github.com/yxuansu/OpenAlpaca)
+Here are some of params used during training:
 |||
 |:-------------:|:-------------:|
+|*batch_size*|16|
 |*train_micro_batch_size_per_gpu*|2|
 |*gradient_accumulation_steps*|2|
 |*Learning rate*|2e-5|
 |*Max length*|1024|
+|*Epochs*|3|
 # Example Usage
+Below shows an example on how to use [alpaca_orca_open_llama_3b](psmathur/alpaca_orca_open_llama_3b)
 ```python
 import torch