Update README.md
Browse files
README.md
CHANGED
@@ -80,6 +80,30 @@ To use model for the inference follow the steps bellow:
|
|
80 |
```
|
81 |
|
82 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
83 |
<h3> Usage and License</h3>
|
84 |
|
85 |
Please note that all model weights are exclusively licensed for research purposes. The accompanying dataset is licensed under CC BY 4.0, which permits solely non-commercial usage.
|
@@ -102,4 +126,4 @@ If you use our repository, please cite the following related paper:
|
|
102 |
doi ="10.1039/D4SC00966E",
|
103 |
url ="http://dx.doi.org/10.1039/D4SC00966E",
|
104 |
}
|
105 |
-
```
|
|
|
80 |
```
|
81 |
|
82 |
|
83 |
+
<h4> Usage for Large Model Version </h4>
|
84 |
+
|
85 |
+
To use the large model version for inference, please refer to the <a href="https://github.com/NVIDIA/NeMo" target="_blank">NeMo</a> project.
|
86 |
+
|
87 |
+
The simplest way to use the large version of the model is to run the <a href="https://github.com/NVIDIA/NeMo/blob/main/examples/nlp/language_modeling/megatron_t5_seq2seq_eval.py" target="_blank">megatron_t5_seq2seq_eval.py</a> or <a href="https://github.com/NVIDIA/NeMo/blob/main/examples/nlp/language_modeling/megatron_t5_seq2seq_finetune.py" target="_blank">megatron_t5_seq2seq_finetune.py</a> script.
|
88 |
+
|
89 |
+
Prior to executing the script, prepare the input (prompts) and output (responses) files and set up the config file. The input file should have prompts on each line, which need to be preprocessed using the `add_special_symbols` function mentioned above.
|
90 |
+
|
91 |
+
In the configuration file, the following keys should be set:
|
92 |
+
|
93 |
+
* Set Input and Target Files:
|
94 |
+
Set the `src_file_name` and `tgt_file_name` fields to the files where the input (prompts) and target (responses) data are stored.
|
95 |
+
|
96 |
+
* Specify the Checkpoint Path:
|
97 |
+
Set the `restore_from_path` field to the NeMo checkpoint path.
|
98 |
+
|
99 |
+
* Enable Predictions Writing:
|
100 |
+
Set `write_predictions_to_file` to True.
|
101 |
+
|
102 |
+
* Define Output File Prefix:
|
103 |
+
Alter the `output_file_path_prefix` field to set the output file prefix.
|
104 |
+
|
105 |
+
After completing these steps, run the script to perform inference.
|
106 |
+
|
107 |
<h3> Usage and License</h3>
|
108 |
|
109 |
Please note that all model weights are exclusively licensed for research purposes. The accompanying dataset is licensed under CC BY 4.0, which permits solely non-commercial usage.
|
|
|
126 |
doi ="10.1039/D4SC00966E",
|
127 |
url ="http://dx.doi.org/10.1039/D4SC00966E",
|
128 |
}
|
129 |
+
```
|