Seq-TransfoRNA / conf /readme.md
Yak-hbdx's picture
uploaded TransfoRNA repo
0b11a42 verified
|
raw
history blame
No virus
1.24 kB
The `inference_settings` has a default yaml containing four keys:
-`sequences_path`: The full path of the file containing the sequences for which their annotations are to be infered.
- `model_path`: the full path of the model to be used for inference.
- `model_name`: A model name indicating the inputs the model expects. One of `seq`,`seq-seq`,`seq-struct`,`seq-reverse` or `baseline`
- `infere_original_testset`: True/False indicating whether inference should be computed on the original test set.
`model` contains the skeleton of the model used, the optimizer, loss function and device. All models are built using [skorch](https://skorch.readthedocs.io/en/latest/?badge=latest)
`train_model_configs` contain the hyperparameters for each dataset; tcga, sncrna and premirna:
- Each file contains the model and the train config.
- Model config: contains the model hyperparameters, sequence tokenization scheme and allows for choosing the model.
- Train config: contains training settings such as the learning rate hyper parameters as well as `dataset_path_train`.
- `dataset_path_train`: should point to the dataset [(Anndata)](https://anndata.readthedocs.io/en/latest/) used for training.