File size: 1,236 Bytes
0b11a42
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
The `inference_settings` has a default yaml containing four keys:
  -`sequences_path`: The full path of the file containing the sequences for which their annotations are to be infered.
  - `model_path`: the full path of the model to be used for inference.
  - `model_name`: A model name indicating the inputs the model expects. One of `seq`,`seq-seq`,`seq-struct`,`seq-reverse` or `baseline`
  - `infere_original_testset`: True/False indicating whether inference should be computed on the original test set. 

`model` contains the skeleton of the model used, the optimizer, loss function and device. All models are built using [skorch](https://skorch.readthedocs.io/en/latest/?badge=latest)

`train_model_configs` contain the hyperparameters for each dataset; tcga, sncrna and premirna:

  - Each file contains the model and the train config.
    
    - Model config: contains the model hyperparameters, sequence tokenization scheme and allows for choosing the model. 
    
    - Train config: contains training settings such as the learning rate hyper parameters as well as `dataset_path_train`.
      - `dataset_path_train`: should point to the dataset [(Anndata)](https://anndata.readthedocs.io/en/latest/) used for training.