Add checks for custom attributes and n_counts prior to sum ensembl id da2aaea verified hchen725 commited on 30 days ago
further emphasize different token dictionaries in examples e2ee685 Christina Theodoris commited on about 1 month ago
remove unused imports while no longer using distributed sampler b9028ba verified ctheodoris commited on about 1 month ago
Raise error for train and validation mismatch (#459) 9af94b8 verified ctheodoris madhavanvenkatesh commited on about 1 month ago
Upload in_silico_perturber.py (#432) cb89107 verified ctheodoris davidjwen commited on about 1 month ago
Update geneformer/emb_extractor.py (#453) ebc1e09 verified ctheodoris hchen725 commited on Nov 22, 2024
update transformers version to match pretrainer using accelerate 3d62bb9 verified ctheodoris commited on Nov 7, 2024
update pretrainer to not use distributed sampler (Trainer uses accelerate) 8140c51 verified ctheodoris commited on Nov 7, 2024
update function for N_Detections for mixture_model without anchor_token df297bc Christina Theodoris commited on Oct 25, 2024
add check to ensure emb_label is None for getting state embs dict 39b4444 Christina Theodoris commited on Oct 23, 2024
Create multi-task_cell_classification.ipynb (#418) b138ba5 verified ctheodoris madhavanvenkatesh commited on Sep 13, 2024
CUDA kernels incompatible with standard PyTorch device movement with 4bit/8bit, necessitating device-specific handling (#416) b6d28c3 verified ctheodoris madhavanvenkatesh commited on Sep 10, 2024
Add find_packages to automatically include mtl and subpackages in setup.py (#410) 3b78204 verified ctheodoris madhavanvenkatesh commited on Sep 5, 2024
adding tensorboard to setup.py (#409) 94d98d1 verified ctheodoris madhavanvenkatesh commited on Sep 5, 2024
dictionaries from parent dir (#405) 85f295e verified ctheodoris madhavanvenkatesh commited on Sep 3, 2024
remove token dictionary and unpickling from init (#403) 7eca269 verified ctheodoris madhavanvenkatesh commited on Sep 3, 2024
update tokenizer to defaults for 95M models for special token and input size da8cf3d verified ctheodoris commited on Sep 2, 2024
update instructions to include reminder about token dictionary cb1b0d5 verified ctheodoris commited on Sep 2, 2024
pointing dictionaries from the mtl module's init (#397) 7470753 verified ctheodoris madhavanvenkatesh commited on Aug 28, 2024
Refactored token dictionary loading and encapsulated dictionary (#398) beb62a4 verified ctheodoris madhavanvenkatesh commited on Aug 28, 2024
Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395) 2e06f1a verified ctheodoris madhavanvenkatesh commited on Aug 28, 2024
sync token_dictionary variable name w/ classifier a021deb verified ctheodoris commited on Aug 26, 2024
update setup with req and manifest with updated filenames a34fbc2 ctheodoris commited on Aug 21, 2024
allow model_type valid options to take params model_type : {"Pretrained", "GeneClassifier", "CellClassifier", "MTLCellClassifier", "MTLCellClassifier-Quantized"} (#390) 47e0ef8 verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
"save_model_without_heads" is redundant (#385) de10ab0 verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
comment out "def save_model_without_heads(original_model_save_directory)"; redundant for ISP/Emb extractor (#382) 22bf20f verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
fixed bug related to dynamic ranges in dictionary with 'min' and 'max' value mismatch in optuna suggest fn (#380) fe1640b verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
update with 12L and 20L i4096 gc95M models, multitask and quantiz code 933ca80 ctheodoris commited on Aug 15, 2024
update to only have gene names as keys in gene_name_id_dict e61485e verified ctheodoris commited on Aug 11, 2024
Add function for summing of Ensembl IDs (#377) 1e18102 verified ctheodoris hchen725 commited on Aug 11, 2024