Commit History
further emphasize different token dictionaries in examples
e2ee685
Christina Theodoris
commited on
remove unused imports while no longer using distributed sampler
b9028ba
verified
ctheodoris
commited on
Raise error for train and validation mismatch (#459)
9af94b8
verified
Upload in_silico_perturber.py (#432)
cb89107
verified
Update geneformer/emb_extractor.py (#453)
ebc1e09
verified
Update geneformer/tokenizer.py (#450)
664f71e
verified
update isp default mode to cls
0c99403
verified
ctheodoris
commited on
Adding tags to the model (#448)
3a68669
verified
update transformers version to match pretrainer using accelerate
3d62bb9
verified
ctheodoris
commited on
update pretrainer to not use distributed sampler (Trainer uses accelerate)
8140c51
verified
ctheodoris
commited on
update function for N_Detections for mixture_model without anchor_token
df297bc
Christina Theodoris
commited on
add check to ensure emb_label is None for getting state embs dict
39b4444
Christina Theodoris
commited on
change default emb to cls
9ee140a
verified
ctheodoris
commited on
include install time in model card
21d36cd
verified
ctheodoris
commited on
Update trainer output dir (#427)
dc23c5f
verified
Update geneformer/tokenizer.py (#415)
63275a8
verified
Create multi-task_cell_classification.ipynb (#418)
b138ba5
verified
CUDA kernels incompatible with standard PyTorch device movement with 4bit/8bit, necessitating device-specific handling (#416)
b6d28c3
verified
Add find_packages to automatically include mtl and subpackages in setup.py (#410)
3b78204
verified
adding tensorboard to setup.py (#409)
94d98d1
verified
tensorboard add to reqs (#407)
e8fa43a
verified
dictionaries from parent dir (#405)
85f295e
verified
remove token dictionary and unpickling from init (#403)
7eca269
verified
move dict loading to function in eval utils
57bc17e
ctheodoris
commited on
edit doc formatting
fce3f6e
ctheodoris
commited on
edit docs formatting
ef094b2
ctheodoris
commited on
add mtl_classifier to docs
bedb3b7
ctheodoris
commited on
add input size tip to instructions
2732369
verified
ctheodoris
commited on
update tokenizer to defaults for 95M models for special token and input size
da8cf3d
verified
ctheodoris
commited on
update instructions to include reminder about token dictionary
cb1b0d5
verified
ctheodoris
commited on
pointing dictionaries from the mtl module's init (#397)
7470753
verified
Refactored token dictionary loading and encapsulated dictionary (#398)
beb62a4
verified
Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395)
2e06f1a
verified
sync token_dictionary variable name w/ classifier
a021deb
verified
ctheodoris
commited on
update setup with req and manifest with updated filenames
a34fbc2
ctheodoris
commited on
fix imports mtl/eval_utils
eab1878
ctheodoris
commited on
allow model_type valid options to take params model_type : {"Pretrained", "GeneClassifier", "CellClassifier", "MTLCellClassifier", "MTLCellClassifier-Quantized"} (#390)
47e0ef8
verified
peft>=0.11.1 (#387)
4c9dda5
verified
"save_model_without_heads" is redundant (#385)
de10ab0
verified
comment out "def save_model_without_heads(original_model_save_directory)"; redundant for ISP/Emb extractor (#382)
22bf20f
verified
fixed bug related to dynamic ranges in dictionary with 'min' and 'max' value mismatch in optuna suggest fn (#380)
fe1640b
verified
Update README.md
11bcee7
verified
ctheodoris
commited on
precommit formatting
f07bfd7
ctheodoris
commited on
update with 12L and 20L i4096 gc95M models, multitask and quantiz code
933ca80
ctheodoris
commited on
rename for consistency
ec19834
verified
ctheodoris
commited on
delete old gene name dict
817eca2
verified
ctheodoris
commited on
update to only have gene names as keys in gene_name_id_dict
e61485e
verified
ctheodoris
commited on
Add function for summing of Ensembl IDs (#377)
1e18102
verified
save pval
b07f4b1
ctheodoris
commited on