Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ctheodoris
/
Geneformer
like
208
Fill-Mask
Transformers
Safetensors
ctheodoris/Genecorpus-30M
bert
single-cell
genomics
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
464
Train
Deploy
Use this model
main
Geneformer
/
geneformer
15 contributors
History:
129 commits
ctheodoris
Add checks for custom attributes and n_counts prior to sum ensembl id (
#461
)
09de197
verified
17 days ago
gene_dictionaries_30m
Update geneformer/tokenizer.py (#415)
3 months ago
mtl
Raise error for train and validation mismatch (#459)
17 days ago
__init__.py
Safe
1.22 kB
precommit formatting
4 months ago
classifier.py
Safe
64.7 kB
Update trainer output dir (#427)
3 months ago
classifier_utils.py
Safe
23.3 kB
precommit formatting
4 months ago
collator_for_classification.py
Safe
31.2 kB
Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395)
4 months ago
emb_extractor.py
Safe
32.3 kB
Update geneformer/emb_extractor.py (#453)
29 days ago
ensembl_mapping_dict_gc95M.pkl
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
3.96 MB
LFS
Add function for summing of Ensembl IDs (#377)
4 months ago
evaluation_utils.py
Safe
9.76 kB
move dict loading to function in eval utils
4 months ago
gene_median_dictionary_gc95M.pkl
pickle
Detected Pickle imports (2)
"numpy.core.multiarray.scalar"
,
"numpy.dtype"
How to fix it?
1.51 MB
LFS
Add function for summing of Ensembl IDs (#377)
4 months ago
gene_name_id_dict_gc95M.pkl
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
2.04 MB
LFS
rename for consistency
4 months ago
in_silico_perturber.py
Safe
66.4 kB
Upload in_silico_perturber.py (#432)
18 days ago
in_silico_perturber_stats.py
Safe
45.1 kB
update function for N_Detections for mixture_model without anchor_token
about 2 months ago
mtl_classifier.py
Safe
13.7 kB
edit docs formatting
4 months ago
perturber_utils.py
Safe
32.4 kB
CUDA kernels incompatible with standard PyTorch device movement with 4bit/8bit, necessitating device-specific handling (#416)
3 months ago
pretrainer.py
Safe
29.5 kB
remove unused imports while no longer using distributed sampler
17 days ago
token_dictionary_gc95M.pkl
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
426 kB
LFS
update with 12L and 20L i4096 gc95M models, multitask and quantiz code
4 months ago
tokenizer.py
Safe
29.5 kB
Add checks for custom attributes and n_counts prior to sum ensembl id (#461)
17 days ago