jpata's picture
add readme
25cc1a0

Model Card for mlpf-clic-clusters-v1.6

This model reconstructs particles in a detector, based on the tracks and calorimeter clusters recorded by the detector.

Model Details

Model Description

  • Developed by: Joosep Pata, Eric Wulff, Farouk Mokhtar, Mengke Zhang, David Southwick, Maria Girone, David Southwick, Javier Duarte
  • Model type: graph neural network with learnable structure in locality-sensitive hashing bins
  • License: Apache License

Model Sources

Uses

Direct Use

This model may be used to study the physics and computational performance on ML-based reconstruction in simulation.

Out-of-Scope Use

This model is not intended for physics measurements on real data.

Bias, Risks, and Limitations

The model has only been trained on simulation data and has not been validated against real data.

How to Get Started with the Model

Use the code below to get started with the model.

git clone https://github.com/jpata/particleflow/releases/tag/v1.6
cd particleflow

#Download the software image
wget https://hep.kbfi.ee/~joosep/tf-2.14.0.simg

#Download the checkpoint
wget https://huggingface.co/jpata/particleflow/resolve/clic_clusters_v1.6/weights-96-5.346523.hdf5
wget https://huggingface.co/jpata/particleflow/resolve/clic_clusters_v1.6/opt-96-5.346523.pkl

#Launch a shell in the software image
apptainer shell --nv tf-2.14.0.simg

#Continue the training from a checkpoint
python3 mlpf/pipeline.py train --config parameters/clic.yaml --weights weights-96-5.346523.hdf5 --batch-multiplier 0.5

#Run the evaluation for a given training directory, loading the best weight file in the directory
python3 mlpf/pipeline.py evaluate --train-dir experiments/clic-REPLACEME

Training Details

Training Data

Trained on the following dataset: Pata, J., Wulff, E., Duarte, J., Mokhtar, F., Zhang, M., Girone, M., & Southwick, D. (2023). Simulated datasets for detector and particle flow reconstruction: CLIC detector, machine learning format (v1.5.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.8409592

Training Procedure

python3 mlpf/pipeline.py train --config parameters/clic.yaml

Evaluation

python3 mlpf/pipeline.py evaluate --train-dir experiments/clic-REPLACEME

Citation

BibTeX:

@misc{pata2023scalable,
      title={Scalable neural network models and terascale datasets for particle-flow reconstruction}, 
      author={Joosep Pata and Eric Wulff and Farouk Mokhtar and David Southwick and Mengke Zhang and Maria Girone and Javier Duarte},
      year={2023},
      eprint={2309.06782},
      archivePrefix={arXiv},
      primaryClass={physics.data-an}
}

Glossary

PF - particle flow reconstruction

Model Card Contact

Joosep Pata, joosep.pata@cern.ch