|
--- |
|
tags: |
|
- computer_vision |
|
- pose_estimation |
|
- animal_pose_estimation |
|
- deeplabcut |
|
pipeline_tag: keypoint-detection |
|
--- |
|
# MODEL CARD: |
|
|
|
## Model Details |
|
|
|
• SuperAnimal-TopViewMouse model developed by the [M.W.Mathis Lab](http://www.mackenziemathislab.org/) in 2023, |
|
trained to predict mouse topline pose from images. |
|
Please see [Shaokai Ye et al. 2023](https://arxiv.org/abs/2203.07436) for details. |
|
|
|
• The there are three models: |
|
- `pose_model.pth` is an HRNet-w32 compatable for DLC3.0+ Pytorch code, trained on our TopViewMouse-5K dataset. |
|
- `detector.pt` is a Faster R-CNN that can be used as a detector for top-down detection. |
|
- `DLC_ma_supertopview5k_resnet_50_iteration-0_shuffle-1.tar.gz` is a DLCRNet trained on our TopViewMouse-5K dataset. |
|
|
|
|
|
• Full training details can be found in Ye et al. 2023. |
|
You can use this model simply with our light-weight loading package called [DLCLibrary](https://github.com/DeepLabCut/DLClibrary). |
|
Here is an example useage: |
|
|
|
```python |
|
from pathlib import Path |
|
from dlclibrary import download_huggingface_model |
|
|
|
# Creates a folder and downloads the model to it |
|
model_dir = Path("./superanimal_topviewmouse_model") |
|
model_dir.mkdir() |
|
download_huggingface_model("superanimal_topviewmouse_model", model_dir) |
|
``` |
|
|
|
## Intended Use |
|
|
|
• Intended to be used for pose tracking of lab mice videos filmed from an overhead view. The models can be used as a plug-and- |
|
play solution if extremely high precision is not required (we benchmark the zero-shot performance in the paper). Otherwise, it is |
|
recommended to also be used as the weights for transfer learning and fine-tuning. |
|
|
|
• Intended for academic and research professionals working in fields related to animal behavior, neuroscience, biomechanics, and |
|
ecology. |
|
|
|
• Not suitable for other species and other camera views. Also not suitable for videos that look dramatically different from those we |
|
show in the paper. |
|
|
|
## Factors |
|
|
|
• Based on the known robustness issues of neural networks, the relevant factors include the lighting, contrast and resolution of the |
|
video frames. The present of objects might also cause false detections of the mice and keypoints. When two or more animals are |
|
extremely close, it could cause the top-down detectors to only detect only one animal, if used without further fine-tuning. |
|
|
|
|
|
## Metrics |
|
• Mean Average Precision (mAP) |
|
|
|
• Root Mean Square Error (RMSE) |
|
|
|
## Evaluation Data |
|
|
|
• The test split of TopViewMouse-5K and in the paper on two benchmarks, DLC Openfield and TriMouse |
|
|
|
|
|
## Training Data |
|
|
|
It consists of being trained together on the following datasets: |
|
|
|
- **3CSI, BM, EPM, LDB, OFT** See full details at (1) and in (2). |
|
|
|
- **BlackMice** See full details at (3). |
|
|
|
- **WhiteMice** Courtesy of Prof. Sam Golden and Nastacia Goodwin. See details in SIMBA (4). TriMouse See full details |
|
at (5). |
|
|
|
- **DLC-Openfield** See full details at (6). |
|
|
|
- **Kiehn-Lab-Openfield, Swimming, and treadmill** Courtesy of Prof. Ole |
|
Kiehn, Dr. Jared Cregg, and Prof. Carmelo Bellardita; see details at (7). |
|
|
|
- **MausHaus** We collected video data from five |
|
single-housed C57BL/6J male and female mice in an extended home cage, carried out in the laboratory of Mackenzie Mathis |
|
at Harvard University and also EPFL (temperature of housing was 20-25C, humidity 20-50%). Data were recorded at 30Hz |
|
with 640 × 480 pixels resolution acquired with White Matter, LLC eV cameras. Annotators localized 26 keypoints across 322 |
|
frames sampled from within DeepLabCut using the k-means clustering approach (8). All experimental procedures for mice |
|
were in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals and approved by |
|
the Harvard Institutional Animal Care and Use Committee (IACUC) (n=1 mouse), and by the Veterinary Office of the Canton |
|
of Geneva (Switzerland; license GE01) (n=4 mice). |
|
|
|
Here is an image with examples from the datasets, the distribution of images per dataset, and the keypoint guide. |
|
|
|
<p align="center"> |
|
<img src="https://images.squarespace-cdn.com/content/v1/57f6d51c9f74566f55ecf271/1690986892069-I1DP3EQU14DSP5WB6FSI/modelcard-TVM.png?format=1500w" width="95%"> |
|
</p> |
|
|
|
## Ethical Considerations |
|
|
|
• Data was collected with IUCAC or other governmental approval. Each individual dataset used in training reports the ethics approval |
|
they obtained. |
|
|
|
## Caveats and Recommendations |
|
|
|
• The model may have reduced accuracy in scenarios with extremely varied lighting conditions or atypical mouse characteristics not |
|
well-represented in the training data. For example, this dataset only has one set of white mice, therefore it may not generalize well |
|
to diverse settings of white lab mice. |
|
|
|
• Please note that each training dataset was labeled by separate labs and different individuals, therefore while we map names to a |
|
unified pose vocabulary, there will be annotator bias in keypoint placement (See Ye et al. 2023 for our Supplementary Note on |
|
annotator bias). |
|
|
|
• Note the dataset is primarily using C56Blk6/J mice and only some CD1 examples. |
|
|
|
• We recommend if performance is not as good as you need it to be, first try video adaptation (see Ye et al. 2023), or fine-tune these |
|
weights with your own labeling. |
|
|
|
## License |
|
|
|
Modified MIT. |
|
|
|
Copyright 2023 by Mackenzie Mathis, Shaokai Ye, and contributors. |
|
|
|
Permission is hereby granted to you (hereafter "LICENSEE") a fully-paid, non-exclusive, |
|
and non-transferable license for academic, non-commercial purposes only (hereafter “LICENSE”) |
|
to use the "MODEL" weights (hereafter "MODEL"), subject to the following conditions: |
|
|
|
The above copyright notice and this permission notice shall be included in all copies or substantial |
|
portions of the Software: |
|
|
|
This software may not be used to harm any animal deliberately. |
|
|
|
LICENSEE acknowledges that the MODEL is a research tool. |
|
THE MODEL IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING |
|
BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. |
|
IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, |
|
WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE MODEL |
|
OR THE USE OR OTHER DEALINGS IN THE MODEL. |
|
|
|
If this license is not appropriate for your application, please contact Prof. Mackenzie W. Mathis |
|
(mackenzie@post.harvard.edu) and/or the TTO office at EPFL (tto@epfl.ch) for a commercial use license. |
|
|
|
Please cite **Ye et al** if you use this model in your work https://arxiv.org/abs/2203.07436v2. |
|
|
|
## References |
|
|
|
1. Oliver Sturman, Lukas von Ziegler, Christa Schläppi, Furkan Akyol, Mattia Privitera, Daria Slominski, Christina Grimm, Laetitia Thieren, Valerio |
|
Zerbi, Benjamin Grewe, et al. Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial |
|
solutions. Neuropsychopharmacology, 45(11):1942–1952, 2020. |
|
2. Lukas von Ziegler, Oliver Sturman, and Johannes Bohacek. Videos for deeplabcut, noldus ethovision X14 and TSE multi conditioning systems |
|
comparisons. https://doi.org/10.5281/zenodo.3608658. Zenodo, January 2020. |
|
3. Isaac Chang. Trained DeepLabCut model for tracking mouse in open field arena with topdown view. https://doi.org/10.5281/zenodo.3955216. |
|
Zenodo, July 2020. |
|
4. Simon RO Nilsson, Nastacia L. Goodwin, Jia Jie Choong, Sophia Hwang, Hayden R Wright, Zane C Norville, Xiaoyu Tong, Dayu Lin, Bran- |
|
don S. Bentzley, Neir Eshel, Ryan J McLaughlin, and Sam A. Golden. Simple behavioral analysis (simba) – an open source toolkit for computer |
|
classification of complex social behaviors in experimental animals. bioRxiv, 2020. |
|
5. Jessy Lauer, Mu Zhou, Shaokai Ye, William Menegas, Steffen Schneider, Tanmay Nath, Mohammed Mostafizur Rahman, Valentina Di Santo, |
|
Daniel Soberanes, Guoping Feng, Venkatesh N. Murthy, George Lauder, Catherine Dulac, Mackenzie W. Mathis, and Alexander Mathis. Multi- |
|
animal pose estimation, identification and tracking with deeplabcut. Nature Methods, 19:496 – 504, 2022. |
|
6. Alexander Mathis, Pranav Mamidanna, Kevin M Cury, Taiga Abe, Venkatesh N Murthy, Mackenzie Weygandt Mathis, and Matthias Bethge. Deeplab- |
|
cut: markerless pose estimation of user-defined body parts with deep learning. Nature neuroscience, 21:1281–1289, 2018. |
|
7. Jared M. Cregg, Roberto Leiras, Alexia Montalant, Paulina Wanken, Ian R. Wickersham, and Ole Kiehn. Brainstem neurons that command |
|
mammalian locomotor asymmetries. Nature neuroscience, 23:730 – 740, 2020 |