merve HF staff

Fix task tag

998d7f3 verified 3 months ago

8.53 kB

	---
	tags:
	- computer_vision
	- pose_estimation
	- animal_pose_estimation
	- deeplabcut
	pipeline_tag: keypoint-detection
	---
	# MODEL CARD:

	## Model Details

	• SuperAnimal-TopViewMouse model developed by the [M.W.Mathis Lab](http://www.mackenziemathislab.org/) in 2023,
	trained to predict mouse topline pose from images.
	Please see [Shaokai Ye et al. 2023](https://arxiv.org/abs/2203.07436) for details.

	• The there are three models:
	- `pose_model.pth` is an HRNet-w32 compatable for DLC3.0+ Pytorch code, trained on our TopViewMouse-5K dataset.
	- `detector.pt` is a Faster R-CNN that can be used as a detector for top-down detection.
	- `DLC_ma_supertopview5k_resnet_50_iteration-0_shuffle-1.tar.gz` is a DLCRNet trained on our TopViewMouse-5K dataset.


	• Full training details can be found in Ye et al. 2023.
	You can use this model simply with our light-weight loading package called [DLCLibrary](https://github.com/DeepLabCut/DLClibrary).
	Here is an example useage:

	```python
	from pathlib import Path
	from dlclibrary import download_huggingface_model

	# Creates a folder and downloads the model to it
	model_dir = Path("./superanimal_topviewmouse_model")
	model_dir.mkdir()
	download_huggingface_model("superanimal_topviewmouse_model", model_dir)
	```

	## Intended Use

	• Intended to be used for pose tracking of lab mice videos filmed from an overhead view. The models can be used as a plug-and-
	play solution if extremely high precision is not required (we benchmark the zero-shot performance in the paper). Otherwise, it is
	recommended to also be used as the weights for transfer learning and fine-tuning.

	• Intended for academic and research professionals working in fields related to animal behavior, neuroscience, biomechanics, and
	ecology.

	• Not suitable for other species and other camera views. Also not suitable for videos that look dramatically different from those we
	show in the paper.

	## Factors

	• Based on the known robustness issues of neural networks, the relevant factors include the lighting, contrast and resolution of the
	video frames. The present of objects might also cause false detections of the mice and keypoints. When two or more animals are
	extremely close, it could cause the top-down detectors to only detect only one animal, if used without further fine-tuning.


	## Metrics
	• Mean Average Precision (mAP)

	• Root Mean Square Error (RMSE)

	## Evaluation Data

	• The test split of TopViewMouse-5K and in the paper on two benchmarks, DLC Openfield and TriMouse


	## Training Data

	It consists of being trained together on the following datasets:

	- 3CSI, BM, EPM, LDB, OFT See full details at (1) and in (2).

	- BlackMice See full details at (3).

	- WhiteMice Courtesy of Prof. Sam Golden and Nastacia Goodwin. See details in SIMBA (4). TriMouse See full details
	at (5).

	- DLC-Openfield See full details at (6).

	- Kiehn-Lab-Openfield, Swimming, and treadmill Courtesy of Prof. Ole
	Kiehn, Dr. Jared Cregg, and Prof. Carmelo Bellardita; see details at (7).

	- MausHaus We collected video data from five
	single-housed C57BL/6J male and female mice in an extended home cage, carried out in the laboratory of Mackenzie Mathis
	at Harvard University and also EPFL (temperature of housing was 20-25C, humidity 20-50%). Data were recorded at 30Hz
	with 640 × 480 pixels resolution acquired with White Matter, LLC eV cameras. Annotators localized 26 keypoints across 322
	frames sampled from within DeepLabCut using the k-means clustering approach (8). All experimental procedures for mice
	were in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals and approved by
	the Harvard Institutional Animal Care and Use Committee (IACUC) (n=1 mouse), and by the Veterinary Office of the Canton
	of Geneva (Switzerland; license GE01) (n=4 mice).

	Here is an image with examples from the datasets, the distribution of images per dataset, and the keypoint guide.

	<p align="center">
	<img src="https://images.squarespace-cdn.com/content/v1/57f6d51c9f74566f55ecf271/1690986892069-I1DP3EQU14DSP5WB6FSI/modelcard-TVM.png?format=1500w" width="95%">
	</p>

	## Ethical Considerations

	• Data was collected with IUCAC or other governmental approval. Each individual dataset used in training reports the ethics approval
	they obtained.

	## Caveats and Recommendations

	• The model may have reduced accuracy in scenarios with extremely varied lighting conditions or atypical mouse characteristics not
	well-represented in the training data. For example, this dataset only has one set of white mice, therefore it may not generalize well
	to diverse settings of white lab mice.

	• Please note that each training dataset was labeled by separate labs and different individuals, therefore while we map names to a
	unified pose vocabulary, there will be annotator bias in keypoint placement (See Ye et al. 2023 for our Supplementary Note on
	annotator bias).

	• Note the dataset is primarily using C56Blk6/J mice and only some CD1 examples.

	• We recommend if performance is not as good as you need it to be, first try video adaptation (see Ye et al. 2023), or fine-tune these
	weights with your own labeling.

	## License

	Modified MIT.

	Copyright 2023 by Mackenzie Mathis, Shaokai Ye, and contributors.

	Permission is hereby granted to you (hereafter "LICENSEE") a fully-paid, non-exclusive,
	and non-transferable license for academic, non-commercial purposes only (hereafter “LICENSE”)
	to use the "MODEL" weights (hereafter "MODEL"), subject to the following conditions:

	The above copyright notice and this permission notice shall be included in all copies or substantial
	portions of the Software:

	This software may not be used to harm any animal deliberately.

	LICENSEE acknowledges that the MODEL is a research tool.
	THE MODEL IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING
	BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.
	IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
	WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE MODEL
	OR THE USE OR OTHER DEALINGS IN THE MODEL.

	If this license is not appropriate for your application, please contact Prof. Mackenzie W. Mathis
	(mackenzie@post.harvard.edu) and/or the TTO office at EPFL (tto@epfl.ch) for a commercial use license.

	Please cite Ye et al if you use this model in your work https://arxiv.org/abs/2203.07436v2.

	## References

	1. Oliver Sturman, Lukas von Ziegler, Christa Schläppi, Furkan Akyol, Mattia Privitera, Daria Slominski, Christina Grimm, Laetitia Thieren, Valerio
	Zerbi, Benjamin Grewe, et al. Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial
	solutions. Neuropsychopharmacology, 45(11):1942–1952, 2020.
	2. Lukas von Ziegler, Oliver Sturman, and Johannes Bohacek. Videos for deeplabcut, noldus ethovision X14 and TSE multi conditioning systems
	comparisons. https://doi.org/10.5281/zenodo.3608658. Zenodo, January 2020.
	3. Isaac Chang. Trained DeepLabCut model for tracking mouse in open field arena with topdown view. https://doi.org/10.5281/zenodo.3955216.
	Zenodo, July 2020.
	4. Simon RO Nilsson, Nastacia L. Goodwin, Jia Jie Choong, Sophia Hwang, Hayden R Wright, Zane C Norville, Xiaoyu Tong, Dayu Lin, Bran-
	don S. Bentzley, Neir Eshel, Ryan J McLaughlin, and Sam A. Golden. Simple behavioral analysis (simba) – an open source toolkit for computer
	classification of complex social behaviors in experimental animals. bioRxiv, 2020.
	5. Jessy Lauer, Mu Zhou, Shaokai Ye, William Menegas, Steffen Schneider, Tanmay Nath, Mohammed Mostafizur Rahman, Valentina Di Santo,
	Daniel Soberanes, Guoping Feng, Venkatesh N. Murthy, George Lauder, Catherine Dulac, Mackenzie W. Mathis, and Alexander Mathis. Multi-
	animal pose estimation, identification and tracking with deeplabcut. Nature Methods, 19:496 – 504, 2022.
	6. Alexander Mathis, Pranav Mamidanna, Kevin M Cury, Taiga Abe, Venkatesh N Murthy, Mackenzie Weygandt Mathis, and Matthias Bethge. Deeplab-
	cut: markerless pose estimation of user-defined body parts with deep learning. Nature neuroscience, 21:1281–1289, 2018.
	7. Jared M. Cregg, Roberto Leiras, Alexia Montalant, Paulina Wanken, Ian R. Wickersham, and Ole Kiehn. Brainstem neurons that command
	mammalian locomotor asymmetries. Nature neuroscience, 23:730 – 740, 2020