LofiAmazonSpace / README.md
jennzhuge
readme
2b207de
|
raw
history blame
2 kB
metadata
title: DNA Identifier Tool
emoji: πŸ“‰
colorFrom: green
colorTo: yellow
sdk: gradio
sdk_version: 4.32.1
app_file: app.py
pinned: false

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Welcome to Lofi Amazon Rainforest Beats to Hack/AI to's DNA Identifier Tool.

This tool is intended to help conservationists/biologists identify unmatched eDNA samples or verify known samples by predicting genus from DNA sequences. If unsure, the tool can also visualize the DNA embedding space to help one hypothesize about which species the sequence could belong to.

Genus Prediction

To get started, upload a DNA sequence and the coordinates where you sampled it. (We can easily extend this tool to handle multiple DNA sequences with CSV upload.) Our tool will output the top three most probable genuses that your sample belongs to based on DNA and environmental factors such as elevation, annual precipitation, or human activity levels of the sample location. You can also see the top three most probable genuses based on DNA similarity alone.

DNA Embedding Space Visualization

Prehaps we have a DNA sequence for which the highest genus probability is very low (this could be because scientists have not managed to directly sample any specimens of the genus, so our training dataset, BOLD, doesn't contain any examples), we can still examine the DNA embedding of the sequence in relation to known samples. The t-SNE plots show the embedding space of the top N most common species in the area surrounding the given coordinate. We can see clear group distinctions between species. The following t-SNE plot show how the sample sequence embedding is positioned in the space and identified nearest species clusters.

Downstream Tasks

Potential downstream tasks include:

  • Identifying invasive species.
  • Reclassifying wrongly classified species. for example red panda is called a panda, but it's actually more genetically similar to a raccoon.