Instructions to use TheVortexProject/insectnet with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Scikit-learn
How to use TheVortexProject/insectnet with Scikit-learn:
from huggingface_hub import hf_hub_download import joblib model = joblib.load( hf_hub_download("TheVortexProject/insectnet", "sklearn_model.joblib") ) # only load pickle files from sources you trust # read more about it here https://skops.readthedocs.io/en/stable/persistence.html - Notebooks
- Google Colab
- Kaggle
File size: 2,955 Bytes
0e7b80b | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 | # Field Validation
InsectNet field validation results from Pine Hollow, Tennessee (35.8565, -83.3744).
All detections are human-confirmed unless marked as playback.
## Confirmed Natural Detections
| Date | Class | Conf. | Species | Method |
|------|-------|-------|---------|--------|
| May 30 | cicada_drone | 83% | Neotibicen/Megatibicen | User confirmed by ear |
| May 30 | frog | 51% | Cope's Gray Treefrog | User heard outside + WAV confirmed |
| May 30 | frog | 80-99.97% | Cope's Gray Treefrog + Eastern Narrow-mouthed Toad | User confirmed chorus, two species |
| May 31 | cricket_katydid | 99% | Field cricket | User confirmed by ear |
## Confirmed Playback Detections
| Date | Class | Conf. | Species | Notes |
|------|-------|-------|---------|-------|
| May 29 | cicada_drone | 100% | Neotibicen lyricen | Phone playback at mic |
| May 29 | frog | 64.5% | American Toad | Phone playback, BirdNET cross-validated |
| May 29 | frog | 99.2% | Gray Treefrog (Dryophytes spp.) | Phone playback |
| May 29 | cricket_katydid | 100% | Gryllus campestris | Phone playback; BirdNET misidentified as G. fultoni |
## Known False Positives
| Source | Class Triggered | Confidence | Notes |
|--------|----------------|------------|-------|
| AC window unit | cicada_drone | Up to 92.3% | Background <2% — very confident wrong answer |
| Weed whacker | bee | 98.1% | User confirmed |
| Night ambient noise | bee | 50-70% | Temporal filter needed — bees don't fly at night |
## Key Validations
### Frog Chorus (May 30)
The first sustained natural capture event. ~440 frog detections over 2.5 hours
(21:00-23:30). Two clear phases:
1. **Early evening (17:25-18:35):** Individual frogs at 55-65% confidence,
RMS 0.003-0.008
2. **Chorus peak (21:00-23:00):** Sustained 80-99.97% confidence,
RMS 0.01-0.08
Time-resolved BirdNET logit analysis identified two species calling
simultaneously: Cope's Gray Treefrog (dominant, throughout) and Eastern
Narrow-mouthed Toad (secondary, second half).
### First Natural Cicada (May 30, 06:59)
After ~14 hours running unattended through overnight rain, the sidecar
captured a cicada at 83% confidence (RMS 0.009). The user confirmed it
sounded like a genuine cicada. Cosine similarity against training centroids
matched Neotibicen/Megatibicen (0.986).
### First Low-Confidence Frog (May 30, 20:08)
A frog detected at only 51% confidence (RMS 0.004) was confirmed real —
the user heard the frog near the mic and confirmed the WAV. This invalidated
the hypothesis that real detections always clear 80% and established
class-dependent thresholds.
## Validation vs Testing
- **Testing** confirms the pipeline works: inotify fires, WAVs get processed,
files end up in the right places. Confidence scores are not evidence.
- **Validation** requires human listening and explicit species identification.
Only validated captures qualify as training data.
All detections listed above are validated.
|