insectnet / validation.md
TheVortexProject's picture
Initial upload: 6-class BirdNET-logit classifier, model card, docs
0e7b80b verified
|
Raw
History Blame Contribute Delete
2.96 kB
# Field Validation
InsectNet field validation results from Pine Hollow, Tennessee (35.8565, -83.3744).
All detections are human-confirmed unless marked as playback.
## Confirmed Natural Detections
| Date | Class | Conf. | Species | Method |
|------|-------|-------|---------|--------|
| May 30 | cicada_drone | 83% | Neotibicen/Megatibicen | User confirmed by ear |
| May 30 | frog | 51% | Cope's Gray Treefrog | User heard outside + WAV confirmed |
| May 30 | frog | 80-99.97% | Cope's Gray Treefrog + Eastern Narrow-mouthed Toad | User confirmed chorus, two species |
| May 31 | cricket_katydid | 99% | Field cricket | User confirmed by ear |
## Confirmed Playback Detections
| Date | Class | Conf. | Species | Notes |
|------|-------|-------|---------|-------|
| May 29 | cicada_drone | 100% | Neotibicen lyricen | Phone playback at mic |
| May 29 | frog | 64.5% | American Toad | Phone playback, BirdNET cross-validated |
| May 29 | frog | 99.2% | Gray Treefrog (Dryophytes spp.) | Phone playback |
| May 29 | cricket_katydid | 100% | Gryllus campestris | Phone playback; BirdNET misidentified as G. fultoni |
## Known False Positives
| Source | Class Triggered | Confidence | Notes |
|--------|----------------|------------|-------|
| AC window unit | cicada_drone | Up to 92.3% | Background <2% — very confident wrong answer |
| Weed whacker | bee | 98.1% | User confirmed |
| Night ambient noise | bee | 50-70% | Temporal filter needed — bees don't fly at night |
## Key Validations
### Frog Chorus (May 30)
The first sustained natural capture event. ~440 frog detections over 2.5 hours
(21:00-23:30). Two clear phases:
1. **Early evening (17:25-18:35):** Individual frogs at 55-65% confidence,
RMS 0.003-0.008
2. **Chorus peak (21:00-23:00):** Sustained 80-99.97% confidence,
RMS 0.01-0.08
Time-resolved BirdNET logit analysis identified two species calling
simultaneously: Cope's Gray Treefrog (dominant, throughout) and Eastern
Narrow-mouthed Toad (secondary, second half).
### First Natural Cicada (May 30, 06:59)
After ~14 hours running unattended through overnight rain, the sidecar
captured a cicada at 83% confidence (RMS 0.009). The user confirmed it
sounded like a genuine cicada. Cosine similarity against training centroids
matched Neotibicen/Megatibicen (0.986).
### First Low-Confidence Frog (May 30, 20:08)
A frog detected at only 51% confidence (RMS 0.004) was confirmed real —
the user heard the frog near the mic and confirmed the WAV. This invalidated
the hypothesis that real detections always clear 80% and established
class-dependent thresholds.
## Validation vs Testing
- **Testing** confirms the pipeline works: inotify fires, WAVs get processed,
files end up in the right places. Confidence scores are not evidence.
- **Validation** requires human listening and explicit species identification.
Only validated captures qualify as training data.
All detections listed above are validated.