Hello everyone,
I am a hobbyist software developer and nature enthusiast, and while I don’t have a background in ornithology or academic biology, I wanted to share an open-source engineering project I’ve been building over the last year. I'm hoping to get some feedback from the acoustic and conservation experts here!
The project is called Global Birdsong Radio (currently live at wilddetection.net).
The Concept:
It acts as a "Planetary Ear." It is a distributed network that actively listens to 50+ live nature cameras and microphones around the world (YouTube, IP Cams, Icecast). When a bird sings, it pushes a real-time notification to the user and logs it on a live interactive map.
The Architecture:
Because running 50+ live audio streams through a cloud server is prohibitively expensive and triggers massive IP bans, I built this using a Hybrid Edge/Cloud architecture:
The Edge (Local): A Python-based engine running locally on my hardware. It handles stream ingestion (via yt-dlp and headless browsers), audio normalization, and runs the BirdNET AI model locally.
The Hydra Network: To bypass YouTube rate limits, the local engine binds specific listening processes to different physical network adapters (SIM cards/Wi-Fi interfaces) to distribute the traffic.
The Cloud: Only lightweight metadata (timestamp, SNR, species, location) and small MP3 clips of verified detections are beamed to a central DigitalOcean PostgreSQL database and Flask API, which drives the public web map.
Handling the "Firehose" of Data:
My biggest challenge was false positives and alert fatigue (e.g., getting many alerts a day for Pigeons). To solve this, I built a few intelligence layers:
Bayesian Cooldowns: The system keeps a 14-day rolling history of every stream. It dynamically learns if a bird is a "Resident" (mutes alerts for hours) or a "Rare Visitor" (alerts immediately).
Adaptive SNR Distance: The engine measures the RMS volume of the detection against the background noise floor to establish a "Golden Anchor" (loudest known volume) for that specific species on that specific stream, allowing it to tag alerts as "Very Near" or "Far."
DSP Bioacoustic Vetoes: I added custom DSP math (checking spectral flatness and frequency isolation) to detect non-birds (like African Cicadas, Howler Monkeys or Tink Frogs), while using BirdNET's weather classifications (wind/rain) to veto false positives.
My Ask to this Community:
As an engineer, the code is solid, but I would love the perspective of actual researchers and bioacoustics experts:
Does a tool like this have value for citizen science or migration tracking?
Are there better open-source models I should look at beyond BirdNET and YAMNet for the edge?
Any feedback on the Web Map UI (wilddetection.net) would be deeply appreciated.
Thank you for your time, and I look forward to learning from this community!
WildDetection.pdf3 March 2026 6:05pm
Thank you for sharing this — this is an impressively thoughtful engineering build. The hybrid edge/cloud architecture and the adaptive SNR logic are particularly interesting from a field-deployment perspective.
A few thoughts...
On value for citizen science and migration tracking: Yes. There could be real value in pairing this with curated monitoring stations or training cohorts.
This is exciting work, and I appreciate the open-source spirit behind it. I’d be interested in seeing how a system like this performs in savanna or tropical forest contexts, where acoustic complexity is high.
Looking forward to following the project.
12 May 2026 7:12pm
Hi, lately I have added vision detection to existing audio detection of birds. As well, vision detection can detect other animals besides birds and can detect flocks/herds:
13 May 2026 11:54am
Hey Avi,
I am quite intrigued by your work. I discovered many wildlife streams in East Africa that I was not aware of. Excellent work.
I will give my 2 cents and start by saying I am also not a researcher or an expert, so with that out of the way here are some feedback I have for you.
- Excellent work but you did not share any code e.g Github link or any way for researchers to reproduce your work in filtering noise, distance estimation or other technical innovations you came up with.
- The site would be more useful for researchers if there was an alternative view, without the map that lists in tabular format each stream, what was detected and at what time plus confidence ratios. Filters for regions, time, taxa e.t.c This can enable researchers see the wealth of data you have collected, correlate it with the actual stream and decide if it is useful for them.
This is what I could come up with, in terms of feedback but otherwise this is an excellent resource. I am sure to be a regular user.
18 May 2026 4:48pm
This is really impressive Avi! Would you mind sharing what kind of hardware the local ingestion + detection is running on?
It's a fun experiment to think if this could be expanded to any live stream happening on any platform (Instagram, TikTok)
Zingfa