Realtime video analysis using ML on Raspberry PI


I am looking into using ML to identify a flock of birds from a video camera feed, either color or grayscale, using OpenCV and Python on a Raspberry Pi? Has anyone attempted this previously and would they be prepared to kindly share some of their experiences with me please? My aim is to link it to audio analysis to nail species flock identification at 100m or so?

Tally ho!