Tools for automating image augmentation

Biologist and Research Technician working with ecosystem monitoring and research at Zackenberg Research Station in Greenland

Groups

Does anyone know of tools to automate image augmentation and manipulation. I wish to train ML image recognition models with images in which the target animal (and false targets) has been cropped out from originals and placed in other images with different backgrounds. If such tools could also simulate lighting conditions on the target ,based on the lighting in the new background, it would be super nice.

I am probably wishing for too much but if it is available, it would surely save me some time constructing these images manually.

Lars Holst Hansen

@Lars_Holst_Hansen

Aarhus University

Biologist and Research Technician working with ecosystem monitoring and research at Zackenberg Research Station in Greenland

27 January 2024 9:06am

Seems like I should be able to make something usable in python using rembg

Sometimes a little googling before posting a question here could make wonders ... ;)

Wade

@Wade

27 January 2024 5:05pm

There might be some challenges & risks with that approach. ML models should ideally be trained using the same sort of data they will ultimately run against; presumably you're not intending your model to analyse 'photoshopped' images, but real unaltered images?

The problem is that the 'photoshopping' - whether it's masking & multi-image composition or any other effect - can introduce image artefacts that, even if imperceptible to humans, can be detectable by an ML model. The model can unwittingly become trained to detect and rely on those artefacts, hurting its performance in particularly inexplicable ways when it's then used on real, unadulterated data.

That's not to say it's impossible to train models this way - the use of synthesised data is not uncommon in the field - but it has downsides.

Tangentially, the same is true of things like watermarks and overlays inserted by trail cameras, like the time, weather, camera brand etc - those should all be removed before use with ML models. Otherwise, you might think you're training a model to accurately detect wolves vs coyotes but what you're actually training is a model which thinks canines captured on Brownings are wolves and on Reconyx's are coyotes, or somesuch coincidental correlation that happens to be in your training data.

Arky

@arky

Technologist and Visual storyteller focusing on social, conservations issues.

12 February 2024 5:00pm

So we are using the default parameters in Yolov8 for data augmentation. It supports the basic flip, distort, rotate and adding noise.

It would be interesting to use yolov8 segmentation and extract the animal. Let me share the code snippet to extract the bears from images and background is black.

# %% [markdown]
# # Animal Face Feature Extraction 

# %%
from pathlib import Path

import cv2 as cv
import numpy as np

from ultralytics import YOLO


# %%
# Variables 
source = "https://upload.wikimedia.org/wikipedia/commons/e/e3/Grizzly_Bear_%28Ursus_arctos_ssp.%29.jpg"
model = "yolov8n-seg.pt" # see other yolov8 models  
cococlass = 21 # Refer coco class names



# %%
# Load a yolov8 model 
model = YOLO(model) 

# %%
# Predict with the model
results = model(source=source, save=True)

# %%
# Source: Yolov8 Documentation
# https://docs.ultralytics.com/guides/isolating-segmentation-objects/
#  Iterate detection results (helpful for multiple images)
for r in results:
    img = np.copy(r.orig_img)
    img_name = Path(r.path).stem # source image base-name

    # Iterate each object contour (multiple detections)
    for ci,c in enumerate(r):
        #  Get detection class name
        label = c.names[c.boxes.cls.tolist().pop()]
        b_mask = np.zeros(img.shape[:2], np.uint8)

        # Create contour mask 
        contour = c.masks.xy.pop().astype(np.int32).reshape(-1, 1, 2)
        _ = cv.drawContours(b_mask, [contour], -1, (255, 255, 255), cv.FILLED)

        # Isolate object with black background
        mask3ch = cv.cvtColor(b_mask, cv.COLOR_GRAY2BGR)
        isolated = cv.bitwise_and(mask3ch, img)

        #  Bounding box coordinates
        x1, y1, x2, y2 = c.boxes.xyxy.cpu().numpy().squeeze().astype(np.int32)

        # Crop image to object region
        iso_crop = isolated[y1:y2, x1:x2]

        # Save isolated object to file
        _ = cv.imwrite(f'{img_name}_{label}-{ci}.png', iso_crop)

# %%

Paul Allin

12 February 2024 5:00pm

We are looking at something similar where limited data sets mean that training algorithms becomes difficult. I have read plenty of examples in literature where modifying the training data can improve accuracy of the model but this will only go so far as you are not generating entirely new images. I think the best would be to run an experiment on a subset of your data to see how much it would improve. Having a greater heterogeneity of backgrounds typically improves a models performance so I think you may well be onto something here. Please keep us updated with your progress!

Lars Holst Hansen

12 February 2024 5:00pm

Hi @arky !

Thanks for your reply.

I am running into pytorch/torchvision incompatibility issues when trying to run your script.

Which versions are you using?

Best regards,

Lars

Arky

12 February 2024 5:00pm

@Lars_Holst_Hansen Here is the information you requested. Also run Yolov8 in multiple remote environments without any issues. Perhaps you'll need to use a virtual environment (venv et al) or conda to remedy incompatibility issues.

$ yolo checks
Ultralytics YOLOv8.1.4 🚀 Python-3.10.12 torch-1.13.1+cu117 CUDA:0 (Quadro T2000, 3904MiB)
Setup complete ✅ (16 CPUs, 62.5 GB RAM, 465.0/467.9 GB disk)

OS                  Linux-6.5.0-17-generic-x86_64-with-glibc2.35
Environment         Linux
Python              3.10.12
Install             pip
RAM                 62.54 GB
CPU                 Intel Core(TM) i7-10875H 2.30GHz
CUDA                11.7

matplotlib          ✅ 3.5.1>=3.3.0
numpy               ✅ 1.26.3>=1.22.2
opencv-python       ✅ 4.7.0.72>=4.6.0
pillow              ✅ 10.2.0>=7.1.2
pyyaml              ✅ 6.0.1>=5.3.1
requests            ✅ 2.31.0>=2.23.0
scipy               ✅ 1.11.4>=1.4.1
torch               ✅ 1.13.1>=1.8.0
torchvision         ✅ 0.14.1>=0.9.0
tqdm                ✅ 4.66.1>=4.64.0
psutil              ✅ 5.9.8
py-cpuinfo          ✅ 9.0.0
thop                ✅ 0.1.1-2209072238>=0.1.1
pandas              ✅ 1.5.3>=1.1.4
seaborn             ✅ 0.12.2>=0.11.0

Lars Holst Hansen

12 February 2024 5:00pm

Perfect thanks! I am still a novice using Python but my wife can help me!