OceanRep
A Deep-Learning Based Pipeline for Estimating the Abundance and Size of Aquatic Organisms in an Unconstrained Underwater Environment from Continuously Captured Stereo Video.
Böer, Gordon, Gröger, Joachim P., Badri-Höher, Sabah, Cisewski, Boris, Renkewitz, Helge, Mittermayer, Felix H. , Strickmann, Tobias and Schramm, Hauke (2023) A Deep-Learning Based Pipeline for Estimating the Abundance and Size of Aquatic Organisms in an Unconstrained Underwater Environment from Continuously Captured Stereo Video. Sensors, 23 (6). Art.Nr. 3311. DOI 10.3390/s23063311.
Preview |
Text
sensors-23-03311.pdf - Published Version Available under License Creative Commons: Attribution 4.0. Download (14MB) | Preview |
Abstract
The utilization of stationary underwater cameras is a modern and well-adapted approach to provide a continuous and cost-effective long-term solution to monitor underwater habitats of particular interest. A common goal of such monitoring systems is to gain better insight into the dynamics and condition of populations of various marine organisms, such as migratory or commercially relevant fish taxa. This paper describes a complete processing pipeline to automatically determine the abundance, type and estimate the size of biological taxa from stereoscopic video data captured by the stereo camera of a stationary Underwater Fish Observatory (UFO). A calibration of the recording system was carried out in situ and, afterward, validated using the synchronously recorded sonar data. The video data were recorded continuously for nearly one year in the Kiel Fjord, an inlet of the Baltic Sea in northern Germany. It shows underwater organisms in their natural behavior, as passive low-light cameras were used instead of active lighting to dampen attraction effects and allow for the least invasive recording possible. The recorded raw data are pre-filtered by an adaptive background estimation to extract sequences with activity, which are then processed by a deep detection network, i.e., Yolov5. This provides the location and type of organisms detected in each video frame of both cameras, which are used to calculate stereo correspondences following a basic matching scheme. In a subsequent step, the size and distance of the depicted organisms are approximated using the corner coordinates of the matched bounding boxes. The Yolov5 model employed in this study was trained on a novel dataset comprising 73,144 images and 92,899 bounding box annotations for 10 categories of marine animals. The model achieved a mean detection accuracy of 92.4%, a mean average precision (mAP) of 94.8% and an F1 score of 93%.
Document Type: | Article |
---|---|
Keywords: | marine species detection; underwater imagery; stereo-vision; deep learning |
Research affiliation: | Fraunhofer OceanRep > GEOMAR > FB3 Marine Ecology > FB3-EV Marine Evolutionary Ecology Kiel University |
Main POF Topic: | PT6: Marine Life |
Refereed: | Yes |
Open Access Journal?: | Yes |
Publisher: | MDPI |
Related URLs: | |
Date Deposited: | 23 Mar 2023 07:42 |
Last Modified: | 04 Feb 2025 11:43 |
URI: | https://oceanrep.geomar.de/id/eprint/58241 |
Actions (login required)
View Item |
Copyright 2023 | GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel | All rights reserved
Questions, comments and suggestions regarding the GEOMAR repository are welcomed
at bibliotheksleitung@geomar.de !