/luderick-seagrass

This dataset comprises of annotated footage of Girella tricuspidata in two estuary systems in South East Queensland, Australia. This data is suitable for a range of classification and object detection research in unconstrained underwater environments.

Annotated videos of luderick from estuaries in southeast Queensland, Australia

CC BY 4.0 DOI

example dataset image

Overview

This dataset comprises of annotated footage of Girella tricuspidata in two estuary systems in south East Queensland, Australia. This data is suitable for a range of classification and object detection research in unconstrained underwater environments. The raw data was obtained using submerged action cameras (Haldex Sports Action Cam HD 1080p) to collect video footage in the Tweed River estuary in southeast Queensland (-28.169438, 153.547594), between February and July 2019. Additional footage was collected from seagrass meadows in a separate estuary system in Tallebudgera Creek (-28.109721, 153.448975). Each sampling day, six cameras were deployed for 1 h over a variety of seagrass patches; the angle and placement of cameras was varied among deployment to ensure a variety of backgrounds and fish angles. Videos were trimmed for training to contain only footage of luderick (the target species for the study) and split into 5 frames per second.

Full data report can be found here: https://doi.org/10.3389/fmars.2021.629485

Dataset presentation

This dataset includes 9429 annotations and 4280 images which can be used for training object detection deep learning models and other related computer vision tasks. The dataset is organised into 3 sub-datasets that have been allocated for training, test and novel test purposes.

Dataset ID Raw Videos Version Suggested use Luderick Annotations Bream Annotations Total
Luderick Seagrass Jack Evans A Wvo7U_76t Download (1.3GB) 8 training 6649 53 6702
Luderick Seagrass Jack Evans B OmKwIVpe- Download (1.1GB) 8 test 1632 29 1661
Luderick Seagrass Tallebudgera 4bUBoZmvV Download (576MB) 6 novel test 1023 43 1066
Total 9304 125 9429

Images are included in a ZIP archive which can be downloaded from either of the following:

Each annotation includes object instance annotations which consist of the following key fields: Labels are provided as a common name: either "luderick" for Girella tricuspidata or "bream" for Acanthopagrus australis; Bounding boxes that enclose the species in each frame are provided in "[x, y, width, height]" format, in pixel units; Segmentation masks which outline the species as a polygon are provided as a list of pixel coordinates in the format "[x, y, x, y, ...]"; The corresponding image is provided as an image filename. All image coordinates (bounding box and segmentation masks) are measured from the top left image corner and are 0-indexed.

Annotations are provided in both CSV format and COCO JSON format which is a commonly used data format for integration with object detection frameworks including PyTorch and TensorFlow.

Additional details for each image can be found in dataset_images.csv, including data collection deployment dates, geo-coordinates and habitat type.

COCO JSON

Each annotation in COCO JSON format includes the following fields:

Key Description
id INT annotation ID
category_id INT category ID
image_id INT image ID
bbox ARRAY [x, y, width, height] of bounding box in px
area INT area of bounding box in pixels squared
segmentation STR segmentation polygon coordinates in format "[[x, y, x, y, ...]]"
iscrowd INT 0 or 1. A value of 1 indicated the annotation includes more than one individual

Each image in COCO JSON format includes the following fields:

Key Description
id INT image ID
height INT image height (px)
width INT image width (px)
file_name STR image filename

Each category in COCO JSON format includes the following fields:

Key Description
id INT category ID
name STR category name (species common name)

COCO JSON Example

{
  "annotations": [{
    "id": 0,
    "image_id": 0,
    "category_id": 1,
    "bbox": [ 0, 76, 624, 1003 ],
    "iscrowd": 0,
    "area": 625872,
    "segmentation": [
      [ 5, 76, 154, 80, 409, 76, 471, 86, 546, 110 ]
    ]
  }],
  "images": [{
      "file_name": "20190618_1.mov_5fps_000001.jpg",
      "height": 1080,
      "width": 1920,
      "id": 0,
      "license": 1
    }
  ],
  "categories": [{
    "name": "luderick",
    "id": 1
  }]

CSV

For each annotation in CSV format, the following columns are provided:

Column Description
id INT annotation ID
category STR name of category (luderick/bream)
category_id INT category ID
image STR image file name
image_id INT image ID
bbox_x INT minimum x pixel coordinate of bounding box
bbox_y INT minimum y pixel coordinate of bounding box
bbox_w INT width of bounding box in pixels
bbox_h INT height of bounding box in pixels
area INT area of bounding box in pixels squared
segmentation STR segmentation polygon coordinates in format "[[x, y, x, y, ...]]"

CSV Example

id category category_id image image_id bbox_x bbox_y bbox_w bbox_h area segmentation
0 luderick 1 20190618_1.mov_5fps_000001.jpg 0 0 76 624 1003 625872 "[[5, 76, ...]]"