You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OpenSARShip 1.0 is a medium-resolution ship dataset consisted of 11346 image chips cropped from a total of 41 Sentinel-1 images.
It covers mainly 5 ports in Asia and it has 17 types (AIS types) of ships in total.
The spatial resolutions of the images are 2.7 × 22 to 3.6 × 22 and 20 × 22 meters.
Dataset images have mixed VV and VH polarizations.
Each ship image corresponds to an automatic identification system (AIS) message.
The detailed information of each ship chip, containing the AIS messages, the SAR ship signatures, and the messages provided by the MarineTraffic website, is listed in a XML file named Ship.xml.
The image sizes range from 30 × 30 to 120 × 120 pixels.
For every Sentinel-1 SAR image, four subfolders provide the different formats of ship chips: original data, visualized data in greyscale, visualized data in pseudo-color, and calibrated data.
Original and calibrated data are in .tiff format and have 128 bit color depth (32 bits per channel).
Visualized data in pseudo-color have 24 bit color depth (8 bits per channel) and are in .png format.
Visualized data in greyscale have 8 bit color depth and are in .tiff format.
OpenSARShip 2.0 consists of 34528 image chips cropped from a total of 87 Sentinel-1 images.
The whole products are in the interferometric wide swath (IW) mode.
The OpenSARShip contains two available products of the IW mode: the single look complex (SLC) and the ground range detected (GRD) products.
These 87 Sentinel-1 images, 52 from GRD and 35 from SLC imageries, are selected from 10 typical intense marine traffic scenes globally in latest years.
For Sentinel-1 the spatial resolutions of the images 2.7 × 22 to 3.6 × 22 and 20 × 22 meters.
Dataset images have mixed VV and VH polarizations.
Each ship image corresponds to an automatic identification system (AIS) message.
The detailed information of each ship chip, containing the AIS messages, the SAR ship signatures, and the messages provided by the MarineTraffic website, is listed in an XML file named Ship.xml
The image sizes range from 30 × 30 to 120 × 120 pixels.
For every Sentinel-1 SAR image, four subfolders provide the different formats of ship chips: original data, visualized data in greyscale, visualized data in pseudo-color, and calibrated data.
It consists of 43819 ship chips of 256 × 256 pixels.
The specific dataset includes 59535 ship instances.
This dataset was created using 102 Chinese Gaofen-3 images and 108 Sentinel-1 images.
For Gaofen-3, the images have spatial resolutions of 3 m, 5 m, 8 m, 10m, and 25 m per pixel, and for Sentinel-1 the spatial resolutions of the images are 1.7 × 4.3 to 3.6 × 4.9 and 20 × 22 meters.
For Gaofen-3 the imaging modes are Ultrafine Strip-Map (UFS), Fine Strip-Map 1 (FSI), Full Polarization 1 (QPSI), Full Polarization 2 (QPSII), and Fine Strip-Map 2 (FSII).
For Sentinel-1, the imaging modes are S3 Strip-Map (SM), S6 SM, and IW-mode.
Each ship chip corresponds to an Extensible Markup Language (XML) file, indicating the ship's location, the ship chip name, and the image shape.
The AIR-SARShip-1.0 dataset is collected from the Gaofen-3 satellite, and contains 31 images of large scenes.
The imaging mode has both spotlight and strip modes. All images are in single polarization mode with a size of about 3000 × 3000 pixels.
The raw SAR images (3000 × 3000 pixels) cropped into 500 × 500 pixel sub-images.
The above 6×6×31 = 1116 sub-images are in .tiff format.
Every image corresponds to a XML label file which includes image file name, pixel size, number of channels, resolution, category, and position of each target box.
The spatial resolutions of SAR images are 1 and 3 meters per pixel.
Dataset is collected from the Gaofen-3 satellite, and contains 300 images of large scenes.
The imaging mode has both spotlight and strip modes.
All images are in single polarization mode.
In dataset, each image size is about 1000×1000 pixels, with .tiff format, single channel, 16-bit image depth.
The spatial resolutions of SAR images are 1 and 3 meters per pixel.
Each image corresponds to an .xml file, provides the detail information including the length and width dimensions, the category and the position of the target box.
FUSAR-Ship dataset has a total of 15 ship categories, 98 ship subcategories, consisted of 126 GF-3 scenes covering various scenarios. The imaging mode of those 126 images is ultrafine strip-map (UFS) mode.
It includes over 5000 ship chips with AIS messages and some other types of marine targets and background clutters.
The above single-band ship images have dimensions of 512 × 512 pixels.
These images are in .tiff format and their color depth is 8 bits.
These image chips are stored under subfolder named after its 'category/subcategory'. The filename of each sample follows the convention: Ship_CxxSyyNzzzz.tiff, where xx is the index of category, yy is the index of subcategory and zzzz is the index of this particular sample.
Dataset images have VV and HH polarizations.
The matchup metadata is compiled in the file 'meta.csv' or 'meta.xls', which follow the format of: id mmsi length width polarMode centerLookAngle heightspace widthspace path.
11. DSSDD (Dual-polarimetric SAR Ship Detection Dataset)
The specific dataset contains 50 dual-polarimetric SAR images from Sentinel-1.
The above images were cropped to 1236 image slices with the size of 256x256 pixels.
These 1236 images have VV and VH polarizations which were then fused into R,G,B channels for the creation of the pseudo-color image.
The colour depth of the images (.png) is 8 bits/channel.
The 16-bit original images (.tif) are also provided.
This dataset includes 3540 ship instances.
Each ship was labeled with both a rotatable bounding box (RBox) and a horizontal bounding box (BBox).
Each image slice has a corresponding XML format annotation file, indicating the slice size, slice name, and annotation type:
The RBox label is tagged as “robndbox”, where "cx", "cy", "w", "h", and "angle" indicate the center coordinates, height, width, and angle of a box, respectively.
Correspondingly, the BBox label is tagged as "bndbox", where "xmin", "xmax", "ymin", "ymax" refer to the top left corner and the lower right corner coordinates of a box, respectively.
It consists of 30 panoramic SAR tiles of the Chinese Gaofen-3 with a resolution of 1 m in range direction and azimuth.
These original SAR images are in spotlight (SL) mode with a HH and VV polarizations.
The above imageries were cropped to 666 smaller images with dimensions of 1024x1024 pixels.
SRSDD-v1.0 contains 2884 ship instances which are distributed among 6 different categories:
ore-oil (carrier) ↦ 166 ship instances
bulk-cargo ↦ 2053 ship instances
Fishing ↦ 288 ship instances
LawEnforce ↦ 25 ship instances
Dredger ↦ 263 ship instances
Container ↦ 89 ship instances
In the dataset, each instance's location is annotated by a oriented quadrilateral bounding box [(x1, y1), (x2, y2), (x3, y3), (x4, y4)]. The first point of the bounding box (x1, y1) denotes the starting point, which refers to the top left corner of a ship.
Annotations for an image are saved in a text file with the same file name. In the first line, "imagesource" is given. In the second line, "gsd"(ground sample distance=1) is given. From third line to the last line in the annotation text file, annotation for each instance is given.
For every instance, a "difficult" label is provided, which indicates whether the instance is difficult to be detected (1 for difficult, 0 for not difficult).
The xView dataset contains 1 million objects across 60 classes in over 1400 km2 of imagery and released by the Defense Innovation Unit Experimental (DIUx) and the National Geospatial-intelligence Agency (NGA).
It consists of 1414 large scene images (GeoTIFF format) with dimensions ranging from 2772 × 2678 to 5121 × 3023 pixels.
These data is collected from WorldView-3 satellites at 0.3 m ground sample distance.
The GeoTIFF files are broken into two sets: RGB and 8-band. The 8-band set contains 8-band multispectral GeoTIFF images labeled as image_id.tif where image_id is a unique integer. Similarly, the RGB set contains pan-sharpened 3-band RGB GeoTIFF images with the same naming convention.
The above 60 fine-grained classes are labeled in a parent class-child class manner. There are seven different parent classes, namely fixed-wing aircraft, passenger vehicle, truck, railway vehicle, maritime vessel, engineering vehicle, and building (some child classes have no parent class).
The maritime vessel class contains nine child classes: motorboat, sailboat, tugboat, barge, fishing vessel, ferry, yacht, container ship and oil tanker.
Each image set has a corresponding geoJSON file. The fields in the geoJSON files include:
TYPE_ID : The bounding box label class ID.
CAT_ID : DigitalGlobe's unique ID for image strips.
IMAGE_ID : The image chip filename on which a feature is marked.
BOUNDS_IMCOORDS : Bounding box in pixel coordinates [xmin, ymin, xmax, ymax] of the image chip in which it is marked.
COORDINATES : Coordinates in longitude-latitude form for bounding box points.
5. DOTA (Dataset for Object deTection in Aerial images)
It consists of 2806 images which they have dimensions of about 4000 × 4000 pixels.
This dataset contains 15 different categories but only 14 main categories (because small vehicle and large vehicle are both subcategories of vehicle).
These 2806 images contain 43736 ship instances.
Images in this dataset are collected from multiple sensors and platforms with multiple resolutions (Google Earth, Gaofen-2 and JL-1).
In the dataset, each instance's location is annotated by a oriented quadrilateral bounding box [(x1, y1), (x2, y2), (x3, y3), (x4, y4)]. The first point of the bounding box (x1, y1) denotes the starting point, which refers to the top left corner of a ship.
Annotations for an image are saved in a text file with the same file name. In the first line, "imagesource" is given. In the second line, "gsd"(ground sample distance) is given. From third line to the last line in the annotation text file, annotation for each instance is given.
For every instance, a "difficult" label is provided, which indicates whether the instance is difficult to be detected (1 for difficult, 0 for not difficult).
The above 21761 images have various sizes and are in .jpeg format with 24-bit color depth.
The TGRS-HRRSD dataset contains 3975 ship instances.
Every image has a corresponding .xml file which contains the image's dimensions, the class name, and the 4 coordinates (of the lower left and upper right points) of each bounding box, for every instance that is shown in the image.
This labeling is stored in XML files using the annotation format of PASCAL VOC.
The above .xml files contain the image's dimensions and channels, instance's class name, and the four pairs of pixel coordinates for every instance's corresponding bounding box.
8. DIOR(object Detection In Optical Remote sensing images)
DIOR consists of 23463 optical remote sensing images and 192472 object instances that are manually labeled with axis‐aligned bounding boxes, covered by 20 common object categories.
The size of images in the dataset is 800 × 800 pixels and the spatial resolutions range from 0.5m to 30m.
The above images acquired from Google Earth.
The 2702 of the 23463 images contain ~63000 ship instances.
The specific dataset contains 2612 images from 17 large ports including China, Japan, the United States and Spain.
The above images have dimensions of 930 × 930 pixels and their spatial resolutions range from 0.12m to 1.93m.
These images contain 5634 ship instances.
There are 43 categories of ships and a 'dock' category in the dataset.
These 43 classes of ships were divided into 4 Level-2 categories, including warship, carrier, submarine and civil ship. And all ships shares the same level-1 label ship.
1 Level-1 category : ship
4 Level-2 categories : warship, carrier, submarine and civil ship
43 Level-3 categories : container, oil tanker, yacht, hovercraft, etc.
Ship samples in FGSD were annotated with both common used bounding box and rotated bounding box. The rotated bounding box is annotated as (xc, yc, w, h, θ).
Each image has a corresponding annotation file, which includes the source-port’s ID, the resolution and corresponding Google Earth's resolution level of the image.
ShipRSImageNet contains over 3435 images with 17573 ship instances, annotated with both horizontal and orientated bounding boxes.
The above 3435 images were obtained from various sensors, satellite platforms, locations, and seasons.
Each image is around 930×930 pixels and contains ships with different scales, orientations, and aspect ratios.
The spatial resolution of the images ranges from 0.12 to 6 m.
These 3435 images were collected from:
The xView dataset (532 images).
The HRSC2016 dataset (1057 images).
The FGSD dataset (1846 images).
The Airbus Ship Detection Challenge (21 images).
Chinese satellites, such as GaoFen-2 and JiLin-1 (17 images).
Ships in ShipRSImageNet are hierarchically classified into four levels (0-3) and 50 categories (49 ship types and "Dock").
Level 0 distinguishes whether the object is a ship, namely "Class". Level 1 further classifies the ship object category, named as "Category". Level 2 further subdivides the categories based on level 1 ("Subcategory"). Level 3 is the specific type of ship, named "Type".
The ShipRSImageNet was divided in training, validation and test sets which contain 64% (2198), 16% (550) and 20% (687) of the original images respectively.
Each ship instance has a corresponding set of annotations which are:
A horizontal bounding box ↦ (xmin, ymin, xmax, ymax).