Skip to content

Latest commit

 

History

History
183 lines (115 loc) · 12.8 KB

README.md

File metadata and controls

183 lines (115 loc) · 12.8 KB

This repository is forked from https://github.com/yangsiyu007/SpaceNetExploration. For the accompanying blog post from Microsoft AI Team, see here.

Building Footprint Extraction

Overview

This repository contains a walkthrough demonstrating how to perform semantic segmentation using convolutional neural networks (CNNs) on satellite images to extract the footprints of buildings. We show how to carry out the procedure on Google Colab Notebooks. We use a subset of the data and labels from the SpaceNet Challenge, an online repository of freely available satellite imagery released to encourage the application of machine learning to geospatial data.

The blog post that first announced this sample project is here on the Azure Blog.

Data

SpaceNet Building Footprint Extraction Dataset

The code in this repository was developed for training a semantic segmentation model (currently two variants of the U-Net are implemented) on the Vegas set of the SpaceNet building footprint extraction data. Instruction for downloading the SpaceNet data can be found on their website. Subsetting to just Vegas makes the sample code clearer, but it can be easily extended to take in training data from the four other locations.

The above link will have the latest documentation. As of 2020-06-15, the following worked for me:

aws s3 cp s3://spacenet-dataset/spacenet/SN2_buildings/tarballs/SN2_buildings_train_AOI_2_Vegas.tar.gz . 

aws s3 cp s3://spacenet-dataset/spacenet/SN2_buildings/tarballs/AOI_2_Vegas_Test_public.tar.gz .

However, note the organizers may change s3 paths so if that doesn't work refer to the original documentation.

The organizers release a portion of this data as training data and the rest are held out for the purpose of the competitions they hold. For the experiments discussed here, we split the official training set 70:15:15 into our own training, validation and test sets. These are 39 GB in size as raw images in TIFF format with labels.

Generate Input from Raw Data

After downloading the raw data (for example at RegLab, we have the raw data stored in this Google Drive Folder; space-net-exploration/data/raw), you will need to run split_train_val_test.py.

This is necessary even though the SpaceNet utilities code also splits the data into trainval and test as it creates mask annotations from polygon labels, because that process only split the data after smaller chips are created from larger, raw images with some overlap.

Next, we will need to use the SpaceNet utilities to convert the raw images to a format that semantic segmentation models can take as input. The utilities are in this repo. Most of the functionalities you will need are in the python folder. Specifically, you will need to run createDataSpaceNet.py. Please read their instructions on the repo's README to understand all the tools and parameters available. After using createDataSpaceNet.py from the utilities repo to process the raw data, the input image and its label look like the following:

Example of input image and its label

Environment Setup

AWS Command Line Interface to Get Data

The original data can be found in the SpaceNet Challenge. To download the data requires AWS command line tools.

Note, to download AWS Command line tools, see the AWS docs for more info.

Google Colab

We will be using Google Colab Notebooks for this tutorial. The benefits to using this are:

  1. It's free! (Including GPU)
  2. With Stanford account, we have unlimited Google Drive storage. Thus, we can upload large datasets into Google Drive which we can hook into Google Colab notebooks for easy exploration.
  3. Okay-ish development environment

Note, we're keeping track of which folders in Google Drive are fully uploaded, since this is a lot of data and uploading to Google Drive in bulk seems to bug out.

  • AOI_2_Vegas_Test_public

    • MUL (done! 1,282 items)
    • MUL-PanSharpen (done! 1,282 items)
    • PAN (done! 1,282 items)
    • RGB-PanSharpen (done! 1,282 items)
  • AOI_2_Vegas_Train

    • geojson (done!)
      • buildings (done! 3,851 items)
    • MUL (done! 3,851 items)
    • MUL-PanSharpen (done! 3,851 items)
    • PAN (done! 3,851 items)
    • RGB-PanSharpen (done! 3,851 items)
    • summaryData (done! 1 item)

Additional Packages to Install

There are two additional packages for the polygonization of the result of the CNN model so that our results can be compared to the original labels, which are expressed in a polygon data type. You can install these using pip:

pip install rasterio
pip install shapely

Data Storage Options

We will be using Google Drive to store data for this tutorial. This is because Stanford offers unlimited Google Drive storage and we can access our Google Drive folders seamlessly within Google Colab.

Link to Google Drive Folder

Bugs/Errors

Pre-processing

When initially running, this:

!python utilities/spacenetutilities/scripts/createDataSpaceNet.py /content/drive/My\ Drive/space-net-exploration/data/raw/AOI_2_Vegas_Train/ \
           --srcImageryDirectory RGB-PanSharpen \
           --outputDirectory data/processed/ \
           --annotationType PASCALVOC2012 \
           --imgSizePix 256

I was getting the following error:

Traceback (most recent call last):
  File "utilities/spacenetutilities/scripts/createDataSpaceNet.py", line 324, in <module>
    bboxResize= args.boundingBoxResize
  File "utilities/spacenetutilities/scripts/createDataSpaceNet.py", line 89, in processChipSummaryList
    bboxResize=bboxResize
  File "/content/drive/My Drive/space-net-exploration/utilities/spacenetutilities/labeltools/pascalVOCLabel.py", line 212, in geoJsonToPASCALVOC2012
    borderValue=255
  File "/content/drive/My Drive/space-net-exploration/utilities/spacenetutilities/labeltools/pascalVOCLabel.py", line 117, in geoJsonToPASCALVOC2012SegmentCls
    source_layer = gpd.read_file(geoJson)
  File "/usr/local/lib/python3.6/dist-packages/geopandas/io/file.py", line 89, in read_file
    with reader(path_or_bytes, **kwargs) as features:
  File "/usr/local/lib/python3.6/dist-packages/fiona/env.py", line 398, in wrapper
    return f(*args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/fiona/__init__.py", line 250, in open
    path = parse_path(fp)
  File "/usr/local/lib/python3.6/dist-packages/fiona/path.py", line 132, in parse_path
    elif path.startswith('/vsi'):
AttributeError: 'list' object has no attribute 'startswith'

which is similar to the issue already filed here.

Google Drive Timeouts

Working in collab, I would sometimes get errors like "A Google Drive timeout has occurred (most recently at HH:MM:SS)". This most often happened to me when training the model:

!python training/train_aml.py \
           --experiment_name first-test \
           --out_dir /content/drive/My\ Drive/space-net-exploration/models/

So far, I've had some success to re-running as failed attempts cache partial state locally before timing out. If you continue to encounter this issue, it has to do with the number of files or subfolders in a folder growing too large. Their recommended solution is to try moving files and oflders directly contained in "My Drive" into sub-folders.

Model Training

We tackle the problem of outlining building footprints in satellite images by applying a semantic segmentation model to first classify each pixel as background, building, or boundary of buildings. The U-Net is used for this task. There are two variants of the U-Net implemented in the models directory, differing by the sizes of filters used. The baseline U-Net is a similar version as used by the winner of the SpaceNet Building Footprint competition XD_XD. We referenced several open source implementations, noted in the relevant files.

Code for training the model is in the training directory. The training script is train_aml.py (AML Stands for Azure Machine Learning, as this repo is forked from Microsoft AI For Good Project) and all the paths to input/output, parameters and other arguments are specified in train_single_gpu_config.py, which you can modify and experiment with. The default configuration has total_epochs set to 15 to run training for 15 epochs, which takes about an hour in total on a VM with a P100 GPU (SKU NC6s_v2 on Azure). For the sample image above, the result of the segmentation model is as follows at epoch 3, 5, 7 and 10:

Example of input image and its label

Generate Polygons of the Building Footprints

Standard graphics techniques are used to convert contiguous blobs of building pixels identified by the segmentation model, using libraries Rasterio and Shapely. The script pipeline/polygonize.py performs this procedure, and you can change various parameters in polygonize_config.py in the same directory. The most important parameter influencing the performance of the model is min_polygon_area, which is the area in squared pixels below which blobs of building pixels are discarded, reducing the noise in our results. Increasing this threshold decreases the number of false positive footprint proposals.

Evaluation

The evaluation metric used by the SpaceNet Challenge is the F1 score, where a footprint proposal is counted as a true positive if its intersection over union (IoU) with the ground truth polygon is above 0.5.

You can of course employ your own metric to suit your application, but if you would like to use the SpaceNet utilities to compute the F1 score based on polygons of building footprints, you need to first combine the annotations for each image in geojson format into a csv with python/createCSVFromGEOJSON.py from the utilities repo. In the root directory of utilities, run

python python/createCSVFromGEOJSON.py -imgDir /tutorial_data/val/RGB-PanSharpen -geoDir /tutorial_data/val/geojson/buildings -o ground_truth.csv --CreateProposalFile

Then you can use python/evaluateScene.py to compute the F1 score, giving the ground truth csv produced from the last command and the csv output proposals.csv produced by pipeline/polygonize.py in this repo:

python python/evaluateScene.py ground_truth.csv proposal.csv

Related Materials

Bing team's announcement that they released a large quantity of building footprints in the US in support of the Open Street Map community, and article briefly describing their method of extracting them.

Very helpful blog post and code on road extraction from satellite images by Jeff Wen on a different dataset. We also took inspiration in structuring the training pipeline from this repo.

SpaceNet road extraction challenge.

Tutorial on pixel-level land cover classification using semantic segmentation in CNTK on Azure.

Helpful blog post on understanding Semantic Segmentation with UNet

  1. Note the reason why it is called UNet is because there are two steps: encoder (downsamples, learns the WHAT) and a decoder (upsamples, learns the WHERE to get the pixel level classifications). Thus this follows a "U" shape.