This package provides various tools for indexing Google Earth Engine (GEE) images into an Open Data Cube (ODC) index. It takes advantage of the GEE REST API and the GEE STAC API.
In order to use ODC with GEE, you will need to be signed up as a GEE developer. If not, you may send an application to Google here. This process will require a Google Account and to follow Google's terms of service for using their product.
First clone/download this repository to where you want the package to reside.
Example: git clone https://github.com/ceos-seo/odc-gee.git
Next you will need to register the package and install dependencies.
Preferred method is to use pip:
pip install -e odc-gee
Alternatively, using Python setuptools:
python setup.py build && python setup.py install
The ODC-GEE package requires an ODC database to be running and configured. Instructions for setting up an ODC environment can be found on the ODC readthedocs page.
The ODC database will also need to support the EO3 metadata type. This can be
checked by running datacube metadata list
in your ODC environment. If EO3 is
not listed then you will need to add it. The default EO3 type definition can be
found
here.
To add it, save the EO3 definition in a .yaml
file then run the following
command: datacube metadata add <your_eo3.yaml>
.
The scripts and python modules in this package allow for the following environment variables:
GOOGLE_APPLICATION_CREDENTIALS
: Optional; the service account credentials JSON file (default: ~/.config/odc-gee/credentials.json).REGIONS_CONFIG
: Optional; a JSON file for storing latitude/longitude locations if not performing global indexing (default: ~/.config/odc-gee/regions.json).
Some example configuration files are provided in the ./opt/config
directory.
Change $USER
to the username that is using this package.
The package comes with two scripts. One is to create ODC product definitions using GEE metadata. The other is for indexing an ODC product for use with GEE.
New product creation is done using new_product
. An example: new_product --asset <asset_id> <product_name.yaml>
. This will try to automate the entire
product definition creation process as defined in the ODC
documentation.
Check the resulting document to see if anything needs changing for your desired
result. You can then add the resulting product definition to the ODC database:
datacube product add <product_name.yaml>
. Use new_product --help
to see all
available options.
Note: The following option is deprecated in favor of "real-time" indexing but is kept here for posterity, and it is still useful in environments that need to use standard ODC instead of a wrapper.
The indexing of datasets to a local ODC index can be done using the index_gee
command. The script will format datasets to conform to a product definition if
the supplied product parameter is a product name of a definition in the ODC
database. Otherwise, the script will create a generic product definition and
dataset document based on GEE metadata. Example usage: index_gee --product ls8_google
. Use index_gee --help
to see all available options.
The package also provides Python modules accessible by import odc_gee
.
Optional items such as systemd timers and product update scripts are also
provided in the ./opt
folder. Change $USER
to the username that is using
this package. These provide example templates for automating updates.
A Datacube wrapper has been created with the intent of handling GEE OAuth in notebooks if credentials aren't provided. This wrapper also intends to allow for real-time indexing capabilities of GEE products so that the manual indexing process is not required.
This assumes an indexed product with ls8_google
as the product name, a
defined CRS/resolution, and measurements with red/green/blue aliases. If no
credentials file is supplied then the wrapper will try to authenticate with
Google using OAuth. A product definition can be creating by following the steps
on using the new_product
command.
from odc_gee.earthengine import Datacube
latitude = (-17.63, -17.75)
longitude = (168.25, 168.15)
time = ('2019-01-01', '2019-02-02')
dc = Datacube()
ds = dc.load(product='ls8_google', measurements=['red', 'green', 'blue'],
group_by='solar_day', latitude=latitude, longitude=longitude, time=time)
The Datacube wrapper provides a method for indexing GEE items "on the fly" without an indexed product by providing a GEE asset ID from the GEE catalog.
from odc_gee.earthengine import Datacube
latitude = (-17.63, -17.75)
longitude = (168.25, 168.15)
time = ('2019-01-01', '2019-02-02')
dc = Datacube()
ds = dc.load(asset='LANDSAT/LC08/C01/T1_SR', measurements=['B4', 'B3', 'B2'],
latitude=latitude, longitude=longitude, time=time,
group_by='solar_day', resolution=(-2.69493352e-4, 2.69493352e-4),
output_crs='EPSG:4326')
ds.isel(time=0).to_array().plot.imshow(vmin=0, vmax=3000, size=8,
aspect=ds.dims['longitude']/ds.dims['latitude']);