Skip to content

Latest commit

 

History

History
41 lines (38 loc) · 1.19 KB

outline.md

File metadata and controls

41 lines (38 loc) · 1.19 KB

To be extended...

ATMOSPHERIC DATA ANALYSIS WITH DASK AND XARRAY

Outline

  1. Xarray
    1. Data structures for multi-dimensional data
    2. Working with labeled data
      1. Named dimensions/axes
      2. Coordinate labels
      3. Label based indexing
    3. Computation with xarray
      1. Arithmetic
      2. Aggregation
      3. Group-by and Resample
    4. Plotting and Visualization
      1. Integration with Matplotlib
      2. Making Maps with Cartopy
      3. Bokeh/Holoviews/Geoviews
    5. Input and Output
      1. NetCDF 3/4
      2. RasterIO
      3. Zarr
  2. Dask
    1. Dynamic task scheduling
      1. Dask Delayed Functions
      2. Dask Delayed Objects
      3. Dask Schedulers
      4. Dask Parallelism
      5. Profiling & Diagnostics
      6. Dask Collections (array, bag, and dataframe)
      7. Dask Distributed
    2. Dask and Xarray
      1. Parallel/streaming/lazy computation using dask.array
      2. Reading and writing data
    3. Automatic parallelization with xarray.apply_ufunc
    4. Distributed scheduler for HPC and cloud computing environments
  3. Applications
    1. CMIP5 data
    2. CESM Large Ensemble