Projects

Welcome to the central repository for information on waterhackweek projects. This content is distributed on our Learning Resources, as well as the Waterhackweek webpage. Please read more about how to lead tutorials and projects Project Guidelines and Project management on the Waterhackweek Wiki. Learn more about How to Collaborate with Waterhackweek if you are interested in funding your leadership, tutorial development, project, learning community tool development, participation grants, sponsor community efforts, or to propose new ways to advance the ideas sparked at Waterhackweek.

Table of Projects 2020

We try to keep it simple, but these are complex projects. Deep dive from here, but please update regularly so that everyone can follow behind you. The Evolving Team Project is a 2020 experiment on ourselves. We are using a weekly meetup Fridays, July-August 2020, to teach ourselves how to organize all-virtual team projects for diverse teams and interests. Project Leads and participants are invited to use the project to onboard to Waterhackweek, or to contribute in any way that sparks imagination and community. All other projects are designed for team work Aug 31-Sep 4, 2020. Note the bottom of the page has archived projects available as foundations for new projects.

Project Name	Where we work: Slack, Github, Hydroshare, JupyterHub	Cyberinfrastructure & Data Science Tools	Team	Team Lead	Project Lead	Data Science Lead	Geoscience Domain
Evolving Team Cooties: Infectiously Open, Wildly Transmissible, Hilariously Accessible	#teamcooties whw2020_flattenthecurve HydroShare Resource TBD jupyterhub.cuahsi.org	Cloud Widgets in Jupyter, Visualization with Bokeh, Data Smashing (COVID-19, National Water Model), Statistics for Public Translation	WHW Organizers & Contributing Friends: Philip, Mary	Christina	Tony	Scott	Open Global GeoHealth
The Next Amazing Project	Where we work: #funSlack, hyperlinkGithub, Hydroshareresource	XSEDE, Pangeo, Scikitlearn, landlab, matplotlib, plotly, pandas, xarray, numpy, scipy, cartopy ... Cyberinfrastructure & Data Science Tools	Enthusiast Team	Team Lead	Project Lead	Data Science Lead	Geoscience Domain

Purpose of the projects:

During Waterhackweek we will be facilitating open hacking sessions most of the afternoons. The purpose of these sessions is for you to gain hands-on experience in working together on a well-defined problem related to geospatial sciences.

What is hacking?

Hacking is a session of focused, highly collaborative computer programming, in which we create conditions for rapid absorption of new ideas and methods. Visit our hacking central page for more specific information on this approach.

Why is this important?

Increasingly, research and software development is being conducted across groups of people with diverse skills and backgrounds. We believe this collaborative work leads to more innovative solutions to complex problems. At waterhack week, our goal is to explore with you some of the skills needed to navigate technical and social challenges of working in these kinds of collaborative settings.

How will the projects be conducted?

On day 1 we will facilitate the sharing of ideas and formation of people into small teams (2-5 people) each team will identify a:

Project Lead

Coordinate team in the direction of the research question pitch. Data, code, and tools originally proposed may change. Avoid dispensing and coordinating tasks.
Everyone should code something or get their hands on the data in their way.
The project scientist makes sure that their team all meet their individual goals, and that the team evolves.
Pitch project. Interact with all interested participants during networking. Coordinate team building with other instructors to an optimal size of 4-7 total participants.
Populate original GIthub project repository and readme following the Project Guidelines.
Coordinate with 1-2 Data Scientists to support your team, especially Github workflows that work for everyone to be able to contribute code.
Manage data. Be sure you have permission and consent to use data you share in the project. Follow professional standards while using data for training that may be used in future publications. Here is a good resource on publication of data and consent ethics.

Data Science Lead

Any group can ask any data scientist for help.
Interact with as many people as possible.
Enable and encourage participants to solve their problems in a way that builds confidence in their skills
Data scientists will be responsible for supporting at least one project, but expected to float around as needed.
- Support for one project means that on Monday, the team’s data scientist should coordinate a Github workflow that works best for the team by facilitating a team conversation.
- Mentor: Remove personal/interpersonal roadblocks
- Be in tune with progress toward individual goals (discrete daily personal brief checking)
- Facilitate team changes
- Technology Dam breaker: in a kind non-life threatening manner - what library or tool can this team use they may not know about?

Team Participants

Innovation is unpredictable! Sensitive subject matter and personal physical boundaries may become an issue so find a process with your team to check-in regularly to address any issues to be aware of as a group. If you find that you are not feel comfortable sharing with the Team, please reach out privately to discuss with an organizer or other instructor.
Be supportive! Learning software and personal growth both require vulnerability and a safe space. If anyone displays negative attitudes, judgments or insensitivity, we want to authentically address it. If there are concerns outside of the scope you are comfortable addressing, speak with another leader about it or please contact Christina.
We aim to please. Keep your instructors updated as to whether your learning and personal goals for the hackweek are being met. If they aren’t, we will do everything we can to help you meet your goals.
Everyone at hackweek is primarily participating on a volunteer basis. Please respect that this is a new community growing on donated time. If resolution to any issue is beyond the scope of hackweek, we will direct you to more resources.

once formed, each team will be guided through exercises to help narrow in on a set of tasks that are doable within the 5 days. A brief project outline will be posted to GitHub, following the "Project Guidelines" below. Each morning will start with a daily stand-up meeting to check-in on progress and challenges. On day 3 we will have a mid-week project check-in. On day 5 each team will present their work in a series of lightning talks.

What can I do to prepare in advance?

if you have a project idea already brewing, we encourage you to share that with the team on our general Slack channel. We can add additional channels as the project ideas develop. Reel free to explore various projects and initiate conversations. The goal is to gather as much information as you can to inform your decision about which team to join when we meet in person. Contact a waterhack organizer if you would like assistance in assessing whether a project is well-scoped, or if you need help with a particular dataset.

Where can I find help ?

Instructor Tables

We have some tables where data scientists will congregate and participants can find any kind of support they need.

Instructor Bios

We have the Instructor Bios and some sticky notes put up at one end of the room. This can be used to identify who is available and what they want to be asked about.

Project Help-Desk Data Scientists

By definition, anything could happen in a hackweek project! If you want it to happen faster, ask us for help. If you want help avoiding a downturn in productivity, teamwork, or creativity, we can help come up with ideas and solutions.

TBD - someone as big as Jacob Deppen - support data scientists, project scientists, instructors and organizers.
TBD - someone as deep as Madhavi Srinivasan - support participants, especially new code contributors on Github and Jupyter Notebooks
Scott Black - support on HydroShare and almost every other question
Christina Bandaragoda- curious about anything that is not working, celebrating surprises by collecting data; handles anything that is not working (event design, questionable team behavior, awkward personal situations, or code of conduct violations).

Project guidelines

Each project requires a brief project summary in the readme.md of each GitHub project folder. Below is a template for the project summary. You can visit the project folder on the geohack GitHub page to see existing examples.

Before the hackweek, the project lead initiates a new repository in their Github account.

Project Overview: Why is this important? What is the potential here?

Shared Project Goals: [List the top three in terms of computer science and data science learning]

Create innovative visualizations for new datasets
Implement common methods on emerging platforms
Establish a new workflow for a new user or audience

Individual Learning Opportunities[Community Norms currently include Python 3+, HydroShare Data Repository, Github Code Repository]

Cloud computing with Jupyter Notebooks on CUAHSI HydroShare

List of Specific Software Libraries, Cyberinfrastructure Platforms, Datasets and Other Technologies

All programming levels welcome. We'll be primarily using Python but other languages can be used too!

On-boarding Steps: All programming levels welcome. Time Committment and Language Expectations. A. We'll be primarily using Python but other languages can be used too! B. It will take 10 min to read the materials for experienced python programmers with Github and HydroShare accounts. It will take 2-5 hours to onboard and get comfortable for those new coding or new users of these platforms.

Organizational and Personal Disclosures, Disclaimers, and Expectations

References and Links to Sources

At the hackweek, the project lead will pitch their project, and work with the Team to fork their repository to the Waterhackweek organization, and build a team plan for governing collaboration for the week, including a detailed description of the Team Project.

Project Title

Brief title describing the proposed work.

Collaborators on this project

List all participants on the project. Choose one team member to act as project lead, and identify one geohack organizer as the data science lead.

The problem

What geospatial problem are you going to explore? Provide a few sentences. If this is a technical exploration of software or data science methods, explain why this work is important in a broader context.

Application Example

List one specific application of this work.

Sample data

If you already have some data to explore, briefly describe it here (size, format, how to access).

Specific Questions

List the specific tasks you want to accomplish or research questions you want to answer.

Existing methods

How would you or others traditionally try to address this problem?

Proposed methods/tools

Building from what you learn at geohackweek, what new approaches would you like to try to implement?

Project Repository Guidelines

Each project requires a brief project summary in the readme.md of each project folder. Here is a template for the project summary. You can view existing projects to see examples.

Project Title

Brief title describing the proposed work.

Collaborators on this project

List all participants on the project. Choose one team member to act as project lead, and identify one geohack organizer as the data science lead.

The problem

What water data science problem are you going to explore? Provide a few sentences. If this is a technical exploration of software or data science methods, explain why this work is important in a broader context.

HydroShare Guidelines

What is HydroShare ?

HydroShare is CUAHSI's online collaboration environment for sharing data, models, and code.

Learn more about how to get started with HydroShare here : https://help.hydroshare.org/introduction-to-hydroshare/getting-started/

Why put my data on HydroShare?

Data storage and limits: While code repositories like Github are optimized for version control and code publication, they have a strict file size limitation of 100 MB and total repository limit of 1GB. HydroShare has a file size limit of 1 GB and each user receives automatically an allocation of 20 GB of space for their resources when they sign up. Users who need more space should contact [email protected] and explain who they are and why they need more space.

Sharing: each file uploaded to Hydroshare receives a hyperlink that can be shared for easy download and data sharing, even with colleagues who are not HydroShare users.

Publication: once your data is ready for formal publication, you can get a digital object identifier (DOI) for each HydroShare resource.

What is a HydroShare resource ? How can I upload and publish a resource ?

In HydroShare, the content you upload and publish is referred to as a "resource".

Learn more about how to upload and publish a resource here :

https://help.hydroshare.org/creating-and-managing-resources/

What kind of metadata should I add for my resource ?

When you create a new resource, you will be taken to the landing page for that resource, where you can edit the metadata. You can also edit the metadata elements of your resource at any point after its creation by clicking on the “Edit Resource” button on the landing page.

Learn more about how to describe your resource using different metadata elements : https://help.hydroshare.org/creating-and-managing-resources/best-practices-for-describing-your-resource-with-metadata/

What if I want to add custom/additional metadata for my resource ?

You can describe your resource using additional metadata elements beyond the default elements provided directly on the page. You can click on the plus icon next to “Additional Metadata” and mention the name and value of the new metadata element.

What if I have large number of resources or additional information about them ?

Sometimes, you may wish to add a detailed description of the contents of your resource, and it is inconvenient to type all of this text into the Abstract element or to include it as “Additional Metadata” elements.

One way to give detailed description of resource files is to include a README.txt or README.md file along with your resources. This README file can contain information related to but not restricted to:

Organization of different resource files into a directory structure
Description about the different attributes in the data resource(s). Additionally, one can add if the attribute is categorical or quantitative. If they are categorical and have been encoded, mention what each encoding means. For eg: 0 means female, 1 means male.

How can I cite my resource ?

Citations include any institutes/universities/organizations that are to be credited for creating and collecting the data in this resource.

https://help.hydroshare.org/creating-and-managing-resources/connecting-your-resource-data-with-your-publication/

Additional Links

Guide for Data Authors and Publishers

Read More Frequently Asked HydroShare Questions here

Best Practices for Designating Authorship

Access and Build on the Waterhackweek Project Archive

Project Checkpoints 2019

Project Name	Github Repo	slack channel	Hydroshare
Civil digital infrastructure for real-world health impacts made possible by information flow, data sharing, and data security that enables clean water for everyone.	whw2019_watermesh	#watermesh	[TBD]
Mapping Co-Occurrence of Geogenic Groundwater Contaminants of Concern	whw2019_Mapping_Groundwater_Contaminants	#map_gw_contaminants	[TBD]
Climate change and ecohydrologic response using Landlab	whw19_landlab_ecohydrology	#landlab_ecohydrology
Understanding the diversity of hydrological regimes in the Andean Amazon region	whw2019_AndeanAmazonHydro	#andeanamazonhydro	[TBD]
Effects of wildlife on Tundra hydrology and lake change	whw2019_northern-lakes	#northern_lakes	[TBD]
Snow modeling with SUMMA and pysumma	whw2019_snow_modeling	#snow_modeling
Tuolumne snowmelt during rain on snow events	whw2019_snowmelt	#snowmelt
Meteorological forcing data for NWM/WRF-Hydro	meteo_forcing_wrfhydro	#meteo_wrfhydro
How will climate change affect hydrologic extreme events in the Pacific Northwest	whw2019_extremeH2O	#extremeh20

Project Tools/Roles/Location 2019

Project Name	Data Science Tools	Data Science Lead	Project Lead	Location
Civil digital infrastructure for real-world health impacts made possible by information flow, data sharing, and data security that enables clean water for everyone.		Jim Phuong	Taina Rodriguez Curet	Puerto Rico
Mapping Co-Occurrence of Geogenic Groundwater Contaminants of Concern	matplotlib, plotly, pandas, xarray, numpy, scipy, cartopy	Rohit Khattar	Katya Cherukumilli	India, US
Climate change and ecohydrologic response using Landlab	Landlab , numpy , matplotlib , openpyxl , OGH	Sai Nudurupati	Kaiwen Wang	Central New Mexico, USA
Understanding the diversity of hydrological regimes in the Andean Amazon region	Tethys , Func-Flow	Yesica Leon-Tinoco	Guido A. Herrera-R	Andean Amazon in Ecuador and Brazil
Effects of wildlife on Tundra hydrology and lake change	Google Earth Engine, Visualization in Python	Matthew Bonnema	Paul Mann	Yukon Kuskokwim delta, Alaska
Snow Modeling with SUMMA and pysumma	pysumma , xarray		Andrew Bennett	California
Tuolumne snowmelt during rain on snow events	gasterio , numpy , geopandas , matplotlib	Steven Pestana	Lisa Katz	Upper Tuolumne Watershed
Meteorological forcing data for NWM/WRF-Hydro		Yu-Fen Huang	Yin-Phan Tsang	Hawaii
How will climate change affect hydrologic extreme events in the Pacific Northwest	Xarray,pandas,cartopy,seaborn	Yifan Cheng	Oriana Chegwidden	Pacific Northwest

Water related GeoHackWeek Archives 2018

Project Name	slack channel	Hydroshare
Freshwaterhack of geohackweek 2017: Haackwell		Haackwell Groundwater Geohackweek Project 2017
Freshwaterhack of geohackweek 2016: Oh Well
Web portal for carbon dioxide in global inland waters	#co2web
Snowmelt	#snowmelt

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
LICENSE		LICENSE
README.md		README.md

License

waterhackweek/projects

Folders and files

Latest commit

History

Repository files navigation

Projects

Table of Projects 2020

Purpose of the projects:

What is hacking?

Why is this important?

How will the projects be conducted?

Project Lead

Data Science Lead

Team Participants

What can I do to prepare in advance?

Where can I find help ?

Instructor Tables

Instructor Bios

Project Help-Desk Data Scientists

Project guidelines

Before the hackweek, the project lead initiates a new repository in their Github account.

At the hackweek, the project lead will pitch their project, and work with the Team to fork their repository to the Waterhackweek organization, and build a team plan for governing collaboration for the week, including a detailed description of the Team Project.

Project Title

Collaborators on this project

The problem

Application Example

Sample data

Specific Questions

Existing methods

Proposed methods/tools

Project Repository Guidelines

Project Title

Collaborators on this project

The problem

HydroShare Guidelines

What is HydroShare ?

Why put my data on HydroShare?

What is a HydroShare resource ? How can I upload and publish a resource ?

What kind of metadata should I add for my resource ?

What if I want to add custom/additional metadata for my resource ?

What if I have large number of resources or additional information about them ?

How can I cite my resource ?

Additional Links

Access and Build on the Waterhackweek Project Archive

Project Checkpoints 2019

Project Tools/Roles/Location 2019

Water related GeoHackWeek Archives 2018

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Packages