Enhancement#305 gridded global datav5 #323

nip5 · 2018-08-06T20:07:48Z

Merge gridded 250m soil data extraction functionality.

… like conus

…tion

…t tests

…cludefield

nip5 · 2018-09-12T20:28:14Z

Some new lintr tests will fail because of commit b28734c but all tests pass for my function.

R/ExtractData_Soils.R

…com/DrylandEcology/rSFSW2 into Enhancement#305_GriddedGlobalDatav5

Addressed

R/ExtractData_Soils.R

demo/SFSW2_project_descriptions.R

tests/testthat/test_ExtractData_Soils.R

dschlaep

My review is commenting on the code, but does not include running the tests and check whether the code produces what we want. @CaitlinA can do that better and should give the final ok, once all else is resolved. Thanks.

tests/testthat/test_ExtractData_Soils.R

dschlaep

These are non-standard tests because they require data outside the source package, i.e., they are not self-sufficient.

I think that this is ok in this case, but it requires substantial documentation at the top of this file for future developers, e.g., explain exactly what external data files are required and how to set up the paths correctly.

I found that your idea was good with the run_tests which was turned off by default. Unfortunately, you removed this switch. This switch prevented these tests to be run (and fail) unless a future developer knows exactly how to set up these specific tests.

You define paths and the code does not check whether or not these exists. The tests as they are currently will fail the paths and files don't exist, i.e., anytime these tests are run not on your correctly setup machine -- e.g., this fails on @CaitlinA 's or my machine. I suggest that these tests should skip with some appropriate message if the paths are not correctly set up and the necessary files do not exists.

dschlaep · 2019-03-14T16:47:00Z

tests/testthat/test_ExtractData_Soils.R

-sim_size$runsN_todo <- 0
-sim_size$digitsN_total <- 3
-sim_size$runIDs_sites_by_dbW <- c(1, 2, 3, 4, 5)
+suppressWarnings(is_online <-


Why do you test whether this has online access? I don't see anything in your tests that requires online access.

You asked me to model setting up whether or not to skip the tests on other tests, thus I removed the run_tests flag since no other test does it that way but I do agree that it was useful. The code being tested will tell you if the paths are wrong ie. if you run the tests with an incorrect soils extraction path if will tell you the folder or file does not exist. Are you suggesting that I check those in the tests before running them as well?

I only referred to the use of skip* functions which are intended for use within test_that() blocks, i.e., they don't skip if outside a test_that block as you had it

I didn't suggest at all that you test online access and I didn't say anything about removing run_tests

If you don't skip your tests if the external files don't exist, then running devtools::test() will fail -- except on your specific machine. I think this is bad. You are introducing here a new type of tests to our package -- your tests are the first to assume data outside the source package and besides online accessible data. Thus, you need to handle your new test gracefully and differently such that other people's workflow is not being broken.

I understand the intended use of the skip functions but it sounds much more helpful to give a user a reason for skipping the test using skip() outside of testthat functions rather than giving no explanation when skipping by simply bypassing the code as is the case in tests such as test_net_CDF_function.R.

skipping the test using skip() outside of testthat functions

This doesn't work. Have you ever tried it out???

give a user a reason for skipping the test ... rather than giving no explanation

No one prevented you from providing a reason for skipping; it would be easy enough to add a if (!any(do_skip)) {} else {test_that("block name", {skip(your message)})}. Plus, a package user never sees the tests; it is only developers that see the tests; thus, the code for do_skipcalculation intest_netCDF_functions.R` provides a detailed explanation in the comments.

dschlaep

I still would like to see some explanation at the top of this file that explains the special nature of these tests including stating that data that are not included in the package are required and need to be prepared manually -- as it is, the next developer will waste time wading through your test code, has to figure out that she/he needs to manually set dir_ex_soil; will realize that the tests still fail; will have to dig further and realize that function extract_soil_ISRIC250m needs special folder hierarchy and files, etc.

Why not a few sentences documenting and explaining???

dschlaep · 2019-03-14T18:39:41Z

tests/testthat/test_ExtractData_Soils.R

  # =============================================================================
  # Tests designed to test the underlining structures created
  # from soil extraction functions.
  # =============================================================================

-  # set stage manually so that future changes won't cause the test to fail ======
  # setup file paths
  fnames_in <- environment()
  fnames_in$fslayers <- file.path("/home/natemccauslin/Desktop/Dryland Ecology/rSFSW2/tests/test_data/TestPrj4/1_Input/SWRuns_InputData_SoilLayers_v9.csv")


Why absolute paths for fslayers and fsoils specific to your computer? The folder TestPrj4 is part of this package source, thus, it will always be the same and you should use a relative path, e.g., see example in test_projects.R which uses dir_tests <- file.path("..", "test_data", "TestPrj4")

dschlaep · 2019-03-14T18:42:53Z

tests/testthat/test_ExtractData_Soils.R

  dir_ex_soil <- "/media/natemccauslin/SOILWAT_DATA/GIS/Data/Soils/"
+
+  # if any of the file paths above are invalid, skip all tests and setup
+  if (!all(file.exists(c(fnames_in$fslayers, fnames_in$fsoils, dir_ex_soil)))) {


You don't check that dir_ex_soil contains the required content for function extract_soil_ISRIC250m. Let's assume that dir_ex_soil exists but the person has not downloaded the specific dataset, then this still fails because of

dir.ex.gridded <- file.path(dir_ex_soil, "ISRIC", "GriddedGlobalV5") # stop program execution if folder path is incorrect if (!dir.exists(dir.ex.gridded)) stop(paste0("Folder '", dir.ex.gridded, "' does not exist"))

The unit test shouldn't be testing if the directory contains the files before the function is called because that is done in extract_soil_ISRIC250m so it is done regardless of how the function is being called. Basically if I were to do that I'd be checking it twice.

Having the function fail because the person doesn't have the correct dataset is by design.

Having the function fail because a user doesn't have the files is correct; but having the unit tests fail because a developer doesn't have the files is bad -- the unit tests should skip.

dschlaep · 2019-03-14T18:48:54Z

tests/testthat/test_ExtractData_Soils.R

+
+  # if any of the file paths above are invalid, skip all tests and setup
+  if (!all(file.exists(c(fnames_in$fslayers, fnames_in$fsoils, dir_ex_soil)))) {
+    skip("File paths incorrectly configured for Soils Extraction tests")


I'm repeating myself skip doesn't work outside test_that blocks.

Why don't you run the package tests on another machine that doesn't have GriddedGlobalV5 installed? Something like

Sys.setenv(NOT_CRAN = "true") Sys.setenv(RSFSW2_ALLTESTS = "true") devtools::test()

It seems to work, do you mean you don't like it outside of testthat?

You are correct skip does work correctly outside of test_that blocks -- this must be a new feature of more recent testthat versions because it didn't work in earlier versions when I checked it out originally several years back (e.g., discussion on this r-lib/testthat@b1e41a0).

Apparently, the devel version of testthat also introduced an skip_if_offline (r-lib/testthat@be8e6b6) which will be nice for us!

Ok, I am fine with your skips, but there is still the problem that the tests are not skipped and fail if the correct external files are not available.

nip5 added 30 commits June 26, 2018 10:49

Started integrating new soil data (100m)

23b6834

extracts sand/clay/bd successfully

33a4e93

Integrated gravel data from 250m

8943c9b

Refactored code to run without as much user guidance

cdd2294

Improved some soil calculations, added some environment changes

f49e3e4

Moved the assignment in certain fields out of new function to be more…

680ddfe

… like conus

starting integrating 250m depth

8497ff5

made adjustments for rSFSW2 tests

20341ed

some performance enhancements

246d291

started merge with decadel aggregations

bcdebb4

Merged with feature_DecadelAggregations, implemented cell soil extrac…

6063ecf

…tion

Removed database comparisions in TestPrj4 to allow passing of all uni…

0eca671

…t tests

Added unit test file for testing soil extraction related functions

c1e8dc9

Added unit tests for get_datasource_masterfield and get_datasource_in…

c14507c

…cludefield

Added test for prepare_ExtractData_Soils

5a1cee0

Some print statements only used during development

41630f9

Added Roxygen documentation to do_ExtractSoilDataFrom100m

b1df9f5

Remove extra comment weight

6af2dc7

Fix issue with unit tests getting hung up after conversion from = to <-

9229243

Fix extracted gravel data being unusually low

4c0bf91

Add depth tif integration

e7d332e

Add unit test for update_soils_input

1a75875

Add unit test for new extraction function

91d1f53

Fix testPrj4 now works again

575e35b

Add resume functionality to function

bea0f35

Remove print statements and debug comments

71b714b

Change back models flag for CI tests

7ae6713

Remove 250m extaction unit test for CI

367b6aa

Delete .RData

d73f71b

Delete rSFSW2.Rproj

0775cb0

dschlaep previously requested changes Sep 13, 2018

View reviewed changes

R/ExtractData_Soils.R Outdated Show resolved Hide resolved

dschlaep reviewed Sep 13, 2018

View reviewed changes

R/ExtractData_Soils.R Outdated Show resolved Hide resolved

dschlaep reviewed Sep 13, 2018

View reviewed changes

R/ExtractData_Soils.R Outdated Show resolved Hide resolved

nip5 added 8 commits January 30, 2019 10:42

merge with master

f5b3473

lintr compliance fixes

9e87219

moved verbose message out of main loop

2b163e3

modify extract_soilISRIC250m function description

f1bb913

modify MMC usage in ISRIC extraction function

be96b2c

add extract soil test with on/off flag

2be512b

Merge branch 'master' into Enhancement#305_GriddedGlobalDatav5

3731d90

add catch if file to extract is found but has bad content

1e62d13

nip5 dismissed CaitlinA’s stale review via 1e62d13 March 8, 2019 20:20

nip5 added 3 commits March 8, 2019 13:20

Merge branch 'master' into Enhancement#305_GriddedGlobalDatav5

6c772c7

Merge branch 'Enhancement#305_GriddedGlobalDatav5' of https://github.…

ca1f319

…com/DrylandEcology/rSFSW2 into Enhancement#305_GriddedGlobalDatav5

lintr compliance

4e9a4cb

nip5 requested review from dschlaep and CaitlinA March 13, 2019 19:11

dschlaep requested changes Mar 13, 2019

View reviewed changes

nip5 added 5 commits March 13, 2019 14:46

remove semicolons in extract soils unit test

349a4e8

add spaces between ) and { in some instances

2d978b8

fix odd indenting

b5daa1d

pulled out percent_div to set it as a default value

df084f4

remove SSURGO soils data flag as it is not implemented

1696fda

dschlaep requested changes Mar 14, 2019

View reviewed changes

tests/testthat/test_ExtractData_Soils.R Outdated Show resolved Hide resolved

change CI fix format

9226360

dschlaep requested changes Mar 14, 2019

View reviewed changes

add check for file configurations in unit test

5f2c10d

dschlaep requested changes Mar 14, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancement#305 gridded global datav5 #323

Enhancement#305 gridded global datav5 #323

nip5 commented Aug 6, 2018

nip5 commented Sep 12, 2018

dschlaep left a comment

dschlaep left a comment

dschlaep Mar 14, 2019

nip5 Mar 14, 2019

dschlaep Mar 14, 2019

nip5 Mar 14, 2019

dschlaep Mar 14, 2019

dschlaep left a comment

dschlaep Mar 14, 2019

dschlaep Mar 14, 2019

nip5 Mar 14, 2019

dschlaep Mar 14, 2019

dschlaep Mar 14, 2019

nip5 Mar 14, 2019

dschlaep Mar 14, 2019

Enhancement#305 gridded global datav5 #323

Are you sure you want to change the base?

Enhancement#305 gridded global datav5 #323

Conversation

nip5 commented Aug 6, 2018

nip5 commented Sep 12, 2018

dschlaep left a comment

Choose a reason for hiding this comment

dschlaep left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dschlaep left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment