Skip to content
Joseph Ramsey edited this page Nov 20, 2024 · 381 revisions

Changes for upcoming 7.6.6

Main changes.

  1. Made some corrections to the implementation to the final FCI orientation rules in response to a review from Peter Spirtes.
  2. Updated discriminating path rule to make it more efficient.
  3. Refactored FciOrient to always use discriminating path rule.
  4. Reimplemented R8, R9, R10 in final FCI orientation rules for efficiency.
  5. Revised implementation of the AD Tree class and incorporated it into the Chi-Square and G-Square discrete independence tests as an alternative to sample counting.
  6. Revised DiscreteBicScore to ignore unattested categories for variables.
  7. Removed several "experimental" designations for algorithms.
  8. Added an implementation of CellTable using AD Trees for space-savings and switched discrete ChiSquare to use it.
  9. Improved AdTree for efficiency and clarity.
  10. Improved MvpBicScore.
  11. Added InterruptedException to MarkovCheck and handling for it in several classes.
  12. Added method to calculate IDA-like distances for nodes.
  13. Added CpdagParentDistancesFromTrue class.
  14. Added column to bootstrap table to show edge properties.
  15. Refacted all sepset finding methods into a new class, SepsetFinder, and added several new new methods.
  16. Improved logging consistency across LV algorithms.
  17. Improved LvLite.
  18. Added "recursive msep" and "noncolliders only" options to MarkovCheckEditor.
  19. Added selection bias option to nodes in the graph editor.
  20. Adjusted DagToPag to account correctly for selection bias.
  21. Added method to produce data with selection nodes conditioned to > 0 as a new DataBox item.
  22. Refactored logic to find unshielded colliders and to R4 discriminating path orientations into a separate interface.
  23. Implemented the R0R0 methods for both test-based and DagToPag contexts.
  24. Added logic in R4 to handle the case of multiple discriminating paths between two nodes for the test-based implementation (using the msep path blocking method).
  25. Removed independence test caching for the IndTestFisherZ class, which was causing errors in places.
  26. Added nodewise Markov check methods.
  27. Did several optimizations for PAG-related functions to improve speed of interface.
    • Cached some types of paths in the graph class.
    • Replaced some depth-first methods with breadth-first equivalents.
    • For looking for latent common causes, limited length of treks (correctly) to 3.
    • Introduced the PagCache class to store calculated PAGs for DAGs and DAGs for their calculated PAGs, for speed of lookup, so DagToPag only needs to be called once per DAG.
  28. Added missing Javadocs, corrected some errors.
  29. Made repairs to testwise deletion and optimized it.

Selected Minutiae.

  1. Made table counting classes for the chi-square and g-square tests immutable to be safe in multi-threaded environments in response to a worry discovered when moving to JDK 21.1.
  2. Replaced block real matrix with MatrixUtils where used.
  3. Corrected pseudoinverse calculation.
  4. Improved fast adjacency search.
  5. Improved KCI implementation. (Still needs work.)
  6. Removed Meek verbose flag where used.
  7. In MultiDimensionalIntTable and CellTable, replaced Longs with Integers to save space.
  8. Added "no rows" check to ChiSquareTable for efficiency.
  9. Improved MnlrLilklihood and MvpLikelihood.
  10. In AdTree, refactored Subdivision class to use Java record.
  11. Added validation checks to AdTree.
  12. Added bootstrap row sampling to MarkovCheck and AdTree.
  13. Handled empty total effects in various methods.
  14. Corrected some edge marking logic in AbstractWorkbench.
  15. Refactored compareTo() method in IndependenceFact for efficiency.
  16. Added menu items to the Search box to save graphs in various formats.

To do:

  1. Get final bootstrapping changes from Kevin and Jessi.
  2. Clarify legal PAG check with Peter. (I.e., why am I missing tail endpoints still?)
  3. Update documentation.