FeatureMatching: Prefix output matches files when using "ranges" to avoid overwrites #628

yann-lty · 2019-04-25T11:06:59Z

Description

Introduce an automatic prefix system for matches files when using "range" command-line parameters. The goal of this PR is to ensure a file created for one iteration does not get overwritten in another one.

When generating matches files per image, we rely on minimal viewId of each image pair to create the filename (minViewId.txt). By iterating over sorted viewIds which only match with greater viewIds, this guarantees that files are written only once.
However, the latter condition can be broken in the current pipeline when the file generated by the ImageMatching step declares for one view [B] a match with a view [A] that has an inferior viewId. This occurs when [A] has reached max view matches; the [A,B] view match is moved to [B]. If [A] and [B] are handled in two distinct iterations (1) and (2), "A.txt" will be first written in (1) and overwritten in (2). All matches from (1) are then lost.

By automatically prefixing matches files based on "range" parameters and generating only one matches file per execution by default, this guarantees uniqueness of generated files.
This also changes the loading of matches files, which will now consider all "*matches.txt" files in the given folder. This behavior is retro-compatible with previously generated files.

Features list

FeatureMatching: Generate a prefix for matches files based on range parameter
FeatureMatching: Generate only one matches file per execution by default
IO: Load all '*.matches.txt" file in given folder and accumulate matches
IO: Update IndMatch_IO test
IO: Remove duplicated input folders when loading matches/features
SfmData: manage features/matches folders internally (relative paths, duplicates removal...)

Implementation remarks

Prefix is created by dividing rangeStart/rangeSize which reflects the iteration number when working by chunks. This is preferred to random naming for debug purposes.

When using range parameters from main_featureMatching, prefix the resulting files with rangeStart/rangeSize (i.e: iteration index when processing all views by chunks). => with matchFilePerImage: avoids overwriting files if a view is present in several iterations => without matchFilePerImage: avoids overwriting the unique resulting file * io: consider all files containing "matches.txt" when loading matches files from a folder

`matchFilePerImage` was activated by default to handle parallel runs of `featureMatching` on different ranges of views of the same scene. This is now handled by the prefix added to output files when using range related parameters.

* matches are accumulated when successively loading several files using the same PairwiseMatches variable without clearing it * check for duplicates and test deduplication function

Ensure a folder is not considered twice when loading features and matches.

simogasp

kudos for adding unit tests! 👍

src/aliceVision/matching/io.cpp

src/aliceVision/sfmData/SfMData.cpp

* handle conversions to relative and absolute paths when _absolutePath is defined * update internal relative folder paths when absolutePath is modified * ensure features/matches folders contains no duplicates * [tests] add 'sfmData_test' module + test internal folders management * [software] update global/incrementalSfm

fabiencastan · 2019-06-04T15:44:16Z

We should update the major version of main_featureMatching.cpp (and remember to update it in Meshroom).

…vention has changed This new version is able to load files with the previous naming convention but not the opposite.

fabiencastan · 2019-06-06T21:07:36Z

src/aliceVision/sfmData/sfmData_test.cpp

+  BOOST_CHECK(fs::path(featuresFolders[0]).is_absolute());
+  BOOST_CHECK(fs::equivalent(featuresFolders[0], refFolder));
+  BOOST_CHECK(fs::path(matchesFolders[0]).is_absolute());
+  BOOST_CHECK(fs::equivalent(matchesFolders[0], refFolder));


We could try to add a new folder, like:

sfmData.setAbsolutePath(fs::absolute(filename).string()); sfmData.addFeaturesFolder(fs::absolute(filename) / "uselessFolder/..").string()); // duplicates from non-canonical path

to check the canonical path conversion with duplicates from different input strings.

yann-lty requested review from simogasp, fabiencastan and gregoire-dl April 25, 2019 11:07

yann-lty added 2 commits April 25, 2019 13:12

yann-lty force-pushed the dev_matchingIO branch from e761d1a to e830096 Compare April 25, 2019 11:13

yann-lty added 2 commits April 25, 2019 13:13

[tests] indMatch: update IO test

3d137e8

* matches are accumulated when successively loading several files using the same PairwiseMatches variable without clearing it * check for duplicates and test deduplication function

[io] remove duplicated folders when loading features/matches

e830096

Ensure a folder is not considered twice when loading features and matches.

yann-lty force-pushed the dev_matchingIO branch from e6eace0 to 8e012c6 Compare April 26, 2019 08:50

yann-lty mentioned this pull request Apr 26, 2019

Make range complete block size accessible to command line nodes alicevision/Meshroom#454

Merged

yann-lty marked this pull request as ready for review April 26, 2019 13:12

simogasp approved these changes Apr 29, 2019

View reviewed changes

src/aliceVision/matching/io.cpp Outdated Show resolved Hide resolved

src/aliceVision/matching/io.cpp Outdated Show resolved Hide resolved

src/aliceVision/matching/io.cpp Outdated Show resolved Hide resolved

src/aliceVision/sfmData/SfMData.cpp Outdated Show resolved Hide resolved

yann-lty force-pushed the dev_matchingIO branch from 54e3f76 to abc661c Compare April 29, 2019 12:36

yann-lty added 2 commits April 29, 2019 14:36

[matching] IO: formatting + doc

abc661c

fabiencastan added this to the 2019.2 milestone Jun 5, 2019

[software] featureMatching: update major version as the IO naming con…

0ddefa2

…vention has changed This new version is able to load files with the previous naming convention but not the opposite.

fabiencastan reviewed Jun 6, 2019

View reviewed changes

fabiencastan merged commit 132ac39 into develop Jun 7, 2019

fabiencastan deleted the dev_matchingIO branch June 7, 2019 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FeatureMatching: Prefix output matches files when using "ranges" to avoid overwrites #628

FeatureMatching: Prefix output matches files when using "ranges" to avoid overwrites #628

yann-lty commented Apr 25, 2019 •

edited

Loading

simogasp left a comment

fabiencastan commented Jun 4, 2019 •

edited

Loading

fabiencastan Jun 6, 2019

FeatureMatching: Prefix output matches files when using "ranges" to avoid overwrites #628

FeatureMatching: Prefix output matches files when using "ranges" to avoid overwrites #628

Conversation

yann-lty commented Apr 25, 2019 • edited Loading

Description

Features list

Implementation remarks

simogasp left a comment

Choose a reason for hiding this comment

fabiencastan commented Jun 4, 2019 • edited Loading

fabiencastan Jun 6, 2019

Choose a reason for hiding this comment

yann-lty commented Apr 25, 2019 •

edited

Loading

fabiencastan commented Jun 4, 2019 •

edited

Loading