Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Readd clustering #281

Merged
merged 78 commits into from
Feb 24, 2022
Merged
Show file tree
Hide file tree
Changes from 19 commits
Commits
Show all changes
78 commits
Select commit Hold shift + click to select a range
e8f91f8
Clustering with pseudonymized reports
SimDing Jan 28, 2022
aa426da
Merge branch 'master' into readd-clustering
SimDing Jan 28, 2022
42e4959
Fix merge
SimDing Jan 28, 2022
d649b64
Make bgfs not fail when the cauchy function can't be initialized prop…
SimDing Jan 28, 2022
21449f7
Fix: Clustering fails when there are no clusters
SimDing Jan 28, 2022
72b9004
Undo change by java language server
SimDing Jan 28, 2022
ecda92e
Fix javadoc error
SimDing Jan 28, 2022
82a183a
Fix: Do not use java 16 features
SimDing Jan 28, 2022
796620e
Complain about missing submodule
SimDing Jan 28, 2022
75d7ecd
Remove unnecessary clustering option
SimDing Feb 10, 2022
32a1285
Move clustering options to dedicated class
SimDing Feb 11, 2022
c26a26f
Add clustering options to CLI
SimDing Feb 11, 2022
7e8472d
Clustering tests
SimDing Feb 11, 2022
d76a1ff
Merge branch 'master' into readd-clustering
SimDing Feb 11, 2022
b054791
Rename top down to agglomerative clustering
SimDing Feb 11, 2022
f7aef77
Remove unused class
SimDing Feb 11, 2022
f3fb600
Apply spotless
SimDing Feb 11, 2022
0988fec
Add some more javadoc
SimDing Feb 11, 2022
82d1353
Spotless
SimDing Feb 11, 2022
66b6921
Mark clustering options as final
SimDing Feb 11, 2022
4c40d55
Remove useless method
SimDing Feb 11, 2022
4f5506b
Remove PseudonymizedReports submodule
SimDing Feb 16, 2022
6acab02
JDoc comment for CligroupHelper
SimDing Feb 16, 2022
52b59e1
Better names for the clustering algorithm enum and interface
SimDing Feb 16, 2022
cb1f9ad
JDoc for ClusteringAdapter
SimDing Feb 16, 2022
63f45c3
Made interface Cluster into an interface
SimDing Feb 16, 2022
d9ae5c7
Move nested class cluster out of ClusteringResult
SimDing Feb 16, 2022
674c56f
Make ClusteringResult class instead of interface
SimDing Feb 16, 2022
acc5f0a
Abbreviation GP -> Gaussian Process
SimDing Feb 16, 2022
380b501
JDoc comments for ClusteringOptions
SimDing Feb 16, 2022
828a57f
Abbreviation ca -> clusteringAlgorithm
SimDing Feb 16, 2022
4c4ca46
JDoc comment for ClusteringResult
SimDing Feb 16, 2022
a58f4ad
Abbreviation avg -> average
SimDing Feb 16, 2022
1825261
Abbreviation idx -> index
SimDing Feb 16, 2022
be40d0b
"Abbreviation" N -> numberOfSubmissions
SimDing Feb 16, 2022
f8b3cf4
More renaming
SimDing Feb 16, 2022
0679092
Rename Preprocessor -> ClusteringPreprocessor
SimDing Feb 16, 2022
3a9d6c4
abbrevation prelim -> preliminary
SimDing Feb 16, 2022
ac4afc8
More abbreviations
SimDing Feb 16, 2022
66ceeb5
Remove MergedList
SimDing Feb 16, 2022
8f56968
Move nested classes out of agglomerative clustering
SimDing Feb 16, 2022
3cabcd3
Use ClusteringOptions in SpectralClustering
SimDing Feb 16, 2022
e4625f0
Spotless
SimDing Feb 16, 2022
5d9cf11
Fix tests
SimDing Feb 16, 2022
360a0d5
Spotless
SimDing Feb 16, 2022
1330299
Fix javadoc for ClusteringResult
SimDing Feb 16, 2022
1c7ef6a
abbrevation tp -> thresholdPreprocessor
SimDing Feb 16, 2022
f84deee
abbrevation cdf -> CumulativeDistributionFunction
SimDing Feb 16, 2022
4eb6fb9
No abbrevations in SpectralClustering
SimDing Feb 16, 2022
2ab46c9
abbreviations in baysian optimization
SimDing Feb 16, 2022
89dd8c6
Move PreprocessorHelper out of ClusteringPreprocessor
SimDing Feb 16, 2022
85aaaa6
Moved PreprocessedClusteringAlgorithm out of ClusteringPreprocessor
SimDing Feb 16, 2022
4bffe8c
More JDoc for Cluster
SimDing Feb 16, 2022
d7fe068
Changed ClusteringPreprocessor to post process only indices
SimDing Feb 16, 2022
3426eb6
Fix JDoc
SimDing Feb 16, 2022
89a8ee3
Spotless
SimDing Feb 16, 2022
05dbd79
Fix JDoc
SimDing Feb 16, 2022
0f14e3c
Rename E -> percentagesOfSimilaritySums
SimDing Feb 18, 2022
70244c7
Renaming of Preprossors enum
SimDing Feb 18, 2022
bc8f8d3
Renameing + JDoc in GP
SimDing Feb 18, 2022
5ac7248
Remove unused methods
SimDing Feb 18, 2022
93110cc
Move preprocessor creation into enum
SimDing Feb 20, 2022
5004fd0
Move cluster similarity calculation into enum
SimDing Feb 20, 2022
82037bc
Renaming stuff in BaysianOptimization
SimDing Feb 20, 2022
bf8bdc7
Renaming in gaussian process
SimDing Feb 20, 2022
8496fbb
Renaming in percentile preprocessor
SimDing Feb 20, 2022
a8f4922
Clean up ClusteringTest
SimDing Feb 20, 2022
f8ccf50
Spotless
SimDing Feb 20, 2022
4ae5ed7
Rename params in baysian optimization
SimDing Feb 20, 2022
10ec0e5
Extract test data from AgglomerativeClusteringTest
SimDing Feb 21, 2022
4f9cb03
Test spectral clustering with same data as agglomerative clustering
SimDing Feb 21, 2022
b28b7a0
Tests for threshold preprocessor
SimDing Feb 21, 2022
ce66be5
Test for percentile preprocessor
SimDing Feb 21, 2022
d5e48a2
Test for cdf preprocessor
SimDing Feb 21, 2022
4ada621
Test for cluster
SimDing Feb 24, 2022
5797920
Test for clustering result
SimDing Feb 24, 2022
84f4dbd
Remove traces of git submodule.
tsaglam Feb 24, 2022
6d82ee7
Merge pull request #5 from jplag/readd-clustering-submodule-fix
SimDing Feb 24, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .gitmodules
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
[submodule "jplag/src/test/resources/de/jplag/PseudonymizedReports"]
path = jplag/src/test/resources/de/jplag/PseudonymizedReports
url = https://github.com/jplag/PseudonymizedReports
SimDing marked this conversation as resolved.
Show resolved Hide resolved
11 changes: 11 additions & 0 deletions jplag/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,17 @@
<artifactId>system-rules</artifactId>
<scope>test</scope>
</dependency>
<dependency>
<groupId>org.apache.commons</groupId>
<artifactId>commons-math3</artifactId>
<version>3.6.1</version>
</dependency>
<dependency>
<groupId>org.mockito</groupId>
<artifactId>mockito-core</artifactId>
<version>4.0.0</version>
<scope>test</scope>
</dependency>

<dependency>
<groupId>de.jplag</groupId>
Expand Down
59 changes: 45 additions & 14 deletions jplag/src/main/java/de/jplag/CLI.java
Original file line number Diff line number Diff line change
@@ -1,30 +1,24 @@
package de.jplag;

import static de.jplag.CommandLineArgument.BASE_CODE;
import static de.jplag.CommandLineArgument.COMPARISON_MODE;
import static de.jplag.CommandLineArgument.DEBUG;
import static de.jplag.CommandLineArgument.EXCLUDE_FILE;
import static de.jplag.CommandLineArgument.LANGUAGE;
import static de.jplag.CommandLineArgument.MIN_TOKEN_MATCH;
import static de.jplag.CommandLineArgument.RESULT_FOLDER;
import static de.jplag.CommandLineArgument.ROOT_DIRECTORY;
import static de.jplag.CommandLineArgument.SHOWN_COMPARISONS;
import static de.jplag.CommandLineArgument.SIMILARITY_THRESHOLD;
import static de.jplag.CommandLineArgument.SUBDIRECTORY;
import static de.jplag.CommandLineArgument.SUFFIXES;
import static de.jplag.CommandLineArgument.VERBOSITY;
import static de.jplag.CommandLineArgument.*;

import java.io.File;
import java.util.Optional;
import java.util.Random;

import net.sourceforge.argparse4j.ArgumentParsers;
import net.sourceforge.argparse4j.inf.ArgumentParser;
import net.sourceforge.argparse4j.inf.ArgumentParserException;
import net.sourceforge.argparse4j.inf.Namespace;

import de.jplag.clustering.Algorithms;
import de.jplag.clustering.ClusteringOptions;
import de.jplag.clustering.Preprocessors;
import de.jplag.clustering.algorithm.AgglomerativeClustering;
import de.jplag.exceptions.ExitException;
import de.jplag.options.JPlagOptions;
import de.jplag.options.LanguageOption;
import de.jplag.options.SimilarityMetric;
import de.jplag.options.Verbosity;
import de.jplag.reporting.Report;
import de.jplag.strategy.ComparisonMode;
Expand All @@ -42,6 +36,8 @@ public class CLI {
"More Abstract than Tree", "Students Nightmare", "No, changing variable names does not work", "The tech is out there!"};

private static final String PROGRAM_NAME = "jplag";
static final String CLUSTERING_GROUP_NAME = "Clustering";
static final String CLUSTERING_PREPROCESSING_GROUP_NAME = "Clustering - Preprocessing";

private final ArgumentParser parser;

Expand Down Expand Up @@ -71,8 +67,9 @@ public static void main(String[] args) {
*/
public CLI() {
parser = ArgumentParsers.newFor(PROGRAM_NAME).build().defaultHelp(true).description(generateDescription());
CliGroupHelper groupHelper = new CliGroupHelper(parser);
for (CommandLineArgument argument : CommandLineArgument.values()) {
argument.parseWith(parser);
argument.parseWith(parser, groupHelper);
}
}

Expand Down Expand Up @@ -115,6 +112,40 @@ public JPlagOptions buildOptionsFromArguments(Namespace namespace) {
options.setMaximumNumberOfComparisons(SHOWN_COMPARISONS.getFrom(namespace));
ComparisonMode.fromName(COMPARISON_MODE.getFrom(namespace)).ifPresentOrElse(it -> options.setComparisonMode(it),
() -> System.out.println("Unknown comparison mode, using default mode!"));

ClusteringOptions.Builder clusteringBuilder = new ClusteringOptions.Builder();
Optional.ofNullable((Boolean) CLUSTER_ENABLE.getFrom(namespace)).ifPresent(clusteringBuilder::enabled);
Optional.ofNullable((Algorithms) CLUSTER_ALGORITHM.getFrom(namespace)).ifPresent(clusteringBuilder::algorithm);
Optional.ofNullable((SimilarityMetric) CLUSTER_METRIC.getFrom(namespace)).ifPresent(clusteringBuilder::similarityMetric);
Optional.ofNullable((Float) CLUSTER_SPECTRAL_BANDWIDTH.getFrom(namespace)).ifPresent(clusteringBuilder::spectralKernelBandwidth);
Optional.ofNullable((Float) CLUSTER_SPECTRAL_NOISE.getFrom(namespace)).ifPresent(clusteringBuilder::spectralGPVariance);
Optional.ofNullable((Integer) CLUSTER_SPECTRAL_MIN_RUNS.getFrom(namespace)).ifPresent(clusteringBuilder::spectralMinRuns);
Optional.ofNullable((Integer) CLUSTER_SPECTRAL_MAX_RUNS.getFrom(namespace)).ifPresent(clusteringBuilder::spectralMaxRuns);
Optional.ofNullable((Integer) CLUSTER_SPECTRAL_KMEANS_ITERATIONS.getFrom(namespace))
.ifPresent(clusteringBuilder::spectralMaxKMeansIterationPerRun);
Optional.ofNullable((Float) CLUSTER_AGGLOMERATIVE_THRESHOLD.getFrom(namespace)).ifPresent(clusteringBuilder::agglomerativeThreshold);
Optional.ofNullable((AgglomerativeClustering.InterClusterSimilarity) CLUSTER_AGGLOMERATIVE_INTER_CLUSTER_SIMILARITY.getFrom(namespace))
.ifPresent(clusteringBuilder::agglomerativeInterClusterSimilarity);
Optional.ofNullable((Boolean) CLUSTER_PREPROCESSING_NONE.getFrom(namespace)).ifPresent(none -> {
if (none) {
clusteringBuilder.preprocessor(Preprocessors.NONE);
}
});
Optional.ofNullable((Boolean) CLUSTER_PREPROCESSING_CDF.getFrom(namespace)).ifPresent(cdf -> {
if (cdf) {
clusteringBuilder.preprocessor(Preprocessors.CDF);
}
});
Optional.ofNullable((Float) CLUSTER_PREPROCESSING_PERCENTILE.getFrom(namespace)).ifPresent(percentile -> {
clusteringBuilder.preprocessor(Preprocessors.PERCENTILE);
clusteringBuilder.preprocessorPercentile(percentile);
});
Optional.ofNullable((Float) CLUSTER_PREPROCESSING_THRESHOLD.getFrom(namespace)).ifPresent(threshold -> {
clusteringBuilder.preprocessor(Preprocessors.THRESHOLD);
clusteringBuilder.preprocessorPercentile(threshold);
});
options.setClusteringOptions(clusteringBuilder.build());

return options;
}

Expand Down
29 changes: 29 additions & 0 deletions jplag/src/main/java/de/jplag/CliGroupHelper.java
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
package de.jplag;

import java.util.HashMap;
import java.util.Map;

import net.sourceforge.argparse4j.inf.ArgumentContainer;
import net.sourceforge.argparse4j.inf.ArgumentGroup;
import net.sourceforge.argparse4j.inf.ArgumentParser;
import net.sourceforge.argparse4j.inf.MutuallyExclusiveGroup;

public class CliGroupHelper {
SimDing marked this conversation as resolved.
Show resolved Hide resolved

private final ArgumentParser parser;
private Map<String, MutuallyExclusiveGroup> mutuallyExclusiveGroups = new HashMap<>();
private Map<String, ArgumentGroup> argumentGroups = new HashMap<>();

public CliGroupHelper(ArgumentParser parser) {
this.parser = parser;
}

public ArgumentContainer getMutuallyExclusiveGroup(String name) {
return mutuallyExclusiveGroups.computeIfAbsent(name, parser::addMutuallyExclusiveGroup);
}

public ArgumentContainer getArgumentGroup(String name) {
return argumentGroups.computeIfAbsent(name, parser::addArgumentGroup);
}

}
162 changes: 131 additions & 31 deletions jplag/src/main/java/de/jplag/CommandLineArgument.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,7 @@
package de.jplag;

import static de.jplag.CLI.CLUSTERING_GROUP_NAME;
import static de.jplag.CLI.CLUSTERING_PREPROCESSING_GROUP_NAME;
import static de.jplag.options.JPlagOptions.DEFAULT_COMPARISON_MODE;
import static de.jplag.options.JPlagOptions.DEFAULT_SHOWN_COMPARISONS;
import static de.jplag.options.JPlagOptions.DEFAULT_SIMILARITY_THRESHOLD;
Expand All @@ -9,62 +11,103 @@
import java.util.List;
import java.util.Optional;

import net.sourceforge.argparse4j.impl.Arguments;
import net.sourceforge.argparse4j.inf.Argument;
import net.sourceforge.argparse4j.inf.ArgumentAction;
import net.sourceforge.argparse4j.inf.ArgumentContainer;
import net.sourceforge.argparse4j.inf.ArgumentParser;
import net.sourceforge.argparse4j.inf.Namespace;

import de.jplag.clustering.Algorithms;
import de.jplag.clustering.ClusteringOptions;
import de.jplag.clustering.algorithm.AgglomerativeClustering;
import de.jplag.options.LanguageOption;
import de.jplag.options.SimilarityMetric;
import de.jplag.strategy.ComparisonMode;

/**
* Command line arguments for the JPlag CLI. Each argument is defined through an enumeral.
* @author Timur Saglam
*/
public enum CommandLineArgument {
ROOT_DIRECTORY("rootDir", NumberOfArgumentValues.ONE_OR_MORE_VALUES, String.class),
LANGUAGE("-l", String.class, LanguageOption.getDefault().getDisplayName(), LanguageOption.getAllDisplayNames()),

ROOT_DIRECTORY(new Builder("rootDir", String.class).nargs(NumberOfArgumentValues.ONE_OR_MORE_VALUES)),
LANGUAGE(new Builder("-l", String.class).defaultsTo(LanguageOption.getDefault().getDisplayName()).choices(LanguageOption.getAllDisplayNames())),
BASE_CODE("-bc", String.class),
VERBOSITY("-v", String.class, "quiet", List.of("quiet", "long")), // TODO SH: Replace verbosity when integrating a real logging library
VERBOSITY(new Builder("-v", String.class).defaultsTo("quiet").choices(List.of("quiet", "long"))), // TODO SH: Replace verbosity when integrating a
// real logging library
DEBUG("-d", Boolean.class),
SUBDIRECTORY("-S", String.class),
SUFFIXES("-p", String.class),
EXCLUDE_FILE("-x", String.class),
MIN_TOKEN_MATCH("-t", Integer.class),
SIMILARITY_THRESHOLD("-m", Float.class, DEFAULT_SIMILARITY_THRESHOLD),
SHOWN_COMPARISONS("-n", Integer.class, DEFAULT_SHOWN_COMPARISONS),
RESULT_FOLDER("-r", String.class, "result"),
COMPARISON_MODE("-c", String.class, DEFAULT_COMPARISON_MODE.getName(), ComparisonMode.allNames());
SIMILARITY_THRESHOLD(new Builder("-m", Float.class).defaultsTo(DEFAULT_SIMILARITY_THRESHOLD)),
SHOWN_COMPARISONS(new Builder("-n", Integer.class).defaultsTo(DEFAULT_SHOWN_COMPARISONS)),
RESULT_FOLDER(new Builder("-r", String.class).defaultsTo("result")),
COMPARISON_MODE(new Builder("-c", String.class).defaultsTo(DEFAULT_COMPARISON_MODE.getName()).choices(ComparisonMode.allNames())),
CLUSTER_ENABLE(new Builder("--cluster-skip", Boolean.class).argumentGroup(CLUSTERING_GROUP_NAME).action(Arguments.storeTrue())),
CLUSTER_ALGORITHM(
new Builder("--cluster-alg", Algorithms.class).argumentGroup(CLUSTERING_GROUP_NAME)
.defaultsTo(ClusteringOptions.DEFAULTS.getAlgorithm())),
CLUSTER_METRIC(
new Builder("--cluster-metric", SimilarityMetric.class).argumentGroup(CLUSTERING_GROUP_NAME)
.defaultsTo(ClusteringOptions.DEFAULTS.getSimilarityMetric())),
CLUSTER_SPECTRAL_BANDWIDTH(
new Builder("--cluster-spectral-bandwidth", Float.class).argumentGroup(CLUSTERING_GROUP_NAME).metaVar("bandwidth")
.defaultsTo(ClusteringOptions.DEFAULTS.getSpectralKernelBandwidth())),
CLUSTER_SPECTRAL_NOISE(
new Builder("--cluster-spectral-noise", Float.class).argumentGroup(CLUSTERING_GROUP_NAME).metaVar("noise")
.defaultsTo(ClusteringOptions.DEFAULTS.getSpectralGPVariance())),
CLUSTER_SPECTRAL_MIN_RUNS(
new Builder("--cluster-spectral-min-runs", Integer.class).argumentGroup(CLUSTERING_GROUP_NAME).metaVar("min")
.defaultsTo(ClusteringOptions.DEFAULTS.getSpectralMinRuns())),
CLUSTER_SPECTRAL_MAX_RUNS(
new Builder("--cluster-spectral-max-runs", Integer.class).argumentGroup(CLUSTERING_GROUP_NAME).metaVar("max")
.defaultsTo(ClusteringOptions.DEFAULTS.getSpectralMaxRuns())),
CLUSTER_SPECTRAL_KMEANS_ITERATIONS(
new Builder("--cluster-spectral-kmeans-interations", Integer.class).argumentGroup(CLUSTERING_GROUP_NAME).metaVar("iterations")
.defaultsTo(ClusteringOptions.DEFAULTS.getSpectralMaxKMeansIterationPerRun())),
CLUSTER_AGGLOMERATIVE_THRESHOLD(
new Builder("--cluster-agglomerative-threshold", Float.class).argumentGroup(CLUSTERING_GROUP_NAME).metaVar("threshold")
.defaultsTo(ClusteringOptions.DEFAULTS.getAgglomerativeThreshold())),
CLUSTER_AGGLOMERATIVE_INTER_CLUSTER_SIMILARITY(
new Builder("--cluster-agglomerative-inter-cluster-similarity", AgglomerativeClustering.InterClusterSimilarity.class)
.argumentGroup(CLUSTERING_GROUP_NAME).defaultsTo(ClusteringOptions.DEFAULTS.getAgglomerativeInterClusterSimilarity())),
CLUSTER_PREPROCESSING_NONE(
new Builder("--cluster-pp-none", Boolean.class).mutuallyExclusiveGroup(CLUSTERING_PREPROCESSING_GROUP_NAME)
.action(Arguments.storeTrue())),
CLUSTER_PREPROCESSING_CDF(
new Builder("--cluster-pp-cdf", Boolean.class).mutuallyExclusiveGroup(CLUSTERING_PREPROCESSING_GROUP_NAME).action(Arguments.storeTrue())),
CLUSTER_PREPROCESSING_PERCENTILE(
new Builder("--cluster-pp-percentile", Float.class).mutuallyExclusiveGroup(CLUSTERING_PREPROCESSING_GROUP_NAME).metaVar("percentile")),
CLUSTER_PREPROCESSING_THRESHOLD(
new Builder("--cluster-pp-threshold", Float.class).mutuallyExclusiveGroup(CLUSTERING_PREPROCESSING_GROUP_NAME).metaVar("threshold"));

SimDing marked this conversation as resolved.
Show resolved Hide resolved
private final String flag;
private final NumberOfArgumentValues numberOfValues;
private final String description;
private final Optional<Object> defaultValue;
private final Optional<Collection<String>> choices;
private final Optional<String> argumentGroup;
private final Optional<String> mutuallyExclusiveGroup;
private final Optional<ArgumentAction> action;
private final Optional<String> metaVar;
private final Class<?> type;

private CommandLineArgument(String flag, Class<?> type) {
this(flag, NumberOfArgumentValues.SINGLE_VALUE, type, Optional.empty(), Optional.empty());
}

private CommandLineArgument(String flag, NumberOfArgumentValues numberOfValues, Class<?> type) {
this(flag, numberOfValues, type, Optional.empty(), Optional.empty());
}

private CommandLineArgument(String flag, Class<?> type, Object defaultValue) {
this(flag, NumberOfArgumentValues.SINGLE_VALUE, type, Optional.of(defaultValue), Optional.empty());
}

private CommandLineArgument(String flag, Class<?> type, Object defaultValue, Collection<String> choices) {
this(flag, NumberOfArgumentValues.SINGLE_VALUE, type, Optional.of(defaultValue), Optional.of(choices));
this(new Builder(flag, type));
}

private CommandLineArgument(String flag, NumberOfArgumentValues numberOfValues, Class<?> type, Optional<Object> defaultValue,
Optional<Collection<String>> choices) {
this.flag = flag;
this.numberOfValues = numberOfValues;
this.type = type;
this.defaultValue = defaultValue;
this.choices = choices;
private CommandLineArgument(Builder builder) {
this.flag = builder.flag;
this.type = builder.type;
this.defaultValue = builder.defaultValue;
this.choices = builder.choices;
this.argumentGroup = builder.argumentGroup;
this.mutuallyExclusiveGroup = builder.mutuallyExclusiveGroup;
this.action = builder.action;
this.metaVar = builder.metaVar;
this.numberOfValues = builder.nargs.orElse(NumberOfArgumentValues.SINGLE_VALUE);
this.description = retrieveDescriptionFromMessages();
}

Expand All @@ -76,10 +119,10 @@ public String flag() {
}

/**
* @return the flag name of the command line argument without leading dashes.
* @return the flag name of the command line argument without leading dashes and inner dashes replaced with underscores.
*/
public String flagWithoutDash() {
return flag.replace("-", "");
return flag.replaceAll("^-+", "").replaceAll("-", "_");
}

/**
Expand Down Expand Up @@ -111,10 +154,15 @@ public <T> List<T> getListFrom(Namespace namespace) {
* Parses the command line argument with a specific parser.
* @param parser is that parser.
*/
public void parseWith(ArgumentParser parser) {
Argument argument = parser.addArgument(flag).help(description);
public void parseWith(ArgumentParser parser, CliGroupHelper groupHelper) {
ArgumentContainer argContainer = mutuallyExclusiveGroup.map(groupHelper::getMutuallyExclusiveGroup)
.or(() -> argumentGroup.map(groupHelper::getArgumentGroup)).orElse(parser);

Argument argument = argContainer.addArgument(flag).help(description);
choices.ifPresent(it -> argument.choices(it));
defaultValue.ifPresent(it -> argument.setDefault(it));
action.ifPresent(argument::action);
metaVar.ifPresent(argument::metavar);
argument.type(type);
if (type == Boolean.class) {
argument.action(storeTrue());
Expand All @@ -136,4 +184,56 @@ private String retrieveDescriptionFromMessages() {
}
return Messages.getString(getClass().getSimpleName() + "." + builder.toString());
}

private static class Builder {
SimDing marked this conversation as resolved.
Show resolved Hide resolved
private final String flag;
private final Class<?> type;
private Optional<Object> defaultValue = Optional.empty();
private Optional<Collection<String>> choices = Optional.empty();
private Optional<String> argumentGroup = Optional.empty();
private Optional<String> mutuallyExclusiveGroup = Optional.empty();
private Optional<ArgumentAction> action = Optional.empty();
private Optional<String> metaVar = Optional.empty();
private Optional<NumberOfArgumentValues> nargs = Optional.empty();

public Builder(String flag, Class<?> type) {
this.flag = flag;
this.type = type;
}

public Builder defaultsTo(Object defaultValue) {
this.defaultValue = Optional.of(defaultValue);
return this;
}

public Builder choices(Collection<String> choices) {
this.choices = Optional.of(choices);
return this;
}

public Builder argumentGroup(String argumentGroup) {
this.argumentGroup = Optional.of(argumentGroup);
return this;
}

public Builder mutuallyExclusiveGroup(String mutuallyExclusiveGroup) {
this.mutuallyExclusiveGroup = Optional.of(mutuallyExclusiveGroup);
return this;
}

public Builder action(ArgumentAction action) {
this.action = Optional.of(action);
return this;
}

public Builder metaVar(String metaVar) {
this.metaVar = Optional.of(metaVar);
return this;
}

public Builder nargs(NumberOfArgumentValues nargs) {
this.nargs = Optional.of(nargs);
return this;
}
}
}
4 changes: 4 additions & 0 deletions jplag/src/main/java/de/jplag/JPlag.java
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@
import java.util.Set;
import java.util.stream.Collectors;

import de.jplag.clustering.ClusteringFactory;
import de.jplag.exceptions.ExitException;
import de.jplag.exceptions.SubmissionException;
import de.jplag.options.JPlagOptions;
Expand Down Expand Up @@ -90,6 +91,9 @@ public JPlagResult run() throws ExitException {
// Compare valid submissions.
JPlagResult result = comparisonStrategy.compareSubmissions(submissionSet);
errorCollector.print("\nTotal time for comparing submissions: " + TimeUtil.formatDuration(result.getDuration()), null);

result.setClusteringResult(ClusteringFactory.getClusterings(result.getComparisons(), options.getClusteringOptions()));

return result;
}

Expand Down
Loading