benchmarkVis - Benchmark Visualizations in R

benchmarkVis is a R package to visualize benchmark results in different ways. It is working with standard csv, json and rds files and can also be combined with several R benchmark packages like microbenchmark , rbenchmark or mlr through integrated wrappers. Thanks to the universal input table structure it is also possible to integrate results from batchtools or frameworks outside the R language like pythons scikit-learn.

Getting Started

Install the development version

devtools::install_github("collinleiber/benchmarkVis")

Take a look into the Wiki for a full tutorial

Description

Benchmarking is a good way to compare the performances of different algorithms. To evaluate the results often the same procedures are used. But even though different benchmarks normally contain similar information, the structure can differ significantly. This increases the effort to visualize and analyze them. You have to do the same creation steps over and over again with just a few little changes. At this point the benchmarkVis package comes into play. It aims to convert various formats into a default data table which can be visualized in multiple ways.

Compatible data table

problem	problem.parameter	algorithm	algorithm.parameter	replication	replication.parameter	measure.*	list.*
character	list	character	list	character	list	numeric	numeric vector
mandatory	optional	mandatory	optional	optional	optional	optional	optional

As you can see, each column has a fixed name and data type. Also some of the columns are optional while others are mandatory. The table can contain any number of measures and lists. It is important that at least one column of type measure or list is contained and that the column names start with "measure." / "list.".

Table components

problem: The problem that should be solved by an algorithm (e.g. dataset or machine learning task)
algorithm: The procedure to solve the problem with
replication: If you want to try an approach more than one time, you can specify the replication strategy (e.g. repetition or resampling)
*.parameter: Specifies numerical or categorical parameters concerning the corresponding column (e.g. problem properties like data size, algorithm parameters or replication parameters like number of repetitions)
measure.*: The measure to evaluate the result of an algorithm with (e.g. execution time or misclassification error)
list.*: Same as measure columns but contain a vector of results (e.g. results for every single replication)

To get the components of your input data table you can use following methods:

getMeasures(data.table)
getLists(data.table)
getMainColumns(data.table)
getParameterColumns(data.table)
getParameters(data.table, parameter.column)

The main columns always consist of problem and algorithm and can also contain replication.

Algorithm tuning

One special case occurs if you try to tune your algorithm by changing its parameters through multiple iterations. If this is the case you need to add the numeric field iteration to the algorithm.parameter list of the corresponding algorithm. It is important that no value occurs multiple times for the same combination of problem, algorithm and replication.

To see all tuning combinations in your data table just execute:

getTunings(data.table)

Quick Start

In this example we will use one of the provided wrappers (in this case the wrapper for microbenchmarks) as input data and create a bar plot and a list line chart.

Create input data:

library(benchmarkVis)
library(microbenchmark)

x = runif(100)
benchmark = microbenchmark(sqrt(x), x ^ 0.5)

table = useMicrobenchmarkWrapper(benchmark)

See a list with all visualizations usable with the input data:

getValidPlots(table)

Create Plots:

createBarPlot(table, "measure.mean")
createListLinePlot(table, "list.values", "mean", TRUE)

Shiny Application

The package functionality can also be reached via a shiny app which you can start with:

runShinyApp()

Next steps

For more complex examples take a look at the Example Use Cases.

If you want to use your own data you can import csv, json and rds files:

CSV:

table = csvImport("PATH.TO.CSV.FILE")

JSON:

table = jsonImport("PATH.TO.JSON.FILE")

RDS:

table = rdsImport("PATH.TO.RDS.FILE")

Check out our tutorial in the Wiki for detailed information.

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
R		R
data		data
inst		inst
man		man
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.travis.yml		.travis.yml
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
benchmarkVis.Rproj		benchmarkVis.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

benchmarkVis - Benchmark Visualizations in R

Getting Started

Description

Compatible data table

Table components

Algorithm tuning

Quick Start

Shiny Application

Next steps

About

Releases

Packages

Contributors 3

Languages

License

collinleiber/benchmarkVis

Folders and files

Latest commit

History

Repository files navigation

benchmarkVis - Benchmark Visualizations in R

Getting Started

Description

Compatible data table

Table components

Algorithm tuning

Quick Start

Shiny Application

Next steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages