Skip to content

Iteration 3 Visualization Requirements

smoniker edited this page Apr 19, 2013 · 12 revisions

Iteration 3 - Exploring Reporting

Purpose / Preamble

This document captures a set of high level requirements for visualization / reporting from the Hub. We will use these requirements as we explore available modules and tools that could be used to enhance the Hub component. Enhancement is not in scope for this iteration, but the environmental scan is in scope, thus the need for some initial working requirements.

Types of Visualizations

There are many options for visualizations and ways to categorize the approaches. For SCOOP, it will be important to provide a core set of visualizations that are easy to use, as not everyone will be a researcher who uses the system. For more specific and complex visualizations, an export tool should be available. Instead, SCOOP will focus on doing a core set of visualizations that will support the majority of users an an intuitive manner.

For SCOOP we will consider the following visualizations:

  • Trending over time of single and multiple variables
  • Scatter plot with trend lines of two items (e.g. # of diabetics in a practice and average A1C per practice)
  • Scatter plots with 3rd variable represented by size / colour / shape of indicator.
  • Comparison of different categories -- bar charts
  • Visualization of data quality in an answer
  • Venn Diagrams?

Visualization Features

Feature: xx-xx TITLE In order to visualize relative quantities The question manager should be able to Plot multiple results of a question set as a bar graph.

Feature: xx-xx TITLE In order to visualize temporal trends in data The question manager should be able to plot results of a question over time, either from a single query set or from repetitions of a query across the network.

Export Features

Feature: xx-xx TITLE In order to share findings in a report or publication The question manager should be able to Export a visualization as a PDF

Out of Scope Visualization

For the current focus, we will assume that qualitative data visualizations are out of scope. Flowcharts, gantt charts, and other workflow / dependency charts are not in scope. Also, animated trends over time, while they can be powerful, are not needed at this time for SCOOP.

Data fundamentals for constructing visualizations

The following topics will be addressed in the sections that follow:

  • Common relationships in quantitative data
  • Visual options for encoding quantitative data
  • Best and worst practices for representing select data relationships
  • An overall process for constructing (or not constructing) appropriate data visualizations

Following the discussion of visualization strategies and tactics, an overview of open source visualization technologies will be provided, complete with a sampling of representative visualizations.

Common relationships in quantitative data

Quantitative data consists of numbers. When visualized, quantitative data is usually depicted relative to some other category of interest. The selection of those categorical descriptions is what helps to transform quantitative data into quantitative information. These categorical descriptions, or scales, come in three flavors:

  • Nominal - Categories of data that aren't really related, but instead just described by name (e.g. Doctor A, Doctor B, Doctor C)
  • Ordinal - Categories of data that are related in that they follow an implied order, although they aren't really quantified (e.g. underweight, normal weight, overweight)
  • Interval - Categories of data are related, ordered and represent quantitative values themselves (e.g. BMI < 15, BMI >= 15 and <25, BMI >=25)

To guide selection of categorical scale, here are seven common relationships one might try to depict with a quantitative data set:

1. Nominal Comparison

Here, you just want to place categorical data measures side by side to facilitate comparison, with no desire to encourage ranking or identification of larger data patterns.

  • Number of patients per doctor
  • Number of tests per patient

2. Ranking

Here, you want to facilitate more than just a comparison of nominal categories. You want to draw attention to the relationship that exists between a category and its quantitative value. Nominal just got ordinal, yo!

  • Number of patients per doctor, ordered by doctor with most to least patients
  • Number of telephone consultations by clinic, ordered by clinic with most to least consultations

3. Time Series

Here, you want to depict the relationship of quantitative measures as they were captured in equal and successive intervals of time.

  • Billing revenues per month
  • Average number of minutes between check-in and examination per hour of operation

4. Parts-to-Whole

Here, a set of nominal, ordinal or interval categories are depicted so that it becomes possible to see what proportion of the total quantity is represented by a given category.

  • Percentage of clinic revenue earned by clinic physicians
  • Percentage of clinic patients according to age bracket

5. Deviation

6. Distribution

7. Correlation

When visualizing data, it is helpful to understand what type of categorical descriptions will be used to illustrate patterns or relationships in the underlying dataset. Choice of categorical scale will dictate to large degree what visualization tactic should be used to encode the relationship you are trying to depict.

Current Iteration: 13

General Topics

Resources


Previous Iteration: 12

Previous Iteration: 11

Previous Iteration: 10

Previous Iteration: 9

Previous Iteration: 8

Previous Iteration: 7

Previous Iteration: 6

Previous Iteration: 5

Previous Iteration: 4

Previous Iteration: 3

Previous Iteration: 2

Previous Iteration: 1

Previous Iteration: 0

Clone this wiki locally