The CDR Test Data CLI tool has been created by the Data Standards Body (DSB) to provide diverse test datasets to the Consumer Data Right (CDR) community to assist with the development and delivery of various CDR solutions.
This versatile tool showcases its ability to create synthetic datasets that can be tailored to specific requirements using configurable option files. This enables developers to use these tailored datasets for testing, experimentation, or development purposes within their CDR projects.
The goal of the Test Data CLI is to provide support for CDR participants seeking to test their implementations to ensure confidence that they have covered a wide range of possible scenarios likely to be encountered in production scenarios.
The Test Data CLI can be used to generate manufactured test data for the Consumer Data Standards (CDS). This CLI allows the configuration, through an option file, of a variety of data factories
that can be used to generate test files with different scenarios.
The DSB offers the Test Data CLI as an npm package available in the NPM registry. You can easily install this package in your project using npm. For more information, refer to the Quick Start section below.
You can also set up a local instance of the Test Data CLI tool for customised and extended use cases. For more information, refer to the Local Setup and Customisation section below.
For detailed examples and practical applications, please refer to the Example Use Cases section below.
To install the Test Data CLI tool globally, ensure npm is installed on your system and run the following command:
npm install @cds-au/testdata -g
This makes the CLI available globally on your system.
To generate test data, you will need an options file that specifies the configuration for data generation:
testdata generate <options-file> <destination-file>
Argument | Required | Description |
---|---|---|
options-file | mandatory | The options file indicating the factories to execute, in what order and with what options specified. |
destination-file | optional | The destination file where the generated JSON test data will be saved. The contents will always be JSON consistent with the test data schema. |
To learn more about the Data Generator and how it works, refer to the Data Generation Process section below.
- Schema Display: Run**
testdata schema
** to view the current JSON schema used for test data. Usevonly
option to view the version of the current JSON schema. - List Factories: Run
testdata factories
to view the list of all the currently implemented factories, along with a short description of each factory describing its purpose. - Factory Details: Run
testdata factory <factory-id>
to view detailed information about a specific factory, including the purpose of the factory, the data that it generates or modifies and the options that it consumes.
Argument | Required | Description |
---|---|---|
factory-id | mandatory | The ID of the factory that documentation is requested for |
For example, **`testdata factory create-customers`** will return:
```bash
Supported capabilities include:
create a customer
create a set of customers
...
```
In this section, you will find practical use cases for the Test Data CLI, each designed to show how the tool can be used to generate synthetic data for a variety of scenarios. To facilitate ease of use and understanding, each use case is accompanied by a specific command. These commands use specific sample options files from the repository, located in the samples/options
directory, which define the necessary parameters for generating targeted datasets. You can run these commands to generate a JSON file with the test data output and use them for development or testing of your CDR application.
-
Use Case 1 (UC1): Generate CDS-compliant test energy plan data to simulate realistic API responses in your mock Data Holder’s Product Reference Data API.
testdata generate ./samples/options/uc1.json ./samples/output/u1-output.json
-
Use Case 2 (UC2): Generate a mix of residential and business customer profiles to enhance testing of varied customer interactions.
testdata generate ./samples/options/uc2.json ./samples/output/u2-output.json
-
Use Case 3 (UC3): Create a dataset for a single residential customer with an energy account to test personalised service scenarios.
testdata generate ./samples/options/uc3.json ./output/samples/u3-output.json
-
Use Case 4 (UC4): Augment an existing data holder structure by adding additional customer profiles to enhance your existing dataset.
testdata generate ./samples/options/uc4.json ./samples/output/u4-output.json
-
Use Case 5 (UC5): Augment an existing data file by adding additional holders without altering existing data from the input file.
testdata generate ./samples/options/uc5.json ./samples/output/u5-output.json
-
Use Case 6 (UC6): Create a dataset that includes a mix of valid and intentionally invalid customer data to test system robustness.
testdata generate ./samples/options/uc6.json ./samples/output/u6-output.json
The data generator utilises one or more data factories and creates data by executing the individual factories. The options file drives the behaviour of the data generation.
Data factories are modular components within the CLI that generate specific data structures:
- Valid Data Factories: Located in
src/factories
, these are grouped by sector (banking, energy, etc.) and generate synthetic data compliant with CDR standards. - Invalid Data Factories: Found in
src/factories/invalid-factories
, these are used for generating synthetic data that deliberately includes errors for testing error handling.
The options file is crucial for directing the data generation process. It details which factories to execute and the order and parameters for execution, adhering to a schema defined in src/logic/options.ts
.
Before you begin, ensure you have the following installed:
- Git, for cloning the repository.
- Node.js. (Note: This demo project was tested with NodeJS v18.12.1.)
- npm (Node Package Manager) - included with Node.js installation.
-
Create a fork of this repository. To do this, click the "Fork" button on the top right corner of this page.
-
After forking the repository, clone it to your local machine. You can do this by running the following command in your terminal or command prompt:
git clone https://github.com/your-username/project-name.git
Replace
your-username
with your GitHub username andproject-name
with the name of your repository. -
Once the repository is cloned, navigate to the project directory by running:
cd project-name
Replace
project-name
with the name of the repository.
To build the repository and use the library without installing it globally:
- Customise the project as needed for your specific use case.
- Install libraries
npm install
- Build
npm run build
- Use npm link
npm link
To test your changes:
- Run
npm run test
We welcome contributions from the community! If you'd like to contribute to this project, please follow these simple steps:
-
Create a new branch for your work from the
master
branch:git checkout -b feature/your-feature-name
-
Begin making your changes or contributions.
-
Follow the instructions in the project repository to run and test your changes locally.
-
Commit your changes with clear and concise commit messages.
-
Push your changes to your forked repository.
-
Open a pull request (PR) using the
master
branch in the original repository as the destination branch. Include a detailed description of your changes and the problem you are addressing. -
Engage in the discussion on your PR and make any necessary adjustments based on feedback from maintainers and other contributors.
-
Once your PR is approved and all tests pass, it will be merged into the project.
Note: Please ensure your contributions align with our project's objectives and guidelines.
Encountered an issue? We're here to help. Please visit our issue reporting guidelines for submitting an issue.
Join our newsletter to receive the latest updates, release notes, and alerts. Subscribe here.
The artefact is released under the MIT License, which allows the community to use and modify it freely.
The artefacts in this repository are offered without warranty or liability, in accordance with the MIT licence.
The Data Standards Body (DSB) develops these artefacts in the course of its work, in order to perform quality assurance on the Australian Consumer Data Right Standards (Data Standards).
The DSB makes this repository, and its artefacts, public on a non-commercial basis in the interest of supporting the participants in the CDR ecosystem.
The resources of the DSB are primarily directed towards assisting the Data Standards Chair for developing the Data Standards.
Consequently, the development work provided on the artefacts in this repository is on a best-effort basis, and the DSB acknowledges the use of these tools alone is not sufficient for, nor should they be relied upon with respect to accreditation, conformance, or compliance purposes.