Piper is a customizable data pipeline processing library written in Go.
Piper abstracts the components of a data pipeline into chained processes, which run concurrent batch jobs to execute user-supplied callback functions. Each successful job within a batch moves through the pipeline to the next process until completion. Each failed job within a batch gets retried up to the maximum number of retry attempts. Failed jobs that have exceed the maximum number of retry attempts are passed to a user-supplied callback function, which could be sent to a dead-letter queue for instance.
Install the piper library with the following command:
go install github.com/jasonyunicorn/piper
Then import the piper package in your application as such:
import "github.com/jasonyunicorn/piper"
To create a batch process, do the following:
- define a data structure (containing the fields pertinent to the data pipeline) which implements the
piper.DataIF
interface - define a batch function which implements the
piper.BatchExecutable
interface - use the
piper.NewProcess
API to create a new batch process - initialize the process using the
process.Start
function - enqueue data for processing by using the
process.ProcessData
function
To create a pipeline (two or more processes chained together), do the following:
- follow steps 1-3 above to define a new process for each process in the pipeline
- use the
piper.NewPipeline
API to create a new pipeline - initialize the process using the
process.Start()
function - enqueue data for processing by using the
process.ProcessData
function
There are no dependencies required to run any of the unit tests.
To run the tests from the command line:
go test -v -cover -race github.com/jyu617/piper
Pull requests for bug fixes, new features or performance enhancements are welcome!
This project follows the semantic versioning guidelines for versioning. For the versions available, see the tags on this repository.
- Jason Yu
See also the list of contributors who participated in this project.
This project is licensed under the MIT License - see the LICENSE.md file for details