From 7f3b74c052f0473dab30677214e187d676876237 Mon Sep 17 00:00:00 2001 From: Frances Perry Date: Sat, 13 Dec 2014 10:56:14 -0800 Subject: [PATCH] Welcome to Dataflow! --- README | 19 +++++++++++++++++++ 1 file changed, 19 insertions(+) create mode 100644 README diff --git a/README b/README new file mode 100644 index 00000000000..2da8ea5ba93 --- /dev/null +++ b/README @@ -0,0 +1,19 @@ +Greetings! Welcome to the (Alpha) Google Cloud Dataflow Java SDK. + +Dataflow provides a simple, powerful model for building both batch and +streaming parallel data processing Pipelines. + +To use the Google Cloud Dataflow SDK, you build a Pipeline which manages a +graph of PTransforms and PCollections that the PTransforms consume and produce. + +You then use a PipelineRunner to specify where and how the pipeline should +execute. Currently there are two runners: + 1. The DirectPipelineRunner runs the pipeline on your local machine. + 2. The [Blocking]DataflowPipelineRunner runs the pipeline on the Dataflow + Service using the Google Cloud Platform. The Dataflow Service is + currently in the Alpha phase of development and access is limited to + whitelisted users. + +For more about both the Dataflow SDK and the Dataflow Service, visit: + http://cloud.google.com/dataflow +