Client library for AWS Data Pipeline
npm install --save @datafire/amazonaws_datapipeline
let amazonaws_datapipeline = require('@datafire/amazonaws_datapipeline').create({
accessKeyId: "",
secretAccessKey: "",
region: ""
});
.then(data => {
console.log(data);
});
AWS Data Pipeline configures and manages a data-driven workflow called a pipeline. AWS Data Pipeline handles the details of scheduling and ensuring that data dependencies are met so that your application can focus on processing the data.
AWS Data Pipeline provides a JAR implementation of a task runner called AWS Data Pipeline Task Runner. AWS Data Pipeline Task Runner provides logic for common data management scenarios, such as performing database queries and running data analysis using Amazon Elastic MapReduce (Amazon EMR). You can use AWS Data Pipeline Task Runner as your task runner, or you can write your own task runner to provide custom data management.
AWS Data Pipeline implements two main sets of functionality. Use the first set to create a pipeline and define data sources, schedules, dependencies, and the transforms to be performed on the data. Use the second set in your task runner application to receive the next task ready for processing. The logic for performing the task, such as querying the data, running data analysis, or converting the data from one format to another, is contained within the task runner. The task runner performs the task assigned to it by the web service, reporting progress to the web service as it does so. When the task is done, the task runner reports the final success or failure of the task to the web service.
amazonaws_datapipeline.ActivatePipeline({
"pipelineId": null
}, context)
- input
object
- parameterValues
- items ParameterValue
- pipelineId required
- startTimestamp
- parameterValues
- output ActivatePipelineOutput
amazonaws_datapipeline.AddTags({
"pipelineId": null,
"tags": null
}, context)
- input
object
- tags required
- items Tag
- pipelineId required
- tags required
- output AddTagsOutput
amazonaws_datapipeline.CreatePipeline({
"name": null,
"uniqueId": null
}, context)
- input
object
- tags
- items Tag
- description
- name required
- uniqueId required
- tags
- output CreatePipelineOutput
amazonaws_datapipeline.DeactivatePipeline({
"pipelineId": null
}, context)
- input
object
- cancelActive
- pipelineId required
- output DeactivatePipelineOutput
amazonaws_datapipeline.DeletePipeline({
"pipelineId": null
}, context)
- input
object
- pipelineId required
Output schema unknown
amazonaws_datapipeline.DescribeObjects({
"pipelineId": null,
"objectIds": null
}, context)
- input
object
- marker
string
- evaluateExpressions
- marker
- objectIds required
- items id
- pipelineId required
- marker
- output DescribeObjectsOutput
amazonaws_datapipeline.DescribePipelines({
"pipelineIds": null
}, context)
- input
object
- pipelineIds required
- items id
- pipelineIds required
- output DescribePipelinesOutput
amazonaws_datapipeline.EvaluateExpression({
"pipelineId": null,
"objectId": null,
"expression": null
}, context)
- input
object
- expression required
- objectId required
- pipelineId required
- output EvaluateExpressionOutput
amazonaws_datapipeline.GetPipelineDefinition({
"pipelineId": null
}, context)
- input
object
- pipelineId required
- version
- output GetPipelineDefinitionOutput
amazonaws_datapipeline.ListPipelines({}, context)
- input
object
- marker
string
- marker
- marker
- output ListPipelinesOutput
amazonaws_datapipeline.PollForTask({
"workerGroup": null
}, context)
- input
object
- hostname
- instanceIdentity
- document
- signature
- workerGroup required
- output PollForTaskOutput
amazonaws_datapipeline.PutPipelineDefinition({
"pipelineId": null,
"pipelineObjects": null
}, context)
- input
object
- parameterObjects
- items ParameterObject
- parameterValues
- items ParameterValue
- pipelineId required
- pipelineObjects required
- items PipelineObject
- parameterObjects
- output PutPipelineDefinitionOutput
amazonaws_datapipeline.QueryObjects({
"pipelineId": null,
"sphere": null
}, context)
- input
object
- limit
string
- marker
string
- limit
- marker
- pipelineId required
- query
- selectors
- items Selector
- selectors
- sphere required
- limit
- output QueryObjectsOutput
amazonaws_datapipeline.RemoveTags({
"pipelineId": null,
"tagKeys": null
}, context)
- input
object
- pipelineId required
- tagKeys required
- items string
- output RemoveTagsOutput
amazonaws_datapipeline.ReportTaskProgress({
"taskId": null
}, context)
- input
object
- fields
- items Field
- taskId required
- fields
- output ReportTaskProgressOutput
amazonaws_datapipeline.ReportTaskRunnerHeartbeat({
"taskrunnerId": null
}, context)
- input
object
- hostname
- taskrunnerId required
- workerGroup
amazonaws_datapipeline.SetStatus({
"pipelineId": null,
"objectIds": null,
"status": null
}, context)
- input
object
- objectIds required
- items id
- pipelineId required
- status required
- objectIds required
Output schema unknown
amazonaws_datapipeline.SetTaskStatus({
"taskId": null,
"taskStatus": null
}, context)
- input
object
- errorId
- errorMessage
- errorStackTrace
- taskId required
- taskStatus required
- output SetTaskStatusOutput
amazonaws_datapipeline.ValidatePipelineDefinition({
"pipelineId": null,
"pipelineObjects": null
}, context)
- input
object
- parameterObjects
- items ParameterObject
- parameterValues
- items ParameterValue
- pipelineId required
- pipelineObjects required
- items PipelineObject
- parameterObjects
- ActivatePipelineInput
object
: Contains the parameters for ActivatePipeline.- parameterValues
- items ParameterValue
- pipelineId required
- startTimestamp
- parameterValues
- ActivatePipelineOutput
object
: Contains the output of ActivatePipeline.
- AddTagsInput
object
: Contains the parameters for AddTags.- tags required
- items Tag
- pipelineId required
- tags required
- AddTagsOutput
object
: Contains the output of AddTags.
- CreatePipelineInput
object
: Contains the parameters for CreatePipeline.- tags
- items Tag
- description
- name required
- uniqueId required
- tags
- CreatePipelineOutput
object
: Contains the output of CreatePipeline.- pipelineId required
- DeactivatePipelineInput
object
: Contains the parameters for DeactivatePipeline.- cancelActive
- pipelineId required
- DeactivatePipelineOutput
object
: Contains the output of DeactivatePipeline.
- DeletePipelineInput
object
: Contains the parameters for DeletePipeline.- pipelineId required
- DescribeObjectsInput
object
: Contains the parameters for DescribeObjects.- evaluateExpressions
- marker
- objectIds required
- items id
- pipelineId required
- DescribeObjectsOutput
object
: Contains the output of DescribeObjects.- hasMoreResults
- marker
- pipelineObjects required
- items PipelineObject
- DescribePipelinesInput
object
: Contains the parameters for DescribePipelines.- pipelineIds required
- items id
- pipelineIds required
- DescribePipelinesOutput
object
: Contains the output of DescribePipelines.- pipelineDescriptionList required
- items PipelineDescription
- pipelineDescriptionList required
- EvaluateExpressionInput
object
: Contains the parameters for EvaluateExpression.- expression required
- objectId required
- pipelineId required
- EvaluateExpressionOutput
object
: Contains the output of EvaluateExpression.- evaluatedExpression required
- Field
object
: A key-value pair that describes a property of a pipeline object. The value is specified as either a string value (StringValue
) or a reference to another object (RefValue
) but not as both.- key required
- refValue
- stringValue
- GetPipelineDefinitionInput
object
: Contains the parameters for GetPipelineDefinition.- pipelineId required
- version
- GetPipelineDefinitionOutput
object
: Contains the output of GetPipelineDefinition.- parameterObjects
- items ParameterObject
- parameterValues
- items ParameterValue
- pipelineObjects
- items PipelineObject
- parameterObjects
- InstanceIdentity
object
:Identity information for the EC2 instance that is hosting the task runner. You can get this value by calling a metadata URI from the EC2 instance. For more information, see Instance Metadata in the Amazon Elastic Compute Cloud User Guide. Passing in this value proves that your task runner is running on an EC2 instance, and ensures the proper AWS Data Pipeline service charges are applied to your pipeline.
- document
- signature
- InternalServiceError
object
: An internal service error occurred.- message
- InvalidRequestException
object
: The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.- message
- ListPipelinesInput
object
: Contains the parameters for ListPipelines.- marker
- ListPipelinesOutput
object
: Contains the output of ListPipelines.- hasMoreResults
- marker
- pipelineIdList required
- items PipelineIdName
- Operator
object
: Contains a logical operation for comparing the value of a field with a specified value.- type
- values
- items string
- OperatorType
string
(values: EQ, REF_EQ, LE, GE, BETWEEN)
- ParameterAttribute
object
: The attributes allowed or specified with a parameter object.- key required
- stringValue required
- ParameterAttributeList
array
- items ParameterAttribute
- ParameterObject
object
: Contains information about a parameter object.- attributes required
- items ParameterAttribute
- id required
- attributes required
- ParameterObjectList
array
- items ParameterObject
- ParameterValue
object
: A value or list of parameter values.- id required
- stringValue required
- ParameterValueList
array
- items ParameterValue
- PipelineDeletedException
object
: The specified pipeline has been deleted.- message
- PipelineDescription
object
: Contains pipeline metadata.
- PipelineDescriptionList
array
- items PipelineDescription
- PipelineIdName
object
: Contains the name and identifier of a pipeline.- id
- name
- PipelineNotFoundException
object
: The specified pipeline was not found. Verify that you used the correct user and account identifiers.- message
- PipelineObject
object
: Contains information about a pipeline object. This can be a logical, physical, or physical attempt pipeline object. The complete set of components of a pipeline defines the pipeline.- fields required
- items Field
- id required
- name required
- fields required
- PipelineObjectList
array
- items PipelineObject
- PipelineObjectMap
object
- PollForTaskInput
object
: Contains the parameters for PollForTask.- hostname
- instanceIdentity
- document
- signature
- workerGroup required
- PollForTaskOutput
object
: Contains the output of PollForTask.- taskObject
- attemptId
- objects
- pipelineId
- taskId
- taskObject
- PutPipelineDefinitionInput
object
: Contains the parameters for PutPipelineDefinition.- parameterObjects
- items ParameterObject
- parameterValues
- items ParameterValue
- pipelineId required
- pipelineObjects required
- items PipelineObject
- parameterObjects
- PutPipelineDefinitionOutput
object
: Contains the output of PutPipelineDefinition.- errored required
- validationErrors
- items ValidationError
- validationWarnings
- items ValidationWarning
- Query
object
: Defines the query to run against an object.- selectors
- items Selector
- selectors
- QueryObjectsInput
object
: Contains the parameters for QueryObjects.- limit
- marker
- pipelineId required
- query
- selectors
- items Selector
- selectors
- sphere required
- QueryObjectsOutput
object
: Contains the output of QueryObjects.- hasMoreResults
- ids
- items id
- marker
- RemoveTagsInput
object
: Contains the parameters for RemoveTags.- pipelineId required
- tagKeys required
- items string
- RemoveTagsOutput
object
: Contains the output of RemoveTags.
- ReportTaskProgressInput
object
: Contains the parameters for ReportTaskProgress.- fields
- items Field
- taskId required
- fields
- ReportTaskProgressOutput
object
: Contains the output of ReportTaskProgress.- canceled required
- ReportTaskRunnerHeartbeatInput
object
: Contains the parameters for ReportTaskRunnerHeartbeat.- hostname
- taskrunnerId required
- workerGroup
- ReportTaskRunnerHeartbeatOutput
object
: Contains the output of ReportTaskRunnerHeartbeat.- terminate required
- Selector
object
: A comparision that is used to determine whether a query should return this object.- fieldName
- operator Operator
- SelectorList
array
: The list of Selectors that define queries on individual fields.- items Selector
- SetStatusInput
object
: Contains the parameters for SetStatus.- objectIds required
- items id
- pipelineId required
- status required
- objectIds required
- SetTaskStatusInput
object
: Contains the parameters for SetTaskStatus.- errorId
- errorMessage
- errorStackTrace
- taskId required
- taskStatus required
- SetTaskStatusOutput
object
: Contains the output of SetTaskStatus.
- Tag
object
: Tags are key/value pairs defined by a user and associated with a pipeline to control access. AWS Data Pipeline allows you to associate ten tags per pipeline. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.- key required
- value required
- TaskNotFoundException
object
: The specified task was not found.- message
- TaskObject
object
: Contains information about a pipeline task that is assigned to a task runner.- attemptId
- objects
- pipelineId
- taskId
- TaskStatus
string
(values: FINISHED, FAILED, FALSE)
- ValidatePipelineDefinitionInput
object
: Contains the parameters for ValidatePipelineDefinition.- parameterObjects
- items ParameterObject
- parameterValues
- items ParameterValue
- pipelineId required
- pipelineObjects required
- items PipelineObject
- parameterObjects
- ValidatePipelineDefinitionOutput
object
: Contains the output of ValidatePipelineDefinition.- errored required
- validationErrors
- items ValidationError
- validationWarnings
- items ValidationWarning
- ValidationError
object
: Defines a validation error. Validation errors prevent pipeline activation. The set of validation errors that can be returned are defined by AWS Data Pipeline.- errors
- items validationMessage
- id
- errors
- ValidationErrors
array
- items ValidationError
- ValidationWarning
object
: Defines a validation warning. Validation warnings do not prevent pipeline activation. The set of validation warnings that can be returned are defined by AWS Data Pipeline.- id
- warnings
- items validationMessage
- ValidationWarnings
array
- items ValidationWarning
- attributeNameString
string
- attributeValueString
string
- boolean
boolean
- cancelActive
boolean
- errorMessage
string
- fieldList
array
- items Field
- fieldNameString
string
- fieldStringValue
string
- id
string
- idList
array
- items id
- int
integer
- longString
string
- pipelineList
array
- items PipelineIdName
- string
string
- stringList
array
- items string
- tagKey
string
- tagList
array
- items Tag
- tagValue
string
- taskId
string
- timestamp
string
- validationMessage
string
- validationMessages
array
- items validationMessage