@datafire/amazonaws_datapipeline

5.0.0 • Public • Published

@datafire/amazonaws_datapipeline

Client library for AWS Data Pipeline

Installation and Usage

npm install --save @datafire/amazonaws_datapipeline
let amazonaws_datapipeline = require('@datafire/amazonaws_datapipeline').create({
  accessKeyId: "",
  secretAccessKey: "",
  region: ""
});

amazonaws_datapipeline.ActivatePipeline({
  "pipelineId": ""
}).then(data => {
  console.log(data);
});

Description

AWS Data Pipeline configures and manages a data-driven workflow called a pipeline. AWS Data Pipeline handles the details of scheduling and ensuring that data dependencies are met so that your application can focus on processing the data.

AWS Data Pipeline provides a JAR implementation of a task runner called AWS Data Pipeline Task Runner. AWS Data Pipeline Task Runner provides logic for common data management scenarios, such as performing database queries and running data analysis using Amazon Elastic MapReduce (Amazon EMR). You can use AWS Data Pipeline Task Runner as your task runner, or you can write your own task runner to provide custom data management.

AWS Data Pipeline implements two main sets of functionality. Use the first set to create a pipeline and define data sources, schedules, dependencies, and the transforms to be performed on the data. Use the second set in your task runner application to receive the next task ready for processing. The logic for performing the task, such as querying the data, running data analysis, or converting the data from one format to another, is contained within the task runner. The task runner performs the task assigned to it by the web service, reporting progress to the web service as it does so. When the task is done, the task runner reports the final success or failure of the task to the web service.

Actions

ActivatePipeline

amazonaws_datapipeline.ActivatePipeline({
  "pipelineId": ""
}, context)

Input

Output

AddTags

amazonaws_datapipeline.AddTags({
  "pipelineId": "",
  "tags": []
}, context)

Input

  • input object
    • pipelineId required id
    • tags required tagList

Output

CreatePipeline

amazonaws_datapipeline.CreatePipeline({
  "name": "",
  "uniqueId": ""
}, context)

Input

Output

DeactivatePipeline

amazonaws_datapipeline.DeactivatePipeline({
  "pipelineId": ""
}, context)

Input

Output

DeletePipeline

amazonaws_datapipeline.DeletePipeline({
  "pipelineId": ""
}, context)

Input

  • input object
    • pipelineId required id

Output

Output schema unknown

DescribeObjects

amazonaws_datapipeline.DescribeObjects({
  "pipelineId": "",
  "objectIds": []
}, context)

Input

  • input object
    • marker string
    • evaluateExpressions boolean
    • marker string
    • objectIds required idList
    • pipelineId required id

Output

DescribePipelines

amazonaws_datapipeline.DescribePipelines({
  "pipelineIds": []
}, context)

Input

  • input object

Output

EvaluateExpression

amazonaws_datapipeline.EvaluateExpression({
  "pipelineId": "",
  "objectId": "",
  "expression": ""
}, context)

Input

  • input object
    • expression required longString
    • objectId required id
    • pipelineId required id

Output

GetPipelineDefinition

amazonaws_datapipeline.GetPipelineDefinition({
  "pipelineId": ""
}, context)

Input

  • input object

Output

ListPipelines

amazonaws_datapipeline.ListPipelines({}, context)

Input

  • input object

Output

PollForTask

amazonaws_datapipeline.PollForTask({
  "workerGroup": ""
}, context)

Input

Output

PutPipelineDefinition

amazonaws_datapipeline.PutPipelineDefinition({
  "pipelineId": "",
  "pipelineObjects": []
}, context)

Input

Output

QueryObjects

amazonaws_datapipeline.QueryObjects({
  "pipelineId": "",
  "sphere": ""
}, context)

Input

  • input object
    • limit string
    • marker string
    • limit int
    • marker string
    • pipelineId required id
    • query Query
    • sphere required string

Output

RemoveTags

amazonaws_datapipeline.RemoveTags({
  "pipelineId": "",
  "tagKeys": []
}, context)

Input

  • input object

Output

ReportTaskProgress

amazonaws_datapipeline.ReportTaskProgress({
  "taskId": ""
}, context)

Input

Output

ReportTaskRunnerHeartbeat

amazonaws_datapipeline.ReportTaskRunnerHeartbeat({
  "taskrunnerId": ""
}, context)

Input

  • input object
    • hostname id
    • taskrunnerId required id
    • workerGroup string

Output

SetStatus

amazonaws_datapipeline.SetStatus({
  "pipelineId": "",
  "objectIds": [],
  "status": ""
}, context)

Input

  • input object
    • objectIds required idList
    • pipelineId required id
    • status required string

Output

Output schema unknown

SetTaskStatus

amazonaws_datapipeline.SetTaskStatus({
  "taskId": "",
  "taskStatus": ""
}, context)

Input

Output

ValidatePipelineDefinition

amazonaws_datapipeline.ValidatePipelineDefinition({
  "pipelineId": "",
  "pipelineObjects": []
}, context)

Input

Output

Definitions

ActivatePipelineInput

  • ActivatePipelineInput object: Contains the parameters for ActivatePipeline.

ActivatePipelineOutput

  • ActivatePipelineOutput object: Contains the output of ActivatePipeline.

AddTagsInput

  • AddTagsInput object: Contains the parameters for AddTags.
    • pipelineId required id
    • tags required tagList

AddTagsOutput

  • AddTagsOutput object: Contains the output of AddTags.

CreatePipelineInput

  • CreatePipelineInput object: Contains the parameters for CreatePipeline.

CreatePipelineOutput

  • CreatePipelineOutput object: Contains the output of CreatePipeline.
    • pipelineId required id

DeactivatePipelineInput

  • DeactivatePipelineInput object: Contains the parameters for DeactivatePipeline.

DeactivatePipelineOutput

  • DeactivatePipelineOutput object: Contains the output of DeactivatePipeline.

DeletePipelineInput

  • DeletePipelineInput object: Contains the parameters for DeletePipeline.
    • pipelineId required id

DescribeObjectsInput

  • DescribeObjectsInput object: Contains the parameters for DescribeObjects.

DescribeObjectsOutput

DescribePipelinesInput

  • DescribePipelinesInput object: Contains the parameters for DescribePipelines.

DescribePipelinesOutput

  • DescribePipelinesOutput object: Contains the output of DescribePipelines.

EvaluateExpressionInput

  • EvaluateExpressionInput object: Contains the parameters for EvaluateExpression.
    • expression required longString
    • objectId required id
    • pipelineId required id

EvaluateExpressionOutput

  • EvaluateExpressionOutput object: Contains the output of EvaluateExpression.

Field

  • Field object: A key-value pair that describes a property of a pipeline object. The value is specified as either a string value (StringValue) or a reference to another object (RefValue) but not as both.

GetPipelineDefinitionInput

  • GetPipelineDefinitionInput object: Contains the parameters for GetPipelineDefinition.

GetPipelineDefinitionOutput

InstanceIdentity

  • InstanceIdentity object:

    Identity information for the EC2 instance that is hosting the task runner. You can get this value by calling a metadata URI from the EC2 instance. For more information, see Instance Metadata in the Amazon Elastic Compute Cloud User Guide. Passing in this value proves that your task runner is running on an EC2 instance, and ensures the proper AWS Data Pipeline service charges are applied to your pipeline.

InternalServiceError

  • InternalServiceError object: An internal service error occurred.

InvalidRequestException

  • InvalidRequestException object: The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.

ListPipelinesInput

  • ListPipelinesInput object: Contains the parameters for ListPipelines.

ListPipelinesOutput

  • ListPipelinesOutput object: Contains the output of ListPipelines.

Operator

  • Operator object: Contains a logical operation for comparing the value of a field with a specified value.

OperatorType

  • OperatorType string (values: EQ, REF_EQ, LE, GE, BETWEEN)

ParameterAttribute

ParameterAttributeList

ParameterObject

ParameterObjectList

ParameterValue

ParameterValueList

PipelineDeletedException

  • PipelineDeletedException object: The specified pipeline has been deleted.

PipelineDescription

  • PipelineDescription object: Contains pipeline metadata.

PipelineDescriptionList

PipelineIdName

  • PipelineIdName object: Contains the name and identifier of a pipeline.

PipelineNotFoundException

  • PipelineNotFoundException object: The specified pipeline was not found. Verify that you used the correct user and account identifiers.

PipelineObject

  • PipelineObject object: Contains information about a pipeline object. This can be a logical, physical, or physical attempt pipeline object. The complete set of components of a pipeline defines the pipeline.

PipelineObjectList

PipelineObjectMap

PollForTaskInput

  • PollForTaskInput object: Contains the parameters for PollForTask.

PollForTaskOutput

  • PollForTaskOutput object: Contains the output of PollForTask.

PutPipelineDefinitionInput

PutPipelineDefinitionOutput

Query

  • Query object: Defines the query to run against an object.

QueryObjectsInput

  • QueryObjectsInput object: Contains the parameters for QueryObjects.

QueryObjectsOutput

  • QueryObjectsOutput object: Contains the output of QueryObjects.

RemoveTagsInput

  • RemoveTagsInput object: Contains the parameters for RemoveTags.

RemoveTagsOutput

  • RemoveTagsOutput object: Contains the output of RemoveTags.

ReportTaskProgressInput

  • ReportTaskProgressInput object: Contains the parameters for ReportTaskProgress.

ReportTaskProgressOutput

  • ReportTaskProgressOutput object: Contains the output of ReportTaskProgress.

ReportTaskRunnerHeartbeatInput

  • ReportTaskRunnerHeartbeatInput object: Contains the parameters for ReportTaskRunnerHeartbeat.
    • hostname id
    • taskrunnerId required id
    • workerGroup string

ReportTaskRunnerHeartbeatOutput

  • ReportTaskRunnerHeartbeatOutput object: Contains the output of ReportTaskRunnerHeartbeat.

Selector

  • Selector object: A comparision that is used to determine whether a query should return this object.

SelectorList

  • SelectorList array: The list of Selectors that define queries on individual fields.

SetStatusInput

  • SetStatusInput object: Contains the parameters for SetStatus.
    • objectIds required idList
    • pipelineId required id
    • status required string

SetTaskStatusInput

SetTaskStatusOutput

  • SetTaskStatusOutput object: Contains the output of SetTaskStatus.

Tag

  • Tag object: Tags are key/value pairs defined by a user and associated with a pipeline to control access. AWS Data Pipeline allows you to associate ten tags per pipeline. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.

TaskNotFoundException

  • TaskNotFoundException object: The specified task was not found.

TaskObject

  • TaskObject object: Contains information about a pipeline task that is assigned to a task runner.

TaskStatus

  • TaskStatus string (values: FINISHED, FAILED, FALSE)

ValidatePipelineDefinitionInput

ValidatePipelineDefinitionOutput

ValidationError

  • ValidationError object: Defines a validation error. Validation errors prevent pipeline activation. The set of validation errors that can be returned are defined by AWS Data Pipeline.

ValidationErrors

ValidationWarning

  • ValidationWarning object: Defines a validation warning. Validation warnings do not prevent pipeline activation. The set of validation warnings that can be returned are defined by AWS Data Pipeline.

ValidationWarnings

attributeNameString

  • attributeNameString string

attributeValueString

  • attributeValueString string

boolean

  • boolean boolean

cancelActive

  • cancelActive boolean

errorMessage

  • errorMessage string

fieldList

  • fieldList array

fieldNameString

  • fieldNameString string

fieldStringValue

  • fieldStringValue string

id

  • id string

idList

  • idList array
    • items id

int

  • int integer

longString

  • longString string

pipelineList

string

  • string string

stringList

  • stringList array

tagKey

  • tagKey string

tagList

  • tagList array

tagValue

  • tagValue string

taskId

  • taskId string

timestamp

  • timestamp string

validationMessage

  • validationMessage string

validationMessages

Readme

Keywords

none

Package Sidebar

Install

npm i @datafire/amazonaws_datapipeline

Weekly Downloads

2

Version

5.0.0

License

MIT

Unpacked Size

122 kB

Total Files

5

Last publish

Collaborators

  • datafire