Run AWS Glue ETL Job
Last updated
Last updated
This job triggers the running of an ETL Job in AWS Glue
We recommend creating a ETL Job for each ingestion/transformation task in AWS Glue you wish to perform.
This way, you can use Orchestra to trigger your reverse ELT on a cron or event based schedule. This has a number of advantages vs. using AWS' in built workflows feature.
You can co-ordinate tasks outside of AWS - these would typically be other ETL jobs, other notebooks, or other tasks in AWS-adjacent environments e.g. Snowflake queries
You can use Orchestra to trigger jobs across AWS Accounts / Environments
When AWS Glue jobs run, cost is incurred. Running these operations on a schedule you set explicitly ensures these costs do not go out of hand
We aggregate metadata from the AWS Glue Task in the same place as the metadata from other operations in your Pipeline
These parameters are required to run the Run Workflow Task
Job name
String
N.A.
S3 Ingestion
Arguments
JSON
JSON format
Region
String
AWS region
us-east-1
Create AWS Glue ETL job
Navigate to the
Create an ETL job
You can set arguments to send when starting your ETL job. These much be in a JSON format