Synapse Run Pipeline

Description

This job triggers and monitors the running of a Pipeline in Azure Synapse Analytics (not a Pipeline in Orchestra)

Use Cases

We recommend creating a Synapse Run Pipeline Task for every "Pipeline" within Synapse.

This way, you can use Orchestra to trigger your reverse ELT on a cron or event based schedule. This has a number of advantages vs. using Synapse's in-built scheduler:

  • You can co-ordinate tasks outside of Synapse - these would typically be other Spark jobs, other notebooks, or other tasks in Synapse-adjacent environments e.g. Spark notebooks, python jobs

    • A common use-case is to have a dbt run jobs that run after Synapse workflows that are running autoloader scrips, or Coalesce.io jobs

  • You can use Orchestra to trigger jobs across Synapse environments

  • When Synapse jobs run, cloud compute cost is incurred. Running these operations on a schedule you set explicitly ensures these costs do not go out of hand

  • We aggregate metadata from the Task in the same place as the metadata from other operations in your Pipeline

Parameters and setup

These parameters are required to run the Run Workflow Task

NameData typeRestrictionsExample

Pipeline name

String

N.A.

order-pipeline

Parameters

JSON

JSON format

{
  "arg": "dummy"
}

Setup guide

Fetch the Azure Synapse Analytics parameters

  1. Head to your Azure Synapse Analytics portal

  2. Navigate to Integrate -> Pipelines. From here you can get the name of the pipeline you want to run.

We recommend leaving your pipelines without a trigger. This way Orchestra can manually trigger them when needed.

Options

You can send parameters to the pipeline run by copying a JSON object with your parameters.

Last updated