Run job
Description
This job triggers a connector using this endpoint.
Use Cases
We recommend creating a Run job Task for every "job" within dbt Cloud.
This way, you can use Orchestra to trigger your data transformation on a cron or event based schedule. This has a number of advantages vs. using dbt's in-built scheduler:
You can co-ordinate tasks outside of dbt, like ingestion tools (for operations that move data to your data warehouse) or reverse ELT tools (for moving data outside of the warehouse)
You can use Orchestra to trigger jobs across dbt Cloud Accounts and Projects. This means you can enable a data mesh architecture with a high degree of flexibility around Orchestrating different jobs
When dbt Cloud jobs run, Data Warehouse cost is incurred. Running these operations on a schedule you set explicitly ensures these costs do not go out of hand. This easily happens when Orchestrating multiple jobs across different accounts or projects, because dbt Cloud triggers cross-project jobs automatically where that setting is enabled
Orchestrating across dbt Projects is only available via an enterprise dbt Cloud plan
We aggregate metadata from the dbt Task in the same place as the metadata from other operations in your Pipeline
Setup guide
Fetch the dbt Job ID
Head to the dbt cloud, and select "Deploy" -> "Jobs"
You'll be taken to an overview of your jobs where the URL is:
https://cloud.getdbt.com/deploy/{account_id}/projects/{project_id}/jobs
Select the job you want, and parse the
job_id
from the url, which is of the form:https://cloud.getdbt.com/deploy/{account_id}/projects/{project_id}/jobs/{job_id}
Parameters
These parameters are required to run the Job run Task
Name | Data type | Restrictions | Example |
---|---|---|---|
Job ID | Number | N.A. | 12345 |
Error handling
API Requests
If we receive the following error codes from dbt Cloud, we'll raise an error and the task will move to a failed state.
Code | Description | Handling |
401 | Unauthorised | We will raise an error and parse the raw error message from the dbt response as the Orchestra message |
404 | Not Found | We will raise an error with the HTTP Reason as the Orchestra message |
Other error code | We will raise an error with the HTTP Reason as the Orchestra message |
Failed models or tests
The job response from dbt Cloud looks like this:
There is therefore, limited information provided in the Orchestra message
field, however detailed run information is available in the Lineage tab.
Retrying from the point of failure
dbt Cloud supports retrying a node from the point of failure.
In this case, Orchestra will make a request to this endpoint.
This can be done in the Orchestra UI by navigating to "View All Runs" here:
Select a run by clicking on it:
You will be able to retry from the point of failure by selecting Re-run pipeline and selecting "Re-run from Failed":
Last updated