Run River

Description

This job runs a River in Rivery using this endpoint.

Use Cases

We recommend creating a Run River task for each Rivery River.

This way, you can use Orchestra to trigger your data ingestion on a manual or event based schedule. This has a number of advantages vs. using Rivery's in-built scheduler:

  • You can coordinate tasks outside of Rivery, like other ingestion tools for connectors Rivery doesn't support, using Orchestra.

  • You can coordinate multiple rivers in Rivery to occur at the same time. By creating the tasks in parallel in Orchestra you will only need to change the schedule in one place instead of modifying the schedule of each river individually in Rivery.

  • When Rivery rivers run, both Rivery Pricing Unit Credits and Data Warehouse costs are incurred. Running these operations on a schedule you set explicitly ensures these costs do not go out of hand.

  • We aggregate metadata from the Rivery Task in the same place as the metadata from other operations in your Pipeline

Parameters

These parameters are required to run the Run River task

NameData typeRestrictionsExample

River ID

String

N.A.

abcdefghi123456789

Error handling

Rate limiting

Rivery has strict throttling limits, so ensure your pipeline configuration in Orchestra will not surpass these limits. The below text is taken from the Rivery API documentation:

Throttling rules: * River can be executed up to 2 times per minute * Each user can execute a river up to 15 times per minute

API Requests

If we receive the following error codes from Rivery, we'll raise an error and the task will move to a failed state.

Status CodeDescriptionHandling

429

Rate limits exceeded

Orchestra fails your task. Rivers shouldn't be run this frequently

401

Invalid API token

Orchestra cannot start the River run

403

Invalid Account ID or Environment ID

Orchestra cannot start the River run

404

River ID is wrong

Orchestra cannot find the River to start

Last updated