Run River
Last updated
Last updated
This job runs a River in Rivery using endpoint.
We recommend creating a Run River task for each Rivery River.
This way, you can use Orchestra to trigger your data ingestion on a manual or event based schedule. This has a number of advantages vs. using Rivery's in-built scheduler:
You can coordinate tasks outside of Rivery, like other ingestion tools for connectors Rivery doesn't support, using Orchestra.
You can coordinate multiple rivers in Rivery to occur at the same time. By creating the tasks in parallel in Orchestra you will only need to change the schedule in one place instead of modifying the schedule of each river individually in Rivery.
When Rivery rivers run, both Rivery Pricing Unit Credits and Data Warehouse costs are incurred. Running these operations on a schedule you set explicitly ensures these costs do not go out of hand.
We aggregate metadata from the Rivery Task in the same place as the metadata from other operations in your Pipeline
These parameters are required to run the Run River task
River ID
String
N.A.
abcdefghi123456789
Rivery has strict throttling limits, so ensure your pipeline configuration in Orchestra will not surpass these limits. The below text is taken from the Rivery API documentation:
Throttling rules: * River can be executed up to 2 times per minute * Each user can execute a river up to 15 times per minute
If we receive the following error codes from Rivery, we'll raise an error and the task will move to a failed state.
429
Rate limits exceeded
Orchestra fails your task. Rivers shouldn't be run this frequently
401
Invalid API token
Orchestra cannot start the River run
403
Invalid Account ID or Environment ID
Orchestra cannot start the River run
404
River ID is wrong
Orchestra cannot find the River to start