Datastream check stream status

Description

This job checks the status of the stream using this endpoint. If the stream is not in a "RUNNING" state, the Task will fail

Use Cases

We recommend creating a Datastream check stream status Task for every Datastream you use in GCP.

This way, you can ensure that at a bare minimum, the Datastream is in a healthy and unpaused state before running data transformations on data in Datastream destinations, or other potentially expensive operations. This also acts as a natural check on the Datastream in the same place as all your other data assets.

Additionally:

  • You can co-ordinate tasks outside of Datastream, like batch ingestion tools such as Airbyte or Fivetran, or potentially other streams

  • You get a log of helpful metadata stored in Orchestra for the Datastream regarding its status

  • We aggregate metadata from GCP for Datastream in the same place as the metadata from other operations in your Pipeline

Parameters

These parameters are required to run the Datastream check stream status Task

NameData typeRestrictionsExample

Connection ID

GUID

N.A.

0368bfcb-946f-4d0f-8b14-2252cdc1f5e0

Workspace ID (Optional)

GUID

N.A.

0368bfcb-946f-4d0f-8b14-2252cdc1f5e0

Error handling

Stream Status (Stati?)

If we receive the following statuses from GCP, we'll raise an error and the task will move to a failed state.

StateDescriptionHandling

STATE_UNSPECIFIED

Unspecified stream state.

We will raise an error and store the state in the Orchestra message

NOT_STARTED

The stream has been created but has not yet started streaming data.

We will raise an error and store the state in the Orchestra message

RUNNING

The stream is running.

We will raise an error and store the state in the Orchestra message

PAUSED

The stream is paused.

We will raise an error and store the state in the Orchestra message

MAINTENANCE

The stream is in maintenance mode. Updates are rejected on the resource in this state.

We will raise an error and store the state in the Orchestra message

FAILED

The stream is experiencing an error that is preventing data from being streamed.

We will raise an error and store the state in the Orchestra message

FAILED_PERMANENTLY

The stream has experienced a terminal failure.

We will raise an error and store the state in the Orchestra message

STARTING

The stream is starting, but not yet running.

We will raise an error and store the state in the Orchestra message

DRAINING

The Stream is no longer reading new events, but still writing events in the buffer.

We will raise an error and store the state in the Orchestra message

Last updated