Integrations

How to connect to any data tool or infrastructure using Orchestra

In Orchestra, a connection to a third party data tool or piece of data infrastructure is called an "Integration".

These could be data ingestion tools like Fivetran or Stitch, data transformation tools like dbt Cloud, data warehouses like Snowflake or reverse ELT tools, to name but a few.

We also recognise the need to trigger and monitor core pieces of infrastructure within cloud providers such as AWS, GCP or Azure. We recognise that while stitching together cloud vendor-specific pieces of infrastructure is often quite straightforward, providing end-to-end visbility and orchestration across these systems and third party saas tools is a challenge.

Therefore, in Orchestra Integrations to cloud infrastructure components exist too, like GCP Datastream, AWS ECS/EKS/EC2 and Azure Virtual machines, for example.

Features

  • Authentication: every Integration should have the concept of a user or Service Principle that enables a Data Governance first authentication and usage approach

  • Secret encryption in transit and at rest

    • Optional: secrets checked for expiry / validity up to every hour to avoid pipelines breaking due to secret expiry / rotation

Integrations supported

Data ingestion

  • Stitch

  • Fivetran

  • Airbyte Cloud

  • Airbyte Server (beta)

  • Matillion Data Productivity Cloud ("DPC") (beta)

Utility

  • HTTP (beta)

  • Render

  • Python SDK (beta)

Business Intelligence

  • Sigma (beta)

  • Lightdash (beta)

  • Hex (beta)

  • Power BI (beta)

Workflow orchestration

  • Airflow (beta)

  • Prefect cloud

Streaming Platforms

  • GCP - Datastream

Data transformation

  • dbt Cloud

  • Coalesce

Reverse ELT

  • Census

  • Hightouch

Data warehouse

  • Snowflake

  • Databricks (beta)

Cloud Provider

  • AWS Glue (beta)

  • AWS EC2 (Elastic Cloud Compute) (beta)

  • Azure Data Factory

  • Azure VMs (beta)

Last updated