Airflow setup
Learn how to integrate Catalog with Apache Airflow using Marquez and OpenLineage for lineage tracking.
Beta Feature
This feature is in Beta. Reach out if you want to activate it on your instance.
Catalog integrates with Marquez and OpenLineage to capture lineage from your Airflow DAGs.
Configure Airflow
To connect Airflow to Catalog, install the OpenLineage integration and set the required environment variables.
- If using Airflow 2.3 through 2.6: add
openlineage-airflowto the requirements or packages to install within the Airflow instance - If using Airflow 2.7: nothing to install
- Add the following environment variables:
OPENLINEAGE_URL: the URL of the HTTP backendOPENLINEAGE_API_KEY: the API key to allow Airflow to write to MarquezOPENLINEAGE_NAMESPACE: the namespace associated with the Airflow instance
Credentials
You must provide Catalog with the following elements:
- Marquez API URL and API key:
OPENLINEAGE_URL,OPENLINEAGE_API_KEY - Namespace configured for Airflow (
OPENLINEAGE_NAMESPACE) - Namespace for the main warehouse source (set by OpenLineage)
No Marquez Server
If you do not have your own Marquez server, reach out to the support team.