globe
Webinars

TaskFlow API in Airflow 2.0

Watch On Demand

Hosted By

  • Viraj Parekh Viraj Parekh Field CTO
    Astronomer

Check out the code used in the above webinar and instructions to set up your own Xcom backend!

What is the TaskFlow API?

Prior to Airflow 2.0, Airflow did not have an explicit way to declare messages passed between tasks in a DAG.

XComs could be used, but were hidden in execution functions inside the operator.

The TaskFlow API is a functional API that allows you to explicitly declare message passing while implicitly declaring task dependencies.

TaskFlow API Features

TaskFlow API Functionality Includes:

  • XCom Args, which allow task dependencies to be abstracted and inferred as a result of the Python function invocation
  • A task decorator that automatically creates PythonOperator tasks from Python functions and handles variable passing
  • Support for Custom XCom Backends

taskflow-1

taskflow-2

Decorators

Task decorator allows users to convert any Python function into a task instance using PythonOperator.

DAG decorator allows users to instantiate the DAG without using a context manager.

taskflow-3

Custom Xcom Backends

The TaskFlow API supports a new `xcom_backend` parameter, which allows you to

  • Store XComs external to Airflow
  • Implement your own serialization and deserialization methods

Future Work

The TaskFlow API is a new Airflow feature, and will likely be expanded on in the future.

Development of additional decorators to support other operators is already ongoing.

The easiest way to get started with Apache Airflow 2.0 is by using the Astronomer CLI. To make it easy you can get up and running with Airflow by following our Quickstart Guide.



Astronomer Apache Airflow Fundamentals Certification badge

Get Apache Airflow Certified

If you want to learn more about how to get started with Airflow, you can join the thousands of other data engineers who have received the Astronomer Certification for Apache Airflow Fundamentals. This exam assesses an understanding of the basics of the Airflow architecture and the ability to create simple data pipelines for scheduling and monitoring tasks.

Learn More About Certification