airflow.triggers.base
Attributes
Classes
Arguments required for start task execution from triggerer. |
|
Base class for all triggers. |
|
Base class for triggers used to schedule DAGs based on external events. |
|
Something that a trigger can fire when its conditions are met. |
|
Yield this event in order to end the task successfully. |
|
Yield this event in order to end the task with failure. |
|
Yield this event in order to end the task with status 'skipped'. |
Functions
Module Contents
- airflow.triggers.base.log[source]
- type airflow.triggers.base.Operator = MappedOperator | SerializedBaseOperator[source]
- class airflow.triggers.base.StartTriggerArgs[source]
Arguments required for start task execution from triggerer.
- timeout: datetime.timedelta | None = None[source]
- class airflow.triggers.base.BaseTrigger(**kwargs)[source]
Bases:
abc.ABC,airflow.sdk.definitions._internal.templater.Templater,airflow.utils.log.logging_mixin.LoggingMixinBase class for all triggers.
A trigger has two contexts it can exist in:
Inside an Operator, when it’s passed to TaskDeferred
Actively running in a trigger worker
We use the same class for both situations, and rely on all Trigger classes to be able to return the arguments (possible to encode with Airflow-JSON) that will let them be re-instantiated elsewhere.
- trigger_id = None[source]
- template_fields = ()[source]
- template_ext = ()[source]
- task_id = None[source]
- property task_instance: airflow.models.taskinstance.TaskInstance[source]
- render_template_fields(context, jinja_env=None)[source]
Template all attributes listed in self.template_fields.
This mutates the attributes in-place and is irreversible.
- Parameters:
context (airflow.sdk.definitions.context.Context) – Context dict with values to apply on content.
jinja_env (jinja2.Environment | None) – Jinja’s environment to use for rendering.
- abstract serialize()[source]
Return the information needed to reconstruct this Trigger.
- abstract run()[source]
- Async:
Run the trigger in an asynchronous context.
The trigger should yield an Event whenever it wants to fire off an event, and return None if it is finished. Single-event triggers should thus yield and then immediately return.
If it yields, it is likely that it will be resumed very quickly, but it may not be (e.g. if the workload is being moved to another triggerer process, or a multi-event trigger was being used for a single-event task defer).
In either case, Trigger classes should assume they will be persisted, and then rely on cleanup() being called when they are no longer needed.
- async cleanup()[source]
Cleanup the trigger.
Called when the trigger is no longer needed, and it’s being removed from the active triggerer process.
This method follows the async/await pattern to allow to run the cleanup in triggerer main event loop. Exceptions raised by the cleanup method are ignored, so if you would like to be able to debug them and be notified that cleanup method failed, you should wrap your code with try/except block and handle it appropriately (in async-compatible way).
- async on_kill()[source]
Kill the external job managed by this trigger when the task is killed by a user.
Symmetric with
BaseOperator.on_kill()on the worker side: override this method to stop external work (e.g. cancel a BigQuery job, terminate a Databricks run) when a user explicitly acts on the deferred task via mark-failed, clear, or mark-succeeded.Distinction from
cleanup():cleanup()runs on every trigger exit — success, timeout, shutdown, and user kill. It is meant for releasing local resources held by this trigger instance. Putting external job cancellation incleanup()would cancel in-flight work on every triggerer restart or rolling deploy.on_kill()runs only when a user explicitly kills the task. It is the right place to cancel external work you do not want to keep running after the user performs an action.
This only fires when a user acts on the task. It does not fire on:
Triggerer shutdown or restart — the trigger is redistributed, not cancelled.
Triggerer redistribution to another triggerer process.
Trigger timeout — the trigger is killed, not cancelled by user.
Normal trigger completion (the trigger fired an event).
This method runs in the triggerer’s asyncio event loop, so it must be async-safe. Use
awaitfor any I/O; do not block the event loop.Exceptions raised here are logged as warnings and do not propagate — they will not affect the task state or the triggerer. Implement your own retry or error handling inside this method if needed.
on_kill()is given a bounded time to complete. Implementations that call slow external APIs should apply their own timeouts rather than relying on the framework bound.
- static repr(classpath, kwargs)[source]
- __repr__()[source]
- class airflow.triggers.base.BaseEventTrigger(**kwargs)[source]
Bases:
BaseTriggerBase class for triggers used to schedule DAGs based on external events.
BaseEventTriggeris a subclass ofBaseTriggerdesigned to identify triggers compatible with event-driven scheduling.Sharing an underlying I/O stream between triggers
A subclass that polls an upstream resource which can be safely consumed by multiple sibling triggers (e.g. a directory scan, a polling REST API) may opt in to having the triggerer run a single underlying poll loop and fan its raw events out to every trigger in the group. To do so, override:
shared_stream_key()— return a key identifying the shared stream (atupleof strings is a common choice). Triggers whose key compares equal share one poll.open_shared_stream()— open the shared stream and yield raw events. Called once per group in the triggerer.filter_shared_stream()— convert the shared raw stream into this trigger’s ownTriggerEventinstances, applying any per-trigger filtering or transformation.
Triggers whose
shared_stream_keyreturnsNone(the default) keep the existing behavior: each trigger gets its own poll loop viarun().Suitable upstreams
The shared-stream channel is one-way today: events flow from the producer (
open_shared_stream) to each subscriber’sfilter_shared_stream, with no path back to tell the producer that a subscriber accepted, dropped, or finished processing an event. That restricts the pattern to upstreams whose consumption does not depend on a side effect on a handle that only the producer holds:Idempotent / read-only reads (filesystem listings, polling REST APIs).
Subscriber-side-effect cleanup, where the trigger’s per-event action (
unlink, local marking, …) operates through APIs the subscriber already owns, independent of the shared producer handle.
Upstreams not in scope include Kafka consumers (regardless of commit mode), SQS with delete-on-process or visibility extension, and any source where progress on the producer’s handle is tied to the subscriber’s accept / reject decision. These sources need a way for the subscriber to signal acceptance back to the producer, which the current shared-stream API does not provide.
- static hash(classpath, kwargs)[source]
Return the hash of the trigger classpath and kwargs. This is used to uniquely identify a trigger.
We do not want to have this logic in
BaseTriggerbecause, when used to defer tasks, 2 triggers can have the same classpath and kwargs. This is not true for event driven scheduling.
Identify an underlying I/O stream that can be shared with sibling triggers.
Two trigger instances whose
shared_stream_key()return values compare equal (and are notNone) will share a single underlying poll loop in the triggerer. Each instance still receives the events it cares about through its ownfilter_shared_stream()call.Returning
None(the default) opts out of sharing — the trigger runs its own independent poll loop viarun(), exactly as today.The return value is read once when
run_triggerfirst starts this trigger; any change to the key afterwards has no effect on group membership for this instance. To share one poll across a set of sibling triggers, ensure every trigger in the set returns the same key from the outset.The key must be deterministic — derive it from configuration fields, never from per-call values such as
time.time()oruuid.uuid4(), because the comparison must be stable across the lifetime of the group.Note
This method is called after
render_template_fields(), so any templated attribute (for example adirectoryderived from a Jinja expression) is already resolved when the key is constructed. Two sibling triggers that render to the same path will correctly share their poll.
- Abstractmethod:
- Async:
Open the shared underlying stream and yield raw events.
Called once per shared-stream group in the triggerer.
kwargsis taken from one trigger in the group; implementations should rely only on fields whose values participate inshared_stream_key(), because other fields may differ between siblings in the group.Implementations are expected to run for the lifetime of the group — the triggerer drives the iterator from a single task and cancels it when the last subscriber leaves. Returning without raising (e.g. because the upstream resource closed) is treated as an error and propagated to every subscriber, so the contract is “yield forever, or raise”. If an upstream EOF is a meaningful end-of-life condition, raise an exception that conveys it.
Declared as a classmethod (not staticmethod) so subclasses can compose via
super().open_shared_stream(kwargs)and reachclsfor class-scoped state or diagnostics.Required only when
shared_stream_key()returns non-None.
- Async:
Transform the shared raw event stream into this trigger’s events.
The triggerer calls this method (instead of
run()) when this trigger participates in a shared-stream group. Iterateshared_streamto receive raw events from the shared poll, andyieldaTriggerEventfor each one that should fire this trigger.Required only when
shared_stream_key()returns non-None.
- class airflow.triggers.base.TriggerEvent(payload, **kwargs)[source]
Bases:
pydantic.BaseModelSomething that a trigger can fire when its conditions are met.
Events must have a uniquely identifying value that would be the same wherever the trigger is run; this is to ensure that if the same trigger is being run in two locations (for HA reasons) that we can deduplicate its events.
- payload: Any = None[source]
The payload for the event to send back to the task.
Must be natively JSON-serializable, or registered with the airflow serialization code.
- __repr__()[source]
- class airflow.triggers.base.TaskSuccessEvent(*, xcoms=None, **kwargs)[source]
Bases:
BaseTaskEndEventYield this event in order to end the task successfully.
- task_instance_state: airflow.utils.state.TaskInstanceState[source]
- class airflow.triggers.base.TaskFailedEvent(*, xcoms=None, **kwargs)[source]
Bases:
BaseTaskEndEventYield this event in order to end the task with failure.
- task_instance_state: airflow.utils.state.TaskInstanceState[source]
- class airflow.triggers.base.TaskSkippedEvent(*, xcoms=None, **kwargs)[source]
Bases:
BaseTaskEndEventYield this event in order to end the task with status ‘skipped’.
- task_instance_state: airflow.utils.state.TaskInstanceState[source]
- airflow.triggers.base.trigger_event_discriminator(v)[source]
- airflow.triggers.base.DiscrimatedTriggerEvent[source]