apache-airflow-providers-apache-hdfs
Changelog¶
4.10.3¶
Release Date: 2025-09-22
Bug Fixes¶
[OSSTaskHandler, CloudwatchTaskHandler, S3TaskHandler, HdfsTaskHandler, ElasticsearchTaskHandler, GCSTaskHandler, OpensearchTaskHandler, RedisTaskHandler, WasbTaskHandler] supports log file size handling (#55455)
4.10.2¶
Release Date: 2025-08-02
Misc¶
Add Python 3.13 support for Airflow. (#46891)
Remove type ignore across codebase after mypy upgrade (#53243)
Remove upper-binding for "python-requires" (#52980)
Temporarily switch to use >=,< pattern instead of '~=' (#52967)
Replace 'BaseHook' to Task SDK for 'apache/hdfs' (#52684)
4.10.1¶
Release Date: 2025-07-06
Bug Fixes¶
Fix HDFS remote_base to use path for compatibility with hdfs client (#50804)
Misc¶
Move 'BaseHook' implementation to task SDK (#51873)
Replace models.BaseOperator to Task SDK for apache/hdfs (#52505)
Drop support for Python 3.9 (#52072)
Use BaseSensorOperator from task sdk in providers (#52296)
Remove HDFSHook, HdfsRegexSensor, HdfsSensor, HdfsFolderSensor (#52217)
Bump upper binding on pandas in all providers (#52060)
4.10.0¶
Release Date: 2025-06-18
Features¶
Add custom headers and cookies support in WebHDFSHook (#50955)
Bug Fixes¶
Use conn.password instead of conn.get_password() (#50759)
4.9.0¶
Release Date: 2025-05-18
Note
This release of provider is only available for Airflow 2.10+ as explained in the Apache Airflow providers support policy <https://github.com/apache/airflow/blob/main/PROVIDERS.rst#minimum-supported-version-of-airflow-for-community-managed-providers>_.
Misc¶
Bump min Airflow version in providers to 2.10 (#49843)
4.8.1¶
Release Date: 2025-04-19
Misc¶
remove superfluous else block (#49199)
4.8.0¶
Release Date: 2025-04-14
Features¶
Add support for reading/writing task logs to HDFS in Airflow 3 (#48788)
4.7.1¶
Release Date: 2025-03-13
Misc¶
Upgrade flit to 3.11.0 (#46938)
4.7.0¶
Release Date: 2024-12-26
Note
This release of provider is only available for Airflow 2.9+ as explained in the Apache Airflow providers support policy.
Features¶
Add mTLS support to WebHDFSHook (#44561)
Misc¶
Bump minimum Airflow version in providers to Airflow 2.9.0 (#44956)
4.6.0¶
Release Date: 2024-10-31
Features¶
added MultipleFilesWebHdfsSensor (#43045)
Misc¶
Fixed failing static checks & provider tests (#43122)
4.5.1¶
Release Date: 2024-09-24
Misc¶
Remove deprecated log handler argument filename_template (#41552)
4.5.0¶
Release Date: 2024-08-22
Note
This release of provider is only available for Airflow 2.8+ as explained in the Apache Airflow providers support policy.
Misc¶
Bump minimum Airflow version in providers to Airflow 2.8.0 (#41396)
4.4.2¶
Release Date: 2024-06-27
Misc¶
Update pandas minimum requirement for Python 3.12 (#40272)
implement per-provider tests with lowest-direct dependency resolution (#39946)
4.4.1¶
Release Date: 2024-05-30
Misc¶
Faster 'airflow_version' imports (#39552)
Simplify 'airflow_version' imports (#39497)
4.4.0¶
Release Date: 2024-05-06
Note
This release of provider is only available for Airflow 2.7+ as explained in the Apache Airflow providers support policy.
Misc¶
Bump minimum Airflow version in providers to Airflow 2.7.0 (#39240)
4.3.3¶
Release Date: 2024-03-08
Misc¶
Turn Pydantic into an optional dependency (#37320)
4.3.2¶
Release Date: 2023-12-31
Misc¶
Remove _read method from hdfs task handler after bumping min airflow version to 2.6 (#36425)
Consolidate loading delete_local_logs conf in hdfs task handler (#36422)
4.3.1¶
Release Date: 2023-12-27
Bug Fixes¶
fix connection type webhdfs (#36145)
4.3.0¶
Release Date: 2023-12-12
Note
This release of provider is only available for Airflow 2.6+ as explained in the Apache Airflow providers support policy.
Misc¶
Bump minimum Airflow version in providers to Airflow 2.6.0 (#36017)
4.2.0¶
Release Date: 2023-10-17
Note
This release of provider is only available for Airflow 2.5+ as explained in the Apache Airflow providers support policy.
Misc¶
Bump min airflow version of providers (#34728)
Use 'airflow.exceptions.AirflowException' in providers (#34511)
4.1.1¶
Release Date: 2023-09-12
Misc¶
Fix package name in exception message for hdfs provider (#33813)
4.1.0¶
Release Date: 2023-06-23
Note
This release dropped support for Python 3.7
Features¶
Add ability to read/write task instance logs from HDFS (#31512)
4.0.0¶
Release Date: 2023-05-22
Note
This release of provider is only available for Airflow 2.4+ as explained in the Apache Airflow providers support policy.
Breaking changes¶
The original HDFS Hook and sensor has been removed. It used the old HDFS snakebite-py3 library that had no update in years and the protobuf they are using reached end of life.
The 3.* version of the provider is still available and can be used if you need to use the old hooks and sensors.
The HDFSHook
, HDFSSensor
, HdfsRegexSensor
, HdfsRegexSensor
that have been removed from
this provider and they are not available anymore. If you want to continue using them,
you can use 3.* version of the provider, but the recommendation is to switch to the new
WebHDFSHook
and WebHDFSSensor
that use the WebHDFS
API.
Remove snakebite-py3 based HDFS hooks and sensors (#31262)
Note
Protobuf 3 required by the snakebite-py3 library has ended its life in June 2023 and Airflow and it’s providers stopped supporting it. If you would like to continue using HDFS hooks and sensors based on snakebite-py3 library when you have protobuf library 4.+ you can install the 3.* version of the provider but due to Protobuf incompatibility, you need to do one of the two things:
set
PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python
variable in your environment.downgrade protobuf to latest 3.* version (3.20.3 at this time)
Setting PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python
will make many libraries using protobuf
much slower - including multiple Google client libraries and Kubernetes. Downgrading protobuf to
(already End-Of-Life) 3.* version will make some of the latest versions of the new providers
incompatible (for example google and grpc) and you will have to downgrade those providers as well.
Both should be treated as a temporary workaround only, and you should migrate to WebHDFS
as soon as possible.
Misc¶
Bump minimum Airflow version in providers (#30917)
3.2.1¶
Release Date: 2023-04-06
Bug Fixes¶
Fix HDFSHook HAClient is invalid (#30164)
3.2.0¶
Release Date: 2022-11-18
Note
This release of provider is only available for Airflow 2.3+ as explained in the Apache Airflow providers support policy.
Misc¶
Move min airflow version to 2.3.0 for all providers (#27196)
3.1.0¶
Release Date: 2022-08-14
Features¶
Adding Authentication to webhdfs sensor (#25110)
3.0.1¶
Release Date: 2022-07-16
Bug Fixes¶
'WebHDFSHook' Bugfix/optional port (#24550)
3.0.0¶
Release Date: 2022-06-13
Breaking changes¶
This release of provider is only available for Airflow 2.2+ as explained in the Apache Airflow providers support policy https://github.com/apache/airflow/blob/main/README.md#support-for-providers
Misc¶
chore: Refactoring and Cleaning Apache Providers (#24219)
2.2.3¶
Release Date: 2022-03-26
Bug Fixes¶
Fix mistakenly added install_requires for all providers (#22382)
2.2.2¶
Release Date: 2022-03-19
Misc¶
Add Trove classifiers in PyPI (Framework :: Apache Airflow :: Provider)
2.2.1¶
Release Date: 2022-03-10
Misc¶
Support for Python 3.10
Add how-to guide for WebHDFS operators (#21393)
2.2.0¶
Release Date: 2021-12-06
Features¶
hdfs provider: restore HA support for webhdfs (#19711)
2.1.1¶
Release Date: 2021-10-05
Bug Fixes¶
fix get_connections deprecation warning in webhdfs hook (#18331)
2.1.0¶
Release Date: 2021-09-03
Features¶
hdfs provider: allow SSL webhdfs connections (#17637)
Misc¶
Optimise connection importing for Airflow 2.2.0
2.0.0¶
Release Date: 2021-06-23
Breaking changes¶
Auto-apply apply_default decorator (#15667)
Warning
Due to apply_default decorator removal, this version of the provider requires Airflow 2.1.0+.
If your Airflow version is < 2.1.0, and you want to install this provider version, first upgrade
Airflow to at least version 2.1.0. Otherwise your Airflow package version will be upgraded
automatically and you will have to manually run airflow upgrade db
to complete the migration.
1.0.1¶
Release Date: 2021-02-08
Updated documentation and readme files.
1.0.0¶
Release Date: 2020-12-14
Initial version of the provider.