DdlOperator¶

The DdlOperator is an Airflow operator designed to execute Data Definition Language (DDL) statements on Teradata databases. It provides a robust way to create, alter, or drop database objects as part of your data pipelines.

Note

The DdlOperator requires the Teradata Parallel Transporter (TPT) package from Teradata Tools and Utilities (TTU) to be installed on the machine where the tbuild command will run (either local or remote). Ensure that the tbuild executable is available in the system’s PATH. Refer to the official Teradata documentation for installation, configuration, and security best practices.

Key Features:

Executes DDL SQL statements (CREATE, ALTER, DROP, etc.)
Works with single statements or batches of multiple DDL operations
Integrates with Airflow’s connection management for secure database access
Provides comprehensive logging of execution results
Supports both local and remote execution via SSH

When you need to manage database schema changes, create temporary tables, or clean up data structures as part of your workflow, the DdlOperator offers a streamlined approach that integrates seamlessly with your Airflow DAGs.

Prerequisite¶

Make sure your Teradata Airflow connection is defined with the required fields:

host
login
password

You can define a remote host with a separate SSH connection using the ssh_conn_id.

Ensure that the Teradata Parallel Transporter (TPT) package is installed on the machine where TdLoadOperator will execute commands. This can be:

The local machine where Airflow runs the task, for local execution.
A remote host accessed via SSH, for remote execution.

If executing remotely, ensure that an SSH server (e.g., sshd) is running and accessible on the remote machine, and that the tbuild executable is available in the system’s PATH.

Note

For improved security, it is highly recommended to use private key-based SSH authentication (SSH key pairs) instead of username/password for the SSH connection.

This avoids password exposure, enables seamless automated execution, and enhances security.

See the Airflow SSH Connection documentation for details on configuring SSH keys: https://airflow.apache.org/docs/apache-airflow/stable/howto/connection/ssh.html

To execute DDL operations in a Teradata database, use the DdlOperator.

Handling Escape Sequences for Embedded Quotes¶

When working with DDL statements that contain embedded quotes, it’s important to understand how escape sequences are handled differently between the DAG definition and the SQL execution:

In DAG Definition (Python): - Use backslash escape sequences: \" for double quotes, \' for single quotes - Python string literals require backslash escaping

In SQL Execution (Teradata): - SQL standard requires doubling quotes when enclosed within the same quote type - Single quotes in single-quoted strings: 'Don''t' - Double quotes in double-quoted identifiers: "My""Table"

Example:

# In your DAG - use Python escape sequences
ddl_with_quotes = DdlOperator(
    task_id="create_table_with_quotes",
    ddl=[
        "CREATE TABLE test_table (col1 VARCHAR(50) DEFAULT '\"quoted_value\"')",
        "INSERT INTO test_table VALUES ('It''s a test')",  # Note the doubled single quotes
    ],
    teradata_conn_id="teradata_default",
)

Key Points: - When defining DDL statements in Python strings, use standard Python escape sequences - The operator automatically handles the conversion for TPT script generation - For SQL string literals containing quotes, follow SQL standards (double the quotes) - Test your DDL statements carefully when they contain complex quoting

Key Operation Examples with DdlOperator¶

Dropping tables in Teradata¶

You can use the DdlOperator to drop tables in Teradata. The following example demonstrates how to drop multiple tables: