SQL Dialects

The Dialect offers an abstraction layer between the DbApiHook implementation and the database. For some database multiple connection types are available, like native, ODBC and or JDBC. As the OdbcHook and the JdbcHook are generic hooks which allows you to interact with any database that has a driver for it, it needed an abstraction layer which allows us to run specialized queries depending of the database to which we connect and that’s why dialects where introduced.

The default Dialect class has following operations available which underneath use SQLAlchemy to execute, but can be overloaded with specialized implementations per database:

  • placeholder specifies the database specific placeholder used in prepared statements (default: %s);

  • inspector returns the SQLAlchemy inspector which allows us to retrieve database metadata;

  • extract_schema_from_table allows us to extract the schema name from a string.

  • get_column_names returns the column names for the given table and schema (optional) using the SQLAlchemy inspector.

  • get_primary_keys returns the primary keys for the given table and schema (optional) using the SQLAlchemy inspector.

  • get_target_fields returns the columns names that aren’t identity or auto incremented columns, this will be used by the insert_rows method of the DbApiHook if the target_fields parameter wasn’t specified and the Airflow property core.dbapihook_resolve_target_fields is set to True (default: False).

  • reserved_words returns the reserved words in SQL for the target database using the SQLAlchemy inspector.

  • generate_insert_sql generates the insert SQL statement for the target database.

  • generate_replace_sql generates the upsert SQL statement for the target database.

At the moment there are only 3 dialects available:

The dialect to be used will be derived from the connection string, which sometimes won’t be possible. There is always the possibility to specify the dialect name through the extra options of the connection:

dialect_name: 'mssql'

If a specific dialect isn’t available for a database, the default one will be used, same when a non-existing dialect name is specified.

Was this entry helpful?