SQL Dialects¶
The Dialect
offers an abstraction layer between the
DbApiHook
implementation and the database. For some database multiple
connection types are available, like native, ODBC and or JDBC. As the OdbcHook
and the JdbcHook
are generic hooks which allows you to interact with any
database that has a driver for it, it needed an abstraction layer which allows us to run specialized queries
depending of the database to which we connect and that’s why dialects where introduced.
The default Dialect
class has following operations
available which underneath use SQLAlchemy to execute, but can be overloaded with specialized implementations
per database:
placeholder
specifies the database specific placeholder used in prepared statements (default:%s
);inspector
returns the SQLAlchemy inspector which allows us to retrieve database metadata;extract_schema_from_table
allows us to extract the schema name from a string.get_column_names
returns the column names for the given table and schema (optional) using the SQLAlchemy inspector.get_primary_keys
returns the primary keys for the given table and schema (optional) using the SQLAlchemy inspector.get_target_fields
returns the columns names that aren’t identity or auto incremented columns, this will be used by the insert_rows method of theDbApiHook
if the target_fields parameter wasn’t specified and the Airflow propertycore.dbapihook_resolve_target_fields
is set to True (default: False).reserved_words
returns the reserved words in SQL for the target database using the SQLAlchemy inspector.generate_insert_sql
generates the insert SQL statement for the target database.generate_replace_sql
generates the upsert SQL statement for the target database.
At the moment there are only 3 dialects available:
default
Dialect
reuses the generic functionality that was already available in theDbApiHook
;mssql
MsSqlDialect
specialized for Microsoft SQL Server;postgresql
PostgresDialect
specialized for PostgreSQL;
The dialect to be used will be derived from the connection string, which sometimes won’t be possible. There is always the possibility to specify the dialect name through the extra options of the connection:
dialect_name: 'mssql'
If a specific dialect isn’t available for a database, the default one will be used, same when a non-existing dialect name is specified.