Google Cloud Platform Data Studio (Looker) Connection¶
Communication between Airflow and Data Studio (Looker) is done via Looker API. To facilitate the API communication, Data Studio operators use Looker SDK as an API client. Before calling API, Looker SDK needs to authenticate itself using your Data Studio API credentials.
Obtain your Data Studio API credentials using instructions in the Looker API authentication documentation.
Obtain your Data Studio API path and port as described in the Looker API documentation.
Setup a Data Studio connection in Airflow.
The HTTP connection type provides connection to Data Studio API.
The LookerHook uses this connection to run
API requests on a Data Studio instance issued by LookerStartPdtBuildOperator and LookerCheckPdtBuildSensor.
Configuring the Connection¶
- Host (required)
Base URL for Data Studio API. Do not include /api/* in the URL.
- Login (required)
Data Studio API client id.
- Password (required)
Data Studio API client secret.
- Port (optional)
Port for Data Studio API. If hosted on GCP, don’t specify the port leaving just the host.
- Extra (optional)
Specify the extra parameters (as json dictionary) that can be used in Data Studio connection. The following parameters are supported:
verify_ssl- Set to false only if testing locally against self-signed certs. Defaults to true if not specified.timeout- Timeout in seconds for HTTP requests. Defaults to 2 minutes (120) seconds if not specified.
Example “extras” field:
{ "verify_ssl": true, "timeout": 120, }
Connection URI¶
A URL configuration example of a Data Studio connection:
AIRFLOW_CONN_YOUR_CONN_ID='http://YourClientID:YourClientSecret@https%3A%2F%2Fyour.looker.com:19999?verify_ssl=true&timeout=120'