OpenMetadata/docs/integrations/connectors/azure-sql.md

---
description: >-
  This guide will help you install and configure the Azure SQL connector and run
  metadata ingestion workflows manually.
---

# Azure SQL

## Requirements

Using the OpenMetadata Azure SQL connector requires supporting services and software. Please ensure your host system meets the requirements listed below. Then continue to follow the procedure for installing and configuring this connector.

### OpenMetadata (version 0.8.0 or later)

You must have a running deployment of OpenMetadata to use this guide. OpenMetadata includes the following services:

* OpenMetadata server supporting the metadata APIs and user interface
* Elasticsearch for metadata search and discovery
* MySQL as the backing store for all metadata
* Airflow for metadata ingestion workflows

If you have not already deployed OpenMetadata, please follow the instructions to [Run OpenMetadata](../../../try-openmetadata/run-openmetadata.md) to get up and running.

### Python (version 3.8.0 or later)

Please use the following command to check the version of Python you have.

```
python3 --version
```

## Procedure

Here’s an overview of the steps in this procedure. Please follow the steps relevant to your use case.

1. [Prepare a Python virtual environment](azure-sql.md#1.-prepare-a-python-virtual-environment)
2. [Install the Python module for this connector](azure-sql.md#install-from-pypi-or-source)
3. [Create a configuration file using template JSON](azure-sql.md#3.-create-a-configuration-file-using-template-json)
4. [Configure service settings](azure-sql.md#4.-configure-service-settings)
5. [Enable/disable the data profiler](azure-sql.md#5.-enable-disable-the-data-profiler)
6. [Install the data profiler Python module (optional)](azure-sql.md#6.-install-the-data-profiler-python-module-optional)
7. [Configure data filters (optional)](azure-sql.md#7.-configure-data-filters-optional)
8. [Configure sample data (optional)](azure-sql.md#8.-configure-sample-data-optional)
9. [Configure DBT (optional)](azure-sql.md#9.-configure-dbt-optional)
10. [Confirm sink settings](azure-sql.md#10.-confirm-sink-settings)
11. [Confirm metadata\_server settings](azure-sql.md#11.-confirm-metadata\_server-settings)
12. [Run ingestion workflow](azure-sql.md#run-manually)

### 1. Prepare a Python virtual environment

In this step, we'll create a Python virtual environment. Using a virtual environment enables us to avoid conflicts with other Python installations and packages on your host system.

In a later step, you will install the Python module for this connector and its dependencies in this virtual environment.

#### 1.1 Create a directory for openmetadata

Throughout the docs, we use a consistent directory structure for OpenMetadata services and connector installation. If you have not already done so by following another guide, please create an openmetadata directory now and change into that directory in your command line environment.

```
mkdir openmetadata; cd openmetadata
```

#### 1.2 Create a virtual environment

Run the following command to create a Python virtual environment called, `env`. You can try multiple connectors in the same virtual environment.

```bash
python3 -m venv env
```

#### 1.3 Activate the virtual environment

Run the following command to activate the virtual environment.

```bash
source env/bin/activate
```

Once activated, you should see your command prompt change to indicate that your commands will now be executed in the environment named `env`.

#### 1.4 Upgrade pip and setuptools to the latest versions

Ensure that you have the latest version of pip by running the following command. If you have followed the steps above, this will upgrade pip in your virtual environment.

```
pip3 install --upgrade pip setuptools
```

### 2. Install the Python module for this connector <a href="#install-from-pypi-or-source" id="install-from-pypi-or-source"></a>

Once the virtual environment is set up and activated as described in Step 1, run the following command to install the Python module for the Azure SQL connector.

```bash
pip3 install 'openmetadata-ingestion[azuresql]'
```

### 3. Create a configuration file using template JSON

Create a new file called `azuresql.json` in the current directory. Note that the current directory should be the `openmetadata` directory you created in Step 1.

Copy and paste the configuration template below into the `azuresql.json` file you created.

{% hint style="info" %}
Note: The `source.config` field in the configuration JSON will include the majority of the settings for your connector. In the steps below we describe how to customize the key-value pairs in the `source.config` field to meet your needs.
{% endhint %}

{% code title="azuresql.json" %}
```json
{
  "source": {
    "type": "azuresql",
    "config": {
      "host_port": "hostname.domain.com:1433",
      "scheme": "mssql+pyodbc",
      "service_type" = "DatabaseServiceType.AzureSQL.value",
      "driver": "{ODBC Driver 17 for SQL Server}",
      "username": "username",
      "password": "strong_password",
      "database": "azuresql_db",
      "service_name": "local_azure_sql",
      "query": "select top 50 * from {}.{}",
      "data_profiler_enabled": "false",
      "table_filter_pattern": {
        "excludes": ["[\\w]*event_vw.*"]
      },
      "schema_filter_pattern": {
        "excludes": ["azuresql.*", "information_schema.*", "performance_schema.*", "sys.*"]
      }
    }
  },
  "sink": {
    "type": "metadata-rest",
    "config": {}
  },
  "metadata_server": {
    "type": "metadata-server",
    "config": {
      "api_endpoint": "http://localhost:8585/api",
      "auth_provider_type": "no-auth"
    }
  }
}  
```
{% endcode %}

### 4. Configure service settings

In this step we will configure the Azure SQL service settings required for this connector. Please follow the instructions below to ensure that you've configured the connector to read from your Azure SQL service as desired.

#### host\_port

Edit the value for `source.config.host_port` in `azuresql.json` for your Azure SQL deployment. Use the `host:port` format illustrated in the example below.

```json
"host_port": "hostname.domain.com:1433"
```

Please ensure that your Azure SQL deployment is reachable from the host you are using to run metadata ingestion.

#### scheme

Edit the value for `source.config.scheme` in `azuresql.json` for your Azure SQL deployment. Use the `scheme` format illustrated in the example below.

```javascript
"scheme": "mssql+pyodbc"
```

#### service\_type

Edit the value for `source.config.service_type` as shown in the example below.

```javascript
"service_type": "DatabaseServiceType.AzureSQL.value"
```

#### driver

Edit the value for `source.config.driver` as shown in the example below.

```javascript
"driver": "{ODBC Driver 17 for SQL Server}"
```

#### username

Edit the value for `source.config.username` to identify your Azure SQL user.

```json
"username": "username"
```

{% hint style="danger" %}
Note: The user specified should be authorized to read all databases you want to include in the metadata ingestion workflow.
{% endhint %}

#### password

Edit the value for `source.config.password` with the password for your Azure SQL user.

```json
"password": "strong_password"
```

#### service\_name

OpenMetadata uniquely identifies services by their `service_name`. Edit the value for `source.config.service_name` with a name that distinguishes this deployment from other services, including other Azure SQL services that you might be ingesting metadata from.

```json
"service_name": "local_azure_sql"
```

#### database (optional)

If you want to limit metadata ingestion to a single database, include the `source.config.database` field in your configuration file. If this field is not included, the connector will ingest metadata from all databases that the specified user is authorized to read.

To specify a single database to ingest metadata from, provide the name of the database as the value for the `source.config.database` key as illustrated in the example below.

```json
"database": "azuresql_db"
```

### 5. Enable/disable the data profiler

The data profiler ingests usage information for tables. This enables you to assess the frequency of use, reliability, and other details.

#### data\_profiler\_enabled

When enabled, the data profiler will run as part of metadata ingestion. Running the data profiler increases the amount of time it takes for metadata ingestion, but provides the benefits mentioned above.

You may disable the data profiler by setting the value for the key `source.config.data_profiler_enabled` to `"false"` as follows. We've done this in the configuration template provided.

```json
"data_profiler_enabled": "false"
```

If you want to enable the data profiler, update your configuration file as follows.

```json
"data_profiler_enabled": "true"
```

{% hint style="info" %}
Note: The data profiler is enabled by default if no setting is provided for `data_profiler_enabled`.
{% endhint %}

### 6. Install the data profiler Python module (optional)

If you've enabled the data profiler in Step 5, run the following command to install the Python module for the data profiler. You'll need this to run the ingestion workflow.

```bash
pip3 install 'openmetadata-ingestion[data-profiler]'
```

The data profiler module takes a few minutes to install. While it installs, continue through the remaining steps in this guide.

### 7. Configure data filters (optional)

#### include\_views (optional)

Use `source.config.include_views` to control whether or not to include views as part of metadata ingestion and data profiling.

Explicitly include views by adding the following key-value pair in the `source.config` field of your configuration file.

```json
"include_views": "true"
```

Exclude views as follows.

```json
"include_views": "false"
```

{% hint style="info" %}
Note: `source.config.include_views` is set to `true` by default.
{% endhint %}

#### include\_tables (optional)

Use `source.config.include_tables` to control whether or not to include tables as part of metadata ingestion and data profiling.

Explicitly include tables by adding the following key-value pair in the `source.config` field of your configuration file.

```json
"include_tables": "true"
```

Exclude tables as follows.

```json
"include_tables": "false"
```

{% hint style="info" %}
Note: `source.config.include_tables` is set to `true` by default.
{% endhint %}

#### table\_filter\_pattern (optional)

Use `source.config.table_filter_pattern` to select tables for metadata ingestion by name.

Use `source.config.table_filter_pattern.excludes` to exclude all tables with names matching one or more of the supplied regular expressions. All other tables will be included. See below for an example. This example is also included in the configuration template provided.

```json
"table_filter_pattern": {
    "excludes": ["information_schema.*", "[\\w]*event_vw.*"]
}
```

Use `source.config.table_filter_pattern.includes` to include all tables with names matching one or more of the supplied regular expressions. All other tables will be excluded. See below for an example.

```json
"table_filter_pattern": {
    "includes": ["corp.*", "dept.*"]
}
```

See the documentation for the [Python re module](https://docs.python.org/3/library/re.html) for information on how to construct regular expressions.

{% hint style="info" %}
You may use either `excludes` or `includes` but not both in `table_filter_pattern.`
{% endhint %}

#### schema\_filter\_pattern (optional)

Use `source.config.schema_filter_pattern.excludes` and `source.config.schema_filter_pattern.includes` field to select the schemas for metadata ingestion by name. The configuration template provides an example.

The syntax and semantics for `schema_filter_pattern` are the same as for [`table_filter_pattern`](azure-sql.md#table\_filter\_pattern-optional). Please check that section for details.

### 8. Configure sample data (optional)

#### generate\_sample\_data (optional)

Use the `source.config.generate_sample_data` field to control whether or not to generate sample data to include in table views in the OpenMetadata user interface. The image below provides an example.

![](../../.gitbook/assets/generate\_sample\_data.png)

Explicitly include sample data by adding the following key-value pair in the `source.config` field of your configuration file.

```json
"generate_sample_data": "true"
```

If set to true, the connector will collect the first 50 rows of data from each table included in ingestion, and catalog that data as sample data, which users can refer to in the OpenMetadata user interface.

You can exclude the collection of sample data by adding the following key-value pair in the `source.config` field of your configuration file.

```json
"generate_sample_data": "false"
```

{% hint style="info" %}
Note: `generate_sample_data` is set to `true` by default.
{% endhint %}

### 9. Configure DBT (optional)

DBT provides transformation logic that creates tables and views from raw data. OpenMetadata includes an integration for DBT that enables you to see the models used to generate a table from that table's details page in the OpenMetadata user interface. The image below provides an example.

![](../../.gitbook/assets/configure\_dbt.png)

To include DBT models and metadata in your ingestion workflows, specify the location of the DBT manifest and catalog files as fields in your configuration file.

#### dbt\_manifest\_file (optional)

Use the field `source.config.dbt_manifest_file` to specify the location of your DBT manifest file. See below for an example.

```json
"dbt_manifest_file": "./dbt/manifest.json"
```

#### dbt\_catalog\_file (optional)

Use the field `source.config.dbt_catalog_file` to specify the location of your DBT catalog file. See below for an example.

```json
"dbt_catalog_file": "./dbt/catalog.json"
```

### 10. Confirm sink settings

You need not make any changes to the fields defined for `sink` in the template code you copied into `azuresql.json` in Step 4. This part of your configuration file should be as follows.

```json
"sink": {
    "type": "metadata-rest",
    "config": {}
},
```

### 11. Confirm metadata\_server settings

You need not make any changes to the fields defined for `metadata_server` in the template code you copied into `azuresql.json` in Step 4. This part of your configuration file should be as follows.

```json
"metadata_server": {
    "type": "metadata-server",
    "config": {
        "api_endpoint": "http://localhost:8585/api",
        "auth_provider_type": "no-auth"
    }
}
```

### 12. Run ingestion workflow <a href="#run-manually" id="run-manually"></a>

Your `azuresql.json` configuration file should now be fully configured and ready to use in an ingestion workflow.

To run an ingestion workflow, execute the following command from the `openmetadata` directory you created in Step 1.

```bash
metadata ingest -c ./azuresql.json
```

## Next Steps

As the ingestion workflow runs, you may observe progress both from the command line and from the OpenMetadata user interface. To view the metadata ingested from Azure SQL, visit [http://localhost:8585/explore/tables](http://localhost:8585/explore/tables). Select the Azure SQL service to filter for the data you've ingested using the workflow you configured and ran following this guide. The image below provides an example.

![](<../../.gitbook/assets/next\_steps (1).png>)

## Troubleshooting

### ERROR: Failed building wheel for cryptography

When attempting to install the `openmetadata-ingestion[azuresql]` Python package in Step 2, you might encounter the following error. The error might include a mention of a Rust compiler.

```
Failed to build cryptography
ERROR: Could not build wheels for cryptography which use PEP 517 and cannot be installed directly
```

This error usually occurs due to an older version of pip. Try upgrading pip as follows.

```bash
pip3 install --upgrade pip setuptools
```

Then re-run the install command in [Step 2](azure-sql.md#install-from-pypi-or-source).

### requests.exceptions.ConnectionError

If you encounter the following error when attempting to run the ingestion workflow in Step 12, this is probably because there is no OpenMetadata server running at http://localhost:8585.

```
requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=8585): 
Max retries exceeded with url: /api/v1/services/databaseServices/name/local_azure_sql 
(Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x1031fa310>: 
Failed to establish a new connection: [Errno 61] Connection refused'))
```

To correct this problem, please follow the steps in the [Run OpenMetadata](../../../try-openmetadata/run-openmetadata.md) guide to deploy OpenMetadata in Docker on your local machine.

Then re-run the metadata ingestion workflow in [Step 12](azure-sql.md#run-manually).
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
+								---
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								description: >-
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								  This guide will help you install and configure the Azure SQL connector and run
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								  metadata ingestion workflows manually.
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
+								---
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								# Azure SQL
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								## Requirements
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Using the OpenMetadata Azure SQL connector requires supporting services and software. Please ensure your host system meets the requirements listed below. Then continue to follow the procedure for installing and configuring this connector.
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								### OpenMetadata (version 0.8.0 or later)
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								You must have a running deployment of OpenMetadata to use this guide. OpenMetadata includes the following services:
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								* OpenMetadata server supporting the metadata APIs and user interface
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								* Elasticsearch for metadata search and discovery
 								* MySQL as the backing store for all metadata
 								* Airflow for metadata ingestion workflows
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								If you have not already deployed OpenMetadata, please follow the instructions to [Run OpenMetadata](../../../try-openmetadata/run-openmetadata.md) to get up and running.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								### Python (version 3.8.0 or later)
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								Please use the following command to check the version of Python you have.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```
 								python3 --version
 								```
 								## Procedure
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								Here’s an overview of the steps in this procedure. Please follow the steps relevant to your use case.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+. [Prepare a Python virtual environment](azure-sql.md#1.-prepare-a-python-virtual-environment)
 . [Install the Python module for this connector](azure-sql.md#install-from-pypi-or-source)
 . [Create a configuration file using template JSON](azure-sql.md#3.-create-a-configuration-file-using-template-json)
 . [Configure service settings](azure-sql.md#4.-configure-service-settings)
 . [Enable/disable the data profiler](azure-sql.md#5.-enable-disable-the-data-profiler)
 . [Install the data profiler Python module (optional)](azure-sql.md#6.-install-the-data-profiler-python-module-optional)
 . [Configure data filters (optional)](azure-sql.md#7.-configure-data-filters-optional)
 . [Configure sample data (optional)](azure-sql.md#8.-configure-sample-data-optional)
 . [Configure DBT (optional)](azure-sql.md#9.-configure-dbt-optional)
 . [Confirm sink settings](azure-sql.md#10.-confirm-sink-settings)
 . [Confirm metadata\_server settings](azure-sql.md#11.-confirm-metadata\_server-settings)
 . [Run ingestion workflow](azure-sql.md#run-manually)
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								### 1. Prepare a Python virtual environment
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								In this step, we'll create a Python virtual environment. Using a virtual environment enables us to avoid conflicts with other Python installations and packages on your host system.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								In a later step, you will install the Python module for this connector and its dependencies in this virtual environment.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								#### 1.1 Create a directory for openmetadata
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								Throughout the docs, we use a consistent directory structure for OpenMetadata services and connector installation. If you have not already done so by following another guide, please create an openmetadata directory now and change into that directory in your command line environment.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```
 								mkdir openmetadata; cd openmetadata
 								```
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								#### 1.2 Create a virtual environment
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								Run the following command to create a Python virtual environment called, `env`. You can try multiple connectors in the same virtual environment.
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
 								```bash
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								python3 -m venv env
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								```
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								#### 1.3 Activate the virtual environment
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								Run the following command to activate the virtual environment.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```bash
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								source env/bin/activate
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
+								```
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								Once activated, you should see your command prompt change to indicate that your commands will now be executed in the environment named `env`.
-												OpenMetadata snapshot release 0.3

											
										
										
											2021-08-01 14:27:44 -07:00
-												GitBook: [#48] Edits for MySQL, Redshift, & Snowflake

											
										
										
											2021-12-29 11:23:55 +00:00
+								#### 1.4 Upgrade pip and setuptools to the latest versions
-												GitBook: [main] 36 pages and 11 assets modified
											
										
										
											2021-08-13 15:48:38 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								Ensure that you have the latest version of pip by running the following command. If you have followed the steps above, this will upgrade pip in your virtual environment.
-												Update ingestion connector docs

											
										
										
											2021-08-12 14:11:56 -07:00
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```
 								pip3 install --upgrade pip setuptools
 								```
 								### 2. Install the Python module for this connector <a href="#install-from-pypi-or-source" id="install-from-pypi-or-source"></a>
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Once the virtual environment is set up and activated as described in Step 1, run the following command to install the Python module for the Azure SQL connector.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```bash
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								pip3 install 'openmetadata-ingestion[azuresql]'
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```
 								### 3. Create a configuration file using template JSON
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Create a new file called `azuresql.json` in the current directory. Note that the current directory should be the `openmetadata` directory you created in Step 1.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Copy and paste the configuration template below into the `azuresql.json` file you created.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								{% hint style="info" %}
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								Note: The `source.config` field in the configuration JSON will include the majority of the settings for your connector. In the steps below we describe how to customize the key-value pairs in the `source.config` field to meet your needs.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								{% endhint %}
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								{% code title="azuresql.json" %}
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```json
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								{
 								  "source": {
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								    "type": "azuresql",
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								    "config": {
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								      "host_port": "hostname.domain.com:1433",
 								      "scheme": "mssql+pyodbc",
 								      "service_type" = "DatabaseServiceType.AzureSQL.value",
 								      "driver": "{ODBC Driver 17 for SQL Server}",
-												GitBook: [#72] Trino, Vertica & Hive Updates

											
										
										
											2022-01-06 12:23:09 +00:00
+								      "username": "username",
 								      "password": "strong_password",
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								      "database": "azuresql_db",
 								      "service_name": "local_azure_sql",
 								      "query": "select top 50 * from {}.{}",
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								      "data_profiler_enabled": "false",
-												Added Filter Params for Table and Schema (#1954)

* Added Filter Params for table and Schema

* Bigquery Doc changes

* Doc Changes for databases

* Filter Pattern Changes

* Table Filter Pattern Example Changes

* Filter Pattern Example Changes
											
										
										
											2021-12-29 22:43:09 +05:30
+								      "table_filter_pattern": {
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								        "excludes": ["[\\w]*event_vw.*"]
-												Added Filter Params for Table and Schema (#1954)

* Added Filter Params for table and Schema

* Bigquery Doc changes

* Doc Changes for databases

* Filter Pattern Changes

* Table Filter Pattern Example Changes

* Filter Pattern Example Changes
											
										
										
											2021-12-29 22:43:09 +05:30
+								      },
 								      "schema_filter_pattern": {
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								        "excludes": ["azuresql.*", "information_schema.*", "performance_schema.*", "sys.*"]
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								      }
 								    }
 								  },
 								  "sink": {
-												GitBook: [main] 85 pages modified
											
										
										
											2021-09-07 17:52:06 +00:00
+								    "type": "metadata-rest",
-												updated documentation (#508)

* updated documentation

* addressing pyline findings

* addressing pyline findings

* doc update

Co-authored-by: parthp2107 <parth@getcollate.io>
											
										
										
											2021-09-19 08:43:31 +05:30
+								    "config": {}
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								  },
 								  "metadata_server": {
 								    "type": "metadata-server",
 								    "config": {
 								      "api_endpoint": "http://localhost:8585/api",
-												updated documentation (#508)

* updated documentation

* addressing pyline findings

* addressing pyline findings

* doc update

Co-authored-by: parthp2107 <parth@getcollate.io>
											
										
										
											2021-09-19 08:43:31 +05:30
+								      "auth_provider_type": "no-auth"
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								    }
 								  }
-												GitBook: [#35] fix missing curly brace from MySQL connector config template

											
										
										
											2021-12-22 21:33:54 +00:00
+								}
-												Update ingestion connector docs

											
										
										
											2021-08-12 13:53:29 -07:00
+								```
 								{% endcode %}
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								### 4. Configure service settings
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								In this step we will configure the Azure SQL service settings required for this connector. Please follow the instructions below to ensure that you've configured the connector to read from your Azure SQL service as desired.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								#### host\_port
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Edit the value for `source.config.host_port` in `azuresql.json` for your Azure SQL deployment. Use the `host:port` format illustrated in the example below.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								"host_port": "hostname.domain.com:1433"
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Please ensure that your Azure SQL deployment is reachable from the host you are using to run metadata ingestion.
 								#### scheme
 								Edit the value for `source.config.scheme` in `azuresql.json` for your Azure SQL deployment. Use the `scheme` format illustrated in the example below.
 								```javascript
 								"scheme": "mssql+pyodbc"
 								```
 								#### service\_type
 								Edit the value for `source.config.service_type` as shown in the example below.
 								```javascript
 								"service_type": "DatabaseServiceType.AzureSQL.value"
 								```
 								#### driver
 								Edit the value for `source.config.driver` as shown in the example below.
 								```javascript
 								"driver": "{ODBC Driver 17 for SQL Server}"
 								```
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								#### username
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Edit the value for `source.config.username` to identify your Azure SQL user.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"username": "username"
 								```
 								{% hint style="danger" %}
 								Note: The user specified should be authorized to read all databases you want to include in the metadata ingestion workflow.
 								{% endhint %}
 								#### password
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Edit the value for `source.config.password` with the password for your Azure SQL user.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"password": "strong_password"
 								```
 								#### service\_name
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								OpenMetadata uniquely identifies services by their `service_name`. Edit the value for `source.config.service_name` with a name that distinguishes this deployment from other services, including other Azure SQL services that you might be ingesting metadata from.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								"service_name": "local_azure_sql"
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```
 								#### database (optional)
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								If you want to limit metadata ingestion to a single database, include the `source.config.database` field in your configuration file. If this field is not included, the connector will ingest metadata from all databases that the specified user is authorized to read.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								To specify a single database to ingest metadata from, provide the name of the database as the value for the `source.config.database` key as illustrated in the example below.
 								```json
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								"database": "azuresql_db"
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								### 5. Enable/disable the data profiler
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								The data profiler ingests usage information for tables. This enables you to assess the frequency of use, reliability, and other details.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								#### data\_profiler\_enabled
-												GitBook: [#46] Minor edit - MySQL

											
										
										
											2021-12-28 19:36:24 +00:00
+								When enabled, the data profiler will run as part of metadata ingestion. Running the data profiler increases the amount of time it takes for metadata ingestion, but provides the benefits mentioned above.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								You may disable the data profiler by setting the value for the key `source.config.data_profiler_enabled` to `"false"` as follows. We've done this in the configuration template provided.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"data_profiler_enabled": "false"
 								```
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								If you want to enable the data profiler, update your configuration file as follows.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"data_profiler_enabled": "true"
 								```
 								{% hint style="info" %}
 								Note: The data profiler is enabled by default if no setting is provided for `data_profiler_enabled`.
 								{% endhint %}
 								### 6. Install the data profiler Python module (optional)
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								If you've enabled the data profiler in Step 5, run the following command to install the Python module for the data profiler. You'll need this to run the ingestion workflow.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```bash
 								pip3 install 'openmetadata-ingestion[data-profiler]'
 								```
 								The data profiler module takes a few minutes to install. While it installs, continue through the remaining steps in this guide.
 								### 7. Configure data filters (optional)
 								#### include\_views (optional)
 								Use `source.config.include_views` to control whether or not to include views as part of metadata ingestion and data profiling.
 								Explicitly include views by adding the following key-value pair in the `source.config` field of your configuration file.
 								```json
 								"include_views": "true"
 								```
 								Exclude views as follows.
 								```json
 								"include_views": "false"
 								```
 								{% hint style="info" %}
 								Note: `source.config.include_views` is set to `true` by default.
 								{% endhint %}
 								#### include\_tables (optional)
 								Use `source.config.include_tables` to control whether or not to include tables as part of metadata ingestion and data profiling.
 								Explicitly include tables by adding the following key-value pair in the `source.config` field of your configuration file.
 								```json
 								"include_tables": "true"
 								```
 								Exclude tables as follows.
 								```json
 								"include_tables": "false"
 								```
 								{% hint style="info" %}
 								Note: `source.config.include_tables` is set to `true` by default.
 								{% endhint %}
 								#### table\_filter\_pattern (optional)
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								Use `source.config.table_filter_pattern` to select tables for metadata ingestion by name.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								Use `source.config.table_filter_pattern.excludes` to exclude all tables with names matching one or more of the supplied regular expressions. All other tables will be included. See below for an example. This example is also included in the configuration template provided.
 								```json
 								"table_filter_pattern": {
 								    "excludes": ["information_schema.*", "[\\w]*event_vw.*"]
 								}
 								```
 								Use `source.config.table_filter_pattern.includes` to include all tables with names matching one or more of the supplied regular expressions. All other tables will be excluded. See below for an example.
 								```json
 								"table_filter_pattern": {
 								    "includes": ["corp.*", "dept.*"]
 								}
 								```
 								See the documentation for the [Python re module](https://docs.python.org/3/library/re.html) for information on how to construct regular expressions.
 								{% hint style="info" %}
 								You may use either `excludes` or `includes` but not both in `table_filter_pattern.`
 								{% endhint %}
 								#### schema\_filter\_pattern (optional)
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								Use `source.config.schema_filter_pattern.excludes` and `source.config.schema_filter_pattern.includes` field to select the schemas for metadata ingestion by name. The configuration template provides an example.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								The syntax and semantics for `schema_filter_pattern` are the same as for [`table_filter_pattern`](azure-sql.md#table\_filter\_pattern-optional). Please check that section for details.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								### 8. Configure sample data (optional)
 								#### generate\_sample\_data (optional)
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								Use the `source.config.generate_sample_data` field to control whether or not to generate sample data to include in table views in the OpenMetadata user interface. The image below provides an example.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								![](../../.gitbook/assets/generate\_sample\_data.png)
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								Explicitly include sample data by adding the following key-value pair in the `source.config` field of your configuration file.
 								```json
 								"generate_sample_data": "true"
 								```
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								If set to true, the connector will collect the first 50 rows of data from each table included in ingestion, and catalog that data as sample data, which users can refer to in the OpenMetadata user interface.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								You can exclude the collection of sample data by adding the following key-value pair in the `source.config` field of your configuration file.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"generate_sample_data": "false"
 								```
 								{% hint style="info" %}
 								Note: `generate_sample_data` is set to `true` by default.
 								{% endhint %}
 								### 9. Configure DBT (optional)
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								DBT provides transformation logic that creates tables and views from raw data. OpenMetadata includes an integration for DBT that enables you to see the models used to generate a table from that table's details page in the OpenMetadata user interface. The image below provides an example.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								![](../../.gitbook/assets/configure\_dbt.png)
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								To include DBT models and metadata in your ingestion workflows, specify the location of the DBT manifest and catalog files as fields in your configuration file.
 								#### dbt\_manifest\_file (optional)
 								Use the field `source.config.dbt_manifest_file` to specify the location of your DBT manifest file. See below for an example.
 								```json
 								"dbt_manifest_file": "./dbt/manifest.json"
 								```
 								#### dbt\_catalog\_file (optional)
 								Use the field `source.config.dbt_catalog_file` to specify the location of your DBT catalog file. See below for an example.
 								```json
 								"dbt_catalog_file": "./dbt/catalog.json"
 								```
 								### 10. Confirm sink settings
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								You need not make any changes to the fields defined for `sink` in the template code you copied into `azuresql.json` in Step 4. This part of your configuration file should be as follows.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"sink": {
 								    "type": "metadata-rest",
 								    "config": {}
 								},
 								```
 								### 11. Confirm metadata\_server settings
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								You need not make any changes to the fields defined for `metadata_server` in the template code you copied into `azuresql.json` in Step 4. This part of your configuration file should be as follows.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```json
 								"metadata_server": {
 								    "type": "metadata-server",
 								    "config": {
 								        "api_endpoint": "http://localhost:8585/api",
 								        "auth_provider_type": "no-auth"
 								    }
 								}
 								```
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								### 12. Run ingestion workflow <a href="#run-manually" id="run-manually"></a>
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Your `azuresql.json` configuration file should now be fully configured and ready to use in an ingestion workflow.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												GitBook: [#50] Athena Connector - Updates

											
										
										
											2021-12-29 19:18:43 +00:00
+								To run an ingestion workflow, execute the following command from the `openmetadata` directory you created in Step 1.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```bash
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								metadata ingest -c ./azuresql.json
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								```
 								## Next Steps
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								As the ingestion workflow runs, you may observe progress both from the command line and from the OpenMetadata user interface. To view the metadata ingested from Azure SQL, visit [http://localhost:8585/explore/tables](http://localhost:8585/explore/tables). Select the Azure SQL service to filter for the data you've ingested using the workflow you configured and ran following this guide. The image below provides an example.
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
 								![](<../../.gitbook/assets/next\_steps (1).png>)
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								## Troubleshooting
 								### ERROR: Failed building wheel for cryptography
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								When attempting to install the `openmetadata-ingestion[azuresql]` Python package in Step 2, you might encounter the following error. The error might include a mention of a Rust compiler.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```
 								Failed to build cryptography
 								ERROR: Could not build wheels for cryptography which use PEP 517 and cannot be installed directly
 								```
-												GitBook: [#42] MySQL Connector - Edited

											
										
										
											2021-12-28 18:10:12 +00:00
+								This error usually occurs due to an older version of pip. Try upgrading pip as follows.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```bash
 								pip3 install --upgrade pip setuptools
 								```
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Then re-run the install command in [Step 2](azure-sql.md#install-from-pypi-or-source).
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								### requests.exceptions.ConnectionError
-												GitBook: [#62] Minor edits - Connectors

											
										
										
											2022-01-04 07:28:23 +00:00
+								If you encounter the following error when attempting to run the ingestion workflow in Step 12, this is probably because there is no OpenMetadata server running at http://localhost:8585.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
 								```
 								requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=8585):
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Max retries exceeded with url: /api/v1/services/databaseServices/name/local_azure_sql
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
+								(Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x1031fa310>:
 								Failed to establish a new connection: [Errno 61] Connection refused'))
 								```
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								To correct this problem, please follow the steps in the [Run OpenMetadata](../../../try-openmetadata/run-openmetadata.md) guide to deploy OpenMetadata in Docker on your local machine.
-												GitBook: [#33] No subject

											
										
										
											2021-12-22 20:08:24 +00:00
-												Docs (#3276)

* GitBook: [#50] BigQuery, Glue, MSSQL, Postgres, Redshift, Snowflake - V2

* GitBook: [#62] No subject

* GitBook: [#63] No subject

* GitBook: [#64] Beta

* GitBook: [#65] Make Harsha's requested changes to connectors section organization

* GitBook: [#66] Kerberos authentication with Hive

* GitBook: [#67] Fix procedure overview links

* GitBook: [#68] Fix procedure overview links

* GitBook: [#69] correct step reference

* GitBook: [#70] Add Kerberos connection troubleshooting

* updated json schema and schema docs (#3219)

* updated json schema and schema docs

* added glossay to readme

* GitBook: [#72] Metrics & Tests

Co-authored-by: Parth Panchal <parth.panchal@deuexsolutions.com>
Co-authored-by: Shilpa V <vernekar.shilpa@gmail.com>
Co-authored-by: Shannon Bradshaw <shannon.bradshaw@arrikto.com>
Co-authored-by: parthp2107 <83201188+parthp2107@users.noreply.github.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
											
										
										
											2022-03-08 08:13:37 -08:00
+								Then re-run the metadata ingestion workflow in [Step 12](azure-sql.md#run-manually).