mirror of
https://github.com/datahub-project/datahub.git
synced 2025-10-24 07:24:58 +00:00
73 lines
3.7 KiB
Markdown
73 lines
3.7 KiB
Markdown
![]() |
---
|
||
|
title: Setup
|
||
|
---
|
||
|
# Snowflake Ingestion Guide: Setup & Prerequisites
|
||
|
|
||
|
In order to configure ingestion from Snowflake, you'll first have to ensure you have a Snowflake user with the `ACCOUNTADMIN` role or `MANAGE GRANTS` privilege.
|
||
|
|
||
|
## Snowflake Prerequisites
|
||
|
|
||
|
1. Create a DataHub-specific role by executing the following queries in Snowflake. Replace `<your-warehouse>` with an existing warehouse that you wish to use for DataHub ingestion.
|
||
|
|
||
|
```sql
|
||
|
create or replace role datahub_role;
|
||
|
-- Grant access to a warehouse to run queries to view metadata
|
||
|
grant operate, usage on warehouse "<your-warehouse>" to role datahub_role;
|
||
|
```
|
||
|
|
||
|
Make note of this role and warehouse. You'll need this in the next step.
|
||
|
|
||
|
2. Create a DataHub-specific user by executing the following queries. Replace `<your-password>` with a strong password. Replace `<your-warehouse>` with the same warehouse used above.
|
||
|
|
||
|
```sql
|
||
|
create user datahub_user display_name = 'DataHub' password='<your-password>' default_role = datahub_role default_warehouse = '<your-warehouse>';
|
||
|
-- Grant access to the DataHub role created above
|
||
|
grant role datahub_role to user datahub_user;
|
||
|
```
|
||
|
|
||
|
Make note of the user and its password. You'll need this in the next step.
|
||
|
|
||
|
3. Assign privileges to read metadata about your assets by executing the following queries. Replace `<your-database>` with an existing database. Repeat for all databases from your Snowflake instance that you wish to integrate with DataHub.
|
||
|
|
||
|
```sql
|
||
|
set db_var = '"<your-database>"';
|
||
|
-- Grant access to view database and schema in which your tables/views exist
|
||
|
grant usage on DATABASE identifier($db_var) to role datahub_role;
|
||
|
grant usage on all schemas in database identifier($db_var) to role datahub_role;
|
||
|
grant usage on future schemas in database identifier($db_var) to role datahub_role;
|
||
|
|
||
|
-- Grant Select acccess enable Data Profiling
|
||
|
grant select on all tables in database identifier($db_var) to role datahub_role;
|
||
|
grant select on future tables in database identifier($db_var) to role datahub_role;
|
||
|
grant select on all external tables in database identifier($db_var) to role datahub_role;
|
||
|
grant select on future external tables in database identifier($db_var) to role datahub_role;
|
||
|
grant select on all views in database identifier($db_var) to role datahub_role;
|
||
|
grant select on future views in database identifier($db_var) to role datahub_role;
|
||
|
|
||
|
-- Grant access to view tables and views
|
||
|
grant references on all tables in database identifier($db_var) to role datahub_role;
|
||
|
grant references on future tables in database identifier($db_var) to role datahub_role;
|
||
|
grant references on all external tables in database identifier($db_var) to role datahub_role;
|
||
|
grant references on future external tables in database identifier($db_var) to role datahub_role;
|
||
|
grant references on all views in database identifier($db_var) to role datahub_role;
|
||
|
grant references on future views in database identifier($db_var) to role datahub_role;
|
||
|
|
||
|
```
|
||
|
|
||
|
If you have imported databases in your Snowflake instance that you wish to integrate with DataHub, you'll need to use the below query for them.
|
||
|
|
||
|
```sql
|
||
|
grant IMPORTED PRIVILEGES on database "<your-database>" to role datahub_role;
|
||
|
```
|
||
|
|
||
|
4. Assign privileges to extract lineage and usage statistics from Snowflake by executing the below query.
|
||
|
|
||
|
```sql
|
||
|
grant imported privileges on database snowflake to role datahub_role;
|
||
|
```
|
||
|
|
||
|
## Next Steps
|
||
|
|
||
|
Once you've done all of the above in Snowflake, it's time to [move on](configuration.md) to configuring the actual ingestion source within DataHub.
|
||
|
|
||
|
*Need more help? Join the conversation in [Slack](http://slack.datahubproject.io)!*
|