Pere Miquel Brull ce15728327
MINOR - Docs 1.3 BETA & Slack Alert (#14961)
* MINOR - Docs 1.3 BETA & Slack Alert

* MINOR - Docs 1.3 BETA & Slack Alert
2024-01-31 07:16:22 +01:00

8.9 KiB

title slug
Run the PowerBI Connector Externally /connectors/dashboard/powerbi/yaml

Run the PowerBI Connector Externally

Stage PROD
Dashboards {% icon iconName="check" /%}
Charts {% icon iconName="check" /%}
Owners {% icon iconName="cross" /%}
Tags {% icon iconName="cross" /%}
Datamodels {% icon iconName="check" /%}
Projects {% icon iconName="check" /%}
Lineage {% icon iconName="check" /%}

In this section, we provide guides and references to use the PowerBI connector.

Configure and schedule PowerBI metadata and profiler workflows from the OpenMetadata UI:

{% partial file="/v1.3/connectors/external-ingestion-deployment.md" /%}

Requirements

{%inlineCallout icon="description" bold="OpenMetadata 0.12 or later" href="/deployment"%} To deploy OpenMetadata, check the Deployment guides. {%/inlineCallout%}

To access the PowerBI APIs and import dashboards, charts, and datasets from PowerBI into OpenMetadata, a PowerBI Pro license is necessary.

PowerBI Account Setup

Step 1: Create an Azure AD app and configure the PowerBI Admin consle

Please follow the steps mentioned here for setting up the Azure AD application service principle and configure PowerBI admin settings

Login to Power BI as Admin and from Tenant settings allow below permissions.

  • Allow service principles to use Power BI APIs
  • Allow service principals to use read-only Power BI admin APIs
  • Enhance admin APIs responses with detailed metadata

Step 2: Provide necessary API permissions to the app

Go to the Azure Ad app registrations page, select your app and add the dashboard permissions to the app for PowerBI service and grant admin consent for the same:

The required permissions are:

  • Dashboard.Read.All

Optional Permissions: (Without granting these permissions, the dataset information cannot be retrieved and the datamodel and lineage processing will be skipped)

  • Dataset.Read.All

{% note %}

Make sure that in the API permissions section Tenant related permissions are not being given to the app Please refer here for detailed explanation

{% /note %}

Step 3: Create New PowerBI workspace

The service principal only works with new workspaces. For reference

Python Requirements

To run the PowerBI ingestion, you will need to install:

pip3 install "openmetadata-ingestion[powerbi]"

Metadata Ingestion

All connectors are defined as JSON Schemas. Here you can find the structure to create a connection to PowerBI.

In order to create and run a Metadata Ingestion workflow, we will follow the steps to create a YAML configuration able to connect to the source, process the Entities if needed, and reach the OpenMetadata server.

The workflow is modeled around the following JSON Schema

1. Define the YAML Config

This is a sample config for PowerBI:

{% codePreview %}

{% codeInfoContainer %}

Source Configuration - Service Connection

{% codeInfo srNumber=1 %}

clientId: PowerBI Client ID.

To get the client ID (also know as application ID), follow these steps:

  • Log into Microsoft Azure.
  • Search for App registrations and select the App registrations link.
  • Select the Azure AD app you're using for embedding your Power BI content.
  • From the Overview section, copy the Application (client) ID.

{% /codeInfo %}

{% codeInfo srNumber=2 %}

clientSecret: PowerBI Client Secret.

To get the client secret, follow these steps:

  • Log into Microsoft Azure.
  • Search for App registrations and select the App registrations link.
  • Select the Azure AD app you're using for embedding your Power BI content.
  • Under Manage, select Certificates & secrets.
  • Under Client secrets, select New client secret.
  • In the Add a client secret pop-up window, provide a description for your application secret, select when the application secret expires, and select Add.
  • From the Client secrets section, copy the string in the Value column of the newly created application secret.

{% /codeInfo %}

{% codeInfo srNumber=3 %}

tenantId: PowerBI Tenant ID.

To get the tenant ID, follow these steps:

  • Log into Microsoft Azure.
  • Search for App registrations and select the App registrations link.
  • Select the Azure AD app you're using for Power BI.
  • From the Overview section, copy the Directory (tenant) ID.

{% /codeInfo %}

{% codeInfo srNumber=4 %}

scope: Service scope.

To let OM use the Power BI APIs using your Azure AD app, you'll need to add the following scopes:

Instructions for adding these scopes to your app can be found by following this link: https://analysis.windows.net/powerbi/api/.default.

{% /codeInfo %}

{% codeInfo srNumber=5 %}

authorityUri: Authority URI for the service.

To identify a token authority, you can provide a URL that points to the authority in question.

If you don't specify a URL for the token authority, we'll use the default value of https://login.microsoftonline.com/.

{% /codeInfo %}

{% codeInfo srNumber=6 %}

hostPort: URL to the PowerBI instance.

To connect with your Power BI instance, you'll need to provide the host URL. If you're using an on-premise installation of Power BI, this will be the domain name associated with your instance.

If you don't specify a host URL, we'll use the default value of https://app.powerbi.com to connect with your Power BI instance.

{% /codeInfo %}

{% codeInfo srNumber=7 %}

Pagination Entity Per Page:

The pagination limit for Power BI APIs can be set using this parameter. The limit determines the number of records to be displayed per page.

By default, the pagination limit is set to 100 records, which is also the maximum value allowed. {% /codeInfo %}

{% codeInfo srNumber=8 %}

Use Admin APIs:

Option for using the PowerBI admin APIs:

  • Enabled (Use PowerBI Admin APIs) Using the admin APIs will fetch the dashboard and chart metadata from all the workspaces available in the PowerBI instance.

{% note %}

When using the PowerBI Admin APIs there are no limitations on the Datasets that are retrieved for creating lineage information.

{% /note %}

  • Disabled (Use Non-Admin PowerBI APIs) Using the non-admin APIs will only fetch the dashboard and chart metadata from the workspaces that have the security group of the service principal assigned to them.

{% note %}

When using the PowerBI Non-Admin APIs, the lineage information can only be generated if the dataset is a Push Dataset. For more information please visit the PowerBI official documentation here.

{% /note %}

{% /codeInfo %}

{% partial file="/v1.3/connectors/yaml/dashboard/source-config-def.md" /%}

{% partial file="/v1.3/connectors/yaml/ingestion-sink-def.md" /%}

{% partial file="/v1.3/connectors/yaml/workflow-config-def.md" /%}

{% /codeInfoContainer %}

{% codeBlock fileName="filename.yaml" %}

source:
  type: powerbi
  serviceName: local_powerbi
  serviceConnection:
    config:
      type: PowerBI
      clientId: clientId
      clientSecret: secret
      tenantId: tenant
      # scope:
      #    - https://analysis.windows.net/powerbi/api/.default (default)
      # authorityURI: https://login.microsoftonline.com/ (default)
      # hostPort: https://analysis.windows.net/powerbi (default)
      # pagination_entity_per_page: 100 (default)
      # useAdminApis: true (default)

{% partial file="/v1.3/connectors/yaml/dashboard/source-config.md" /%}

{% partial file="/v1.3/connectors/yaml/ingestion-sink.md" /%}

{% partial file="/v1.3/connectors/yaml/workflow-config.md" /%}

{% /codeBlock %}

{% /codePreview %}

{% partial file="/v1.3/connectors/yaml/ingestion-cli.md" /%}