3385 Commits

Author SHA1 Message Date
Ayush Shah
00ecca07e9
Add fix for test connection w/o db (#11354) 2023-05-02 16:00:57 +05:30
Ayush Shah
8ebe6a80e6
Upgrade Pyarrow (#11383) 2023-05-02 16:00:32 +05:30
Nahuel
94eece76f8
Fix: Tableau DataModel optional dataType (#11379) 2023-05-02 09:05:44 +00:00
Mayur Singal
4110dc2472
Fix #11352: Fix athena usage models (#11378) 2023-05-02 08:24:01 +00:00
Teddy
4b5a0eab1a
fix: catch generic SQAlchemy error for non supported regex_match (#11366) 2023-05-02 10:30:30 +05:30
Keith Sirmons
ad9b5a0cb5
Impalaconnection 0.2.1 + string datatypes enabled in profile (#11364)
* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* removed print statements and ran make py_format

* updated to fix some pylint errors.
imported Dialects to remove string compare to "impala" engine

* moved huge comment into function docstring.
This comment shows us the sql to get quartiles in Impala

* added cast to decimal for column when running average in mean.py

* fixed lint error

* fixed ui ordering of precision and scale.
Precision should be ordred in front of scale since the precision is set first in decimal data types

* Fixed overflow error when converting large numbers to bigint

Fixed error for CHAR datatype missing.

* Fixed NaN issues with Impala Profile

* py formatting

* Fixed warnings from SqlAlchemy
  The GenericFunction 'max' is already registered and is going to be overridden.
  The GenericFunction 'min' is already registered and is going to be overridden.

Updated Min/Max to handle strings by getting they length.

* Updated profiler to handle strings by using the string length as the parameter to compute the profile

* py_format updates

* fix: ran linting

* fix: Mysql hardcoded table alias

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
Co-authored-by: Teddy Crepineau <teddy.crepineau@gmail.com>
2023-04-30 10:03:56 +02:00
Pere Miquel Brull
fc5c0fa756
Fixes #11340 - Add missing headers (#11356)
* Add missing headers

* Add raise

* Format
2023-04-28 07:42:37 +02:00
Teddy
b715208d28
Fixes #11327 - Improve Profiler Logging (#11341)
* feat: improved profiler logging

* feat: ran python linting
2023-04-27 18:18:33 +02:00
Pere Miquel Brull
c53a3413fb
Fixes #11307 - Handle exceptions if LookML model is invalid (#11320)
* Fix dynamo docs

* Handle data model fetch exceptions

* Format

* Add example for Private Key format
2023-04-27 11:42:16 +02:00
Ayush Shah
f7168db8ea
Add Quicksight AWS support (#11294) 2023-04-27 11:39:47 +05:30
Mayur Singal
fd5f63fb58
Fix MSSQL connection with pyodbc scheme (#11304)
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2023-04-27 07:25:10 +02:00
Teddy
0930bc307a
fix: change in entityLink to string in CreateTestCaseRequest (#11291) 2023-04-26 10:52:09 +00:00
Nahuel
bcdab5e30a
Fix: Tableau E2E wrong expected values (#11290) 2023-04-26 13:44:17 +05:30
Pere Miquel Brull
9ebc0cd5e4
Fix sample usage DAG directory (#11276) 2023-04-26 06:53:10 +02:00
Ayush Shah
dd509681be
Fixes tableau, add quicksight e2e (#11177) 2023-04-26 10:22:08 +05:30
Teddy
afce5fa61b
Fix E2E tests (#11267)
* fix: profile only include schema

* tests: add logic to handle exsiting views and table for Hive

* fix: python linting
2023-04-25 16:05:49 +02:00
Onkar Ravgan
8bcfd013a1
Added validation (#11249) 2023-04-25 06:58:59 +00:00
Ayush Shah
efd82113ec
Fix E2E tests (#11226) 2023-04-25 10:11:06 +05:30
Mayur Singal
c920c9afa3
0.13 to 1.00 docs changes (#11236)
* 0.13 to 1.00 changes

* add superset changes

* dbt gcs yaml fix

---------

Co-authored-by: Onkar Ravgan <onkar.10r@gmail.com>
2023-04-24 16:12:24 +02:00
NiharDoshi99
d1996d4260
added docs for sqlite (#11232) 2023-04-24 11:07:15 +00:00
Teddy
017fbc6a32
fix: logic for number of profiled tables (#11222)
* fix: logic for number of profiled tables

* fix: python linting
2023-04-24 08:00:25 +02:00
Pere Miquel Brull
d3d523e96d
Ingestion md docs review (#11219)
* Update workflow docs

* Remove duplicate key

* Update Custom connector docs

* Update Domo connector docs

* Dashboard docs updates

* Some databases docs updates

* Finish db docs updates

* Remove Pulsar

* Messaging docs

* Metadata docs

* ML docs

* S3 docs

* Fix rendering

* Update title and description of the databaseSchema

* Pipeline Service docs

* remove pulsar from tests

* Format

* Fix test

* Remove pulsar

* Remove pulsar
2023-04-23 18:43:46 +02:00
Sriharsha Chintalapani
123758e21e
Fix #10964: update retentionSize based on retention.size in the topic config (#11217)
* Fix #10964: update retentionSize based on retention.size in the topic config

* Fix #10964: update retentionSize based on retention.size in the topic config
2023-04-23 08:36:58 +02:00
Sriharsha Chintalapani
9e259be44e
Fix #11214: Ingestion based elastic search index missing serviceType for MLModel and Container (#11215) 2023-04-23 07:30:37 +02:00
Mayur Singal
da2f03ca50
Salesforce docs & remove unnecessary fields (#11207) 2023-04-22 18:32:32 +02:00
Mayur Singal
cb5ee34a1b
Fix Lineage Via Table Entity Error (#11209) 2023-04-22 18:31:30 +02:00
Nahuel
ed1388827e
Doc: Add ElasticsearchReindex and Data Insight docs in UI (#11201) 2023-04-21 11:34:55 -07:00
Teddy
6e129c1e65
Issue 10805 Added Hive e2e (#11197)
* tests: Added E2E test for Hive + fix minor bug

* tests: ran python linting
2023-04-21 15:45:12 +00:00
Onkar Ravgan
4c3b20b910
Req Markdown docs: dbt, sagemaker, mode, powerbi, db2, dynamo, kinesis, fivetran (#11173)
* Added markdown req docs

* Added v1 docs

* Update openmetadata-docs-v1/content/v1.0.0/connectors/database/db2/index.md

fixed typo in db2 grant

Co-authored-by: Teddy <teddy.crepineau@gmail.com>

* typo fix v1 docs

---------

Co-authored-by: Teddy <teddy.crepineau@gmail.com>
2023-04-21 16:44:41 +02:00
Ayush Shah
a50c31539b
Fix HexByteString Issue, revert datatype change (#11145)
* Fix HexByteString Issue, revert datatype change

* Add E2E MSSQL Bit type
2023-04-21 10:08:27 +02:00
Keith Sirmons
97b58c65f5
Impalaconnection (#11151)
* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* removed print statements and ran make py_format

* updated to fix some pylint errors.
imported Dialects to remove string compare to "impala" engine

* moved huge comment into function docstring.
This comment shows us the sql to get quartiles in Impala

* added cast to decimal for column when running average in mean.py

* fixed lint error

* fixed ui ordering of precision and scale.
Precision should be ordred in front of scale since the precision is set first in decimal data types

* first pass for impala connector

* updated default auth_mechanism to be one of the enum values.

* updated UI documentation to match fields for the impalaconneciton.

refined impalaConnection to bring use_ssl to a boolean instead or relying on an extra connection option being manually added.

Removed reference to hive for type mapping

added impala to the pip setup

* py_format updates

* removed print statement

* Lints and fixes

* Updated database documentation to follow new style

* Flag as BETA

* Remove tests

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2023-04-21 09:57:13 +02:00
Teddy
6f5d88ff63
tests: Added E2E tests for redshift (#11169) 2023-04-21 08:55:54 +02:00
Nahuel
8184516f80
Fix: Align includeOwner option with the rest of options to include entities (#11160) 2023-04-20 16:01:54 +02:00
Mayur Singal
7c1fe5202d
Fix Metabase E2E Tests (#11156) 2023-04-20 13:22:11 +00:00
Pere Miquel Brull
e453593760
Rel to #10805 - Ingestion docs glue & Kerberos Dep (#11157)
* Glue

* Add kerberos system dep
2023-04-20 10:56:46 +02:00
Pere Miquel Brull
91cd1491ee
Add Athena, Lineage and Usage docs & Fix Athena UI Lineage and Usage workflows (#11148)
* Athena docs

* Lineage and Usage docs

* Missing section close

* Fix Athena Model
2023-04-20 06:31:53 +02:00
Mayur Singal
dd754d586e
Metabase E2E Test & docs (#11126) 2023-04-20 00:50:23 +05:30
Onkar Ravgan
09fb69b68d
Fixed objectstore import (#11144) 2023-04-19 21:39:29 +05:30
Teddy
0f7d9699ad
Fix metrics filtering (#11149)
* fix: get column not filtering for metric types when profilerConfig with include columns is set

* fix: run python linting
2023-04-19 14:09:13 +00:00
Milan Bariya
66b25d2f30
Fix: Databricks usage issue (#11143) 2023-04-19 19:25:17 +05:30
Ayush Shah
ca861bc06e
Fixes #11137: Mssql Syntax Error + Arithemetic Error (#11138) 2023-04-19 15:08:12 +05:30
NiharDoshi99
1862ba2ba4
Changing behaviour for owners same as description for dashboards (#11118)
* changing behaviour for owners same as description

* fix typo
2023-04-19 12:31:56 +05:30
Onkar Ravgan
932ffa5fa0
Added Powerbi E2E and docs (#11130) 2023-04-19 11:49:20 +05:30
Pere Miquel Brull
a78a3b4734
Azure datalake metadata ingestion fixes (#11125)
* Add ADLS permissions

* Fix Azure DL ingestion

* Format

* enable decode for json

* fix gcs decode error

---------

Co-authored-by: ulixius9 <mayursingal9@gmail.com>
2023-04-19 07:28:41 +02:00
Pere Miquel Brull
463f242d6b
Add S3 Storage docs & CI validation (#11120)
* Prep CI

* Update Athena

* Update docs for S3 Storage

* Add manifest information
2023-04-19 06:31:55 +02:00
Milan Bariya
7cbe48971d
fix: Redash lineage issue (#11098)
* fix: Redash lineage issue

* change based on comments

* change based on comments

* change based on comments
2023-04-18 21:22:17 +05:30
Teddy
07b74ce8e8
fix: remove count of results processed (#11115) 2023-04-18 14:00:14 +00:00
Teddy
97ff34967a
Fix histogram bin creation (#11105)
* fix: bin creation + pass full table name for mysql median computation

* fix: ran linting for python

---------

Co-authored-by: Nahuel <nahuel@getcollate.io>
2023-04-18 14:49:21 +02:00
Nahuel
c3bfd1310a
Fix: bump 'openmetadata-sqllineage' version to 1.0.4 (#11109) 2023-04-18 13:58:39 +02:00
Teddy
c415f04eb7
Issue 10805 profiler e2e mssql (#11096)
* feat: extracted builder logic into factory and cleaned up comments

* feat: added E3E logic for time based profiler partition + logic for testing detailed profiler results
2023-04-18 11:56:16 +02:00