2916 Commits

Author SHA1 Message Date
Mayur Singal
ef7b02529d
Remove sqlalchemy-ibmi from db2 dependency (#11553)
* Remove sqlalchemy-ibmi from db2 dependency

* remove from json schema

* add migration

* update what's new
2023-05-11 15:03:26 +02:00
Pere Miquel Brull
6040078a7c
Validate context in Airflow (#11551) 2023-05-11 12:23:43 +02:00
Pere Miquel Brull
f22d604c54
Remove old tests (#11505)
Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>
2023-05-11 10:29:30 +02:00
Sriharsha Chintalapani
e3a99e2f84
Fix Airflow ingestion: check if downstream task are null (#11539)
* Fix Airflow ingestion: check if downstream task are null

* Fix Airflow ingestion: check if downstream task are null
2023-05-11 06:43:57 +02:00
Mayur Singal
f7a0d3f5f2
Fix E2E Vertica & System Metric (#11525) 2023-05-10 17:35:55 +05:30
Mayur Singal
be7b056d72
Fix tableau models to make fields optional (#11513) 2023-05-10 12:23:44 +05:30
Teddy
60de33d7cf
Fixes #11384 - Implement mem. optimization for sys. metrics (#11460)
* fix: optimize system metrics retrieval for memory

* fix: ran python linting

* fix: logic to retrieve unique system metrics operations

* fix: added logic to clean up query before parsing it

* fix: added E2E tests for rds, bq, snflk system metrics

* fix: ran python linting

* fix: fix postgres query + add default byte size to env var

* fix: ran python linting
2023-05-09 12:05:35 +02:00
Pere Miquel Brull
80af2b20bb
Centralize SSL Schemas & Add SSL for Airflow REST (#11363) 2023-05-08 21:05:10 +02:00
Ayush Shah
65fc94a292
Fix Dynamodb connection (#11473)
---------

Co-authored-by: Onkar Ravgan <onkar.10r@gmail.com>
Co-authored-by: Nahuel Verdugo Revigliono <nahuel@getcollate.io>
2023-05-08 15:48:38 +00:00
Mayur Singal
ddc3060184
Fix vertica comments for version 9 (#11479) 2023-05-08 11:04:28 +00:00
Mayur Singal
074418eac3
Fix: Snowflake support extarnal tables (#11466) 2023-05-08 15:54:42 +05:30
Mayur Singal
a897954afc
Improve filtering for lineage query (#11457) 2023-05-08 15:18:51 +05:30
Ayush Shah
2c9ba537eb
Fix min max on rowversion/timestamp mssql (#11455) 2023-05-08 14:52:53 +05:30
Mayur Singal
e20f75fa54
Fix #11468: Fix salesforce scheme issue (#11474) 2023-05-08 14:02:17 +05:30
Pere Miquel Brull
aaf92a9ae8
Bump python-daemon (#11477) 2023-05-08 09:59:57 +02:00
Nahuel
b22192575d
Fix#11216: Use Enum for SASL Mechanism for Kafka and Redpanda connections (#11385)
* Fix#11216: Use ENUM for SASL Mechanism for Kafka and Redpanda connections

* Minor change

* Address PR comments

* Fix postgres migration
2023-05-08 09:35:49 +02:00
Noe Alejandro Perez Dominguez
b9aef4156f
Allow client requests to be retried (#11464)
This is regarding: https://github.com/psf/requests/issues/4664

The change allows errors to be handled and retried.

Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>
2023-05-08 08:31:55 +02:00
Keith Sirmons
65c5b44eaa
Impala Connection Profiler is_nan rollback; Histogram fix. (#11388) 2023-05-05 21:45:30 +02:00
Onkar Ravgan
c7902344cb
Added tableau owner test connection step (#11436)
* Added tableau owner test connection

* typo fixed

* remvoed generic exception
2023-05-05 18:29:50 +05:30
Mayur Singal
2793c927cd
Fix postgres cypress issue (#11441) 2023-05-04 17:09:27 +00:00
Mayur Singal
6ddf7034ae
Add exception handling for tableau sheets (#11429)
* Add exception handling for tableau sheets

* remove meating
2023-05-04 16:38:27 +05:30
Mayur Singal
3f65c8ab0f
Fix Vertica column fetch query (#11424)
* Fix vertica column fetch query

* fix timestamp datatype
2023-05-04 13:37:00 +05:30
Mayur Singal
186eb252d7
Fix #11332: Fix databricks sample data ingestion for array datatype (#11420)
* Fix #11332: Fix databricks sample data ingestion for array datatype

* Fix checkstyle
2023-05-04 13:36:12 +05:30
Mayur Singal
49f3bae15e
Fix gemetry type for postgres (#11394) 2023-05-04 13:02:50 +05:30
Teddy
0a7f114281
fix: added logic to handled tests with no in result (#11409) 2023-05-03 21:59:23 +02:00
NiharDoshi99
02d4a1d7d6
making downstream_task_ids field optional for airflow AirflowDagDetails (#11405)
* making downstream_task_ids field optional for airflow AirflowDagDetails

* update requirements file for airflow
2023-05-03 22:09:10 +05:30
Onkar Ravgan
7e9c02fe6f
Fixed clean_query method for \n (#11389)
* Fixed clean query method

* fixed regex and tests

* updated regex
2023-05-03 18:08:54 +05:30
Keith Sirmons
00289bd85f
Fixes#11189: Implement Impala and hive get_view_definition (#11237)
* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* removed print statements and ran make py_format

* updated to fix some pylint errors.
imported Dialects to remove string compare to "impala" engine

* moved huge comment into function docstring.
This comment shows us the sql to get quartiles in Impala

* added cast to decimal for column when running average in mean.py

* fixed lint error

* fixed ui ordering of precision and scale.
Precision should be ordred in front of scale since the precision is set first in decimal data types

* Added get_view_definition to hive and impala connectors.

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
2023-05-03 15:06:33 +05:30
Nahuel
3ba29e7f0e
Fix: Redshift E2E tests (#11396) 2023-05-03 08:12:32 +05:30
Nahuel
1ec6e5e285
Fix#11311: Add IBM dependency for i Series in DB2 connector (#11381) 2023-05-02 15:50:39 +02:00
Teddy
f8c667b504
Fix median for concatenable types (#11382)
* fix: median/fq/tq for concatenable types

* fix: ran linting
2023-05-02 10:45:26 +00:00
Ayush Shah
00ecca07e9
Add fix for test connection w/o db (#11354) 2023-05-02 16:00:57 +05:30
Ayush Shah
8ebe6a80e6
Upgrade Pyarrow (#11383) 2023-05-02 16:00:32 +05:30
Nahuel
94eece76f8
Fix: Tableau DataModel optional dataType (#11379) 2023-05-02 09:05:44 +00:00
Mayur Singal
4110dc2472
Fix #11352: Fix athena usage models (#11378) 2023-05-02 08:24:01 +00:00
Teddy
4b5a0eab1a
fix: catch generic SQAlchemy error for non supported regex_match (#11366) 2023-05-02 10:30:30 +05:30
Keith Sirmons
ad9b5a0cb5
Impalaconnection 0.2.1 + string datatypes enabled in profile (#11364)
* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.

* added the ordinalPosition data point into the Column constructor.

* renamed variable to better describe its usage.

* updated profile errors.
Hive connections now comment columns by default.

* removed print statements

* Cleaned up code by pulling check into its own function

* Updated median function to return null when it is being used for first and third quartiles.

* removed print statements and ran make py_format

* updated to fix some pylint errors.
imported Dialects to remove string compare to "impala" engine

* moved huge comment into function docstring.
This comment shows us the sql to get quartiles in Impala

* added cast to decimal for column when running average in mean.py

* fixed lint error

* fixed ui ordering of precision and scale.
Precision should be ordred in front of scale since the precision is set first in decimal data types

* Fixed overflow error when converting large numbers to bigint

Fixed error for CHAR datatype missing.

* Fixed NaN issues with Impala Profile

* py formatting

* Fixed warnings from SqlAlchemy
  The GenericFunction 'max' is already registered and is going to be overridden.
  The GenericFunction 'min' is already registered and is going to be overridden.

Updated Min/Max to handle strings by getting they length.

* Updated profiler to handle strings by using the string length as the parameter to compute the profile

* py_format updates

* fix: ran linting

* fix: Mysql hardcoded table alias

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
Co-authored-by: Teddy Crepineau <teddy.crepineau@gmail.com>
2023-04-30 10:03:56 +02:00
Pere Miquel Brull
fc5c0fa756
Fixes #11340 - Add missing headers (#11356)
* Add missing headers

* Add raise

* Format
2023-04-28 07:42:37 +02:00
Teddy
b715208d28
Fixes #11327 - Improve Profiler Logging (#11341)
* feat: improved profiler logging

* feat: ran python linting
2023-04-27 18:18:33 +02:00
Pere Miquel Brull
c53a3413fb
Fixes #11307 - Handle exceptions if LookML model is invalid (#11320)
* Fix dynamo docs

* Handle data model fetch exceptions

* Format

* Add example for Private Key format
2023-04-27 11:42:16 +02:00
Ayush Shah
f7168db8ea
Add Quicksight AWS support (#11294) 2023-04-27 11:39:47 +05:30
Mayur Singal
fd5f63fb58
Fix MSSQL connection with pyodbc scheme (#11304)
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2023-04-27 07:25:10 +02:00
Teddy
0930bc307a
fix: change in entityLink to string in CreateTestCaseRequest (#11291) 2023-04-26 10:52:09 +00:00
Nahuel
bcdab5e30a
Fix: Tableau E2E wrong expected values (#11290) 2023-04-26 13:44:17 +05:30
Pere Miquel Brull
9ebc0cd5e4
Fix sample usage DAG directory (#11276) 2023-04-26 06:53:10 +02:00
Ayush Shah
dd509681be
Fixes tableau, add quicksight e2e (#11177) 2023-04-26 10:22:08 +05:30
Teddy
afce5fa61b
Fix E2E tests (#11267)
* fix: profile only include schema

* tests: add logic to handle exsiting views and table for Hive

* fix: python linting
2023-04-25 16:05:49 +02:00
Onkar Ravgan
8bcfd013a1
Added validation (#11249) 2023-04-25 06:58:59 +00:00
Ayush Shah
efd82113ec
Fix E2E tests (#11226) 2023-04-25 10:11:06 +05:30
Mayur Singal
c920c9afa3
0.13 to 1.00 docs changes (#11236)
* 0.13 to 1.00 changes

* add superset changes

* dbt gcs yaml fix

---------

Co-authored-by: Onkar Ravgan <onkar.10r@gmail.com>
2023-04-24 16:12:24 +02:00