1157 Commits

Author SHA1 Message Date
Sriharsha Chintalapani
96dd94eab7
Merge pull request #4022 from MilanBariya/issue-3266
ISSUE-3266: Add unit test for vertica
2022-04-11 13:01:05 -07:00
Sriharsha Chintalapani
9479b49af6
Merge pull request #4030 from ulixius9/issue-3830
Fix #3830: Fixed Query parser
2022-04-11 08:12:13 -07:00
Milan Bariya
b64384e84a Add unit test for dynamodb 2022-04-11 19:33:27 +05:30
ulixius9
38ee0eca49 Fix #3830: Fixed Query parser 2022-04-11 18:59:36 +05:30
Milan Bariya
4fdb99bb26 ISSUE-3266: Add unit test for vertica 2022-04-11 17:57:05 +05:30
Milan Bariya
d73c7d9b44 ISSUE-3266: Add unit test for vertica 2022-04-11 15:17:22 +05:30
Mayur Singal
a3ceea6b63
Hive Source Fix (#3919)
* Hive Source Fix

* Added auth option test & unified unit test
2022-04-08 22:29:48 +05:30
Pere Miquel Brull
06a3e4c989
Fix #3825 - Schema Name, SQL Source FQDN & ORM Profiler (#3942)
* Fix db schema name

* Fix sqlite connection

* Correctly register scanned tables

* improve sqlite connection

* Adapt schemas on ORM profiler

* Format
2022-04-08 19:28:10 +05:30
Milan Bariya
4d15d0f91e
ISSUE-3811: Add unit test for clickhouse (#3949) 2022-04-08 19:26:02 +05:30
dvecerdea
230fc256d6
Refactor Trino Connector and allow proxy setup in trino connection config (#3810)
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2022-04-07 12:50:37 -07:00
Ayush Shah
7c3a6082a6
Refactor - Redshift Connection (#3920) 2022-04-07 08:52:38 -07:00
Mayur Singal
3651efd7f5
Databricks Source Fix (#3917)
Databricks Source Fix (#3917)
2022-04-07 16:40:50 +02:00
Pere Miquel Brull
93525aea0d
Fix #3823 & Fix generate default Enum field (#3909)
* Fix sample data DAG

* Fix callback imports

* Use --set-default-enum-member in generate

* Format

* Add faulty merge hard_delete

* Fix airflow lineage, improve naming and fix lineage tests

* Add mysql url test

* Add mysql url test

* Update CI name

* Fix test ometa endpoint

* Format

* Fix metadata config
2022-04-07 18:22:50 +05:30
Pere Miquel Brull
bd4071bd64
Fix #3826 & #3886 - Profiler workflow & filter pattern (#3893)
Fix #3826 & #3886 - Profiler workflow & filter pattern (#3893)
2022-04-06 17:05:00 +02:00
Pere Miquel Brull
63533eb388
Fix for connectors based on refactoring of schemas V2 (#3870)
Co-authored-by: Ayush Shah <ayush@getcollate.io>
2022-04-05 18:33:25 -07:00
Pere Miquel Brull
b3480693e4
Fix #3824 - OMeta Schema and JSON Connections (#3861)
Fix #3824 - OMeta Schema and JSON Connections (#3861)
2022-04-05 21:20:39 +02:00
Pere Miquel Brull
e2539c5e83
Fix #3844 - First iteration for deprecating MetadataServerConfig (#3853)
* Style

* deprecate MetadataServerConfig

* Remove audience from Okta
2022-04-05 18:02:45 +05:30
Onkar Ravgan
01b246f586
ISSUE-3485: Added DynamoDB Unit Test (#3488)
* ISSUE-3485: Added DynamoDB Unit Test

* Added more tests and optimized code

* Re-added list check condition

* Added file sink test

* Fixed file path

* added assert for table and column names

* Modified exception block and optimized test cases

Co-authored-by: Onkar Ravgan <onkarravgan@Onkars-MacBook-Pro.local>
2022-03-31 20:38:17 +05:30
Sriharsha Chintalapani
b14c8dc2c4
Issue-3685: Variable based separator used for fullyQualifiedName instead of hardcoded . for Python and make the separator to : (#3778)
* Issue-3685: Variable based separator used for fullyQualifiedName instead of hardcoded . for Python and make the sepearator to :

* Fix failing test

* Use colon for run_local_docker validation

* Update tests FQDN

* Update tests FQDN

Co-authored-by: Sachin-chaurasiya <sachinchaurasiyachotey87@gmail.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
2022-03-31 19:20:27 +05:30
codingwithabhi
2006b2b8ac
bigquery-unittest-added (#3479)
* bigquery-unittest-added

* mock-added-in-create_bigquery_client

* complex-struct-added

* makefile-updated-to-install-all-packages-for-unittest

* bigquery-unittest-updated-via-file-sink
2022-03-31 14:27:51 +05:30
Pere Miquel Brull
9ced748c4f
Use root for FQDN (#3780) 2022-03-31 12:05:11 +05:30
Sriharsha Chintalapani
f79b251509
Fix #3771: Main branch schemas and ingestion is failing; Fix #2992: Remove ChangeEvents log from ElasticSearch Indexes (#3772) 2022-03-30 18:26:57 -07:00
Pere Miquel Brull
c22381fdc8
Fix #3747 - FQDN model in ometa api (#3748)
Fix #3747 - FQDN model in ometa api (#3748)
2022-03-30 08:54:47 +02:00
Pere Miquel Brull
b3087d08b9
Fix #3522 - Add timeout to profiler (#3707)
Fix #3522 - Add timeout to profiler (#3707)
2022-03-30 08:54:27 +02:00
Snyk bot
3173646484
fix: ingestion/tests/integration/source/mlflow/Dockerfile to reduce vulnerabilities (#3618) 2022-03-23 18:26:41 -07:00
Pere Miquel Brull
eb906589fd
Fix #3525 - Profiler breaks on Postgres data (#3583)
Fix #3525 - Profiler breaks on Postgres data (#3583)
2022-03-22 15:55:44 +01:00
Pere Miquel Brull
548a0ab722
Fix #3437 & #3186 - Airflow lineage Task Group & Tests (#3460) 2022-03-21 10:29:49 -07:00
Ayush Shah
538ee87acc
Fix tests - added recursion delete and error handling (#3560) 2022-03-21 10:00:39 -07:00
Pere Miquel Brull
16e82d45de
Fix #3371 - Run Profiler and Tests on a % of the data (#3424)
Fix #3371 - Run Profiler and Tests on a % of the data (#3424)
2022-03-16 06:05:59 +01:00
Pere Miquel Brull
130bbb0c5c
Fix #3104 - Remove unused imports with pycln (#3370)
Fix #3104 - Remove unused imports with pycln (#3370)
2022-03-14 06:59:15 +01:00
Pere Miquel Brull
94d7500216
Fix #3248 & #3251 - Update metrics and column profile (#3262)
Fix #3248 & #3251 - Update metrics and column profile (#3262)
2022-03-08 11:44:39 +01:00
Pere Miquel Brull
434aa30a92
Fix #3221 - columnValuesToBeUnique vs. columnValuesToBeNotNull JSONs (#3255)
Fix #3221 - columnValuesToBeUnique vs. columnValuesToBeNotNull JSONs (#3255)
2022-03-08 09:15:20 +01:00
Pere Miquel Brull
954908efec
Fix #3217 - Test case props should be unique (#3220)
Fix #3217 - Test case props should be unique (#3220)
2022-03-07 14:43:42 +01:00
Pere Miquel Brull
2444b884bf
Fix #3150 - Missing count & value length tests (#3193)
Fix #3150 - Missing count & value length tests (#3193)
2022-03-07 07:19:13 +01:00
Pere Miquel Brull
4a752e3ab2
Fix #3151 - Ingestion profiler should use ORM Profiler (#3192) 2022-03-06 15:43:43 -08:00
Francisco J. Jurado Moreno
a6e98b3fdb
Fix2113 - Database Service API Integration tests (#3185) 2022-03-06 14:59:32 +01:00
Pere Miquel Brull
4116233697
Fix #3105 - ColumnValuesToMatchRegex & other fixes (#3149)
Fix #3105 - ColumnValuesToMatchRegex & other fixes (#3149)
2022-03-04 18:11:49 +01:00
Pere Miquel Brull
bd7b91b448
Fix #3112 - col profile safety & sample data (#3142)
Fix #3112 - col profile safety & sample data (#3142)
2022-03-04 13:14:11 +01:00
Snyk bot
834bb8e483
fix: ingestion/tests/integration/source/mlflow/Dockerfile to reduce vulnerabilities (#3134)
The following vulnerabilities are fixed with an upgrade:
- https://snyk.io/vuln/SNYK-DEBIAN11-AOM-1290331
- https://snyk.io/vuln/SNYK-DEBIAN11-AOM-1298721
- https://snyk.io/vuln/SNYK-DEBIAN11-AOM-1300249
- https://snyk.io/vuln/SNYK-DEBIAN11-UNZIP-2396444
- https://snyk.io/vuln/SNYK-DEBIAN11-UNZIP-2396445
2022-03-04 08:34:24 +01:00
Snyk bot
8132b91c30
fix: ingestion/tests/integration/lineage/airflow/requirements.txt to reduce vulnerabilities (#3130)
The following vulnerabilities are fixed by pinning transitive dependencies:
- https://snyk.io/vuln/SNYK-PYTHON-APACHEAIRFLOW-2413442
- https://snyk.io/vuln/SNYK-PYTHON-APACHEAIRFLOW-2414089
- https://snyk.io/vuln/SNYK-PYTHON-WTFORMS-40581
2022-03-04 07:49:58 +01:00
Pere Miquel Brull
e96ac838ff
Fix #3084 - Implement missing tests (#3117)
Fix #3084 - Implement missing tests
2022-03-04 06:59:47 +01:00
Pere Miquel Brull
71207de362
Fix #2875 - Profiler API Sink (#3011)
Fix #2875 - Profiler API Sink
2022-03-02 16:46:28 +01:00
Alberto Miorin
fe5618c8f1
Fix #3037: metadata --version doesn't work (#3038) 2022-03-01 12:19:36 +01:00
Pere Miquel Brull
990608522a
Fix #2981 - Update Profile to match TableProfile (#2982) 2022-02-25 09:26:30 -08:00
Pere Miquel Brull
4d928bab80
Fix #2965 - Additional Properties set to False (#2976) 2022-02-24 08:38:56 -08:00
Pere Miquel Brull
a4b383fa83
Fix #2897 - Profiler CLI (#2967)
Fix #2897 - Profiler CLI
2022-02-24 08:03:50 +01:00
Pere Miquel Brull
1fb0e7c489
Fix #2878 & #2877 - Implement Metrics and Validate Composed Metrics (#2926)
Fix #2878 & #2877 - Implement Metrics and Validate Composed Metrics
2022-02-24 07:08:39 +01:00
Pere Miquel Brull
1224d20a36
Fix #2894 - Profiler Processor & Metrics (#2900)
Fix #2894 - Profiler Processor & Metrics (#2900)
2022-02-22 08:09:02 +01:00
Pere Miquel Brull
e55579aaa8
Fix #2845 - Init Profiler Workflow (#2862)
* Fix list typing

* Add sqlite service

* Add sqlite service

* Add sqlite service

* Refactor validation into class

* Refactor validation into class

* Prepare table simple profiler

* Add note

* test ORM conversion

* Prepare workflow config utilities

* Prepare workflow skeleton

* Use new core Validation

* Refactor workflow config parsing

* Add comment

* Simplify workflow validations

* Fix table metric check

* Add init for convenience, otherwise interpreter cries when trying to __call__ the get result

* Fix table metric check

* Format

* Format

* Fix table list and metrics init

* Prepare profiler workflow integration tests

* Bump version

* Fix pycharm imports

* format
2022-02-20 17:55:12 +01:00
Pere Miquel Brull
f304d290b4
Fix #2751 - Init ORM Profiler (#2831)
* ORM Profiler skeleton

* Fix table name within service

* Add license

* Prepare custom types

* Fix converter

* Compute stddev only on numeric

* Prepare smart registries

* Update tests

* Update results retrieval

* Fix composed metrics result

* Format

* Add missing type

* Add _label decorator

* clean readme

* clean readme

* Filter types when profiler runs not allowed metric types

* Fix null ratio

* Add proper type

* RuleMetric skeleton

* Prepare table metrics

* Update simple profiler

* Format

* Define test expression grammar and node visiting

* Unify metric registry

* Prepare validation core

* Add grammar lib

* Add safe get

* Format

* Allow decimals in grammar

* Test validation conversion

* Fix validation conversion and test

* Rename to row_number

* Update READMEs

* Format

* Row number naming

* Fix rename
2022-02-18 07:48:38 +01:00