harshsoni2024
4f57f4e710
GEN-309: OpenAPI Connector ( #17754 )
...
* configure api service metadata
* add rest api service
* fix test con. pyformat changes
* add models, fix test con.
* improve test con.
* add docs, side doc
* fix model data parse, url error fix
* add tests
* fix pytest errors
---------
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2024-09-11 13:36:53 +05:30
harshsoni2024
3922f45a3e
fix typo in profiler metric ( #17784 )
2024-09-10 22:31:23 +05:30
Suman Maharana
094bae7097
Fixes #15565 : Advanced Avro schema recursion depth issue ( #17683 )
...
* Fix: Avro schema recursion depth issue
* py_format
* Addressed comments
2024-09-10 18:26:42 +05:30
Mayur Singal
b142882848
MINOR: Fix redshift ingestion warnings in case of ext table ( #17767 )
2024-09-10 15:32:35 +05:30
Teddy
e4c01c5702
fix: region typo in test ( #17766 )
2024-09-09 17:54:07 +05:30
Mayur Singal
9107c41a23
GEN-1251: Fix Expected ColumnLineage but got dict ( #17708 )
2024-09-09 17:52:51 +05:30
Pere Miquel Brull
c309906a1b
MINOR - Bump Presidio Analyzer and validate support for legal entities ( #17750 )
2024-09-06 16:07:08 +02:00
Pere Miquel Brull
1a276450fa
MINOR - Fix lineage GET for names with /
and standardize quote calls ( #17748 )
...
* MINOR - Fix lineage GET for names with `/` and standardize quote calls
* format
* fix import
2024-09-06 12:12:44 +02:00
Imri Paran
84be1a3162
Fix 17698: use resolution logic for snowflake system metrics profiler ( #17699 )
...
* fix(profiler): snowflake
resolve tables using the snowflake engine instead of OpenMetadata
* added env for cleaning up dbs in E2E
* moved system metric method to profiler. all the rest says in snowflake
* format
* revert unnecessary changes
* removed test for previous resolution method
* use shutdown39
2024-09-06 07:25:10 +00:00
Ayush Shah
b2f21fa070
Fixes #17085 : Update Oracle count & unique count function to handle HexByteString/Blob types ( #17596 )
2024-09-06 12:42:41 +05:30
Pere Miquel Brull
8191202850
MINOR - Better PII classification for JSON data ( #17734 )
...
* MINOR - Better PII classification for JSON data
* linting
2024-09-06 08:54:23 +02:00
Teddy
a7b6279f2e
GEN-1333 Add TS validation on DQ and Porfiler data ingestion ( #17731 )
...
* fix: added ts validation for DQ and profiler data
* fix: change ts to ms in pytest
* style: ran python linting
2024-09-06 08:16:31 +02:00
Onkar Ravgan
6cd07071d7
Fixed dbt cloud run_res name ( #17694 )
2024-09-04 16:58:29 +05:30
Pere Miquel Brull
2237d5a8d5
MINOR - PII Scanner tests and log levels ( #17686 )
...
* MINOR - PII Scanner tests and log levels
* MINOR - PII Scanner tests and log levels
2024-09-04 12:11:07 +02:00
nicor88
235337dbcd
fixes 17648: add tags and description for tableau published data source ( #17678 )
...
* tableau data-source tags and description
* tweaks
* PR review
* change test description
* black formatting
2024-09-03 11:25:27 +02:00
Suman Maharana
e93cf23d6c
Fixes: Parse postgres json column fields ( #17645 )
2024-09-03 11:10:47 +05:30
Mayur Singal
77d98c9675
MINOR: Fix athena lineage models ( #17662 )
2024-09-02 14:19:13 +05:30
Pere Miquel Brull
bcb29b46da
MINOR - Implement SAP Hana Lineage ( #17615 )
...
* MINOR - SAP Hana Lineage
* skeleton
* parser
* lineage
* manage formulas
* add cvs
* add cvs
* better typing
* enum
* handle cvs
* saphana docs
2024-08-30 07:42:43 +02:00
Ayush Shah
9245c8965f
fix: Bump Ingestion and Dockerfile versions ( #17624 )
2024-08-29 12:03:30 +05:30
nicor88
5eae1e371c
fix ingestion of owner in dbt, via email ( #17613 )
2024-08-28 18:24:50 +05:30
Pere Miquel Brull
2180a6c7f1
FIX - profiler interface system metrics validation & e2e YAML includeDDL ( #17562 )
2024-08-23 09:00:18 +02:00
Imri Paran
b48c6a0485
feat(postgres): add money profile ( #17558 )
...
add support for profiling of money type
2024-08-22 14:53:34 -07:00
Pere Miquel Brull
519b3c32e3
MINOR - Speek up redshift test connection ( #17553 )
2024-08-22 11:12:09 -07:00
Onkar Ravgan
bbe92e2af3
MINOR: Fix none password issue for mysql and postgresql databases ( #17548 )
...
* fix none password issue
* added warning
2024-08-22 17:34:34 +05:30
kwgdaig
43a244fbf1
ISSUE-17045: Modified to create column linage even when upstream columns and data source columns are one-to-many ( #17112 )
...
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2024-08-22 09:07:05 +02:00
Imri Paran
2dd613b2a7
tests: lineage ( #17436 )
...
add test for cell with 100k characters
2024-08-21 20:28:08 -07:00
Mayur Singal
dd17ee739a
MINOR: Fix output handler time ingestion ( #17429 )
...
* MINOR: Fix output handler time ingestion
* chore: fixes Lint error
---------
Co-authored-by: Ayush Shah <ayush@getcollate.io>
2024-08-21 21:07:26 +05:30
Ayush Shah
7ada44a315
Minor: Remove trailing slash from FivetranClient base URL ( #17528 )
...
* fix: Remove trailing slash from FivetranClient base URL
* chore: use a method created for removing trailing slash
2024-08-21 13:02:16 +00:00
Ayush Shah
c08b83f41d
MINOR: Fixes AttributeError: 'DatalakeGcsClient' object has no attribute 'project' ( #17526 )
2024-08-21 17:51:35 +05:30
Imri Paran
5133c31d31
MINOR: kafka integration tests ( #17457 )
...
* tests: kafka integration
kafka integration tests with schema registry
* added ignore kafka for python 3.8
* fixed tests
2024-08-21 16:05:09 +05:30
IceS2
ac6b192fb3
MINOR: Fix pydantic v2 issues with domo ( #17507 )
...
* Fix pydantic v2 issues with domo
* Fix sourceURL for domo charts
2024-08-21 11:10:12 +02:00
Imri Paran
c055620ff4
tests: lineage ( #17509 )
...
added test cases for lineage with and without includeDDL
2024-08-21 07:47:30 +00:00
Do Manh Ha
a868596db7
Fixes #17461 : Unquote and interpret escaped characters in BigQuery dataset description ( #17462 )
...
* fix(bigquery): unquote and convert any escaped characters to their actual representations
* test: bigquery description with multiple line
---------
Co-authored-by: Imri Paran <imri.paran@gmail.com>
2024-08-20 17:56:19 +02:00
Teddy
354f879866
fix: db_service not passed to metric instantiating ( #17504 )
2024-08-20 07:57:03 -07:00
Teddy
4ed2035a50
fix: remove root when accessing fqn of entity reference ( #17491 )
2024-08-20 15:17:16 +02:00
Ayush Shah
9880f06b2c
Fixes #17489 : Allow non numeric numbers to be sent via Json, Replace NaN value… ( #17490 )
...
* fix: Allow non numeric numbers to be sent via Json, Replace NaN values with None in SQAProfilerInterface
Replace NaN values with None in the SQAProfilerInterface class to maintain database parity. NaN values will be cast to null in OpenMetadata. This change ensures that data handling processes account for this conversion.
* fix: histogram overflow error
* test: Add Unit Test for Null and Null Ratio Metric
* chore: Address comments
* chore: Address comments
* fix: checkstyle and message
* fix: failing tests as null count works as expected
2024-08-20 16:33:55 +05:30
Imri Paran
a59eb2a3cd
fix: pin numpy version ( #17487 )
2024-08-20 10:19:05 +00:00
Imri Paran
5da7bb049c
MINOR: fix table profiler on empty tables in trino ( #17471 )
...
* fix(profiler): trino
coalesce row count to 0 if result is null. this value gets returned for empty tables
* fixed test_metadata.py
2024-08-20 08:42:10 +00:00
IceS2
48b43900b6
Install db2 dependency on amd64 architectures ( #17495 )
2024-08-20 09:24:38 +02:00
Imri Paran
2722eadc33
fix: gcs ( #17486 )
...
1. update docs using gcp credentials in path.
2. updated example `clientIt` in docs
3. fixed client to work with implicit project
4. fixed workflow to warn about missing buckets
2024-08-19 23:14:46 -07:00
Imri Paran
31c2ec8c57
MINOR: fix qlikcloud test connection ( #17459 )
...
* fix: qlikcloud test connection
* patch test_connection for qlik cloud unit tests
2024-08-19 23:14:09 -07:00
IceS2
ddd8c41864
Fix DB2 Schema Trailing Whitespaces ( #17475 )
2024-08-19 23:13:14 -07:00
Imri Paran
7508848376
fix(dq): data types for unique columns ( #17431 )
...
1. remove json and array from supported data types of unique column test.
2. migrations.
3. tests.
2024-08-19 14:28:42 +02:00
Mayur Singal
8acf6d3e94
MINOR: Make Include ddl disabled by default ( #17450 )
...
* MINOR: Make Include ddl disabled by default
* make schema def failure warning debug
* Add missing condition
---------
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
Co-authored-by: Pablo Takara <pjt1991@gmail.com>
2024-08-19 14:07:18 +02:00
Pere Miquel Brull
b175e40e99
MINOR - Clean DEBUG logs ( #17464 )
...
Co-authored-by: Suman Maharana <sumanmaharana786@gmail.com>
2024-08-19 12:20:29 +02:00
Imri Paran
4c08f82e4e
Fixes 17413: Fix one sided tests for columnValueLengthsToBeBetween and columnValuesToBeBetween ( #17423 )
...
* mysql integration tests
* fix(data-quality): accept between with no bounds
add between filters only when the bounds are defined. if they are not (ie: resolve to 'inf' values), do not add any filters
* format
* consolidated ingestion_config
* format
* fixed handling of date and time columns
* fixed tests
2024-08-19 09:09:35 +02:00
Onkar Ravgan
bbb3256c0d
Match correct file names for the dbt artifacts ( #17445 )
2024-08-18 11:40:37 +02:00
Suman Maharana
de3a82eeb6
Minor: Kill active/idle connections after test connections ( #17411 )
...
* Minor: Kill active/idle connections after test connections
* fixed idle conn for multi db
* added exception handling
2024-08-14 15:42:42 +02:00
Ayush Shah
8ad6c95fe4
Fixes #17367 : PipelineStatus Timestamp None not allowed ( #17422 )
...
* fix(ingestion): Change Timestamp None to Current Time noting pending pipeline
* fix(ingestion): Address comments around PipelineStatus timestamp
* fix(ingestion): Improve timestamps handling for tasks and pipeline status
2024-08-13 15:39:29 +02:00
Suman Maharana
feab12422b
MINOR: Fix Datetime Conversion issue in usage/lineage ( #17380 )
...
* MINOR: Fix Datetime Conversion issue in usage/lineage
* Undo mssql specific fixes
* fixed datetime conversion in mssql
* fixed datetime conversion in oracle
2024-08-13 14:04:50 +02:00