OpenMetadata

mirror of https://github.com/open-metadata/OpenMetadata.git synced 2025-08-04 15:18:17 +00:00

Author	SHA1	Message	Date
Mohit Tilala	64ec471e52	Fixes #22363 #22369 : Stored procedure temp table processing and lineage filtering with db & schema (#22416 ) * Process temp table graph in stored procedure processor and add db/schema filtering on lineage * Add tests for stored procedure lineage processing * Fix tests and py_format * Fix the filters and log stored proc query count info	2025-07-18 12:32:22 +05:30
Suman Maharana	9838278ac4	Add: Schema and Database Mark Deletion (#22088 ) * Added Schema and Database Mark Deletion * removed unnecessary changes * fixed marked deleted databases * Added to all db connectors * Added generated types * Added tests	2025-07-15 16:26:46 +02:00
Ayush Shah	fe2caf7a5d	MINOR: Enhance patch request handling by adding 'skip_on_failure' parameter (#22142 ) * Enhance patch request handling by adding 'skip_on_failure' parameter * Introduced 'skip_on_failure' option in build_patch and OMetaPatchMixin methods to control behavior on patch operation failures. * Updated documentation to reflect the new parameter and its default value. * Improved error handling to log warnings instead of raising exceptions when 'skip_on_failure' is set to True. * fix: add tests for patch request with skip on failure * refactor: streamline mock patching and improve test readability in patch request tests * Consolidated import statements for unittest mock. * Enhanced readability by reducing line breaks and simplifying mock patching syntax. * Ensured consistent use of commas in function calls for clarity. * Updated tests to maintain functionality while improving code style. * fix: improve error handling in patch operations * Enhanced logging for patch operation failures in both build_patch and OMetaPatchMixin methods. * Added detailed entity information in warning and error messages to aid in debugging. * Ensured consistent behavior when 'skip_on_failure' is set, providing clearer feedback on operation outcomes. * fix: clean up whitespace in patch request error handling * Removed unnecessary whitespace in the build_patch function to improve code readability. * Ensured consistent formatting in warning and error messages for better clarity during logging. * fix: enhance error handling and improve test assertions in patch request * Updated the condition for checking 'changeDescription' in the _remove_change_description function for better clarity. * Modified exception handling in tests to raise RuntimeError instead of a generic Exception, providing more specific error feedback. * Improved assertions in tests to check for the presence of error messages, enhancing the robustness of error handling verification. * Adjusted test cases to reflect changes in expected patch operation counts and ensure accurate validation of patch operations. * fix: enhance patch operation with skip_on_failure handling * Added 'skip_on_failure' parameter to OMetaPatchMixin methods to control behavior on patch failures. * Improved error handling to log warnings and provide detailed feedback when patch operations are skipped. * Updated tests to verify the new behavior of skipping failures and improved assertions for clarity.	2025-07-14 12:33:17 +05:30
Mayur Singal	47b20a5f2d	MINOR: Fix databricks default schema issue (#22254 )	2025-07-09 11:50:50 -07:00
Mohit Tilala	a6c0261728	Add lineage stored procedure and view filter pattern support (#22223 ) * Add lineage stored procedure and view filter pattern support * Update generated TypeScript types * Add tests for lineage filter pattern --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>	2025-07-08 16:32:25 -07:00
Mohit Tilala	ebfce5ba7b	Handle quoted entity names in masking queries (#22174 ) Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>	2025-07-08 15:09:00 -07:00
Keshav Mohta	6e40f976e7	Fix #20145 : Implemented Prefix For Dashboard Service (#21585 ) * feat: implemented microstrategy lineage & dbServicePrefix * feat: added dbServicePrefixes support in other dashboards * fix: test_metabase and powerbi extra code remove * fix: python checkstyle * refactor: added prefix support for other connectors - superset, tableau, etc * refactor: added migration for prefix change and fix dbServicePrefixes field description * refactor: added prefix changes in superset db source * doc: add prefix in tableau doc * fix: typescript files and postgres migration for prefix * fix: moved migration in 1.8.2 --------- Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>	2025-07-08 18:54:35 +02:00
Ethan	99486a5006	Fixes #18151 : change replaced copy by model_copy (#18153 ) * feat: replace copy * fix native python copy --------- Co-authored-by: IceS2 <pjt1991@gmail.com>	2025-07-08 16:08:20 +02:00
Mayur Singal	573e3bfc21	MINOR: Improve Array Sampler for UC & DBX (#22155 ) * MINOR: Improve Array Sampler for UC & DBX * make log debug * address comments	2025-07-08 14:25:20 +02:00
Mayur Singal	2fcf3281d8	MINOR: Fix snowflake map key type error (#22205 )	2025-07-08 14:44:05 +05:30
Mayur Singal	c2a3027962	MINOR: Fix pytest 3.10 (#22192 )	2025-07-08 10:09:00 +05:30
IceS2	1260a0600a	MINOR: Update Snowflake Connection (#22167 ) * Update Snowflake Connection * Extracting needed methods	2025-07-07 16:38:22 +02:00
mgorsk1	3f01be6756	feat: derive downstream lineage from DBT exposures (#21992 ) * 🎉 Init * replace TARGET with EXPOSURE * refactor and document * add docs * handle missing type/entity not matching * linter * update docs * refactor for using label for communicating FQN as name field cannot contain special characters other than underscore. Storing dots in the name works for now but there is a deprecation warning and it will fail in the future. * improve docs * improve docs * improve logging * refactor for usage of meta.open_metadata_fqn * linting * update docs * update docs * fix docs * 🎉 Add tests	2025-07-07 16:34:33 +02:00
Ferjani Nasraoui	b0e1a136cf	Fixes #21106 : Support owner extraction from serialized Airflow DAGs (#22071 ) * fix(airflow): correctly extract owners from serialized Airflow DAGs Airflow serialization format wraps tasks under `__var` and `__type`. Previously, the OpenMetadata Airflow connector failed to extract task owners properly in this format. This patch: - Flattens `__var` when parsing task owners - Fallbacks to `default_args["owner"]` if no task-level owner is explicitly present - Ensures correct DAG owner is picked as the most common task owner - Handles compatibility with older Airflow versions Fixes: #21106 * test(airflow): add tests for owner extraction from serialized Airflow DAGs Adds new test cases to validate owner extraction logic: - Owners from serialized task format (`__var`) - Fallback to `default_args['owner']` if task owners are missing - Resolution of most common owner - Compatibility with unstructured or missing owners * remove test version specific comment * simplify comments and warnings * fix return statement * fixing formatting * adding handling of default args * fixing and adding more tests	2025-07-03 14:21:36 +05:30
Teddy	29450d1104	feat: add support for DBX system metrics (#22044 ) * feat: add support for DBX system metrics * feat: add support for DBX system metrics * fix: added WRITE back * fix: failing test cases * fix: failing test	2025-07-02 08:54:16 +02:00
Suman Maharana	e36e5da26e	Added Databricks pipeline Lineage (#22014 )	2025-06-30 10:41:22 +05:30
harshsoni2024	10b377590c	qlikcloud get script tables (#22022 )	2025-06-30 10:36:57 +05:30
Mayur Singal	c8f94783ed	Minor: Python E2E Fixes (#21959 )	2025-06-28 18:05:58 +05:30
sonika-shah	5d733b490c	Minor Fix : query_cost_record_search_index Search exception for elasticsearch instance (#21985 ) * Fix : query_cost_record_search_index Search exception for elasticsearch instance * add sample query to cover test scenarios * update mapping and fix test	2025-06-28 11:22:34 +05:30
IceS2	c899d45e8e	MINOR: Update Trino Connection to fix data diff (#21983 )	2025-06-27 07:58:48 +02:00
harshsoni2024	616579a6c1	feat-21984: REST service process nested objects inside array dtype in schema (#21984 )	2025-06-27 10:44:35 +05:30
Ayush Shah	11ac56356b	MINOR: Modify Sample data (#21599 )	2025-06-24 17:16:13 +05:30
harshsoni2024	f490406968	MINOR: pbi improve logging (#21868 )	2025-06-20 16:32:56 +05:30
Keshav Mohta	73ea60b898	Refactor: Unity Catalog (#21801 )	2025-06-20 16:04:34 +05:30
Himanshu Khairajani	79c3d55128	Fix #21679 : Added `metadata ingest-dbt` CLI Command for Direct DBT Artifacts Ingestion (#21680 ) * metadata dbt * fix: - default path to current directory - addional warning and exception handling for missing metadata config vars * test: add unit tests for DBT Ingestion CLI * refactor * PR review: - using Pydantic to parse and validate the openmetadata config in dbt's .yml - extended test-cases - giving user more configuration options for ingestion * py refactoring * add: dbt-auto ingest docs * Improvements: - using environement variables for loading sensitve variables - added docs for auto dbt-ingestion for dbt-core - more test cases * fix: - test case for reading JWT token inside the the method * refactor: py code formatting * refactor: py formatting * ingest-dbt docs updated * refined test cases * Chore: - sonar vulnerability issue review - using existing URL class for host validation --------- Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>	2025-06-19 17:57:10 +05:30
Suman Maharana	7be62f3ed9	Add: Tableau Hierarchy project filter (#21811 )	2025-06-19 11:18:52 +05:30
IceS2	040a33117c	MINOR: Fix Profiler Infinite Loop (#21843 )	2025-06-19 10:33:45 +05:30
harshsoni2024	d38ee0ed52	feat-21712: PowerBI internal entities & cross workspace lineage (#21837 )	2025-06-18 20:46:17 +05:30
Keshav Mohta	7c0eeef049	Fixes #19692 : Implemented Nifi Pipeline Lineage (#21802 ) * feat: implemented nifi pipeline lineage * test: implemented tests for nifi pipeline lineage * fix: yield_pipeline_bulk_lineage_details output type hinting * fix: component check in connections --------- Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>	2025-06-18 07:31:04 +00:00
harshsoni2024	a09a696358	MINOR: Tableau proxy url for sourceurl (#21799 )	2025-06-18 10:52:08 +05:30
Mayur Singal	34c43eaea0	MINOR: Fix pytests (#21807 )	2025-06-17 23:44:29 +05:30
IceS2	e79c54e6a5	MINOR: Add injection to profiler (#21738 ) * Initial implementation for our Connection Class * Implement the Initial Connection class * Add Unit Tests * Implement Dependency Injection for the Ingestion Framework * Fix Test * Fix Profile Test Connection * Add Injection to Metrics in Profiler * Add Injection to the Profiler * Fix UnitTests * Fix Pytests * Fix Tests * Fix types	2025-06-17 19:01:00 +02:00
harshsoni2024	0f79d8ea1d	MINOR: pytest opt out flaky test (#21800 ) * remove mlflow test until fixed * alationsink test count fixed * pylint fix gx	2025-06-17 14:23:28 +05:30
IceS2	49df5fc9de	MINOR: Implement dependency injection on ingestion (#21719 ) * Initial implementation for our Connection Class * Implement the Initial Connection class * Add Unit Tests * Implement Dependency Injection for the Ingestion Framework * Fix Test * Fix Profile Test Connection * Fix test, making the injection test run last * Update connections.py * Changed NewType to an AbstractClass to avoid linting issues * remove comment * Fix bug in service spec * Update PyTest version to avoid importlib.reader wrong import	2025-06-16 08:03:38 +02:00
Pere Menal-Ferrer	44e09e41a2	Revert "FIX #1464 (#21520 )" (#21726 ) This reverts commit 1e86f9870fd663122b9bbb64f3cf17cf32619c7f.	2025-06-13 17:27:32 +02:00
IceS2	891ff4184d	MINOR: Initial implementation for our Connection Class (#21581 ) * Initial implementation for our Connection Class * Implement the Initial Connection class * Add Unit Tests * Fix Test * Fix Profile Test Connection * Remove unit test * Remove comment * Fix tests and missing changes	2025-06-13 14:52:29 +02:00
harshsoni2024	6a6180b2e3	powerbi change owner condition (#21724 )	2025-06-12 16:11:43 +05:30
Suman Maharana	18f9f2cdb6	Fix: Tableau project id should always be a string (#21700 )	2025-06-12 11:21:53 +05:30
Teddy	c09a8b27ae	ISSUE #16676 - Add Tag to CreateTestCase (#21366 ) * refactor: removed testSuite field from CreateTestCase BREAKING CHANGE: when creating a test case, testsuite is now derived from entityLink (fetch or created) * feat: allow setting tags when creating a test case * style: ran linters * fix: compiling error * fix: failing test case * fix: failing tests * removed testSuite from required filed * fixed ui side * style: ran java linting * deprecation: remove testSuite param from ingestion * fix: remove test suite filed * fix: remove test_suite field --------- Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>	2025-06-11 09:59:08 +02:00
Suman Maharana	0df058a53d	Fix: dbtcloud CL errors (#21685 )	2025-06-10 21:45:07 +05:30
Pere Menal-Ferrer	1e86f9870f	FIX #1464 (#21520 ) * Add PIICategoryTags and some utilities on top of them. * Fix static-check * Add test for fqn representation * Add NEREntityGeneralTags.json from Collate * Add test to check PIICategoryTags agree with the ones used by OM server * Add LabelExtractor * Fix style * Add ignore superflous-parens for pylint * Ass comment as per PR review * Fix not-updated PII-IT * Remove duplicated IT test for PII --------- Co-authored-by: Pere Menal <pere.menal@getcollate.io> Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>	2025-06-09 16:05:35 -07:00
Keshav Mohta	b7a7023890	Fix #20665 : BigQuery - Adding billing project (#21231 )	2025-06-09 13:09:40 +05:30
Teddy	5078a2fbb9	DEPRECATION: Remove testCaseResults endpoint from testCaseResource (#21527 ) * deprecation: remove testCaseResults endpoint from testCaseResource * fix: path in test e2e test * fix: endpoint name to testCaseResults * style: fix java linting	2025-06-07 21:02:54 +02:00
IceS2	5b20b84546	MINOR: Add logic to handle WorkflowContext on Ingestion (#21425 ) * Add logic to handle WorkflowContext on Ingestion * Revert base.py changes * Removed comment * Fix basedpyright complaints * Make ContextManager automatically add its context to the PipelineStatus * Small changes	2025-06-03 17:35:08 +02:00
Suman Maharana	720c6d3f9f	Add: Looker explore to view Column Lineage (#21504 ) * Add: explore to view Column Lineage * Add tags ingestion and fix cll warnings * lint * Addressed comments * fixed tests	2025-06-03 20:23:43 +05:30
Pere Menal-Ferrer	6683c632f4	FIX #21464 (#21463 ) * Reproduce failing behaviour with non-date-time data * Add a presidio patch for DateTimes * Fix type-check error --------- Co-authored-by: Pere Menal <pere.menal@getcollate.io>	2025-05-30 08:18:50 +02:00
Teddy	2a120c166a	MINOR: Py failing test cases (#21437 ) * fix: failing test cases * fix: skip test for now	2025-05-28 17:52:32 +02:00
harshsoni2024	8bbc4d8c3d	MINOR: PBI dataset expressions empty value fix (#21409 )	2025-05-27 16:50:55 +05:30
Pere Menal-Ferrer	ca812852d6	ci/nox-setup-testing (#21377 ) * Make pytest to user code from src rather than from install package * Fix test_amundsen: missing None * Update pytest configuration to use importlib mode * Fix custom_basemodel_validation to check model_fields on type(values) to prevent noisy warnings * Refactor referencedByQueries validation to use field_validator as per deprecation warning * Update ColumnJson to use model_rebuild rather as replacement for forward reference updates as per deprecation warning * Move superset test to integration test as they are using testcontainers * Update coverage source path * Fix wrong import. * Add install_dev_env target to Makefile for development dependencies * Add test-unit as extra in setup.py * Modify dependencies in dev environment. * Ignore all airflow tests * Remove coverage in unit_ingestion_dev_env. Revert coverage source to prevent broken CI. * Add nox for running unit test * FIx PowerBI integration test to use pathlib for resource paths and not os.getcwd to prevent failures when not executed from the right path * Move test_helpers.py to unit test, as it is not an integration test. * Remove utils empty folder in integration tests * Refactor testcontainers configuration to avoid pitfalls with max_tries setting * Add nox unit testing basic setup * Add format check session * Refactor nox-unit and add plugins tests * Add GHA for py-nox-ci * Add comment to GHA * Restore conftest.py file * Clarify comment * Simplify function * Fix matrix startegy and nox mismatch * Improve python version strategy with nox and GHA --------- Co-authored-by: Pere Menal <pere.menal@getcollate.io>	2025-05-27 10:56:52 +02:00
Pere Menal-Ferrer	6ea630d7ef	DevEx: Ingestion development improvement (focus on unit testing) (#21362 ) * Fix test_amundsen: missing None * Fix custom_basemodel_validation to check model_fields on type(values) to prevent noisy warnings * Refactor referencedByQueries validation to use field_validator as per deprecation warning * Update ColumnJson to use model_rebuild rather as replacement for forward reference updates as per deprecation warning * Move superset test to integration test as they are using testcontainers * Add install_dev_env target to Makefile for development dependencies * Add test-unit as extra in setup.py * Skip failing IT test. Requires further investigation.	2025-05-26 10:38:17 +02:00

1 2 3 4 5 ...

1181 Commits