OpenMetadata

mirror of https://github.com/open-metadata/OpenMetadata.git synced 2025-10-03 04:46:27 +00:00

Author	SHA1	Message	Date
Mayur Singal	7760663b22	MINOR: Change ingestion licence header (#20549 )	2025-04-03 10:39:47 +05:30
Pere Miquel Brull	467534c088	Revert "Fixes 20063: Allowing to trigger an app with a payload (#20056 )" (#20155 ) This reverts commit 9642c7353813be7a450bba4148d5b4eb751b1c33.	2025-03-10 06:59:29 +01:00
IceS2	9642c73538	Fixes 20063: Allowing to trigger an app with a payload (#20056 ) * Allowing to trigger an app with a payload * added option to pass custom payload to triggered job * feat(apps): add custom config to trigger - added custom config for app triggers * added handling for configuration validation on apps * - removed `oneOf` from app configuration scheme - generated ts files for apps * added CONFIG_OVERRIDE_KEY * format * fixed app config key * format * format * fixed python app runner to work with new schema --------- Co-authored-by: sushi30 <imri.paran@gmail.com>	2025-03-07 21:17:05 +05:30
Imri Paran	97fad806a2	Fixes 19755: Publish app config with status (#19754 ) * feat(app): add config to status add config to the reported status of the ingestion pipeline * added separate pipeline service client call for external apps * fix masking of pydantic model * - overload model_dump to mask secrets instead of a separate method - moved tests to test_custom_pydantic.py * fix: execution time * fix: mask secrets in dump json * fix: for python3.8 * fix: for python3.8 * fix: use mask_secrets=False when dumping a model for create * format * fix: update mask_secrets=False for workflow configurations * fix: use context directly when using model_dump_json * fix: default behavior when dumping json * format * fixed tests	2025-02-25 16:51:49 +00:00
Keshav Mohta	7bea4f957f	Feature: Docker Host Retry (#19127 )	2025-01-14 19:48:10 +05:30
Pere Miquel Brull	e56f477a4a	Fix #19147 - Executable Test Suites (#19221 ) * backend * format & tests * rename backend * migrations and ingestion * format & tests * format & tests * tests * format & tests * tests * updated ui side of changes * addressing comment * fixed failing unit test * fix test list * added e2e test, and fixed existing test --------- Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>	2025-01-07 17:59:54 +01:00
Pere Miquel Brull	7aacfe032c	MINOR - FQN encoding in ometa_api, TestSuite pipeline creation & serialization of test case results (#18877 ) * DOCS - Update ES config * MINOR - Add missing FQN encoding & force types * MINOR - Add missing FQN encoding & force types * format * fix tests	2024-12-02 17:17:21 +01:00
Pere Miquel Brull	c68a45e7d8	Create new Auto Classification Workflow (#18610 )	2024-11-19 08:10:45 +01:00
Imri Paran	95982b9395	[GEN-356] Use ServiceSpec for loading sources based on connectors (#18322 ) * ref(profiler): use di for system profile - use source classes that can be overridden in system profiles - use a manifest class instead of factory to specify which class to resolve for connectors - example usage can be seen in redshift and snowflake * - added manifests for all custom profilers - used super() dependency injection in order for system metrics source - formatting * - implement spec for all source types - added docs for the new specification - added some pylint ignores in the importer module * remove TYPE_CHECKING in core.py * - deleted valuedispatch function - deleted get_system_metrics_by_dialect - implemented BigQueryProfiler with a system metrics source - moved import_source_class to BaseSpec * - removed tests related to the profiler factory * - reverted start_time - removed DML_STAT_TO_DML_STATEMENT_MAPPING - removed unused logger * - reverted start_time - removed DML_STAT_TO_DML_STATEMENT_MAPPING - removed unused logger * fixed tests * format * bigquery system profile e2e tests * fixed module docstring * - removed import_side_effects from redshift. we still use it in postgres for the orm conversion maps. - removed leftover methods * - tests for BaseSpec - moved get_class_path to importer * - moved constructors around to get rid of useless kwargs * - changed test_system_metric * - added linage and usage to service_spec - fixed postgres native lineage test * add comments on collaborative constructors	2024-10-24 07:47:50 +02:00
Pere Miquel Brull	c2929e67e6	MINOR - Return TestConnectionResult from test_connection_fn (#18320 ) * MINOR - Return TestConnectionResult from test_connection fn * MINOR - Return TestConnectionResult from test_connection fn	2024-10-18 09:54:07 +02:00
Pere Miquel Brull	5074f6588f	MINOR - Validate app runner init (#18316 )	2024-10-18 09:40:06 +02:00
Pere Miquel Brull	7012e73d75	GEN-1166 - Improve Ingestion Workflow Error Summary (#18280 ) * GEN-1166 - Improve Ingestion Workflow Error Summary * fix test * docs * comments	2024-10-16 18:15:50 +02:00
Pere Miquel Brull	e820cb0450	GEN-1410 - Improve PII Logging information (#17835 )	2024-09-16 07:05:30 +02:00
Imri Paran	59854de58d	[GEN-1406]: import plugin and clear plugin message for failed source import (#17788 ) * support side effects on source classes by always importing source class * streamlined error message * fixed service type extraction for test suite pipeline * - replaced "custom" with constant - added quotes for the plugin exception for copy/paste ergonomics	2024-09-12 16:13:03 +02:00
Pere Miquel Brull	2237d5a8d5	MINOR - PII Scanner tests and log levels (#17686 ) * MINOR - PII Scanner tests and log levels * MINOR - PII Scanner tests and log levels	2024-09-04 12:11:07 +02:00
Mayur Singal	ef67474310	Data Insights Refactor (#16532 ) --------- Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com> Co-authored-by: Pablo Takara <pjt1991@gmail.com> Co-authored-by: Chira Madlani <chirag@getcollate.io> Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com> Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>	2024-07-30 17:38:43 +05:30
IceS2	c522f14178	MINOR: Refactor output_handlers to a WorkflowOutputHandler class (#17149 ) * Refactor output_handlers to a WorkflowOutputHandler class * Add old methods as deprecated to avoid breaking changes * Extract WorkflowInitErrorHandler from workflow_output_handler * Fix static checks * Fix tests * Fix tests * Update code based on comments from PR * Update comment	2024-07-29 09:20:34 +02:00
Prajwal214	ff8d101d9b	Doc: Updating Profiler Workflow Docs URL (#17140 ) Co-authored-by: Prajwal Pandit <prajwalpandit@Prajwals-MacBook-Air.local>	2024-07-23 15:18:03 +05:30
Pere Miquel Brull	2aef457785	FIX #16481 - Truncate ingestion pipeline status (#16997 ) * FIX #16481 - Truncate ingestion pipeline status * FIX #16481 - Truncate ingestion pipeline status * FIX #16481 - Truncate ingestion pipeline status	2024-07-12 09:44:21 +02:00
Mayur Singal	7359d6210c	MINOR: Fix Profiler for SSL Enabled Source (#16613 )	2024-06-12 11:40:30 +05:30
Pere Miquel Brull	cb72a22b59	Fix - e2e tests for pydantic V2 (#16551 ) * Fix - e2e tests for pydantic V2 * add correct default * add correct default * revert datetime aware * revert datetime aware * revert datetime aware * revert datetime aware * revert datetime aware * revert datetime aware * revert datetime aware * revert datetime aware * fix apis * format	2024-06-06 19:36:17 -07:00
Pere Miquel Brull	d8e2187980	#15243 - Pydantic V2 & Airflow 2.9 (#16480 ) * pydantic v2 * pydanticv2 * fix parser * fix annotated * fix model dumping * mysql ingestion * clean root models * clean root models * bump airflow * bump airflow * bump airflow * optionals * optionals * optionals * jdk * airflow migrate * fab provider * fab provider * fab provider * some more fixes * fixing tests and imports * model_dump and model_validate * model_dump and model_validate * model_dump and model_validate * union * pylint * pylint * integration tests * fix CostAnalysisReportData * integration tests * tests * missing defaults * missing defaults	2024-06-05 21:18:37 +02:00
Imri Paran	d5bf30ccd3	MINOR: trino integration test (#16291 ) * added trino integration test * - removed warnings for classes which are not real tests - removed "helpers" as its being used * use a docker network instead of host * print logs for hive failure * removed superset unit tests * try pinning requests for test * try pinning requests for test * wait for hive to be ready * fix trino fixture * - reduced testcontainers_config.max_tries to 5 - remove intermediate containers * print with logs * disable capture logging * updated db host * removed debug stuff * removed debug stuff * removed version pin for requests * reverted superset * ignore trino integration on python 3.8	2024-05-22 15:12:00 +00:00
Pere Miquel Brull	78324bd17c	MINOR - Set pipeline status early (#16077 )	2024-04-30 09:22:04 +02:00
Imri Paran	93ec391f5c	MINOR: Dynamodb sample data (#15264 ) * feat(nosql-profiler): row count 1. Implemented the NoSQLProfilerInterface as an entrypoint for the nosql profiler. 2. Added the NoSQLMetric as an abstract class. 3. Implemented the interface for the MongoDB database source. 4. Implemented an e2e test using testcontainers. * added profiler support for mongodb connection * doc * use int_admin_ometa in test setup * - fixed linting issue in gx - removed unused inheritance * moved the nosql function into the metric class * feat(profiler): add dynamodb row count * feat(profiler): add dynamodb row count * formatting * validate_compose: raise exception for bad status code. * fixed import * format * feat(nosql-profiler): added sample data 1. Implemented the NoSQL sampler. 2. Some naming changes to the NoSQL adaptor to avoid fixing names with the profiler interface. 3. Tests. * added default sample limit * formatting * fixed import * feat(profiler): dynamodb sample data * tests for dynamo db sample data * format * format * use service connection for nosql adaptor factory * fixed tests * format * fixed after merge	2024-04-22 17:46:40 +02:00
Mayur Singal	18da8a5964	Fix #15208 : Lineage - log query parsing issue summary (#15921 )	2024-04-18 19:07:51 +05:30
IceS2	e7c9d6aa7f	FIXES 15215: Implement initial Multithreading approach for the Metadata Ingestion on Databases (#15130 ) * Implement Initial MultiThread suggestion * Update all the ingestion sources to use the new ContextManager * Fix missing wraps on decorator * Fix Unittests * Fix linters * Fix linters * Fix BigQuery UnitTests * Add UnitTests to the newly created code * Fix unittest * change the threads from table to schemas * Update README.md * Small change suggested by Sonar * Slight change to test a different way to multithread over tables * Debug changes * More multithread tests * Remove uneeded wait time * Testing * refactor code based on removal of time.sleep * Fix wrong paste * Improve ExecutionTimeContextManager * Fix missing .get() and unit tests * Fix conflicting changes * Update Multithread logic with the incremental extraction * Fix linters * Fix unittest * Remove commented code * Fix Unittests * Fix checkstyle * Change default to threads = 1	2024-03-25 18:20:40 +01:00
Pere Miquel Brull	b778bc7968	#14943 - Check tags before PII processor (#15622 )	2024-03-21 14:15:28 +05:30
IceS2	51e3d7a466	FIXES 15215: First draft implementation on extracting metadata incrementally. Done for Snowflake, BigQuery and Redshift (#15201 ) * Initial incremental implementation for snowflake * Initial unit test refactor for snowflake * Fix linter complaints * Propagate change on abstract create method * Add missing argument to create * Polish Snowflake incremental extraction * Fix linters and make enabled required * Initial proposal for incremental bigquery extraction * BigQuery incremental tests * Remove debugging override * Fix linters * Remove unused query * Initial Redshift Incremental Extraction * Add Incremental Extraction documentation * Move the default to False * Improve code based on sonarcloud input * Apply suggestions * Fix wrong path * Change timestamp to be time aware as per sonar * Move documentation to 1.4 * Move documentation to 1.4 * Fix linters --------- Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>	2024-03-15 14:00:49 +01:00
Sriharsha Chintalapani	8af194193a	Add stack trace while throwing an error to debug (#15522 )	2024-03-12 10:07:46 -07:00
IceS2	7b20ed2f34	FIXES 15215: Add execution time tracker for ingestion (#15013 ) * Initial idea on how to track execution times * Fix linters * Add missing decorators on the API Client * Improve where the output is being handled	2024-02-16 09:58:20 +01:00
Teddy	9a4a9df836	Fix #14895 - Get Metadata from Parquet Schema (#14956 ) * linting: fix python linting * fix: get column types from parquet schema for parquet files * style: python linting * fix: remove displayType check in test as variation depending on OS	2024-02-01 09:02:52 +01:00
Sriharsha Chintalapani	2e95fcb98d	Fix #14786 : Suggestions API (#14821 ) * Fix #14786: Suggestions API * Handle suggestions in ometa * Minor: Optimise Databricks Client (#14776) * MINOR - Fix SP topology context & Looker usage context (#14816) * MINOR - Fix SP topology context & Looker usage context * MINOR - Fix SP topology context & Looker usage context * Fix tests * Fixes #14598: Fix Tags / Labels ingestion on includeTags as False (#14782) * fix(ui): password error message for char limits (#14808) * fix(ui): password error message for char limits * fix java side code * Fixes #13556: Support for Salesforce table description ingestion (#14733) * ISSUE-13556: Add suport for Salesforce table description ingestion * ISSUE-13556: Remove unnecessary blank line * ISSUE-13556: Fix to get description for each table --------- Co-authored-by: Teddy <teddy.crepineau@gmail.com> * MINOR - Better handling of Ingestion Pipeline Status (#14792) * MINOR - Better handling of Ingestion Pipeline Status * format * format * MINOR: Added table validation for cost analysis data (#14793) * Added validation for cost analysis source * centralized life cycle logic * CYPRESS: simplify side navigation click in cypress (#14818) * simplify side navigation click in cypress * make sidbar item uses common enum * fix cypress failure of outside import * fix(#14326): tier dropdown is not working in advance search (#14780) * improvement in advance search based on custom property * fix a reading undefined property issue * wip: advance search based on tier * some code cleanup and improvement * some fixes * fix: ui flicker when advanceSearched is apply and refresh the page * some cleanup * no need to call customproperty api call, if entity not suppport customProperties * minor change * fix: autocomplete not working in tier search option in advance search modal * added unit test for advance search provider component * some cleanup * added testcase for open modal * added testcase for resetAllFilters method * removed unwanted code * added e2e test for testing tier advance search * fix: e2e search flow for single field * fix: string field not working after giving listValues in TierSearch * fix: group query e2e test fix * used asyncFetch way to get the tierOptions synchronously * some cleanup * remove unwanted lines * some cleanup * fix: selected option show option value instead of option title * fix(minor): update skip icon for executions (#14809) * Fixes #14803: ignore capitalization when confirming deletes (#14804) * ignore case when confirming deletes * Test confirmation of deletes works when case differs Added test case for 'delete' as the confirmation text. * minor(config): update openmetadata-ui code reviewers (#14823) * Add Tests * Add list/accept/reject apis * initial ui changes * localisation * show suggestion for empty description * ui feedbacks * Fix permission check for entities without owner * Fix entityLink and add tests * Add update suggestion WIP * Fix test * Fix PUT and Pagination * Fix styling * update test * Update status * add OM server connection in apps * add permissions check * Fix CI * Remove TODO * Fix feedResourceTest * fix unit tests * add private configs for apps * add private configs for apps * fix update application icons * minor center align icon * add private configs for apps * Format * Fix pydantic gen * Remove token * Update name * Rework private conf * Fix apps * Fix apps * Format * Format * show metapilot only if its installed --------- Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com> Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com> Co-authored-by: Ayush Shah <ayush@getcollate.io> Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com> Co-authored-by: kwgdaig <18678754+kwgdaig@users.noreply.github.com> Co-authored-by: Teddy <teddy.crepineau@gmail.com> Co-authored-by: Onkar Ravgan <onkar.10r@gmail.com> Co-authored-by: Ashish Gupta <ashish@getcollate.io> Co-authored-by: Abhishek Porwal <80886271+Abhishek332@users.noreply.github.com> Co-authored-by: Carlo Q <carlo@machina.bio> Co-authored-by: karanh37 <karanh37@gmail.com>	2024-01-31 18:51:09 -08:00
Pere Miquel Brull	8f2c2b21c2	MINOR - Better handling of Ingestion Pipeline Status (#14792 ) * MINOR - Better handling of Ingestion Pipeline Status * format * format	2024-01-23 09:24:19 +01:00
Onkar Ravgan	13b326d11c	Fix 14040: Part 3 Added updated field to status for patched entities (#14708 ) * Added updated field to status * Fixed e2e test	2024-01-16 17:53:05 +05:30
Pere Miquel Brull	e8eb2bda3c	MINOR - Hide AutoTagger, add MetaPilot search, and fix apps status calls (#14201 ) * MINOR - Hide AutoTagger & MetaPilot search & Fix apps status calls * Fix method call * Fix method call * Fix method call * "additionalProperties": true * Use EntityRef * Use ES sink * Update UI schema type	2024-01-08 14:57:07 +01:00
Mayur Singal	d82770a91e	Fix #14421 : Airflow - Handle pipeline status when dag killed (#14562 )	2024-01-05 18:14:26 +05:30
Pere Miquel Brull	b786064bc2	#11857 - Store workflow status in the Ingestion Pipeline Status (#14462 ) * Register StackTraceError in spec * Register StackTraceError in spec * Register StackTraceError in spec * Add todos * Update status * docs * format * Fix tests * Fix tests * Fix tests * Ignore generated * Fix tests * Fix tests * Tests * Try constants * Try constants * Print * Print * Print * order * Fix service name * fix ui error --------- Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>	2023-12-22 15:43:50 +01:00
karanh37	853a83f8b9	fix: allow config update for external app (#14034 ) * fix: allow config update for external app * Prep config * Add configs * fix: call configure app on saving of configurations * fix: add autocomplete widget for json schema forms * ORganize configuration handling * Allow config * Format * Fix test * Prep migrations for allowConfiguration * TODO * Add pg migrations * fix: add allowConfiguration support * Autotagger * Update install flow --------- Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>	2023-11-27 16:55:33 -08:00
Pere Miquel Brull	6fb3f12675	App Resource - Add /configure endpoint & WAII app definition (#13983 ) * App Configuration * TODO * Configure App with OM server info * Remove init * Rename configs * Remove extra source	2023-11-17 09:28:11 +01:00
Mayur Singal	b34111ea00	ElasticSearch: Fix ES connection (#13919 )	2023-11-13 14:10:11 +05:30
Pere Miquel Brull	c742835766	Auto Tagger Application - Preparing the Ingestion Framework (#13862 ) * Prepare the skeleton for generic app registration * Prepare the skeleton for generic app registration * Handle app runner * Prepare the skeleton for generic app registration * Prepare the skeleton for generic app registration * Allow deployment * Fix PII APP * Fix lint * Fix PII APP * Fix PII APP * Prepare config-based external apps * Prepare config-based external apps * Fix lint * Prepare config-based external apps * Fix DI errors * Amend comments	2023-11-13 08:58:38 +01:00
Mayur Singal	4c19bb5a1d	Add support for external profiler workflow (#13887 ) * Add support for external profiler workflow * pylint * resolve comments * resolve comments * pyformat * fix code smell	2023-11-09 18:50:47 +05:30
Pere Miquel Brull	c6297b9cdf	Fix status handling for new services (#13707 )	2023-10-25 20:48:14 +02:00
Pere Miquel Brull	899cd7e1fe	Fix DQ Workflow (#13631 ) * Fix DQ Workflow * Fix DQ Workflow	2023-10-18 11:49:38 +02:00
Pere Miquel Brull	d3da2d1b9f	Register Ingestion pipelines just from YAML (#13501 ) * Register Ingestion pipelines just from YAML * Format	2023-10-10 07:04:04 +02:00
Pere Miquel Brull	f5e10c4a5f	Fix #7272 - BaseWorkflow docs and cleanup (#13471 ) * DQ BaseWorkflow * Test suite runner * test Suite workflow * Refactor DQ for BaseWorkflow * Lint * Fix source * Fix source * Fix source * Fix source * Fix test * Prepare docs * Clean sink * Clean legacy classes * typo * ProcessorStatus	2023-10-09 07:05:05 +02:00
Pere Miquel Brull	aed9e3875f	DQ base workflow (#13454 ) * DQ BaseWorkflow * Test suite runner * test Suite workflow * Refactor DQ for BaseWorkflow * Lint * Fix source * Fix source * Fix source * Fix source * Fix test * Fix test * Fix test	2023-10-06 18:29:18 +02:00
Pere Miquel Brull	0282574bdd	Create ometa client once and pass it around & improve pycln config (#13310 ) * Create ometa client once and pass it around & improve pycln config * Fix * Fix * Fix tests * Fix maven ci * Fix tests * Fix tests * Fix tests * Format * Fix DI	2023-10-04 09:14:03 +02:00
Teddy	6ca71ae323	Issue 12679 - Handle Entity Object Instantiation Error + Refactor Workflow (#13384 ) * feat: updated DI workflow to inherit from BaseWorkflow + split processor and producer classes * feat: __init__.py files creation * feat: updated workflow import classes in code and doc * feat: moved kpi runner from runner to processor folder * fix: skip failure on list entities * feat: deleted unused files * feat: updated status reporter * feat: ran linting * feat: fix test error with typing and fqn * feat: updated test dependencies * feat: ran linting * feat: move execution order up * feat: updated cost analysis report to align with new workflow * feat: fix entity already exists for pipeline entity status * feat: ran python linting * feat: move skip_on_failure to method * feat: ran linting --------- Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>	2023-10-02 12:05:30 +02:00

1 2

55 Commits