OpenMetadata

mirror of https://github.com/open-metadata/OpenMetadata.git synced 2025-11-01 02:56:10 +00:00

Author	SHA1	Message	Date
Sriharsha Chintalapani	76020bd0e7	Fix Kafka Connect for lineage parsing (#23819 ) * Fix Kafka Connect for lineage parsing * Fix Kafka Connect for lineage parsing	2025-10-09 14:01:36 -07:00
Mayur Singal	88115e1218	MINOR: Fix training / issue in UC S3 lineage (#23816 )	2025-10-09 18:44:07 +02:00
Antoine Balliet	be3a91f7df	fix: logger level should work for deprecation warnings (#23784 ) * chore: implement logger levels tests for depreciation * fix: use METADATA_LOGGER instead of warnings * use unit test syntax * isort * black * fix test --------- Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>	2025-10-09 18:21:28 +02:00
Mayur Singal	05f064787f	Feat: Add kafka lineage support in databricks pipelines (#23813 ) * Add dlt pipeline support * Fix code style * Add variable parsing * Fix kafka lineage --------- Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>	2025-10-09 16:42:08 +02:00
Sriharsha Chintalapani	454d7367b0	Kafka Connect: Support Confluent Cloud connectors (#23780 )	2025-10-09 01:28:27 +05:30
Mohit Tilala	da8c50d2a0	Add pagination for snowflake usage and lineage queries sql (#23781 ) * Add pagination for snowflake usage and lineage queries sql * py_format	2025-10-08 20:45:14 +05:30
Mayur Singal	4708c2b64f	feat: Unity Catalog Lineage Enhancement: External Location Support (#23790 )	2025-10-08 20:26:39 +05:30
harshsoni2024	f2819ce4e4	Fix: PowerBI snowflake query lineage parsing (#23746 )	2025-10-08 18:32:25 +05:30
Mohit Tilala	61e4c1ffba	Pin `pydantic` to <2.12.0 (#23782 ) * Bump datamodel-code-generator to 0.34.0 * Pin down pydantic to <2.12 * Revert "Bump datamodel-code-generator to 0.34.0" This reverts commit c69116d2935eea49e9c78b2607f2fea94bc44738.	2025-10-08 13:24:27 +05:30
Eugenio	af0672e4cf	Fixes #22302 : add `table2.keyColumns` parameter for table diff validation (#23667 ) * Update `TableDiffParamsSetter` to move data at table level This means that `key_columns` and `extra_columns` will be defined per table instead of "globally", just like `data_diff` expects * Update `TableDiffValidator` to use table's `key_columns` Call `data_diff` and run validations using each table's `key_columns` * Create migration to update `tableDiff` test definition * Fix Playwright test	2025-10-08 09:32:00 +02:00
Eugenio	a6ac42371d	Ensure recognizers are created (#23645 ) * Add the migration classes and data for recognizers This is so that we can run a migration that sets `json->recognizers` of `PII.Sensitive` and `PII.NonSensitive` tags from json values. The issue with normal migrations was that the value of recognizers was too long to be persisted in the server migrations log. Created a common `migration.utils.v1110.MigrationProcessBase` * Ensure building automatically with the right parameters * Update typescript types	2025-10-07 15:13:35 +00:00
Eugenio	47e953f9d3	PLAYWRIGHT FIXES: ensure sample data is passed to the right columns (#23761 ) * Ensure we take columns ordered from the sampler This is to avoid analyzing columns with data from other columns * Remove expectation of address to have Sensitive tag This is for a couple of reasons: - First: per our internal definition it should actually be Non Sensitive. - Second: presidio actually picks SOME of them up as PERSON (Sensitive) entities, but since we've raised the tolerance, now we're not classifying them as Sensitive.	2025-10-07 09:39:24 +02:00
harshsoni2024	9ba65ac0d2	Fix: Add support for datamodel source url (#23715 )	2025-10-06 20:04:43 +00:00
Mohit Tilala	0cf0394d0b	Fixes #22406 : Add workflow resource utilisation metrics for better troubleshooting (#23696 ) * Add workflow resource utilization metrics for better troubleshooting * Add types for correct static type checking * Remove duplicate type annotations	2025-10-06 13:20:06 +05:30
harshsoni2024	da7a2778f6	MINOR: iceberg load table retry backoff (#23579 )	2025-10-05 23:42:56 +05:30
Sriharsha Chintalapani	fc7412f6dd	Add Timescale Connector (#23665 ) * Add Timescale Connector * Update generated TypeScript types * Add UI changes for the Timescale * lineage, usage and java * Add beta tag * update logo --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com> Co-authored-by: Akash Verma <akashverma@Mac.lan>	2025-10-03 19:00:59 -07:00
Mohit Tilala	b15dc8fe42	Add better handling of no columns found/permission issue exceptions (#23695 )	2025-10-03 21:07:16 +05:30
Keshav Mohta	3d49b6689d	Fixes #23356 : Databricks & UnityCatalog OAuth and Azure AD Auth (#23561 ) * feat: databricks oauth and azure ad auth setup * refactor: add auth type changes in databricks.md * fix: test after oauth changes * refactor: unity catalog connection to databricks connection code * feat: added oauth and azure ad for unity catalog * fix: unitycatalog tests, doc & required type in connection.json * fix: generated tx files * fix: exporter databricksConnection file * refactor: unitycatalog example file * fix: usage example files * fix: unity catalog sqlalchemy connection * fix: unity catalog client headers * refactor: make common auth.py for dbx and unitycatalog * fix: auth functions import * fix: test unity catalog tags as None * fix: type hinting and sql migration * fix: migration for postgres	2025-10-03 19:53:19 +05:30
harshsoni2024	ea54b6b883	MINOR: datalake column subfields fix (#23576 )	2025-10-03 16:13:10 +05:30
Akash Verma	06453a925d	Fix #21093 : Update test connection improvements (#23516 ) * Update test connection improvements * Update queries * checkstyle * fix test failure --------- Co-authored-by: Akash Verma <akashverma@Akashs-MacBook-Pro-2.local>	2025-10-03 13:50:46 +05:30
Akash Verma	5bb2924a6a	Fix #16081 : Add support for SQL Server hierarchyid, geography, and geometry types (#23527 )	2025-10-03 11:46:01 +05:30
Akash Verma	4d68fe7a10	feat: Add ML model lineage support (#23494 )	2025-10-03 11:38:41 +05:30
Suman Maharana	c8055576ba	Fixes #21686 : Add missing includeOwners check in dashboard services (#22514 )	2025-10-03 10:53:25 +05:30
Keshav Mohta	48ff77c917	Fixes: MF4 Import Error (#23659 ) * fix: asammdf and avro import error * fix: mf4 import only * test: fix mf4 test	2025-10-01 20:08:45 +05:30
Eugenio	5da2d32b34	Use recognizer in classification (#23628 ) * Refactor presidio utils Extract the spacy model functionality from the analyzer building function * Added a new `TagClassifier` This classifier uses tags to dynamically build presidio `RecognizerRegistry`s * Added a new `TagProcessor` This processor uses `TagClassifier` to label a column based on the tags' recognizers * Create `TagProcessor` based on workflow configuration * Create decorator to apply threshold to recognizers This is so that we can apply thresholds on recognizer results without subclassing or having to keep a map between the presidio recognizer and the recognizer configuration * Fix broken test	2025-10-01 14:43:28 +02:00
Eugenio	dff2b394d5	Fix classification scoring (#23523 ) * Add `reason` property to `TagLabel` This is to understand what score was used for selecting the entity * Build `TagLabel`s with `reason` * Increase `PIIProcessor._tolerance` This is so we correctly filter out low scores from classifiers while still maintaining the normalization that filters out confusing outcomes. e.g: an output with scores 0.3, 0.7 and 0.75, would initially filter the 0.3 and then discard the other two because they're both relatively high results. * Make database and DAO changes needed to persist `TagLabel.reason` * Update generated TypeScript types --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-10-01 12:11:14 +00:00
Keshav Mohta	6b7262a8ea	Feature: MF4 File Reader (#23308 ) * feat: mf4 file reader * refactor: removed schema_from_data implementation * test: added tests for mf4 files	2025-10-01 11:19:00 +02:00
Pere Miquel Brull	375e001dd9	MINOR - Fix S3 logging from ingestion pipelines (#23590 ) * MINOR - Fix S3 logging from ingestion pipelines * Update generated TypeScript types * config * update s3 configurations for streamable logs * Update generated TypeScript types * update s3 configurations for streamable logs * update s3 configurations for streamable logs * update s3 configurations for streamable logs * SSE off by default * Update log retrieval to use s3 if ingestion runner has streamable logs enabled --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Pablo Takara <pjt1991@gmail.com>	2025-10-01 09:44:17 +02:00
Ayush Shah	dd99ab5678	feat: Add Unity Catalog data diff module to use DBX connection instead of workspaceclient (#23404 )	2025-09-30 20:56:54 +05:30
Sriharsha Chintalapani	18677afd39	Add support for Tags customizable rules, capturing feedback (#23289 ) * Add support for translations in multi lang * Add Tag Feedback System * Update generated TypeScript types * Fix typing issues and add tests to reocgnizer factory * Updated `TagResourceTest.assertFieldChange` to fix broken test This is because change description values had been serialized into strings and for some reason the keys ended up in a different order. So instead of performing String comparison, we do Json comparisons --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Eugenio Doñaque <eugenio.donaque@getcollate.io>	2025-09-30 07:17:18 +02:00
Sriharsha Chintalapani	bb1395fc72	Implement Modern Fluent API Pattern for OpenMetadata Java Client (#23239 ) * Implement Modern Fluent API Pattern for OpenMetadata Java Client * Add Lineage, Bulk, Search static methods * Add all API support for Java & Python SDKs * Add Python SDKs and mock tests * Add Fluent APIs for sdks * Add Fluent APIs for sdks * Add Fluent APIs for sdks, support async import/export * Remove unnecessary scripts * fix py checkstyle * fix tests with new plural form sdks * Fix tests * remove examples from python sdk * remove examples from python sdk * Fix type check * Fix pyformat check * Fix pyformat check * fix python integration tests * fix pycheck and pytests * fix search api pycheck * fix pycheck * fix pycheck * fix pycheck * Fix test_sdk_integration * Improvements to SDK * Remove SDK coverage for Python 3.9 * Remove SDK coverage for Python 3.9 * Remove SDK coverage for Python 3.9	2025-09-29 16:07:02 -07:00
Mohit Tilala	22a0925cd2	Fix correct snowflake object types in source url (#23612 )	2025-09-29 15:31:10 +00:00
Keshav Mohta	4528c0c1c4	Fixes #23416 : Option To Opt Out of BigQuery Policy Tags Ingestion (#23532 ) * fix: added includePolicyTags flag * feat: added includePolicyTags	2025-09-29 18:24:10 +05:30
Mayur Singal	b489112bdd	MINOR: Fix import error log (#23578 )	2025-09-29 12:47:19 +05:30
Keshav Mohta	94104e0806	fix: lineage flow and improved logging for databricks pipeline (#23586 )	2025-09-26 18:22:01 +05:30
Eugenio	bb50514a00	FIxes #16983 : can't sample data from trino tables with complex types (#23478 ) * Update test data for `tests.integration.trino` This is to create tables with complex data types. Using raw SQL because creating tables with pandas didn't get the right types for the structs * Update tests to reproduce the issue Also included the new tables in the other tests to make sure complex data types do not break anything else Reference: [issue 16983](https://github.com/open-metadata/OpenMetadata/issues/16983) * Added `TypeDecorator`s handle `trino.types.NamedRowTuple` This is because pydantic couldn't figure out how to create python objects when receiving `NamedRowTuple`s, which broke the sampling process. This makes sure the data we receive from the trino interface is compatible with Pydantic	2025-09-26 08:13:28 +02:00
Suman Maharana	be51d53464	Fix - Hive Metastore None issue (#23520 )	2025-09-24 10:11:29 +05:30
Mayur Singal	933802a354	MINOR: Support metabase API Key Auth (#23436 )	2025-09-23 22:01:10 +05:30
Keshav Mohta	cb26c91442	Revert "Fixes #23356 : Databricks OAuth & Azure AD Auth (#23482 )" (#23530 ) This reverts commit f1afe8f5f114ee58090168fd7ae5d66b38a01ab0.	2025-09-23 17:44:16 +02:00
Teddy	57c5a50d20	ISSUE #23435 - Fix pass / fail count for custom SQL (#23506 ) * fix: added logic to compute pass/fail for sql queries with cte, nested queries, and joins * added logic to correctly compute pass / fail rows * style: ran python linting * fix: failing tests * style: fix linting error * fix: flawed count logic * fix: handle case where we don't compute row count	2025-09-23 16:53:51 +02:00
Suman Maharana	79fde4ab02	Minor: improved dbt debug logs (#23509 )	2025-09-23 19:52:58 +05:30
Ayush Shah	d94b39f6f5	fix(ssl): Update SSLManager to use dynamic schema registry paths (#23505 )	2025-09-23 18:10:18 +05:30
Keshav Mohta	f1afe8f5f1	Fixes #23356 : Databricks OAuth & Azure AD Auth (#23482 ) * feat: databricks oauth and azure ad auth setup * refactor: add auth type changes in databricks.md * fix: test after oauth changes * refactor: unity catalog connection to databricks connection code	2025-09-23 15:22:50 +05:30
Suman Maharana	1c710ef5e3	Fix Stream logger url (#23491 )	2025-09-23 14:35:14 +05:30
Keshav Mohta	9262040381	fix: handle database native types for create table request during openlineage lineage (#23513 )	2025-09-23 10:11:39 +02:00
Suman Maharana	e2b903532e	Fixes - Kafkaconnect lineage & descriptions (#23234 ) * Fix Kafkaconnect lineage & descriptions * fix typos Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * address comments Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * address comms --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-09-23 10:08:37 +02:00
Pere Miquel Brull	49bdf1a112	MINOR - Report status for tests that blow up (#23326 ) * MINOR - Report status for tests that blow up * format	2025-09-22 16:34:36 +02:00
Mohit Tilala	d1e60acd2a	[SAP HANA] Prevent exponential processing lineage parsing and use full name for filtering (#23484 ) * Prevent exponential processing lineage parsing * Use full name of views for filtering * pylint fix - isort	2025-09-22 19:46:34 +05:30
Keshav Mohta	1a67e4fb7d	Feature: MariaDB Stored Procedures and Functions Support #23422	2025-09-18 17:59:39 +05:30
Akash Verma	da5dab7fef	Fixes #23388 : Handle string and dict types for Metabase dataset_query field (#23417 ) * Handle string and dict types for Metabase dataset_query field * Added tests --------- Co-authored-by: Akash Verma <akashverma@Mac.lan>	2025-09-16 16:57:08 -07:00

1 2 3 4 5 ...

3571 Commits