OpenMetadata

mirror of https://github.com/open-metadata/OpenMetadata.git synced 2025-07-23 09:22:18 +00:00

Author	SHA1	Message	Date
Teddy	610322ffed	MINOR - MSSQL timestamp type profiler fix (#18935 ) * fix: mssql timestamp processing * fix: min/max test type on datetime column * style: fix python format	2024-12-06 08:03:42 +01:00
Teddy	03bd8e9dc4	FEAT: added TABLESAMPLE for MSSQL (#18926 ) * feat: added TABLESAMPLE for sqlserver * fix: class name * test: added test to generated sample query	2024-12-05 14:17:39 +01:00
Ayush Shah	8664c8df75	Fixes GEN-2199: Allow Fivetran filtering of pipelines using name instead of id (#18929 )	2024-12-05 10:55:11 +05:30
Teddy	ac2f6d7132	MINOR - Fix sqa table reference (#18839 ) * fix: sqa table reference * style: ran python linting * fix: added raw dataset to query runner * fix: get table and schema name from orm object * fix: get table level config for table tests	2024-11-28 18:49:11 +01:00
mgorsk1	da176767a8	feat: add dbt freshness check test (#18730 ) * add dbt freshness check * docs * run linting * add test case param definition * fix test case param definition * add config for dbt http, fix linting * refactor (only create freshness test definition when user executed one) * fix dbt files class * fix dbt files class 2 * fix dbt objects class * fix linting * fix pylint * fix linting once and for all --------- Co-authored-by: Teddy <teddy.crepineau@gmail.com>	2024-11-28 18:30:11 +01:00
harshsoni2024	cb33f274fc	Connector: rename microstrategy connector (#18604 )	2024-11-28 18:50:42 +05:30
Suman Maharana	9a21e77e15	Added dbt cloud multi projects and jobs filter (#18801 ) * Added dbt cloud multi project and jobs filter * added tests * change to array type * updated yaml config * added migrations	2024-11-28 16:10:34 +05:30
Imri Paran	cd74d8f55a	MINOR: ref(data-quality): modularized test case validator import (#18716 ) * ref(data-quality): modularized test case validator import - removed test_suite_factory - implemented TestCaseImporter - removed SQAValidatorBuilder and PandasValidatorBuilder in favor of a SourceType enum - removed the orm table creation from test suite source * format * IValidatorBuilder -> ValidatorBuilder * use the table from the sampler in the test suite interface * linting * fixed the profiler with similar solution * removed unused inheritance * removed unneeded super().__init__() * removed all instances of orm_table * fixed tests * add reportExplicitAny=false * fixed tests	2024-11-27 16:25:12 +01:00
Teddy	58699063db	MINOR -- Fix DQ Partition Issue (#18641 ) * fix: renamed `random_sample` to `get_dataset` and change dunder method access for SQA Table object * fix: removed handle_partition decorator * fix: fixed DQ partition issue + moved to `tablesample` method * style: ran python linting * style: fix python format check issues * feat: added postgres tablesample * style: ran python linting * fix: sampling delta * fix: merge conflicts * fix: resolved conflicts * style: ran python linting * fix: patch orm call in test case * fix: mock build_table_orm call in tests * fix: test case failures and errors * fix: removed unused import * fix: patch typo * fix: trino table schema retrieval * fix: remove tuple context manager for 3.8 test support	2024-11-27 08:50:54 +01:00
Imri Paran	ee7d043035	[GEN-2109] feat(mongo): added ssl support (#18731 ) * feat(mongo): added ssl support Added SSL support for MongoDB using the SSL manager. Attached a video demo. - [Example repository for setting up mongodb with SSL](https://github.com/sushi30/mongodb-docker-ssl-example) - [MongoDB TLS documentation](https://www.mongodb.com/docs/manual/tutorial/configure-ssl/) * fixed test_doris.py	2024-11-22 08:54:13 -08:00
Ayush Shah	17ffdf9850	fix: modify fqn to allow quotes with dots (#18719 )	2024-11-22 09:33:50 +05:30
Pere Miquel Brull	c68a45e7d8	Create new Auto Classification Workflow (#18610 )	2024-11-19 08:10:45 +01:00
Ayush Shah	6f1df37ba1	Fixes GEN-1260: Add Validators while creating table to escape special characters (#18456 )	2024-11-18 15:02:57 +05:30
Sriharsha Chintalapani	88c8fb48f3	Add Edit glossary terms, Edit Tier , Edit Tags as separate permissions (#18331 ) * Add EditGlossaryTerms Permission * Fix #18330: Add EDIT_GLOSSARY_TERM permission and enforce EDIT_TIER permisson * add edit glossary term permission check in UI * revert EDIT_GLOSSARY_TERMS operation * Add EDIT_GLOSSARY_TERMS to common operations * Add EDIT_TIER to common operations * add default empty array for tags field, as patch calls can run into issues * Fix tests * Fix tests * added glossary terms * fix conflicts * fix permission check for data model * Add EditGlossaryTerms to DataConsumerPolicy * Add EditGlossaryTerms,EditTier to DataConsumerPolicy * fix tests * Fix migrations for EditTier,EditGlossaryTerms * add edit tier permission to data consumer * Fix tests * fix pytests * missing test_dbt.py --------- Co-authored-by: karanh37 <karanh37@gmail.com> Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com> Co-authored-by: ulixius9 <mayursingal9@gmail.com>	2024-11-15 10:50:15 -08:00
mgorsk1	3d2dfeb583	feat: use native trino client authentication classes (#16196 ) --------- Co-authored-by: ulixius9 <mayursingal9@gmail.com>	2024-11-15 12:54:42 +05:30
Imri Paran	bde6ee4125	MINOR: Data diff sample fix (#18632 ) * fix(data-diff): sampling configuration handle the sampling condition separately for the 2 tables allowing to apply sampling on columns with mismatching cases * format	2024-11-15 08:22:13 +01:00
Akash Verma	fa30be0589	fix #17726 Databricks schema name with hyphen issue (#18598 )	2024-11-14 20:02:20 +05:30
Teddy	45d27a377d	GEN 1184 - Added Workflow Classification and Metric LevelConfig (#18572 )	2024-11-11 15:59:42 +01:00
Imri Paran	cdaa5c10af	[GEN-1996] feat(data-quality): use sampling config in data diff (#18532 ) * feat(data-quality): use sampling config in data diff - get the table profiling config - use hashing to sample deterministically the same ids from each table - use dirty-equals to assert results of stochastic processes * - reverted missing md5 - added missing database service type * - use a custom substr sql function * fixed nounce * added failure for mssql with sampling because it requires a larger change in the data-diff library * fixed unit tests * updated range for sampling	2024-11-11 10:07:23 +01:00
Mayur Singal	efed932d97	Mask SQL Queries in Usage & Lineage Workflow (#18565 )	2024-11-11 11:44:47 +05:30
Imri Paran	b92b950060	Fix 18434: feat(statistics-profiler): use statistics tables to profile trino tables (#18433 ) * feat(statistics-profiler): use statistics tables to profile trino tables - implemented the collaborative root class - added the "useStatistics" profiler parameter - added the "supportsStatistics" database connection property - implemented the ProfilerWithStatistics and StoredStatisticsSource to add this functionality to specific profilers - implemented TrinoStoredStatisticsSource for specific trino statistics logic * added ABC to terminal classes in collaborative root * fixed docstring for TestSuiteInterface * reverted unintended changes * typo	2024-11-07 18:37:31 +01:00
Teddy	d579008c99	GEN 1683 - Add Column Value to be At Expected Location Test (#18524 ) * feat: added column value to be in expected location test * fix: renamed value -> values * doc: added 1.6 documentatio entry * style: ran python linting * fix: move data packaging to pyproject.yaml * fix: add init file back for data package * fix: failing test case	2024-11-06 11:17:13 +01:00
IceS2	dccba20101	Return s3 endpoint as str() instead of Url (#18521 )	2024-11-05 17:39:50 +00:00
Katarzyna Kałek	47c75fe6a7	Enhanced Glue ingestion with external table features (#18511 ) * added fileFormat, locationPath and external table lineage to Glue ingestion * Improve Lineage Label --------- Co-authored-by: Katarzyna Kałek <kkalek@olx.pl> Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>	2024-11-05 21:48:20 +05:30
Nicola Coretti	7ebc62dca7	feat: Add support for exasol datasource (#17166 ) * Add flake.nix * Add lockfile for flake * Update nix environment and document usage * Add schema for exasol connector * Add Exasol definitions to databaseService * Fix error in exasol connector schema * Add additional connection options/settings to exasol connector * Add exasol-connector to ui * Add depdencies for exasol-connector * Update notes * Update ingestion code * Add Basic Documentation for Exasol Connector * Update flake file * Add developer notes * Add python script which can be used as entry point for debugging in ide * Add config file which can be used for debugging (manual execution) * Update debug script * Update developer notes * Remove old developer notes * Add .venv to gitignore * Update dev notes * Update development notes * Update ExasolSource * Establish basic connection to Exasol DB from connector * Update exasol connector connection settings * Add service_spec for exasol plugin * Remove development files * Remove unused module * Applied code formatter * Update exasol dependency constraint(s) * Add unit test for exasol connection url(s) * Fixed test expectations for exasol connection url test(s) * Adjust the test query for the Exasol connection test	2024-10-31 08:11:30 +01:00
Mayur Singal	9d91325af8	Lineage-1: Move view lineage processing to lineage workflow (#18220 ) Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com>	2024-10-28 18:18:22 +05:30
Imri Paran	95982b9395	[GEN-356] Use ServiceSpec for loading sources based on connectors (#18322 ) * ref(profiler): use di for system profile - use source classes that can be overridden in system profiles - use a manifest class instead of factory to specify which class to resolve for connectors - example usage can be seen in redshift and snowflake * - added manifests for all custom profilers - used super() dependency injection in order for system metrics source - formatting * - implement spec for all source types - added docs for the new specification - added some pylint ignores in the importer module * remove TYPE_CHECKING in core.py * - deleted valuedispatch function - deleted get_system_metrics_by_dialect - implemented BigQueryProfiler with a system metrics source - moved import_source_class to BaseSpec * - removed tests related to the profiler factory * - reverted start_time - removed DML_STAT_TO_DML_STATEMENT_MAPPING - removed unused logger * - reverted start_time - removed DML_STAT_TO_DML_STATEMENT_MAPPING - removed unused logger * fixed tests * format * bigquery system profile e2e tests * fixed module docstring * - removed import_side_effects from redshift. we still use it in postgres for the orm conversion maps. - removed leftover methods * - tests for BaseSpec - moved get_class_path to importer * - moved constructors around to get rid of useless kwargs * - changed test_system_metric * - added linage and usage to service_spec - fixed postgres native lineage test * add comments on collaborative constructors	2024-10-24 07:47:50 +02:00
Pere Miquel Brull	5074f6588f	MINOR - Validate app runner init (#18316 )	2024-10-18 09:40:06 +02:00
Pere Miquel Brull	7012e73d75	GEN-1166 - Improve Ingestion Workflow Error Summary (#18280 ) * GEN-1166 - Improve Ingestion Workflow Error Summary * fix test * docs * comments	2024-10-16 18:15:50 +02:00
Imri Paran	be82086e25	MINOR: add column case sensitivity parameter (#18115 ) * fix(data-quality): table diff - added handling for case-insensitive columns - added handling for different numeric types (int/float/Decimal) - added handling of boolean test case parameters * add migrations for table diff * add migrations for table diff * removed cross type diff for now. it appears to be flaky * fixed migrations * use casefold() instead of lower() * - implemented utils.get_test_case_param_value - fixed params for case sensitive column * handle bool test case parameters * format * testing * format * list -> List * list -> List * - change caseSensitiveColumns default to fase - added migration to stay backward compatible * - removed migration files - updated logging message for table diff migration * changed bool test case parameters default to always be false * format * docs: data diff - added the caseSensitiveColumns parameter requires: https://github.com/open-metadata/OpenMetadata/pull/18115 * fixed test_get_bool_test_case_param	2024-10-15 16:29:43 +02:00
Onkar Ravgan	e6705f25b3	fixed dbt tag name (#18273 )	2024-10-15 16:43:03 +05:30
Onkar Ravgan	2ee015e426	Add array supp for json schema parser (#18255 )	2024-10-15 07:30:16 +05:30
Suman Maharana	dd08bc9ffd	GEN-895: Added Glue Pipeline Lineage (#18063 )	2024-10-14 13:08:17 +05:30
Imri Paran	68e71cb3dc	GEN-970: Refactor redshift system metrics to support freshness test (#17981 ) * ref(profiler): redshift system metrics - moved redshift system metrics to the redshift source module - use Timestamp in data quality - added plugin feature to test utils * use timezone.utc * format * reverted unintended snowflake changes * fixed import test_system_metrics.py * revert * fixed import in tests	2024-10-10 08:32:07 +02:00
Sachin Chaurasiya	457f3d919a	GEN-1322: API Entity - Remove Beta (#17967 ) * GEN-1322: API Entity - Remove Beta * minor: add doc for the metadata pipeline * api service refactor * api service refactor backend changes * add apiconnection in test service connection * pytest fix * fix java file formatting * Fix casing of REST in ApiServiceRest.spec.ts * Refactor REST to Rest in API classes * minor change * minor change * minor change * fix cashing for API to Api * add playwright test for api service ingestion * fix: playwright test --------- Co-authored-by: harshsoni2024 <harshsoni2024@gmail.com>	2024-10-08 14:39:55 +05:30
Ethan	49fceb4674	Fixes #18104 : change parse_obj and assertEquals which was deprecated (#18105 ) * change deprecationwarning * fix format python * fix replace module * change : java function name	2024-10-07 09:02:41 +02:00
Imri Paran	71720ebc51	fix(table-diff): support cross database (#18085 ) fixed table diff url to include database in all cases	2024-10-04 15:31:17 +02:00
Suman Maharana	bc6f4824ea	Added DBT tests with versionless and fixed v7 parsing (#18028 )	2024-09-27 19:53:27 +05:30
sam-mccarty-mavenclinic	0dd3e97170	Fix 17911: Looker parsing improvements for liquid templating and view/model aliasing (#17912 ) * Looker parsing improvements for liquid templating and view/model aliasing * add python-liquid dependency to looker plugin requirements * move to static method with 'openmetadata' context and add rendering tests * remove backtick stripping --------- Co-authored-by: Imri Paran <imri.paran@gmail.com>	2024-09-27 13:55:15 +02:00
Suman Maharana	37b6dc8290	Add Sigma Dashboard Connector (#17855 ) * Add Sigma Dashboard Connector * changed to id instead of name in dashboard entity * Address Comments * addressed comments * Added Docs * yaml file changes * fix ui changes	2024-09-26 16:29:35 +05:30
Pere Miquel Brull	4cccaae446	GEN-996 - Allow PII Processor without storing Sample Data (#17927 ) * GEN-996 - Allow PII Processor without storing Sample Data * fix import * fix import	2024-09-20 16:05:29 +02:00
Ayush Shah	2feb7d04ca	MINOR: Add Matillion SSL + Docs implementation (#17792 )	2024-09-16 12:57:12 +05:30
Sriharsha Chintalapani	c34545adeb	Issue-15768: Support Metric Entity (#17680 ) * Issue-15768: Support Metric Entity * Issue-15768: Support Metric Entity * Issue-15768: Support Metric Entity * Fix tests * Fix tests * Fix tests * Minor: Fix tests * ui: add metricsAPI rest utils * ui: metric list page part 1 * feat: Add metric translations for multiple languages * chore: Add "metric" field to SearchIndexingApplication schema * ui: add create metric page * ui: metric details page patch 1 * ui: add custom property and lineage support for metric entity * ui: add expression component * ui: add metric summary component * chore: Update tab labels in MetricDetails and MetricVersion components * ui: show other info like metric type, granularity, etc * feat: Add support for metric entity in search dropdown * feat: Rename custom property to Metric in MetricEntity.md * feat: Add OwnerLabel component to MetricListPage * Fix expression field in Metric * chore: update expression to metricExpression * ui: add metric header component with edit option * Add metric to SearchIndexApp * chore: Update expression to metricExpression * ui: allow metric expression edit * ui: update metric icon * minor improvements * Fix lineage indexing for Metric * Update GlobalSettingsClassBase.ts to use MetricIcon for metrics in the global settings menu * Fix error handling in MetricListPage component * add related metrics * minor improvements * Fix relatedTerms patch * Fix relatedTerms validation * Add Boolean for deleted * filter active entity from related metric list * playwrite e2e part 1 * Refactor MetricSummary component to include RelatedMetrics in the summary panel * test: add playwright test for metric special cases * Add 'Metrics' to Explore Tree * test: add e2e for add metric page * test: add test for metric listing page content * Add Boolean for deleted, remove deleted from suggests * Refactor LineageProvider to handle deleted flag properly * add playwright for metric listing * fix test * Add colored metric icon and update its usage in GlobalSettingsClassBase * Fixed py_test test_ometa_endpoint for metric --------- Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com> Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com> Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com> Co-authored-by: SumanMaharana <sumanmaharana786@gmail.com> Co-authored-by: Ashish Gupta <ashish@getcollate.io>	2024-09-16 11:36:31 +05:30
k.nakagaki	3d8e30142c	Fixes 8428: make it possible to choice a sampling method type when we create profile ingestion for the Snowflake (#17831 ) * Add test for existing code * Add sampling method at ingestion. * add samplingMethodType into UI * modify init method to use new parameter. * create descriptions * execute isort * fix an unintended change. * apply py_format * close section * specify init arguments * fix bug * apply py_format --------- Co-authored-by: Teddy <teddy.crepineau@gmail.com>	2024-09-15 21:51:17 +02:00
Imri Paran	59854de58d	[GEN-1406]: import plugin and clear plugin message for failed source import (#17788 ) * support side effects on source classes by always importing source class * streamlined error message * fixed service type extraction for test suite pipeline * - replaced "custom" with constant - added quotes for the plugin exception for copy/paste ergonomics	2024-09-12 16:13:03 +02:00
Imri Paran	b70b3ce913	added config logging with secrets redacted (#17770 )	2024-09-12 10:19:53 +02:00
Imri Paran	a3d6c1dd20	MINOR: tests(datalake): use minio (#17805 ) * tests(datalake): use minio 1. use minio instead of moto for mimicking s3 behavior. 2. removed moto dependency as it is not compatible with aiobotocore (https://github.com/getmoto/moto/issues/7070#issuecomment-1828484982) * - moved test_datalake_profiler_e2e.py to datalake/test_profiler - use minio instead of moto * fixed tests * fixed tests * removed default name for minio container	2024-09-12 07:13:01 +02:00
harshsoni2024	4f57f4e710	GEN-309: OpenAPI Connector (#17754 ) * configure api service metadata * add rest api service * fix test con. pyformat changes * add models, fix test con. * improve test con. * add docs, side doc * fix model data parse, url error fix * add tests * fix pytest errors --------- Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>	2024-09-11 13:36:53 +05:30
Suman Maharana	094bae7097	Fixes #15565 : Advanced Avro schema recursion depth issue (#17683 ) * Fix: Avro schema recursion depth issue * py_format * Addressed comments	2024-09-10 18:26:42 +05:30
Teddy	e4c01c5702	fix: region typo in test (#17766 )	2024-09-09 17:54:07 +05:30

1 2 3 4 5 ...

723 Commits