48 Commits

Author SHA1 Message Date
Imri Paran
be82086e25
MINOR: add column case sensitivity parameter (#18115)
* fix(data-quality): table diff

- added handling for case-insensitive columns
- added handling for different numeric types (int/float/Decimal)
- added handling of boolean test case parameters

* add migrations for table diff

* add migrations for table diff

* removed cross type diff for now. it appears to be flaky

* fixed migrations

* use casefold() instead of lower()

* - implemented utils.get_test_case_param_value
- fixed params for case sensitive column

* handle bool test case parameters

* format

* testing

* format

* list -> List

* list -> List

* - change caseSensitiveColumns default to fase
- added migration to stay backward compatible

* - removed migration files
- updated logging message for table diff migration

* changed bool test case parameters default to always be false

* format

* docs: data diff

- added the caseSensitiveColumns parameter

requires: https://github.com/open-metadata/OpenMetadata/pull/18115

* fixed test_get_bool_test_case_param
2024-10-15 16:29:43 +02:00
Chirag Madlani
b0563ccf98
revert quicktype bump since for CI issue (#17934)
* enable logging for debugging

* remove node-gyp

* add node-gyp globally before installing deps

* reduce quick type to 10

* revert quicktype and node-gyp changes for CI

* fix unit tests
2024-09-20 19:30:06 +05:30
sonika-shah
8d5661a7bf
Support Elasticsearch cluster alias for E2E tests  (#17117)
* Fix backend tests and have index alias with clusterAlias appended

* Fix backend tests and have index alias with clusterAlias appended

* Fix failing tests

* alias setup

* fix suggestion not working due to alias

* fix getIndexOrAliasName method for multiple indexes

* update openmetadata.yaml

* update childAliases with clusterAlias

---------

Co-authored-by: Ashish Gupta <ashish@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2024-08-10 11:26:49 -07:00
Shailesh Parmar
1d4e3e0825
Revert alias setup (#16610)
* Revert "fix(CI): update `run_local_docker.sh` script with cluster alias ES endpoint (#16604)"

This reverts commit baab52076f71f76e8bc48d89564ba733ac77a1e6.

* Revert "chore(CI): Enable Elasticsearch cluster alias by default (#16599)"

This reverts commit a3524e14d3a1244db19199ab1eae3796ba15b499.
2024-06-11 19:14:25 +05:30
Akash Jain
baab52076f
fix(CI): update run_local_docker.sh script with cluster alias ES endpoint (#16604) 2024-06-11 15:55:28 +05:30
Pere Miquel Brull
a1404e6b4a
MINOR - Clean ingestion dependencies (#15679)
* WIP - MINOR - Clean ingestion dependencies

* test

* test

* Clean imports

* add pyiceberg for test

* Revert "add pyiceberg for test"

This reverts commit ab26942736586f089a57a644ffd727aca200db62.

* add pyiceberg for test

* Remove docker dep

* clean local docker sh

* MINOR - AKS Airflow troubleshooting docs

* Fix action

* clean local docker sh
2024-04-11 14:30:40 +02:00
Imri Paran
aade838020
Fixes #15388: Use native backup tools (#15393)
* feat: use native backup tools

1. added mysqldump 8.3 to the ingestion container.
2. documented how to use native tools to back up and restore.
3. added deprecated message on the cli backup and restore.

* added deprecation notice for 1.3 backup

* removed 1.3.x deprecation notice

* added another backup page in 1.3 introducing SQL dump tools

* added --set-gtid-purged=OFF to the mysql dump process
2024-03-12 06:23:05 +01:00
Shiyang Xiao
241f3c68d7
Fixes #14413: Add SAS connector (#14415)
* feat: SAS Viya connector

* refactor SASCatalog to SAS

* add SAS logo to UI and connection documentation

* doc changes

* modify ingestion logic

* revert original changes

* added support for dataflow & perfect logic for reports/datatables

* add filter doc

* more updates to perfect ingestion for each asset type

* fix a bug with table lineage not created properly

* Delete ingestion/pipelines/sasCatalog.yaml

* precomit fix

* Conversion to database connector

* minor fixes

* make custom properties type generic

* Add SAS javaEnum

* add dummy variable for sas.yaml

---------

Co-authored-by: lizmc <liz.mcintosh@sas.com>
Co-authored-by: Shiyang Xiao <Shiyang.Xiao@sas.com>
2024-01-11 06:46:57 -08:00
Ayush Shah
9c6d202555
Add Sample data, modify regex pattern (#14467) 2024-01-11 14:23:33 +05:30
Mohit Yadav
63ac994371
Point Reindex request to new Applications api (#13508) 2023-10-10 17:05:07 +05:30
Ayush Shah
1a8b11b29e
Fix Github Actions, Timeout (#13488) 2023-10-09 15:01:23 +05:30
Ayush Shah
15a0c85b79
install 3rd party for running validate compose docker (#12457) 2023-07-17 10:39:23 +00:00
Mohit Yadav
6410fcbea4
Fix Alert Intermittent Issues for some entities (#12189)
* Fix Alert Intermittent Issues for some entities

* Fix Failing test for disabled publisher

* Fix Failing test due to deleted entity

* Fix Tests

---------

Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2023-06-29 19:00:54 -07:00
Anuj359
73f5859ed9
Docker script update (#12115)
* Create manual.yml

* Fixing Typo

* Removing manual test github action file

* Testing IP runner

* Updating docker script for sandbox db task

* removing the ip test action

* Updating docker script for sandbox db task postgres

* Volume cleaning argument addition

* Volume cleaning argument addition
2023-06-26 15:44:53 +05:30
07Himank
9355c0677d
Open search elastic search client task (#11879)
* Added diffrent clients for elasticSearch and openSearch and did the interface implementation

* removed some comments and some changes

* Add Elastic Search Test Container

* upgarded es version

* added test cases for elastic search

* added test case for, when tag is deleted cehck its relationships also get deleted

* changed package of open,elastic to openSearch and elasticSearch

* added support for skip search test case if index is not present of that entity

* Refactor bunch of files

* unnecessary semicolon

* refactor test case indexing

* fix checkstyle

* checkstyle

* added hardcoded configs and changed package name

* fixed no response from hits

* stopped elasticsearch container and some fixes

* closed elasticsearch container

* revert

* closed client after used in test

* style

* edit es config in junit

* checking if using already es version release memory for maven ci

* revert docker file changes

* remove es container for testing the ci

* commented out all test cases for elasticsearch

* fix

* creating elastic search with reusable enabled

* trying to fix ci error

* start container only if it is not running

* revert docker changes and done changes for elastic search container

* changed var name of sort_order for es

* lower the version of es client

* fixes

* all latest version of es including docker

* Use Single sort order

* Fix Keyword issue

* Immediate refresh policy for entity created

* increase sleep time

* disable test

* added reindexing logic in run_local_docker.sh

* enabled test case

---------

Co-authored-by: Himank Mehta <himankmehta@Himanks-MacBook-Air.local>
Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
Co-authored-by: ulixius9 <mayursingal9@gmail.com>
2023-06-22 16:55:05 +05:30
Anuj359
b2ae9d498c
Update docker for sandbox (#11938)
* Stopping the docker before maven build
2023-06-12 10:21:05 +00:00
Sriharsha Chintalapani
de10cd254b
New docker structure (#10438)
* Adding the different docker-compose file openmetadata and ingestion

* Added two different env files for mysql and postgres

* Updated the docker file path

* Updated the path of docker folder structure

* Fix docker

* Updating the PR with necessary changes required

---------

Co-authored-by: “Vijay” <“vijay.l@deuexsolutions.com”>
Co-authored-by: Akash-Jain <Akash.J@deuexsolutions.com>
2023-03-06 08:58:53 -08:00
Nahuel
247016307d
Fix#8648: Mask sensitive info from API responses (#10307)
* Mask sensitive info from API responses

* Rename converter classes

* Add missing Java classes from JSON schemas and class converters

* Update test service connection schema

* Update datalakeConnection JSON schema and fix some tests

* Fix AlertsRuleEvaluatorResourceTest and minor error in run_local_docker.sh

* Fix Pipeline and Database service tests

* Minor refactor

* Fix CsvUtilTest

* Fix EventMonitorFactoryTest

* Fix CloudWatchEventMonitorTest

* Update datalake metadata

* Update bigquery metadata

* Fix test connection functionality

* Fix OMeta service api test

* Update gcsValues title and revert changes in GH actions

* Mask sensitive enabled by default for local docker

* Add missing tests

* Address PR comments

* Address PR comments

* fix ui breaks on gcsValues.json

* Address PR comments

* Minor refactor

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
2023-03-03 18:10:01 +00:00
Nahuel
2c43ebba6f
Fix#9448: Add ES volumes (#9506)
* Add ES volumes

* Fix run_local_docker script

* Fix error run_local_docker script

* Update Es volumes in docker-compose files
2022-12-23 17:33:30 +01:00
Vj-L
f3855a1909
Added the default volumes for ElasticSearch (#9452)
* Added the default volumes for ElasticSearch

* Added docs for ES Volumes

* Updated as per the comments

* Updated as per the comments

* Added the docs for bare-metal upgrade

* Apply suggestions from code review

* Update docker/local-metadata/docker-compose.yml

* Pylint check addressed

Co-authored-by: “Vijay” <“vijay.l@deuexsolutions.com”>
Co-authored-by: Nahuel <nahuel@getcollate.io>
2022-12-21 18:29:49 +00:00
Ayush Shah
b25336a55c
Add Generation of Pydantic Models via script (#8923) 2022-11-22 11:19:22 +05:30
Vj-L
2c055641be
Added host volumes for Database service (#8722)
* Added host volumes for Database service

* Formated the python file

* Done changes according to the comments on PR

* Changes done as per the comments on Draft PR

* Fix PyLint

* Changes done as per the comments on Draft PR

* Updated the path of volume for Database

* Added cleanup state in workflows

* Added cleanup state in workflows

Co-authored-by: “Vijay” <“vijay.l@deuexsolutions.com”>
Co-authored-by: ulixius9 <mayursingal9@gmail.com>
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2022-11-18 14:05:33 +01:00
Sriharsha Chintalapani
801012d2a6
Add remote debug port (#8045)
* Add remote debug port

* Add remote debug port

* Add remote debug port

* Add remote debug port
2022-10-11 11:13:15 +05:30
Vj-L
489d506e3a
Removed the volume for Database (#7798)
* Removed the volume for Database

* Removed the volume for Database

* Added named volume

* Added named volume

Co-authored-by: “Vijay” <“vijay.l@deuexsolutions.com”>
2022-09-29 21:44:12 +05:30
Vj-L
6cfd16925a
Added default volumes (#7741)
* Added default volumes

* Ran make py_format for formatting

* Added sample data in mount directories

* Updated as per the comments

* Done changes for --ingest_sample_data flag

Co-authored-by: “Vijay” <“vijay.l@deuexsolutions.com”>
2022-09-29 14:53:55 +05:30
Akash Jain
b3d340e403
Fix: env variables with basic auth changes for docker compose (#7693) 2022-09-23 23:52:28 +05:30
Nahuel
ed58c27f76
Remove unused property extra_hosts from docker-compose files (#7096)
* Remove unused properties from docker-compose files

* Minor fix in run_local_docker script
2022-08-31 21:30:24 +02:00
Teddy
811f640a18
Fixes #6891 by implementing support for VARIANT DType (#7084) 2022-08-31 19:01:00 +02:00
Nahuel
30ff470922
Reduce the CI times for py-tests (#6352)
* Testing if we can reduce the CI times

* Update POM and run_local_docker script
2022-07-26 17:47:03 +02:00
Pere Miquel Brull
bbfb0dd7d2
Give more verbosity for CI (#6048)
Give more verbosity for CI (#6048)
2022-07-18 14:44:50 +02:00
Pere Miquel Brull
8e121fb306
Fix #4096 - PipelineService JSON refactor & Airflow metadata source connector (#5125)
Fix #4096 - PipelineService JSON refactor & Airflow metadata source connector (#5125)
2022-05-25 08:35:16 +02:00
Pere Miquel Brull
6a6507e754
Fix #3962 - Profiler uses DatabaseSchema & Sample Data fix (#4056) 2022-04-12 13:40:59 +05:30
Pere Miquel Brull
93525aea0d
Fix #3823 & Fix generate default Enum field (#3909)
* Fix sample data DAG

* Fix callback imports

* Use --set-default-enum-member in generate

* Format

* Add faulty merge hard_delete

* Fix airflow lineage, improve naming and fix lineage tests

* Add mysql url test

* Add mysql url test

* Update CI name

* Fix test ometa endpoint

* Format

* Fix metadata config
2022-04-07 18:22:50 +05:30
Sriharsha Chintalapani
b14c8dc2c4
Issue-3685: Variable based separator used for fullyQualifiedName instead of hardcoded . for Python and make the separator to : (#3778)
* Issue-3685: Variable based separator used for fullyQualifiedName instead of hardcoded . for Python and make the sepearator to :

* Fix failing test

* Use colon for run_local_docker validation

* Update tests FQDN

* Update tests FQDN

Co-authored-by: Sachin-chaurasiya <sachinchaurasiyachotey87@gmail.com>
Co-authored-by: pmbrull <peremiquelbrull@gmail.com>
2022-03-31 19:20:27 +05:30
Ayush Shah
351db56687
Added fix for Sample_Usage not being ingested in Local docker (#3650) 2022-03-27 22:38:13 -07:00
Ayush Shah
9a620c380d
Fixing Metadata indexing issue (#3580) 2022-03-22 14:37:41 +05:30
Ayush Shah
b865d85d85
Docker Calculation fix + Reset DB command added (#3459) 2022-03-21 10:28:58 -07:00
Pere Miquel Brull
1eb3f1ad41
[issue-2322] - Backup CLI (#2349)
* Prepare cli module

* Check call

* Prepare backup

* Run pytest on schema changes

* Remove traces

* Add backup plugin

* run generate during CI

* Install after generate

* Address logger hotspot
2022-01-22 12:08:14 -08:00
Pere Miquel Brull
6693b4ae40
[issue-2262] - Simplify sample data DAG (#2337)
* Simplify sample data DAG

* Remove mkdir

* Generate sources before running compose

* Generate sources to install models to ingestion image

* Add python-on-whales for docker --start

* Remove python-on-whales from base

* Install venv

* Setup python
2022-01-22 16:37:56 +05:30
Pere Miquel Brull
2ec32017bb
[issue-1698] - Airflow DockerOperator setup (#1730)
* Fix typo

* Clean setup

* Update ingestion local image to be barebone on connector dependencies

* Prepare ingestion connectors base image

* Add system dependencies

* Prepare docker CLI

* Add docker provider

* Prepare entrypoint for the image

* Remove DBT pipeline as per Issue 1658

* Add TODO for ingestion build

* Bind docker socket

* Update comment

* Update README

* Use DockerOperator in sample data

* Build images with latest tag

* Prepare symlink to pass the volume to the DockerOperator

* Update README

* Prepare Base image for CI

* COPY multiple files into dir

* COPY multiple files into dir

* Remove DBT source as is now part of table ingestion

* Build docker base in run_local_docker
2021-12-18 16:41:38 +01:00
Ayush Shah
07924f80c3
Added Metabase Connector (#1726)
* Added Metabase Connector

* Metabase Added support for lineage

* Modified Metabase as per comments

* Metabase filter patterns added

* Update databaseService.json
2021-12-15 09:53:55 -08:00
parthp2107
a9e70e6a89
License header update (#1498)
* updated liscense header

* updated license header

* updated license header

* updated license header

* addressing stylecheck findings
2021-12-01 12:46:28 +05:30
Ayush Shah
47a97e7be0 FQN changed from bigquery to bigquery_gcp 2021-11-29 16:57:39 +05:30
Ayush Shah
11e2229972
Added OMeta API, removed wget (#1376)
* Added OMeta API, removed wget

* Added curl instead of Wget in run_local_docker.sh
2021-11-24 11:57:33 -08:00
Ayush Shah
2111f6c938
Docker Commands Modified (#1199)
* Docker ingestion modified

* Command line commands updated
2021-11-16 00:25:40 -08:00
Ayush Shah
5dc3bb9297
Docker Support from Python added (#1158)
* Script modified - supports running from different locations

* Docker support from Python CLI

* Docker plugin setup.py

* Paths for latest and local dockers updated

* Resolved Comments - Docker CLI optimized, timestamp added

* help attribute added to options

* Docker clean code refactored
2021-11-12 10:30:28 -08:00
Pere Miquel Brull
1291bf6ec0
[Issue-892] - GET operations return service (#916)
* Force image build in local docker

* Use service as default field query param for entities with required service

* Return service in List

* Naming convention

* Update Repository setFields to include service

* Revert resource changes

* Revert resource changes

* Update tests to check we always return the service, even if not specified

* Return service for database
2021-10-24 10:26:28 -07:00
Ayush Shah
319f7a795c
Docker local (#723)
* added docker file for local build and ingestion

* Docker Modified for local setup

* Ingestion script optimized

* Added script to directly run Local Docker

Co-authored-by: Parth Panchal <parthpanchal@Parths-MacBook-Air.local>
2021-10-09 10:58:33 -07:00