* SAP Hana skeleton
* Add SAP Hana Connector
* Fix ingestion and docs
* Prep SAP Hana Profiler
* Linting
* Update index.md
* Revert: Update index.md
---------
Co-authored-by: Ayush Shah <ayush@getcollate.io>
* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.
* added the ordinalPosition data point into the Column constructor.
* renamed variable to better describe its usage.
* updated profile errors.
Hive connections now comment columns by default.
* removed print statements
* Cleaned up code by pulling check into its own function
* Updated median function to return null when it is being used for first and third quartiles.
* updated metadata to work with the impala query engine.
Uses the describe function to grab column names, data types, and comments.
* added the ordinalPosition data point into the Column constructor.
* renamed variable to better describe its usage.
* updated profile errors.
Hive connections now comment columns by default.
* removed print statements
* Cleaned up code by pulling check into its own function
* Updated median function to return null when it is being used for first and third quartiles.
* removed print statements and ran make py_format
* updated to fix some pylint errors.
imported Dialects to remove string compare to "impala" engine
* moved huge comment into function docstring.
This comment shows us the sql to get quartiles in Impala
* added cast to decimal for column when running average in mean.py
* fixed lint error
* fixed ui ordering of precision and scale.
Precision should be ordred in front of scale since the precision is set first in decimal data types
* first pass for impala connector
* updated default auth_mechanism to be one of the enum values.
* updated UI documentation to match fields for the impalaconneciton.
refined impalaConnection to bring use_ssl to a boolean instead or relying on an extra connection option being manually added.
Removed reference to hive for type mapping
added impala to the pip setup
* py_format updates
* removed print statement
* Lints and fixes
* Updated database documentation to follow new style
* Flag as BETA
* Remove tests
---------
Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
* Organise calls
* Prepare skeleton
* Add looker model handling
* Parse files as sql
* Handle labels
* Linting
* Format
* Fix version
* Also check the API for explore lineage
* Added mark delete logic
* Final test and optimization
* After merge fixes
* Added include tags for dash pipelines dbt
* added docs and fixed test
* Fixed py tests
* Added UI changes for following newly added fields:
- markDeletedDashboards
- markDeletedMlModels
- markDeletedPipelines
- markDeletedTopics
- includeTags
* Fixed failing unit tests
* updated json files of localization for other languages
* Improved localization changes
* added localization changes for other languages
* Updated mark deleted desc
* updated the ingestion fields descriptions in the ingestion form for UI
* automated localization changes for other languages
* updated descriptions for includeTags field for dbtPipeline and databaseServiceMetadataPipeline json
* fixed issue where includeTags field was being sent in the dbtConfigSource
* Added flow to input taxonomy while adding BigQuery service.
---------
Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
* Change how status is handled after running workflow
* Reset changes in config files
* Add auxiliary Summary class
* Improve failures handling
* Pylint error
* Pylint error
* Show result in table
* Add test
* Fix setup.py
* Add comments
* Update sqllineage-openmetadata version + add timeout
* Pyimpala fix colnames, comments and dialect sql compilation (#10470)
* Fix col names and comments for impala hive
* Fix cols, comments and impala sql compilation
* Handle hive types
* Format
* Added doc in avro array and tests (#10473)
* Fixed: Add job definition id field for dbt cloud in UI #10269 (#10472)
* fixed Add job definition id field for dbt cloud in UI #10269
* sync-localization file
* fixed failing unit test and add unit test for the changes
* Address PR comments
* Update tests
* Pylint clean
---------
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
Co-authored-by: Onkar Ravgan <onkar.10r@gmail.com>
Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>
* Fix#10429: Kafka Sample data improvements and adding support for JSONSchema and Protobuf
* Fix#10429: Kafka Sample data improvements and adding support for JSONSchema and Protobuf
* Fix#10429: Kafka Sample data improvements and adding support for JSONSchema and Protobuf
* Fix#10429: Kafka Sample data improvements and adding support for JSONSchema and Protobuf
* Added top level parsing and unit tests
* fix(ui): show schemaText and fields both
* fix no data placeholder for fields & schema text
* addressing comments
* fixed py checkstyle
---------
Co-authored-by: Onkar Ravgan <onkar.10r@gmail.com>
Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
* WIP: pii tagging using spacy
* added test cases and changes as per comment
* fix python checkstyle
* fix python checkstyle
* added score, test_cases and docs update
* solved merge conflict
* fix python checkstyle
* remove pii tagging using regex
* fix python test
* lib changes and added some test case
* changed as per comment
* fix: python test
* fix: changes to get source_config
* fix: changes as per comment
* feat(profiler): renamed module to
* feat(profiler): added dbt-artifacts-parser to test setup.py
* feat(profiler): refactor workflow and interface
* feat(profiler): linting
* feat(profiler): removed old profiler modules
* feat(profiler): added support for value and integer range partition
* feat(profiler): fixed linting
* feat(profiler): added partitionning support for datalake profiler
* feat(profiler): removed `ProfilerInterfaceArgs` class
* feat(profiler): address comments
* feat(profiler): Added `OTHER` as an `IntervalType` for UI type generation
* feat(testSuite): extracted out column test for SQA type
* refactor(testSuite): extracted SQA column and table tests into their own classes
* refactor(testSuite): Added pkutil namespace package style for test suite classes
* refactor(testSuite): added dynamic importer function for test cases
* refactor(testSuite): black formatting
* refactor(testSuite): fixed linting issues
* refactor(testSuite): refactor metrics for dataframe
* refactor(testSuite): Added Mixins and base methods
* refactor(testSuite): extrcated out get bound for floats
* refactor(testSuite): Added pandas column test cases
* refactor(testSuite): Deleted old column tests
* refactor(testSuite): Added table tests for datalake
* refactor(testSuite): Removed old tests definition
* refactor(testSuite): changed registry to dynamic class inport
* refactor(testSuite): renamed dl_fn to df_fn
* refactor(testSuite): updated registry unit test
* refactor(testSuite): updated import path to sqa like column
* refactor(testSuite): cleaned up imports in old files
* refactor(testSuite): harmonzied SQALikeColumn object to replicate SQA Column object
* refactor(testSuite): linting
* refactor(testSuite): linting
* refactor(testSuite): raise expection on DQ exception
* refactor(testSuite): linting
* refactor(testSuite): removed pandas from base requirements
* refactor(testSuite): Added __futur__ for py3.7 type hint
* refactor(testSuite): added `df` to good-names
* refactor(testSuite): renamed Handler to Validator
* refactor(testSuite): Added test inheritance for column tests
* refactor(testSuite): cleaned up column type check
* refactor(testSuite): cleaned up typo
* refactor(testSuite): extracted main table test logic into parent class
* refactor(testSuite): linting
* refactor(testSuite): linting fixes
* refactor(testSuite): address doc string and linting issues
* Bump to version 0.13.2
* Bump mvn projects to 1.0.0-SNAPSHOT
* Bump python projects to 1.0.0.dev0
---------
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
* Replace sqllineage with openmetadata-sqllineage
* Fix checkstyle and failing test
* Move logic to retrieve dialect of a service type into a class
* Improve py-check message when it fails
* Updated mapper
* Update code after merge
* Fix: added test case for atlas
* Fix: resolved conflict
* Fix: changing back neo4j to old version
* Fix: changing back neo4j to old version
* Fix: changes as per comment
* Fix: changes as per comment
* Fix: python checkstyle