Sergio Gómez Villamor
e072a42d03
feat(ingest): adds get_entities_v3 method to DataHubGraph ( #13045 )
2025-04-02 10:22:14 +02:00
Harshal Sheth
5bc8a895f9
chore(ingest): remove calls to deprecated methods ( #13009 )
2025-03-28 13:42:54 -07:00
Harshal Sheth
2cc8856c6b
feat(ingest): allow MCPWs instead of workunits ( #12947 )
2025-03-24 08:10:48 -07:00
david-leifker
ebd3a5078d
feat(ingestion-tracing): implement ingestion with tracing api ( #12714 )
2025-03-13 11:33:28 -05:00
Harshal Sheth
12eb0cd1a7
feat(sdk): add search client ( #12754 )
2025-03-03 10:05:26 -08:00
Aseem Bansal
2101a50e83
dev: immutable args remove ( #12735 )
2025-02-27 15:38:47 +05:30
Harshal Sheth
79f844b208
chore(py): add unused-noqa lint rule ( #12615 )
2025-02-13 08:28:48 -08:00
Harshal Sheth
7472c535d3
feat(ci): add pytest hooks for updating golden files ( #12581 )
2025-02-12 15:32:31 -08:00
Harshal Sheth
7c1954237f
fix(sdk): fix platform instance generation in the sdk ( #12573 )
2025-02-07 08:21:56 -08:00
Aseem Bansal
262dd76518
dev: remove black in favor of ruff for formatting ( #12378 )
2025-01-18 15:06:20 +05:30
Harshal Sheth
ce6474df5a
chore(ingest): remove deprecated calls to Urn.create_from_string ( #11983 )
2024-12-02 09:53:13 -08:00
Harshal Sheth
7dbb3e60cb
chore(ingest): start using explicit exports ( #11899 )
2024-11-20 13:33:30 -08:00
Harshal Sheth
6316e10d48
feat(ingest): check ordering in SqlParsingAggregator tests ( #11735 )
2024-10-29 17:50:37 +01:00
Harshal Sheth
d34717fd82
fix(ingest): remove default value from DatahubClientConfig.server ( #11570 )
2024-10-16 13:50:33 -07:00
Mayuri Nehate
9568a4254d
feat: separate great-expectations action package ( #11096 )
...
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-08-21 12:13:36 -04:00
Gabe Lyons
423af83ef1
feat(rest-emitter): adding async flag to rest emitter ( #10902 )
...
Co-authored-by: Gabe Lyons <gabe.lyons@acryl.io>
2024-07-12 13:30:21 -07:00
Harshal Sheth
f4be88d0a9
feat(ingest): set pipeline name in system metadata ( #10190 )
...
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2024-06-27 15:00:35 -07:00
Harshal Sheth
3d5735cbc5
chore(ingest): run pyupgrade for python 3.8 ( #10513 )
2024-05-15 22:31:05 -07:00
dushayntAW
a164b70e1d
chore(ingest/presto-on-hive) Set enable_properties_merge to True by default ( #10469 )
2024-05-15 18:57:13 +05:30
Harshal Sheth
0d780e5f8f
feat(ingest): sql parsing aggregator ( #9786 )
2024-02-09 16:27:45 -05:00
Harshal Sheth
0e418b527e
fix(ingest): upgrade pytest-docker ( #9765 )
2024-02-01 16:33:15 -08:00
Shubham Jagtap
1741c07d76
feat(ingestion): Add test_connection methods for important sources ( #9334 )
2023-12-14 12:31:51 -05:00
Harshal Sheth
f9fd9467ef
feat(ingest): clean up DataHubRestEmitter return type ( #9286 )
...
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-11-30 21:00:43 -05:00
Harshal Sheth
73514ad9c5
fix(ingest): cleanup large images in CI ( #9153 )
2023-10-31 21:28:38 -07:00
Harshal Sheth
9deb7be3fc
fix(ingest): refactor test markers + fix disk space issues in CI ( #8938 )
2023-10-03 20:17:49 -07:00
Andrew Sikowitz
2261531e31
test(ingest): Aspect level golden file comparison ( #8310 )
2023-07-11 10:39:47 -04:00
Harshal Sheth
2d442161c4
ci(ingest/kafka): improve kafka integration test reliability ( #8085 )
2023-05-25 15:40:56 -07:00
Andrew Sikowitz
fdbc4de695
refactor(ingest): Call source_helpers via new WorkUnitProcessors in base Source ( #8101 )
2023-05-24 13:36:19 -07:00
Harshal Sheth
b0f8c3de1e
refactor(ingest): simplify stateful ingestion provider interface ( #8104 )
...
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-23 12:57:57 -07:00
Shirshanka Das
151eab3628
fix(build): fix lint issue ( #8066 )
2023-05-17 08:17:52 -07:00
Shirshanka Das
b3c790aab6
feat: Add support for Data Products ( #8039 )
...
Co-authored-by: Chris Collins <chriscollins3456@gmail.com>
2023-05-17 07:17:25 +00:00
Harshal Sheth
e99875cac6
chore(ingest): enable flake8 bugbear linting ( #7763 )
2023-04-10 14:14:42 -07:00
Harshal Sheth
667ca8632d
feat(ingest): avoid embedding serialized json in metadata files ( #6742 )
2022-12-28 19:28:38 -05:00
cccs-eric
ec8a4e0eab
feat(ingest): upgrade pydantic version ( #6858 )
...
This PR also removes the requirement on docker-compose v1 and makes our tests use v2 instead.
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2022-12-27 17:06:16 -05:00
Harshal Sheth
5584bfb469
refactor(ingest/stateful): remove get_last_state
method ( #6794 )
2022-12-19 20:48:22 -05:00
Harshal Sheth
e9d50ed992
refactor(ingest/stateful): remove IngestionJobStateProvider
( #6792 )
2022-12-19 17:03:54 -05:00
Tamas Nemeth
e41b455e14
fix(ingest): bigquery - sharded table support improvements ( #6789 )
2022-12-19 18:57:37 +01:00
Harshal Sheth
8a537b0559
feat(ingest): add datahub state inspect
command ( #6763 )
2022-12-15 18:55:36 -05:00
Harshal Sheth
85bb1f5030
test(ingest): make hive/trino test more reliable ( #6741 )
2022-12-12 21:02:52 -05:00
Harshal Sheth
44cfd21a65
chore(ingest): bump and pin mypy ( #6584 )
2022-12-02 19:53:28 +01:00
Harshal Sheth
3e907ab0d1
feat(ingest): loosen sqlalchemy dep & support airflow 2.3+ ( #6204 )
...
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2022-11-11 15:04:36 -05:00
Shirshanka Das
c5fc2ea798
fix(ingest): looker - deps, column level lineage fixes ( #6271 )
2022-10-24 08:31:48 +02:00
Harshal Sheth
73fd35888b
build(ingest): remove markupsafe dep and bump pytest-docker ( #6201 )
2022-10-14 18:59:40 -07:00
Harshal Sheth
220ae0b6c9
feat(ingest): make sink use type annotations ( #5899 )
2022-09-10 19:46:20 -07:00
Piotr Sierkin
828a711684
feat(ingest): dbt - control over emitting test_results, test_definitions, etc. ( #5328 )
...
Co-authored-by: Piotr Sierkin <piotr.sierkin@getindata.com>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2022-08-06 21:42:53 -07:00
Shirshanka Das
558a65a3c3
fix(ci): fix mysql test and attempt kafka-connect ingestion ( #5352 )
2022-07-07 08:28:34 -07:00
Shirshanka Das
e93e4691fb
feat(ingest): lookml - adding support for only emitting reachable views from explores ( #5333 )
2022-07-05 10:14:12 -07:00
buggythepirate
92338c7912
feat(ingest): Added new ingestion source SAP HANA ( #4376 )
...
Co-authored-by: Ravindra Lanka <rlanka@acryl.io>
2022-05-26 03:42:50 -07:00
Swaroop Jagadish
35b187a8d4
feat(ingest): transformers - add support for processing MCP-s ( #4337 )
2022-03-07 13:14:29 -08:00
Claudio Benfatto
aeefde4fa1
feat(ingestion): Kafka stateful ingestion ( #4028 )
...
* test: test stateful ingestion for kafka
test: some more advancement
test: some improvements
refactoring
* refactor: remove some linter modifications
* tests: add unit tests for kafka state
* refactor: minor changes
* tests: improve test coverage
* fix: fix naming
* style: fix format with black
* fix: fix broken test
* revert: revert smoke tests to master
* feat: add reporting to kafka source
* tests: add smoke tests for kafka reporting
* revert: revert changes to the smoke tests
* test: add kafka integration test for stateful ingestion
* docs: update documentation on kafka source
* fix: return empty string when no platform instance
* revert: remove unwanted file
* fix: solve problem with platform instance
* chore: use console sink instead of file
* fix: disable complexity check for _extract_record
* fix: remove if condition in get_platform_instance_id
* chore: remove unneeded integration test
* test: test platform instance in kafka source unit tests
2022-02-15 07:18:36 -08:00