5512 Commits

Author SHA1 Message Date
Shirshanka Das
ff7c7fc8e2
Update README.md (#2041)
Adding links to latest DataHub blog post.
2020-12-17 11:40:52 -08:00
Nagarjuna Kanamarlapudi
0fef73bb57
fix(search): Fix the rollback unintentional rollback(#2028) of dataset index to search by field paths. (#2040)
Enables the auto complete of field paths on DataHub UI
2020-12-14 17:40:24 -08:00
Ilya Ksenofontov
eedf0ac1af
fix: The field connection with type URLConnection is not Serializable. (#2034)
An allegedly Serializable object with non-transient, non-serializable data members could cause program crashes. Method toURL() of URL class is depreceted.
2020-12-09 10:33:31 -08:00
Nagarjuna Kanamarlapudi
3235dc2fec
Update town hall history for 12/4 and announce the next town hall date (#2032)
* Update townhall history for 12/4 and announce the next townhall date.
2020-12-07 15:26:34 -08:00
Nagarjuna Kanamarlapudi
f103998dcb
feat(Search): Inject restli client into index builders (#2024)
* feat(Search): Inject restli client into index builders
Inject restli client into index builders
2020-12-03 11:43:48 -08:00
John Plaisted
838f964114
feat: add elasticsearch sanity integration tests (#2028)
These tests verify that, given an index settings and mappings, data can be written to the index, and read from it with a query_all query. These are very simple sanity tests.

We can, and should, write more complex tests that specific to each index in the future.
2020-12-02 20:49:34 -08:00
John Plaisted
5f9d967451
fix: ingestion docker image (#2027)
The environment was not set correctly, so it could not fire kafka events. It (mce-cli) always worked when running outside of docker.

I also added a dev ingestion docker image / script which may be much faster if you've already built locally.

Tested:
1. Cleaned docker volumes and started datahub. Verified it is empty.
2. Built with gradle.
3. Ran ./docker/ingestion/ingestion-dev.sh. Verified data shows in DataHub.
4. Ran step 1 again.
5. Ran ./docker/ingestion/ingestion.sh. Verified data shows in DataHub.
v0.6.1
2020-12-02 17:40:12 -08:00
Nagarjuna Kanamarlapudi
a1e7e26e08
Fix dataset index creation issue (#2022) 2020-11-30 18:33:06 -08:00
Harsh Shah
06cca77a6b
feat(models): Add DataFlow and DataJob models (#2008)
Add DataFlow and DataJob urn and aspect models.
The aspects include metadata associated with Data processing flow and job entities,
associated inputs and outputs of a job.
2020-11-30 17:48:29 -08:00
John Plaisted
8ab4e930c2
Add a CODEOWNERS file. (#2021)
After this we can enforce that reviews must be approved by people on this list.
2020-11-30 17:02:50 -08:00
Charlie Tran
c739ab4f22
fix/docs(frontend): Syncs UI with internal frontend (#2009)
* Init new and improved in repo documentation for open source and github
{COMMIT-SYNC:dca335ff397c3e9c1283d845cef56fdb42f7853b}

* [META-13251] Initial implementation for button to integrate with wintermute
{COMMIT-SYNC:491b2bf5bc91137a2919370e81936033be15b13a}

* Quickfix for UMP Datasets not showing the correct owner link
{COMMIT-SYNC:26eb64cca78f96ed383994b7e67dd4fc8a99d985}

* Updates history of datahub documentation
{COMMIT-SYNC:306b2c72f3a81e699c8b414a8aaf4dcbfdf1ed31}

Co-authored-by: Ignacio Bona <ibonapiedrabuena@linkedin.com>
2020-11-26 06:56:18 -08:00
Mars Lan
1a64d6364d
docs: upload updated deck (#2016)
* Delete Taming the Data Beast Using DataHub.pdf

* Add files via upload
2020-11-26 04:57:58 -08:00
Mars Lan
6fdea6391d
docs: update links (#2015)
* Update README.md

* Update links.md
2020-11-26 04:56:11 -08:00
Mars Lan
0cee8d90c2
Update links.md 2020-11-25 19:39:12 -08:00
Mars Lan
0754e26940
Update README.md 2020-11-25 19:38:24 -08:00
Mars Lan
90b09afbc1
Add files via upload 2020-11-25 19:34:46 -08:00
Mars Lan
8fad457330
Update debugging.md 2020-11-25 16:55:32 -08:00
Mars Lan
bc26b1186d
Update links.md 2020-11-25 14:10:14 -08:00
Mars Lan
7f6720f90c
Update README.md 2020-11-25 12:08:18 -08:00
Nagarjuna Kanamarlapudi
8707bb06a8
Townhall agenda for December 4 (#2011)
* Townhall agenda for December 4
2020-11-25 11:48:09 -08:00
Kerem Sahin
4d8320e4a0
feat(dashboard): Dashboards backend implementation (#1884) 2020-11-23 09:25:58 -08:00
Nagarjuna Kanamarlapudi
5d083143db
feat(dataset): Enable search of datasets by field names (#2001)
* feat(dataset): Enable search of datasets by field names
2020-11-20 12:01:07 -08:00
John Plaisted
6f59a91865
Bump GMA to latest (#2007)
Involves fixing some types. Bright side is that ModelValidation actually checks aspects now (it did not before because aspects are not records but unions!).
2020-11-18 14:07:48 -08:00
cptran777
89caefe268 Init new and improved in repo documentation for open source and github
{COMMIT-SYNC:dca335ff397c3e9c1283d845cef56fdb42f7853b}
2020-11-17 15:44:26 -08:00
shakti-garg-saxo
808330f7f8
feat(kubernetes): Add pod-level annotations to the datahub helm charts (#2005)
* 2003 | add pod annotations

* 2003 | update documentation and bump up chart version

Co-authored-by: shakti-garg <shakti.garg@gmail.com>
2020-11-17 15:20:55 -08:00
Ignacio Bona
5fc77f2e61 OS build improvements (1minute)
{COMMIT-SYNC:01040678c188091b7f5ccea77e1ec8282d4be142}
2020-11-17 12:38:21 -08:00
Ignacio Bona
96af9c7ed6 OS Fixes: Hand tailor types, merge dataset-entity, removed unused types
{COMMIT-SYNC:8532a92fef64760adebfc54fa2ad4c3acf361b41}
2020-11-17 12:38:21 -08:00
Ignacio Bona
f043d414a4 Automatic pr creation
{COMMIT-SYNC:7f757e3a514fdeff1de922112f182386bd322228}
2020-11-17 12:38:21 -08:00
shakti-garg-saxo
1351de4599
1995 | fix indentation value in helm deployment templates (#2004)
Co-authored-by: shakti-garg <shakti.garg@gmail.com>
2020-11-17 07:29:59 -08:00
Mars Lan
f5891a733a
Update links.md 2020-11-13 21:19:17 -08:00
shakti-garg-saxo
7ed6eb8845
Update doc for configuring topic names (#1999)
1. Add topic-name configuration for each component
2. How to configure them in the deployable
2020-11-13 21:08:48 -08:00
Arun Vasudevan
f738f7b7a9 Doc fix 2020-11-13 16:16:24 -08:00
Arun Vasudevan
89960a0a27 Exposing Neo4j Driver Configs 2020-11-13 16:16:24 -08:00
Ignacio Bona Piedrabuena
eb8b4a357e
Update townhall-history.md 2020-11-12 14:40:35 -08:00
Ignacio Bona Piedrabuena
5b5bfbef25
(doc) Adding DataHub UI overview presentation 2020-11-12 14:38:53 -08:00
Shirshanka Das
41e40b6994
Update townhall-history.md
Add SCSI talk link
2020-11-12 11:40:34 -08:00
Shirshanka Das
f511404cd0
Adding SCSI talk slides 2020-11-12 11:37:39 -08:00
Shirshanka Das
9e881f8ece
Update townhall-history.md
Add link to use-cases talk slides
2020-11-12 11:31:32 -08:00
Shirshanka Das
99291ec03b
Adding Metadata Use-Cases lightning talk slides 2020-11-12 11:27:53 -08:00
Kerem Sahin
733893f5f9
feat(dashboard): Dashboard models update (#1932)
* feat(dashboard): Dashboard models update

* Keep chartId/dashboardId fields in the URN definitions and add fields for chartURL/dashboardURL into info aspects

* Rebase and address some comments
2020-11-12 11:17:22 -08:00
John Plaisted
32e18a1fc7
fix: fix build definition of DatasetFieldUrn (#1991) 2020-11-11 16:54:05 -08:00
John Plaisted
60e43061d8
[Breaking] Update to GMA 0.2.0 and fix Urn definitions. (#1977)
Urn definitions needed to be updated since 0.2.0 changed the base Urn class. 

I also added some more urn coercers that were missing.
2020-11-11 16:06:29 -08:00
Jyoti Wadhwani
70ddb09d29
feat: enable SCSI for datasets (#1986)
* enable SCSI for datasets

* Update scsi-onboarding-guide.md
2020-11-11 13:04:20 -08:00
Ignacio Bona Piedrabuena
3407dab090
2020-10-10 Syncronizing datahub-web {COMMIT-SYNC:7f757e3a514fdeff1de922112f182386bd322228} (#1989) 2020-11-10 17:34:31 -08:00
Nagarjuna Kanamarlapudi
7d574d1094
feat(field-level-lineage): Add models for field level lineage (#1936)
* feat(field-level-lineage): adding models for field level lineage

adding models for field level lineage. Introduce DatasetFieldUrn as a unique identifier for dataset field
2020-11-09 14:08:48 -08:00
Ignacio Bona Piedrabuena
89c78551cc
1604086049622-ui-sync (#1981)
Co-authored-by: Ignacio Bona <ibonapiedrabuena@linkedin.com>
2020-11-09 12:17:51 -08:00
Nagarjuna Kanamarlapudi
bf361a6355
Updates to town hall history and next town hall (#1988)
* Updates to townhall history and next townhall

* Update README.md

Co-authored-by: Mars Lan <mars.th.lan@gmail.com>
2020-11-09 10:29:32 -08:00
shakti-garg-saxo
66a10a8f99
docs: update UI credential requirement for Quickstart (#1987)
* update UI credential requirement for Quickstart

* Update quickstart.md

Co-authored-by: Mars Lan <mars.th.lan@gmail.com>
2020-11-07 17:54:52 -08:00
Jyoti Wadhwani
1516a9ad78
Update README.md 2020-11-05 16:12:38 -08:00
Madhu Podila
0527e483bc
feat(business-glossary):RFC for Business Glossary (#1842)
* added intial version of Business Glossary RFC

* slight updates to Business Glossary RFC

* slight updates to Business Glossary RFC

* updated the sample business glossary table

* updated the sample business glossary content

* updated the sample business glossary content

* Update README.md

updated the attribute of Business Term

* incorporated feedback around DatasetFieldUrn and relationships

* Update README.md

updated the alternatives section

* Update README.md

incorporated the feedback provided by Nagarjuna

* Update README.md

Incorporated feedback 
a) removal of redundant uri attribute from BusinessTerm definition
b) rolled back changes to SchemaField (as per feedback) and will continue to use fieldPath is unique identified for a field in a given Dataset

* Incrporated the review comments and Discussion Feedck

* Fix teh RFC link

* Fix teh RFC link
2020-11-05 14:56:28 -08:00