Mars Lan 35c0781f59 Use WH_APP_FOLDER property as the base directory for temp files for various ETL jobs (#451)
* Use WH_APP_FOLDER property as the base directory for temp files in Oracle ETL, instead of the full path defined in wh_etl_job_property, which is often /var/tmp/{something}.

* Move common code to FileUtil.py and move Voldamort's temp files too.

* Move Kafaka ETL temp files.

* Move Espresso ETL temp files.

* Move Multiproduct ETL temp files.

* Move CodeSearch ETL temp files.

* Move teradata ETL temp files.
2017-07-10 13:42:51 -07:00
2016-06-22 21:21:19 -07:00
2015-11-19 14:39:21 -08:00
2017-07-10 09:54:08 -07:00
2016-10-19 17:42:28 -07:00
2016-10-19 17:42:28 -07:00
2015-11-19 14:39:21 -08:00
2015-12-11 11:02:29 -08:00

WhereHows Build Status Gitter PRs Welcome

WhereHows is a data discovery and lineage tool built at LinkedIn. It integrates with all the major data processing systems and collects both catalog and operational metadata from them.

Within the central metadata repository, WhereHows curates, associates, and surfaces the metadata information through two interfaces:

  • a web application that enables data & linage discovery, and community collaboration
  • an API endpoint that empowers automation of data processes/applications

WhereHows serves as the single platform that:

  • links data objects with people and processes
  • enables crowdsourcing for data knowledge
  • provides data governance and provenance based on ownership and lineage

Documentation

The detailed information can be found in the Wiki

Examples in VM

There is a pre-built vmware image (about 11GB) to quickly demonstrate the functionality of WhereHows. Check out the VM Guide

Getting Started

New to Wherehows? Check out the Getting Started Guide

Preparation

First, please setup the metadata repository in MySQL.

CREATE DATABASE wherehows
  DEFAULT CHARACTER SET utf8
  DEFAULT COLLATE utf8_general_ci;

CREATE USER 'wherehows';
SET PASSWORD FOR 'wherehows' = PASSWORD('wherehows');
GRANT ALL ON wherehows.* TO 'wherehows'

Execute the DDL files to create the required repository tables in wherehows database.

Build

  1. Get the source code: git clone https://github.com/linkedin/WhereHows.git
  2. Put a few 3rd-party jar files to metadata-etl/extralibs directory. Some of these jar files may not be available in Maven Central or Artifactory. See the download instrucitons for more detail. cd WhereHows/metadata-etl/extralibs
  3. From the WhereHows root directory and build all the modules: ./gradlew build
  4. Start the metadata ETL and API service: ./gradlew backend-service:runPlayBinary
  5. In a new terminal, start the web front-end: ./gradlew web:runPlayBinary. The WhereHows UI is available at http://localhost:9000 by default. You can change the port number by editing the value of project.ext.httpPort in web/build.gradle.

Contribute

Want to contribute? Check out the Contributors Guide

Community

Want help? Check out the Gitter chat room and Google Groups

Description
The Metadata Platform for your Data and AI Stack
Readme Apache-2.0 1.1 GiB
Languages
Java 41%
Python 29%
TypeScript 28.1%
JavaScript 1.1%
Shell 0.2%
Other 0.2%