3044 Commits

Author SHA1 Message Date
Yi Wang
44807f5f7e Fix the incompatibility between integer repo_id in db and string field in record 2016-08-10 17:24:03 -07:00
Yi Wang
11158e0b9f Remove unnecessary log entries from kafka processor 2016-08-10 11:23:39 -07:00
Yi Wang
bc276274ff Use FileWriter to write records into CSV file 2016-08-10 11:20:31 -07:00
Yi Wang
83834e4e88 Add fetching acl owner info from svn, also change some property names. 2016-08-10 09:11:37 -07:00
jerrybai2009
162892a9e8 Merge pull request #203 from jerrybai2009/master
implement the dataset accessiblities section
2016-08-09 16:25:57 -07:00
Yi Wang
4689160dbb Extract commit hash use it when querying acl 2016-08-09 13:05:59 -07:00
jbai
45c528c9d9 implement the dataset accessiblities section 2016-08-09 11:34:46 -07:00
Yi Wang
830413e122 Add multiproduct and git repo metadata etl job 2016-08-08 21:28:37 -07:00
Eric Sun
cd4853d0a5 Use ProcessBuilder and redirected log file for HDFS Extract (#198)
* Use ProcessBuilder and redirected log file for HDFS Extract

* relax urn validation rule
2016-08-08 14:02:34 -07:00
jerrybai2009
39cec22e25 Merge pull request #197 from jerrybai2009/master
upgrade ember from 1.12 to 2.6.2
2016-08-05 09:52:03 -07:00
jbai
5305124d8c fix the bind-attr and wrong flow link in search result issues 2016-08-04 18:46:33 -07:00
jbai
23910971a9 upgrade ember from 1.12 to 2.6.2 2016-08-04 17:44:47 -07:00
Eric Sun
c4d1605a0c Merge pull request #196 from alyiwang/master
Modify Kafka Master to handle more than one Kafka connection configurations
Add additional error handling when starting the service

With this change in place.
- each Kafka Zookeeper requires a corresponding entry defined in wh_etl_job
- the connection info (such as Zookeeper, SchemaRegistery, topic to staging table mapping...) are configured in wh_etl_job_property
- kafka.consumer.etl.jobid in application.conf will determine if such Kafka job will be launched when the backend-service starts
2016-08-04 14:26:59 -07:00
Yi Wang
c0cfe1f5ca Modify KafkaConsumerMaster to handle more than one kafka config, add error handling 2016-08-04 13:07:19 -07:00
Eric Sun
ef584552be Merge pull request #194 from alyiwang/master
Get cluster info from cfg_cluster and format kafka events cluster field
2016-08-03 20:24:07 -07:00
Yi Wang
3d3b2a8075 Get kafka job id from applicatoin.conf and then get ref_id and configs from DB 2016-08-03 18:55:07 -07:00
Yi Wang
dbbdb6e2fb Modify Oracle metadata ETL job, use Json dumps and remove unnecessary quotes 2016-08-03 18:49:00 -07:00
jerrybai2009
b4a718efd0 Merge pull request #195 from ericsun2/master
temp fix for hdfs_schema_crawler getRuntime().exec() hangs problem
2016-08-03 18:15:43 -07:00
jerrybai2009
e7c7175cba Merge pull request #188 from jerrybai2009/master
load the teradata and hadoop data into table dict_dataset_instance
2016-08-03 18:13:06 -07:00
Eric Sun
ca7542ca40 remove duplicate section 2016-08-03 17:52:49 -07:00
Eric Sun
53d40c8392 add a few new hdfs directory patterns 2016-08-03 16:16:58 -07:00
Eric Sun
1cd5872369 temp fix for hdfs_schema_crawler getRuntime().exec() hangs problem; exclude log4j 2016-08-03 15:50:00 -07:00
jerrybai2009
42431fea4b Merge pull request #192 from ericsun2/master
add log_dataset_instance_load_status
2016-08-01 15:07:06 -07:00
Eric Sun
8c9cb99ba4 primary_dataset_type for cfg_database 2016-08-01 13:20:04 -07:00
Eric Sun
67479eac58 log_dataset_instance_load_status 2016-08-01 13:00:39 -07:00
Eric Sun
3553ad6a38 Merge pull request #189 from ericsun2/master
add python module [requests] for simple REST client
2016-07-29 23:17:28 -07:00
Eric Sun
6355ccc039 add python module [requests] for simple REST client 2016-07-29 23:10:33 -07:00
jbai
ea1ac0da9f load the teradata and hadoop data into table dict_dataset_instance 2016-07-29 10:59:33 -07:00
Eric Sun
9d2c803f0c Merge pull request #187 from ericsun2/master
Add datacenter, deploymenttier, cluster info to better describe dataset instance
2016-07-28 17:22:32 -07:00
Eric Sun
6f32bae81a Merge pull request #186 from alyiwang/master
Add basic Oracle dataset metadata ETL job
2016-07-28 17:21:59 -07:00
jerrybai2009
32d5b7b3df Merge pull request #184 from jerrybai2009/master
merge the commit of wherehows datasets properties styling
2016-07-28 17:21:07 -07:00
Eric Sun
50c222d478 Avro schema for Kafka topics 2016-07-28 16:55:47 -07:00
Eric Sun
f745642212 add datacenter, deploymenttier, cluster to describe dataset instance 2016-07-28 16:38:03 -07:00
Yi Wang
74ed769bab add Oracle dataset metadata ETL job 2016-07-28 14:07:07 -07:00
jbai
6d9d571ed3 merge the commit of wherehows datasets properties styling 2016-07-27 16:44:18 -07:00
jerrybai2009
6eae923387 Merge pull request #183 from jerrybai2009/master
update the dataset versions when selected instance changed
2016-07-27 16:41:04 -07:00
jbai
6a1f6e6654 fix the cross web browser issue of button text 2016-07-26 18:23:14 -07:00
jbai
c1d6d8fd68 update the backgroud the version and instance buttons 2016-07-26 17:23:15 -07:00
jbai
85bc2db85c add try catch to catch the exception when reading the config properties 2016-07-26 16:53:30 -07:00
jbai
c0f261190b update the dataset versions when selected instance changed 2016-07-26 16:50:27 -07:00
Yi (Alan) Wang
4b1903abee Merge pull request #179 from alyiwang/master
Ingest Gobblin tracking events into wherehows by Kafka consumer client
2016-07-26 14:13:25 -07:00
Yi Wang
629eb04d92 Merge branch 'master' of https://github.com/linkedin/WhereHows 2016-07-26 12:22:47 -07:00
Yi Wang
7edacc9a9f get kafka config from wh_etl_job_property 2016-07-26 12:16:34 -07:00
jerrybai2009
e0dc182377 Merge pull request #181 from jerrybai2009/master
fix the issue which dataset api returns empty schema when user is not logged in
2016-07-26 11:16:46 -07:00
jbai
cbedae7916 fix the issue which dataset api returns empty schema when user is not logged in 2016-07-26 10:46:05 -07:00
jerrybai2009
27e3962a6f Merge pull request #180 from jerrybai2009/master
show dataset instance when it is availalbe
2016-07-26 10:37:40 -07:00
jbai
6de4945478 show dataset instance when it is availalbe 2016-07-25 17:19:12 -07:00
Yi Wang
6d4706bc62 Ingest Gobblin tracking events into wherehows using Kafka consumer client 2016-07-25 15:03:29 -07:00
jbai
97a68b9291 update the dataset schema json view as selected version of dataset changed 2016-07-22 16:06:24 -07:00
jbai
0f5124579c fix the issue of datasetSchemaRecord expected 11 args but got 9 2016-07-21 17:39:22 -07:00