12076 Commits

Author SHA1 Message Date
Na Zhang
1962f0a477 mark SCM users confirmed by system automatically 2016-11-11 11:12:28 -08:00
camelliazhang
2f70909564 Merge pull request #273 from camelliazhang/master
update the score table during elastic search dataset update
2016-11-11 11:10:01 -08:00
Na Zhang
2facf409b2 update the score table during elastic search dataset update 2016-11-11 10:09:31 -08:00
camelliazhang
b52ca80217 Merge pull request #271 from camelliazhang/master
add exception handling for DATABASE_SCM_METADATA_ETL and collect info
2016-11-09 10:06:54 -08:00
Yi Wang
b4f5e438e2 Add JobExecutionLineageEvent and kafka processor 2016-11-08 19:11:37 -08:00
Stephane Maarek
a32a0e4b7f last_modified column has a default for the table cfg_database (#269)
@alyiwang will also touch a few other cfg_??? tables for the default values
2016-11-08 17:45:27 -08:00
Na Zhang
725e689326 add exception handling for DATABASE_SCM_METADATA_ETL and collect info 2016-11-08 17:37:36 -08:00
camelliazhang
d911b14d18 Merge pull request #270 from camelliazhang/master
search ranking improvement with static boosting
2016-11-08 17:30:29 -08:00
Na Zhang
217b7d9d09 search ranking improvement with static boosting 2016-11-08 15:18:51 -08:00
Eric Sun
7b36d09b58 Add get_schema_literal_from_url() to fetch schema literal based on schema url (#268)
* use schema_url_helper to fetch avro schema from hdfs or http location

* trim space

* add dfs.namenode.kerberos.principal.pattern; include htrace for SchemaUrlHelper
2016-11-07 08:14:45 -08:00
Yi (Alan) Wang
e34bbcc629 Update README.md (#264) 2016-11-02 13:48:22 -07:00
Yi (Alan) Wang
dca47a3b75 Merge pull request #254 from alyiwang/master
Upgrade to play 2.4.8
2016-10-20 13:18:58 -07:00
Yi Wang
dec851c272 Update README.md 2016-10-20 11:26:23 -07:00
Yi Wang
623b1f733a Change web scalaVersion to 2.10.6, increase SBT heap size to compile 2016-10-20 10:50:42 -07:00
Douglas Moore
53f6622ed8 Update README.md (#252)
Remove backlink to my github account.
2016-10-19 18:37:40 -07:00
Yi Wang
664e4072bb Upgrade to play 2.4.8 2016-10-19 17:42:28 -07:00
Yi (Alan) Wang
3aaba33e75 Merge pull request #253 from camelliazhang/master
Get Espresso and Oracle owners from SCM, and a  bug fix for Teradata
2016-10-19 14:49:07 -07:00
Na Zhang
dbaf053e76 Add local test properties template for teradata and scm owners ETL 2016-10-19 14:10:29 -07:00
Na Zhang
043dc25e89 Get owners for espresso and oracle, and fix a bug for teradata 2016-10-19 11:13:32 -07:00
Yi (Alan) Wang
bf6ec00c99 Merge pull request #250 from alyiwang/master
Login authentication support multiple LDAP servers, add login history
2016-10-13 15:42:00 -07:00
Yi Wang
7b625a7dab Add example of multiple LDAP config usage 2016-10-13 15:22:56 -07:00
Yi Wang
3227412339 Login authentication support multiple LDAP servers, add login history 2016-10-13 14:30:43 -07:00
Yi (Alan) Wang
3bdaff934f Merge pull request #247 from alyiwang/master
Update Kafka consumer actors to reduce memory usage
2016-10-12 21:31:44 -07:00
Yi Wang
fcd6cf149e Update MetastoreAuditProcessor to reduce storage, also refactor some code 2016-10-11 11:26:36 -07:00
Yi Wang
5049c847fa Update Kafka consumer actors to reduce memory usage 2016-10-10 14:49:14 -07:00
Yi (Alan) Wang
286b93d7e8 Merge pull request #244 from alyiwang/master
Update Azkaban_Execution job to fetch cronExpression in flow scheduling
2016-10-06 17:22:10 -07:00
Yi Wang
c9f4f18d9c Update Azkaban_Execution job to fetch cronExpression in flow scheduling 2016-10-06 13:43:10 -07:00
Yi (Alan) Wang
c9dfb637af Update MetadataChangeEvent APIs according to schema change (#243)
* Update MetadataChangeEvent APIs according to schema change

* Update MultiproductLoad to reflect new Owner types

* Add comments for Owner_type precedence (priority) and compliance
2016-10-06 13:33:45 -07:00
Yi Wang
0356497124 Add comments for Owner_type precedence (priority) and compliance 2016-10-06 13:24:29 -07:00
Yi Wang
8ab5c824b0 Update MultiproductLoad to reflect new Owner types 2016-10-03 18:39:21 -07:00
Yi Wang
b74d58a33f Update MetadataChangeEvent APIs according to schema change 2016-10-03 10:56:23 -07:00
camelliazhang
fe1e698b8a remove hive instance hardcode cluster name (#236) 2016-09-30 17:15:43 -07:00
Na Zhang
10339690a9 Update HiveTransform and HiveLoad, remove hardcoded cluster name 2016-09-30 16:59:59 -07:00
Eric Sun
fd3b4baef8 avoid loop in LDAP org hierarchy (#242) 2016-09-30 16:45:38 -07:00
jerrybai2009
5f0426ea6b using the dynamic cursor to reduce the memory usage (#241) 2016-09-30 16:45:17 -07:00
jerrybai2009
2f1b31df35 Merge pull request #240 from jerrybai2009/master
tracking the GobblinTrackingEvent_autit to get owner information
2016-09-29 16:13:14 -07:00
jbai
a11e4908dc tracking the GobblinTrackingEvent_autit to get owner information 2016-09-29 15:01:32 -07:00
jerrybai2009
137d48a9b0 Merge pull request #237 from alyiwang/master
Update Kafka processor casting Object to String
2016-09-26 16:44:39 -07:00
Yi Wang
ac34eb683f Update Kafka processor casting Object to String, also add debug info if can't fetch schema from Registery 2016-09-26 15:06:33 -07:00
Na Zhang
5c76f47313 remove hive instance hardcode cluster name 2016-09-26 15:06:30 -07:00
Yi (Alan) Wang
bc06c1a882 Merge pull request #234 from alyiwang/master
logback redirect ETL job logs into corresponding files
2016-09-24 14:43:34 -07:00
Yi Wang
1ad2b1528e logback redirect ETL job logs into corresponding files 2016-09-23 16:54:52 -07:00
Yi (Alan) Wang
753de7de7c Merge pull request #233 from alyiwang/master
Update backend APIs to cast SQL results back to Java record then to Json
2016-09-21 08:59:04 -07:00
Eric Sun
89ff794ddf Add api to get dependents of a dataset (#232)
* Use ProcessBuilder and redirected log file for HDFS Extract

* relax urn validation rule

* continue process if hive sql parsor encounters error

* reformat etl job log message

* add API to find dataset dependents, such as which hive tables are based on an hdfs path
2016-09-21 08:55:44 -07:00
Yi Wang
be65efb0cc Update backend APIs to cast SQL results back to Java record then serialize to Json reply 2016-09-20 18:56:49 -07:00
jerrybai2009
9db27f7048 Merge pull request #229 from jerrybai2009/master
update the dataset ownership information
2016-09-20 14:02:35 -07:00
Yi (Alan) Wang
c9a9d3a330 Merge pull request #231 from alyiwang/master
Add REST proxy for Security API from backend to web
2016-09-19 21:36:26 -07:00
Yi Wang(Data Infrastructure)
1171e00097 Add REST proxy for Security API from backend to web 2016-09-19 18:14:10 -07:00
jbai
44df5dd08a update the dataset ownership information 2016-09-19 16:10:23 -07:00
Daniel Grove
937637c3ea Add Database to create_tables readme (#225)
* Add Database to create_tables readme

* Add -D to signify database
2016-09-16 09:18:49 -07:00