30 Commits

Author SHA1 Message Date
Mars Lan
0637578aa6 Remove deprecated code (#666) 2017-08-11 16:45:50 -07:00
Mars Lan
4b4cae2148 Add base class for all pure jython ETLs (#663) 2017-08-10 17:40:19 -07:00
Mars Lan
ac25412cb7 Move java plugin back to applicable subprojects, instead of applying it broadly (#646) 2017-08-08 17:12:57 -07:00
Mars Lan
00b8e6898f Update README.md 2017-07-26 16:47:31 -07:00
Mars Lan
2967489f18 Update ETL docs. 2017-07-26 16:32:44 -07:00
Yi (Alan) Wang
761210b645 Add Util function to parse boolean parameter for ETL jobs (#620) 2017-07-26 13:25:14 -07:00
Yi (Alan) Wang
5e0d2c01cf Fix Oracle ETL job, rewrite data sampling (#612) 2017-07-24 11:11:03 -07:00
Yi (Alan) Wang
6a8fe66ef9 Add active, deprecated flag to dataset, modify ETL to update is_active (#604) 2017-07-19 17:07:28 -07:00
Yi (Alan) Wang
3138185068 Remove some LinkedIn specific ETL jobs (#601) 2017-07-18 16:39:45 -07:00
Mars Lan
1b23133d44 Rename Readme.md to README.md 2017-07-17 20:08:09 -07:00
Mars Lan
b442bb693b Rename readme.md to Readme.md 2017-07-17 20:07:50 -07:00
Mars Lan
951f0571bc Update readme.md 2017-07-17 20:07:38 -07:00
Mars Lan
c7b6fd1688 Move TMS restli related code out of open source. (#587)
Add skeleton for a generalized DAO framework.
2017-07-10 13:44:35 -07:00
Mars Lan
86b815eadf Fix broken Travis build. 2017-07-10 13:44:35 -07:00
Mars Lan
a950cdbc1a Fix the bug where MySQL credential isn't properly passed to DatabaseWriter. (#585) 2017-07-10 13:44:35 -07:00
Mars Lan
8058734917 Fix the last few instances whewre username & password are specified via manual construction of url. (#584) 2017-07-10 13:44:35 -07:00
Mars Lan
33f49c5f55 Fix more places where connection URLs are manually constructed. (#579) 2017-07-10 13:44:35 -07:00
Mars Lan
af37b3c39f Rely on job file specifying kerberos.keytab.file using absolute path. (#578) 2017-07-10 13:44:35 -07:00
Mars Lan
932aca7f05 Call the correct method to create connection instead of buliding the connection string manually. (#577) 2017-07-10 13:44:35 -07:00
Mars Lan
9dca733d76 Move elastic-serach related props from wh_property table to job files. (#574)
This will break FlowTreeBuilder & DatasetTreeBuilder, both are not being used anyway.
2017-07-10 13:44:34 -07:00
Mars Lan
d2d92367f9 Also include TD table with O kind (no primary key) during ETL. (#571) 2017-07-10 13:44:34 -07:00
Mars Lan
e411aecf2b Missed another TD ut_ table filter. (#570) 2017-07-10 13:44:34 -07:00
Mars Lan
9f39f1f380 Remove the filter for TD ut_* tables as we do need the schema for them. (#569) 2017-07-10 13:44:34 -07:00
Mars Lan
68bb73f4a7 Remove the custom access count filter for DM_BIZ etc. (#568) 2017-07-10 13:44:34 -07:00
Mars Lan
75f57c6ac0 Remove some of the LI-speicifc ETLs. (#559) 2017-07-10 13:44:33 -07:00
Mars Lan
53a30d5a77 Major refactoring of ETL scheduling & configuration (#542)
* Major refactoring
- Move job-spcific properties from wh_etl_job_property table to .job files
- Use the job file name instead of numeric IDs to identify ETL jobs
- Use reflection to create ETL job class at run time instead of relying hard-coded enums
- Drop ETL job-related APIs as they're no longer needed
- Drop wh_etl_job, wh_etl_job_property, wh_etl_job_execution tables
- Add wh_etl_schedule & wh_etl_history tables
2017-07-10 13:44:33 -07:00
Yi (Alan) Wang
fcbde02b37 Stop fetching EI/DEV-only Espresso DB (#522) 2017-07-10 13:42:59 -07:00
Yi (Alan) Wang
675dadd374 Kafka ETL to fetch queuing pipeline, also add topic blacklist (#509) 2017-07-10 13:42:57 -07:00
Yi (Alan) Wang
225d1fc6ec Modify HIVE ETL to commit often (#505) 2017-07-10 13:42:57 -07:00
Mars Lan
5f5c0937d1 Rename web, backend-service (#490)
* Rename web to wherehows-api and update README.

* Rename backend-service to wherehows-backend

* Rename metadata-etl to wherehows-etl

* Rename hadoop-dataset-extractor-standalone to wherehows-hadoop
2017-07-10 13:42:56 -07:00