117 Commits

Author SHA1 Message Date
SunZhaonan
6b024196cd Fix Hive extract disorder bug. Add Hive database optional whitelist params 2016-03-16 19:09:52 -07:00
SunZhaonan
4574e89de9 Fix bug of sample data schema inconsistant. Add clean up. Parameterize number of Actor 2016-02-22 17:40:07 -08:00
SunZhaonan
dfeefba213 Parameterize dataset source derived process. 2016-02-17 16:06:24 -08:00
SunZhaonan
de4d4cd0c1 Add documentation on important Constants and Classes 2016-02-12 16:57:12 -08:00
SunZhaonan
db593bb3bd Add some sample properties in template 2016-02-09 16:27:31 -08:00
SunZhaonan
b5d7c38b7d Eclipse integration. Resolve circular dependency. wherehows-common-test configure. 2016-02-09 15:50:49 -08:00
SunZhaonan
32e2547035 Add license to the new files 2015-12-21 14:10:30 -08:00
Zhen Chen
7fca60a527 add ref flow id for sub flow jobs 2015-12-17 16:26:15 -08:00
Zhen Chen
d6f77a6f06 fix ldap org hierachy is empty bug and minor changes 2015-12-16 17:01:55 -08:00
SunZhaonan
cd44daba5d Merge with master 2015-12-16 15:54:50 -08:00
Zhen Chen
3c51365e12 parse the xml format of gitorious project page instead of the html page 2015-12-15 11:44:05 -08:00
SunZhaonan
07c46304b5 Fix bug of duplicate field loading. Fix bug of subflow process in azkaban lineage ETL. 2015-12-11 19:46:35 -08:00
Zhen Chen
af04ff6efc add git file commit history etl 2015-12-11 11:02:29 -08:00
Zhen Chen
ebbf9ec629 add ldap user and group metadata etl 2015-12-10 16:26:57 -08:00
Zhen Chen
5a08134b8d add dataset owner metadata etl 2015-12-07 15:17:01 -08:00
SunZhaonan
b83c0a4322 Exclude tests that need configurations 2015-11-19 14:56:20 -08:00
SunZhaonan
d5c3d87d00 Initial commit 2015-11-19 14:39:21 -08:00