Douglas Moore
d44c194529
Backend service readme ( #215 )
...
* Update README.md
* Update README
* Rename README to README.me
* Rename README.me to README.md
* Update README.md
2016-09-02 09:31:52 -07:00
Yi (Alan) Wang
579b8fc9d7
Add metadataChangeEvent APIs to backend-service ( #205 )
...
* Add multiproduct and git repo metadata etl job
* Extract commit hash use it when querying acl
* Use FileWriter to write records into CSV file
* Remove unnecessary log entries from kafka processor
* Fix the incompatibility between integer repo_id in db and string field in record
* merge API tables to existing dataset owner and schema field table
* Add confidential and recursive column to dict_dataset_field
2016-08-24 09:10:35 -07:00
Eric Sun
cd4853d0a5
Use ProcessBuilder and redirected log file for HDFS Extract ( #198 )
...
* Use ProcessBuilder and redirected log file for HDFS Extract
* relax urn validation rule
2016-08-08 14:02:34 -07:00
Yi Wang
c0cfe1f5ca
Modify KafkaConsumerMaster to handle more than one kafka config, add error handling
2016-08-04 13:07:19 -07:00
Yi Wang
3d3b2a8075
Get kafka job id from applicatoin.conf and then get ref_id and configs from DB
2016-08-03 18:55:07 -07:00
Eric Sun
ca7542ca40
remove duplicate section
2016-08-03 17:52:49 -07:00
Eric Sun
1cd5872369
temp fix for hdfs_schema_crawler getRuntime().exec() hangs problem; exclude log4j
2016-08-03 15:50:00 -07:00
Eric Sun
8c9cb99ba4
primary_dataset_type for cfg_database
2016-08-01 13:20:04 -07:00
Eric Sun
9d2c803f0c
Merge pull request #187 from ericsun2/master
...
Add datacenter, deploymenttier, cluster info to better describe dataset instance
2016-07-28 17:22:32 -07:00
Eric Sun
f745642212
add datacenter, deploymenttier, cluster to describe dataset instance
2016-07-28 16:38:03 -07:00
Yi Wang
74ed769bab
add Oracle dataset metadata ETL job
2016-07-28 14:07:07 -07:00
Yi Wang
7edacc9a9f
get kafka config from wh_etl_job_property
2016-07-26 12:16:34 -07:00
Yi Wang
6d4706bc62
Ingest Gobblin tracking events into wherehows using Kafka consumer client
2016-07-25 15:03:29 -07:00
jbai
7a77aba4b7
merge the pull request 165 to master branch
2016-07-21 10:38:36 -07:00
Naga Srinivas Vemuri
97370ed2e1
Query Dataset properties to retrieve datasetUrns
2016-07-21 11:54:47 +05:30
jbai
6af54658d6
merge Fetching dataset watchers via get /dataset/watchers to main branch
2016-06-30 10:20:54 -07:00
Eric Sun
1573fdb212
rename hive dependency to hive_exec; reuse metadata-etl/extralibs; test travis ci;
2016-06-28 18:03:02 -07:00
Rafal Kluszczynski
cc13379075
fix: use logback logging provided by play framework (exclude log4j binding)
2016-06-27 11:01:37 +02:00
jbai
366f2f61d8
fix the backend service build failed issue
2016-06-16 17:33:02 -07:00
jbai
6974ae26ae
fix the gradlew check failed issue and make the cluster name input is mandatory
2016-06-15 11:12:20 -07:00
jbai
9705a07ad8
provide the dataset dependency api
2016-06-14 16:17:24 -07:00
jbai
e7885f28db
comment out the flow tree builder since UI does not use it anymore
2016-06-13 10:15:32 -07:00
Rafal Kluszczynski
5f25e6a6b5
refactor: test assertions more readable
2016-05-30 12:39:15 +02:00
Rafal Kluszczynski
e2765dc3a2
chore: execute tests during backend-service module gradle build
2016-05-30 11:10:25 +02:00
Rafal Kluszczynski
14e5ec497d
test: check properties file and etl job command generation
2016-05-30 11:09:36 +02:00
Rafal Kluszczynski
d06bfbfbd0
fix: use correctly directory path from properties when executing etl job
2016-05-30 08:59:05 +02:00
Rafal Kluszczynski
8d43d3699f
Merge branch 'master' into usingGradleWrapper
2016-05-23 10:08:59 +02:00
SunZhaonan
0b5c421311
Fix Hive column parser parent path bug
2016-05-19 16:36:30 -07:00
Rafal Kluszczynski
856849abd1
updating documentation
2016-05-16 14:58:49 +02:00
SunZhaonan
9d6a1b2649
Add optional config of ETL job white list
2016-05-12 16:28:23 -07:00
SunZhaonan
31de21ddcf
pass parameter through file.
2016-05-03 16:25:56 -07:00
SunZhaonan
c66b00e2f6
Fix dataset insert API bug. Fix load sql bug.
2016-03-28 16:27:43 -07:00
SunZhaonan
a0b7cb9d57
Fix process hanging bug. Add hive field ETL process.
2016-03-16 19:12:21 -07:00
SunZhaonan
aff8f323e4
Scheduler check previous job is finished. Redirect remote outputstream into log. Fix avro parser bugs
2016-03-16 19:09:53 -07:00
SunZhaonan
4b3b344b96
enable travis
2016-03-15 16:20:04 -07:00
SunZhaonan
c4671d2579
Add field comments ETL
...
Fix API bug of tech_matrix_id
Add key in comment table
2016-03-14 14:23:33 -07:00
SunZhaonan
5e9ae37952
Change to multi processing instead of multi thread. Fix hive ETL bug
2016-02-29 16:37:03 -08:00
SunZhaonan
8a76f9f931
Move test config file to a external file
2016-02-29 16:37:03 -08:00
SunZhaonan
033a28faee
Backend job property API. 'update' change to 'insert on duplicate update'
2016-02-17 16:42:54 -08:00
SunZhaonan
b5d7c38b7d
Eclipse integration. Resolve circular dependency. wherehows-common-test configure.
2016-02-09 15:50:49 -08:00
SunZhaonan
c3da00003e
Fix bugs. Reenforce logging. Format jython scripts. Add missing table DDL.
2016-02-03 19:22:18 -08:00
SunZhaonan
05d54b3070
Add Hive metadata ETL process
2015-12-21 12:07:08 -08:00
SunZhaonan
cd44daba5d
Merge with master
2015-12-16 15:54:50 -08:00
SunZhaonan
07c46304b5
Fix bug of duplicate field loading. Fix bug of subflow process in azkaban lineage ETL.
2015-12-11 19:46:35 -08:00
Zhen Chen
e21a6b8e75
remove unused variable, change load script and add etl type in backend service
2015-12-11 13:52:23 -08:00
Zhen Chen
ebbf9ec629
add ldap user and group metadata etl
2015-12-10 16:26:57 -08:00
Zhen Chen
5a08134b8d
add dataset owner metadata etl
2015-12-07 15:17:01 -08:00
SunZhaonan
d5c3d87d00
Initial commit
2015-11-19 14:39:21 -08:00