datahub

mirror of https://github.com/datahub-project/datahub.git synced 2025-10-23 06:54:45 +00:00

Author	SHA1	Message	Date
Douglas Moore	d44c194529	Backend service readme (#215 ) * Update README.md * Update README * Rename README to README.me * Rename README.me to README.md * Update README.md	2016-09-02 09:31:52 -07:00
Yi (Alan) Wang	579b8fc9d7	Add metadataChangeEvent APIs to backend-service (#205 ) * Add multiproduct and git repo metadata etl job * Extract commit hash use it when querying acl * Use FileWriter to write records into CSV file * Remove unnecessary log entries from kafka processor * Fix the incompatibility between integer repo_id in db and string field in record * merge API tables to existing dataset owner and schema field table * Add confidential and recursive column to dict_dataset_field	2016-08-24 09:10:35 -07:00
Eric Sun	cd4853d0a5	Use ProcessBuilder and redirected log file for HDFS Extract (#198 ) * Use ProcessBuilder and redirected log file for HDFS Extract * relax urn validation rule	2016-08-08 14:02:34 -07:00
Yi Wang	c0cfe1f5ca	Modify KafkaConsumerMaster to handle more than one kafka config, add error handling	2016-08-04 13:07:19 -07:00
Yi Wang	3d3b2a8075	Get kafka job id from applicatoin.conf and then get ref_id and configs from DB	2016-08-03 18:55:07 -07:00
Eric Sun	ca7542ca40	remove duplicate section	2016-08-03 17:52:49 -07:00
Eric Sun	1cd5872369	temp fix for hdfs_schema_crawler getRuntime().exec() hangs problem; exclude log4j	2016-08-03 15:50:00 -07:00
Eric Sun	8c9cb99ba4	primary_dataset_type for cfg_database	2016-08-01 13:20:04 -07:00
Eric Sun	9d2c803f0c	Merge pull request #187 from ericsun2/master Add datacenter, deploymenttier, cluster info to better describe dataset instance	2016-07-28 17:22:32 -07:00
Eric Sun	f745642212	add datacenter, deploymenttier, cluster to describe dataset instance	2016-07-28 16:38:03 -07:00
Yi Wang	74ed769bab	add Oracle dataset metadata ETL job	2016-07-28 14:07:07 -07:00
Yi Wang	7edacc9a9f	get kafka config from wh_etl_job_property	2016-07-26 12:16:34 -07:00
Yi Wang	6d4706bc62	Ingest Gobblin tracking events into wherehows using Kafka consumer client	2016-07-25 15:03:29 -07:00
jbai	7a77aba4b7	merge the pull request 165 to master branch	2016-07-21 10:38:36 -07:00
Naga Srinivas Vemuri	97370ed2e1	Query Dataset properties to retrieve datasetUrns	2016-07-21 11:54:47 +05:30
jbai	6af54658d6	merge Fetching dataset watchers via get /dataset/watchers to main branch	2016-06-30 10:20:54 -07:00
Eric Sun	1573fdb212	rename hive dependency to hive_exec; reuse metadata-etl/extralibs; test travis ci;	2016-06-28 18:03:02 -07:00
Rafal Kluszczynski	cc13379075	fix: use logback logging provided by play framework (exclude log4j binding)	2016-06-27 11:01:37 +02:00
jbai	366f2f61d8	fix the backend service build failed issue	2016-06-16 17:33:02 -07:00
jbai	6974ae26ae	fix the gradlew check failed issue and make the cluster name input is mandatory	2016-06-15 11:12:20 -07:00
jbai	9705a07ad8	provide the dataset dependency api	2016-06-14 16:17:24 -07:00
jbai	e7885f28db	comment out the flow tree builder since UI does not use it anymore	2016-06-13 10:15:32 -07:00
Rafal Kluszczynski	5f25e6a6b5	refactor: test assertions more readable	2016-05-30 12:39:15 +02:00
Rafal Kluszczynski	e2765dc3a2	chore: execute tests during backend-service module gradle build	2016-05-30 11:10:25 +02:00
Rafal Kluszczynski	14e5ec497d	test: check properties file and etl job command generation	2016-05-30 11:09:36 +02:00
Rafal Kluszczynski	d06bfbfbd0	fix: use correctly directory path from properties when executing etl job	2016-05-30 08:59:05 +02:00
Rafal Kluszczynski	8d43d3699f	Merge branch 'master' into usingGradleWrapper	2016-05-23 10:08:59 +02:00
SunZhaonan	0b5c421311	Fix Hive column parser parent path bug	2016-05-19 16:36:30 -07:00
Rafal Kluszczynski	856849abd1	updating documentation	2016-05-16 14:58:49 +02:00
SunZhaonan	9d6a1b2649	Add optional config of ETL job white list	2016-05-12 16:28:23 -07:00
SunZhaonan	31de21ddcf	pass parameter through file.	2016-05-03 16:25:56 -07:00
SunZhaonan	c66b00e2f6	Fix dataset insert API bug. Fix load sql bug.	2016-03-28 16:27:43 -07:00
SunZhaonan	a0b7cb9d57	Fix process hanging bug. Add hive field ETL process.	2016-03-16 19:12:21 -07:00
SunZhaonan	aff8f323e4	Scheduler check previous job is finished. Redirect remote outputstream into log. Fix avro parser bugs	2016-03-16 19:09:53 -07:00
SunZhaonan	4b3b344b96	enable travis	2016-03-15 16:20:04 -07:00
SunZhaonan	c4671d2579	Add field comments ETL Fix API bug of tech_matrix_id Add key in comment table	2016-03-14 14:23:33 -07:00
SunZhaonan	5e9ae37952	Change to multi processing instead of multi thread. Fix hive ETL bug	2016-02-29 16:37:03 -08:00
SunZhaonan	8a76f9f931	Move test config file to a external file	2016-02-29 16:37:03 -08:00
SunZhaonan	033a28faee	Backend job property API. 'update' change to 'insert on duplicate update'	2016-02-17 16:42:54 -08:00
SunZhaonan	b5d7c38b7d	Eclipse integration. Resolve circular dependency. wherehows-common-test configure.	2016-02-09 15:50:49 -08:00
SunZhaonan	c3da00003e	Fix bugs. Reenforce logging. Format jython scripts. Add missing table DDL.	2016-02-03 19:22:18 -08:00
SunZhaonan	05d54b3070	Add Hive metadata ETL process	2015-12-21 12:07:08 -08:00
SunZhaonan	cd44daba5d	Merge with master	2015-12-16 15:54:50 -08:00
SunZhaonan	07c46304b5	Fix bug of duplicate field loading. Fix bug of subflow process in azkaban lineage ETL.	2015-12-11 19:46:35 -08:00
Zhen Chen	e21a6b8e75	remove unused variable, change load script and add etl type in backend service	2015-12-11 13:52:23 -08:00
Zhen Chen	ebbf9ec629	add ldap user and group metadata etl	2015-12-10 16:26:57 -08:00
Zhen Chen	5a08134b8d	add dataset owner metadata etl	2015-12-07 15:17:01 -08:00
SunZhaonan	d5c3d87d00	Initial commit	2015-11-19 14:39:21 -08:00

1 2

98 Commits