datahub/metadata-ingestion
2021-02-15 18:29:27 -08:00
..
2021-02-15 18:29:27 -08:00
2021-02-15 18:29:27 -08:00
2021-02-15 18:29:27 -08:00
2021-02-15 18:29:27 -08:00
2021-02-15 18:29:27 -08:00
2021-02-15 18:29:27 -08:00
2021-02-15 18:29:27 -08:00

Dev

Set up dev environment

  • On MacOS: brew install librdkafka
  • On Debian/Ubuntu: sudo apt install librdkafka-dev
  • python3 -m venv venv
  • source venv/bin/activate
  • pip install -e .

Run tests

  • pip install -r test_requirements.txt
  • pytest

Sanity check code before checkin (currently broken)

  • flake8 src test && mypy -p gometa && black --check -l 120 src test && isort --check-only src test && pytest

Run recipe

  • ./recipes/kafka_to_console.sh

Using Docker

Build the image

  • source docker/docker_build.sh

Run the ingestion script (recipes/file_to_file.yml)

While mounting a local directory for input recipes and an output directory for output data

  • source docker/docker_run.sh recipes/file_to_file.yml