mirror of
https://github.com/datahub-project/datahub.git
synced 2025-11-01 19:25:56 +00:00
Dev
Set up dev environment
- On MacOS:
brew install librdkafka - On Debian/Ubuntu:
sudo apt install librdkafka-dev - python3 -m venv venv
- source venv/bin/activate
- pip install -e .
Run tests
- pip install -r test_requirements.txt
- pytest
Sanity check code before checkin (currently broken)
- flake8 src test && mypy -p gometa && black --check -l 120 src test && isort --check-only src test && pytest
Run recipe
- ./recipes/kafka_to_console.sh
Using Docker
Build the image
- source docker/docker_build.sh
Run the ingestion script (recipes/file_to_file.yml)
While mounting a local directory for input recipes and an output directory for output data
- source docker/docker_run.sh recipes/file_to_file.yml