This repository was archived by the owner on Mar 13, 2020. It is now read-only.

11 Oct 03:34

v0.1.6-beta

6995e52

Add ability to accept initial execution id Pre-release

Pre-release

The init-execution command now accepts an optional --execution-id parameter where users can provide a GUID themselves.

Assets 2

15 May 01:45

ChintanRaval

v0.1.5-beta

3e1a7b2

Start logging stats upon completion of execution and its steps Pre-release

Pre-release

v0.1.5-beta

Merge pull request #27 from PageUpPeopleOrg/feature/log-stats-upon-co…

Assets 2

14 May 06:16

seanbudd

v0.1.4-beta

574c186

Improve integration tests Pre-release

Pre-release

Merges

Run tests using a restricted test user instead of admin (#26)

Assets 2

13 May 03:03

ChintanRaval

v0.1.3-beta

2a6f430

Add ability to track execution steps and statistics Pre-release

Pre-release

Add ability to track execution steps and statistics

introduce a new entity execution_step to track each of the data pipeline execution's steps like LOAD, TRANSFORM, etc.
update type of execution_time_ms to store up to PostgreSQL 'BIGINT' type
update integration tests to cover all changes

Assets 2

15 Apr 02:42

seanbudd

v0.1.2-beta

1755cd2

Rename execution tracking tables Pre-release

Pre-release

Merges [OSC-1302] Rename tables to match rdl (#22)

Assets 2

08 Apr 03:59

seanbudd

v0.1.1-beta

69deebc

Rename to DPO, Integration tests and Alembic Pre-release

Pre-release

Changes

Add Alembic

#20 : add schema revision tool alembic
#21 : delete accidental alembic file

Rename MCD to DPO

formerly "model-change-detector" now "data-pipeline-orchestrator"

Add Integrations Tests

Add coverage for all new commands in integration tests

add coverage for all new commands in integration tests
make integration tests re-runnable
apply DRY to assist multiple execution iterations
log passing tests post assertion
move from plain Bourne shell to Bash shell since we now use a 3rd-party gist to generate UUIDs to allow us to re-run integration tests on dev machines

Assets 2

04 Mar 23:28

ChintanRaval

v0.1.0-beta

6b16f43

Refactor commands to support better state management Pre-release

Pre-release

renamed the below commands
- init command to init-execution
- complete command to complete-execution
  - this now also calculates the overall execution time between init-execution and complete-execution
added the below commands:
- get-last-successful-execution: Finds the last successful data pipeline execution. Returns an execution-id which is a GUID identifier of the new execution, if found; else returns and empty string.
- get-execution-last-updated-timestamp: Returns the last-updated-on ISO 8601 datetime with timezone of the given execution-id. Raises an error if given execution-id is invalid.
split compare command into:
- persist-models: Saves models of the given model-type within the given execution-id by persisting hashed checksums of the given models.
- compare-models: Compares the hashed checksums of models between two executions. Returns comma-separated string of changed model names.
  - this now returns all models when all models have changed OR during first execution instead of the previous *

Assets 2

24 Jan 01:10

ChintanRaval

v0.0.2-alpha

5f1ee98

Add new commands - 'compare' and 'complete' data pipeline execution Pre-release

Pre-release

New commands:

compare: Compares & persists SHA256-hashed checksums of the given models against those of the last successful execution. Returns comma-separated string of changed model names. Parameters required:
- execution-id: a GUID identifier of an existing data pipeline execution as returned by the init command.
- model-type: type of models being processed e.g.: load, transform, etc. this model-type is used to group the model checksums by and used to find and compare older ones.
- base-path: absolute or relative path to the models e.g.: ./load, /home/local/load, C:/path/to/load
- model-patterns: path-based patterns (relative to base-path) to different models with extensions. models within a model-type must be named uniquely regardless of their file extension. e.g.: *.txt, **/*.txt, ./relative/path/to/some_models/**/*.csv, relative/path/to/some/more/related/models/**/*.sql
complete: Marks the completion of an existing execution by updating a record for the same in the given database. Returns nothing unless there's an error. Parameter required:
- execution-id: a GUID identifier of an existing data pipeline execution as returned by the init command.

Assets 2

14 Sep 22:59

ChintanRaval

v0.0.1-alpha

4cf3072

Support to start a new data pipeline execution Pre-release

Pre-release

v0.0.1-alpha

Assets 2

Releases: pageuppeople-opensource/data-pipeline-orchestrator

Add ability to accept initial execution id

Uh oh!

Start logging stats upon completion of execution and its steps

Uh oh!

Improve integration tests

Merges

Uh oh!

Add ability to track execution steps and statistics

Uh oh!

Rename execution tracking tables

Uh oh!

Rename to DPO, Integration tests and Alembic

Changes

Add Alembic

Rename MCD to DPO

Add Integrations Tests

Uh oh!

Refactor commands to support better state management

Uh oh!

Add new commands - 'compare' and 'complete' data pipeline execution

Uh oh!

Support to start a new data pipeline execution

Uh oh!