A base class and utility functions for creating pure functions steps for DAGs that are heavily tied to large amounts of data.

This library should rarely be used by itself, it was developed in pair with cookiecutter-stepworkflow and you should look there for more context rich documentation.


Stable Release:

pip install datastep

Development Head:

pip install git+https://github.com/AllenCellModeling/datastep.git


For full package documentation please visit AllenCellModeling.github.io/datastep.


See CONTRIBUTING.md for information related to developing the code.

**Free software: Allen Institute Software License**

