Maintain integrity of knowledge base with provenance tracking of data points generated from multiple processing steps on text documents. Automate workflows for data processing, managing machine-learning models, and scientific computing tasks.