I am creating a couple of open source tools for data governance. The first one is a data catalog (1) with tags for PII data. The second one is a data lineage application (2). The goal is to keep these as simple as possible to install and use.
IMO the current options are too complicated or expensive and appropriate for the largest companies. I cannot hack a simple application for data discovery or usage statistics. So I am building a dead simple data catalog that I can reuse. The data lineage app is the first app on it.
IMO the current options are too complicated or expensive and appropriate for the largest companies. I cannot hack a simple application for data discovery or usage statistics. So I am building a dead simple data catalog that I can reuse. The data lineage app is the first app on it.
(1) https://github.com/tokern/piicatcher (2) https://github.com/tokern/lineage