Data lineage is the history or progression of data, including the origins, transformations and uses of data. It is used to ensure that data is used transparently and responsibly, and to help trace the origin and use of data.

For example, Data lineage can be used to agree on the use of data, to help integrate data from different sources, and to ensure that data is used in a consistent manner across the organization. It is also often used in identifying and resolving problems with data, and to ensure that data is used in a reliable way.

In more and more sectors in the Netherlands, Data Lineage is a very hot & relevant topic. Audits and regulators are requiring more and more organizations to invest in making data lineage transparent in accountability processes. Within the Netherlands, the financial sector is leading the way in making the origin of data transparent in reports & data science initiatives.

An example

Below is a practical & visual example of what data lineage in Microsoft Purview looks like. With this, your data management department can easily see how data gets from the source (left) into the final dashboards (right) and the path & changes your data takes in the process. This form of capture and transparency not only helps increase your data qualitybut is also an important requirement in the increasingly strict audits.

A practical & concrete example of data lineage in Microsoft Purview
A practical & concrete example of data lineage in Microsoft Purview