Datasets and data lineage
The model that dependency analysis creates must be accurate in order to be useful. When data assets are accurately represented and correctly associated with the applications that read or write them, they are said to line up: they have accurate lineage.
The process of dependency analysis produces two types of lineage:
Data lineage occurs among data assets. It refers to the correct association of data assets with the applications that create or access them.
Field lineage occurs among data fields. It refers to the ability to trace a data field from its source to its target, correctly and completely.
This chapter primarily talks about creating and maintaining accurate data lineage. If the data lineage for a project is correct and dependency analysis completes without any warnings, then accurate field lineage tends to occur as a natural consequence.
This section describes how lineage is maintained in the technical repository. It covers the following topics: