Discovering Data Estate Insights
Start course

In this course, users will explore the suite of tools available in Microsoft Purview for registering and scanning data sources, connecting a business glossary, searching the data catalog, and customizing metadata with enrichments and classifications. In addition, this course will review some of the management and administrative functionality in Purview, including creating roles, managing authorizations, and using the Apache Atlas API for custom implementations. This course will also review deployment best practices and network security considerations. By completing this course, users will have a strong understanding of the suite of functionality currently available in Purview and how these tools support a larger governance initiative within an organization.  

Learning Objectives

  • Provision and install Microsoft Purview
  • Create and manage a role
  • Register and scan data sources
  • Create a business glossary
  • Enrich metadata with classifications
  • Review data lineage tooling
  • Understand deployment best practices
  • Take network security considerations into account

Intended Audience

This course is designed for individuals who are responsible for setting up, monitoring, or exploring data catalog and governance programs within their organization.  


To get the most from this course, you should have some familiarity and experience with governance tooling as well as a basic understanding of the Azure portal.


Discover Data Estate Insights. Data Estate Insights are one of the key pillars of Microsoft Purview. The feature provides customers a single pane of glass view into their catalog and further aims to provide specific insights to the data source administrators, business users, data stewards, data officers, and security administrators. Currently, Microsoft Purview has the following insights reports available: Asset insights, Glossary insights, Classification insights, and Sensitivity labeling insights. 

Asset insights gives a bird's eye view of our data estate and its distribution by source type, by classification, and by file size as some of the dimensions. This report caters to different types of stakeholders in data governance and cataloging roles who are interested to know the state of their data map by classification and file extensions. The report provides broad insights through graphs and KPIs and can also deep dive into specific anomalies such as misplaced files. The asset report supports an end-to-end experience where we can view the count of assets with a specific classification, break down the information by source types and top folders, and also view the list of assets for further investigation. 

Glossary insights gives the Data Stewards a status report on the glossary. Data Stewards can view this report to understand distribution of glossary terms by status, learn how many glossary terms are attached to assets, and how many are not yet attached to any asset. Business users can also learn about the completeness of their glossary terms. This report summarizes top items that a Data Steward needs to focus on to create a complete and usable glossary for their organization. 

Stewards can also navigate directly into the glossary from the glossary insights report to make changes on a specific glossary term. Classification insights. This report provides details about where classified data is located, the classification found during a scan, and a drill down to the classified objects themselves. It enables Stewards, Curators, and Security Administrators to understand the types of information found in the organization's data estate. Sensitivity labeling insights. This report provides details about the sensitivity labels found during a scan, as well as drill-down to the labeled files themselves. It enables security administrators to ensure the security of information found in their organization's data estate, as well as to understand required actions around managing access to specific repositories or files.


About the Author

Steve is an experienced Solutions Architect with over 10 years of experience serving customers in the data and data engineering space. He has a proven track record of delivering solutions across a broad range of business areas that increase overall satisfaction and retention. He has worked across many industries, both public and private, and found many ways to drive the use of data and business intelligence tools to achieve business objectives. He is a persuasive communicator, presenter, and quite effective at building productive working relationships across all levels in the organization based on collegiality, transparency, and trust.