image
Classifications and Customizations
Start course
Difficulty
Intermediate
Duration
45m
Students
610
Ratings
4.8/5
Description

In this course, users will explore the suite of tools available in Microsoft Purview for registering and scanning data sources, connecting a business glossary, searching the data catalog, and customizing metadata with enrichments and classifications. In addition, this course will review some of the management and administrative functionality in Purview, including creating roles, managing authorizations, and using the Apache Atlas API for custom implementations. This course will also review deployment best practices and network security considerations. By completing this course, users will have a strong understanding of the suite of functionality currently available in Purview and how these tools support a larger governance initiative within an organization.  

Learning Objectives

  • Provision and install Microsoft Purview
  • Create and manage a role
  • Register and scan data sources
  • Create a business glossary
  • Enrich metadata with classifications
  • Review data lineage tooling
  • Understand deployment best practices
  • Take network security considerations into account

Intended Audience

This course is designed for individuals who are responsible for setting up, monitoring, or exploring data catalog and governance programs within their organization.  

Prerequisites 

To get the most from this course, you should have some familiarity and experience with governance tooling as well as a basic understanding of the Azure portal.

Transcript

Classifications and customizations. Data classification in the context of Microsoft Purview is a way of categorizing data assets by assigning unique logical tags or classes to the data assets. Classification is based on the business context of the data. For example, we might classify assets as passport number, driver's license number, credit card number, or an email address. When we classify data assets, we make them easier to understand, search, and govern. Classifying data assets also helps us understand the risks associated with them. This, in turn, can help us implement measures to protect sensitive or important data from ungoverned proliferation and unauthorized access across the data state. 

Microsoft Purview provides an automated classification capability while we scan our data sources. We get more than 200 built-in system classifications and the ability to create custom classifications for our data. We can classify assets automatically when they're configured as part of a scan or we can edit them manually in the Microsoft Purview governance portal after they've been scanned and ingested. Classification can be particularly important for data governance. 

Among other reasons, classifying data assets is important because it helps: narrow down the search for data assets that we're interested in, organize and understand the variety of data classes that are important in our organization and where they're stored, understand the risks associated with our most important data assets and then take appropriate measures to mitigate them. There are two main types of classifications available in Purview. System classifications are the 200+ out of the box classifications already built into Purview — and include items like person names, email addresses, phone numbers, and passport numbers. Custom classifications are classifications that we create based on a pattern or a specific column name that's unavailable as a system classification. Custom classification rules can be based on a regular expression pattern or dictionary. We can also apply both types of classifications to file assets, table assets, and column assets.

 

About the Author
Students
1940
Courses
3

Steve is an experienced Solutions Architect with over 10 years of experience serving customers in the data and data engineering space. He has a proven track record of delivering solutions across a broad range of business areas that increase overall satisfaction and retention. He has worked across many industries, both public and private, and found many ways to drive the use of data and business intelligence tools to achieve business objectives. He is a persuasive communicator, presenter, and quite effective at building productive working relationships across all levels in the organization based on collegiality, transparency, and trust.