CloudAcademy
  1. Home
  2. Training Library
  3. Amazon Web Services
  4. Courses
  5. Introduction to Machine Learning Concepts

Unsupervised Learning

The course is part of this learning path

Introduction to Machine Learning on AWS
course-steps 4 certification 1 lab-steps 2
play-arrow
Start course
Overview
DifficultyBeginner
Duration48m
Students520

Description

Overview

In this course you'll learn about Machine Learning and where it fits within the wider Artificial Intelligence (AI) field. The course proceeds with a formal definition of Machine Learning, and continues on with explanations for the various machine learning and training techniques. We review both Supervised and Unsupervised learning, showcasing the main differences between each type of learning method. We review both Classification and Regression models, showcasing the main differences between each type of training model.

 

We provide a basic review of several of the most popular and commonly used machine learning algorithms including:

  • Linear Regression
  • Logistic Regression
  • K Nearest Neighbour (KNN)
  • K-Means
  • Decision Tree
  • Random Forest
  • Support Vector Machines (SVM)
  • Naïve Bayes

Finally, we’ll provide a basic level introduction to Deep Learning and Deep Neural Networks, as a more specialised form of Machine Learning.

Intended Audience

The intended audience for this course includes:

  • Beginners starting out to the field of Machine Learning
  • Anyone interested in understanding how Machine Learning works

Learning Objectives

By completing this course, you will:

  • Understand what Machine Learning is and what it offers
  • Understand the benefits of using the Machine Learning
  • Understand business use cases and scenarios that can benefit from using the Machine Learning
  • Understand the different Machine Learning training techniques
  • Understand the difference between Supervised and Unsupervised training
  • Understand the difference between Classification and Regression
  • Become familiar with several of the commonly used and popular Machine Learning algorithms discussed
  • Understand the basic principles behind Deep Learning and Deep Neural Networks

Pre-requisites

The following prerequisites will be both useful and helpful for this course:

  • A background in statistics or probability
  • Familiarity and understanding of computer algorithms
  • Basic understanding of data analytics

Course Agenda

The agenda for the remainder of this course is as follows:

  • We’ll discuss what Machine Learning is and when and why you might consider using it
  • We’ll discuss benefits and business use cases that have been empowered by leveraging Machine Learning
  • We’ll breakdown machine learning into supervised and unsupervised training models
  • We’ll discuss the differences of classification and regression techniques
  • We’ll examine a set of commonly used and popular machine learning algorithms
  • Finally we’ll take an introductory look at deep learning and the concept of deep neural networks.

Feedback

If you have thoughts or suggestions for this course, please contact Cloud Academy at support@cloudacademy.com.

Transcript

- [Instructor] Welcome back. In this lecture, we'll start diving into unsupervised learning, and how you use it to train machine learning models. Let's start with the following statement. I have a data set containing no answers. I need to discover a pattern or relationship embedded within the data set that might help me find answers? How might I use machine learning to predict the answers I seek. Contrary to supervised training, unsupervised training takes a different approach in that the associated algorithms are designed to work with unlabeled data sets. Unsupervised learning algorithms don't rely on pre-labeled training data to learn.

The machine learning algorithm in this situation will hunt for patterns in structure buried within the data set. Having discovered any patterns or structure, the model can then be used to make predictions and/or decisions when new data is introduced into the problem. Pattern recognition, within the data set, can involve concepts of clustering, anomaly detection, and association discovery. A typical unsupervised machine learning example is that of identifying close-knit groups of friends within a social network. As mentioned previously, unsupervised training is tasked with finding patterns, relationships, and/or correlations between data points within the data set being processed.

These patterns, relationships, and/or correlations come in different types. For example, as can be seen on this slide, if we consider our data set to be a table of instances, each with the same set of features, then we can begin to see the different types of these so-called patterns. Clustering is concerned with finding similar instances. Anomaly detection is concerned with finding unusual instances, and association discovery is concerned with finding feature rules that may exist between and across instances. Let's take a closer look at each of these patterns. Reiterating as per the earlier slide, clustering is concerned with finding similar instances.

In the example given here, we can clearly see a clustering pattern, in which Mary undertakes travel to or from New York on the first of each month at the cost of $30. Anomaly detection, on the other hand, is concerned with finding unusual instances. In the example given here, we can clearly see an anomaly in which Mary appears to have spent $2000 on entertainment in New York on a day that she didn't bill any travel costs, and finally, association discovery is concerned with finding feature rules that may exist between and across instances within the data set.

In the example given here, we can clearly see a number of potential feature associations. As can be seen on this slide, many of the unsupervised learning algorithms fall into the clustering category. For example, we can use either the K. Means Hierarchical or the Gaussian Mixture for clustering analysis. The training phase for unsupervised training is similar to that used within supervised training, the key differentiator being the loss of labels, or that the supervised training algorithms do not take, as part of their inputs, a label or answer.

Instead, the unsupervised training algorithm number crunches the feature vectors for all data instances looking for cluster patterns, anomalies, and/or association rules. As such, the generated model itself returns answers that suggest cluster citizenship, or whether the data point should be considered an anomaly. Unsupervised machine learning can be used to answer many business problems. The important thing to consider when attempting to answer these questions is where and what data sources are available. Some examples are given here.

Do these customers have similar likes? Here, we would train our model with the customer profiles data set. Is this transaction a fraud? Here, we would train our model with the previous and past transactions data set. And finally, were these products purchased together? Here, we would train our model with the examples of previous purchases data sets. Okay, let's quickly summarize unsupervised training, its important characteristics, how it is used, and some example algorithms. Most importantly, the training data set has only examples and no specific label or outcome.

These types of data sets are actually fairly common as the process of labeling is typically expensive. The goal of unsupervised training is to search for and find patterns, anomalies, and rule associations. Unsupervised training tends to be more complex because the data has not outcome and cannot be evaluated. Each discovery method has its own quality measures. Unsupervised training algorithms can be used to perform the following tasks, clustering, for example, we can use the K. Means Algorithm, which we'll discuss in more detail in the next lecture, anomaly detection, and/or association discovery.

That concludes our lecture on unsupervised training. In the next lecture, we'll start focusing on the actual machine learning algorithms themselves that are commonly used. Go ahead and close this lecture, and we'll see you shortly in the next one.

About the Author

Students5701
Labs6
Courses41
Learning paths8

Jeremy is a Cloud Researcher and Trainer at Cloud Academy where he specializes in developing technical training documentation for security, AI, and machine learning for both AWS and GCP cloud platforms.

He has a strong background in development and coding, and has been hacking with various languages, frameworks, and systems for the past 20+ years.

In recent times, Jeremy has been focused on Cloud, Security, AI, Machine Learning, DevOps, Infrastructure as Code, and CICD.

Jeremy holds professional certifications for both AWS and GCP platforms.