Machine Learning Problems
The course is part of this learning path
Machine learning is a branch of artificial intelligence that deals with learning patterns and rules from training data. In this course from Cloud Academy, you will learn all about its structure and history. Its origins date back to the middle of the last century, but in the last decade, companies have taken advantage of the resource for their products. This revolution of machine learning has been enabled by three factors.
First, memory storage has become economic and accessible. Second, computing power has also become readily available. Third, sensors, phones, and web application have produced a lot of data which has contributed to training these machine learning models. This course will guide you to the basic principles, foundations and best practices of machine learning. It is advisable to be able to understand and explain these basics before diving into deep learning and neural nets. This course is made up of 10 lectures and two accompanying exercises with solutions. This Cloud Academy course is part of the wider Data and Machine Learning learning path.
- Learn about the foundations and history of machine learning
- Learn and understand the principles of memory storage, computing power, and phone/web applications.
It is recommended to complete the Introduction to Data and Machine Learning course before taking this course.
The dataset used in exercise 2 of this course can be found at the following link: https://www.kaggle.com/liujiaqi/hr-comma-sepcsv/version/1
Hello and welcome to this video on machine learning problems. In this video, we will talk about the different types of machine learning, including supervised, unsupervised and reinforcement learning. On this course, we'll primarily focus on supervised learning, which is important to understand what each of these is. Let's start with supervised learning. In supervised learning, an algorithm learns from labeled data. For example, let's say we're training an image recognition algorithm to distinguish cats from dogs. Each training data point will be a pair of an image, that's the training data and the label, specifying whether it's a cat or a dog. In a similar way, if we are training a translation engine, we would provide both input and output sentences and ask the algorithm to learn the function that connects them. Conversely, in unsupervised learning, data comes without labels and the task is to find similar data points in the data set, in order to identify any underlying higher order structure.
For example, think of a data set that contains purchased references from e-commerce users. These users would like a form cluster, that behaving in a similar way. They may have similar purchase behavior. They may spend the same amount of money or the similar amount of money or buy similar type of objects. You can think of these different groups as different tribes with different preferences. Once these tribes are identified, we can describe each data point, which is each user, in terms of the tribe, it belongs to. Getting a deeper understanding of the data and also reducing the number of data points, that we have to deal with. Finally, reinforcement learning is similar to supervised learning, but in this case, the algorithm is training an agent to act in an environment. The agent can take some action and know the state of the environment and these actions lead to an outcome. The outcome is attached to a score and the algorithm tries to maximize the score.
Typical examples are the algorithms, that learn to play games, like chess or Go. And the main difference with supervised learning is that the algorithm does not really see a label for each action, but may receive the score after a sequence of actions. In 2016, a software train with this technique beat the World Go Champion and marked the new milestone in the race to World's Artificial Intelligence. So, in conclusion, we have seen there's many types of machine learning problems and deep learning has been successfully applied to both supervised, unsupervised and reinforcement learning problems. We will focus on supervised learning problems in the rest of the course. Thank you for watching and see you in the next video.
About the Author
I am a Data Science consultant and trainer. With Catalit I help companies acquire skills and knowledge in data science and harness machine learning and deep learning to reach their goals. With Data Weekends I train people in machine learning, deep learning and big data analytics. I served as lead instructor in Data Science at General Assembly and The Data Incubator and I was Chief Data Officer and co-founder at Spire, a Y-Combinator-backed startup that invented the first consumer wearable device capable of continuously tracking respiration and activity. I earned a joint PhD in biophysics at University of Padua and Université de Paris VI and graduated from Singularity University summer program of 2011.