It's the last day for these savings

Imbalanced Learning (Unbalanced Data) - The Complete Guide

Learn how to handle imbalanced data in Machine Learning. Data based approaches, algorithmic approaches and more!

865 Students

4h48min

Intermediate

4.7

thumbnail
  • Overview
  • Curriculum
  • Discussion
  • Review
  • Instructor

What you will learn

  • Understand the underline causes of the Class Imbalance problem

  • Why it is a major challenge in machine learning and data mining fields

  • Learn the different characteristics of imbalanced datasets

  • Learn the state-of-the-art techniques and algorithms

  • Understand variety of data based methods such as SMOTE, ADASYN, B-SMOTE and many more!

  • Apply Data-Based Techniques in practice

  • Understand different algorithmic approaches such as: One Class Learning, Cost Sensitive Learning and more!

  • Apply Algorithmic-Based methods in practice

  • Learn how to correctly evaluate a prediction model built using imbalanced data

  • Learn strategies and recommendations to help you avoid pitfalls when working with imbalanced dataset

What are the requirements for taking your course

  • Prior knowledge in machine learning/data science is necessary or at least currently enrolled in a machine learning course.

Who is this course for

  • This course is for students and professionals who are working in the machine learning / data science area and want to increase their knowledge and skills. It is also for students who are currently taking a course in these areas. It is not for students with no background knowledge in Machine Learning.

Description

This is a niche topic for students interested in data science and machine learning fields. The classical data imbalance problem is recognized as one of the major problems in the field of data mining and machine learning. Imbalanced learning focuses on how an intelligent system can learn when it is provided with unbalanced data.

There is an unprecedented amount of data available. This has caused knowledge discovery to garner attention in recent years. However, many real-world datasets are imbalanced. Learning from unbalanced data poses major challenges and is recognized as needing significant attention.

The problem with unbalanced data is the performance of learning algorithms in the presence of underrepresented data and severely skewed class distributions. Models trained on imbalanced datasets strongly favor the majority class and largely ignore the minority class. Several approaches introduced to date present both data-based and algorithmic solutions.

The specific goals of this course are:

  • Help the students understand the underline causes of unbalanced data problem.

  • Go over the major state-of-the-art methods and techniques that you can use to deal with imbalanced learning.

  • Explain the advantages and drawback of different approaches and methods .

  • Discuss the major assessment metrics for imbalanced learning to help you correctly evaluate the effectiveness of your solution.

Introduction

Introduction

Problem Definition

How Common is this problem?

Prerequisites & Course Outcomes

The Four Different Characteristics

How Hard is my Unbalanced Dataset?

Datasets - Quick Guide

Languages & Source Code

Installing Anaconda for Mac

Installing Anaconda for Windows

Data-based Approaches - Under-Sampling

Data-based Approaches Introduction

Undersampling Methods Introduction

Undersampling: Random Undersampling

Example - Random Undersampling

Tomek Link

Practical Example - Tomek Link

UnderSampling: One Sided Selection

Practical Example - OSS

img

No Discussion Found

4.7

78 Reviews

5
44
4
19
3
10
2
2
1
3
Bassam Almogahed

Instructor

$19.73

This Course Includes

67 Lessons
0 Quiz
0 Assignment
22 Downloadable Resources
English
Full Lifetime Access
Certificate of completion
Go To Class

Related Skills

More Courses From Jacob Collier