3 Mins Read

Various Types of Regularization

Hamed Mohamadi

February 17, 2023

Table of contents

Various Types of Regularization

Machine Learning and Deep Learning models use regularization techniques to prevent overfitting. Before delving into the regularization methods, it’s crucial to understand the concept of overfitting.

To illustrate, imagine you are preparing for a final exam and your professor has provided you with 100 sample questions. If your preparation is limited to memorizing these 100 questions and you are unable to answer questions that differ slightly from these 100 sample questions, your understanding of the material would be considered overfitted. In the world of algorithms, overfitting occurs when an algorithm is only able to accurately predict the data it has learned from the training set but is unable to accurately predict and classify new data that deviates from the training set. This can be visualized by a graph line that tries to fit the data as accurately as possible but is not suitable for real-world scenarios.

To avoid overfitting, “noise” is added to the model through regularization techniques such as L1, L2, and Dropout. These methods are commonly used in Deep Learning models, such as Artificial Neural Networks (ANNs).

Figure1. Overview of Underfitting, Overfitting, and Ideal balance.

L1 and L2 Regularization

While L1 and L2 are different mathematically, they both serve the purpose of solving the overfitting problem.

L1 regularization adds an L1 penalty, equivalent to the absolute value of the magnitude of the coefficient, which serves to restrict the size of the coefficient. One of the methods that implement this method is Lasso regression. As a result of using L1, the weight value tends to zero. In L1 regularization, regression coefficients are determined by minimizing the L1 loss function as follows:

L2 regularization, on the other hand, adds an L2 penalty that is proportional to the square of the magnitude of the coefficients. This method is implemented in algorithms such as Ridge regression and Support Vector Machines (SVMs). Unlike L1 regularization, the weight value does not become zero when using L2 regularization, but it still tends toward zero. The L2 regularization consists in minimizing the L2 loss function, which can be expressed as follows:

Dropout

Dropout regularization method works by randomly setting input units to 0 during the training time. By removing nodes from each layer, the Dropout layer is able to release the model from overfitting, as illustrated in the photo below.

Public

Ensemble Machine Learning

Ensemble machine learning is a powerful technique that leverages the strengths of multiple weak learning models, also known as...

Parisa Sabzeh

February 19, 2023

5 Mins Read

Public

Activation Functions in Neural Network

Activation functions are the main components of neural network nodes. This article examines the various types of activation functions...

Parisa Sabzeh

February 19, 2023

4 Mins Read

Public

Machine Learning Engineers Should Use Docker

Docker is a platform that enables developers to easily create, deploy, and run applications in containers, and has gained...

Hamed Mohamadi

February 17, 2023

4 Mins Read

Public

TPU, GPU, CPU: Which Is Better for Deep Learning?

In this paper, we compare the performance of CPU, GPU, and TPU processors to see which one is better...

Hamed Mohamadi

February 11, 2023

3 Mins Read

Public

The History of AI (part 2)

In the second part of a series of articles about the history of artificial intelligence, we look at important...

Parisa Sabzeh

January 28, 2023

3 Mins Read

Public

Important Computer Vision Datasets

This article reviews famous datasets in the field of computer vision. ...

Hamed Mohamadi

January 26, 2023

6 Mins Read

Surfing on Categories

Subscribe to our newsletter and get the latest practical content.

You can enter your email address and subscribe to our newsletter and get the latest practical content. You can enter your email address and subscribe to our newsletter.

Aiex.ai

1. Image and video input

2. Annotation & task management

3. Health check

4. Dataset management

5. Augmentation

6. Parallel on cloud Training

How does AIEX work?

7. Deployment on cloud inference

Automotive

Railway

Manufacturing

Safety & Security

Medical

Agriculture

Revolutionary Indsutry Transformation

Revolutionary Indsutry Transformation

Various Types of Regularization

Various Types of Regularization

Various Types of Regularization

L1 and L2 Regularization

Dropout

Ensemble Machine Learning

Activation Functions in Neural Network

Machine Learning Engineers Should Use Docker

TPU, GPU, CPU: Which Is Better for Deep Learning?

The History of AI (part 2)

Important Computer Vision Datasets

Aiex.ai

About

About AIEX

Contact us

Contact Info

info@aiex.ai