Member-only story

Key Concepts of Machine Learning-2| Day (3/45) | A2Z ML | Mohd Saqib

Mohd Saqib
8 min readJan 17, 2023

Read my previous blog if you have not covered yet — Prev
In this blog, we will discussed some important questions may arise before actually starting the modeling.

ref

Day 3

Index:
- Why split data (Training vs Validation vs Testing)?
- Cross-validation: Evaluating validation data?
- Regression vs Classification vs Clustering
- Metrics to evaluation of Testing data

Why split data (Training vs Validation vs Testing)?

Typically the whole data is divided into three parts: training, validation and testing. The training data is used to train the model and adjust its parameters. The validation data is used to evaluate the model’s performance during the training process and tune the hyperparameters. The testing data is used to evaluate the final performance of the model after it has been trained and validated.

It is important to note that the ratio of the data split between these three sets may vary depending on the size of the dataset and the specific use case. For example, if the dataset is small, it may be necessary to use a smaller validation and testing set, whereas if the dataset is large, the validation…

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Or, continue in mobile web

Already have an account? Sign in

Mohd Saqib
Mohd Saqib

Written by Mohd Saqib

Scholar @ McGill University, Canada | IIT (ISM) | AMU | Travel | msaqib.cs@gmail.com

Responses (2)

Write a response