Normalization and scaling features in ML

28 vues (au cours des 30 derniers jours)
NOR EL ISLAM DAHMOUNI
NOR EL ISLAM DAHMOUNI le 28 Oct 2020
Hello everyone
its is very important to scale and normalize data for training ML algorithme, lets take for exemple the mean normalization , so to normalize one feature we take the each instance of vector feature so we substract the mean of this vector x_vector - mean (x_vector), and divide by max(x_vector)-min(x_vector ) . Now the question is : should i normalize and scale the data test as well ?, if yes on based on what after the integration of our model for an real time application to normalize our data from sensors ? (real data without normaliziation and scaling ) , like we dont have vector and then we calculate the mean and max min to normalize it.
Thanks for your contribution

Réponses (1)

Shantanu Dixit
Shantanu Dixit le 16 Juin 2023
Hi Nor,
This is a concept related to feature scaling, here's how you would handle normalization for real-time data using the statistics from the training data:
  1. Calculate the mean (x_train_mean), maximum (x_train_max), and minimum (x_train_min) values for each feature in the training data (x_train).
  2. Normalize the real-time data (x_realtime) using the formula: x_normalized = (x_realtime - x_train_mean) / (x_train_max - x_train_min).
  3. Use the normalized real-time data (x_normalized) as input to your trained model for prediction.
By using the mean and range values from the training data, you ensure consistency in the normalization process.

Catégories

En savoir plus sur Statistics and Machine Learning Toolbox dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by