Estimation of Variable | set 1

22 July 2024

2

Variability: It is the import dimension that measures the data variation i.e. whether the data is spread out or tightly clustered. Also known as Dispersion When working on data sets in Machine Learning or Data Science, involves many steps – variance measurement, reduction, and distinguishing random variability from the real one. identifying sources of real variability, making decisions regarding the pre-processing choice or model selection based on it.

Terms related to Variability Metrics :

-> Deviation 
-> Variance
-> Standard Deviation
-> Mean Absolute Deviation
-> Median Absolute Deviation
-> Order Statistics
-> Range
-> Percentile 
-> Inter-quartile Range

Deviation: We can call it – errors or residuals also. It is the measure of how different/dispersed the values are, from the central/observed value.
Example :

Sequence : [2, 3, 5, 6, 7, 9] 
Suppose, Central/Observed Value = 7

Deviation = [-5, -4, -2, -1, 0, 2]

Variance (s²): It is the best-known measure to estimate the variability as it is Squared Deviation. One can call it mean squared error as it is the average of standard deviation.

Example :

Sequence : [2, 3, 5, 6, 7, 9] 
Mean              = 5.33
Total Terms, n    = 6
Squared Deviation = (2 - 5.33)² + (3 - 5.33)² + (5 - 5.33)²
                    (6 - 5.33)² + (7 - 5.33)² + (9 - 5.33)²
Variance          = Squared Deviation / n

Code –

Python3

# Variance
 
import numpy as np
 
Sequence = [2, 3, 5, 6, 7, 9]
 
var = np.var(Sequence)
 
print("Variance : ", var)

Output :

Variance :  5.5555555555555545

Standard Deviation: It is the square root of Variance. Is also referred to as Euclidean Norm.

Example :

Sequence : [2, 3, 5, 6, 7, 9] 
Mean              = 5.33
Total Terms, n    = 6
Squared Deviation = (2 - 5.33)² + (3 - 5.33)² + (5 - 5.33)²
                    (6 - 5.33)² + (7 - 5.33)² + (9 - 5.33)²

Variance             = Squared Deviation / n
Standard Deviation   = (Variance)^1/2

Code –

Python3

# Standard Deviation
 
import numpy as np
 
Sequence = [2, 3, 5, 6, 7, 9]
 
std = np.std(Sequence)
 
print("Standard Deviation : ", std)

Output :

Standard Deviation :  2.357022603955158

Mean Absolute Deviation: One can estimate a typical estimation for these deviations. If we average the values, the negative deviations would offset the positive ones. Also, the sum of deviations from the mean is always zero. So, it is a simple approach to take the average deviation itself.

Example :

Sequence : [2, 4, 6, 8] 
Mean     = 5
Deviation around mean = [-3, -1, 1, 3]

Mean Absolute Deviation = (3 + 1 + 1 + 3)/ 4

Python3

# Mean Absolute Deviation
 
import numpy as np
 
def mad(data):
    return np.mean(np.absolute(
            data - np.mean(data)))
     
Sequence = [2, 4, 6, 8] 
 
print ("Mean Absolute Deviation : ", mad(Sequence))

Output :

Mean Absolute Deviation :  2.0

Estimation of Variable | set 1

Python3

Python3

Python3

Java Program for Longest Common Subsequence

Maximum height of Tree when any Node can be considered as Root

Print Fibonacci sequence using 2 variables

LEAVE A REPLY Cancel reply

Most Popular

5 Best VPNs for New Jersey in 2025: Tested & Confirmed by Gjurgjica Panova

5 Best VPNs for Boston in 2025: Fast, Secure & Reliable by Todd Faulk

Amazon’s secret software is already replacing the Android-based Fire OS

Android 16 beta 4 quietly removes a small but useful Pixel feature

Recent Comments

EDITOR PICKS

5 Best VPNs for New Jersey in 2025: Tested & Confirmed by Gjurgjica Panova

5 Best VPNs for Boston in 2025: Fast, Secure & Reliable by Todd Faulk

Amazon’s secret software is already replacing the Android-based Fire OS

POPULAR POSTS

5 Best VPNs for New Jersey in 2025: Tested & Confirmed by Gjurgjica Panova

5 Best VPNs for Boston in 2025: Fast, Secure & Reliable by Todd Faulk

Amazon’s secret software is already replacing the Android-based Fire OS

POPULAR CATEGORY

ABOUT US

FOLLOW US