Mean and Variance in Statistics – Definition, Properties

Mean and variance are measures of central dispersion. Mean is the average of a given set of numbers. The average of the squared difference from the mean is the variance.

Central dispersion tells us how the data that we are taking for observation are scattered and distributed. We will learn about different properties, but before that, we need to get familiar with some of the features like mean, median and variance of the given data distribution.

If we multiply the observed values of a random variable by a constant t, its simple mean, sample standard deviation and sample variance will be multiplied by t, |t| and t², respectively. Also, if we add a constant m to the observed values of a random variable, that constant value will be added to the sample mean, but the sample standard deviation and sample variance remain unchanged. A similar rule applies to the theoretical mean and variance of random variables.

If Y = tx + m, then

\(\begin{array}{l}\mu_y = t \mu_x + m\end{array} \)

\(\begin{array}{l}\sigma^2_y = t^2 \sigma^2_x\end{array} \)

\(\begin{array}{l}\sigma_y = |t| \sigma_x\end{array} \)

Here, μ_x, σ_x², and σ_x are the mean, variance and standard deviation of the random variable X, respectively, and μ_y, σ_y², and σ_y are the mean, variance and standard deviation of the random variable Y, respectively.

Mean in Statistics

The term average of a random variable in probability and statistics is the mean or the expected value. If we know the probability distribution for a random variable, we can also find its expected value. The mean of a random variable shows the location or the central tendency of the random variable.

The definition of ‘mean’ is different in different branches of Mathematics. Normally, by mean, we usually denote the average of the discrete data present in a set of numbers. The arithmetic mean is usually given by (This is the formula that we represent for ungrouped data),

\(\begin{array}{l}\bar{x} = \frac{x_1+x_2+x_3+……+x_n}{n}\end{array} \)

\(\begin{array}{l}\bar{x} = \frac{1}{n}\sum_{i=1}^n x_i\end{array} \)

Where x₁, x₂, x₃, ….., x_n denote the value of the respective terms,

And n = number of terms

Let us take another example where each data point is given with separate frequency data.

The formula for the mean calculation in this case (called the discrete frequency data) is,

\(\begin{array}{l}\bar{x} = \frac{f_1x_1+ f_2x_2+ f_3x_3+……+ f_nx_n}{n}= \frac{1}{n}\sum_{i=1}^n f_ix_i \end{array} \)

Where, x₁, x₂, x₃, ….., x_n denote the value of the respective terms

And f₁, f₂, f₃, ….., f_n denote the respective frequency data of the respective term

And n = number of terms

The formula for both the sample and the population taken is the same, but the denotation is different; the sample mean is denoted by x̄, and the population mean is represented by μ.

Properties of Mean

Some properties of the mean are given by:
1. If we increase individual units by k, then the mean will increase by k.
2. If we decrease individual units by k, then the mean will decrease by k.
3. If we multiply each unit by k, then the mean will be multiplied by k.
4. If we divide each unit by k, then the mean will be divided by k.

What Is Variance in Statistics?

Variance is the expected value of the squared variation of a random variable from its mean value. Sometimes, we have to take the mean deviation by taking the absolute values from a set of values. The absolute values were taken to measure the deviations; otherwise, the positive and negative deviations may cancel out each other.

So, to remove the sign of deviation, we usually take the variance of the data set, i.e., we usually square the deviation values. As squares are always positive, the variance is always a positive number.

Let us take ”n” observations as a₁, a₂, a₃,…..,a_n and their mean is represented by

\(\begin{array}{l}\bar{a}.\end{array} \)

Then, the variance is denoted by

\(\begin{array}{l}σ^2 = (a_1- \bar{a})^2 + (a_2-\bar{a})^2 + (a_3-\bar{a})^2….. + (a_n-\bar{a})^2=\sum_{i=1}^n (a_i-\bar{a})^2\end{array} \)

Check: How to Find Variance

Variance of Random Variables in Probability and Statistics

The variance of a random variable shows the variability of the random variables. Variance represents the distance of a random variable from its mean.

It can be calculated by using below formula:

σ_x² = Var (X) = ∑_i (x_i − μ)² p(x_i) = E(X − μ)²

Var(X) = E(X²) − [E(X)]²

[E(X)]² = [∑_i x_i p(x_i)]² = μ and E(X²) = ∑_i x_i² p(x_i).

Properties of Variance

(1) If the variance is zero, this means that

\(\begin{array}{l}(a_i – \bar{a})\end{array} \)

is equal to zero, which is nothing but each value of the set is equal to the mean value

\(\begin{array}{l}\bar{a}.\end{array} \)

(2) If the variance is small, it means that the observations are pretty close to the mean value, and if the value is greater, the deviations of the observations are far from the mean value.

(3) If each observation is increased by ‘a’ where aϵR, then the variance will remain unchanged.

(4) If each observation is multiplied by ‘a’ where a ϵ R, then the variance will be multiplied by a² also.

But for some data sets, the variance by the formula

\(\begin{array}{l}\sum_{i=1}^{n} (a_i – \bar{a})^2\end{array} \)

does not give the proper values, as the range of deviation may vary and the observations may be more scattered about the mean. So, to overcome this difficulty, we take the mean of the square of the deviations.
So, the variance is given by:

\(\begin{array}{l}\sigma^2 = \frac{1}{n}\sum_{i=1}^{n} (a_i – \bar{a})^2\end{array} \)

As a result of squaring, the unit of variance is not the same as that of the data sets taken.

Also, Read

Solved Examples

Question 1: An experiment is conducted with 16 values of b, and the following results were obtained: ∑ b² = 2560 and ∑ b = 180. On checking through the data again, it is observed that one observation with a particular value 30 is replaced with 20. Then, the corrected variance will be

Solution:

∑ b²=2560 and ∑ b=180

So, ∑ b¹ = 180 – 30 + 20 = 170

And the variance will be decreased by

∑ b² = 900 – 400 = 500

The value of variance becomes ∑ (b¹)²=2560-900+400 = 2060

So, the corrected variance will be = 1/n ∑ (b¹)² – [1/n ∑ b¹]² = 1/16 × 2060 – (1/16 × 170)² = 128.75 – 112.890625 = 15.859375

Question 2: Let us take two sets of values where one set is represented by the scores of 100 Indian batsmen, and the other represents the scores of 100 Australian batsmen. Incidentally, the Indians have scored runs in the order 550, 551, 552……649. And the Australian batsmen have scored runs in the order 900, 901, 902….999. If the variances of the two sets are represented by

σ_A and σ_B, then σ_A/σ_B is

Solution:

We know, σ² = (∑ d_i²)/n

Here, both the Australian and Indian Batsmen sets have 100 consecutive positive integers and the value of n = 100, which is also the same. Thus, ∑ d_i² is the same for both of these integer sets.

So, σ_A/σ_B =1

Question 3: Find the mean and variance of the new random variables if we are given the mean and variance of the random variable X are 125 and 225, respectively.

Solution: The new random variable is the original random variable minus its mean.

Let Y be another random variable, and then

Y = X – 125

\(\begin{array}{l}\mu_Y = \mu_X – 125 = 125 – 125 = 0\end{array} \)

\(\begin{array}{l}\sigma^2_Y = \sigma^2_X =225\end{array} \)

\(\begin{array}{l}\sigma_Y = \sigma_X =15\end{array} \)

If we create a new variable Z = Y/15, which is obtained by dividing the random Y by its standard deviation, the mean, variance and standard deviation of this new variable are

\(\begin{array}{l}\mu_Z = \mu_Y/15 = 0/15 = 0\end{array} \)

\(\begin{array}{l}\sigma^2_Z = \sigma^2_Y/15^2 =225/225 = 1\end{array} \)

The new random variable Z has a mean of 0 and a variance of 1.

Question 4: The mean of the values 0, 1, 2,……,n having corresponding weight

\(\begin{array}{l}^{n}{{C}_{0}},{{\,}^{n}}{{C}_{1}},{{\,}^{n}}{{C}_{2}},……..\,,{{\,}^{n}}{{C}_{n}}\end{array} \)

Solution:
The required mean is

\(\begin{array}{l}\bar{x}=\frac{0.1+{{1.}^{n}}{{C}_{1}}+{{2.}^{n}}{{C}_{2}}+{{3.}^{n}}{{C}_{3}}+……+n{{.}^{n}}{{C}_{n}}}{1{{+}^{n}}{{C}_{1}}{{+}^{n}}{{C}_{2}}+….{{+}^{n}}{{C}_{n}}}\\ =\frac{\sum\limits_{r=0}^{n}{r.\,{{\,}^{n}}{{C}_{r}}}}{\sum\limits_{r=0}^{n}{^{n}{{C}_{r}}}}=\frac{\sum\limits_{r=1}^{n}{r.\frac{n}{r}\,{{\,}^{n-1}}{{C}_{r-1}}}}{\sum\limits_{r=0}^{n}{^{n}{{C}_{r}}}}\\ = \frac{n\sum\limits_{r=1}^{n}{^{n-1}{{C}_{r-1}}}}{\sum\limits_{r=0}^{n}{^{n}{{C}_{r}}}}\\ =\frac{n{{.2}^{n-1}}}{{{2}^{n}}}\\ =\frac{n}{2}\end{array} \)

Question 5: The average of n numbers x₁, x₂, x₃, …., x_n is M. If x_n is replaced by x’, then the new average is

Solution:

M = (x₁ + x₂ + x₃ +…. + x_n )/n

nM = x₁ + x₂ + x₃ +…. + x_n-1 + x_n

nM – x_n= x₁ + x₂ + x₃ +…. + x_n-1

\(\begin{array}{l}\frac{nM-{{x}_{n}}+{x}’}{n} = \frac{{{x}_{1}}+{{x}_{2}}+{{x}_{3}}+……{{x}_{n-1}}+{x}’}{n} \\ \text{New average} =\frac{nM-{{x}_{n}}+{x}’}{n}\end{array} \)

Statistics – Important Topics

Mean, Median and Mode

Frequently Asked Questions

What do you mean by variance?

The average of the squared differences from the mean is called variance. Variance is denoted by σ².

What do you mean by mean in statistics?

Mean is the average of a given set of observations.

Give two properties of the mean.

If the individual units are increased by k, then the mean will increase by k.
If the individual units are decreased by k, then the mean will decrease by k.

Give the equation for finding the variance.

Variance is given by the equation σ² = ∑(xi-x̄)²/N
x₁, x₂, …., x_N are the N observations.

Mean and Variance

Mean in Statistics

Properties of Mean

What Is Variance in Statistics?

Variance of Random Variables in Probability and Statistics

Properties of Variance

Also, Read

Solved Examples

Statistics – Important Topics

Mean, Median and Mode

Frequently Asked Questions

What do you mean by variance?

What do you mean by mean in statistics?

Give two properties of the mean.

Give the equation for finding the variance.

Comments

Leave a Comment Cancel reply

FREE TEXTBOOK SOLUTIONS