Skip to main content
Back to Statistics & Probability
JEE Main 2018
Statistics & Probability
Statistics
Hard

Question

The mean and variance of 7 observations are 8 and 16, respectively. If five observations are 2, 4, 10, 12, 14, then the absolute difference of the remaining two observations is :

Options

Solution

Key Concepts and Formulas

  1. Mean (xˉ\bar{x}): The average of a set of nn observations x1,x2,,xnx_1, x_2, \ldots, x_n. xˉ=i=1nxin\bar{x} = \frac{\sum_{i=1}^{n} x_i}{n} This formula helps relate the sum of all observations to the given mean.

  2. Variance (σ2\sigma^2): A measure of the spread of data points from the mean. The computational formula for variance is particularly useful: σ2=i=1nxi2n(xˉ)2\sigma^2 = \frac{\sum_{i=1}^{n} x_i^2}{n} - (\bar{x})^2 This formula allows us to directly calculate the sum of the squares of observations.

  3. Algebraic Identities: These identities are crucial for manipulating expressions involving sums, products, and differences of variables:

    • (a+b)2=a2+b2+2ab(a+b)^2 = a^2 + b^2 + 2ab
    • (ab)2=(a+b)24ab(a-b)^2 = (a+b)^2 - 4ab

Step-by-Step Solution

We are given n=7n=7 observations with a mean xˉ=8\bar{x}=8 and variance σ2=16\sigma^2=16. Five of these observations are 2,4,10,12,142, 4, 10, 12, 14. Let the two unknown observations be xx and yy. Our goal is to find the absolute difference, xy|x-y|.

Step 1: Use the Mean Formula to Find the Sum of the Unknown Observations (x+yx+y)

Why this step? The mean formula provides a direct relationship between the sum of all observations (including the unknowns) and the given mean. This will yield our first equation involving xx and yy.

First, calculate the sum of the five known observations: Sum of known observations=2+4+10+12+14=42\text{Sum of known observations} = 2 + 4 + 10 + 12 + 14 = 42

Now, apply the mean formula, including the unknown observations xx and yy: xˉ=Sum of all observationsn\bar{x} = \frac{\text{Sum of all observations}}{n} 8=42+x+y78 = \frac{42 + x + y}{7}

Multiply both sides by 7: 8×7=42+x+y8 \times 7 = 42 + x + y 56=42+x+y56 = 42 + x + y

Subtract 42 from both sides to isolate x+yx+y: x+y=5642x + y = 56 - 42 x+y=14(Equation 1)\mathbf{x + y = 14 \quad \text{(Equation 1)}} We now have the sum of the two unknown observations.

Step 2: Use the Variance Formula to Find the Sum of Squares of the Unknown Observations (x2+y2x^2+y^2)

Why this step? The computational variance formula involves the sum of squares of all observations (xi2\sum x_i^2). Since we know the variance and mean, this formula allows us to set up a second equation to find x2+y2x^2+y^2, which is essential for solving for xx and yy.

First, calculate the sum of squares of the five known observations: Sum of squares of known observations=22+42+102+122+142\text{Sum of squares of known observations} = 2^2 + 4^2 + 10^2 + 12^2 + 14^2 =4+16+100+144+196= 4 + 16 + 100 + 144 + 196 =460= 460

Now, apply the variance formula, substituting the given values (σ2=16\sigma^2=16, xˉ=8\bar{x}=8, n=7n=7) and the sum of squares of known observations: σ2=xi2n(xˉ)2\sigma^2 = \frac{\sum x_i^2}{n} - (\bar{x})^2 16=460+x2+y27(8)216 = \frac{460 + x^2 + y^2}{7} - (8)^2 16=460+x2+y276416 = \frac{460 + x^2 + y^2}{7} - 64

Add 64 to both sides to isolate the fraction term: 16+64=460+x2+y2716 + 64 = \frac{460 + x^2 + y^2}{7} 80=460+x2+y2780 = \frac{460 + x^2 + y^2}{7}

Multiply both sides by 7: 80×7=460+x2+y280 \times 7 = 460 + x^2 + y^2 560=460+x2+y2560 = 460 + x^2 + y^2

Subtract 460 from both sides to isolate x2+y2x^2+y^2: x2+y2=560460x^2 + y^2 = 560 - 460 x2+y2=100(Equation 2)\mathbf{x^2 + y^2 = 100 \quad \text{(Equation 2)}} We now have the sum of squares of the two unknown observations.

Step 3: Combine Equations to Find the Product of the Unknown Observations (xyxy)

Why this step? We have x+yx+y (from Equation 1) and x2+y2x^2+y^2 (from Equation 2). The algebraic identity (x+y)2=x2+y2+2xy(x+y)^2 = x^2+y^2+2xy directly connects these two expressions to xyxy. Finding xyxy is a necessary intermediate step to calculate xy|x-y|.

Recall the algebraic identity: (x+y)2=x2+y2+2xy(x+y)^2 = x^2 + y^2 + 2xy

Substitute the values from Equation 1 (x+y=14x+y=14) and Equation 2 (x2+y2=100x^2+y^2=100): (14)2=100+2xy(14)^2 = 100 + 2xy 196=100+2xy196 = 100 + 2xy

Subtract 100 from both sides: 196100=2xy196 - 100 = 2xy 96=2xy96 = 2xy

Divide by 2 to find xyxy: xy=962xy = \frac{96}{2} xy=48\mathbf{xy = 48} We now have the product of the two unknown observations.

Step 4: Calculate the Absolute Difference (xy|x-y|)

Why this step? The ultimate objective is to find xy|x-y|. We have x+yx+y and xyxy, which can be directly used with the algebraic identity (xy)2=(x+y)24xy(x-y)^2 = (x+y)^2 - 4xy. This approach is more efficient than solving for xx and yy individually.

Recall the algebraic identity: (xy)2=(x+y)24xy(x-y)^2 = (x+y)^2 - 4xy

Substitute the values we found: x+y=14x+y=14 and xy=48xy=48: (xy)2=(14)24(48)(x-y)^2 = (14)^2 - 4(48) (xy)2=196192(x-y)^2 = 196 - 192 (xy)2=4(x-y)^2 = 4

To find the absolute difference, take the square root of both sides: (xy)2=4\sqrt{(x-y)^2} = \sqrt{4} xy=2|x-y| = 2 Thus, the absolute difference of the remaining two observations is 2.

Common Mistakes & Tips

  • Variance Formula Error: A common mistake is using the formula σ2=(xixˉ)2n\sigma^2 = \frac{\sum (x_i - \bar{x})^2}{n} instead of the computational formula. While correct, the computational formula σ2=xi2n(xˉ)2\sigma^2 = \frac{\sum x_i^2}{n} - (\bar{x})^2 is much more efficient when xˉ\bar{x} and xi2\sum x_i^2 are readily available or calculable.
  • Arithmetic Errors: With multiple calculations involving squares and sums, it's easy to make small arithmetic mistakes. Always double-check your calculations, especially during summation and squaring.
  • Ignoring Algebraic Identities: Attempting to solve for xx and yy individually by substituting y=14xy=14-x into x2+y2=100x^2+y^2=100 would lead to a quadratic equation (x2+(14x)2=100x^2 + (14-x)^2 = 100). While solvable, it's more time-consuming than directly using the identities to find (xy)2(x-y)^2.

Summary

This problem effectively demonstrates the application of mean and variance definitions in conjunction with fundamental algebraic identities to solve for unknown data points. We systematically used the mean to establish the sum of the unknown observations, then employed the variance formula to find the sum of their squares. Finally, algebraic identities were leveraged to efficiently determine their product and, ultimately, their absolute difference. The absolute difference between the two remaining observations is 2.

The final answer is 2\boxed{2}, which corresponds to option (A).

Practice More Statistics & Probability Questions

View All Questions