Skip to main content
Back to Statistics & Probability
JEE Main 2019
Statistics & Probability
Statistics
Hard

Question

Let x 1 , x 2 ,...........,x n be n observations such that xi2=400\sum {x_i^2} = 400 and xi=80\sum {{x_i}} = 80. Then a possible value of n among the following is

Options

Solution

Key Concepts and Formulas

  • Variance (σ2\sigma^2): A statistical measure that quantifies the spread or dispersion of a set of data points around their mean. For nn observations x1,x2,,xnx_1, x_2, \dots, x_n, the computational formula for variance is: σ2=xi2n(xin)2\sigma^2 = \frac{\sum x_i^2}{n} - \left(\frac{\sum x_i}{n}\right)^2
  • Non-Negativity of Variance: A fundamental property of variance is that it can never be negative. Since variance is calculated from squared differences, and squares of real numbers are always non-negative, their sum and average must also be non-negative. Therefore: σ20\sigma^2 \ge 0
  • Cauchy-Schwarz Inequality (Alternative Insight): For any real numbers a1,,ana_1, \dots, a_n and b1,,bnb_1, \dots, b_n, the inequality states: (i=1naibi)2(i=1nai2)(i=1nbi2)\left(\sum_{i=1}^n a_i b_i\right)^2 \le \left(\sum_{i=1}^n a_i^2\right) \left(\sum_{i=1}^n b_i^2\right) This inequality can be used to derive the same constraint on nn.

Step-by-Step Solution

We are given nn observations x1,x2,,xnx_1, x_2, \dots, x_n with the following sums:

  1. Sum of squares of observations: xi2=400\sum x_i^2 = 400
  2. Sum of observations: xi=80\sum x_i = 80

Our goal is to find a possible value of nn from the given options.

Step 1: Apply the Principle of Non-Negativity of Variance

The most crucial concept here is that the variance of any set of real numbers must be non-negative. We will use the formula for variance and set it greater than or equal to zero.

  • Why this step? The variance formula connects the sum of observations, the sum of squares of observations, and the number of observations (nn). By applying the non-negativity constraint (σ20\sigma^2 \ge 0), we can form an inequality involving nn that will help us find its possible values.

σ2=xi2n(xin)20\sigma^2 = \frac{\sum x_i^2}{n} - \left(\frac{\sum x_i}{n}\right)^2 \ge 0

Step 2: Substitute the Given Values into the Inequality

Now, we will substitute the provided numerical values for xi2\sum x_i^2 and xi\sum x_i into the inequality derived in Step 1.

  • Given values:

    • xi2=400\sum x_i^2 = 400
    • xi=80\sum x_i = 80
  • First, calculate the term (xi)2\left(\sum x_i\right)^2: (xi)2=(80)2=6400\left(\sum x_i\right)^2 = (80)^2 = 6400

  • Substitute these into the variance inequality: 400n6400n20\frac{400}{n} - \frac{6400}{n^2} \ge 0

    • Explanation: The term (xin)2\left(\frac{\sum x_i}{n}\right)^2 correctly expands to (xi)2n2\frac{(\sum x_i)^2}{n^2}, which gives 6400n2\frac{6400}{n^2}. It's important not to confuse this with xi2n2\frac{\sum x_i^2}{n^2}.

Step 3: Solve the Inequality for nn}

Our next task is to algebraically manipulate the inequality 400n6400n20\frac{400}{n} - \frac{6400}{n^2} \ge 0 to find the possible range for nn.

  • Find a Common Denominator: To combine the fractions, we use n2n^2 as the common denominator. We multiply the first term 400n\frac{400}{n} by nn\frac{n}{n}: 400nn26400n20\frac{400n}{n^2} - \frac{6400}{n^2} \ge 0

  • Combine the Terms: Now, we can combine the numerators over the common denominator: 400n6400n20\frac{400n - 6400}{n^2} \ge 0

  • Analyze the Denominator: Since nn represents the number of observations, it must be a positive integer (n1n \ge 1). This means n2n^2 will always be strictly positive (n2>0n^2 > 0).

  • Determine the Sign of the Numerator: For the entire fraction 400n6400n2\frac{400n - 6400}{n^2} to be greater than or equal to zero, and knowing that the denominator n2n^2 is positive, the numerator (400n6400)(400n - 6400) must also be greater than or equal to zero. 400n64000400n - 6400 \ge 0

  • Isolate nn: Add 64006400 to both sides of the inequality: 400n6400400n \ge 6400 Divide both sides by 400400. Since 400400 is a positive number, the direction of the inequality remains unchanged: n6400400n \ge \frac{6400}{400} n16n \ge 16

Step 4: Interpret the Result and Choose the Correct Option

The inequality n16n \ge 16 tells us that the number of observations, nn, must be an integer greater than or equal to 16.

Let's examine the given options: (A) 18 (B) 15 (C) 12 (D) 9

Out of these options, only n=18n = 18 satisfies the condition n16n \ge 16. The other options (15, 12, 9) are all less than 16, which would imply a negative variance, an impossible scenario.


Common Mistakes & Tips

  • Don't Forget Non-Negativity: The principle that variance (σ2\sigma^2) must be 0\ge 0 is the cornerstone of this problem. Always remember this fundamental statistical property.
  • Correct Squaring of the Mean: A frequent error is to incorrectly square the mean term. Remember that (xin)2\left(\frac{\sum x_i}{n}\right)^2 means squaring the entire mean, resulting in (xi)2n2\frac{(\sum x_i)^2}{n^2}, not xi2n2\frac{\sum x_i^2}{n^2} (which is the mean of squares).
  • Nature of nn: Always keep in mind that nn represents the number of observations, so it must be a positive integer (n{1,2,3,}n \in \{1, 2, 3, \dots\}). This allows us to confidently state that n>0n > 0 and n2>0n^2 > 0, simplifying the inequality solving process.
  • Cauchy-Schwarz as an Alternative: As shown in the "Key Concepts" section, the problem can also be solved elegantly using the Cauchy-Schwarz inequality by setting ai=xia_i = x_i and bi=1b_i = 1. This directly leads to (xi)2n(xi2)(\sum x_i)^2 \le n (\sum x_i^2), which upon substituting values gives 802n4006400400nn1680^2 \le n \cdot 400 \Rightarrow 6400 \le 400n \Rightarrow n \ge 16. This provides a quick verification or an alternative solution path for those familiar with it.

Summary

This problem demonstrates how a fundamental statistical principle – the non-negativity of variance – can be used to deduce constraints on unknown parameters. By setting the variance formula greater than or equal to zero and substituting the given sums of observations and squares of observations, we established an inequality for nn. Solving this inequality yielded n16n \ge 16. Comparing this result with the provided options, we found that only n=18n=18 is a possible value.

The final answer is 18\boxed{\text{18}} which corresponds to option (A).

Practice More Statistics & Probability Questions

View All Questions