Sunday, 19 February 2017

Measures of Central Tendency

MEASURES OF CENTRAL TENDENCY
  In the given set of a scores, there will be a score in which all the scores are clustered, such a tendency of a scores is called central tendency. To measure this central tendency we have 3 measures, like Mean, Median and mode. These are called measures of central tendency.

1.      The Arithmetic Mean (M):
    The Arithmetic mean is the best known and the most widely used measure of a central tendency.
From Ungrouped Data
(i)                  The arithmetic mean is the measure resulting from dividing the sum of the scores in the distribution by the number of scores. Here,

       M = ∑X/N    Where, X = a score
                                           N=number of scores in a series;
                                           ∑=stands for “the sum of.”
Thus, the arithmetic mean or, more simply the mean of the scores
93, 90, 89, 88, and 86 is
      M=   93+90+ 89+ 88+ 86     =   446   = 89.2
                          5                                   5
(ii)                There is another short-cut method. Let us assume that the mean in the above series is (AM) 89. Find out the deviation of each score from the mean. It will be

X
d=X –x
93
93-89=    4
90
90-89=    1
89
89-89=    0
88
88-89=  -1
86
86-89=   3
                ∑ 
              1

Use the formula: M = X + ∑ X1
  N                  Where, x = assumed mean;
d= deviations from the assumed
.Mean.
                         .      .    M=89 + 1/5
M= 89.2
From Grouped Data
(i) Long Method:
C.I                                       Midpoints (x)                    f                            fx
10- 14                                       12                              3                             36
15 – 19                                      17                             5                              85
20 – 24                                      22                              9                            198
25- 29                                        27                            18                            486
30 – 34                                      32                            11                             352
35 -39                                        37                              5                             185
40 – 44                                      42                               6                             252
45- 49                                        47                               2                               94
50 – 54                                       52                               1                               52
                                                 N = 60                                                 ∑ = 1,740
                                        Mean =    ∑f x =            1,740=   29
                                                            N                    60
                                               M=  29
(ii)Short-cut method:
C.I
f
x
d=x-AM/i
fd
10-14
2
12
-3
-6
15 -19
3
17
-2
-6
20-24
4
22
-1
-4
25- 29
6
27
0
0
30 -34
5
32
+1
5
35- 39
2
37
+2
4
40 -44
2
42
+3
6
                                 N    = 24                                                                             ∑fd = -1
d= x-M/i    =       12 – 27                               where, x = mid point
                                   5                                               M = Assumed mean
i=size of the class interval
Mean = AM + ∑f d  X  i
                           N
            = 27 + (-1/24)5
            = 27 – 5/24
            = 27 – 0.2
M=26.8
Merits of Arithmetic Mean
1.       It is rigidly defined, and a biased investigator will get the same mean from the series as an unbiased one. Its value is always definite.
2.       It can be accurately determined with the help of various methods.
3.       A mean is an algebraic measure which is capable of further algebraic treatment. It is useful in computing standard deviation, correlation, etc.
4.       It is simple to follow. It is very easily understandable.
5.       If the number of items in a series is large, the mean provides a good basis of comparison.
6.       A mean has an important property, and that is that is that the sum of the deviations of all the scores from the mean is always zero. In this respect, the median does not qualify.
Limitations of the Mean
1.       As the skewness of the distribution increases, the reliability of the mean decreases.
2.       Since it is calculated from all the items of a series, sometimes the abnormal items may considerably affect the mean, particularly when the number of items is not large. For example, the mean ofRs. 275 is not at all a representative figure of Rs. 1,000, Rs.25, Rs.35 and Rs. 40.
3.       If a single item of a series is missing, the mean cannot be calculated. If out of 500 items, the values of 499 items are known, the mean cannot be calculated. Other averages, such as the median and the mode, do not need complete data.
4.       A mean gives greater importance to the bigger items of a series and less importance to the smaller items. One big item among five, four of which are small, will push up the mean considerably. Thus, the mean has an upward bias. But the reverse is not true. If in a series of five items, four of whichare big and one item small, the mean will not be pulled down very much; e.g., the mean of items 25, 50, 75, 100 and 850 is 220, while the mean of items 500, 525, 550, 575, and 50 is 440.
5.       Sometimes the mean gives fallacious conclusions; e.g. income of two groups of persons;
    A group:     Rs.1000, Rs.100, Rs.75. Rs.25
    B group:    Rs.325, Rs.300, Rs.285. Rs.290
For both the groups, the mean is Rs.300. It would appear that both the groups are economically at the same level, but this is not really so.
6.       Unless the data are very simple, the mean cannot be located merely by inspection, while the median and the mode can be.
7.       It cannot be accurately determined in the case of an open-end table. (A open-end table is that which does not provide exact scores of the extreme 4 or 5 cases.)
8.       A mean can be used only with distributions that give absolute scores. It cannot be used with scores expressing grades or ranks or order of students, such as A, B,C, D,….
9.       A Mean can be a figure which does not exist in the series at all.
10.   A Mean sometimes gives such results as appear almost absurd; e.g., 3.4 children.
Use the mean…
(i)                 When the scores are distributed symmetrically around the central point, i.e., when the distribution is not badly skewed. M is the centre of gravity in the distribution and each score contributes to its determination;
(ii)               When the measures of a central tendency having the greatest stability and reliability are wanted;
(iii)             When other statistics (e.g. S.D, coefficient of correlation) are to be computed later.

2.      The Median (Mdn)
A median is the average of position. It is often called the counting average. “The median is the point below which 50% of the score lie.
·         It is the middle item of the series.
·         It is exactly half of the total series.
·         It is a point not a score.
(i)                  Find the median of the scores 24, 21, 30, 18, 27, 28, 30. Here N is odd. Arranging these seven scores in order of size we get,
18,   21,   24,    27,   28,    30,     30
The median is 27.0, for there are three scores above and three below 27.0
(ii)                Find the median of the scores 24, 21, 30, 18, 27, 28 Here N is even. Arranging these six scores in order of size, we get,
18,    21,    24,   27,    28,    30
Here there are two idle scores, 24 and 27. The average of these two, i.e. (24+27/2 =25.5, is the median.
In short, median is the N+1/2th measure in order of size
From Grouped data

C.I
f
F/Cf
10-14
3
3
15-19
5
8
20 -24
9
17
25-29
18
35
30-34
11
46
35 -39
5
51
40 -44
6
57
45 -49
2
59
50 -54
1
60
                            N =   60
N/2= 60/2 =30
L = 24.5                                                                Median (Md) =L +    N/2 - FXi
F = 17fm
Fm = 18                                                                                        = 24.5 + 30-17 X 5
i=5                                                                                                                    18
                                                                                                     = 24.5 + 13X5
                                                                                                                       18
                                                                                                    =24.5 + 3.6
                                                                                    Median = 28.1
Merits of the Median
1.       It is an ideal average, for it is rigidly defined.
2.       It is easily understood without any difficulty.
3.       It is not affected by the values of the extreme items (i.e., skewness) and as such is sometimes more representative than the mean.
      If the income of five persons are Rs.300,Rs.350, Rs.400, Rs.450 and Rs.10,000, the median in such cases is a better average.
4.       In case of an open-end table, where the values of the extremes are not known, the median can be calculated if the number of items is known.
5.       The median can be used not only with distributions that give absolute scores such as 25, 30, 32, etc., but also with scores expressing grades or ranks or order of students, such as A, B, C, D…
6.       When the units of measurement are unequal, the median is preferable. It does not assume equality of units, whereas the mean does assume it.
7.       The median never gives absurd or fallacious results.
8.       It can be located in many cases merely by an inspection.
Limitations of the Median
1.       A median is a non-algebraic measure and hence not suitable for further algebraic treatment.
2.       It cannot be used for computing other statistical measures such as SD, or coefficient of correlation.
3.       The arrangement of items in the ascending or descending order is sometimes very tedious.
4.       When there are wide variations between the values of different items, a median may not be representive of the series; e.g.,
         Marks: 15, 16, 16, 18, 18, 20, 54, 60, 70, 70, 82
  The median is 20, which is not representive of the series.
5.       When a median has to be calculated in continuous series, it requires interpolation. The assumption of the interpolation that all the frequencies of the class interval are uniformly spread over their values in the class interval may not be actually true. In most cases, it will not be true.
6.       If big or small items in a series are to receive greater importance the median would be an unsuitable average, for it ignores the values of extreme items.
7.       A median is more likely to be affected by the fluctuations of sampling than the mean.
When to use a median?
1.       When the exact midpoint---the 50% point of the distribution--- is wanted;
2.       When there are extreme scores which would markedly affect the mean. They do not disturb the median;
3.       When it is desired that certain scores should influence the central tendency;
4.       When the distribution has an upper or lower class interval of unspecified length;
5.       When a distribution is described and interpreted in terms of percentiles, it is used as a member of the percentile system.
3. The Mode
The crude or empirical mode (inspection average) is the most quently occurring score in the distribution.
    The mode or the modal value is the 3rd measure of central tendency of the frequency distribution.
     The most often repeated score is called Mode.
Example:  66,   68,   69,   70,   70,   70,   71,   71,   72,   73
     Here the most often repeated score is 70. The mode of these score is 70. The above setoff scores has only one value as a mode therefore is called unimodal.
      If two scores are repeated then we say that distribution is bimodal, if it is three trimodal etc.
       Ex:   7, ⑩, ⑥,   8,   13,   9, ⑩, 11,   ⑥
    When data are grouped into a frequency distribution, the crude mode is usually taken to be the midpoint of that interval which contains the largest frequency. For the distribution given in problem (ii) of mean calculation in short-cut, the mode is 27.0.
     A distribution may have more than one mode. It may be unimodal or bimodal or multimodal.
     The true mode is the point (or peak) of greatest concentration in the distribution. When the distribution is not badly skewed,the approximation of the true mode can be calculated by the formula;
            Mode=3 median – 2 mean
         Or   Mo= 3 Mdn – 3 M
Or Direct formula we can use,
       Mo = L +     fm – f1      X i
                         2fm – f1 – f2
Where, L ---Lower limit of C.I. where the mode lies.
              fm. --- Frequency of C.I. where the mode lies.
              f1 ----first frequency, the frequency before frequency in which mode lies.
              f2 – Second frequency, after frequency in which mode lies.
i-size of the class interval
Merits of the Mode
1.       It possesses the merit of simplicity. In a discrete series, the mode can be located even by inspection.
2.       It is commonly understood. A mode is an average which people use in their day-to-day expressions; e.g., sale of a commodity, size of families, etc., are expressed in their mode values.
3.       Unlike the mean, it cannot be a value which is not found in the series.
4.       A mode is not affected by the values of extreme items, provided they adhere to the natural law relating to extremes.
5.       For the determination of a mode, it is not necessary to know the values of all the items of a series.
Limitations of the Mode
1.       A mode is ill-defined, indeterminate and indefinite, and so untrustworthy.
2.       It is not based on all the observations of a series.
3.       It is not capable of further mathematical treatment.
4.       It may be unpresentative in many cases. A slight change in the series may extensively disturb the mode. For example, in the series, 20, 25, 25, 25, 30, 60, 70, 95, 95, 100, 
the mode is 25. If one student gets 26 instead of 25 and one gets 95 instead of 100, the mode will be shifted to 95.
5.       The mode is affected by changes in the grouping scheme, while preparing frequency distributions.
6.       In ungrouped data, if no score is repeated, it may lead to the wrong conclusion that the series has no mode.
7.       As there may be 2, 3, or more modal values, it becomes impossible to set a definite value of a mode.
When to Use a Mode?
Use a mode ----------
1.       When a quick and approximate measure of a central tendency is all that is wanted;

2.       When the measure of a central tendency should be the most typical value.

No comments:

Post a Comment