VISUAL STATISTICS STUDIO

Binomial Distributions

The basic distributions within the general linear model of statistics are the binomial distribution of which the normal distribution is the limiting case, the t- and F distributions, and the Chi-Square distribution. Let us begin the discussion of these distributions with the binomial distribution.

Galton's Quincunx

Probably the best way to introduce the binomial distribution is via the Galton’s Quincunx. Galton's Quincunx is an apparatus with a single top compartment that contains a handful of marbles and a maze of ducts leading to several compartments at the bottom of the instrument.

 

 

 

 

If this apparatus is set upright, the marbles will fall through the ducts and mimic the pattern of probabilities contained in the Pascal's triangle. When the marbles reach the bottom compartments, the upper contour of the stack of marbles will bear a strong resemblance to a normal curve. Galton's original device used pins positioned on a board and resembling an ornamental arrangement of five bushes; hence the name quincunx.

 

The probabilities associated with probabilities of falling marbles to enter certain path helps to explain the binomial distribution. In the middle of the top compartment that contains the marbles there is an opening with a middle partition. A marble falling through this partition has an equal probability of falling into one of two lower compartments. Each of these compartments has an opening with a partition in its middle. Below this, there are four more compartments. The probability of a marble falling into the leftmost compartment is half of .50, associated with the compartment above, i.e., .25. The probability of a marble falling into the two inner compartments is the sum of .25 from the first above compartment and .25 from the second, i.e., .50. The probability of a marble falling into the rightmost compartment is again .25. This branching pattern is repeated over the rows of the compartments. For the above example, these probabilities are

 

                                         

 

Plenum of Binary Events

Binary events have two outcomes and the number of rows of the n by k plenum where k is the number of events is

 

                                                              

 

Thus, e.g., three true-false items X1, X2, and X3 can be answered in  ways. Within a three item true-false test we can, theoretically, observe 8 response patterns. To construct this plenum, consider that a half of these 8 different outcomes will be false (0) and the other half will be true (1), thus the algorithm for the construction of this plenum is as follows:

 

Enter four zeroes and four ones to the first column. Split the four zeroes in the first column into two zeroes and two ones and record them in the second column. Do the same for the ones. In the last column, record alternate zeroes and ones.

 

The resulting n x k plenum is shown below.

 

                                                      

 

Note that the variables X1, X2, and X3 are not correlated, i.e.,

 

                                                      

 

and that the different response patterns are permutations (Lat. permutation; a thorough change) of 1s and 0s where their order is important.

 

Suppose that questions X1, X2, and X3 are items on the commercial pilot license test where item X1 pertains to FAA regulations, item X2 pertains to navigation and item X3 to the operation of the aircraft. Such a test is scored as shown in the table below,

 

 

                                               

 

as to give credit to responses that reflect knowledge of administrative regulations without the knowledge of navigation and operation of the aircraft or giving credit to knowledge of navigation without the know-how how to fly the commercial airliner could result in death of hundreds of people. The rectangular distribution of the test scored in this manner is shown below.

 

                                                          

 

Now, suppose that questions X1, X2, and X3 are items on the geography test where items X1, X2 and X3 pertain the capitals of Taiwan, South Korea, and Japan. Such a test is scored as a combination of correct responses, without regard to the order of their response patterns, as shown in the table below,

 

                                      

 

 

where the frequencies of the values of the composite score X form a binomial distribution

 

                                                          

 

The mean of the composite score X is defined as

 

                                                             

 

computed for the X in the above table as 3(.5) which equals 1.5. The variance of this variable can be defined as

 

                                                            

 

computed for the above variable X as 3(.5)(.5) which equals .75. Note that the n for the argument of the binomial frequency distribution equals k+1 and that the variance of the binomial distribution refers to the variance of its expanded frequencies [0 1 1 1 2 2 2 3] that is identical to the variance of the composite variable X [0 1 1 2 1 2 2 3].

 

Pascal Triangle

We could also get the above frequencies directly, by constructing the Pascal's triangle, as follows:

 

Start with 1 on the top of the triangle, and append two 1s on the next line with the top 1 in-between. Continue to append 1s on the extremes of following lines while constructing the in-between terms by summing the adjacent terms above them.

 

 

                                                    

 

Pascal triangle was originally developed as a mnemonic aid for expanding binomials, as the expansion of binomials such as  or  are easy to memorize, but expansions to the higher powers become increasingly more difficult to remember. If we arrange expanded binomials according to the increasing order of their exponents

 

                                                

 

It becomes obvious that these expansions can be separated into a component containing progressions of decreasing and increasing exponents, for the bottom line highlighted as

 

                                               

 

and their coefficients that can be recalled by using the mental image of the Pascal’s triangle. For the example, as

 

                                                  

 

 

and

                                                    

 

 

Pascal’s triangle can be also written in terms of factorials as

 

                                         

 

                                                                                                                               

and its associated Gamma functions

 

                

 

 

Note that the length of the binomial variable is k+1, the argument of the gamma function. When this argument is an integer, the gamma function is just the factorial function offset by one,

 

                                                         

 

However, the gamma functions differ from the factorials that they are capable to compute the factorials not only of integers, but also of the real numbers. The gamma functions are the key parameters of the family of gamma distributions which includes the binomial, normal, t, F, and chi-square distributions.

 

Binomial Distribution as the Precursor of the Normal Distribution

Let us define the binomial distribution within the context of the theoretical model underlying this distribution that has the following properties. The n x k plenum consists of 2k possible response patterns (permutations) to a set of k binary variables, assembled into the variable X without regard to their order, i.e., the variable X is the combination of the response patterns contained by the  variables and defined as their sum. The probabilities associated with the 1 (p) and 0 (q) values of the binary variables are equal, i.e., p = q. The means of the binary variables equal to p and their variances equal to pq. The binary variables are orthogonal (uncorrelated), and thus their sums do not contain the covariance terms. Thus the mean of the variable X is defined as kp and its variance as kpq, as indicated in the diagram below.

 

                              

 

The binomial distribution  associated with this model can be defined as

 

 

                                                  

 

 

where i = 0, 1, … k+1 and j = k  i. For the example of the three items X1, X2, and X3,

 

                                         

 

 

 the ordinate  corresponding to the abscissa  equals

 

 

                                      

 

                                      

 

 

                                      

 

 

                                      

 

and can be plotted as

 

                                                          

 

These results are identical to results obtained by computing probabilities directly from the Pascal's triangle by dividing the rows of the Pascal's triangle by their row sums, as

 

                                         

 

When k approaches infinity, the probabilities defined by

 

 

 

equal to probabilities associated with the standard normal distribution.