5.2. Models for Three-Dimensional Tables Table of Contents 6.2. The Multinomial Logit Model

Chapter 6
Multinomial Response Models

We now turn our attention to regression models for the analysis of categorical dependent variables with more than two response categories. Several of the models that we will study may be considered generalizations of logistic regression analysis to polychotomous data. We first consider models that may be used with purely qualitative or nominal data, and then move on to models for ordinal data, where the response categories are ordered.

6.1  The Nature of Multinomial Data

Let me start by introducing a simple dataset that will be used to illustrate the multinomial distribution and multinomial response models.

6.1.1  The Contraceptive Use Data

Table 6.1 was reconstructed from weighted percents found in Table 4.7 of the final report of the Demographic and Health Survey conducted in El Salvador in 1985 (FESAL-1985). The table shows 3165 currently married women classified by age, grouped in five-year intervals, and current use of contraception, classified as sterilization, other methods, and no method.

Table 6.1: Current Use of Contraception By Age
Currently Married Women. El Salvador, 1985

AgeContraceptive MethodAll
Ster.OtherNone
15-19361232296
20-2480137400617
25-29216131301648
30-3426876203547
35-3919750188435
40-4415024164338
45-499110183284
All100548916713165

A fairly standard approach to the analysis of data of this type could treat the two variables as responses and proceed to investigate the question of independence. For these data the hypothesis of independence is soundly rejected, with a likelihood ratio c2 of 521.1 on 12 d.f.

In this chapter we will view contraceptive use as the response and age as a predictor. Instead of looking at the joint distribution of the two variables, we will look at the conditional distribution of the response, contraceptive use, given the predictor, age. As it turns out, the two approaches are intimately related.

6.1.2  The Multinomial Distribution

Let us review briefly the multinomial distribution that we first encountered in Chapter 5. Consider a random variable Yi that may take one of several discrete values, which we index 1, 2, , J. In the example the response is contraceptive use and it takes the values `sterilization', `other method' and `no method', which we index 1, 2 and 3. Let

pij = Pr{ Yi = j }
(6.1)
denote the probability that the i-th response falls in the j-th category. In the example pi1 is the probability that the i-th respondent is `sterilized'.

Assuming that the response categories are mutually exclusive and exhaustive, we have j = 1J pij = 1 for each i, i.e. the probabilities add up to one for each individual, and we have only J-1 parameters. In the example, once we know the probability of `sterilized' and of `other method' we automatically know by subtraction the probability of `no method'.

For grouped data it will be convenient to introduce auxiliary random variables representing counts of responses in the various categories. Let ni denote the number of cases in the i-th group and let Yij denote the number of responses from the i-th group that fall in the j-th category, with observed value yij.

In our example i represents age groups, ni is the number of women in the i-th age group, and yi1, yi2, and yi3 are the numbers of women sterilized, using another method, and using no method, respectively, in the i-th age group. Note that j yij = ni, i.e. the counts in the various response categories add up to the number of cases in each age group.

For individual data ni = 1 and Yij becomes an indicator (or dummy) variable that takes the value 1 if the i-th response falls in the j-th category and 0 otherwise, and j yij = 1, since one and only one of the indicators yij can be `on' for each case. In our example we could work with the 3165 records in the individual data file and let yi1 be one if the i-th woman is sterilized and 0 otherwise.

The probability distribution of the counts Yij given the total ni is given by the multinomial distribution

Pr{Yi1 = yi1, , YiJ = yiJ } =

ni
yi1, yiJ


pi1yi1 piJyiJ
(6.2)
The special case where J = 2 and we have only two response categories is the binomial distribution of Chapter 3. To verify this fact equate yi1 = yi, yi2 = ni-yi, pi1 = pi, and pi2 = 1-pi.


Continue with 6.2. The Multinomial Logit Model
Copyright © Germán Rodríguez, 1993-2000. Please send feedback to grodri@princeton.edu
Conversion from LaTeX was done using TTH, version 2.34.