B.5 Poisson Errors and Link Log
Let us now apply the general theory to the Poisson case, with
emphasis on the log link function.
B.5.1 The Poisson Distribution
A Poisson random variable has probability distribution function
for yi = 0, 1, 2, . The moments are
Let us verify that this distribution belongs to the exponential
family as defined by Nelder and Wedderburn (1972).
Taking logs we find
|
logfi(yi) = yi log(mi) - mi - log(yi!). |
|
Looking at the coefficient of yi we see immediately that the
canonical parameter is
and therefore that the canonical link is the log. Solving for
mi we obtain the inverse link
and we see that we can write the second term in the p.d.f. as
The last remaining term is a function of yi only, so
we identify
Finally, note that we can take ai(f) = f and f = 1,
just as we did in the binomial case.
Let us verify the mean and variance. Differentiating the
cumulant function b(qi) we have
and differentiating again we have
|
vi = ai(f) b(qi) = eqi = mi. |
|
Note that the mean equals the variance.
B.5.2 Fisher Scoring in Log-linear Models
We now consider the Fisher scoring algorithm for Poisson regression
models with canonical link, where we model
The derivative of the link is easily seen to be
Thus, the working dependent variable has the form
The iterative weight is
and simplifies to
Note again that the weight is inversely proportional to the variance
of the working dependent variable.
B.5.3 The Poisson Deviance
Let [^(mi)] denote the m.l.e. of mi under the model of interest
and let [(mi)\tilde] = yi denote the m.l.e. under the saturated model.
From first principles, the deviance is
|
| |
|
|
|
2 |
| [ yi log(yi) - yi - log(yi!) |
| |
|
| - yi log( |
^ mi
|
) + |
^ mi
|
+ log(yi!)]. |
|
| |
|
Note that the terms on yi! cancel out.
Collecting terms on yi we have
|
D = 2 |
| [ yi log( |
yi
|
) - (yi - |
^ mi
|
)]. |
| (B.25) |
The similarity of the Poisson and Binomial deviances should not go unnoticed.
Note that the first term in the Poisson deviance has the form
which is identical to the Binomial deviance. The second term is usually
zero. To see this point, note that for a canonical link the score is
and setting this to zero leads to the estimating equations
In words, maximum likelihood estimation for Poisson log-linear
models-and more generally for any generalized linear model
with canonical link-requires equating
certain functions of the m.l.e.'s (namely X[^(m)])
to the same functions of the data (namely Xy).
If the model has a constant, one column of X will
consist of ones and therefore one of the estimating equations will be
|
|
| yi = |
| |
^ mi
|
or |
| (yi- |
^ mi
|
) = 0, |
|
so the last term in the Poisson deviance is zero.
This result is the basis of an alternative algorithm for computing
the m.l.e.'s known as ``iterative proportional fitting'',
see Bishop et al. (1975) for a description.
The Poisson deviance has an asymptotic chi-squared distribution
as n with the number of parameters p remaining
fixed, and can be used as a goodness of fit test. Differences
between Poisson deviances for nested models (i.e. the log of the
likelihood ratio test criterion) have asymptotic chi-squared
distributions under the usual regularity conditions.
Copyright © Germán Rodríguez, 1993-2000.
Please send feedback to grodri@princeton.edu
Conversion from LaTeX was done using TTH, version 2.34.