Next: About this document ...
Up: Properties of the Poisson
Previous: Mean and Variance
So far we have been thinking of the probability for getting a result
if we know that the mean value should be
. Now suppose we
make a measurement and get
counts, but we don't know anything
about
, except that it must be nonnegative, of course. We may
turn the question around and ask what is the most likely value for
, given the result of our measurement. To make this
turned-around idea more concrete, we use the concept of conditional
probability. We say that the Poisson distribution
tells
us the probability that we get
, on the condition that the mean
value is
. The notation
denotes the
probability for getting
, given that
occurs or
is true.
Thus we could write
 |
(19) |
Now the reverse question is, ``What is the probability that the mean
value is
, given that we just made a measurement and got
?''. This probability would be denoted
. Now a
trivial but important theorem due to Bayes states that
 |
(20) |
where
is the a priori probability for
to occur,
regardless of whether the event
occurs, and
is the a
priori probability for
to occur, regardless of whether the event
occurs. From this theorem we conclude that
 |
(21) |
So we need to know
and
to make progress. The first
is the a priori probability for getting a particular value for
. If we don't know anything about
, except that it is
nonnegative, then we must say that any nonnegative value whatsoever is
equally probable. Thus without benefit of knowing the outcome of the
measurement, we say
is constant, independent of
for nonnegative
, and it is zero for negative
. So
the rhs of this equation reduces simply to
 |
(22) |
where the normalization factor
can be determined
by requiring that the total probability for having any
is 1.
In fact it turns out that
, so
 |
(23) |
This distribution is called the likelihood function for the parameter
. Notice that we are now thinking of the rhs as a continuous
function of
with fixed
. This result is very remarkable,
since a single measurement is giving us the whole probability
distribution! Recall that if we were to measure the length of a table
top, even if we started by assuming we were going to get a Gaussian
distribution, a single measurement would allow us only to guess
and would tell us nothing about
. To get
takes at
least two measurements, and even then we would be putting ourselves at
the mercy of the gods of statistics for taking a chance with only two
measurements. If we weren't so rash as to assume a Gaussian, we would
have to make many measurements of the length of the table top to get
the probability distribution in the measured length.
We now ask, what is the most probable value of
, given that we
just found
? This is the value with maximum likelihood. If we
examine the probability distribution, we see that it peaks at
,
just as we might have expected. We may then ask, what is the error in
the determination of this value. This is a tricky question, because
the Poisson distribution is not shaped like a Gaussian distribution.
However, for large
it looks more and more like a Gaussian.
Expanding the log of the Poisson distribution for large
and fixed
gives
![\begin{displaymath}
P(\bar k \vert k) \approx \exp[-(k - \bar k)^{2}/(2 \bar k)]/\sqrt{2 \pi
\bar k}
\end{displaymath}](img91.gif) |
(24) |
so for large
the error is
 |
(25) |
To summarize, a single measurement yields the entire probability distribution.
For large enough
we can say that
 |
(26) |
To see how Bayesian statistics works, suppose we repeated the
experiment and got a new value
. What is the probability
distribution for
in light of the new result? Now things have
changed, since the a priori probability for
is no
longer constant because we already made one measurement and got
.
Instead we have
 |
(27) |
so
 |
(28) |
Notice that the likelihood function is now the product of the
individual likelihood functions. A more systematic notation would
write this function as
, i.e. the
probability for
having a particular value, given that we made
two measurements and found
and
. The normalization
factor
is obtained by requiring the total probability to be 1.
For large
,
, and
, the most likely value of
is easily shown to be just the average
 |
(29) |
as we should have expected.
The Bayesian approach insists that we fold together all of our
knowledge about a parameter in constructing its likelihood function.
Thus a generalization of these results would state that the likelihood
function for the parameter set
, given the independently measured
results
,
,
, etc. is just
 |
(30) |
where
is a normalization factor. Again, this is just the product
of the separate likelihood functions. The result is completely
general and applies to any probability distribution, not just a
Poisson distribution. We will use this result in discussing
fits to data as a maximum likelihood search.
Next: About this document ...
Up: Properties of the Poisson
Previous: Mean and Variance
Carleton DeTar
2009-11-18