In the course of describing one reason why Poisson traffic is so convenient, I’ll briefly discuss how to make things look Poissonian. It turns out that an arrival or point process with a well-defined rate
can be rescaled to a unit-rate Poisson process. Papangelou, who along with Meyer obtained this result in the early seventies, put it in plain language:
Roughly, moving in
so as to meet expected future points at a rate of one per time unit (given at each instant complete knowledge of the past), we meet them at the times of a Poisson process.
A proof using “elementary” methods is in “The Time-Rescaling Theorem and Its Application to Neural Spike Train Data Analysis” by Brown et al., and their statement of the time rescaling theorem is adapted below:
Let
be a finite realization of a point process with a conditional rate function
that is nonnegative on
. Define
, and assume that
Then the family
is a finite realization of a unit-rate Poisson process.
(Here the notation just indicates the history of the process up to and including time
Technically it refers to the natural filtration of the process. More sophisticated treatments of the time rescaling theorem are given in Theorem 7.4.I and Proposition 14.6.III of Daley and Vere-Jones, 2nd. ed.)
Using the time rescaling theorem, goodness-of-fit tests like the Kolmogorov-Smirnov test can then be applied to evaluate a model characterization of a process. (It is left as an exercise for the reader to evaluate network traffic models in this way, since as far as I can tell nobody has done this yet.) A simple large-deviation result I’ll discuss below is in the same spirit, but is more applicable as the basis of an anomaly detection technique for computer network traffic.
The problem in practice with using the time rescaling theorem is that without a model, getting a reasonable estimate of an arrival rate is hard for anything but an inhomogeneous Poisson process with sufficiently slow rate variation. For slowly varying Poisson processes, all you have to do is divide the number of recent arrivals by a corresponding time window. The only subtlety here is the time windowing, and it’s a minor one.
(However, even in the case of a homogeneous Poisson process with a maximum-likelihood rate estimate, the rescaled process can be far from Poissonian when the estimation timescales are short enough or the number of observed arrivals is small enough. Even though the large number of packet or even flow arrivals should mean that this typically wouldn’t be a concern for most network traffic applications, it should still serve to highlight the essential practical difficulty present even in the simplest possible application of the time rescaling theorem. For more details, see Schoenberg’s paper “On Rescaled Poisson Processes and the Brownian Bridge”.)
Another nice thing about Poisson processes is that forming martingales from them is easy, for essentially the same reason: because the rate is well-characterized, the so-called compensator (essentially, the expected value of a random process as a function of time) is simple. Without getting into too much detail, a martingale is a bounded random process whose expected future value is always the same as its current value. Archetypal examples in discrete and continuous time are, respectively, the number of heads minus the number of tails in a series of fair coin flips and Brownian motion. Martingales are very powerful tools in probability theory, and the following result is just one illustration of this fact.
Let be a homogeneous Poisson process with rate
and let
be the corresponding martingale. By a result called the optional sampling theorem, you can get independent identically distributed (IID) random variables from
by considering the martingale differences at regular intervals: i.e., the differences
are IID, with expected value zero. And from this point you can apply large-deviation theory, which details the asymptotic behavior of large sums of IID random variables.
A few standard calculations yield first that
and subsequently that
so that after a bit of algebra, a routine application of the Cramér large-deviation theorem results in the asymptotic estimate
for , where
and
. This gets really small really fast.

Behavior of large deviation probabilities. is pretty small last time I checked.
Here’s the punchline: suppose you had the wrong idea about the nature of a putative homogeneous Poisson process (e.g., your estimate of the rate was poor or
actually failed to be a homogeneous Poisson process in the first place). Then the resulting putative martingale would not actually be a bona fide martingale, and an otherwise improbable large deviation would detect this fact with high probability and with few false positives. Notice that while this sort of approach uses quite a bit of probability theory, it doesn’t require any actual statistics other than a trivial rate estimate and a threshold to determine what constitutes a large enough deviation. And the fact that network traffic is increasingly Poissonian means that such approaches will be increasingly robust.
In the next post in this series I’ll cover the practical details of this technique.
Appendix
In this appendix I’ll go through the calculations mentioned above. First,
Next, , and the RHS is zero iff
, or equivalently
. Now
identically by Cauchy-Schwarz, so
Finally, for
where and
Note that for
, it happens that
and
, with
and
. Moreover,
for
so
, providing a helpful sanity check on the large-deviation asymptotics above.
As an aside, an essentially identical result can be obtained in general via a Chernoff bound. Briefly,
where the last equality follows by convexity and the assumption that
[...] matter more now than ever, part 5 To sketch how the large-deviation approach discussed in the last post can actually work, consider a process with conditional rate function 1 for the first arrivals and [...]
[...] process (or in principle, at least, more general one-dimensional point processes) was outlined here and [...]