Random bits
29 July 2009Showcasing Fear, Uncertainty and Doubt from the Information Security Industry (It’s about time somebody did this.)
Why Poissonian traffic models matter more now than ever, part 4
28 July 2009In the course of describing one reason why Poisson traffic is so convenient, I’ll briefly discuss how to make things look Poissonian. It turns out that an arrival or point process with a well-defined rate
can be rescaled to a unit-rate Poisson process. Papangelou, who along with Meyer obtained this result in the early seventies, put it in plain language:
Roughly, moving in
so as to meet expected future points at a rate of one per time unit (given at each instant complete knowledge of the past), we meet them at the times of a Poisson process.
A proof using “elementary” methods is in “The Time-Rescaling Theorem and Its Application to Neural Spike Train Data Analysis” by Brown et al., and their statement of the time rescaling theorem is adapted below:
Let
be a finite realization of a point process with a conditional rate function
that is nonnegative on
. Define
, and assume that
Then the family
is a finite realization of a unit-rate Poisson process.
(Here the notation just indicates the history of the process up to and including time
Technically it refers to the natural filtration of the process. More sophisticated treatments of the time rescaling theorem are given in Theorem 7.4.I and Proposition 14.6.III of Daley and Vere-Jones, 2nd. ed.)
Using the time rescaling theorem, goodness-of-fit tests like the Kolmogorov-Smirnov test can then be applied to evaluate a model characterization of a process. (It is left as an exercise for the reader to evaluate network traffic models in this way, since as far as I can tell nobody has done this yet.) A simple large-deviation result I’ll discuss below is in the same spirit, but is more applicable as the basis of an anomaly detection technique for computer network traffic.
The problem in practice with using the time rescaling theorem is that without a model, getting a reasonable estimate of an arrival rate is hard for anything but an inhomogeneous Poisson process with sufficiently slow rate variation. For slowly varying Poisson processes, all you have to do is divide the number of recent arrivals by a corresponding time window. The only subtlety here is the time windowing, and it’s a minor one.
(However, even in the case of a homogeneous Poisson process with a maximum-likelihood rate estimate, the rescaled process can be far from Poissonian when the estimation timescales are short enough or the number of observed arrivals is small enough. Even though the large number of packet or even flow arrivals should mean that this typically wouldn’t be a concern for most network traffic applications, it should still serve to highlight the essential practical difficulty present even in the simplest possible application of the time rescaling theorem. For more details, see Schoenberg’s paper “On Rescaled Poisson Processes and the Brownian Bridge”.)
Another nice thing about Poisson processes is that forming martingales from them is easy, for essentially the same reason: because the rate is well-characterized, the so-called compensator (essentially, the expected value of a random process as a function of time) is simple. Without getting into too much detail, a martingale is a bounded random process whose expected future value is always the same as its current value. Archetypal examples in discrete and continuous time are, respectively, the number of heads minus the number of tails in a series of fair coin flips and Brownian motion. Martingales are very powerful tools in probability theory, and the following result is just one illustration of this fact.
Let be a homogeneous Poisson process with rate
and let
be the corresponding martingale. By a result called the optional sampling theorem, you can get independent identically distributed (IID) random variables from
by considering the martingale differences at regular intervals: i.e., the differences
are IID, with expected value zero. And from this point you can apply large-deviation theory, which details the asymptotic behavior of large sums of IID random variables.
A few standard calculations yield first that
and subsequently that
so that after a bit of algebra, a routine application of the Cramér large-deviation theorem results in the asymptotic estimate
for , where
and
. This gets really small really fast.

Behavior of large deviation probabilities. is pretty small last time I checked.
Here’s the punchline: suppose you had the wrong idea about the nature of a putative homogeneous Poisson process (e.g., your estimate of the rate was poor or
actually failed to be a homogeneous Poisson process in the first place). Then the resulting putative martingale would not actually be a bona fide martingale, and an otherwise improbable large deviation would detect this fact with high probability and with few false positives. Notice that while this sort of approach uses quite a bit of probability theory, it doesn’t require any actual statistics other than a trivial rate estimate and a threshold to determine what constitutes a large enough deviation. And the fact that network traffic is increasingly Poissonian means that such approaches will be increasingly robust.
In the next post in this series I’ll cover the practical details of this technique.
Appendix
In this appendix I’ll go through the calculations mentioned above. First,
Next, , and the RHS is zero iff
, or equivalently
. Now
identically by Cauchy-Schwarz, so
Finally, for
where and
Note that for
, it happens that
and
, with
and
. Moreover,
for
so
, providing a helpful sanity check on the large-deviation asymptotics above.
As an aside, an essentially identical result can be obtained in general via a Chernoff bound. Briefly,
where the last equality follows by convexity and the assumption that
Posted by eqnets
Hacking nuclear command and control
24 July 2009A recent paper by Jason Fritz available at the International Commission on Nuclear Nonproliferation and Disarmament notionally discusses hacking nuclear command and control. The paper is mostly a cursory overview of nuclear C2 with speculation added, e.g. “such systems might depend on X which hackers might be able to exploit [in some unspecified way Y]“. Some choice quotes:
The juicy-looking bit about hacking SSBNs is from this reference, and the relevant quote is below:
Notice the bit about 1992.
After reading this paper and having studied nuclear weapons policy both in coursework and informally over the years (to illustrate, a review I wrote in 2007 for a book on Chinese nuclear policy is here), the paper struck me as highly speculative. Despite this, it may get a lot of sensational attention, which would be bad.
The folks at STRATCOM have all seen WarGames (just a few years ago, I shared an office with a STRATCOM O-5 who loved to talk about things nuclear), and despite the USAF’s well-publicized rccent gaffes, the US military does not take nuclear C2 lightly. Having met quite a few workers in the Russian nuclear establishment visiting a US nonproliferation institute, I feel justified in saying that Russia does not take nuclear C2 lightly either. More generally, anyone that is willing to “eat grass” to develop nuclear weapons is going to safeguard those weapons as carefully as they can, and initiatives like Nunn-Lugar go further by providing US help in securing nuclear materials.
What’s more, the dependence of most states’ nuclear C2 systems on networks is far from clear, and the corresponding vulnerability to information warfare even less so. Mr Fritz can’t be blamed for relying on OSINT, but the result is a work that does not begin to answer any of the by-now familiar questions it raises.