Author/Authors :
Kabluchko، نويسنده , , Zakhar and Wang، نويسنده , , Yizao، نويسنده ,
Abstract :
Let X 1 , X 2 , … be independent identically distributed (i.i.d.) random variables with E X k = 0 , V ar X k = 1 . Suppose that φ ( t ) ≔ log E e t X k < ∞ for all t > − σ 0 and some σ 0 > 0 . Let S k = X 1 + ⋯ + X k and S 0 = 0 . We are interested in the limiting distribution of the multiscale scan statistic M n = max 0 ≤ i < j ≤ n S j − S i j − i . We prove that for an appropriate normalizing sequence a n , the random variable M n 2 − a n converges to the Gumbel extreme-value law exp { − e − c x } . The behavior of M n depends strongly on the distribution of the X k ’s. We distinguish between four cases. In the superlogarithmic case we assume that φ ( t ) < t 2 / 2 for every t > 0 . In this case, we show that the main contribution to M n comes from the intervals ( i , j ) having length l ≔ j − i of order a ( log n ) p , a > 0 , where p = q / ( q − 2 ) and q ∈ { 3 , 4 , … } is the order of the first non-vanishing cumulant of X 1 (not counting the variance). In the logarithmic case we assume that the function ψ ( t ) ≔ 2 φ ( t ) / t 2 attains its maximum m ∗ > 1 at some unique point t = t ∗ ∈ ( 0 , ∞ ) . In this case, we show that the main contribution to M n comes from the intervals ( i , j ) of length d ∗ log n + a log n , a ∈ R , where d ∗ = 1 / φ ( t ∗ ) > 0 . In the sublogarithmic case we assume that the tail of X k is heavier than exp { − x 2 − ε } , for some ε > 0 . In this case, the main contribution to M n comes from the intervals of length o ( log n ) and in fact, under regularity assumptions, from the intervals of length 1. In the remaining, fourth case, the X k ’s are Gaussian. This case has been studied earlier in the literature. The main contribution comes from intervals of length a log n , a > 0 . We argue that our results cover most interesting distributions with light tails. The proofs are based on the precise asymptotic estimates for large and moderate deviation probabilities for sums of i.i.d. random variables due to Cramér, Bahadur, Ranga Rao, Petrov and others, and a careful extreme value analysis of the random field of standardized increments by the double sum method.
Keywords :
Multiscale scan statistic , Cramér series , Gumbel distribution , Double sum method , change-point detection , Subgaussian distributions , Increments of random walks , Extreme value theory , Erd?s–Rényi law , Moderate deviations , Large deviations