SLIDE 68 Neural Spike Data: Multiple Testing
- If 1000 tests are carried out, each with a significance level α = 0.01 = 1%,
around 10 tests will turn out positive, signifying nothing. The positive test results can be explained as mere chance events.
- Example: 100 recorded neurons allow for
100 3
and
100 4
- = 3, 921, 225 quadruplets.
- As a consequence, even though it is very unlikely that, say,
four specific neurons fire together three times if they are independent, it is fairly likely that we observe some set of four neurons firing together three times.
- Example: 100 neurons, 20Hz firing rate, 3 seconds recording time,
binned with 3ms time bins to obtain 1000 transactions. The event of 4 neurons firing together 3 times has a p-value of ≤ 10−6 (χ2-test). The average number of such patterns in independent data is greater than 1 (data generated as independent Poisson processes).
Christian Borgelt Frequent Pattern Mining 269
Neural Spike Data: Multiple Testing
- Solution: shift statistical testing to pattern signatures z, c,
where z is the number of neurons (pattern size) and c the number of coincidences (pattern support). [Picado-Mui˜ no et al. 2013]
- Represent null hypothesis by generating sufficiently many surrogate data sets
(e.g. by spike time randomization for constant firing rate). (Surrogate data generation must take data properties into account.)
- Remove all patterns found in the original data set for which a counterpart
(same signature) was found in some surrogate data set (closed item sets). (Idea: a counterpart indicates that the pattern could be a chance event.)
p a t t e r n s i z e z coincidences c log(#patterns)
−4 −3 −2 −1 1 2 3 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
frequent patterns
a s s e m b l y s i z e z coincidences c rate
0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
false neg. exact
a s s e m b l y s i z e z coincidences c rate
0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
all other patterns
p a t t e r n s i z e z coincidences c
0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
7 neurons 7 coins.
Christian Borgelt Frequent Pattern Mining 270
Neural Spike Data: Induced Patterns
- Let A and B with B ⊂ A be two sets left over after primary pattern filtering,
that is, after removing all sets I with signatures zI, cI = |I|, s(I) that occur in the surrogate data sets.
- The set A is preferred to the set B iff (zA − 1)cA ≥ (zB − 1)cB,
that is, if the pattern A covers at least as many spikes as the pattern B if one neuron is neglected. Otherwise B is preferred to A. (This method is simple and effective, but there are several alternatives.)
- Pattern set reduction keeps only sets that are preferred
to all of their subsets and to all of their supersets. [Torre et al. 2013]
p a t t e r n s i z e z coincidences c log(#patterns)
−4 −3 −2 −1 1 2 3 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
frequent patterns
a s s e m b l y s i z e z coincidences c rate
0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
false neg. exact
a s s e m b l y s i z e z coincidences c rate
0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
all other patterns
p a t t e r n s i z e z coincidences c
0.2 0.4 0.6 0.8 1 2 3 4 5 6 7 8 9 10 11 12 2 3 4 5 6 7 8 9 101112
7 neurons 7 coins.
Christian Borgelt Frequent Pattern Mining 271
Neural Spike Data: Temporal Imprecision
The most common approach to cope with temporal imprecision, namely time binning, has several drawbacks:
Spikes almost as far apart as the bin width are synchronous if they fall into the same bin, but spikes close together are not seen as synchronous if a bin boundary separates them.
Spikes are either synchronous (same time bin) or not, no graded notion of synchrony (precision of coincidence). It is desirable to have continuous time approaches that allow for a graded notion of synchrony. Solution: CoCoNAD (Continuous time ClOsed Neuron Assembly Detection)
- Extends frequent item set mining to point processes.
- Based on sliding window and MIS computation.
[Borgelt and Picado-Mui˜ no 2013, Picado-Mui˜ no and Borgelt 2014]
Christian Borgelt Frequent Pattern Mining 272