Introd u ction to statistical seismolog y C ASE STU D IE S IN - - PowerPoint PPT Presentation

introd u ction to statistical seismolog y
SMART_READER_LITE
LIVE PREVIEW

Introd u ction to statistical seismolog y C ASE STU D IE S IN - - PowerPoint PPT Presentation

Introd u ction to statistical seismolog y C ASE STU D IE S IN STATISTIC AL TH IN K IN G J u stin Bois Lect u rer , Caltech California mo v es and shakes 1 Fa u lt data : USGS Q u aternar y Fa u lt and Fold Database of the United States CASE


slide-1
SLIDE 1

Introduction to statistical seismology

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

Justin Bois

Lecturer, Caltech

slide-2
SLIDE 2

CASE STUDIES IN STATISTICAL THINKING

California moves and shakes

Fault data: USGS Quaternary Fault and Fold Database of the United States

1

slide-3
SLIDE 3

CASE STUDIES IN STATISTICAL THINKING

The Parkfield region

Fault data: USGS Quaternary Fault and Fold Database of the United States

1

slide-4
SLIDE 4

CASE STUDIES IN STATISTICAL THINKING

The Parkfield region

Fault data: USGS Quaternary Fault Fault and Fold Database of the United States Earthquake data: USGS ANSS Comprehensive Earthquake Catalog

1 2

slide-5
SLIDE 5

CASE STUDIES IN STATISTICAL THINKING

The Parkfield region

Image: Linda Tanner, CC-BY-2.0

1

slide-6
SLIDE 6

CASE STUDIES IN STATISTICAL THINKING

Seismic Japan

Data source: USGS ANSS Comprehensive Earthquake Catalog (ComCat)

1

slide-7
SLIDE 7

CASE STUDIES IN STATISTICAL THINKING

ECDF of magnitudes, Japan, 1990-1999

Data source: USGS ANSS Comprehensive Earthquake Catalog (ComCat)

1

slide-8
SLIDE 8

CASE STUDIES IN STATISTICAL THINKING

Location parameters

m ≡ m − 5 ∼ Exponential m ≡ m − m ∼ Exponential

′ ′ t

slide-9
SLIDE 9

CASE STUDIES IN STATISTICAL THINKING

The Gutenberg-Richter Law

The magnitudes of earthquakes in a given region over a given time period are Exponentially distributed One parameter, given by

− m , describes earthquake

magnitudes for a region

m

t

slide-10
SLIDE 10

CASE STUDIES IN STATISTICAL THINKING

The b-value

b = ( − m ) ⋅ ln 10

# Completeness threshold mt = 5 # b-value b = (np.mean(magnitudes) - mt) * np.log(10) print(b) 0.9729214742632566

m

t

slide-11
SLIDE 11

CASE STUDIES IN STATISTICAL THINKING

ECDF of all magnitudes

Data source: USGS ANSS Comprehensive Earthquake Catalog (ComCat)

1

slide-12
SLIDE 12

CASE STUDIES IN STATISTICAL THINKING

ECDF of all magnitudes

Data source: USGS ANSS Comprehensive Earthquake Catalog (ComCat)

1

slide-13
SLIDE 13

CASE STUDIES IN STATISTICAL THINKING

Completeness threshold

The magnitude, m , above which all earthquakes in a region can be detected

t

slide-14
SLIDE 14

Let's practice!

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

slide-15
SLIDE 15

Timing of major earthquakes

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

Justin Bois

Lecturer, Caltech

slide-16
SLIDE 16

CASE STUDIES IN STATISTICAL THINKING

Models for earthquake timing

Exponential: Earthquakes happen like a Poisson process Gaussian: Earthquakes happen with a well-dened period

slide-17
SLIDE 17

CASE STUDIES IN STATISTICAL THINKING

Stable continental region earthquakes

Data source: USGS Earthquake Catalog for Stable Continental Regions

1

slide-18
SLIDE 18

CASE STUDIES IN STATISTICAL THINKING

The Nankai Trough

slide-19
SLIDE 19

CASE STUDIES IN STATISTICAL THINKING

Earthquakes in the Nankai Trough

Date Magnitude 684-11-24 8.4 887-08-22 8.6 1099-02-16 8.0 1361-07-26 8.4 1498-09-11 8.6 1605-02-03 7.9 1707-10-18 8.6 1854-12-23 8.4

slide-20
SLIDE 20

CASE STUDIES IN STATISTICAL THINKING

ECDF of time between Nankai quakes

slide-21
SLIDE 21

CASE STUDIES IN STATISTICAL THINKING

Formal ECDFs

ECDF(x) = fraction of data points ≤ x

slide-22
SLIDE 22

CASE STUDIES IN STATISTICAL THINKING

Formal ECDFs

slide-23
SLIDE 23

CASE STUDIES IN STATISTICAL THINKING

Formal ECDFs

slide-24
SLIDE 24

CASE STUDIES IN STATISTICAL THINKING # time_gap is an array of interearthquake times _ = plt.plot(*dcst.ecdf(time_gap, formal=True)) _ = plt.xlabel('time between quakes (yr)') _ = plt.ylabel('ECDF')

slide-25
SLIDE 25

CASE STUDIES IN STATISTICAL THINKING

# Compute the mean time gap mean_time_gap = np.mean(time_gap) # Standard deviation of the time gap std_time_gap = np.std(time_gap) # Generate theoretical Exponential distribution of timings time_gap_exp = np.random.exponential(mean_time_gap, size=100000) # Generate theoretical Normal distribution of timings time_gap_norm = np.random.normal( mean_time_gap, std_time_gap, size=100000 ) # Plot theoretical CDFs _ = plt.plot(*dcst.ecdf(time_gap_exp)) _ = plt.plot(*dcst.ecdf(time_gap_norm))

slide-26
SLIDE 26

CASE STUDIES IN STATISTICAL THINKING

Model for Nankai Trough

slide-27
SLIDE 27

Let's practice!

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

slide-28
SLIDE 28

How are the Parkfield interearthquake times distributed?

C ASE STU D IE S IN STATISTIC AL TH IN K IN G

Justin Bois

Lecturer, Caltech

slide-29
SLIDE 29

CASE STUDIES IN STATISTICAL THINKING

The Parkfield Prediction

Adapted from Barkun and Lindh, Science, 229, 619-624, 1985

1

slide-30
SLIDE 30

CASE STUDIES IN STATISTICAL THINKING

Hypothesis test on the Nankai megathrust earthquakes

Hypothesis: The time between Nankai Trough earthquakes is Normally distributed with a mean and standard deviation as calculated from the data Test statistic: ?? At least as extreme as: ??

slide-31
SLIDE 31

CASE STUDIES IN STATISTICAL THINKING

The Kolmogorov-Smirnov statistic

slide-32
SLIDE 32

CASE STUDIES IN STATISTICAL THINKING

The Kolmogorov-Smirnov statistic

slide-33
SLIDE 33

CASE STUDIES IN STATISTICAL THINKING

The Kolmogorov-Smirnov statistic

slide-34
SLIDE 34

CASE STUDIES IN STATISTICAL THINKING

The Kolmogorov-Smirnov statistic

slide-35
SLIDE 35

CASE STUDIES IN STATISTICAL THINKING

The Kolmogorov-Smirnov statistic

slide-36
SLIDE 36

CASE STUDIES IN STATISTICAL THINKING

Kolmogorov-Smirnov test

Hypothesis: The time between Nankai Trough earthquakes is Normally distributed with a mean and standard deviation as calculated from the data Test statistic: Kolmogorov-Smirnov statistic At least as extreme as: ≥ observed K-S statistic

slide-37
SLIDE 37

CASE STUDIES IN STATISTICAL THINKING

Simulating the null hypothesis

Draw and store lots of (say, 10,000) samples out of the theoretical distribution Draw n samples out of the theoretical distribution Compute the K-S statistic from the samples

slide-38
SLIDE 38

CASE STUDIES IN STATISTICAL THINKING

# Generate samples from theoretical distribution x_f = np.random.normal(mean_time_gap, std_time_gap, size=10000) # Initialize K-S replicates reps = np.empty(1000) # Draw replicates for i in range(1000): # Draw samples for comparison x_samp = np.random.normal( mean_time_gap, std_time_gap, size=len(time_gap) ) # Compute K-S statistic reps[i] = ks_stat(x_samp, x_f) # Compute p-value p_val = np.sum(reps >= ks_stat(time_gap, x_f)) / 1000

slide-39
SLIDE 39

Let's practice!

C ASE STU D IE S IN STATISTIC AL TH IN K IN G