Wearable devices - statistical learning to the rescue Jaroslaw - - PowerPoint PPT Presentation

wearable devices statistical learning to the rescue
SMART_READER_LITE
LIVE PREVIEW

Wearable devices - statistical learning to the rescue Jaroslaw - - PowerPoint PPT Presentation

Wearable devices - statistical learning to the rescue Jaroslaw Harezlak, Ph.D. Professor and Interim Co-Chair Department of Epidemiology and Biostatistics School of Public Health Indiana University, Bloomington, IN, USA May 22, 2020 Zoom,


slide-1
SLIDE 1

Wearable devices - statistical learning to the rescue

Jaroslaw Harezlak, Ph.D.

Professor and Interim Co-Chair Department of Epidemiology and Biostatistics School of Public Health Indiana University, Bloomington, IN, USA

May 22, 2020 Zoom, Universe

slide-2
SLIDE 2

Outline

Wearable and implantable devices

  • Overview of the accelerometry data
  • Micro-scale analysis
  • Detection and analysis of walking
  • Gait features
  • Macro-scale analysis
slide-3
SLIDE 3

Authors: Marta Karas Jiawei Bai Marcin Strączkiewicz Jaroslaw Harezlak Nancy Glynn Tamara Harris Vadim Zipunnikov Ciprian Crainiceanu Jacek Urbanek

Accelerometry data in health research: challenges and opportunities Review and examples

Statistics in Biosciences, 1-28, 2019

slide-4
SLIDE 4

Wearable and Implantable Technology

  • Wearable and implantable devices are smart

electronic devices (electronic device with micro- controllers) that can be worn on the body as implants

  • r accessories
  • Emerging technology
  • Growing popularity in health research
slide-5
SLIDE 5

Actigraphy

  • Actigraphy: non-invasive monitoring of activity
  • Accelerometer records human movement
slide-6
SLIDE 6

Other wearable/implantable devices

slide-7
SLIDE 7

Major data challenges

  • Data size
  • Complexity and heterogeneity
  • Lack of standardized data collection protocols

– Device type – Device location – Wear/non-wear – No. of days of observations

slide-8
SLIDE 8

Collaboration

Biostatisticians and data scientists Epidemiologists Clinicians Software engineers Health Research

slide-9
SLIDE 9

1 1 count

Raw accelerometry data

50H 50Hz* z*60s 60s*3 3 chan annel els = 9000 9000 number ers

2020

slide-10
SLIDE 10

Macro- and Micro-scale

  • Macro-scale – summarized data (1-minute intervals)
  • Micro-scale – raw accelerometry data (10Hz+)

Hours

5 10 15 20

VMC [g]

0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

Macro-scale Features

CPM

slide-11
SLIDE 11

Micro-scale analysis

  • Activity type recognition

– Detection of walking, driving – Climbing stairs, resting, sedentary vs. upright

slide-12
SLIDE 12

Why walking?

  • Common form of physical activity
  • Features of walking related to:

– Survival – Mild Cognitive Impairment – Dementia – Stroke

Urbanek, J, Zipunnikov, V, Harris, T, Crainiceanu, C, Harezlak, J, Glynn, NW, Validation of gait characteristics extracted from raw accelerometry during walking against measures of physical function, mobility, fatigability, and fitness, The Journals of Gerontology: Medical Sciences, 2018 Apr 17;73(5):676- 681

slide-13
SLIDE 13

Data heterogeneity

slide-14
SLIDE 14

Walking on level ground

𝑤𝑛 𝑢 = 𝑦 𝑢 ! + 𝑧 𝑢 ! + 𝑨 𝑢 !

slide-15
SLIDE 15

Down/up/down stairs

slide-16
SLIDE 16

Raw subsecond-level accelerometry walking data Segmentation of Strides (2 subsequent steps) Feature extraction of human gait and gait pattern

Goal: segment strides from accelerometry walking data

Extract:

  • gait speed
  • stride-to-stride variability statistics
  • subject-specific gait pattern

Karas, M., Straczkiewicz, M., Fadel, W ., Harezlak, J., Crainiceanu, M., Urbanek, J., Adaptive empirical pattern transformation (ADEPT) with application to walking stride segmentation, Biostatistics (accepted)

Courtesy of Marta Karas

slide-17
SLIDE 17

Urbanek, J, Zipunnikov, V, Harris, T, Fadel, W , Glynn, N, Crainiceanu, C, Harezlak J, Prediction of sustained harmonic walking in the free-living environment using raw accelerometry data. Physiological measurement 39 (2), 02NT02 Urbanek, J, Harezlak, J, Glynn, NW , Harris, T, Crainiceanu, C, Zipunnikov, V, Stride variability measures derived from wrist-and hip-worn accelerometers, Gait & Posture 52 (2017) 217–223

Amplitude [g] Amplitude [g] Magnitude [g] Time [s] Stride phase Cadence [steps/s]

Characteristics of walking

slide-18
SLIDE 18

DECOS Study @ University of Pittsburgh

POPULATION:

N = 51 (26 WOMEN) ENROLLED IN THE DEVELOPMENT EPIDEMIOLOGIC COHORT STUDY (DECOS) AGE: BETWEEN 70 AND 90 (MEDIAN = 78, SD = 5.68), BMI: BETWEEN 20.5 AND 37.9 (MEDIAN 25.9, SD = 3.91)

DATA:

FREE-LIVING DATA COLLECTED FOR 7 DAYS

LEFT AND RIGHT WRISTS: ACTIGRAPH GT3X+ (80HZ) HIP: ACTIGRAPH GT3X+ (80HZ) THIGH: ACTIVPAL 3 (20HZ) TREATED AS SILVER STANDARD

slide-19
SLIDE 19

Sedentary vs. upright

slide-20
SLIDE 20

Sitting/Standing Results

Left wrist Right wrist

Method

SedUp SS SedUp SS

Window [s] 15 30 45 60 75 90

  • 15

30 45 60 75 90

  • TPR

Median 0.79 0.81 0.83 0.83 0.84 0.83 0.66 0.82 0.83 0.84 0.85 0.86 0.86 0.65

TNR

Median 0.90 0.90 0.91 0.91 0.91 0.91 0.85 0.91 0.92 0.92 0.93 0.93 0.93 0.88 MAPE [%] 13.3 13.0 12.7 12.6 12.6 12.5 18.2 15.7 15.3 15.2 15.0 15.1 15.1 19.5 MPE [%] 4.1 4.5 3.7 3.4 3.5 2.9 4.1 5.3 4.6 5.6 4.3 4.5 4.5 6.7

slide-21
SLIDE 21

Results (left wrist)

slide-22
SLIDE 22

Micro-scale analysis Summary

  • Objective quantification of physical activity
  • Objective measures of physical performance,

fatigability, mobility and fitness (free-living)

  • Algorithm development
  • Statistical learning methods
slide-23
SLIDE 23

Macro-scale analysis

  • Aggregated 1-minute intervals
  • General questions
slide-24
SLIDE 24
  • US nationally representative sample
  • Physical activity data for ~12 000 participants
  • 7 consecutive days of observation
  • ~10,000 observations per subject
  • Variety of clinical and demographic outcomes
  • Linked mortality data
  • Publicly available dataset
  • Big data challenges
slide-25
SLIDE 25

PA data

Counts-per-minute

slide-26
SLIDE 26

Macro-scale challenges

  • Data format, storage and structure
  • Start times of monitoring periods
  • Different number of days across subjects
  • Missing data due to non-wear
  • Quality control
slide-27
SLIDE 27

Data processing pipeline

Get Total No.

  • f Days of

Observation Quality Flags Create N x D x 1440+ Data Structure Aggregated Activity Data Get Start Dates and Times N x D x 1440+ Activity Data Create Non-wear flags N x D x 1440+ Non-wear flags Subset the data Clean activity data Exclusion criteria + Demographics N x D x 1440+ Activity Data

Statistical analysis

slide-28
SLIDE 28

Macro-scale: summary

  • Curating data is extremely time intensive
  • Proposed pipeline:

– efficient analysis of PA data – fast complex analysis – applied to data from NHANES and BLSA

slide-29
SLIDE 29

Acknowledgements

University of Wroclaw

  • Michal Kos (Statistics)

Johns Hopkins School of Medicine

  • Jacek Urbanek (Biostatistics)
  • Marta Karas (PhD student)
  • Ciprian Crainiceanu (Biostatistics)
  • Vadim Zipunnikov (Biostatistics)

Indiana University

  • William Fadel (Biostatistics)

Harvard University

  • Marcin Straczkiewicz (Biostatistics)

University of Pittsburgh

  • Nancy Glynn (Epidemiology)
slide-30
SLIDE 30

Thank you