Inform Confuse www.glhickey.com @graemeleehickey Co Confl - - PowerPoint PPT Presentation

inform confuse
SMART_READER_LITE
LIVE PREVIEW

Inform Confuse www.glhickey.com @graemeleehickey Co Confl - - PowerPoint PPT Presentation

To To inform or confuse with tables and figures: the EJCTS experience ce Graeme L. Hickey University of Liverpool Inform Confuse www.glhickey.com @graemeleehickey Co Confl flicts s of f interest None Assistant Editor


slide-1
SLIDE 1

To To inform or confuse with tables and figures: the EJCTS experience ce

Graeme L. Hickey University of Liverpool

Inform Confuse

@graemeleehickey www.glhickey.com

slide-2
SLIDE 2

Co Confl flicts s of f interest

  • None
  • Assistant Editor (Statistical Consultant) for EJCTS and ICVTS
slide-3
SLIDE 3

Su Summa mmari rizing data

  • Very small number of statistics – report in-line
  • E.g. “The in-hospital mortality was 10% (n = 20)”
  • Many unrelated statistics (e.g. different patient characteristics) or

displaying fine-level detail – report in tabular format

  • Many related statistics (e.g. biomarker values over time) or data to

complex for modelling – report in graphical format

slide-4
SLIDE 4

Fig Figur ures es as as the the na natur tural al pr pres esen entatio tion n tool

Flowcharts Forest plots

Source: Benchimol et al. PLoS Med 2015; 12(10): e1001885. Source: http://uk.cochrane.org/news/how-read-forest-plot

slide-5
SLIDE 5

Ta Tables as th the e natu tural al pres esen entatio tion tool

Source: Hickey GL et al. EJCTS. 2015; 49: 1441–1449. Source: Nashef SAM et al. EJCTS. 2012; 41: 1-12.

Summarizing + comparing data of different types Summarizing the results of a regression model when the exact coefficients are required

slide-6
SLIDE 6

Fig Figur ures es or

  • r ta

tables

Δ (%): before PS matching Δ (%): after PS matching Age (years) 42.1

  • 11.0

Men

  • 4.3
  • 3.2

White 30.0

  • 0.2

Hypertension 0.0 2.3 Diabetes mellitus

  • 10.0

5.7 Dyslipidemia 1.7 0.0

+ extra columns + figure

Source: Bangalore et al. Circulation. 2010; 122: 1091-1100

?

But avoid repetition/duplication

slide-7
SLIDE 7

Do Don’t t trust s summar ary s statis istics ics alo alone

Source: Matejka & Fitzmaurice (2017) https://www.autodeskresearch.com/publications/samestats http://dx.doi.org/10.1145/3025453.3025912

slide-8
SLIDE 8

Sh Show all the data

We will ask authors, where possible, not to use bar graphs, and instead to use approaches that present full data distribution.

Source: http://www.nature.com/news/announcement-towards-greater-reproducibility-for-life-sciences-research-in-nature-1.22062 Nature 546, 8 (01 June 2017) doi:10.1038/546008a

2017

slide-9
SLIDE 9

Sh Show all the data: : dy dyna namite e pl plot

Shows:

  • mean
  • 1 standard deviation (SD)

Hides:

  • the data
  • asymmetry
  • multi-modality
  • lower error bar
slide-10
SLIDE 10

Sh Show all the data: : dy dyna namite e pl plot

Shows:

  • mean
  • 1 standard error (SEM)
slide-11
SLIDE 11

Sh Show all the data: : dy dyna namite e pl plot

Shows:

  • mean
  • 95% confidence interval (CI)
slide-12
SLIDE 12

Sh Show all the data: : er error bar bar plo plot

Shows:

  • mean
  • 95% confidence interval (CI)

A little better, but still shares a lot of limitations

slide-13
SLIDE 13

Sh Show all the data: : bo box x and nd whi whisker er pl plot

Shows:

  • median
  • lower & upper quartiles
  • utliers
  • lowest/highest values

within 1.5 IQR Up until now, my preferred choice of plot

slide-14
SLIDE 14

Sh Show all the data: : do dot pl plot

Shows:

  • raw data only

Doesn’t show:

  • summary statistics
slide-15
SLIDE 15

Sh Show all the data: : vio violin lin plo plot

Shows:

  • densities

Limitations:

  • unfamiliar
  • symmetry in densities

arbitrary

slide-16
SLIDE 16

Sh Show all the data: : vio violin lin + do dot t plo plot

Shows:

  • densities
  • raw data
slide-17
SLIDE 17

Sh Show all the data: : ri ridgeline plot

Shows:

  • densities
slide-18
SLIDE 18

Th The anatomy of a (n (non-)in informativ tive e fig igure

200 400 600 800 1000 0.0 0.2 0.4 0.6 0.8 1.0 1.2

d

P<.05

0.0 0.2 0.4 0.6 0.8 1.0 6 12 18 24 30

Time from diagnosis (months) Survival probability

Male Female 138 86 35 17 7 2 90 70 30 15 6 1

  • No. at risk

+ + + + + + + +++ + + + + + + + + + + + + + + + + + + + + + + + + + + + ++ +++ + + + + + + + + + + + + + +

Log−rank test P = 0.001

supporting data supporting data undefined statistics inappropriate axes ranges unlabeled axes font size too small unclear axes label inappropriate axes breaks easily distinguishable lines legend grid marks

slide-19
SLIDE 19

Ta Tables that confuse

A (N=56) B (N=56) Age (years) 64.5 63.2746 Female 24 (42.8%) 32 (57.14%) NYHA I 7 1 II 23 19 III 22 25 IV 3 10 Creatinine 1.2 (0.9 – 1.5) 1.6 (1.1 to 3.2) Abnormal CRP 8 (14.3%) 28 (50.0%)

Some of the things that I comment on most frequently:

  • Missing statistics (e.g. standard

deviation)

  • Inappropriate precisions
  • Inconsistent precisions
  • Percentages incorrectly

calculated

  • Data don’t add up
  • Missing measurement units

(e.g. mg/dL or μmol/L?)

  • Undefined statistics
  • Undefined variables
  • ...
slide-20
SLIDE 20

Things to (probably) avoid

Use figures to inform, not confuse

slide-21
SLIDE 21

3D charts Superfluous plots

  • 3rd dimension adds no information
  • Difficult for comparison
  • Often can’t read-off values
  • Waste of page space
  • Often repeating information in main

text

Source: Klag et al. N Engl J Med 1996; 334:13-18 20 50 30 10 20 30 40 50 60 Age category (years)

Percentage of patients

<35 35-65 >65

slide-22
SLIDE 22
  • Unusable for large amounts of data
  • Difficult for comparison
  • Can’t display trends / patterns
  • Easily misinterpreted
  • Often not consistent across multiple

plots

Source: https://en.wikipedia.org/wiki/Pie_chart Source: http://the-geophysicist.com/lying-with-statistics

Pie charts Truncated axes

slide-23
SLIDE 23
  • Confusing and distracting
  • Often poorly labelled
  • Graphs presented often provide no

extra information beyond the AUROC

Source: Keating et al. The Annals of Thoracic Surgery. 2011; 92: 1893-6 Source: Nashef SAM et al. Eur J Cardio-Thoracic Surg. 1999;16: 9–13.

Dual y-axis graphs ROC plots

slide-24
SLIDE 24

Where to get EJCTS/ICVTS specific advice

EJCTS & ICVTS Statistical and Data Reporting Guidelines EJCTS/ICVTS Instructions for Authors webpage

Source: https://academic.oup.com/ejcts/pages/Manuscript_Instructions Source: Hickey et al. Eur J Cardiothorac Surg 2015;48:180–93.

slide-25
SLIDE 25

Co Conclusi sions

  • Tables and figures should (ideally) be:
  • Used only if required
  • Self-contained (i.e. can be read standalone)
  • Easy to interpret
  • Clearly labelled (legends, column titles, etc.)
  • Neatly presented (high quality figures, legible font sizes, etc.)
  • Figure + Table legends are effective constructs for conveying extra

information that facilitates interpretation

  • I always look at the figures and tables first when reviewing a paper
slide-26
SLIDE 26

Thank you for listening… any questions?

Slides available (shortly) from: www.glhickey.com