SI10 Implementation of a Metadata-based Approach to Statistical - - PowerPoint PPT Presentation

si10
SMART_READER_LITE
LIVE PREVIEW

SI10 Implementation of a Metadata-based Approach to Statistical - - PowerPoint PPT Presentation

PhUSE EU CONNECT 2018 SI10 Implementation of a Metadata-based Approach to Statistical Planning, Analysis & Reporting By Frank Freischlger & Hanspeter Schnitzer INDEX Situation & Purpose Clinical Study Metadata Statistical


slide-1
SLIDE 1

PhUSE EU CONNECT 2018

Implementation of a Metadata-based Approach to Statistical Planning, Analysis & Reporting

By Frank Freischläger & Hanspeter Schnitzer

SI10

slide-2
SLIDE 2

Page2 // 7 Nov 2018

INDEX

Situation & Purpose Clinical Study Metadata Statistical Programming for Analysis Statistical Reporting Statistical Planning Conclusion

slide-3
SLIDE 3

Page3 // 7 Nov 2018

SITUATION without METADATA

Traditional approach of statistical planning, analysis & reporting of clinical studies

Protocol eCRF Raw Data Derived Data Programming TFLs SAP SP Specs

TOC & Mock TFLs

DD Specs

Templates & Standards

Clinical Study Report

slide-4
SLIDE 4

Page4 // 7 Nov 2018

PURPOSE

Metadata approach to biostatistics Consider all aspects of processing Set up a system built on and compliant with standards Assure flexibility in support of clients and sponsors Let the system do the routine work Limit needs for programming to a minimum Allow statisticians and programmers to concentrate on advanced methods Enable respective shift in manpower

slide-5
SLIDE 5

Page5 // 7 Nov 2018

CLINICAL STUDY METADATA

slide-6
SLIDE 6

Page6 // 7 Nov 2018

METADATA REPOSITORY

Overview of data panels with content explicitly required per project

General

  • Clients
  • Stakeholders
  • Projects
  • Studies

Protocol

  • Documents
  • IE Criteria
  • Objectives
  • Analyses

Deliverables

  • Plans
  • Derived

datasets

  • Outputs
  • Reports

Statistics

  • Populations
  • Endpoints
  • Intercurrent

events

  • Estimands
  • Evaluations
  • Methods
  • References

Programming

  • LibDirSpecs
  • Output

structures

  • Report

structures

  • Plan structures

Design

  • Epochs
  • Visits
  • Disposition
  • Exposure

Data

  • Sources
  • Groups
  • Variables
slide-7
SLIDE 7

Page7 // 7 Nov 2018

FLEXIBILITY in APPLYING STANDARDS

Approach with different layers

Default Individual Study Project Sponsor

Project-specific content is mandatory Other content may be defined on higher levels Levels are dynamic Content is guaranteed Metadata entry or import Metadata edit checks are in place Independent double entry as an option

slide-8
SLIDE 8

Page8 // 7 Nov 2018

METADATA REPOSITORY

Overview of data panels with content that may be given on any level Statistics

  • Semantics
  • Tools
  • Templates
  • Method options
  • Derivations
  • Descriptors
  • Time definitions
  • Date imputations

Programming

  • InDataset Specs
  • Settings
  • Styles
  • Output options
  • Report options
  • Plan options

Resulting in project relevant metadata Processed and archived per production and delivery

slide-9
SLIDE 9

Page9 // 7 Nov 2018

STATISTICAL PROGRAMMING for ANALYSIS

slide-10
SLIDE 10

Page10 // 7 Nov 2018

WORKFLOW

From derived datasets to booklets of outputs

Derived Data Settings OutputOptions Outputs OutputControls OutputStructures Control Center Macro Library Style Results Datasets Output Datasets TFLs in Booklets Macro-free source code

For development programming With original SAS

  • utputs

With log checking summaries

slide-11
SLIDE 11

Page11 // 7 Nov 2018

LAYOUTS and FORMATTING

Example of PhUSE CSS Analyses and Code Sharing WG on demographics

Label, n (%) Label - n (%) Label n (%) Label, Unit Label (Unit) Label [Unit] xx (xx.x) xx ( xx.x) xx (xxx.x)

If 100%?

N (100.0) N (100)

If 0?

0 ( 0.0)

  • .

Missing category required?

slide-12
SLIDE 12

Page12 // 7 Nov 2018

STATISTICAL REPORTING

slide-13
SLIDE 13

Page13 // 7 Nov 2018

DATA-TO-TEXT TECHNOLOGY

Utilizing methods of Natural Language Generation

Developed by Ehud Reiter and others From structured data to text messages and documents Automatic creation of sentences and documents Content selection and surface realization in computational linguistics Examples include weather forecasts, automated journalism Published successful applications to healthcare data

slide-14
SLIDE 14

Page14 // 7 Nov 2018

WORKFLOW

Auto-generated report elements and post-processing

Reports ReportOptions ReportStructures Sentences

In-text Outputs

Template Report Other Metadata & Results Data

Filled Sentences

Micro-writing and macro-writing Flexible post-processing Updateable with new clinical data

Further statistical writing QC & peer review Client review Finalization

Revisions

slide-15
SLIDE 15

Page15 // 7 Nov 2018

SENTENCES for a REPORT

Some examples: opening line & short description of demographics A total of 500 patients were screened, of whom 50 (10.0%) failed screening prior to randomization (see Table 14.1.1.1). A total of [DISP_SCR_TOT_N] [SUBJ_TXT] were screened,

  • f whom [DISP_FAIL_TOT_N] ([DISP_FAIL_TOT_PCT] %)

failed screening prior to randomization ([REF_TXT1] Table [REF_PT_DISP]). Most patients were female (62.2%) and white (54.4%). Most [SUBJ_TXT] were [DEMO_GENDER_MOSTFRQ_LBL] ([DEMO_GENDER_MOSTFRQ_PCT] %) and [DEMO_RACE_MOSTFRQ_LBL] ([DEMO_RACE_MOSTFRQ_PCT] %).

slide-16
SLIDE 16

Page16 // 7 Nov 2018

STATISTICAL PLANNING

slide-17
SLIDE 17

Page17 // 7 Nov 2018

WORKFLOW

Auto-generated elements of a planning document, and post-processing

Plans PlanOptions PlanStructures Sentences Template Plan Other Metadata

Filled Sentences

Similar to report generation Increased complexity Sentences reusable for reporting

Further statistical writing QC & peer review Client review Finalization

Revisions

slide-18
SLIDE 18

Page18 // 7 Nov 2018

SENTENCES for a PLAN

Some examples: definition of TEAE & description of an estimand An adverse event is treatment-emergent, if onset

  • r deterioration of the event appear after the first

study treatment intake and no later than 45 days after the last study treatment intake.

An adverse event is treatment-emergent, if onset

  • r deterioration of the event appear after the first

[TRT_LBL] [TRTAPPL_LBL] and no later than [TE_LAGDEF] [TE_LAGUNIT] after the last [TRT_LBL] [TRTAPPL_LBL] .

Difference in means between treatment conditions in the change from baseline to Week 24 in sBP in the targeted population regardless of intercurrent events.

Population-level summary Patient population Specification of consideration of intercurrent events Parameter Measure

slide-19
SLIDE 19

Page19 // 7 Nov 2018

CONCLUSION

slide-20
SLIDE 20

Page20 // 7 Nov 2018

METADATA-BASED APPROACH

New approach of statistical planning, analysis & reporting of clinical studies Metadata Repository

Protocol Control Center eCRF Raw Data Derived Data Auto-generated Programming TFLs DD Specs

Templates & Standards

Clinical Study Report

SAP, Table and Mock Versions of TFLs, Statistical Programming Specifications

slide-21
SLIDE 21

Page21 // 7 Nov 2018

SUMMARY OUTLOOK

Extensive MDR Flexible & neural Productive & reliable Requires careful metadata management Limitations Challenging in preparing coded links for sentences Applicable to quality documents like SOPs etc. Useful for service proposals Improve MDR frontend, backend and control center Consider OpenXML or Linked Data as alternatives

slide-22
SLIDE 22

THANK YOU

Frank Freischläger

Estimondo GmbH

frank.freischlaeger@estimondo.com www.estimondo.com

Hanspeter Schnitzer

Estimondo GmbH

hanspeter.schnitzer@estimondo.com www.estimondo.com

Questions Comments