[PPT] - EPP 2007 EPP 2007 Working with concentrated epidemics new features PowerPoint Presentation

SLIDE 1

EPP 2007 EPP 2007

Working with concentrated epidemics new features and approaches

UNAIDS/WHO Working Group

n Global HIV/AIDS & STI Surveillance

SLIDE 2

2007 en 2

UNAIDS Estimation & Projection UNAIDS Estimation & Projection Package 2007 (EPP 2007) Package 2007 (EPP 2007)

Objectives

– Allow national counterparts to build models of their national epidemics composed of

Separate sub-epidemics in different at-risk populations
Geographically diverse regional sub-epidemics

– Giv short-term projections of HIV prevalence (<5 yrs) – Serve as input to Spectrum for assessing incidence, impacts, ART, etc.

SLIDE 3

2007 en 3

What basically does EPP do? What basically does EPP do?

Fits plausible epidemiological model to existing data
Modified Reference Group model – 4 fitting

parameters

– r – controlling the rate of growth – f0 – the proportion of new risk pop entrants – t0 – the start year of the epidemic – φ – behavior change parameter

…for concentrated epidemics

– d – average time in group (duration)

SLIDE 4

2007 en 4

UNAIDS Reference Group model UNAIDS Reference Group model

10 20 30 40 50

% HIV+

t0 f0 φ r d

SLIDE 5

2007 en 5

EPP EPP’ ’s s job: fit the model to the data job: fit the model to the data

10 20 30 40 50 60 70 1 9 8 1 9 8 5 1 9 9 1 9 9 5 2 2 5 2 1 2 1 5 2 2

% HIV+

SLIDE 6

2007 en 6

Steps in constructing a national epidemic Steps in constructing a national epidemic

Choose your country and name this attempt at

national projections (the workset in EPP)

Decide the key groups in the epidemic and its

geographic breakdown

Define population characteristics

– Demographics

SLIDE 7

2007 en 7

Steps in constructing a national epidemic Steps in constructing a national epidemic

Enter HIV prevalence data and sample sizes for

each sub-population or regional sub-epidemic

Fit the Reference Group model to each of them
Adjust your prevalence up or down to match any

large scale survey data that may be available

Display and review the results of your work

SLIDE 8

2007 en 8

EPP interface takes you through the steps EPP interface takes you through the steps

The tabs at the top take you through these steps

SLIDE 9

A quick demo of creating A quick demo of creating a national projection a national projection

SLIDE 10

Features relevant to concentrated Features relevant to concentrated epidemics with turnover epidemics with turnover

Important in countries with long Important in countries with long-

standing

standing epidemics in at epidemics in at-

risk populations

risk populations

SLIDE 11

2007 en 11

200 in

EPP 2007 includes turnover in populations EPP 2007 includes turnover in populations

Clients of sex workers (1000 men with 5 yr duration)

200 out Death General pop males

SLIDE 12

2007 en 12

The Define Pops page The Define Pops page -

Concentrated

Concentrated

You determine if turnover is on or off & enter duration in the group

SLIDE 13

2007 en 13

The Calibrate page The Calibrate page -

Concentrated

Concentrated

You adjust curves up or down (calibration) You determine what happens to people who leave the group if there is turnover

SLIDE 14

2007 en 14

Why is assign prevalence here? Why is assign prevalence here?

The model in EPP 2007 includes population turnover

– Many HIV+ ex-members of at-risk populations, e.g., HIV+ ex- sex workers or HIV+ ex-IDUs

These HIV+s are sometimes captured in other

surveillance populations

– e.g., ex-sex workers showing up in antenatal clinic data

But other times, they’re missed

– e.g., ex-IDUs may be missed because of limited male surveillance

SLIDE 15

2007 en 15

Fits to Thai Central Region IDU Data Fits to Thai Central Region IDU Data

Changes to the fit Changes to the fit

5 10 15 20 25 30 35 40 45 1 9 8 1 9 8 3 1 9 8 6 1 9 8 9 1 9 9 2 1 9 9 5 1 9 9 8 2 1 2 4 2 7 No turnover Dur 10 yrs Data

SLIDE 16

2007 en 16

Living ex Living ex-

IDUs

IDUs with 10 year duration with 10 year duration

Thailand Thailand IDUs IDUs

5000 10000 15000 20000 25000 30000 1980 1983 1986 1989 1992 1995 1998 2001 2004 2007 10 yr duration

At peak this is 5.4% of adult male prevalence

SLIDE 17

2007 en 17

How is assignment of HIV+ ex How is assignment of HIV+ ex’ ’s done? s done?

One selects the population from which the

HIV+s are coming

– Only populations with turnover show up here

One selects where they are to go after they

leave the group

– Only populations without turnover (closed pops) here

One decides to add or replace prevalence

SLIDE 18

2007 en 18

What do What do “ “add add” ” & & “ “replace replace” ” prevalence mean? prevalence mean?

Add prevalence

– The HIV+ former at-risk group members are added to the HIV+ members of the target population – This means they have NOT been captured in surveillance there

Replace prevalence

– Some of the HIV+’s in the target population are assumed to come from the former at-risk group members – The remaining infections that occurred “within group” are calculated

SLIDE 19

2007 en 19

Where do you see the effects? Where do you see the effects?

In the graphs on the Results page
By pushing the “Reassigns” button on the

Results page

Example

– Sex workers and general population women in Mumbai

SLIDE 20

2007 en 20

The Results page The Results page – – Concentrated form Concentrated form

SLIDE 21

2007 en 21

The Reassignment table The Reassignment table

SLIDE 22

New features in EPP 2007 New features in EPP 2007

SLIDE 23

2007 en 23

New features in EPP 2007 New features in EPP 2007

Uncertainty for generalized epidemics

– Bayesian melding

Initial guesses for concentrated epidemics
Review mode
Changes to the fitting and calculations (under the hood)

– Fuller exploration of possible solutions for r, f0, phi and t0 – Speed improvements

Improved calibration for generalized epidemics

– Including adjustments for multiple national surveys

A larger interface

SLIDE 24

2007 en 24

EPP 2007 EPP 2007 – – review mode review mode

Can open a projection w/o changing it
Disables saves
Indicated two ways:

– Title bar says “Review mode” – “Save & continue” becomes “Continue”

Two ways to exit

– On Workset Page, click “Edit” mode – On any page, hit “Save a copy”

SLIDE 25

2007 en 25

Review mode Review mode – – the interface the interface

SLIDE 26

2007 en 26

Changes to Changes to “ “Save a Copy Save a Copy” ”

When you “Save a copy”

– Makes a copy of the current workset with the name you specify – Loads that copy – Restores the page you were on

SLIDE 27

2007 en 27

Slider sensitivity adjustments Slider sensitivity adjustments

New button Adjust sliders

SLIDE 28

2007 en 28

Slider sensitivity panel Slider sensitivity panel

SLIDE 29

Major changes Major changes –

–

Uncertainty in generalized epidemics Uncertainty in generalized epidemics

SLIDE 30

2007 en 30

What have we learned about the What have we learned about the Reference Group model? Reference Group model?

Sometimes the EPP fitter

selects strange curves

This is a “feature” of the

Reference Group model

Some data sets are not

constraining

SLIDE 31

2007 en 31

Many curves can fit the same data Many curves can fit the same data – – some we know are not realistic some we know are not realistic

Source: Adrian Raftery

SLIDE 32

32

So we’re going to do the process we mentioned before…. Try many different combinations of r, f0, phi and t0….

SLIDE 33

33

And even if we eliminate the unreasonable curves….

SLIDE 34

2007 en 34

Some countries with a lot of data have only Some countries with a lot of data have only a few curves that fit a few curves that fit – – data constrains it data constrains it

Botswana through 2003 – 50,000 curves tried

8 curves

SLIDE 35

2007 en 35

Other countries with more limited data have a lot Other countries with more limited data have a lot

f curves that fit
f curves that fit –

– data does not constrain data does not constrain

Senegal urban through 2003 – 50,000 curves tried

240 curves

SLIDE 36

36

In this variation, lies a way of In this variation, lies a way of assessing uncertainty assessing uncertainty

Countries where the data limits us to

nly a few curves have less

uncertainty about the epidemic

SLIDE 37

37

This has now been built into EPP This has now been built into EPP for generalized epidemics for generalized epidemics

SLIDE 38

38

However, for concentrated epidemics However, for concentrated epidemics we don we don’ ’t yet know how to estimate t yet know how to estimate uncertainty uncertainty

SLIDE 39

2007 en 39

Uncertainty in concentrated epidemics Uncertainty in concentrated epidemics

Size of populations is one of the largest unknowns

– We don’t know how to estimate uncertainty here yet

Samples are much more restricted

– Geographically – Access to populations is more limited (FSW, IDU, MSM, etc.) – Data often not representative – convenience samples

SLIDE 40

2007 en 40

But the same approach can let us But the same approach can let us

Find a set of good initial guesses for what the

epidemic looks like

Get an idea of the range of possible curves that

might fit the data we have

– This is not formal “uncertainty” but a more qualitative impression

SLIDE 41

2007 en 41

When you come to the Projection page When you come to the Projection page

New button “Make initial guesses”

SLIDE 42

42

Before you do anything else, press Before you do anything else, press the make initial guesses button the make initial guesses button

This will bring up the EPP 2007 initial guess interface

SLIDE 43

2007 en 43

The EPP 2007 initial guess interface The EPP 2007 initial guess interface

Curve generation What to do with results

Possible curves display

Display controls Advanced

ptions

Start, Stop, and Status

SLIDE 44

2007 en 44

Basic steps to generate initial guesses Basic steps to generate initial guesses

Set the number of curves and resamples

– Generally should accept the defaults – The more curves you generate, the better an initial guess you get

Click the “Find guesses” button
If you’re happy with the results, hit “Use best fit in EPP” button

– Saves the guesses in a file (*.bm2 under eppproj/resample_results) – If not, hit “Keep current EPP fit” and you’ll be prompted to save the results for future reference

Your best guess is transferred back to the Projection Page
You can now do a fit to further refine the projection, if desired

SLIDE 45

2007 en 45

10 20 30 40 50 60 70 1 9 8 1 9 8 5 1 9 9 1 9 9 5 2 2 5 2 1 2 1 5 2 2

% HIV+

This generates as many curves as you request This generates as many curves as you request and weights them by fit to the data and weights them by fit to the data

High weight – fits the data closely

SLIDE 46

2007 en 46

So when you generate guesses you So when you generate guesses you’ ’ll see ll see something like this something like this… …. .

Best fit found among curves created Other curves giving possible fits to the data

SLIDE 47

2007 en 47

The spread in these curves gives an idea of The spread in these curves gives an idea of what curves are possible with your data what curves are possible with your data

Wide spread means many curves are possible fits to data

SLIDE 48

2007 en 48

If you like the If you like the “ “best fit best fit” ” curve, curve, click click “ “Use best fit in EPP Use best fit in EPP” ”

Your result is transferred back to EPP, then click “Save and continue”

SLIDE 49

2007 en 49

And if you want more details or to select And if you want more details or to select another curve another curve – – click click “ “Show best Show best” ”

You can keep this window

pen and experiment with

typing the r, f0, t0 and phi values into the Projection Page Count give an idea of the relative closeness in fitting the data. High count means a closer fit

SLIDE 50

2007 en 50

If you click on If you click on “ “Selected parameter values Selected parameter values” ” after an uncertainty run after an uncertainty run

SLIDE 51

2007 en 51

Selected parameter values Selected parameter values

For the curves that were selected as fitting well, the

blue bars show

– Values of r, f0, phi and t0 for those curves

The red curves show the distribution of these values

when they were originally created before sampling

– For example in preceding slide, more phi’s were generated around +100

SLIDE 52

2007 en 52

So what So what’ ’s with s with “ “Advanced Options Advanced Options” ”

This controls things that may be needed in some

circumstances

– Change how the parameters for the initial curves are generated – Apply conditions to eliminate curves that are unreasonable based on experience

SLIDE 53

2007 en 53

Advanced Options Interface Advanced Options Interface

SLIDE 54

2007 en 54

Limits on curve generation (left hand side) Limits on curve generation (left hand side)

We need to generate a lot of curves

– Done by giving random values for r, f0, t0 and phi

It’s better if we generate ones that are more

likely to fit

– We throw fewer of the curves away

So we can restrict the range on r, f0, phi and t0

SLIDE 55

2007 en 55

Why is this important? Why is this important?

Want to be sure to sample the appropriate range

for a given epidemic

– If the epidemic is going down already, should choose a phi median of 0 or even -100 – Otherwise, will not get sufficient curves (since sample few which are declining)

If we find a strong imbalance, then we should

revise the original criteria for generating curves

SLIDE 56

2007 en 56

Sometimes we get a cluster of curves we know are not feasible
Condition

– Prev < 1% in 1985 will eliminate these

Apply with

caution or you can eliminate valid curves

Conditions on prevalence (right hand side) Conditions on prevalence (right hand side)

SLIDE 57

2007 en 57

So to review, to fit a concentrated epidemic So to review, to fit a concentrated epidemic

Fill in everything until you get to Projection Page as in the past

(and documented in the EPP 2007 presentation handed out)

Press “Make initial guesses” button on Projection page
Press “Find guesses” button & wait for it to finish
Either

– Press the “Use best fit in EPP” – Click on “Show best” and enter the values for r, f0, t0 and phi you like on the Projection Page

On the Projection Page, hit “Save and Continue”

– This is important – DON’T FORGET IT!!! (you’ll lose results)

Move on to fitting the next sub-population

SLIDE 58

2007 en 58

So to review, fitting an epidemic (continued) So to review, fitting an epidemic (continued)

Select the calibration method (if any)
Reassign populations if you have turnover
On the Results page, generate a Spectrum file by:

– Pushing “Output results” – On that display, pushing “Write Spectrum File” – This generates a *.spt file in the eppout directory

Go to the Audit check page and check your findings
Take a well deserved rest

SLIDE 59

59

Appendix Appendix – – estimating uncertainty estimating uncertainty in generalized epidemics in generalized epidemics

SLIDE 60

2007 en 60

Assessing uncertainty Assessing uncertainty – – Bayesian melding Bayesian melding

Developed by Adrian Developed by Adrian Raftery Raftery & Leontine Alkema for EPP & Leontine Alkema for EPP

Randomly generate lots of curves

– Select a lot of (r, f0, phi and t0) values

Compare the curves with the data

– Calculate “goodness” of fit and assign a weight – Likelihood function is used as a weight on the curve – High likelihood means a curve is a good fit and gets a high weight

Resample a smaller number of curves from the curves originally

calculated

– But, resample according to the weight assigned – The curves that fit better get picked more often

Keep the resampled curves, throw away the others
These curves provide an estimate of the uncertainty

SLIDE 61

2007 en 61

10 20 30 40 50 60 70 1 9 8 1 9 8 5 1 9 9 1 9 9 5 2 2 5 2 1 2 1 5 2 2

% HIV+

Bayesian melding first generates many curves Bayesian melding first generates many curves

High weight – fits the data closely

SLIDE 62

2007 en 62

We then resample the curves according to the We then resample the curves according to the weights weights – – selecting ones that best fit the data selecting ones that best fit the data

With 3000 resamples, only 6 high likelihood curves get selected

Zimbabwe urban data

SLIDE 63

2007 en 63

If we do enough curves, If we do enough curves, we can estimate the uncertainty we can estimate the uncertainty

Botswana with 200,000 curves, resampling gives 46 unique ones

95% of resampled curves fall between two dashed lines

SLIDE 64

2007 en 64

And countries where data trends are less clear And countries where data trends are less clear will show more variation in curves will show more variation in curves

Zimbabwe urban 200,000 curves, gives 55 unique ones

That is, they show more uncertainty

SLIDE 65

2007 en 65

And countries with very limited data will not And countries with very limited data will not constrain the possible curves much at all constrain the possible curves much at all

Senegal urban 50 curves, gives 240 unique ones

Uncertainty about the future is huge

SLIDE 66

2007 en 66

As more data becomes available projections As more data becomes available projections should improve & uncertainty fall should improve & uncertainty fall

Botswana urban surveillance data through 2002

SLIDE 67

2007 en 67

Uncertainty decreases as more data Uncertainty decreases as more data becomes available becomes available

Botswana urban using only data through 1995 – data still rising

Very uncertain

SLIDE 68

2007 en 68

Uncertainty decreases as more data Uncertainty decreases as more data becomes available becomes available

Botswana urban using only data through 2000 – points starting to level off

Uncertainty is getting smaller

SLIDE 69

2007 en 69

Uncertainty decreases as more data Uncertainty decreases as more data becomes available becomes available

Botswana urban using all data through 2002 – data has leveled off

Uncertainty is narrowing as epidemic levels off

SLIDE 70

2007 en 70

So how do I do estimate uncertainty? So how do I do estimate uncertainty?

You use EPP as you normally would

– Open a new generalized workset

Workset page

– Define the sub-populations in your epidemic

Define Epidemic page

– Enter the demographics

Define Pops page

– Enter the HIV data for each

Enter Data page

– Enter your calibrations (Calibrate page)

…And then

SLIDE 71

71

Calibration page Calibration page – – generalized epidemics generalized epidemics

SLIDE 72

2007 en 72

The EPP 2007 Calibration page The EPP 2007 Calibration page

Four calibration options

– Adjust to level measured in general pop survey – Adjust by average amount based on DHSes in generalized epidemics – Scale a sub-population’s HIV by a given factor – Don’t do calibration (the default)

SLIDE 73

2007 en 73

The EPP 2007 Calibration page The EPP 2007 Calibration page

SLIDE 74

2007 en 74

If you choose to adjust to a general population

survey, for each sub-population enter:

– Survey prevalence – Year survey was done – Standard error on survey (need to locate this) – Sample size

Up to 3 survey results in different years

The EPP 2007 Calibration page The EPP 2007 Calibration page

SLIDE 75

2007 en 75

Then you come to the Projection page Then you come to the Projection page

New button “Assess uncertainty”

SLIDE 76

76

Before you do anything else, press Before you do anything else, press the assess uncertainty button the assess uncertainty button

This will bring up the EPP 2007 uncertainty interface

SLIDE 77

2007 en 77

The EPP 2007 uncertainty interface The EPP 2007 uncertainty interface

Results display

Display controls Curve generation Advanced

ptions

Start, Stop, and Status What to do with results

SLIDE 78

2007 en 78

Basic steps to make an uncertainty run Basic steps to make an uncertainty run

Set the number of curves and resamples

– Generally should accept the defaults – For final national projections, probably want 200,000 initial curves

Click the “Analyze uncertainty” button
If you’re happy with the results, hit the “Use UA fit in EPP” button

– Saves the uncertainty in a file (*.bm2 under eppproj/resample_results) – If not, hit “Keep current EPP fit” and you’ll be prompted to save the results for future reference

Your results are transferred back to the Projection Page
You can now do a fit to further refine the projection, if desired

– This will not affect your uncertainty results

SLIDE 79

79

Demonstration of making a basic Demonstration of making a basic uncertainty run with the Botswana file uncertainty run with the Botswana file Demonstration

SLIDE 80

2007 en 80

How many curves should I use? How many curves should I use?

Recommendation: 200,000 curves generated for good uncertainty assessment
Leave resamples at 3,000

Zimbabwe 200,000 Zimbabwe 1,000,000

SLIDE 81

2007 en 81

The results display The results display – – what what’ ’s there s there

Parameters

f best fitting

curve found in sample Graph with: Surveillance data Unique curves (light gray) Bounds (dashed lines) Best curve (UA fit - red) Mean (blue) Median (black)

UA fit UA fit – – the curve with best fit to the available data of those sampled the curve with best fit to the available data of those sampled

SLIDE 82

2007 en 82

What do the checkboxes at the bottom refer to? What do the checkboxes at the bottom refer to?

Surv data – the actual surveillance values

– Presented by site so you can see site trends

Curves – the unique resampled curves
UA fit curve – the most likely among the sampled curves

– “best fit” for us

Bounds

– 95% confidence bounds (95% of curves fall between the dashed lines)

Mean and median

– Year by year, the mean & median of all resampled curves

SLIDE 83

2007 en 83

If you calibrate to a survey, you If you calibrate to a survey, you’ ’ll see ll see

Red diamond is the survey value Black dotted lines – uncalibrated bounds Blue dotted lines – calibrated bounds

SLIDE 84

2007 en 84

Two key things to note here Two key things to note here

If only one survey, calibrated UA fit curve will go

through the survey point

Calibrated bounds are lower and narrower than

uncalibrated

– Assumption is that survey accurately reflects true population prevalence, subject to measured error

SLIDE 85

2007 en 85

Results page Results page – – putting your projections together putting your projections together

combine everything into the national projection

combine everything into the national projection

New button “Show uncertainty” Gives national uncertainty combining all the local uncertainty

SLIDE 86

2007 en 86

Clicking on Clicking on “ “Show uncertainty Show uncertainty” ” brings brings up national uncertainty results up national uncertainty results

SLIDE 87

2007 en 87

Saving an uncertainty file for Spectrum Saving an uncertainty file for Spectrum

Click “Save Spectrum uncertainty file”
A file with the extension *.spu will be saved,

which can be read by Spectrum

– By default in C:\Program Files\EPP 2007 R0\eppout

SLIDE 88

2007 en 88

So to review, to fit a generalized epidemic So to review, to fit a generalized epidemic

Fill in everything until you get to Projection Page as in

the past (and documented in the EPP 2005 manual)

Press “Assess uncertainty” button on Projection page
Press “Analyze uncertainty” button & wait for it to finish
Press the “Use UA fit in EPP”
On the Projection Page, hit “Save and Continue”

– This is important – DON’T FORGET IT!!! (you’ll lose results)

Move on to fitting the next sub-population

SLIDE 89

2007 en 89

So to review, fitting an epidemic (continued) So to review, fitting an epidemic (continued)

On the Results page, generate a Spectrum file by:

– Pushing “Output results” – On that display, pushing “Write Spectrum File” – This generates a *.spt file in the eppout directory

Press “Show Uncertainty” button on Results page to see

national uncertainty results

Generate a Spectrum uncertainty file by:

– Pushing “Save Spectrum uncertainty file” on the National Uncertainty Results page – This generates a *.spu file in the eppout directory

Take a well deserved rest

SLIDE 90

2007 en 90

Warning Warning

Do not use older EPP files from EPP 2005