rdi rdi rdi rdi
play

RDI RDI RDI RDI HIV Resistance Response HIV Resistance Response - PowerPoint PPT Presentation

RDI RDI RDI RDI HIV Resistance Response HIV Resistance Response Database Initiative Database Initiative Seville 2002 Presentation Seville 2002 Presentation Goal Goal To develop a relational database to To develop a relational


  1. RDI RDI RDI RDI HIV Resistance Response HIV Resistance Response Database Initiative Database Initiative Seville 2002 Presentation Seville 2002 Presentation

  2. Goal Goal • To develop a relational database to To develop a relational database to • correlate HIV drug resistance- - correlate HIV drug resistance associated genotype data with associated genotype data with response to antiretroviral agents response to antiretroviral agents

  3. Aim Aim • Initial aim is to collect genotype, treatment Initial aim is to collect genotype, treatment • & clinical outcome information (VL &CD4) & clinical outcome information (VL &CD4) from substantial numbers of patients from substantial numbers of patients • To To organise organise data in an oracle data in an oracle- -based based • relational database relational database • To To analyse analyse data using a number of data using a number of • approaches to relate resistance mutation approaches to relate resistance mutation patterns with clinical response patterns with clinical response • Provide wide access to interrogation of Provide wide access to interrogation of • data via the internet data via the internet

  4. Core Team Core Team • Mission of the Core Team: Mission of the Core Team: • • Contribute data & Develop database Contribute data & Develop database • • Ensure data meets appropriate QA Ensure data meets appropriate QA • standards standards • Develop initial data analysis plan Develop initial data analysis plan • • Review requests to Review requests to analyse analyse data from data from • the database the database

  5. Data Input Data Input Sequence Sequence DATABASE VL & CD4 VL & CD4 DATABASE (bl ( bl & follow & follow- -up) up) Therapies Therapies

  6. Data QA Data QA • Specific QA standards applied to Specific QA standards applied to • data submitted to the database data submitted to the database • Developed for : Developed for : • • Clinical Data (Cohort, Clinical Trial) Clinical Data (Cohort, Clinical Trial) • • Sequencing Data (RT, Protease regions) Sequencing Data (RT, Protease regions) •

  7. Data Analysis Data Analysis • Data identified from about 3500 patients Data identified from about 3500 patients • – More when additional genotyping performed More when additional genotyping performed – • Oracle database hardware & software in Oracle database hardware & software in • place (with dedicated support) place (with dedicated support) – DB architecture constructed DB architecture constructed – • Power calculations have been performed Power calculations have been performed • to estimate approx. number of required to estimate approx. number of required data points (see data points (see DiRienzo DiRienzo & & DeGruttola DeGruttola) ) • Initial NN models constructed Initial NN models constructed (see Wang et al) • (see Wang et al)

  8. Neural Network Model Neural Network Model (Dechao Dechao Wang et al) Wang et al) ( Mutations Mutations predicted predicted Neural Neural VL VL ∆ VL ∆ VL Network Network (bl ( bl & follow & follow- -up) up) Therapies Therapies

  9. 62- -Parameter Neural Network model Parameter Neural Network model 62 (Dechao Dechao Wang et al) Wang et al) ( • Input variables Input variables • - 20 PI 20 PI codon codon positions positions - - 29 RT 29 RT codon codon positions positions - - 12 drugs (5 PIs, 5 12 drugs (5 PIs, 5 NRTIs NRTIs, 2 , 2 NNRTIs NNRTIs) ) - - Duration of therapy (weeks) Duration of therapy (weeks) - • Output variable Output variable • - Viral load change at on Viral load change at on- -therapy time therapy time - points points

  10. VGI Vigilance II Database VGI Vigilance II Database Viral load Viral load ( 598 ) ( 598 ) Genotype Genotype (781 ) (781 ) Regimen Regimen (715 ) (715 ) 442 samples 442 samples

  11. Linear Regression Analysis: Linear Regression Analysis: Training Set Training Set Training set (n=639) R 2 = 0.85 4 Predicted viral load 2 change 0 -4 -3 -2 -1 0 1 2 3 -2 -4 Actual viral load change

  12. Linear Regression Analysis: Linear Regression Analysis: Independent Set Independent Set Validation set (n=63) 3 R 2 = 0.55 2 Predicted viral load 1 change 0 -4 -3 -2 -1 -1 0 1 2 3 -2 -3 -4 Actual viral load change

  13. Predicting VL Trajectory: Predicting VL Trajectory: Test Data Set Test Data Set NN Prediction: 75%±1.8% Viral Load correct (~86/115) Time

  14. “In silico silico” Response Prediction ” Response Prediction “In • Baseline genotype for a virtual patient Baseline genotype for a virtual patient • - RT: 41, 67, 118, 210, 215 RT: 41, 67, 118, 210, 215 - - PI: 10, 46, 82, 90 PI: 10, 46, 82, 90 - • Alternative therapy regimens Alternative therapy regimens • a. D4T, ddI ddI, , Kaletra Kaletra a. D4T, b. D4T, ddI ddI, , indinavir indinavir b. D4T, c. AZT, 3TC, Kaletra Kaletra c. AZT, 3TC, d. AZT, 3TC, indinavir indinavir d. AZT, 3TC,

  15. “In silico silico” Response Prediction ” Response Prediction “In 3 2 Viral Load (log 10 ) D4T/ddI/IDV 1 0 AZT/3TC/IDV D4T/ddI/Kal -1 AZT/3TC/Kal -2 Baseline week 8 week 16

  16. Summary Summary • The RDI is focused on establishing relationships • The RDI is focused on establishing relationships between baseline genotype & virological between baseline genotype & virological response via analysis of a large clinical dataset response via analysis of a large clinical dataset • Significant progress has been made: • Significant progress has been made: – Sources of data identified – Sources of data identified – Database architecture constructed – Database architecture constructed – Modeling work has begun – Modeling work has begun • This initiative is open for groups to join & aims to • This initiative is open for groups to join & aims to provide open access to query the database provide open access to query the database • Utilization of large databases is likely to improve • Utilization of large databases is likely to improve the accuracy of genotypic interpretation the accuracy of genotypic interpretation

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend