Synthetic Benchmarks for Genetic Improvement Aymeric Blot Justyna - PowerPoint PPT Presentation

Oct 12, 2023 •217 likes •312 views

Synthetic Benchmarks for Genetic Improvement Aymeric Blot Justyna Petke University College London, UK UK EPSRC grant EP/P023991/1 GI@ICSE 3 July 2020 1 In a Nutshell Motivation: Empirical comparisons of GI approaches Parameter

Synthetic Benchmarks for Genetic Improvement Aymeric Blot Justyna Petke University College London, UK UK EPSRC grant EP/P023991/1 GI@ICSE — 3 July 2020 1
In a Nutshell Motivation: ◮ Empirical comparisons of GI approaches ◮ Parameter configuration of GI ◮ Genetic improvement of GI ◮ Quick experimentation for GI ideas Idea: ◮ Premise: GI applied on software is very slow ◮ Bottleneck: fitness evaluation ◮ Proposition: synthetic benchmarks 2
Synthetic Benchmarks Issues with real-world benchmarks: ◮ Evaluation is expensive ◮ Good data is scarce ◮ Uncertain features Possible solutions: ◮ Surrogate modelling ◮ Artificial instances ◮ Synthetic benchmarks Dang et al., GECCO 2017 (AC(AC) using surrogate modelling) Malitsky et al., LION 2016 (Structure preserving instance generation) 3
Formalism Standard GI: � optimise E [ o ( s, i ) , i ∈ D ] (GI) subject to s ∈ S with: ◮ E : statistical population parameter (e.g., average) ◮ o : cost metric (e.g., running time) ◮ D : input distribution (e.g., test cases, instances) ◮ s : software variants ◮ S : search space Idea: Replacing E [ o ( s, i ) , i ∈ ( D )] by a single instantaneous query 4
Software Analysis Search space: ◮ Around n deletions ◮ Around n 2 replacements ◮ Around n 2 insertions � � k i =1 ( n 2 i ) sequences up to size k s 0 ◮ that’s too big! Assumption: ◮ Edits are independent � only around n 2 fitness values ◮ reasonable to model 5
Synthetic Model Empirical analysis: Contribution aggregation: ◮ Sample edits ◮ Compilation errors propagate ◮ Collect data, e.g.: ◮ Runtime errors propagate ◮ did it compile? ◮ Wrong outputs propagate ◮ did it run? ◮ Duplicate edits are ignored ◮ was it correct? ◮ how much better/worse? ◮ Fitness ratios are multiplied ◮ Compute underlying distribution E.g.: [80% , 100% , 105%] → 84% 6
Conclusion Problem: ◮ GI(software) is much slower than software ◮ GI(GI(software)) is much much slower than GI(software) Idea: ◮ Replace software with model ◮ model is free ◮ GI(model) is cheap ◮ GI(GI(model)) should be reasonable Advantages: ◮ Cheap, reusable benchmarks ◮ Model as complex as designed ◮ Possible focus on particular software feature 7
Selected References Nguyen Dang, Leslie Pérez Cáceres, Patrick De Causmaecker, and Thomas Stützle. Configuring irace using surrogate configuration benchmarks. In Peter A. N. Bosman, editor, Proceedings of the 12th Genetic and Evolutionary Computation Conference (GECCO 2017), Berlin, Germany , pages 243–250. ACM, 2017. Yuri Malitsky, Marius Merschformann, Barry O’Sullivan, and Kevin Tierney. Structure-preserving instance generation. In Paola Festa, Meinolf Sellmann, and Joaquin Vanschoren, editors, Proceedings of the 10th International Conference on Learning and Intelligent Optimization, Revised Selected Papers (LION 10), Ischia, Italy , volume 10079 of Lecture Notes in Computer Science , pages 123–140. Springer, 2016. + 1

Recommend

1 2 Genetic Program Genetic Program Parameter 3 Genetic Program Genetic Program 4 Softcoding

1 2 Genetic Program Genetic Program Parameter 3 Genetic Program Genetic Program 4 Softcoding Platform Softcoding Platform Faster More affordable More affordable More predictable 5 Modulating Softcoding Platform g Genetic

1.13k views • 68 slides

Synthetic Biology Considerations in Synthetic Biology Considerations in Synthetic Biology

Synthetic Biology Considerations in Synthetic Biology Considerations in Synthetic Biology Considerations in Implementation of Security Council Resolution 1540 Implementation of Security Council Resolution 1540 Implementation of Security Council

709 views • 17 slides

Benchmarks Online Testing Data District Benchmarks English/Language Arts and Math

Benchmarks Online Testing Data District Benchmarks English/Language Arts and Math Benchmark A: Sept. 16-25 Benchmark B: Jan. 25-Feb. 5 Benchmark C: June 6-17 All ELA/Math benchmarks will be delivered online and in LinkIt!

266 views • 14 slides

The HPC Challenge Benchmarks and the PMaC project Certificates of relevance for benchmarks

The HPC Challenge Benchmarks and the PMaC project Certificates of relevance for benchmarks Certificates of relevance for benchmarks Do they cover a useful performance space? Do they cover a useful performance space? Do they

525 views • 4 slides

Genetic.io Genetic Algorithms in all their shapes and forms ! Genetic.io Make something of your

Genetic.io Genetic Algorithms in all their shapes and forms ! Genetic.io Make something of your big data Julien Sebrien Self-taught, passion for development. Java, Cassandra, Spark, JPPF . @jsebrien, julien.sebrien@genetic.io

440 views • 43 slides

Germ- -line Genetic Therapy line Genetic Therapy Germ Munson- -Davis Look Bravely at a Davis

Germ-line Genetic Therapy Germ- -line Genetic Therapy line Genetic Therapy Germ Munson- -Davis Look Bravely at a Davis Look Bravely at a Munson Brave New World Brave New World Genetic Treatment Genetic Treatment Curing Disease

71 views • 5 slides

Genetic Programming What is it? Genetic Programming Genetic programming (GP) is an

Genetic Programming What is it? Genetic Programming Genetic programming (GP) is an automated method for creating a working computer program from a high-level problem statement of a problem. Genetic programming starts from a high-

197 views • 8 slides

Synthetic Biology and Rational Design Keith Shearwin University of Adelaide Synthetic biology

Synthetic Biology and Rational Design Keith Shearwin University of Adelaide Synthetic biology what is it? Analogy with engineering Learning by building: natural and synthetic gene circuits 1 (2) (1) 2 (1) Understand natural

622 views • 61 slides

Modular Synthetic Receptor System Interfaced with Nano Breadboard Synthetic receptor scheme

Modular Synthetic Receptor System Interfaced with Nano Breadboard Synthetic receptor scheme Synthetic receptor model Active state Inactive state, protein split Principle of a construction kit FluA-Anticalin scFv-Anti-NIP Transmembraneregion

641 views • 24 slides

BENCHMARKS TOPIC SUMMARY Scott Adams, Dilbert BENCHMARKS The Investment Process and how BM fits

BENCHMARKS TOPIC SUMMARY Scott Adams, Dilbert BENCHMARKS The Investment Process and how BM fits BM definition and why is it important? Uses for BMs Different types Who are the relevant parties? Properties of Ideal BM Examples Problems in

584 views • 22 slides

Inside The RT Patch Talk: Steven Rostedt (Red Hat) Benchmarks : Darren V Hart (IBM) Inside

Inside The RT Patch Talk: Steven Rostedt (Red Hat) Benchmarks : Darren V Hart (IBM) Inside The RT Patch Talk: Steven Rostedt (Red Hat) Benchmarks : Darren V Hart (IBM) Understanding PREEMPT_RT Talk: Steven Rostedt (Red Hat) Benchmarks

840 views • 47 slides

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery Basic principles Candidate gene studies Genome screening Genome sequencing Genetic architecture disease Rationale Genetic

835 views • 58 slides

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery

860 views • 60 slides

Genetic drift (two types) Genetic drift: changes in allele frequencies due to chance. Founder

Genetic drift (two types) Genetic drift: changes in allele frequencies due to chance. Founder effect Ex: Polydactyly in Amish communities -Small gene pool means rarer genes are inherited 1 Genetic drift (two types) Genetic bottleneck: population

476 views • 16 slides

All in the Family How Genetic Counselors Facilitate Familial Genetic Testing Amanda Openshaw, MS,

All in the Family How Genetic Counselors Facilitate Familial Genetic Testing Amanda Openshaw, MS, LCGC Genetic Counselor, ARUP Laboratories Objectives Recognize different methodologies for performing family specific genetic testing

527 views • 21 slides

Design of Synthetic Genetic Systems Closing the Design Automation Loop Jean Peccoud Virginia

Design of Synthetic Genetic Systems Closing the Design Automation Loop Jean Peccoud Virginia Bioinformatics Institute Virginia Tech g Moores law of synthetic genomics The productivity of DNA sequencing has increased more than 500-fold

986 views • 67 slides

Towards Computational Assessment of Idea Novelty Kai Wang 1 Boxiang Dong 2 Junjie Ma 1 1 School of

Towards Computational Assessment of Idea Novelty Kai Wang 1 Boxiang Dong 2 Junjie Ma 1 1 School of Management and Marketing Kean University Union NJ 2 Department of Computer Science Montclair State University Montclair, NJ Jan 11, 2019 Idea

535 views • 19 slides

Least Restrictive Environment Technical Assistance Session: Serving Students with Disabilities

Least Restrictive Environment Technical Assistance Session: Serving Students with Disabilities OSSE Division of Systems and Supports, K-12 Agenda Introduction & Purpose Review of Individuals with Disabilities Education Act

612 views • 29 slides

IDEAs Equitable Services Set-Aside Required Federal Funding for Parentally Placed Private

IDEAs Equitable Services Set-Aside Required Federal Funding for Parentally Placed Private School Students with Disabilities 34 CFR 300.130-300.144 Federal Funding Conference February 2019 Types of IDEA Formula Grants Formula funds

883 views • 40 slides

Stanford question & answer challenge Ethical, legal, societal influences Qualification

Stanford question & answer challenge Ethical, legal, societal influences Qualification problem All preconditions? Ramification problem All effects of action? Knowing that you do not know is the best. Not knowing that you do not know is an

871 views • 64 slides

Identification in Triangular Systems using Control Functions Maximilian Kasy Department of

Identification in Triangular Systems using Control Functions Maximilian Kasy Department of Economics, UC Berkeley Maximilian Kasy (UC Berkeley) Control Functions 1 / 19 Introduction Introduction There is a lively literature on nonparametric

650 views • 19 slides

Se Sect ction ion 811 1 Pr Proj ojec ect t Ren ental al As Assi sistance ance Pr

Se Sect ction ion 811 1 Pr Proj ojec ect t Ren ental al As Assi sistance ance Pr Program am Un Unit t Ident ntif ifica icati tion Section ction 811 PRA January ry 18, , 2018 1 Todays Agenda New Staff Training

444 views • 23 slides

Out-of-set i-vector selection for open-set language identification Hamid Behravan, Tomi Kinnunen,

Out-of-set i-vector selection for open-set language identification Hamid Behravan, Tomi Kinnunen, Ville Hautamki School of Computing University of Eastern Finland Odyssey 2016 June 21-24 Bilbao Closed-set: a test segment corresponds to one

591 views • 21 slides

Shape optimization for interface identification in nonlocal models Volker Schulz and Christian

Shape optimization for interface identification in nonlocal models Volker Schulz and Christian Vollmann www.alop.uni-trier.de Why nonlocal operators? Because of a wealth of application fields: fractional diffusion (Brockmann et al. 2008,

1.12k views • 75 slides

Synthetic Benchmarks for Genetic Improvement Aymeric Blot Justyna - PowerPoint PPT Presentation

Synthetic Benchmarks for Genetic Improvement Aymeric Blot Justyna Petke University College London, UK UK EPSRC grant EP/P023991/1 GI@ICSE 3 July 2020 1 In a Nutshell Motivation: Empirical comparisons of GI approaches Parameter

1 2 Genetic Program Genetic Program Parameter 3 Genetic Program Genetic Program 4 Softcoding

Synthetic Biology Considerations in Synthetic Biology Considerations in Synthetic Biology

Benchmarks Online Testing Data District Benchmarks English/Language Arts and Math

The HPC Challenge Benchmarks and the PMaC project Certificates of relevance for benchmarks

Genetic.io Genetic Algorithms in all their shapes and forms ! Genetic.io Make something of your

Germ- -line Genetic Therapy line Genetic Therapy Germ Munson- -Davis Look Bravely at a Davis

Genetic Programming What is it? Genetic Programming Genetic programming (GP) is an

Synthetic Biology and Rational Design Keith Shearwin University of Adelaide Synthetic biology

Modular Synthetic Receptor System Interfaced with Nano Breadboard Synthetic receptor scheme

BENCHMARKS TOPIC SUMMARY Scott Adams, Dilbert BENCHMARKS The Investment Process and how BM fits

Inside The RT Patch Talk: Steven Rostedt (Red Hat) Benchmarks : Darren V Hart (IBM) Inside

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery

Introduction to Genetic Epidemiology CM van Duijn Genetic Epidemiology Unit Gene Discovery

Genetic drift (two types) Genetic drift: changes in allele frequencies due to chance. Founder

All in the Family How Genetic Counselors Facilitate Familial Genetic Testing Amanda Openshaw, MS,

Design of Synthetic Genetic Systems Closing the Design Automation Loop Jean Peccoud Virginia

Towards Computational Assessment of Idea Novelty Kai Wang 1 Boxiang Dong 2 Junjie Ma 1 1 School of

Least Restrictive Environment Technical Assistance Session: Serving Students with Disabilities

IDEAs Equitable Services Set-Aside Required Federal Funding for Parentally Placed Private

Stanford question &amp; answer challenge Ethical, legal, societal influences Qualification

Identification in Triangular Systems using Control Functions Maximilian Kasy Department of

Se Sect ction ion 811 1 Pr Proj ojec ect t Ren ental al As Assi sistance ance Pr

Out-of-set i-vector selection for open-set language identification Hamid Behravan, Tomi Kinnunen,

Shape optimization for interface identification in nonlocal models Volker Schulz and Christian

Stanford question & answer challenge Ethical, legal, societal influences Qualification