SITE REPORT: THE FRANCIS CRICK INSTITUTE ADAM HUFFMAN Senior HPC - - PowerPoint PPT Presentation

site report the francis crick institute adam huffman
SMART_READER_LITE
LIVE PREVIEW

SITE REPORT: THE FRANCIS CRICK INSTITUTE ADAM HUFFMAN Senior HPC - - PowerPoint PPT Presentation

SITE REPORT: THE FRANCIS CRICK INSTITUTE ADAM HUFFMAN Senior HPC and Cloud Systems Engineer The Francis Crick Institute @adamhuffman adam.huffman@crick.ac.uk MY HISTORY WITH SCIENTIFIC SOFTWARE Life Sciences at University of Manchester


slide-1
SLIDE 1

SITE REPORT: THE FRANCIS CRICK INSTITUTE

slide-2
SLIDE 2

ADAM HUFFMAN

Senior HPC and Cloud Systems Engineer The Francis Crick Institute @adamhuffman adam.huffman@crick.ac.uk

slide-3
SLIDE 3

MY HISTORY WITH SCIENTIFIC SOFTWARE

Life Sciences at University of Manchester Mainly RPM packaging of biology/bioinformatics applications Fedora packager

slide-4
SLIDE 4

MY LATER HISTORY WITH SCIENTIFIC SOFTWARE

Imperial College London CMS experiment, GridPP, WLCG Highly domain-specific application provisioning Mature soware environment CVMFS cf. Compute Canada, ALICE

slide-5
SLIDE 5

THE FRANCIS CRICK INSTITUTE

Biomedical discovery institute dedicated to understanding the fundamental biology underlying health and disease Biggest biomedical research facility under a single roof in Europe 1,500 staff, £650m investment http://www.crick.ac.uk

slide-6
SLIDE 6

MIGRATION

Moving from 3 sites to 1 new building Diverse, devolved environments Data

slide-7
SLIDE 7

MY PRESENT WITH SCIENTIFIC SOFTWARE

Inheriting diversity of legacy clusters and experience Users have very little experience with modules Symbolic links Hard-coded profile paths

slide-8
SLIDE 8

THE SOLUTION?

...EasyBuild

slide-9
SLIDE 9

EASYBUILD FOR EMEDLAB

Large biomedical research cloud I used EasyBuild for several groups

slide-10
SLIDE 10

EASYBUILD USAGE AT CRICK

New CAMP cluster Mostly foss-2016b, some intel 683 modules in main area 119 modules in development area

slide-11
SLIDE 11

COMMUNITY RESPONSE

Lack of familiarity Have to introduce the concept and benefits of modules Some groups took to it quickly, aer initial complaints Benefits and importance of 1:1 training with research groups

slide-12
SLIDE 12

SPECIAL CHARACTERISTICS OF BIOINFORMATICS APPLICATIONS

Pipelines relying on specific old soware versions Similar pipeline, two different old versions Reproducibility?

slide-13
SLIDE 13

IMPACT OF EASYBUILD AT CRICK

Made one part of my work feasible Thanks from research groups Embedding better practices

slide-14
SLIDE 14

EASYBUILD FINDS YOU OUT

Wish I'd been using hide-deps from the beginning... Over-optimised code that won't run on legacy hardware we've inherited (Intel) Better training and documentation needed

slide-15
SLIDE 15

EMPOWERED ENTHUSIASTS

Enabled fruitful cooperation between admins and research groups Teething problems with MODULEPATH and imperfect knowledge sharing

  • Cf. work by Markus at JSC
slide-16
SLIDE 16

ATTENTION-DEFICIT SCIENCE

Problems owing to upstream soware/release quality Special Interest Group, or revival of HPCBIOS? Cf Damian's talk - sufficient familiarity for current hierarchical module naming scheme?

slide-17
SLIDE 17

GENERAL EASYBUILD EXPERIENCE

Importance of community over strict technical merit Very welcoming, extremely helpful Shining example of a distributed open source project Made my work feasible, and even enjoyable, as opposed to impossible

slide-18
SLIDE 18

EASYBUILD OBSERVATIONS

Share tips of the founders Speed adoption of contrib tools e.g. easy_update Linting Local testing .ebp patch files

slide-19
SLIDE 19

MORE OBSERVATIONS

Better metadata, cf. Markus' plan? (Further) toolchain documentation improvements Formalise site customisation methods

slide-20
SLIDE 20

THANK YOU