G alaxy for G enomics-enabled B reeding Star Yanxin Gao - - PowerPoint PPT Presentation

g alaxy for g enomics enabled b reeding
SMART_READER_LITE
LIVE PREVIEW

G alaxy for G enomics-enabled B reeding Star Yanxin Gao - - PowerPoint PPT Presentation

G alaxy for G enomics-enabled B reeding Star Yanxin Gao yg28@cornell.edu Introduction Application Specialist, IT Enterprise Breeding System (2020+) GOBii (2015-Present) Breeding and Genetics Corn, DAS (2008-2015) Soybean,


slide-1
SLIDE 1

Galaxy for Genomics-enabled Breeding

Star Yanxin Gao yg28@cornell.edu

slide-2
SLIDE 2

Star Yanxin Gao, Ph.D., PMP

  • Application Specialist, IT
  • Enterprise Breeding System (2020+)
  • GOBii (2015-Present)
  • Breeding and Genetics
  • Corn, DAS (2008-2015)
  • Soybean, VT (2004-2008)
  • Vegetables, Cornell (1999-2004)

Introduction

slide-3
SLIDE 3

Outline

  • 1. Project
  • 2. Products
  • 3. Partnership
slide-4
SLIDE 4

Genomic selection MAS

GOBii Mission Transform breeding by enabling genomic–assisted selection as routine breeding applications

Genomic-Assisted Selection Platform

High Throughput Analysis and Decision Support Genotyping Management Suitable Genotyping Platforms High Throughput Sample Tracking Breeding Management

GS-Galaxy Project (1st P)

slide-5
SLIDE 5
  • Victor Ulat
  • Susanne Dresigacker
  • Mike Olsen
  • Umesh Rosyara
  • Xuecai Zhang
  • Juan BURGUEÑO
  • Fernado Toledo

CIMMYT

  • Selvanayagam Siva
  • Rajeev Varshney
  • Abhishek Rathor
  • Manish Roorkiwal
  • Hima Kudapa
  • Santosh Deshpande

ICRISAT

  • Venice Juanillas
  • Ramil Mauleon (Former)
  • Ken McNally
  • Josh Cobb
  • Carlos Ignacio
  • Dmytro Chebotarov
  • Nick Alexandrov Former)
  • Jessica Rutkoski (Former)
  • Juan Arbelaez (Former)

IRRI

  • Angel Villahoz-Baleta (Former)
  • Star Yanxin Gao
  • Kelly Robbins
  • Liz Jones
  • Yaw Nti-addae

Cornell University

  • Alexis Dereeper
  • Michael Quinn
  • Paulino Perez
  • Jose Crossa
  • Clay Sneller
  • Kate Dreher
  • Tom Hagen
  • Yoseph Beyene
  • Manje Gowda
  • Nicholas Santantonio
  • Isaak Tecle
  • Milcah Kigoni
  • Iain Milne
  • Gordon Stephen
  • Hiro Iwata
  • Dave Clements

Collaborators

GS-Galaxy Project (1st P)

People

slide-6
SLIDE 6

GS-Galaxy Project (1st P)

Milestones

1st milestone: Minimum set of GS tools installed in Galaxy- June 2018:

Ø GS workflow mapped for each crop and common minimum desirable tools and features identified. Ø Pipeline developed with minimum desirable features. Ø Basic and common tool components put in place. Ø Tested with well curated test datasets for each crop by product owners and testers

2nd milestone: Production server with published GS workflows-June 2019

Ø Customized workflows for each crop. Ø v1 for fully functioning GS pipeline. Ø Working pipeline available to centers. Ø Training and workshop to users other than product owners and more users start using pipeline and tools

3rd milestone: GOBII integrated as data source for Galaxy-June 2020

Ø Deliver a complete pipeline with no data manipulation required. Ø Access data extract for phenotypes and genotypes from Galaxy pipeline using remote Galaxy web servers. Ø Found a solution for storing the output data long term so it can easily be accessed. Ø Pipelines widely used within CG and outside CG in future.

slide-7
SLIDE 7

GS-Galaxy Products (2nd P)

http://galaxy-demo.excellenceinbreeding.org/

slide-8
SLIDE 8

Enable routine genomic selection analysis

Adopted Slide from Clay Sneller

v 2 v 3 v 4 v 6 v 7 v 8

v 5

GS-Galaxy Products (2nd P)

http://galaxy-demo.excellenceinbreeding.org/

slide-9
SLIDE 9

GS-Galaxy Products (2nd P)

http://galaxy-demo.excellenceinbreeding.org/

Workflow 1: Predict GEBVs in untested lines Workflow 2: Clustering/population structure Workflow 4: Genome-wide association study Workflow 3: Cross validation

slide-10
SLIDE 10

GS-Galaxy Products (2nd P)

GOBii-Galaxy Integration

slide-11
SLIDE 11

Adopt products

  • Open-source and free
  • Test with own datasets and use cases
  • Download from toolshed

Partnership Models

GS-Galaxy Partnership (3rd P)

Community development

  • Implement new tools
  • Develop crop-specific workflows
  • Connect with databases
slide-12
SLIDE 12

Adoption Metrics

GS-Galaxy Partnership (3rd P)

  • Number of registered users: 214
  • Number of different crops/programs: 3 (rice, wheat, maize)
  • Number of analyses/jobs performed: 14200
  • Total job file sizes stored: 148 Gb
  • Average file size: 4.2982 Mb
slide-13
SLIDE 13

Genotyping Management Vendors Analytics Sample Management

Breeding Management Sample Tracking Data Extract Breeding Data Extract Decision Support Data Loading Service Tracking

B4R

GS-Galaxy Partnership (3rd P)

slide-14
SLIDE 14

Invite to Partner with GS-Galaxy

  • 1. Project
  • 2. Products
  • 3. Partnership

EiB Demo Server: http://galaxy-demo.excellenceinbreeding.org/ Publish tools to Galaxy main toolshed Galaxy workshop at Cornell/BTI April, 2020

slide-15
SLIDE 15

Thanks