Building a Hybrid Cloud at Canadian Pacific
Stuart Charlton, Director – Infrastructure & Operations Information Technology
Building a Hybrid Cloud Stuart Charlton, Director Infrastructure - - PowerPoint PPT Presentation
Building a Hybrid Cloud Stuart Charlton, Director Infrastructure & Operations at Canadian Pacific Information Technology Canadian Pacific in 2010 15,500 14,800 active employees mile network $5.0 77.6 billion in revenues operating
Building a Hybrid Cloud at Canadian Pacific
Stuart Charlton, Director – Infrastructure & Operations Information Technology
mile network
active employees
billion in revenues
Canadian Pacific in 2010
1
2
Canadian Pacific’s Network
Vision: To be the safest, most fluid railway in North America CP operates in 6 Canadian provinces and 13 US States
3
§ Integrated Information Program
§ Positive Train Control
§ Predictive Operations § New Ordering Processes
§ Reducing Operating Ratio § Givens:
years combined
Responding to the Railway Industry’s Global Renaissance…
IT Transformation
2009-2015
Our Assumptions
§ Challenge #1: Volume, lead times & costs of infrastructure
§ Challenge #2: Bending down the operational cost curve for production
§ Challenge #3: Reducing cycle time of delivering changes to systems
§ Challenge #4: Increasing the availability of core operational systems
Approach: Using the right tool for the job, given the time constraints
Caveat: Forward-looking - this all may change
4
Advice we got: “Look at how complicated all this stuff is!”
Multi-Year Infrastructure & Delivery Strategy
6
Public Cloud Adoption 2009-2011
§ “Guerilla Cloud Warfare” § Dev/Test Infrastructure § Get the company used to them § Resolve immediate lead time problems
2011-2014 Agile Delivery & Ops
§ Move everything to Linux/ Windows § Agile/lean development § Automation, configuration management, pervasive virtualization § Private Cloud for SAP
New Systems Arch
§ Fault-Tolerant Distributed DBs & Data Grids § Event-driven and RESTful integration § Modular pieces
2012-2015
Public Cloud Adoption
Scenario: About to hire 200 SAP or Java Consultants
8
How will you provision for them?
Guerilla Cloud Warfare
§ Aka. “How to adopt several hundred desktops & servers in a controlled way with almost no staff” § Example Roadblock: Firewalls § Normal Solution: Open them up.
§ Guerilla Solution: Reverse SSH Tunnels. Works with TCP, SOCKS, even UDP if you’re crazy enough § Lesson: Get approval and constraints from the people who matter
CISO (who will prepare his team + legal/audit), CTO or GM/VP of Architecture (who is supposed to promote new things)
9
Global Public Cloud Dev/Test Network, late 2010
10
Western US Region
VDI Desktops
Authentication: Windows Domain Logon Outbound Firewall: Domain Group Policy Win2K8 Win2K8 Win2K8 Win2K8 Win2K8 Win2K8 Win2K8 Domain SSH Jump Host
VDI Desktops
Authentication: Windows Domain Logon Outbound Firewall: Domain Group Policy Win2K8 Win2K8 Win2K8 Win2K8 Win2K8 Win2K8
Singapore Region
SSH Jump Host
Dev/SIT Servers
Eastern US Region
SSH Jump Host SSH / 22 Certificate Auth CP Network SSH / 22 Certificate Auth CP Calgary SSH / 22 Certificate Auth Infosys & IBM India Amazon Backbone Amazon Backbone
Legacy Systems
SSH Reverse Tunnels SSH Forward Tunnel
Developer Client
Approved Internet Domains / IPs Windows Firewall RESTRICTED INTERNET ACCESS IPTABLES Approved Internet Domains / IPs Windows Firewall Dev/Test Linux Dev/Test Linux Dev/Test Linux Dev/Test Linux SSH Forward Tunnel
Western US Region
Public Cloud Benefits & Usage Notes
§ Offshore resources get a managed developer workstation
§ Using Amazon’s Internet backbone between regions
11
us-east-1 ap-southeast-1 CP Canadian Data Centre Offshore Teams (India) 15,500 km 2,900 km 750 km AWS Provider CP
Data Categorization
§ Data Categorization
(Note: These are representative examples, not our actual definitions)
12
Public Cloud Benefits & Usage Notes
§ Very quick lead times to deliver working dev/test systems
WebSphere, SAP, Business Objects, SQL Server, Exchange, etc.
§ Performance challenges
visibility in storage and networking
availability is not as predictable
13
Agile Infrastructure
Operations: Cultural & Tooling Changes
§ Old Assumptions
§ New Reality
no excuse for downtime from any business-level service § First Steps to Transformation
15
Enterprise Appliances
§ Oracle Exadata
16
§ VCE Vblock
(Not Really Private Clouds)
“Wire Once, Walk Away” Software-Based Automated Configuration Managed Services that Leverage the Productivity Gains
Private Cloud for Dev/Test
Private Cloud for Production is a Lofty/Questionable Goal
§ We’re focusing on combining virtualization and appliances with automation & metrics to reduce the dev/test cycle § CP Application Development & Test Cloud
17
18
Bending the Operational Cost Curve
Projected Monthly Per-Instance Costs (over 3 years)
Includes Amortized Capital + Operating Expense (e.g. Public cloud fees) + Managed Services
New Systems
The Logic and Constraints of a Railroad
20
Customer Requirements Track Capacity Crew Availability Locomotive Availability Car Availability Yard Capacity Emergency Management
Basic Railway Systems Architecture (80s)
21
§ No Routing § No Forecasting § Location Visibility but no ETAs
Timetable System Repair & Maintenance System Dispatch System Resource Management (Locomotives, Crews, etc.) Train Movement System Plan Reality Constraints Order & Billing Management Waybills
Modern Railway System Architecture
22
Service Design System Repair & Maintenance System Yard Management System Resource Management (Locomotives, Crews, etc.) CAR Movement System Plan Reality Constraints Order & Billing Management Waybills Proactive Shipment Scheduling Shipment Status Projections Proactive Health Monitoring
Designing a Service, circa 1998-2008 § Multi-Tier Hybrid Architecture
§ Independent servers / applications
§ “Put your eggs into a small number of baskets, and watch those baskets” § General assumptions
downtime
Designing a Service on the Cloud, circa 2008+ § Autonomous services
irrelevant) § Interdependent servers / applications
fault handling § “Many, many servers breaking big problems down or distributing lots of little problems around” § New realities
downtime from any service
Current Guidelines for 2012+
Using, where possible: lightweight, simple, inexpensive solutions
25
2012-2015 Systems Design Target (early draft)
26
Service Design System Yard Marshalling Plans Resource States (Locomotives, Crews, etc.) Car Positions Event-Based Integration Across Where Appropriate Orders Waybills Shipment Schedules Billing Resources Health Status (Track, Cars) RESTful Resources Exposed for Common Access Customer Service (Web & Mobile Devices) Hyperlinked Data for Operations Global Search and Analytics Mix of Custom, SAP, and other Packages
Summary: Multi-Year Infrastructure & Delivery Strategy
27
Public Cloud Adoption 2009-2011
§ “Guerilla Cloud Warfare” § Dev/Test Infrastructure § Get the company used to them § Resolve immediate lead time problems
2011-2014 Unified Infrastructure
§ Move everything to Linux/ Windows § Agile/lean development § Automation, configuration management, pervasive virtualization § Private Cloud for SAP
New Systems Arch
§ Fault-Tolerant Distributed DBs & Data Grids § Event-driven and RESTful integration § Modular pieces
2012-2015
28
Contacts & Thanks
Canadian Pacific Suite 500, 401 – 9th Avenue SW Calgary Alberta Canada T2P 4Z4 www.cpr.ca
Stuart Charlton Director – Infrastructure & Operations Information Technology Stuart_Charlton@cpr.ca With thanks to…. CP architecture: Gary Stedman, Dragan Sajic, Vincent Blue, Tim Riley CP operations: Bob Nash, Jack Vanos, Michael Turcotte, Ron Legere, Stan Singer CP IT risk management & security: Kevin Pasveer CP application delivery: Shawn Adams, Michael Wiens, Steve Hester CP CIO: Heather Campbell