TUT1151 Simplifying AI Applications with containers and Kubernetes
Glyn Bowden, Chief Architect, AI & Data Science Practice
SUSECON 2019
TUT1151 Simplifying AI Applications with containers and Kubernetes - - PowerPoint PPT Presentation
TUT1151 Simplifying AI Applications with containers and Kubernetes Glyn Bowden, Chief Architect, AI & Data Science Practice SUSECON 2019 The world is replacing programming with training 23 million developers worldwide 4 in 10 companies
Glyn Bowden, Chief Architect, AI & Data Science Practice
SUSECON 2019
The world is replacing programming with training
2
23 million
developers worldwide Handful of data scientists
4 in 10 companies mention
lack of analytical skills
as a key challenge
By 2020, 50% of organizations will
lack sufficient AI and data literacy skills
to achieve business value
What you need to do defines what you have to do
3
Edge Cloud
Where is the data generated? What does that data consist of? How long do you have to take action? What governance and security regulations do you need to comply with? What are your business goals? How do you prepare and integrate data for advanced analytics ? What do you have to do to put that data in a form you can use?
4
AI ecosystem
A working solution requires every layer of this stack
4
Hardware
– Memory – Storage – Network – Hardware accelerators
Software
– Custom applications – As-a-service offerings – Machine learning and deep learning frameworks and libraries (TensorFlow, Caffe, Scikit-learn, …), data platforms – Systems software and libraries (CUBLAS, MKL, cuDNN, MKL-DNN)
Data
– Public data sources – Proprietary data collections – Data labeling providers
Expertise
– Advisory / consulting services – Vertical SMEs
Challenges to gaining insight
5
Existing AI process
Individual – AI solutions tend to be bespoke. Designed and deployed as part of the project Time consuming – AI solution cycle is long due to bespoke nature of design Risk – Bespoke solutions and iterative platform development introduces risk of inconsistent results and even project failure Inefficient – Skilled and costly resources used
6
The Four Pillars of Data Science
“simplify AI consumption bringing together the AI ecosystem”
Data Sources Data Management Analytics Insight – Multiple formats – Multiple Sources – Multiple Standards – Security – Data Platform – Data Lake – On Premise – Hybrid Cloud Integration – Multiple Machine Learning Libraries – Multiple AI trained models – Multiple AI languages – Business Use Cases – Integrated Analysis – 360 view of data – CDO Dashboards
AI Ecosystem Summary
7
Distinct ML Services
ML Model Candidates
Distinct ML Services
AI / ML Candidate Partners
Language Frameworks
OpenSource
Model Categories
Caffe2Modelzoo
AI – Vertical Solution Use Case Examples
8
Finance
Algorithmic Trading
Market data Trade ExecutionRisk Analysis
Simulation Decision ExecutionFraud Detection
Financial data NotificationHealthcare
Genome Sequencing Resource Management Connected Health
Simulation Conclusion Resource data Proposal Patient sensors MedicalTelco / Media
Customer Profiling
Activity & social analysis Directed advertising Sensor analysis Config. execution Log analysis NotificationNetwork Performance Analysis SmartCity
Manufacturing
Material Analysis Predictive Maintenance
Simulation Conclusion Sensor Analysis Notification Environ. Analysis Task executionSelf-Functioning Devices
Batch or manual Real-time data feed Real-time executionEnterprise
Supply chain
Image / Video Recognition Natural language comms.
Layered Architecture
HPE Internal Use Only9
Infrastructure Container Platform SUSE CaaSP Cloud On-Site DWH SUSE Enterprise Storage Cloud DWH Unified Data Warehouse Big Data Frameworks AI Frameworks Developer Frameworks SUSE CAP Models and Algorithm (Content) Task Execution Engine Visualisation Engine Business Users Data Scientists IT Operators Hybrid IT Practice Cloud Technology Partners
Accelerate time-to-value in your AI journey with OneAI
Rapid deployment of solution components during PoV leading to, smooth production
Conventional steps for a PoV / Production implementation
Provisioning of Infrastructure Setup AI s/w stack Containerization process Messaging and transformation components Monitoring tools
Automated setup with OneAI
Automated Provisioning Containerize Easy Monitoring
Elapsed Time: Weeks to Months Elapsed Time: Hours Faster PoV Getting proof points early and fail fast ! Automation deployment of infrastructure, Monitoring tools and Management: simple
11
How Project OneAI works – an illustrative view
UseCase Parameters
parameters
Templates OneAI
Orchestrated w/ Kubernetes Unified monitoring on open frameworks (Prometheus) Automated Provisioning
ecosystem Use Case Solution Architecture
Use cased based output
Addressing the challenges to Insights
12
HPE Pointnext Framework
Rapid – Deploy an AI solution, so it is ready to start receiving data Reuse – Leverage HPE globally industrialized templates and project IP Learn & Evolve – Solution gallery will grow
delivered Efficient – Consultants are driving insight from the customer data not configuring and experimenting with underlying technologies
Existing AI process
Individual – AI solutions tend to be bespoke. Designed and deployed as part of the project Time consuming – AI Solution cycle is long due to bespoke nature of design Risk – Bespoke solutions and iterative platform development introduces risk of inconsistent results and even project failure Inefficient – Skilled and costly resources used
Project OneAI Functional Architecture v2.0
13
Kubernetes Cluster Istio Service Mesh OneAI Microservices catalog environment datasource lifecycle
Ambassador API Gateway Keycloack OAUTH2/OIDC
hub.docker.com HPE Pointnext Docker Registry Private Docker Registry NVIDIA NGC
Content
Grommet UI
Web browser {{api_client}}
Service Scaling
14
Cluster Scaling
15
Component Hierarchy Design
16
17
18
19
20
gjb@hpe.com
21