Technological Innova.on at Alibaba Alan Qi Vice President of Ant - - PowerPoint PPT Presentation

technological innova on at alibaba
SMART_READER_LITE
LIVE PREVIEW

Technological Innova.on at Alibaba Alan Qi Vice President of Ant - - PowerPoint PPT Presentation

Technological Innova.on at Alibaba Alan Qi Vice President of Ant Financial Service Group Outline 11-11 & Alibaba Technologies at Alibaba Conclusions What is Singlesday (11-11)? Black Friday vs. 11.11 Black Friday ( 2014) 11.11 Shopping


slide-1
SLIDE 1

Technological Innova.on at Alibaba

Alan Qi Vice President of Ant Financial Service Group

slide-2
SLIDE 2

Technologies at Alibaba 11-11 & Alibaba Conclusions

Outline

slide-3
SLIDE 3

What is Singles’day (11-11)?

slide-4
SLIDE 4

Black Friday vs. 11.11

Black Friday(2014) 11.11 Shopping Festival(2015) 1 Country 1 Day 1 Company 1 Day

2.3 Billion

Dollar Online GMV

9.1 Billion

Dollar Offline GMV

249M

Online & Offline Customer

27.9%

On Mobile Client

14.3 Billion

Dollar Online GMV

>249M

Online Customer

68%

On Mobile Client

slide-5
SLIDE 5

“11-11”sales revenue explodes

2015/11/11: 91.2 Billion Yuan (14.3 Billion USD) In 7 years, sales revenue increases

154 times

5.9 19.4 53.3 191 350 571 100 200 300 400 500 600 2009 2010 2011 2012 2013 2014

GMV

Unit: 0.1 Billion Yuan

slide-6
SLIDE 6

“11-11”number of orders explodes

2014/11/11 Total orders: 278 Millions In 6 years, number of orders increases

57 times

490.00 1280 3369 10000 18800 27800 5000 10000 15000 20000 25000 30000 2009 2010 2011 2012 2013 2014

(10K)

IOE architecture Cloud

slide-7
SLIDE 7

“11-11”order peak value explodes

2014/11/11 Order peak value: 80K per sec. In 6 years, order peak values increases

200 times

IOE architecture Cloud

400 1000 3200 14000 42000 80000 10000 20000 30000 40000 50000 60000 70000 80000 90000 2009 2010 2011 2012 2013 2014

(per second)

slide-8
SLIDE 8

“11-11”transcation peak value explodes

2014/11/11 Alipay: 38.5K per sec. In 6 years, transaction peak values increases

193 times

IOE architecture Cloud

200 500 1200 3800 15000 38500 5000 10000 15000 20000 25000 30000 35000 40000 45000 2009 2010 2011 2012 2013 2014

(per second)

slide-9
SLIDE 9

“11-11”transcation peak value explodes

2015/11/11 Alipay: 85.9K per sec. Visa: 56K per sec. in lab 14K per sec in real Master: 40K per sec. in lab

Alipay: Strongest processing power among all payment systems

In 7 years, transaction peak values increases

430 times

IOE architecture Cloud

200 500 1200 3800 15000 38500 5000 10000 15000 20000 25000 30000 35000 40000 45000 2009 2010 2011 2012 2013 2014

(per second)

slide-10
SLIDE 10

Capacity of processing financial big data explodes

In 6 years, data processing capacity increases

100K times

Oracle RAC data Single BU data Hadoop Alibaba Group ODPS Data from Ecosystems

2014/11/11: Total processed data: 100PB

0.10 10 30 100 1024 10240 2000 4000 6000 8000 10000 12000 2009 2010 2011 2012 2013 2014

Data processing capacity

(PB)

slide-11
SLIDE 11

“11-11”needs information security

In 11/11/2014,Alibaba successfully defended 200 DDoS attacks; the biggest attack volume is bigger than 100Gbps.

slide-12
SLIDE 12

The company behind 11-11?

slide-13
SLIDE 13

Alibaba History

1999 2001 2002 2003 2004 2005 2007 2008 2009 2010 2011 2012 2013 2014 2015

Alibaba founded in Jack Ma’s apartment in Hangzhou Alibaba.com surpasses 1M registered users Company breaks even RMB1M revenue daily; Taobao is founded RMB1M profit daily; Alipay is launched RMB1M tax paid daily; Acquired China Yahoo! Alibaba.com lists in HK Alibaba Cloud Computing is founded Tmall.com is introduced Taobao is re-

  • rganized into

Taobao Market- place, Tmall.com and eTao Alibaba Group establishes the Alibaba Foundation with a sizeable fund dedicated to social causes. AliExpress is launched; Juhuasuan and eTao are introduced Alibaba Group is reorganized into 25 business units; announces a plan to set up Small and Micro Financial Services Group; leads the formation

  • f Cainiao

September 19th : Alibaba Group’s IPO (Nasdaq: BABA) Alibaba announced “Hangzhou +Beijing” dual- center strategy

slide-14
SLIDE 14

Alibaba Strategy

Data

e-Commerce

Finance Logistics Health Entertain- ment

slide-15
SLIDE 15

Alibaba Ecosystem

Alibaba e-commerce infrastructure

DATA

buyer sellers

Logistics Partners Other Participants Logistics Infrastructure O2O

Social Network

Digital Entertainment

UC Browser

Online Marketing Ant Financial

slide-16
SLIDE 16

Technologies at Alibaba

slide-17
SLIDE 17

Key Technologies

9 large data centers. Active-active data centers across multiple cities technology Break 4 world records on Sort Benchmark 2015. 3 time faster than Apache Spark A secure e-commerce ecosystem for online transactions, data exchanges, cloud computing and mobile interactions. Ten billion level RPC and message processing capability Super-large scale learning from data with billions of attributes and billions of samples Dynamic restoration & deployment ability of mobile clients Large-scale real-time “search by image” technology that powers r camera-based product search Deep user profiling & interest mining to better connect advertisers to their target audience

slide-18
SLIDE 18

Technology challenges that Alibaba faced in 2009

High hardware cost

IOE (IBM servers, Oracle databases and EMC storage): expensive for scalability. In a few years, the hardware cost could have bankrupted the whole company!

Data islands

Data from different business units are located in different clusters. Extremely hard to share data with each other!

No common data standard for usage

Due to isolated storage, data are moved between clusters with repeated storage and computations.

slide-19
SLIDE 19

Milestones

19

ODPS platform

Massive data processing platform went online at Alibaba financial service.

2010.04 2013.08 5K node in a single cluster 2012.07 CDO started to integrate data 2015 Public service 2009.09 AliCloud Started 2014.11 Big data platform & products have been fully tested by 11-11

slide-20
SLIDE 20

Data-driven business at Alibaba

Data business

Internet Finance Search & ads for ecom Info Security Intelligent logistics Credit Customer service

Dozens of business units Hundreds of PB data Thousands of engineers

slide-21
SLIDE 21

Cloud Computing & Infrastructure

slide-22
SLIDE 22

09/10/2009

Aliyun.com Inc established

08/27/2010

Apsaras became the platform of four applications: Search, Mail, Image Storage, AliFinance

07/28/2011

Aliyun.com went

  • nline, releasing 1st

cloud service: ECS

08/15/2013

5000-node Asparas cluster went into production

07/2014

Launched ODPS, which is capable of processing 100 petabytes of data in six hours.

Aliyun

5K

10/15/2015

Achieved ITU Telecom World Entrepreneurship Award for excellence in providing and promoting innovative ICT solutions with social impact

slide-23
SLIDE 23

World Records

  • Break 4 world records on Sort Benchmark 2015
  • Sort 100 TB of data in 377 seconds, 3 times faster than the record set by

Apache Spark in 2014

slide-24
SLIDE 24

ODPS

External data Oracle Mysql Text … Realtime batch data channel

Tunnel SQL MR Graph

Console

SDK

Base Dev. Platform Data Process Center

Security PAI Stream SQL

External data Oracle Mysql Text OTS … CDP CDP A/B Test

Online System Data Warehouse Analysis & Mining Application

Realtime batch data channel

Tunnel

Distributed Safety Easy to Use Authorization and Access

Capable of processing 100 petabytes of data in 6 hours.

slide-25
SLIDE 25

Public Service

slide-26
SLIDE 26

Public Service

CNTV肩负对央视众多频道内容资源进行整合和转换通过互联网提 供在线服务

CNTV: online resource integra.on for CCTV (biggest TV sta.on in China) and online service. Cost saving: >50%

slide-27
SLIDE 27

Reliable service via data sharing

Active-active data centers across multiple cities

We can distribute network traffic between our Shanghai, Hangzhou and Shenzhen data center without any interruption.

slide-28
SLIDE 28

World-wide Data Centers

9 large data centers in China Mainland, Hong Kong, Singapore and

America.

slide-29
SLIDE 29

Green Data Center: Zhang Bei

  • Zhang Bei: Strong wind,

averaged temperature < 37 degree F.

  • Deploy solar energy, wind

power and cold water cooling solutions: 100% green

  • Estimated at least 45%

energy saving for cooling.

slide-30
SLIDE 30

Green Data Center: Thousand Island Lake

千岛湖

Energy saving

Cold lake water for cooling PDU index: 1.3

90% time: No need for electricity

Cut 10K ton coal usage

slide-31
SLIDE 31

Information Security

  • Intercept 3 million deceptive transaction requests every day (user identification, etc.).
  • Totally successful defense of DDoS attack:546,859GB
  • Totally successful defense of attack since 2014: 9,383,772,384 times
  • Dec. 20th, 2014, Defeated a DDoS attack,which lasted 14 hours and peak traffic reached

453.8 Gb per second

slide-32
SLIDE 32

Big Data & Machine Learning

slide-33
SLIDE 33

11-11 customer service

On 11/11/2015

  • Total service volume: > 10

Millions

  • > 95%: supported by

intelligent robots

  • < 5% : supported by

human

  • >98% phone call: went

through

slide-34
SLIDE 34

Data Computing

Deep Learning Recommen dation Natural language Bayes Optimization Time series analysis Intelligent Customer Service Credit & Loans Identification

Risk Control

AI at Alibaba

Advertising…

slide-35
SLIDE 35

Intelligent customer service

§ Speech recogni.on § Deep Learning for problem iden.fica.on & QA § Natural language processing § Knowledge base construc.on

slide-36
SLIDE 36

Large-scale Machine Learning

Parameter Server Deep Learning Online Learning Hashing etc.

  • Increasing CTR > 10%
  • Boosting ad revenue

significantly

  • Increasing CTR > 5%
  • Speeding up model training

more than 3 times

  • Conduct online learning on

Double-11 Shopping Festival in 2015

Search Ads Tmall Real-time Recommendation Taobao Mobile Search

slide-37
SLIDE 37
  • Using big data technologies to discover risks and identify abnormal behaviors.

CTU

Normal behavior Further verification Fraud Verification Challenge High Risk Low Risk Transaction authorization/ success Logon Request Events Collection

Realtime calculation in 100ms

Security Identification Terminal Management Risk Recognition

Rule Model Machine Learning

Offline computing environment(DW) Support playback of all data training & events

Risk Control (CTU)

slide-38
SLIDE 38
  • Address challenges for small and medium

enterprises to meet their financial need:

– Small loans – Tedious loan approval procedure – Low approval rate – Long cycle

Micro Loans

Farmers need loans for tractors

slide-39
SLIDE 39
  • No human interven.on based
  • n machine learning methods
  • Default rate way less than

tradi.onal banks Data-driven micro loans for small enterprises

  • 10 millions plus loans
  • 90 billion+ RMB por_olio
  • Over 300 thousand enterprises
slide-40
SLIDE 40

180 Thousand loans for small and medium enterprises in rural areas, in total 17.8 Billion Yuan (~2.8 Billion USD)

40

Microloans for rural areas in China

slide-41
SLIDE 41
  • Insurance contracts has year-on-year growth rate of 100%.
  • Over 1 billion contracts in 2013
  • Record-breaking over 100 million contracts one day on

November 11, 2013

Shipping Insurance for Returned Products

40.00% 60.00% 80.00% 100.00% 120.00% 140.00%

Overall rate of compensaLon

slide-42
SLIDE 42

Shipping Insurance

E-commerce trading platform Buyers Insurance Pricing

Insurance platform faces two risks:

  • Incorrect pricing
  • Fraud
slide-43
SLIDE 43

p Uniform 5% fixed rate

Fixed rate

p Solely based on historical data and demographics

Actuarial approach Simple Easy to explain

p Pricing model based on a few parameters

Model driven pricing Improved accuracy

p Millions of features, real time pricing p Machine learned models

Dynamic pricing based on big data High accuracy

Shipping Insurance for Returned Products

slide-44
SLIDE 44

Conclusions

slide-45
SLIDE 45

Feedback loop

Data Technology Business

slide-46
SLIDE 46

Technology Expands Business Boundary

Artificial Intelligence Quantum

  • Comm. &

Comp. Software Defined Network Smart City

  • Enable devices to think & learn like

human-being.

  • Secure communication & high-

performance computation.

  • New network framework providing an

innovation solution for network virtualization.

  • Future cities fusion of cloud comp.,

big data analysis, mobile Internet and social networks.

slide-47
SLIDE 47

THANK YOU