GTC 2018
dAIgnosis,INC.
S8371 S8371 How How We We Can Can Analy Analyze Pr Profile fr from - - PowerPoint PPT Presentation
GTC 2018 S8371 S8371 How How We We Can Can Analy Analyze Pr Profile fr from Re Real Tim Time Conver Con ersa sation tion by by Uns Unsuper upervised ed Learning Learning 03/28/2017 dAIgnosis,Inc. dAIgnosis,INC. GTC 2018 CO COMP
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
Design / development engineers who dedicated to Google Cloud Computing services gathered. Started research on AI technology based on medical system technology development in a national project Established the company May 2017 with the theme of deep learning using GPU.
VP of Google head office joined as a director. Advance technology development to build the original models while studying multiple cloud platforms. Started research using NVIDIA DGX‐1 *7 +1 units (Volta in April 2018) from affiliates. Planned to start real‐time analysis of text combined with image,etc. from the beginning
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
e.g. Call center conversation Unlabeled data
feature extraction to label
Supervised learning Extraordinary task e.g. Complaint handling Semi‐supervised Learning Routine task e.g.. Talk script
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
Unlabeled Data
CNN for SC Classifica tion
Unlabeled data Labeled data
Clustering CNN for SC
Labeled data
Inference
SC sentence classification
GTC 2018
dAIgnosis,INC.
Unlabeled Data
Classifica tion
Unlabeled data Labeled data
Clustering CNN for SC
Labeled data
Inference
GTC 2018
dAIgnosis,INC.
Labeled data
Inference Text data Label Display processing Handle if based on Script on the business scenario or not
Trained set
CNN for SC
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
Is the room available on dd/mm? Is breakfast served? Can I make a reservation on dd/mm? Do you have breakfast? The next room is noisy Room xx is noisy though What time is check‐in? What time can I check in?
GTC 2018
dAIgnosis,INC.
Is the room available on dd/mm? Is breakfast served? Can I make a reservation on dd/mm? Do you have breakfast? The next room is noisy Room xx is noisy though What time is check‐in? What time can I check in?
GTC 2018
dAIgnosis,INC.
Learning model
Category 1 Category 3 Category 4 Category 2
GTC 2018
dAIgnosis,INC.
Do you have breakfast? Learning model Category 1 Category 3 Category 4 Category 2
GTC 2018
dAIgnosis,INC.
Is the room available on dd/mm? Is breakfast served? Can I make a reservation on dd/mm? Do you have breakfast? The next room is noisy Room xx is noisy though What time is check‐in? What time can I check in? Do you have breakfast?
GTC 2018
dAIgnosis,INC.
Category 3
Do you have breakfast? Learning model Category 1 Category 3 Category 4 Category 2
GTC 2018
dAIgnosis,INC.
Learned Acknowledgment message Database We have a plan with breakfast.
Category 3
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
English (20 vowels+24 consonant=44 phoneme): /iː/, /ɪ/, /e/, /æ/, /ʌ/, /ɑː/, /ɒ/, /ɔː/, /ʊ/, /uː/, /ɜː/, /ə/, /eɪ/, /aɪ/, /ɔɪ/, /əʊ/, /aʊ, ɑʊ/, /ɪə/, /eə/, /ʊə/; /p/, /b/, /t/, /d/, /k/, /g/, /ʧ/, /ʤ/, /f/, /v/, /θ/, /ð/, /s/, /z/, /ʃ/, /ʒ/, /h/, /m/, /n/, /ŋ/, /l/, /r/, /w/, /j/ Japanese(5 vowels+16 consonants+3 peculiars=24phoneme): /a/, /i/, /u/, /e/, /o/; /j/, /w/; /k/, /s/, /c/, /t/, /n/, /h/, /m/, /r/, /g/, /ŋ/, /z/, /d/, /b/, /p/; /N/, /T/, /R/ Reference: http://user.keio.ac.jp/~rhotta/hellog/2012‐02‐12‐1.html
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
Our own DGX‐1 infrastructure Business application End users Through business application Trained data Trained model
GTC 2018
dAIgnosis,INC.
Deep Learning processing Business Application
training dataset
Distributed processing
Raw Data Export ~ File Transfer Generating the training dataset
Deep learning
Raw Text Data
150 minutes by 8 GPUs (Training on 50 Epochs)
192 hour conversation per day (20MB of text data)
Note: Execution time of prediction on the machine powered by only CPU. ・220 ms (the training of SCDV using 3,000 dimensions, then) ・430 ms (the training of SCDV using 6,000 dimensions, then)
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
Reference: https://arxiv.org/pdf/1301.3781.pdf
GTC 2018
dAIgnosis,INC.
Reference: https://arxiv.org/pdf/1612.06778.pdf
GTC 2018
dAIgnosis,INC.
(Convolutional Neural Networks for Sentence Classification)
Reference: https://arxiv.org/pdf/1408.5882.pdf
GTC 2018
dAIgnosis,INC.
Reference: http://www.aclweb.org/anthology/C14‐1008
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.
Word2Vec
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean https://arxiv.org/pdf/1301.3781.pdf
word2vec Parameter Learning Explained
Xin Rong https://arxiv.org/pdf/1411.2738.pdf
SCDV
Sparse Composite Document Vectors using soft clustering over distributional representations
Dheeraj Mekala, Vivek Gupta, Bhargavi Paranjape, Harish Karnick https://arxiv.org/pdf/1612.06778.pdf https://dheeraj7596.github.io/SDV/
CNN for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim https://arxiv.org/pdf/1408.5882.pdf
Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts
Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts
Cicero dos Santos, Maira Gatti http://www.aclweb.org/anthology/C14‐1008
GTC 2018
dAIgnosis,INC.
GTC 2018
dAIgnosis,INC.