18. Beyond the Hype of Machine Learning and Artificial Intelligence - - PowerPoint PPT Presentation
18. Beyond the Hype of Machine Learning and Artificial Intelligence - - PowerPoint PPT Presentation
18. Beyond the Hype of Machine Learning and Artificial Intelligence Tue May 21, 2019, 1PM Scott Burt, President & CEO, Integro Poll Question Do you think AI, ML, and/or NLP could help with InfoGov tasks? UNIVERSAL ENHANCEMENTS
- 18. Beyond the Hype of
Machine Learning and Artificial Intelligence
Tue May 21, 2019, 1PM Scott Burt, President & CEO, Integro
Poll Question
Do you think AI, ML, and/or NLP could help with InfoGov tasks?
My theory
- n our use
- f AI for
InfoGov
UNIVERSAL ENHANCEMENTS INTEGRATED GET EDUCATED ACCESSIBLE BUT… HANG ON
Common IG Use Cases
Classification Retention Findability Value Cleanup Enrichment
WHAT IS AI?
AI refers to machines that can learn, reason, and act for themselves
Is this AI?
November 10, 2018, MIT Technology Review
WHAT IS MACHINE LEARNING?
Looking for patterns in massive amounts of data
Machine Learning:
Looking for patterns in massive amounts of data
Document classification Image recognition OCR Voice transcription Document capture
Source: Box
Source: Box
WHAT IS NATURAL LANGUAGE PROCESSING (NLP)?
Computer understanding, analysis and processing of human language
Source: https://azure.microsoft.com/en-us/services/cognitive-services/text-analytics/ See also: https://azure.microsoft.com/en-us/services/cognitive-services/language-understanding-intelligent-service/
SUPERVISED VS. UNSUPERVISED LEARNING
Unsupervised Learning
Source: Brainspace
Supervised Learning
Source: Integro Email Manager
IG USE CASES TO CONSIDER
Seek repetitive tasks
Seek out low-skill tasks that
- ccupy the time of high-skilled
workers
- Is the task data-driven?
- Do you have the data to support
the automation of the task?
- Do you really need the scale that
automation can provide?
Kristian J. Hammond, Northwestern, Professor of Computer Science
Classification, Records and Defensible Disposal
Source: Integro Email Manager
Classification Case Study | Email Archives
- F500 company
- 1 Billion emails in archive
- 87 records categories
- 5 months to process the entire archive
- 256M identified as records of 1B
- Remainder emails not on Hold eligible for
defensible disposal
Classification Case Stud(ies) | Email Management with Human Oversight
- Integro Email Manager™
- Models trained to match full or partial file
plan
- Assists the user by auto classifying and
suggesting most likely categories
- Enables 3-zone email management with light
impact on users
Classification Case Study | Records Audit
- Fortune 500
pharmaceutical company
- 40TB of classified content
- Training was ‘easy’
considering the assembled corpus
- Processed and validated the
accuracy/quality of records categorization
Text Capture for Enrichment
Privacy Compliance
“Personal information” is defined under the CCPA as “information that identifies, relates to, describes, is capable of being associated with, or could reasonably be linked, directly or indirectly, with a particular consumer or household.”
26
Enrichment/Findability | Audio & Video
Source: Box.com
THE LEADING PROVIDERS
Names you know.
Natural Language Processing (NLP) Machine-Learning (ML) Artificial Intelligence (AI) LUIS (Language Understanding Intelligent Service) Azure Machine Learning Service, Cubeflow, Cloud TPUs Azure AI, Text Analytics API Amazon Comprehend Amazon SageMaker Amazon Textract, Amazon Rekognition Watson Natural Language Understanding Watson Machine Learning, Watson Visual Recognition Watson AI, Watson Studio, Watson Discovery Cloud Natural Language Cloud AI Platform AI, prepackaged solutions (e.g., Document Understanding AI, pretrained AI model for healthcare)
Pros and Cons
- f Using AI for
InfoGov
Pros
- Automate previously
impossible tasks – like classify a billion documents
- Cheap to start, just
subscribe
- No AI experience
‘necessary’
- Rapid progress!
- AI as a feature in products
– i.e., Box Skills, Integro Email Manager Cons
- The time and effort to train
the system
- Still early days
- Internal competing AI
projects
- Big vendors not focused on
IG
- Explicability & acceptance
challenges
- Ethics and bias issues
- Read… a lot
- Look for opportunities – low risk, high
reward, bite-sized at first
- Get a free account with a cloud
provider
- Review current vendors and their
plans – The best first projects will likely be AI as features in products – Vendors will be seeking clients to be early adopters and references
- Engage product-neutral consulting
services
- Request a workshop and presentation
- n Leveraging ML to Auto Classify
Content for better Information Governance
Recommended ‘reading’
Poll Question
Do you think AI, ML, and/or NLP could help with InfoGov tasks?
IG Use Cases to consider
- Forms data extraction
- Text analysis
- Text entity extraction
- Content enrichment
- Voice Recording transcription
- Video transcription
- Mergers and Acquisitions
- Improving searchability and findability
- Content Cleanup/Defensible Disposal
- Email management
Scott Burt
President & CEO, Integro 720-904-1601 | sburt@integro.com Twitter: @integroburt LinkedIN: linkedin.com/in/scottburt
Thank You