Text Analytics from a galaxy far, far away PRESENTED BY: - - PowerPoint PPT Presentation

text analytics from a galaxy far far away
SMART_READER_LITE
LIVE PREVIEW

Text Analytics from a galaxy far, far away PRESENTED BY: - - PowerPoint PPT Presentation

Text Analytics from a galaxy far, far away PRESENTED BY: C.BATTISTON TASS MEETING, JUNE 21, 2019 What is Text Analytics? Text analytics is the way to unlock the meaning from all of this unstructured text. It lets you uncover patterns and


slide-1
SLIDE 1

Text Analytics from a galaxy far, far away

PRESENTED BY: C.BATTISTON TASS MEETING, JUNE 21, 2019

slide-2
SLIDE 2

What is Text Analytics?

 Text analytics is the way to unlock the meaning from all of

this unstructured text. It lets you uncover patterns and themes, so you know what customers are thinking about. It reveals their wants and needs.

 "Text analysis" is a broad term covering various processes by

which text and natural language documents can be modified so that they can be organized and described.

slide-3
SLIDE 3

What is “Star Wars”?

 Overall premise: The Rebellion

Alliance is fighting against the evil Galatic Empire.

 Characters:

 Good guys – Luke Skywalker, Princess

Leia, Han Solo, Ben Kenobi, C3P0

 Bad guys – Darth Vader, The Emperor.

The Death Star is the main space station / weapon of the Empire.

slide-4
SLIDE 4
slide-5
SLIDE 5

Data for this presentation

 Data was found at

https://github.com/gastonstat/StarWars/tree/master/Text_files

 Requires copying and pasting into Excel or a TXT file,

whichever your preference.

slide-6
SLIDE 6

Preparing/Cleaning the Data (Excel-based)

slide-7
SLIDE 7

Import and Data Manipulation

slide-8
SLIDE 8
slide-9
SLIDE 9
slide-10
SLIDE 10

Cleaning some of the data in SAS

There were a number of different UPDATE statements I ran to clean the data, but they all had the same basic format.

slide-11
SLIDE 11

Analysis – Line Counts

slide-12
SLIDE 12
slide-13
SLIDE 13
slide-14
SLIDE 14

Analysis – Usage of the word “Force”

slide-15
SLIDE 15

Analysis – Amount of text spoken

slide-16
SLIDE 16
slide-17
SLIDE 17

Which characters’ lines were the longest?

slide-18
SLIDE 18

References and Recommended Reading

 Text Mining and Analysis: Practical Methods, Examples, and Case Studies Using

SAS by by Goutam Chakraborty, Murali Pagolu, Satish Garla

 SAS Essentials: Mastering SAS for Data Analytics by Alan C. Elliott, Wayne A.

Woodward

 A simple approach to text analysis using SAS functions by Wilson Suraweera, Jaya

Weerasooriya, Neil Fernando https://www.sas.com/content/dam/SAS/support/en/sas-global-forum- proceedings/2018/2557-2018.pdf

 Text Mining of Open-Ended Survey Data by Brandon J. Hosek, MA, Barbara E.

Wojcik, PhD https://www.sas.com/content/dam/SAS/support/en/sas-global- forum-proceedings/2019/3705-2019.pdf

slide-19
SLIDE 19

Thanks and Contact Info You can reach me at: Darth.Pathos@gmail.com I’ve got to Han it to you, Yoda best audience!