SLIDE 1 TEXT
MPA 635: Data Visualization November 13, 2018
SLIDE 2
P L A N F O R T O D A Y Surveys and qualitative data Digital humanities Visualizing text with R
SLIDE 3
S U R V E Y S A N D Q U A L I TAT I V E DATA
SLIDE 4
S I N G L E R E S P O N S E Q U E S T I O N S
SLIDE 5
M U L T I P L E R E S P O N S E Q U E S T I O N S
SLIDE 6
C O O C C U R R E N C E A N A LY S I S
SLIDE 7
F R E E R E S P O N S E S
SLIDE 8
SLIDE 9
I S T H I S O K A Y ?
SLIDE 10
W O R D C L O U D S F O R G R O W N U P S
Counting words, but in fancier ways
SLIDE 11
SLIDE 12
SLIDE 13
D I G I TA L H U M A N I T I E S
SLIDE 14
C R A S H C O U R S E I N C O M P U T A T I O N A L L I N G U I S T I C S
Tokens, lemmas, and parts of speech Topics and LDA tf-idf Sentiment analysis Fingerprinting
SLIDE 15
T I D Y T E X T
SLIDE 16
T O K E N S
Element of the text Word Sentence Verse Line Paragraph n-gram
SLIDE 17
T O K E N F R E Q U E N C Y
SLIDE 18
N - G R A M F R E Q U E N C Y
SLIDE 19
P A R T S O F S P E E C H
SLIDE 20
P A R T O F S P E E C H F R E Q U E N C Y
SLIDE 21
A R T S Y S T U F F
SLIDE 22
SLIDE 23
S E N T I M E N T A N A LY S I S
How positive or negative a text is
SLIDE 24
S E N T I M E N T A N A LY S I S
SLIDE 25
SLIDE 26 T F - I D F
Term frequency-inverse document frequency
How important a term is compared to the rest of the documents
SLIDE 27
SLIDE 28
T O P I C M O D E L I N G
SLIDE 29
L A T E N T D I R I C H L E T A L L O C A T I O N
SLIDE 30
C L U S T E R S O F R E L A T E D W O R D S
SLIDE 31
T R A C K T O P I C S O V E R T I M E
SLIDE 32
SLIDE 33 F I N G E R P R I N T I N G
Analyze richness or uniqueness of document
Punctuation patterns, vocabulary choices, sentence length Hapax legomenon
SLIDE 34
Sentence length
SLIDE 35
Hapax legomena
SLIDE 36
Verse length
SLIDE 37
V I S U A L I Z I N G T E X T W I T H R