TEXT MPA 635: Data Visualization November 13, 2018 P L A N F O R - - PowerPoint PPT Presentation

text
SMART_READER_LITE
LIVE PREVIEW

TEXT MPA 635: Data Visualization November 13, 2018 P L A N F O R - - PowerPoint PPT Presentation

TEXT MPA 635: Data Visualization November 13, 2018 P L A N F O R T O D A Y Surveys and qualitative data Digital humanities Visualizing text with R S U R V E Y S A N D Q U A L I TAT I V E DATA S I N G L E R E S P O N S E Q U E S T I O N


slide-1
SLIDE 1

TEXT

MPA 635: Data Visualization November 13, 2018

slide-2
SLIDE 2

P L A N F O R T O D A Y Surveys and qualitative data Digital humanities Visualizing text with R

slide-3
SLIDE 3

S U R V E Y S A N D Q U A L I TAT I V E DATA

slide-4
SLIDE 4

S I N G L E R E S P O N S E Q U E S T I O N S

slide-5
SLIDE 5

M U L T I P L E R E S P O N S E Q U E S T I O N S

slide-6
SLIDE 6

C O O C C U R R E N C E A N A LY S I S

slide-7
SLIDE 7

F R E E R E S P O N S E S

slide-8
SLIDE 8
slide-9
SLIDE 9

I S T H I S O K A Y ?

slide-10
SLIDE 10

W O R D C L O U D S F O R G R O W N U P S

Counting words, but in fancier ways

slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13

D I G I TA L H U M A N I T I E S

slide-14
SLIDE 14

C R A S H C O U R S E I N C O M P U T A T I O N A L L I N G U I S T I C S

Tokens, lemmas, and parts of speech Topics and LDA tf-idf Sentiment analysis Fingerprinting

slide-15
SLIDE 15

T I D Y T E X T

slide-16
SLIDE 16

T O K E N S

Element of the text Word Sentence Verse Line Paragraph n-gram

slide-17
SLIDE 17

T O K E N F R E Q U E N C Y

slide-18
SLIDE 18

N - G R A M F R E Q U E N C Y

slide-19
SLIDE 19

P A R T S O F S P E E C H

slide-20
SLIDE 20

P A R T O F S P E E C H F R E Q U E N C Y

slide-21
SLIDE 21

A R T S Y S T U F F

slide-22
SLIDE 22
slide-23
SLIDE 23

S E N T I M E N T A N A LY S I S

How positive or negative a text is

slide-24
SLIDE 24

S E N T I M E N T A N A LY S I S

slide-25
SLIDE 25
slide-26
SLIDE 26

T F - I D F

Term frequency-inverse document frequency

How important a term is compared to the rest of the documents

slide-27
SLIDE 27
slide-28
SLIDE 28

T O P I C M O D E L I N G

slide-29
SLIDE 29

L A T E N T D I R I C H L E T A L L O C A T I O N

slide-30
SLIDE 30

C L U S T E R S O F R E L A T E D W O R D S

slide-31
SLIDE 31

T R A C K T O P I C S O V E R T I M E

slide-32
SLIDE 32
slide-33
SLIDE 33

F I N G E R P R I N T I N G

Analyze richness or uniqueness of document

Punctuation patterns, vocabulary choices, sentence length Hapax legomenon

slide-34
SLIDE 34

Sentence length

slide-35
SLIDE 35

Hapax legomena

slide-36
SLIDE 36

Verse length

slide-37
SLIDE 37

V I S U A L I Z I N G T E X T W I T H R