Translating Texts into Interpretations and Numbers Department of - - PowerPoint PPT Presentation

translating texts into interpretations and numbers
SMART_READER_LITE
LIVE PREVIEW

Translating Texts into Interpretations and Numbers Department of - - PowerPoint PPT Presentation

Texts as Sources Content Analysis Practicalities/Reflection Translating Texts into Interpretations and Numbers Department of Government London School of Economics and Political Science Texts as Sources Content Analysis


slide-1
SLIDE 1

Texts as Sources Content Analysis Practicalities/Reflection

Translating Texts into Interpretations and Numbers

Department of Government London School of Economics and Political Science

slide-2
SLIDE 2

Texts as Sources Content Analysis Practicalities/Reflection

Preview

Two weeks on data collection

Text analysis Interviewing methods

Problem Set 3

Due November 21

Problem Set 4

Discuss in class next 2 weeks Due December 5

slide-3
SLIDE 3

Texts as Sources Content Analysis Practicalities/Reflection

1 Texts as Sources 2 Content Analysis 3 Practicalities/Reflection

slide-4
SLIDE 4

Texts as Sources Content Analysis Practicalities/Reflection

Research Proposal

This is a prospectus

Introduction Theoretical literature review Plan of data collection Planned analysis Reflective, critical conclusion

slide-5
SLIDE 5

Texts as Sources Content Analysis Practicalities/Reflection

Research Proposal

This is a prospectus

Introduction Theoretical literature review Plan of data collection Planned analysis Reflective, critical conclusion

Presentations are intended for feedback

You can change your topic later Please also use office hours

slide-6
SLIDE 6

Texts as Sources Content Analysis Practicalities/Reflection

1 Texts as Sources 2 Content Analysis 3 Practicalities/Reflection

slide-7
SLIDE 7

Texts as Sources Content Analysis Practicalities/Reflection

What counts as text?

Primary sources

Raw, original evidence

Secondary sources

Interpretations of raw evidence

Tertiary sources

Compendia or indices of two other types

  • f sources
slide-8
SLIDE 8

Texts as Sources Content Analysis Practicalities/Reflection

Use of Texts

Text as description

Rely on text in lieu of direct observation What do we gain? What do we lose?

slide-9
SLIDE 9

Texts as Sources Content Analysis Practicalities/Reflection

Use of Texts

Text as description

Rely on text in lieu of direct observation What do we gain? What do we lose?

Text as DSOs

Treat texts as units of analysis What do we gain? What do we lose?

slide-10
SLIDE 10

Texts as Sources Content Analysis Practicalities/Reflection

Challenges of Text

1 Source “Quality” 2 Subjectivity and differing perspectives 3 Historiography 4 Selection and confirmation bias

slide-11
SLIDE 11

Texts as Sources Content Analysis Practicalities/Reflection

Challenges of Text

1 Source “Quality” 2 Subjectivity and differing perspectives 3 Historiography 4 Selection and confirmation bias

But these are really the challenges of any research!

slide-12
SLIDE 12

Texts as Sources Content Analysis Practicalities/Reflection

slide-13
SLIDE 13

Texts as Sources Content Analysis Practicalities/Reflection

1 Texts as Sources 2 Content Analysis 3 Practicalities/Reflection

slide-14
SLIDE 14

Texts as Sources Content Analysis Practicalities/Reflection

But the biggest issue is that

  • nce we have some texts, our

interpretations are inherently subjective!

slide-15
SLIDE 15

Texts as Sources Content Analysis Practicalities/Reflection

Consider the following quote from a U.S. politician: African Americans must stop making excuses and rely much more

  • n themselves to get ahead in

society. What does it mean? Is it positive or negative? Is it prejudicial?

slide-16
SLIDE 16

Texts as Sources Content Analysis Practicalities/Reflection Kuklinski & Hurley. 1994. “On Hearing and Interpreting Political Messages: A Cautionary Tale of Citizen Cue-Taking.” Journal of Politics 56(3): 729–751.

slide-17
SLIDE 17

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis

Definition: “systematic description of the contents of a communication” Treat texts as DSOs or a source of multiple DSOs

Data set observation: score for a case on a variable

Technique can be applied to any kind of document

slide-18
SLIDE 18

Texts as Sources Content Analysis Practicalities/Reflection

slide-19
SLIDE 19

Texts as Sources Content Analysis Practicalities/Reflection

slide-20
SLIDE 20

Texts as Sources Content Analysis Practicalities/Reflection

slide-21
SLIDE 21

Texts as Sources Content Analysis Practicalities/Reflection

slide-22
SLIDE 22

Texts as Sources Content Analysis Practicalities/Reflection

slide-23
SLIDE 23

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis Process

1 Define a research question

slide-24
SLIDE 24

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis Process

1 Define a research question 2 Define concepts

slide-25
SLIDE 25

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis Process

1 Define a research question 2 Define concepts 3 Identify population (sample) of texts

slide-26
SLIDE 26

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis Process

1 Define a research question 2 Define concepts 3 Identify population (sample) of texts 4 Create a codebook that describes how

to operationalize concepts using text

slide-27
SLIDE 27

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis Process

1 Define a research question 2 Define concepts 3 Identify population (sample) of texts 4 Create a codebook that describes how

to operationalize concepts using text

5 Assign scores to text units based on

codebook

slide-28
SLIDE 28

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis Process

1 Define a research question 2 Define concepts 3 Identify population (sample) of texts 4 Create a codebook that describes how

to operationalize concepts using text

5 Assign scores to text units based on

codebook

6 Aggregate and summarize results

slide-29
SLIDE 29

Texts as Sources Content Analysis Practicalities/Reflection

Example 1: Categorization

Common RQ relates to categorization: What is this text saying? Example RQs:

What arguments are raised in this text? Are certain ideas more prevalent in some texts than others? How is an issue discussed in politics?

slide-30
SLIDE 30

Texts as Sources Content Analysis Practicalities/Reflection

Example 1: Categorization

Common RQ relates to categorization: What is this text saying? Example RQs:

What arguments are raised in this text? Are certain ideas more prevalent in some texts than others? How is an issue discussed in politics?

Running ex.: UKIP’s 2015 Manifesto

slide-31
SLIDE 31

Texts as Sources Content Analysis Practicalities/Reflection

Example 1 (Frames/Discourses)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-32
SLIDE 32

Texts as Sources Content Analysis Practicalities/Reflection

Example 1 (Frames/Discourses)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-33
SLIDE 33

Texts as Sources Content Analysis Practicalities/Reflection

Example 1 (Frames/Discourses)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-34
SLIDE 34

Texts as Sources Content Analysis Practicalities/Reflection

Example 1 (Frames/Discourses)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-35
SLIDE 35

Texts as Sources Content Analysis Practicalities/Reflection

Example 2 (Mentions)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-36
SLIDE 36

Texts as Sources Content Analysis Practicalities/Reflection

Example 2 (Mentions)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-37
SLIDE 37

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-38
SLIDE 38

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-39
SLIDE 39

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-40
SLIDE 40

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-41
SLIDE 41

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-42
SLIDE 42

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-43
SLIDE 43

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-44
SLIDE 44

Texts as Sources Content Analysis Practicalities/Reflection

Example 3 (Issues/Priorities)

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-45
SLIDE 45

Texts as Sources Content Analysis Practicalities/Reflection

Developing Codebooks I

Developed deductively

1 Start with set of concepts 2 Describe how those concepts might be

expressed in text

3 Create a “coding scheme” by which to

score a case on a variable

slide-46
SLIDE 46

Texts as Sources Content Analysis Practicalities/Reflection

Developing Codebooks II

Developed inductively

1 Examine texts for key ideas and

expressions

2 Create a set of categories that summarize

those expressions

3 Examine new texts for expressions of

those categories

4 Expand and clarify the codebook as new

concepts or expressions emerge

slide-47
SLIDE 47

Texts as Sources Content Analysis Practicalities/Reflection

Codebook Example I

1

Assign 1 if the text mentions “Europe”

slide-48
SLIDE 48

Texts as Sources Content Analysis Practicalities/Reflection

Codebook Example I

1

Assign 1 if the text mentions “Europe”

2

Count times “Europe” is mentioned

slide-49
SLIDE 49

Texts as Sources Content Analysis Practicalities/Reflection

Codebook Example I

1

Assign 1 if the text mentions “Europe”

2

Count times “Europe” is mentioned

3

More complex example:

Assign +1 if mentions “Europe” positively Assign -1 if mentions “Europe” negatively Assign 0 if text does not mention “Europe”

slide-50
SLIDE 50

Texts as Sources Content Analysis Practicalities/Reflection

Codebook Example I

1

Assign 1 if the text mentions “Europe”

2

Count times “Europe” is mentioned

3

More complex example:

Assign +1 if mentions “Europe” positively Assign -1 if mentions “Europe” negatively Assign 0 if text does not mention “Europe”

4

List all frames used in the text

slide-51
SLIDE 51

Texts as Sources Content Analysis Practicalities/Reflection

Example Scoring Sheet

Mention Count Affect Frame UKIP Conservative Labour LibDems SNP Greens

slide-52
SLIDE 52

Texts as Sources Content Analysis Practicalities/Reflection

Example 2: Dictionary Methods

Rather than rely on subjective interpretation, dictionary methods create a “dictionary” of terms or expressions that operationalize a concept Score is based on word presence of words or their frequencies Often, goal is to categorize the text based on how it scores relative to other texts

slide-53
SLIDE 53

Texts as Sources Content Analysis Practicalities/Reflection

Example 2: Dictionary Methods

Rather than rely on subjective interpretation, dictionary methods create a “dictionary” of terms or expressions that operationalize a concept Score is based on word presence of words or their frequencies Often, goal is to categorize the text based on how it scores relative to other texts Sentiment (positivity/negativity) Ideology (left/right)

slide-54
SLIDE 54

Texts as Sources Content Analysis Practicalities/Reflection

Affect/Sentiment

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-55
SLIDE 55

Texts as Sources Content Analysis Practicalities/Reflection

Affect/Sentiment

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-56
SLIDE 56

Texts as Sources Content Analysis Practicalities/Reflection

Affect/Sentiment

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-57
SLIDE 57

Texts as Sources Content Analysis Practicalities/Reflection

Unit of Analysis

It can be difficult to decide what the correct unit of analysis is. Possibilities include:

Author/Source Text as a whole Section Page Paragraph Sentence Sentence fragment Word

slide-58
SLIDE 58

Texts as Sources Content Analysis Practicalities/Reflection

Unit of Analysis

It can be difficult to decide what the correct unit of analysis is. Possibilities include:

Author/Source Text as a whole Section Page Paragraph Sentence Sentence fragment Word

Smaller units can always be aggregated later, but larger units cannot be disaggregated later

slide-59
SLIDE 59

Texts as Sources Content Analysis Practicalities/Reflection

Example Scoring Sheet (Paragraph-level)

Mention Count Affect Frame UKIP Conservative Labour LibDems SNP Greens

slide-60
SLIDE 60

Texts as Sources Content Analysis Practicalities/Reflection

Example Scoring Sheet (Sentence-level)

Sentence Mention Affect Frame UKIP 1 UKIP 2 UKIP 3 UKIP 4 UKIP 5

slide-61
SLIDE 61

Texts as Sources Content Analysis Practicalities/Reflection

Sentence

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues

  • f the day.
slide-62
SLIDE 62

Texts as Sources Content Analysis Practicalities/Reflection

Sentence fragments

If only all politicians could believe in Britain as UKIP does. If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-63
SLIDE 63

Texts as Sources Content Analysis Practicalities/Reflection

“Bag of Words” Approaches I

If only all politicians could believe in Britain as UKIP does . If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-64
SLIDE 64

Texts as Sources Content Analysis Practicalities/Reflection

“Bag of Words” Approaches I

If only all politicians could believe in Britain as UKIP does . If only they could share our positive vision of Britain as a proud, independent sovereign nation, a country respected on the world stage, a major player in global trade, with influence and authority when it comes to tackling the pressing international issues of the day.

slide-65
SLIDE 65

Texts as Sources Content Analysis Practicalities/Reflection

“Bag of Words” Approaches II

  • nly all politicians could believe Britain UKIP

does only they could share our positive vision Britain proud independent sovereign nation country respected world stage major player global trade influence authority comes tackling pressing international issues day

slide-66
SLIDE 66

Texts as Sources Content Analysis Practicalities/Reflection

“Bag of Words” Approaches III

independent sovereign nation country international world global positive proud respected major influence authority politicians player Britain Britain UKIP believe share tackling pressing could could does comes vision stage day trade issues

  • nly only all they our
slide-67
SLIDE 67

Texts as Sources Content Analysis Practicalities/Reflection

N-grams and “words”

“Bag of words” usually uses unigrams

slide-68
SLIDE 68

Texts as Sources Content Analysis Practicalities/Reflection

N-grams and “words”

“Bag of words” usually uses unigrams Unigrams are often “stemmed” family, families, familiarity, and familial all become famil.

slide-69
SLIDE 69

Texts as Sources Content Analysis Practicalities/Reflection

N-grams and “words”

“Bag of words” usually uses unigrams Unigrams are often “stemmed” family, families, familiarity, and familial all become famil. Can also be bigrams, trigrams, n-grams

slide-70
SLIDE 70

Texts as Sources Content Analysis Practicalities/Reflection

N-grams and “words”

“Bag of words” usually uses unigrams Unigrams are often “stemmed” family, families, familiarity, and familial all become famil. Can also be bigrams, trigrams, n-grams These allow for multi-word expressions “If only” has a negative connotation even though “if” and “only” are basically neutral words on their own

slide-71
SLIDE 71

Texts as Sources Content Analysis Practicalities/Reflection

1 Texts as Sources 2 Content Analysis 3 Practicalities/Reflection

slide-72
SLIDE 72

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis in Practice

Inductive processes generally require a “training set” and “test set”

Establish coding scheme on “training set” Apply it to broader “test set”

slide-73
SLIDE 73

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis in Practice

Inductive processes generally require a “training set” and “test set”

Establish coding scheme on “training set” Apply it to broader “test set”

Be cautious about scope conditions

slide-74
SLIDE 74

Texts as Sources Content Analysis Practicalities/Reflection

Content Analysis in Practice

Inductive processes generally require a “training set” and “test set”

Establish coding scheme on “training set” Apply it to broader “test set”

Be cautious about scope conditions Develop a precise and reliable codebook

slide-75
SLIDE 75

Texts as Sources Content Analysis Practicalities/Reflection

Intercoder Reliability

Definition: “extent to which two (or more) coders agree on scores”

slide-76
SLIDE 76

Texts as Sources Content Analysis Practicalities/Reflection

Intercoder Reliability

Definition: “extent to which two (or more) coders agree on scores” Have multiple coders analyze same text using coding scheme Assess % of time they assign same scores Increase specificity of codebook if reliability is low (less than 80%)

slide-77
SLIDE 77

Texts as Sources Content Analysis Practicalities/Reflection

Manual versus Automated Techniques

Text analysis methods vary along a continuum from fully manual to fully automated

slide-78
SLIDE 78

Texts as Sources Content Analysis Practicalities/Reflection

Manual versus Automated Techniques

Text analysis methods vary along a continuum from fully manual to fully automated Automated methods can be computationally intensive

slide-79
SLIDE 79

Texts as Sources Content Analysis Practicalities/Reflection

Manual versus Automated Techniques

Text analysis methods vary along a continuum from fully manual to fully automated Automated methods can be computationally intensive Automated methods make assumptions, but so do manual methods

slide-80
SLIDE 80

Texts as Sources Content Analysis Practicalities/Reflection

slide-81
SLIDE 81

Texts as Sources Content Analysis Practicalities/Reflection

slide-82
SLIDE 82

Texts as Sources Content Analysis Practicalities/Reflection

“Quant” or “Qual”?

Given what you know now, in what ways is content analysis a “qualitative” versus “quantitative” research method?

slide-83
SLIDE 83

Texts as Sources Content Analysis Practicalities/Reflection

Hands-On Practice

There is no explicit lab for this topic If you are interested, consider reading the Recommended text for this week, which includes hands-on exercises in R: http://tidytextmining.com/

slide-84
SLIDE 84