home > results and commentary > Obama (2008 vs 2012)

Word Analysis of 2012 U.S. Presidential Debates

Barack Obama (2008 vs 2012)



Introduction

In this analysis the content of Obama's 2008 first debate (vs McCain) is compared to that of the 2012 first debate (vs Romney). The purpose of this comparison is to highlight the debate dynamics of Obama's as the first-time candidate and as the encumbent.

The method of analysis is the same as for other debates, except in this case the transcript is constructed by treating Obama in 2008 as one speaker and Obama in 2012 as the another.

Word Statistics

Debate Word Count

Summary Word Count

The summary word count reports the total number of words and the number of unique, non-stop words used by each candidate. Word number is expressed as both absolute and relative values.

Table 1a
all words
Number of all words and unique words used by each speaker.
set word count
Barack Obama (2008)
7,517 1,370
50.8% 18.2%
61471370
Barack Obama (2012)
7,280 1,255
49.2% 17.2%
60251255
total
14,797 2,010
100.0% 13.6%
127872010

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 1b
exclusive and shared words
Words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Barack Obama (2008)
1,284 755
17.1% 58.8%
529755
Barack Obama (2012)
1,173 640
16.1% 54.6%
533640
both candidates
12,340 615
83.4% 5.0%
11725615

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 1
commentary

Obama fit +3.3% (7,517 vs 7,280) more words back in 2008 than in 2012. He was less repetitive in 2008, delivering Δrel=+5.8% (Δabs=+1.0%, 18.2% vs 17.2%) more unique words.

Table 1
legend
a c
b d
3010

a :: word count

b :: word count, as fraction in total in debate

c :: unique words in (a)

d :: unique words in (a), as fraction in (a) bar :: proportion of (a-c):c

Stop Word Contribution

In the table below, the candidates' delivery is partitioned into stop and non-stop words. Stop words (full list) are frequently-used bridging words (e.g. pronouns and conjunctions) whose meaning depends entirely on context. The fraction of words that are stop words is one measure of the complexity of speech.

Table 2a
non-stop words
Counts of stop and non-stop words.
speaker all stop non-stop
Barack Obama (2008)
7,517 1,370
100.0% 18.2%
61471370
4,282 135
57.0% 3.2%
4147135
3,235 1,235
43.0% 38.2%
20001235
Barack Obama (2012)
7,280 1,255
100.0% 17.2%
60251255
4,095 147
56.2% 3.6%
3948147
3,185 1,108
43.8% 34.8%
20771108
total
14,797 2,010
100.0% 13.6%
127872010
8,377 155
56.6% 1.9%
8222155
6,420 1,855
43.4% 28.9%
45651855

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 2b
exclusive and shared non-stop words
Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Barack Obama (2008)
1,257 747
38.9% 59.4%
510747
Barack Obama (2012)
1,118 620
35.1% 55.5%
498620
both candidates
4,045 488
63.0% 12.1%
3557488

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 2
commentary

Stop word content was very similar in 2008 and 2012. The most popular non-stop word used exclusively in 2008 were "troops", "nulcear", "iran", and "al Qaeda", whereas in 2012 it was "insurance", "small", "deficit", "opportunity" and "middle-class" (not counting references to Obama's opponents, e.g. "John", "McCain", "governor" and "Romney").

There were quite a bit more words unique, +20.5% (747 vs 620), to the 2008 debate than 2012.

Table 2
legend
a c
b d
3010

a :: total number of words, for a given category (all, stop, non-stop)

b :: (a) relative to words in the debate if category=all, otherwise relative to words by the candidate

c :: number of unique words with set (a)

d :: (c) relative to (a)

bar :: proportion of (a-c):c

All further word use statistics represent content that has been filtered for stop words.

Word frequency

The word frequency table summarizes the frequency with which words were used. I show the average word frequency and the weighted cumulative frequencies at 50 and 90 percentile. The average word frequency indicates how many times, on average, a word is used. For a given fraction of the entire delivery, the weighted cumulative frequency indicates the largest word frequency within this fraction (details about weighted cumulative distribution).

Table 3a
word use frequency
Average and 50%/90% percentile word frequencies.
speaker word frequency
all stop non-stop
Barack Obama (2008)
5.5 25 256
5.48725.000256.000
31.7 96 309
31.71996.000309.000
2.6 4 24
2.6194.00024.000
Barack Obama (2012)
5.8 23 260
5.80123.000260.000
27.9 80 267
27.85780.000267.000
2.9 5 23
2.8755.00023.000
total
7.4 44 523
7.36244.000523.000
54.0 166 569
54.045166.000569.000
3.5 8 37
3.4618.00037.000

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 3b
exclusive and shared non-stop word use frequency
Average and 50%/90% cumulative percentile word frequencies. Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word frequency
Barack Obama (2008)
1.68 2 17
1.6832.00017.000
Barack Obama (2012)
1.80 2 25
1.8032.00025.000
total
3.46 8 37
3.4618.00037.000

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 3
commentary

Word frequency profile indicates Obama increased repetition in 2012 by +11.5% (2.9 vs 2.6).

Table 3
legend
a b c
51025

a :: average word frequency

b :: largest word frequency in 50% of content

c :: largest word frequency in 90% of content

bar :: proportion of a:b:c

Sentence Size

Table 4
sentence size
Number of sentences spoken by each speaker and sentence word count statistics. Number of words in a sentence is shown by average and 50%/90% cumulative values for all, stop and non-stop words.
speaker number of sentences sentence size
all stop non-stop
Barack Obama (2008)
431
431
17.5 24 61
17.46224.00061.000
10.0 15 34
9.96515.00034.000
7.6 11 26
7.59011.00026.000
Barack Obama (2012)
391
391
18.6 26 50
18.61926.00050.000
10.6 15 29
10.63615.00029.000
8.4 12 23
8.36012.00023.000
total
822
822
20.0 26 56
20.01226.00056.000
12.3 16 31
12.28216.00031.000
10.0 12 26
9.95312.00026.000

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 4
commentary

Obama's sentences were +10.5% (8.4 vs 7.6) longer in 2012. Given that repetition was also higher, the overall delivery was less dense.

Table 4
legend
a b c
51025

a :: average sentence size

b :: largest sentence size for 50% of content

c :: largest sentence size for 90% of content

bar :: proportion of a:b:c

Part of Speech Analysis

In this section, word frequency is broken down by their part of speech (POS). The four POS groups examined are nouns, verbs, adjectives and adverbs. Conjunctions and prepositions are not considered. The first category (n+v+adj+adv) is composed of all four POS groups.

Part of Speech Count

Table 5
part of speech count
Count of words categorized by part of speech (POS).
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Barack Obama (2008)
2,990 1,177
39.8% 39.4%
89360743237422529310759
1,500 607
50.2% 40.5%
893607
806 374
27.0% 46.4%
432374
518 293
17.3% 56.6%
225293
166 59
5.6% 35.5%
10759
Barack Obama (2012)
2,949 1,061
40.5% 36.0%
9045434813562592529262
1,447 543
49.1% 37.5%
904543
837 356
28.4% 42.5%
481356
511 252
17.3% 49.3%
259252
154 62
5.2% 40.3%
9262
total
5,939 1,778
40.1% 29.9%
2014933105359057545423090
2,947 933
49.6% 31.7%
2014933
1,643 590
27.7% 35.9%
1053590
1,029 454
17.3% 44.1%
575454
320 90
5.4% 28.1%
23090

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 5
commentary

Part of speech analysis shows little change from 2008 to 2012. The ratios of each of noun, verb, adjective and adverb is similar. Verb use is slightly up in 2012 by Δrel=+5.2% (Δabs=+1.4%, 28.4% vs 27%).

In 2012 Obama used a greater fraction of unique adverbs, Δrel=+13.5% (Δabs=+4.8%, 40.3% vs 35.5%). Unique fraction of all other parts of speech was lower in 2012.

In fact, adverbs were the only part of speech for which Obama's delivery was richer in 2012, by +5.1% (62 vs 59). In 2012 he delivered -10.5% (543 vs 607) fewer nouns, -4.8% (356 vs 374) fewer verbs and -14.0% (252 vs 293) fewer adjectives.

Table 5
legend
a c
b d
1535

a :: total number of words for a given POS (all, noun, verb, adjective, adverb, pronoun)

b :: (a) relative to all words by candidate

c :: unique words in (a)

d :: (c) relative to (a)

bar :: proportion of (a-c):c

Part of Speech Frequency

Table 5
part of speech frequency
Frequency of words categorized by part of speech (POS).
part of speech frequency
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Barack Obama (2008)
2.54 4 25
2.5404.00025.000
2.47 4 19
2.4714.00019.000
2.15 3 23
2.1553.00023.000
1.77 2 14
1.7682.00014.000
2.81 5 36
2.8145.00036.000
Barack Obama (2012)
2.78 5 21
2.7795.00021.000
2.67 5 21
2.6655.00021.000
2.35 3 24
2.3513.00024.000
2.03 3 11
2.0283.00011.000
2.48 4 25
2.4844.00025.000
total
3.34 7 36
3.3407.00036.000
3.16 7 30
3.1597.00030.000
2.79 6 47
2.7856.00047.000
2.27 3 18
2.2673.00018.000
3.56 8 61
3.5568.00061.000

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 5
commentary

Repetition was higher in 2012 for all parts of speech, except adverbs: nouns by -100.0% (0 vs 47), verbs by +9.3% (2.35 vs 2.15), and adjectives by +14.7% (2.03 vs 1.77). Adverb repetition fell by -11.7% (2.48 vs 2.81),

Table 5
legend
a b c
51025

a :: average word frequency

b :: largest word frequency in 50% of content

c :: largest word frequency in 90% of content

bar :: proportion of a:b:c

Part of Speech Pairing

Through word pairing, I extract concepts from the text. The number of unique word pairs is a function of sentence length and is one of the measures of complexity.

Table 6a
part of speech pairing — Barack Obama (2008)
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Barack Obama (2008)
noun verb adjective adverb
noun
4,673 3,993
  85.4%
6803993
verb
5,069 4,547
  89.7%
5224547
1,211 1,105
  91.2%
1061105
adjective
2,691 2,463
  91.5%
2282463
1,449 1,319
  91.0%
1301319
350 336
  96.0%
14336
adverb
990 874
  88.3%
116874
543 490
  90.2%
53490
284 255
  89.8%
29255
65 50
  76.9%
1550

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 6b
part of speech pairing — Barack Obama (2012)
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Barack Obama (2012)
noun verb adjective adverb
noun
4,105 3,451
  84.1%
6543451
verb
5,187 4,596
  88.6%
5914596
1,423 1,318
  92.6%
1051318
adjective
2,740 2,413
  88.1%
3272413
1,639 1,475
  90.0%
1641475
423 381
  90.1%
42381
adverb
853 788
  92.4%
65788
512 479
  93.6%
33479
290 266
  91.7%
24266
44 42
  95.5%
242

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 6c
unique part of speech pairing — candidate comparison
Unique word pairs categorized by part of speech (POS)
unique part of speech pairings
noun (n) verb (v) adjective (adj) adverb (adv)
noun
3,993 3,451
  86.4%
3993
3451
verb
4,547 4,596
  101.1%
4547
4596
1,105 1,318
  119.3%
1105
1318
adjective
2,463 2,413
  98.0%
2463
2413
1,319 1,475
  111.8%
1319
1475
336 381
  113.4%
336
381
adverb
874 788
  90.2%
874
788
490 479
  97.8%
490
479
255 266
  104.3%
255
266
50 42
  84.0%
50
42

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 6
commentary

The 2012 debate saw longer sentences, but fewer unique words. Nevertheless, the number of noun/noun combinations in 2008 was higher by +15.7% (3,993 vs 3,451), indicating that the 2008 debate was richer in concepts.

Table 6 a,b
legend
a c
  d
3010

a :: total number of pairs, for a given category (e.g. verb/noun)

c :: number of unique pairs within set (a)

d :: (c) relative to (a)

bar :: proportion of (a-c):c

Table 6c
legend
a c
  d
50
45

a :: unique pairs for Barack Obama (2008)

c :: unique pairs for Barack Obama (2012)

d :: (c) relative to (a) (i.e. Barack Obama (2012) relative to Barack Obama (2008))

bars :: (a) and (c)

Exclusive and Shared Usage

This section enumerates words that were exclusive to a candidate (e.g. used by one candidate but not the other). This content provides insight into what the candidates' priorities are and reveals differences in perspective on similar topics.

For a given part of speech, the table breaks down the number of words that were spoken by only one of the candidates or both candidates (intersection). The last row includes words spoken by either candidate (union).

Table 7
exclusive word usage
Total and unique words used exclusively by a candidate, or by both.
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Barack Obama (2008)
1,195 717
100.0% 60.0%
20.1% 40.3%
478717
3123685120960168522
680 368
56.9% 54.1%
23.1% 39.4%
312368
312368
260 209
21.8% 80.4%
15.8% 35.4%
51209
51209
228 168
19.1% 73.7%
22.2% 37.0%
60168
60168
27 22
2.3% 81.5%
8.4% 24.4%
522
522
Barack Obama (2012)
1,076 601
100.0% 55.9%
18.1% 33.8%
475601
2882926319473134923
580 292
53.9% 50.3%
19.7% 31.3%
288292
288292
257 194
23.9% 75.5%
15.6% 32.9%
63194
63194
207 134
19.2% 64.7%
20.1% 29.5%
73134
73134
32 23
3.0% 71.9%
10.0% 25.6%
923
923
both candidates
3,668 460
100.0% 12.5%
61.8% 25.9%
3208460
13942179141404099120531
1,611 217
43.9% 13.5%
54.7% 23.3%
1394217
1394217
1,054 140
28.7% 13.3%
64.2% 23.7%
914140
914140
500 91
13.6% 18.2%
48.6% 20.0%
40991
40991
236 31
6.4% 13.1%
73.8% 34.4%
20531
20531
total
5,939 1,778
100.0% 29.9%
100.0% 100.0%
41611778
2014933105359057545423090
2,947 933
49.6% 31.7%
100.0% 100.0%
2014933
2014933
1,643 590
27.7% 35.9%
100.0% 100.0%
1053590
1053590
1,029 454
17.3% 44.1%
100.0% 100.0%
575454
575454
320 90
5.4% 28.1%
100.0% 100.0%
23090
23090

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 7
commentary

The 2008 debate had +19.3% (717 vs 601) more exclusive words than 2012. The number of nouns and adjectives unique to the debate was +26.0% (368 vs 292) and +25.4% (168 vs 134) higher in 2008 than 2012, respectively. This is a good indication of just how much more vigorous and varied the 2008 debate was than 2012.

Table 7c
legend
a d
b e
c f
4030
40302015105

a :: total number of words in set (e.g. obama \ romney, obama ∩ romney, obama ∪ romney , for a given part of speech

b :: (a) relative to all exclusive words in n+v+adj+adv

c :: (a) relative to all words in n+v+adj+adv

d :: unique words in (a)

e :: (d) relative to (a)

f :: (d) relative to all unique words in n+v+adj+adv

bar1 :: normalized ratio of (a-d):d

bar2 :: absolute ratio of (a-d):d for all POS groups (first column) or POS group (other columns)

Noun Phrase Usage

Noun phrases were extracted from the text and analyzed for frequency, word count, unique word count and richness. Single-word phrases were not counted.

Top-level noun phrases are those without a parent noun phrase (a parent phrase is one that a similar, longer phrase). Derived noun phrases are those with a parent (more details about noun phrase analysis).

The top-level noun phrases can be interpreted as independent concepts. Derived noun phrases can be interpreted as variants on concepts embodied by the top-level phrases.

Noun Phrase Count and length

This table reports the absolute number of noun phrases, which is related to the number of nouns, and their length.

Table 8a
noun phrase count
Counts of noun phrases in words and per noun.
speaker noun phrase count
all top-level
Barack Obama (2008)
527 261
100.0% 49.5%
0.35 0.43
266261
447 257
84.8% 57.5%
0.30 0.42
190257
Barack Obama (2012)
529 240
100.0% 45.4%
0.37 0.44
289240
433 234
81.9% 54.0%
0.30 0.43
199234

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 8b
noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Barack Obama (2008)
2.29 2 3
2.2872.0003.000
2.33 2 3
2.3292.0003.000
Barack Obama (2012)
2.31 2 3
2.3122.0003.000
2.36 2 4
2.3602.0004.000

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 8
commentary

As hinted by the analysis above, the drop in the number of concepts is reflected in the reduced number of noun phrases. Although in 2012 noun phrases were slightly longer, +1.3% (2.36 vs 2.33) for top-level phrases, there were -8.9% (234 vs 257) fewer of them.

Table 8a
legend
a d
b e
c f
1070

a :: number of noun phrases

b :: (a) relative to number of all noun phrases

c :: number of noun phrases per noun

d :: number of unique phrases

e :: (c) relative to (a)

f :: number of unique noun phrases per unique noun

bar :: normalized ratio of (a-c):c

Table 8b
legend
a b c
102080

a :: average noun phrase size, in words

b :: largest noun phrase size in 50% of content

c :: largest noun phrase size in 90% of content

bar :: proportion of a:b:c


Exclusive and Shared Noun Phrase Count and length

Table 9a
exclusive and shared noun phrase count
Counts of exclusive and shared noun phrases in words and per noun.
speaker noun phrase count
all top-level
Barack Obama (2008)
472 256
43.8% 54.2%
216256
411 253
87.1% 61.6%
158253
Barack Obama (2012)
476 234
44.2% 49.2%
242234
406 230
85.3% 56.7%
176230
both candidates
172 31
16.0% 18.0%
14131
74 18
43.0% 24.3%
5618

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 9b
exclusive and shared noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Barack Obama (2008)
2.31 2 3
2.3092.0003.000
2.35 2 4
2.3482.0004.000
Barack Obama (2012)
2.34 2 3
2.3362.0003.000
2.38 2 4
2.3772.0004.000
both candidates
2.08 2 3
2.0762.0003.000
2.18 2 3
2.1762.0003.000

Fields with (e.g. 155) link to data files. Hover over the field to show these links.

Table 9
commentary

The 2008 debate had a greater variety of exclusive noun phrases, by +9.6% (252 vs 230).

Table 9a
legend
a c
b d
1070

a :: number of noun phrases

b :: (a) relative to number of all noun phrases

c :: number of unique phrases

d :: (c) relative to (a)

bar :: normalized ratio of (a-c):c

Table 9b
legend
a b c
102080

a :: average noun phrase size, in words

b :: largest noun phrase size in 50% of content

c :: largest noun phrase size in 90% of content

bar :: proportion of a:b:c


Windbag Index

The Windbag Index is a compound measure that characterizes the complexity of speech. A low index is indicative of succinct speech with low degree of repetition and large number of independent concepts.

Table 10
windbag index
Windbag Index for each speaker. The higher the value, the more repetitive the speech.
speaker Windbag Index
index value index terms
Barack Obama (2008)
330
-29.5%
330.62775476744
0.430 0.382 0.405 0.464 0.566 0.355 0.495 0.985
-1.6% +9.7% +7.8% +9.1% +14.7% -11.7% +9.2% +1.0%
0.4303578555274710.3817619783616690.4046666666666670.4640198511166250.5656370656370660.3554216867469880.4952561669829220.984674329501916
Barack Obama (2012)
468
+41.8%
468.732412593364
0.438 0.348 0.375 0.425 0.493 0.403 0.454 0.975
+1.7% -8.9% -7.3% -8.3% -12.8% +13.3% -8.4% -1.0%
0.43750.3478806907378340.3752591568762960.4253285543608120.4931506849315070.4025974025974030.4536862003780720.975
Table 10
commentary

The 2008 debate delivery was denser and more vigorous. Repetition plagued Obama in 2012, resulting in a +41.8% (468 vs 330) larger Windbag Index. The only terms in 2012 that contributed to a lower index were the fraction of non-stop words and the fraction of unique adverbs. All other quantities indicated greater repetition across all others parts of the delivery.

Table 10
legend
The Windbag Index is 1/(t1*t2*...*t9) where t1,t2,...,t8 are

t1 :: fraction of words which are non-stop

t2 :: fraction of non-stop words which are unique

t3 :: fraction of nouns which are unique

t4 :: fraction of verbs which are unique

t5 :: fraction of adjectives which are unique

t6 :: fraction of adverbs which are unique

t7 :: fraction of noun phrases which are unique

t8 :: fraction of noun phrases which are top-level


Large individual terms t1...t9 contribute to a smaller index.

The percentage values below the index and each term are relative differences to the other speaker's corresponding term (i.e. 100*(a-b)/b where a is the value for one speaker and b for the other).

Word Clouds

In the word clouds below, the size of the word is proportional to the number of times it was used by a candidate (method details).

Not all words from a group used to draw the cloud fit in the image — less frequently used words for large word groups may fall outside the image.

All Words for Each Candidate

Each candidate's debate portion was extracted and frequencies were compiled for each part of speech (noun, verb, adjective, adverb), with words colored by their part of speech category.

The distribution of sizes within a tag cloud follows the frequency distribution of words. However, word size cannot be compared between clouds, since the minimum and maximum size of the words is fixed.

Debate Word Cloud for Barack Obama (2008) - all words

Debate tag cloud for Barack Obama (2008)

Debate Word Cloud for Barack Obama (2012) - all words

Debate tag cloud for Barack Obama (2012)
commentary

Exclusive Words for Each Candidate

The clouds below show words used exlusively by a candidate. For example, if candidate A used the word "invest" (any number of times), but candidate B did not, then the word will appear in the exclusive word tag cloud for candidate A.

Words exclusive to Barack Obama (2008)

Debate tag cloud for Barack Obama (2008)

Words exclusive to Barack Obama (2012)

Debate tag cloud for Barack Obama (2012)
commentary

Part of Speech Word Clouds

In these clouds, words from each major part of speech were colored based on whether they were exclusive to a candidate or shared by the candidates.

The size of the word is relative to the frequency for the candidate — word sizes between candidates should not be used to indicate difference in absolute frequency.

Cloud of noun words, by speaker

commentary

Cloud of verb words, by speaker

commentary

Cloud of adjective words, by speaker

commentary

Cloud of adverb words, by speaker

commentary

Cloud of all words, by speaker

commentary

Word Pair Clouds for Each Candidate

word pairs for Barack Obama (2008)

adjective/adjective by Barack Obama (2008)
adjective/adverb by Barack Obama (2008)
adjective/noun by Barack Obama (2008)
adjective/verb by Barack Obama (2008)
adverb/adverb by Barack Obama (2008)
adverb/noun by Barack Obama (2008)
adverb/verb by Barack Obama (2008)
noun/noun by Barack Obama (2008)
noun/verb by Barack Obama (2008)
verb/verb by Barack Obama (2008)

word pairs for Barack Obama (2012)

adjective/adjective by Barack Obama (2012)
adjective/adverb by Barack Obama (2012)
adjective/noun by Barack Obama (2012)
adjective/verb by Barack Obama (2012)
adverb/adverb by Barack Obama (2012)
adverb/noun by Barack Obama (2012)
adverb/verb by Barack Obama (2012)
noun/noun by Barack Obama (2012)
noun/verb by Barack Obama (2012)
verb/verb by Barack Obama (2012)
commentary

Downloads

Debate transcript

Parsed word lists (word lists, part of speech lists, noun phrases, sentences)

Word clouds

Raw data structure

Please see the methods section for details about these files.