home > results > Nixon vs. Kennedy (combined)

Word Analysis of 1960 U.S. Presidential Debates

Richard Nixon vs. John F. Kennedy (combined debates)

26 September — 21 October 1960



Introduction

Speaking Turns and Interruptions

Here, I look at the length of each turn of uninterrupted speech.

Table 1
length of sections in words
The number of uninterrupted deliveries (sections), mode/median/mean length of sections in words, and the shortest section length in words that composed 10%, 50% and 90% of the debate.
speaker sections section length debate contiguity (L10 L50 L90)
Richard Nixon
49
49
4.0 341.0 394.4
4.000341.00000000394.388
277 438 1,238
277.000438.0001238.000
John F Kennedy
55
55
7.0 325.0 338.0
7.000325.00000000337.964
280 424 1,285
280.000424.0001285.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 1
legend
a b c
51025

a — section length (mode), shortest section length in 10% of debate

b — section length (median), shortest section length in 50% of debate

c — section length (mean), shortest section length in 90% of debate

bar — proportion of a:b:c

Table 1
commentary

Flesch-Kincaid Reading Ease and Grade Level

The Flesch-Kincaid reading ease and grade level metrics are designed to indicate how difficult a passage in English is to understand.

Reading ease ranges from 100 (easiest) down to 0 (hardest) and can be interpreted as follows

100 –905th gradeVery easy to read. Easily understood by an average 11-year-old student.
90 – 806th gradeEasy to read. Conversational English for consumers.
80 – 707th gradeFairly easy to read.
70 – 608th & 9th gradePlain English. Easily understood by 13- to 15-year-old students.
60 – 5010th to 12th gradeFairly difficult to read.
50 – 30collegeDifficult to read.
30 – 10college graduateVery difficult to read. Best understood by college/university graduates.
10 – 0professionalExtremely difficult to read. Best understood by college/university graduates.

The grade level corresponds roughly to a U.S. grade level. It has a minimum value of –3.4 and no upper bound.

Two sets of readability scores are calculated. One for the entire debate and one that only considers section with at least 9 words.

Table 2a
readability — entire debate
Flesch-Kincaid reading ease and grade level.
speaker grade level reading ease sections sentences words syllables
Richard Nixon
10.08
0.0%
10.08
62.23
0.0%
62.23
49
0.0%
49
873
0.0%
873
19,325
0.0%
19325
27,900
0.0%
27900
John F Kennedy
9.99
0.0%
9.99
61.04
0.0%
61.04
55
0.0%
55
880
0.0%
880
18,588
0.0%
18588
27,322
0.0%
27322

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 2b
readability — excluding short sections
Flesch-Kincaid reading ease and grade level for sections with at least 9 words.
speaker grade level reading ease sections sentences words syllables
Richard Nixon
10.10
0.0%
10.10
62.16
0.0%
62.16
46
0.0%
46
870
0.0%
870
19,313
0.0%
19313
27,884
0.0%
27884
John F Kennedy
10.04
0.0%
10.04
60.89
0.0%
60.89
49
0.0%
49
874
0.0%
874
18,565
0.0%
18565
27,295
0.0%
27295

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 2
legend
a
b
30

a — value for candidate

b — value relative to Donald Trump

bar — proportion of a

Table 2
commentary

Sentence Size

Table 3
sentence size
Number of sentences spoken by each speaker and sentence word count statistics. Number of words in a sentence is shown by average and 50%/90% cumulative values for all, stop and non-stop words.
speaker number of sentences sentence size
all stop non-stop
Richard Nixon
873
873
21.8 30 60
21.82530.00060.000
12.9 17 37
12.85617.00037.000
9.0 13 26
8.96913.00026.000
John F Kennedy
880
880
20.9 27 56
20.85727.00056.000
11.4 16 32
11.41216.00032.000
9.4 13 26
9.44413.00026.000
total
1,753
1753
23.3 30 60
23.33930.00060.000
14.1 18 36
14.13118.00036.000
11.2 14 27
11.20814.00027.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 3
legend
a b c
51025

a — average sentence size

b — largest sentence size for 50% of content

c — largest sentence size for 90% of content

bar — proportion of a:b:c

Table 3
commentary

Word Statistics

Debate Word Count

Summary Word Count

The summary word count reports the total number of words and the number of unique, non-stop words used by each candidate. Word number is expressed as both absolute and relative values.

Table 4a
all words
Number of all words and unique words used by each speaker.
set word count
Richard Nixon
19,053 1,993
50.9% 10.5%
170601993
John F Kennedy
18,354 2,081
49.1% 11.3%
162732081
total
37,407 2,984
100.0% 8.0%
344232984

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 4b
exclusive and shared words
Words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Richard Nixon
1,674 903
8.8% 53.9%
771903
John F Kennedy
1,969 991
10.7% 50.3%
978991
both candidates
33,764 1,090
90.3% 3.2%
326741090

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 4
legend
a c
b d
3010

a — word count

b — word count, as fraction in total in debate

c — unique words in (a)

d — unique words in (a), as fraction in (a)

bar — proportion of (a-c):c

Table 4
commentary

Stop Word Contribution

In the table below, the candidates' delivery is partitioned into stop and non-stop words. Stop words (full list) are frequently-used bridging words (e.g. pronouns and conjunctions) whose meaning depends entirely on context. The fraction of words that are stop words is one measure of the complexity of speech.

Table 5a
non-stop words
Counts of stop and non-stop words.
speaker all words stop words non-stop words
Richard Nixon
19,053 1,993
100.0% 10.5%
170601993
11,223 151
58.9% 1.3%
11072151
7,830 1,842
41.1% 23.5%
59881842
John F Kennedy
18,354 2,081
100.0% 11.3%
162732081
10,043 154
54.7% 1.5%
9889154
8,311 1,927
45.3% 23.2%
63841927
total
37,407 2,984
100.0% 8.0%
344232984
21,266 163
56.9% 0.8%
21103163
16,141 2,821
43.1% 17.5%
133202821

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 5b
exclusive and shared non-stop words
Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Richard Nixon
1,652 894
21.1% 54.1%
758894
John F Kennedy
1,940 979
23.3% 50.5%
961979
both candidates
12,549 948
77.7% 7.6%
11601948

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 5
legend
a c
b d
3010

a — total number of words, for a given category (all, stop, non-stop)

b — (a) relative to words in the debate if category=all, otherwise relative to words by the candidate

c — number of unique words with set (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 5
commentary

Word frequency

The word frequency table summarizes the frequency with which words were used. I show the average word frequency and the weighted cumulative frequencies at 50 and 90 percentile. The average word frequency indicates how many times, on average, a word is used. For a given fraction of the entire delivery, the weighted cumulative frequency indicates the largest word frequency within this fraction (details about weighted cumulative distribution).

Table 6a
word use frequency
Average and 50%/90% percentile word frequencies.
speaker word frequency
all stop non-stop
Richard Nixon
9.6 69 579
9.56069.000579.000
74.3 264 1,152
74.325264.0001152.000
4.3 10 60
4.25110.00060.000
John F Kennedy
8.8 53 535
8.82053.000535.000
65.2 209 1,276
65.214209.0001276.000
4.3 10 72
4.31310.00072.000
total
12.5 119 1,114
12.536119.0001114.000
130.5 487 2,428
130.466487.0002428.000
5.7 17 114
5.72217.000114.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 6b
exclusive and shared non-stop word use frequency
Average and 50%/90% cumulative percentile word frequencies. Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word frequency
Richard Nixon
1.85 2 9
1.8482.0009.000
John F Kennedy
1.98 3 10
1.9823.00010.000
total
5.72 17 114
5.72217.000114.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 6
legend
a b c
51025

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 6
commentary

All further word use statistics represent content that has been filtered for stop words, unless explicitly indicated.

Part of Speech Analysis

In this section, word frequency is broken down by their part of speech (POS). The four POS groups examined are nouns, verbs, adjectives and adverbs. Conjunctions and prepositions are not considered. The first category (n+v+adj+adv) is composed of all four POS groups.

Part of Speech Count

Table 7
part of speech count
Count of words categorized by part of speech (POS).
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Richard Nixon
7,452 1,802
39.1% 24.2%
30419091511630607297346111
3,950 909
53.0% 23.0%
3041909
2,141 630
28.7% 29.4%
1511630
904 297
12.1% 32.9%
607297
457 111
6.1% 24.3%
346111
John F Kennedy
7,826 1,866
42.6% 23.8%
327210251473549753334307113
4,297 1,025
54.9% 23.9%
32721025
2,022 549
25.8% 27.2%
1473549
1,087 334
13.9% 30.7%
753334
420 113
5.4% 26.9%
307113
total
15,278 2,759
40.8% 18.1%
6774147332499141513478707170
8,247 1,473
54.0% 17.9%
67741473
4,163 914
27.2% 22.0%
3249914
1,991 478
13.0% 24.0%
1513478
877 170
5.7% 19.4%
707170

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 7
legend
a c
b d
1535

a — total number of words for a given POS (all, noun, verb, adjective, adverb)

b — (a) relative to all words by candidate

c — unique words in (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 7
commentary

Part of Speech Frequency

Table 8
part of speech frequency
Frequency of words categorized by part of speech (POS).
part of speech frequency
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv) pronouns (pro)
Richard Nixon
4.13 9 59
4.1359.00059.000
4.34 11 69
4.34511.00069.000
3.40 6 60
3.3986.00060.000
3.04 5 18
3.0445.00018.000
4.12 9 33
4.1179.00033.000
58.94 203 699
58.938203.000699.000
John F Kennedy
4.19 9 70
4.1949.00070.000
4.19 9 58
4.1929.00058.000
3.68 9 96
3.6839.00096.000
3.25 6 31
3.2546.00031.000
3.72 8 21
3.7178.00021.000
45.46 149 487
45.456149.000487.000
total
5.54 16 114
5.53816.000114.000
5.60 17 121
5.59917.000121.000
4.55 11 122
4.55511.000122.000
4.17 10 46
4.16510.00046.000
5.16 15 43
5.15915.00043.000
93.54 302 1,186
93.541302.0001186.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 8
legend
a b c
51025

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 8
commentary

Part of Speech Pairing

Through word pairing, I extract concepts from the text. The number of unique word pairs is a function of sentence length and is one of the measures of complexity.

Table 9a
part of speech pairing — Richard Nixon
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Richard Nixon
noun verb adjective adverb
noun
540 238
  44.1%
302238
verb
161 133
  82.6%
28133
13 12
  92.3%
112
adjective
523 381
  72.8%
142381
3 3
  100.0%
03
28 24
  85.7%
424
adverb
8 6
  75.0%
26
47 44
  93.6%
344
28 25
  89.3%
325
12 12
  100.0%
012

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9b
part of speech pairing — John F Kennedy
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - John F Kennedy
noun verb adjective adverb
noun
692 313
  45.2%
379313
verb
127 98
  77.2%
2998
13 12
  92.3%
112
adjective
641 448
  69.9%
193448
9 7
  77.8%
27
48 43
  89.6%
543
adverb
10 9
  90.0%
19
46 37
  80.4%
937
31 27
  87.1%
427
9 8
  88.9%
18

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9c
unique part of speech pairing — candidate comparison
Unique word pairs categorized by part of speech (POS)
unique part of speech pairings
noun (n) verb (v) adjective (adj) adverb (adv)
noun
238 313
  131.5%
238
313
verb
133 98
  73.7%
133
98
12 12
  100.0%
12
12
adjective
381 448
  117.6%
381
448
3 7
  233.3%
3
7
24 43
  179.2%
24
43
adverb
6 9
  150.0%
6
9
44 37
  84.1%
44
37
25 27
  108.0%
25
27
12 8
  66.7%
12
8

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9 a,b
legend
a c
  d
3010

a — total number of pairs, for a given category (e.g. verb/noun)

c — number of unique pairs within set (a)

d — (c) relative to (a)

bar — proportion of (a–c):c

Table 9c
legend
a c
  d
50
45

a — unique pairs for Richard Nixon

c — unique pairs for John F Kennedy

d — (c) relative to (a) (i.e. John F Kennedy relative to Richard Nixon)

bars — (a) and (c)

Table 9
commentary

Detailed Part of Speech Tags

You can really get into the weeds here. Parts of speech are counted more granularly in these tables — nouns and verbs are split into classes and many other word types are shown, such as conjunctions and prepositions.

Table 10a
detailed POS tags — nouns and verbs
Count by part of speech tag: NN (noun, singular), NNP (proper noun, singular), NNPS (proper noun, plural), NNS (noun plural), VB (verb, base form), VBD (verb, past tense), VBG (verb, gerund/present participle), VBN (verb, past participle), VBP (verb, sing. present, non-3d), VBZ (verb, 3rd person sing. present)
Penn Treebank part of speech tag
NN NNP NNPS NNS VB VBD VBG VBN VBP VBZ
Richard Nixon
2,201
11.56%
2201
981
5.15%
981
103
0.54%
103
742
3.90%
742
1,074
5.64%
1074
388
2.04%
388
290
1.52%
290
440
2.31%
440
840
4.41%
840
633
3.32%
633
John F Kennedy
2,213
12.07%
2213
1,248
6.81%
1248
171
0.93%
171
732
3.99%
732
924
5.04%
924
457
2.49%
457
242
1.32%
242
365
1.99%
365
758
4.13%
758
498
2.72%
498

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10b
detailed POS tags — adjectives, pronouns, adverbs and wh-words
Count by part of speech tag: JJ (adjective), JJR (adjective, comparative), JJS (adjective, superlative), PRP (personal pronoun), PRP$ (possessive pronoun), RB (adverb), RBR (adverb, comparative), RBS (adverb, superlative), WDT (wh-determiner), WP (wh-pronoun), WP$ (possessive wh-pronoun), WRB (wh-abverb)
Penn Treebank part of speech tag
JJ JJR JJS PRP PRP$ RB RBR RBS WDT WP WP$ WRB
Richard Nixon
946
4.97%
946
68
0.36%
68
33
0.17%
33
1,633
8.58%
1633
264
1.39%
264
1,152
6.05%
1152
30
0.16%
30
8
0.04%
8
177
0.93%
177
137
0.72%
137
126
0.66%
126
John F Kennedy
1,121
6.11%
1121
72
0.39%
72
29
0.16%
29
1,391
7.59%
1391
276
1.51%
276
926
5.05%
926
21
0.11%
21
12
0.07%
12
134
0.73%
134
99
0.54%
99
1
0.01%
1
65
0.35%
65

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10c
detailed POS tags — prepositions, conjunctions, determiners and others
Count by part of speech tag: CC (coordinating conjunction), CD (cardinal digit), DT (determiner), EX (existential there), FW (foreign word), IN (preposition/subordinating conjunction), MD (modal), PDT (predeterminer), POS (possessive ending), RP (particle), TO (to), UH (interjection)
Penn Treebank part of speech tag
CC CD DT EX FW IN MD PDT POS RP TO UH
Richard Nixon
583
3.06%
583
174
0.91%
174
2,090
10.98%
2090
57
0.30%
57
1
0.01%
1
2,605
13.68%
2605
480
2.52%
480
26
0.14%
26
69
0.36%
69
99
0.52%
99
579
3.04%
579
9
0.05%
9
John F Kennedy
621
3.39%
621
290
1.58%
290
2,091
11.40%
2091
61
0.33%
61
2,477
13.51%
2477
419
2.29%
419
20
0.11%
20
34
0.19%
34
27
0.15%
27
535
2.92%
535
6
0.03%
6

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10
legend
a
b
c
10

a — total number of words with a given tag

b — (a) relative to all tagged words

c — (a) relative to number of words with this tag used by Donald Trump

bar — proportion of a

Table 10
commentary

Exclusive and Shared Usage

This section enumerates words that were exclusive to a candidate (e.g. used by one candidate but not the other). This content provides insight into what the candidates' priorities are and reveals differences in perspective on similar topics.

For a given part of speech, the table breaks down the number of words that were spoken by only one of the candidates or both candidates (intersection). The last row includes words spoken by either candidate (union).

Table 11
exclusive word usage
Total and unique words used exclusively by a candidate, or by both.
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Richard Nixon
1,652 893
100.0% 54.1%
10.8% 32.4%
759893
373416208333811335553
789 416
47.8% 52.7%
9.6% 28.2%
373416
373416
541 333
32.7% 61.6%
13.0% 36.4%
208333
208333
214 133
13.0% 62.1%
10.7% 27.8%
81133
81133
108 53
6.5% 49.1%
12.3% 31.2%
5553
5553
John F Kennedy
1,871 957
100.0% 51.1%
12.2% 34.7%
914957
5375201682581251634555
1,057 520
56.5% 49.2%
12.8% 35.3%
537520
537520
426 258
22.8% 60.6%
10.2% 28.2%
168258
168258
288 163
15.4% 56.6%
14.5% 34.1%
125163
125163
100 55
5.3% 55.0%
11.4% 32.4%
4555
4555
both candidates
11,755 909
100.0% 7.7%
76.9% 32.9%
10846909
58064612828265129315359654
6,267 461
53.3% 7.4%
76.0% 31.3%
5806461
5806461
3,093 265
26.3% 8.6%
74.3% 29.0%
2828265
2828265
1,446 153
12.3% 10.6%
72.6% 32.0%
1293153
1293153
650 54
5.5% 8.3%
74.1% 31.8%
59654
59654
total
15,278 2,759
100.0% 18.1%
100.0% 100.0%
125192759
6774147332499141513478707170
8,247 1,473
54.0% 17.9%
100.0% 100.0%
67741473
67741473
4,163 914
27.2% 22.0%
100.0% 100.0%
3249914
3249914
1,991 478
13.0% 24.0%
100.0% 100.0%
1513478
1513478
877 170
5.7% 19.4%
100.0% 100.0%
707170
707170

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 11c
legend
a d
b e
c f
4030
40302015105

a — total number of words in set (e.g. obama \ romney, obama ∩ romney, obama ∪ romney , for a given part of speech

b — (a) relative to all exclusive words in n+v+adj+adv

c — (a) relative to all words in n+v+adj+adv

d — unique words in (a)

e — (d) relative to (a)

f — (d) relative to all unique words in n+v+adj+adv

bar1 — normalized ratio of (a-d):d

bar2 — absolute ratio of (a-d):d for all POS groups (first column) or POS group (other columns)

Table 11
commentary

Pronoun Usage

This section explores pronoun use in detail. Refer to the methods section for details.

Pronoun Count

Fraction of all words that were pronouns.

Table 12a
pronoun fraction
Fraction of words that were pronouns.
speaker all pronouns
Richard Nixon
19,053 1,993
100.0% 10.5%
170601993
3,831 65
20.1% 1.7%
376665
John F Kennedy
18,354 2,081
100.0% 11.3%
162732081
3,091 68
16.8% 2.2%
302368
total
37,407 2,984
100.0% 8.0%
344232984
6,922 74
18.5% 1.1%
684874

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 12b
exclusive and shared pronouns
Pronouns exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Richard Nixon
11 6
0.2% 54.5%
56
John F Kennedy
21 9
0.3% 42.9%
129
both candidates
6,890 59
99.5% 0.9%
683159

Hover over fields with (e.g. 155) to download the corresponding data file.

Pronoun by Person, Gender and Count

Pronoun usage by person (1st, 2nd, 3rd), gender (masculine, feminine, neuter) and count (singular, plural).

Table 13a
Pronoun by person
Count of pronouns by first, second or third person.
pronoun person
all first second third
Richard Nixon
1,907 25
100.0% 1.3%
1094971271714
1,103 9
57.8% 0.8%
10949
73 2
3.8% 2.7%
712
731 14
38.3% 1.9%
71714
John F Kennedy
1,681 26
100.0% 1.5%
11121051349213
1,122 10
66.7% 0.9%
111210
54 3
3.2% 5.6%
513
505 13
30.0% 2.6%
49213

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13b
Pronoun by gender
Count of pronouns by masculine, feminine or neuter gender.
pronoun gender
all masculine feminine neuter
Richard Nixon
582 9
100.0% 1.5%
2754722913
279 4
47.9% 1.4%
2754
9 2
1.5% 22.2%
72
294 3
50.5% 1.0%
2913
John F Kennedy
369 9
100.0% 2.4%
1264222323
130 4
35.2% 3.1%
1264
4 2
1.1% 50.0%
22
235 3
63.7% 1.3%
2323

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13c
Pronoun by number
Count of pronouns by singular or plural.
pronoun number
all singular plural
Richard Nixon
3,186 47
100.0% 1.5%
21822795720
2,209 27
69.3% 1.2%
218227
977 20
30.7% 2.0%
95720
John F Kennedy
2,590 48
100.0% 1.9%
15953094718
1,625 30
62.7% 1.8%
159530
965 18
37.3% 1.9%
94718

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13
legend
a b
c d
153

a — total number of pronouns, by type

b — unique pronouns in (a)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a)

bar — proportion of (a – b):b

Table 13
commentary

First and third person pronouns — a closer look

These tables break pronouns by interesting contrasts. For example, the ratio of singular to plural 1st person pronouns reveals the use of "I/my/myself" vs. "we/our/ours".

Table 14a
1st person pronouns, by count
Count of singular and plural first person pronouns. This table contrasts use of I/my/myself vs. we/our/ours.
pronoun
first first singular first plural
Richard Nixon
1,103 9
100.0% 0.8%
47746175
481 4
43.6% 0.8%
4774
622 5
56.4% 0.8%
6175
John F Kennedy
1,122 10
100.0% 0.9%
49556175
500 5
44.6% 1.0%
4955
622 5
55.4% 0.8%
6175
Table 14b
3rd person pronouns, by count
Count of singular and plural third person pronouns. This table contrasts he/she/his/her/it vs. they/them/theirs.
pronoun
third third singular third plural
Richard Nixon
731 14
100.0% 1.9%
57391445
582 9
79.6% 1.5%
5739
149 5
20.4% 3.4%
1445
John F Kennedy
505 13
100.0% 2.6%
36091324
369 9
73.1% 2.4%
3609
136 4
26.9% 2.9%
1324
Table 14c
Me and you — 1st person singular and second person pronouns
Count of 1st person singular and second person pronouns. This table contrasts me/my/myself vs you/yours/yourself.
pronoun
all 1st singular 2nd
Richard Nixon
554 6
100.0% 1.1%
4774712
481 4
86.8% 0.8%
4774
73 2
13.2% 2.7%
712
John F Kennedy
554 8
100.0% 1.4%
4955513
500 5
90.3% 1.0%
4955
54 3
9.7% 5.6%
513
Table 14d
I, me, myself and my — closer look at 1st person singular pronouns
Count of specific 1st person singular pronouns: I, me, myself and my.
pronoun
all I me myself my
Richard Nixon
476
100.0%
414.00021.0000.00041.000
414
87.0%
414.000
21
4.4%
21.000
0
0.0%
0.000
41
8.6%
41.000
John F Kennedy
498
100.0%
436.00018.0006.00038.000
436
87.6%
436.000
18
3.6%
18.000
6
1.2%
6.000
38
7.6%
38.000
Table 14
legend
a b
c d
153

a — total number of pronouns, by type

b — unique pronouns in (a) (if more than one)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a) (if less than 100%)

bar — proportion of (a – b):b

Table 14
commentary

Pronouns by Category

This table tallies the use of pronoun by category. The categories are personal, demonstrative, indefinite, object, possessive, interrogative, others, relative, reflexive. Note that some pronouns that belong to multiple categories are counted in only one. For a list of pronouns for each category, see the pronoun methods section.

Table 15
Pronouns by cateogry
Count of pronouns by category.
pronoun category
all personal demonstrative indefinite object possessive interrogative others relative reflexive
Richard Nixon
3,831
100.0%
1495.000984.000409.000132.000265.000230.000124.000180.00015.000
1,495
39.0%
14887
984
25.7%
9804
409
10.7%
38524
132
3.4%
1275
265
6.9%
2569
230
6.0%
2255
124
3.2%
1177
180
4.7%
1791
15
0.4%
114
John F Kennedy
3,091
100.0%
1253.000673.000322.000115.000281.000205.000120.00098.00032.000
1,253
40.5%
12467
673
21.8%
6694
322
10.4%
29725
115
3.7%
1105
281
9.1%
2738
205
6.6%
1996
120
3.9%
1137
98
3.2%
971
32
1.0%
266
Table 15
legend
a b
15

a — total number of pronouns, by category

b — (a) as fraction of all pronouns

bar — proportion of (a)

Table 15
commentary

Noun Phrase Usage

Noun phrases were extracted from the text and analyzed for frequency, word count, unique word count and richness. Single-word phrases were not counted.

Top-level noun phrases are those without a parent noun phrase (a parent phrase is one that a similar, longer phrase). Derived noun phrases are those with a parent (more details about noun phrase analysis).

The top-level noun phrases can be interpreted as independent concepts. Derived noun phrases can be interpreted as variants on concepts embodied by the top-level phrases.

Noun Phrase Count and length

This table reports the absolute number of noun phrases, which is related to the number of nouns, and their length.

Table 16a
noun phrase count
Counts of noun phrases in words and per noun.
speaker noun phrase count
all top-level
Richard Nixon
1,103 337
100.0% 30.6%
0.28 0.37
766337
870 330
78.9% 37.9%
0.22 0.36
540330
John F Kennedy
1,349 393
100.0% 29.1%
0.31 0.38
956393
1,077 385
79.8% 35.7%
0.25 0.38
692385

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 16b
noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Richard Nixon
2.11 2 3
2.1092.0003.000
2.13 2 3
2.1342.0003.000
John F Kennedy
2.12 2 3
2.1252.0003.000
2.16 2 3
2.1562.0003.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 16a
legend
a d
b e
c f
1070

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of noun phrases per noun

d — number of unique phrases

e — (c) relative to (a)

f — number of unique noun phrases per unique noun

bar — normalized ratio of (a–c):c

Table 16b
legend
a b c
102080

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c


Table 16
commentary

Exclusive and Shared Noun Phrase Count and length

Table 17a
exclusive and shared noun phrase count
Counts of exclusive and shared noun phrases in words and per noun.
speaker noun phrase count
all top-level
Richard Nixon
780 304
31.8% 39.0%
476304
699 302
89.6% 43.2%
397302
John F Kennedy
996 358
40.6% 35.9%
638358
914 354
91.8% 38.7%
560354
both candidates
676 63
27.6% 9.3%
61363
334 50
49.4% 15.0%
28450

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 17b
exclusive and shared noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Richard Nixon
2.15 2 3
2.1502.0003.000
2.17 2 3
2.1652.0003.000
John F Kennedy
2.16 2 3
2.1642.0003.000
2.18 2 3
2.1782.0003.000
both candidates
2.01 2 2
2.0132.0002.000
2.02 2 2
2.0212.0002.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 17a
legend
a c
b d
1070

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of unique phrases

d — (c) relative to (a)

bar — normalized ratio of (a–c):c

Table 17b
legend
a b c
102080

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c


Table 17
commentary

Windbag Index

The Windbag Index is a compound measure that characterizes the complexity of speech. A low index is indicative of succinct speech with low degree of repetition and large number of independent concepts.

Because the index uses the ratio of unique words to all words, it will be larger for longer debates because the fraction of unique words shrinks. Therefore, Windbag Index across debates can only be compared if the number of words is similar.

Table 18
windbag index
Windbag Index for each speaker. The higher the value, the more repetitive the speech.
speaker Windbag Index
index value index terms
Richard Nixon
6,398
+2.6%
6398.08266290185
0.411 0.235 0.230 0.294 0.329 0.243 0.306 0.979
-9.2% +1.5% -3.5% +8.4% +6.9% -9.7% +4.9% -0.0%
0.4109589041095890.2352490421455940.2301265822784810.2942550210182160.328539823008850.2428884026258210.3055303717135090.979228486646884
John F Kennedy
6,233
-2.6%
6233.10513326332
0.453 0.232 0.239 0.272 0.307 0.269 0.291 0.980
+10.2% -1.4% +3.7% -7.7% -6.5% +10.8% -4.6% +0.0%
0.4528168246703720.2318613885212370.2385385152431930.2715133531157270.3072677092916280.2690476190476190.2913269088213490.979643765903308
Table 18
legend
The Windbag Index is 1/(t1*t2*...*t9) where t1,t2,...,t8 are

t1 — fraction of words that are non-stop

t2 — fraction of non-stop words that are unique

t3 — fraction of nouns that are unique

t4 — fraction of verbs that are unique

t5 — fraction of adjectives that are unique

t6 — fraction of adverbs that are unique

t7 — fraction of noun phrases that are unique

t8 — fraction of noun phrases that are top-level


Large individual terms t1...t9 contribute to a smaller index.

The percentage values below the index and each term are relative differences to the other speaker's corresponding term (i.e. 100*(a-b)/b where a is the value for one speaker and b for the other).
Table 18
commentary

Word Clouds

In the word clouds below, the size of the word is proportional to the number of times it was used by a candidate (method details).

Not all words from a group used to draw the cloud fit in the image — less frequently used words for large word groups may fall outside the image.

All Words for Each Candidate

Each candidate's debate portion was extracted and frequencies were compiled for each part of speech (noun, verb, adjective, adverb), with words colored by their part of speech category.

The distribution of sizes within a tag cloud follows the frequency distribution of words. However, word size cannot be compared between clouds, since the minimum and maximum size of the words is fixed.

Debate Word Cloud for Richard Nixon - all words

Debate tag cloud for Richard Nixon
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Debate Word Cloud for John F Kennedy - all words

Debate tag cloud for John F Kennedy
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb
commentary

Exclusive Words for Each Candidate

The clouds below show words used exlusively by a candidate. For example, if candidate A used the word "invest" (any number of times), but candidate B did not, then the word will appear in the exclusive word tag cloud for candidate A.

Words exclusive to Richard Nixon

Debate tag cloud for Richard Nixon
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Words exclusive to John F Kennedy

Debate tag cloud for John F Kennedy
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb
commentary

Pronouns for Each Candidate

Word clouds based on only pronouns.

Pronouns for Richard Nixon

Debate tag cloud for Richard Nixon
Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other

Pronouns for John F Kennedy

Debate tag cloud for John F Kennedy
Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other
commentary

Part of Speech Word Clouds

In these clouds, words from each major part of speech were colored based on whether they were exclusive to a candidate or shared by the candidates.

The size of the word is relative to the frequency for the candidate — word sizes between candidates should not be used to indicate difference in absolute frequency.

Cloud of noun words, by speaker

Words unique to each candidate (Nixon, Kennedy) and those spoken by both.
commentary

Cloud of verb words, by speaker

Words unique to each candidate (Nixon, Kennedy) and those spoken by both.
commentary

Cloud of adjective words, by speaker

Words unique to each candidate (Nixon, Kennedy) and those spoken by both.
commentary

Cloud of adverb words, by speaker

Words unique to each candidate (Nixon, Kennedy) and those spoken by both.
commentary

Cloud of all words, by speaker

Words unique to each candidate (Nixon, Kennedy) and those spoken by both.
commentary

Word Pair Clouds for Each Candidate

Pairs used only once during the debate are not shown.

word pairs for Richard Nixon

JJ/JJ by Richard Nixon
JJ/RB by Richard Nixon
JJ/N by Richard Nixon
JJ/V by Richard Nixon
RB/RB by Richard Nixon
RB/N by Richard Nixon
RB/V by Richard Nixon
N/N by Richard Nixon
N/V by Richard Nixon
V/V by Richard Nixon

word pairs for John F Kennedy

JJ/JJ by John F Kennedy
JJ/RB by John F Kennedy
JJ/N by John F Kennedy
JJ/V by John F Kennedy
RB/RB by John F Kennedy
RB/N by John F Kennedy
RB/V by John F Kennedy
N/N by John F Kennedy
N/V by John F Kennedy
V/V by John F Kennedy
commentary

Downloads

Debate transcript

Parsed word lists and word clouds (word lists, part of speech lists, noun phrases, sentences) (word clouds)

Raw data structure

Please see the methods section for details about these files.