The Flesch-Kincaid grade level for each of the Presidential debates in 1960, 2008, 2012, 2016 and 2020. Things are getting worse, not better.

Word Analysis of 1960 U.S. Presidential Debates

Richard Nixon vs. John F. Kennedy


Introduction

1960 Debate Analysis Results

1st debate (26 September 1960)

2nd debate (7 October 1960)

3rd debate (13 October 1960)

4th debate (21 October 1960)

combined debate (26 September — 21 October 1960)

other debates

2020 Debate Analysis

2020 Trump vs. Biden Debate word analysis

2016 Debate Analysis

2016 Clinton vs. Trump Debate word analysis

2012 Debate Analysis

2012 Obama vs. Romney Debate word analysis

2008 Debate Analysis

2008 Obama vs McCain Debate word analysis

1960 Debate Analysis

1960 Nixon vs. Kennedy Debate word analysis

Other Political Debate Analyses

What Romney's and Obama's Body Language Says to Voters. Watch them cut, point and tilt-and-nod.

He counts your words (even those pronouns), an article in the NYT about Pennebaker's approach to analysis of debates and Al Qaeda communication

Lexical Analysis of Obama's and McCain's Speeches by Jacques Savoy

Presidential word use in State of the Union addresses by Jonathan Corum.

Naming Names, a NYT article about candidates' reference to each other during debates (uses Circos).

Randomly Generated Trump Transcripts

If you want more, get more. The debate continues endlessly with Tripsum: Trump Lorem Ipsum—randomly generated text based transcripts from the 2016 Clinton vs Trump debates.

On these pages, I explore word usage in the 1960 U.S. Presidential debates between Richard Nixon and John F Kennedy.

Impatient? Skip to the full word analysis of the first debate.

Formal debates present a unique opportunity to compare the speech patterns of candidates. The debate's format is controlled — though the debates have been thusfar unruly — and each speaker is subject to the same question (in principle) and is given the same amount of time to respond.

That being said, the dynamics of a debate can be greatly affected by one candidate, who can hijack the conversation and use interruptions to influence their opponent's natural style. Thus, the results of the debate analysis cannot be taken out of the context of the debate.

It's important to stress that this analysis is structural and not semantic. I look in detail of how things are said rather than what is said. However, there is a strong connection between the use of specific words (e.g. pronouns) and the speaker's inner dialogue (Your Use of Pronouns Reveals Your Personality).

I use transcripts from The Commision on Presidential Debates and explore themes such grade level, readability, sentence size, parts of speech usage, pronoun usage, unique and shared words and use of concepts. And I cannot help but draw some word clouds.

The analysis is fully automated and uses the Natural Language Toolkit for tokenizing, tagging and chunking. All data and word lists (tagged and chunked) are available for download in plain-text format — you are welcome to use these files in any manner.

other years

Each year's analysis is a collection of stand-alone pages. For a given year, each of the three Presidential debates and the Vice-Presidential (if available) debate results are structured identically.

Results from other years are available: 2008 2012 2016 2020

Methods

Transcripts by the Washington Post for each debate were parsed to extract sections for each speaker, chunk the text into sentences and words, tag each word with its part of speech (tagging), and identify noun phrases (chunking).

The tagged and chunked transcripts are analyzed to determine

I attempt to quantify the overall complexity and repetition by a metric I call the Windbag Index, which is a product of 8 terms each measuring uniqueness in different aspects of speech (more about Windbag Index).

A full description of each of the steps in the analysis is available in the detailed methods section.

The analysis has some limitations.

Results and Commentary

Each debate analysis report contains a lot of data but is shown in exactly the same format, which should help you with making comparisons between debates. To start, you may find these elements the most interesting

Results are shown in a tabular format. From each table you can download the word list used to generate it. This makes it easy to, for example, grab all the adjectives used by Richard Nixon or all the verbs that John F Kennedy used that Richard Nixon did not use.

detailed results – tables, word clouds and commentary

Analysis of Richard Nixon vs. John F. Kennedy (1st debate)

Analysis of Richard Nixon vs. John F. Kennedy (2nd debate)

Analysis of Richard Nixon vs. John F. Kennedy (3rd debate)

Analysis of Richard Nixon vs. John F. Kennedy (4th debate)

Analysis of Richard Nixon vs. John F. Kennedy (combined debates)

Visualizing the Debates

Each debate is visualized using tables and word clouds — there's obviously a ton more than can be done. The word clouds visually show the words and their frequency and tables provide detailed statistics. You can download each word list directly from the tables.

tables & basic word clouds

Word usage tables describe the structural characteristics of speech by frequency of words, sentence size, proportion of unique and exclusive words and breakdown of words by part-of-speech • see example
Word cloud for Richard Nixon, categorized by parts of speech.
Word clouds, categorized by ownership.

Candidates's Word Usage Profiles

Below are a few of the tables available in the full results section.

Readability and Grade Level

The Flesch–Kincaid readability tests are designed to indicate how difficult a passage in English is to understand. There are two tests, the Flesch Reading Ease, and the Flesch–Kincaid Grade Level.

Table 2a
readability
Flesch-Kincaid reading ease and grade level.
speaker grade level reading ease sections sentences words syllables
Richard Nixon
9.78
0.0%
9.78
63.17
0.0%
63.17
12
0.0%
12
224
0.0%
224
4,810
0.0%
4810
6,929
0.0%
6929
John F Kennedy
9.91
0.0%
9.91
62.16
0.0%
62.16
17
0.0%
17
226
0.0%
226
4,839
0.0%
4839
7,032
0.0%
7032

Hover over fields with (e.g. 155) to download the corresponding data file.

Sentence Size

Sentence size with and without stop words.

Table 3
sentence size
Number of sentences spoken by each speaker and sentence word count statistics. Number of words in a sentence is shown by average and 50%/90% cumulative values for all, stop and non-stop words.
speaker number of sentences sentence size
all words stop words non-stop words
Richard Nixon
224
224
21.2 29 59
21.17929.00059.000
12.5 17 35
12.50417.00035.000
8.7 13 25
8.67413.00025.000
John F Kennedy
226
226
21.1 28 49
21.15028.00049.000
11.7 16 29
11.65916.00029.000
9.5 13 24
9.49113.00024.000
total
450
450
23.2 30 58
23.16430.00058.000
14.1 17 33
14.08017.00033.000
11.1 14 26
11.08414.00026.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Part of Speech

Total and unique nouns, verbs, adjectives and adverbs. The parts of speech are identified by their Penn Treebank tags.

Table 7
part of speech count
Count of words categorized by part of speech (POS).
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Richard Nixon
1,843 704
38.8% 38.2%
6643352682421171205245
999 335
54.2% 33.5%
664335
510 242
27.7% 47.5%
268242
237 120
12.9% 50.6%
117120
97 45
5.3% 46.4%
5245
John F Kennedy
2,023 783
42.3% 38.7%
6824053032221701464550
1,087 405
53.7% 37.3%
682405
525 222
26.0% 42.3%
303222
316 146
15.6% 46.2%
170146
95 50
4.7% 52.6%
4550
total
3,866 1,176
40.6% 30.4%
149958765837734121211478
2,086 587
54.0% 28.1%
1499587
1,035 377
26.8% 36.4%
658377
553 212
14.3% 38.3%
341212
192 78
5.0% 40.6%
11478

Hover over fields with (e.g. 155) to download the corresponding data file.

Pronoun usage

English has many pronouns. Here is an accounting of pronoun use by 1st (e.g. I, we, our), 2nd (e.g. you, yours) or 3rd (e.g. he, she, his, them) person.

Table 13a
Pronoun by person
Count of pronouns by first, second or third person.
pronoun person
all first second third
Richard Nixon
468 21
100.0% 4.5%
251819217711
259 8
55.3% 3.1%
2518
21 2
4.5% 9.5%
192
188 11
40.2% 5.9%
17711
John F Kennedy
448 21
100.0% 4.7%
280924212310
289 9
64.5% 3.1%
2809
26 2
5.8% 7.7%
242
133 10
29.7% 7.5%
12310

Hover over fields with (e.g. 155) to download the corresponding data file.

Pronoun contrasts

These tables break pronouns by interesting contrasts. For example, the ratio of singular to plural 1st person pronouns reveals the use of "I/my/myself" vs. "we/our/ours".

Table 14a
1st person pronouns, by count
Count of singular and plural first person pronouns. This table contrasts use of I/my/myself vs. we/our/ours.
pronoun
first first singular first plural
Richard Nixon
259 8
100.0% 3.1%
12941224
133 4
51.4% 3.0%
1294
126 4
48.6% 3.2%
1224
John F Kennedy
289 9
100.0% 3.1%
14441365
148 4
51.2% 2.7%
1444
141 5
48.8% 3.5%
1365
Table 14b
3rd person pronouns, by count
Count of singular and plural third person pronouns. This table contrasts he/she/his/her/it vs. they/them/theirs.
pronoun
third third singular third plural
Richard Nixon
188 11
100.0% 5.9%
1437344
150 7
79.8% 4.7%
1437
38 4
20.2% 10.5%
344
John F Kennedy
133 10
100.0% 7.5%
896344
95 6
71.4% 6.3%
896
38 4
28.6% 10.5%
344
Table 14c
Me and you — 1st person singular and second person pronouns
Count of 1st person singular and second person pronouns. This table contrasts me/my/myself vs you/yours/yourself.
pronoun
all 1st singular 2nd
Richard Nixon
154 6
100.0% 3.9%
1294192
133 4
86.4% 3.0%
1294
21 2
13.6% 9.5%
192
John F Kennedy
174 6
100.0% 3.4%
1444242
148 4
85.1% 2.7%
1444
26 2
14.9% 7.7%
242
Table 14d
I, me, myself and my — closer look at 1st person singular pronouns
Count of specific 1st person singular pronouns: I, me, myself and my.
pronoun
all I me myself my
Richard Nixon
129
100.0%
112.0002.0000.00015.000
112
86.8%
112.000
2
1.6%
2.000
0
0.0%
0.000
15
11.6%
15.000
John F Kennedy
147
100.0%
133.0002.0000.00012.000
133
90.5%
133.000
2
1.4%
2.000
0
0.0%
0.000
12
8.2%
12.000

Windbag Index

The Windbag Index is a compound measure that characterizes the complexity of speech. A low index is indicative of succinct speech with low degree of repetition and large number of independent concepts. A large number suggests a stream of repeating words.

Table 18
windbag index
Windbag Index for each speaker. The higher the value, the more repetitive the speech.
speaker Windbag Index
index value index terms
Richard Nixon
379
+4.7%
379.627086416997
0.410 0.375 0.335 0.475 0.506 0.464 0.474 0.969
-8.7% -2.0% -10.0% +12.2% +9.6% -11.9% +9.4% +0.0%
0.4095699831365940.3746783324755530.3353353353353350.4745098039215690.5063291139240510.4639175257731960.4740740740740740.96875
John F Kennedy
362
-4.5%
362.600221669188
0.449 0.382 0.373 0.423 0.462 0.526 0.433 0.969
+9.6% +2.0% +11.1% -10.9% -8.8% +13.5% -8.6% -0.0%
0.4487447698744770.3822843822843820.3725850965961360.4228571428571430.4620253164556960.5263157894736840.4332425068119890.968553459119497

Word Clouds

Word clouds below are colored by part of speech:   noun   verb   adjective   adverb  

Words exclusive to Richard Nixon (not spoken by John F Kennedy) in the first debate, colored by part of speech.
Words exclusive to John F Kennedy (not spoken by Richard Nixon) in the first debate, colored by part of speech.

Word clouds below are colored by speaker:   Nixon   Kennedy   both  

All nouns in debates, colored by contributing speaker.
All verbs in debates, colored by contributing speaker.

Downloads

Content of word list archive and data structure syntax is described in the methods section.

Richard Nixon vs. John F. Kennedy (1st debate) transcript word lists and tag clouds data structure

Richard Nixon vs. John F. Kennedy (2nd debate) transcript word lists and tag clouds data structure

Richard Nixon vs. John F. Kennedy (3rd debate) transcript word lists and tag clouds data structure

Richard Nixon vs. John F. Kennedy (4th debate) transcript word lists and tag clouds data structure

Richard Nixon vs. John F. Kennedy (combined debates) transcript word lists and tag clouds data structure

updates

7 Oct 2020. Added Nixon vs Kennedy debates