home > results and commentary > Trump vs. Biden (combined)

Word Analysis of 2020 U.S. Presidential Debates

Donald Trump vs. Joe Biden (combined debates)

29 September — 22 October 2020



Introduction

Speaking Turns and Interruptions

Here, I look at the length of each turn of uninterrupted speech.

Table 1
length of sections in words
The number of uninterrupted deliveries (sections), mode/median/mean length of sections in words, and the shortest section length in words that composed 10%, 50% and 90% of the debate.
speaker sections section length debate contiguity (L10 L50 L90)
Donald Trump
705
705
4.0 11.0 32.3
4.00011.0000000032.338
13 89 169
13.00089.000169.000
Joe Biden
502
502
2.0 15.0 47.4
2.00015.0000000047.380
38 112 182
38.000112.000182.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 1
legend
a b c
51025

a — section length (mode), shortest section length in 10% of debate

b — section length (median), shortest section length in 50% of debate

c — section length (mean), shortest section length in 90% of debate

bar — proportion of a:b:c

Table 1
commentary

Flesch-Kincaid Reading Ease and Grade Level

The Flesch-Kincaid reading ease and grade level metrics are designed to indicate how difficult a passage in English is to understand.

This metric does not take repetition into account. A grade level 10 sentence that is repeated 100 times still generates the same metrics because the words per sentence and syllables per word remain constant. To measure how many times a speaker repeats themselves, I use my Windbag Index, below.

Reading ease ranges from 100 (easiest) down to 0 (hardest) and can be interpreted as follows

100 –905th gradeVery easy to read. Easily understood by an average 11-year-old student.
90 – 806th gradeEasy to read. Conversational English for consumers.
80 – 707th gradeFairly easy to read.
70 – 608th & 9th gradePlain English. Easily understood by 13- to 15-year-old students.
60 – 5010th to 12th gradeFairly difficult to read.
50 – 30collegeDifficult to read.
30 – 10college graduateVery difficult to read. Best understood by college/university graduates.
10 – 0professionalExtremely difficult to read. Best understood by college/university graduates.

The grade level corresponds roughly to a U.S. grade level. It has a minimum value of –3.4 and no upper bound.

Two sets of readability scores are calculated. One for the entire debate and one that only considers section with at least 9 words.

Table 2a
readability — entire debate
Flesch-Kincaid reading ease and grade level.
speaker grade level reading ease sections sentences words syllables
Donald Trump
3.84
100.0%
3.84
84.69
100.0%
84.69
705
100.0%
705
2,367
100.0%
2367
22,798
100.0%
22798
30,281
100.0%
30281
Joe Biden
5.45
141.9%
5.45
78.63
92.8%
78.63
502
71.2%
502
1,873
79.1%
1873
23,785
104.3%
23785
32,422
107.1%
32422

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 2b
readability — excluding short sections
Flesch-Kincaid reading ease and grade level for sections with at least 9 words.
speaker grade level reading ease sections sentences words syllables
Donald Trump
4.16
100.0%
4.16
83.70
100.0%
83.70
471
100.0%
471
2,115
100.0%
2115
21,885
100.0%
21885
29,137
100.0%
29137
Joe Biden
5.89
141.6%
5.89
77.33
92.4%
77.33
313
66.5%
313
1,678
79.3%
1678
23,100
105.6%
23100
31,546
108.3%
31546

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 2
legend
a
b
30

a — value for candidate

b — value relative to Donald Trump

bar — proportion of a

Table 2
commentary

Sentence Size

Table 3
sentence size
Number of sentences spoken by each speaker and sentence word count statistics. Number of words in a sentence is shown by average and 50%/90% cumulative values for all, stop and non-stop words.
speaker number of sentences sentence size
all stop non-stop
Donald Trump
2,370
2370
9.7 13 30
9.71113.00030.000
5.6 8 18
5.6498.00018.000
4.1 6 13
4.0626.00013.000
Joe Biden
1,874
1874
12.8 19 40
12.81519.00040.000
7.4 11 24
7.39311.00024.000
5.4 8 18
5.4228.00018.000
total
4,244
4244
13.1 16 36
13.08216.00036.000
8.4 10 22
8.41910.00022.000
6.7 8 17
6.6628.00017.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 3
legend
a b c
51025

a — average sentence size

b — largest sentence size for 50% of content

c — largest sentence size for 90% of content

bar — proportion of a:b:c

Table 3
commentary

Word Statistics

Debate Word Count

Summary Word Count

The summary word count reports the total number of words and the number of unique, non-stop words used by each candidate. Word number is expressed as both absolute and relative values.

Table 4a
all words
Number of all words and unique words used by each speaker.
set word count
Donald Trump
23,014 1,998
48.9% 8.7%
210161998
Joe Biden
24,016 2,547
51.1% 10.6%
214692547
total
47,030 3,408
100.0% 7.2%
436223408

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 4b
exclusive and shared words
Words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Donald Trump
1,746 861
7.6% 49.3%
885861
Joe Biden
2,577 1,410
10.7% 54.7%
11671410
both candidates
42,707 1,137
90.8% 2.7%
415701137

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 4
legend
a c
b d
3010

a — word count

b — word count, as fraction in total in debate

c — unique words in (a)

d — unique words in (a), as fraction in (a)

bar — proportion of (a-c):c

Table 4
commentary

Stop Word Contribution

In the table below, the candidates' delivery is partitioned into stop and non-stop words. Stop words (full list) are frequently-used bridging words (e.g. pronouns and conjunctions) whose meaning depends entirely on context. The fraction of words that are stop words is one measure of the complexity of speech.

Table 5a
non-stop words
Counts of stop and non-stop words.
speaker all words stop words non-stop words
Donald Trump
23,014 1,998
100.0% 8.7%
210161998
13,388 147
58.2% 1.1%
13241147
9,626 1,851
41.8% 19.2%
77751851
Joe Biden
24,016 2,547
100.0% 10.6%
214692547
13,855 154
57.7% 1.1%
13701154
10,161 2,393
42.3% 23.6%
77682393
total
47,030 3,408
100.0% 7.2%
436223408
27,243 160
57.9% 0.6%
27083160
19,787 3,248
42.1% 16.4%
165393248

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 5b
exclusive and shared non-stop words
Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Donald Trump
1,740 855
18.1% 49.1%
885855
Joe Biden
2,537 1,397
25.0% 55.1%
11401397
both candidates
15,510 996
78.4% 6.4%
14514996

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 5
legend
a c
b d
3010

a — total number of words, for a given category (all, stop, non-stop)

b — (a) relative to words in the debate if category=all, otherwise relative to words by the candidate

c — number of unique words with set (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 5
commentary

Word frequency

The word frequency table summarizes the frequency with which words were used. I show the average word frequency and the weighted cumulative frequencies at 50 and 90 percentile. The average word frequency indicates how many times, on average, a word is used. For a given fraction of the entire delivery, the weighted cumulative frequency indicates the largest word frequency within this fraction (details about weighted cumulative distribution).

Table 6a
word use frequency
Average and 50%/90% percentile word frequencies.
speaker word frequency
all stop non-stop
Donald Trump
11.5 92 648
11.51992.000648.000
91.1 295 733
91.075295.000733.000
5.2 12 111
5.20012.000111.000
Joe Biden
9.4 86 642
9.42986.000642.000
90.0 242 869
89.968242.000869.000
4.2 10 90
4.24610.00090.000
total
13.8 163 1,297
13.800163.0001297.000
170.3 539 1,460
170.269539.0001460.000
6.1 19 165
6.09219.000165.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 6b
exclusive and shared non-stop word use frequency
Average and 50%/90% cumulative percentile word frequencies. Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word frequency
Donald Trump
2.04 3 9
2.0353.0009.000
Joe Biden
1.82 2 7
1.8162.0007.000
total
6.09 19 165
6.09219.000165.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 6
legend
a b c
51025

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 6
commentary

All further word use statistics represent content that has been filtered for stop words, unless explicitly indicated.

Part of Speech Analysis

In this section, word frequency is broken down by their part of speech (POS). The four POS groups examined are nouns, verbs, adjectives and adverbs. Conjunctions and prepositions are not considered. The first category (n+v+adj+adv) is composed of all four POS groups.

Part of Speech Count

Table 7
part of speech count
Count of words categorized by part of speech (POS).
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Donald Trump
8,945 1,738
38.9% 19.4%
29159752796522784283554116
3,890 975
43.5% 25.1%
2915975
3,318 522
37.1% 15.7%
2796522
1,067 283
11.9% 26.5%
784283
670 116
7.5% 17.3%
554116
Joe Biden
9,494 2,272
39.5% 23.9%
323913002623723776345372116
4,539 1,300
47.8% 28.6%
32391300
3,346 723
35.2% 21.6%
2623723
1,121 345
11.8% 30.8%
776345
488 116
5.1% 23.8%
372116
total
18,439 3,072
39.2% 16.7%
6633179657169481700488978180
8,429 1,796
45.7% 21.3%
66331796
6,664 948
36.1% 14.2%
5716948
2,188 488
11.9% 22.3%
1700488
1,158 180
6.3% 15.5%
978180

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 7
legend
a c
b d
1535

a — total number of words for a given POS (all, noun, verb, adjective, adverb)

b — (a) relative to all words by candidate

c — unique words in (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 7
commentary

Part of Speech Frequency

Table 8
part of speech frequency
Frequency of words categorized by part of speech (POS).
part of speech frequency
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv) pronouns (pro)
Donald Trump
5.15 12 111
5.14712.000111.000
3.99 8 52
3.9908.00052.000
6.36 24 349
6.35624.000349.000
3.77 7 40
3.7707.00040.000
5.78 17 51
5.77617.00051.000
81.08 402 802
81.078402.000802.000
Joe Biden
4.18 10 90
4.17910.00090.000
3.49 7 55
3.4927.00055.000
4.63 14 396
4.62814.000396.000
3.25 6 51
3.2496.00051.000
4.21 11 71
4.20711.00071.000
68.55 334 495
68.551334.000495.000
total
6.00 18 178
6.00218.000178.000
4.69 12 85
4.69312.00085.000
7.03 34 745
7.03034.000745.000
4.48 10 59
4.48410.00059.000
6.43 21 129
6.43321.000129.000
132.25 819 1,297
132.253819.0001297.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 8
legend
a b c
51025

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 8
commentary

Part of Speech Pairing

Through word pairing, I extract concepts from the text. The number of unique word pairs is a function of sentence length and is one of the measures of complexity.

Table 9a
part of speech pairing — Donald Trump
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Donald Trump
noun verb adjective adverb
noun
418 228
  54.5%
190228
verb
175 139
  79.4%
36139
16 15
  93.8%
115
adjective
575 399
  69.4%
176399
2 2
  100.0%
02
33 27
  81.8%
627
adverb
9 9
  100.0%
09
118 81
  68.6%
3781
39 27
  69.2%
1227
23 14
  60.9%
914

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9b
part of speech pairing — Joe Biden
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Joe Biden
noun verb adjective adverb
noun
531 360
  67.8%
171360
verb
214 177
  82.7%
37177
29 28
  96.6%
128
adjective
604 447
  74.0%
157447
15 9
  60.0%
69
32 23
  71.9%
923
adverb
8 8
  100.0%
08
84 71
  84.5%
1371
25 23
  92.0%
223
10 9
  90.0%
19

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9c
unique part of speech pairing — candidate comparison
Unique word pairs categorized by part of speech (POS)
unique part of speech pairings
noun (n) verb (v) adjective (adj) adverb (adv)
noun
228 360
  157.9%
228
360
verb
139 177
  127.3%
139
177
15 28
  186.7%
15
28
adjective
399 447
  112.0%
399
447
2 9
  450.0%
2
9
27 23
  85.2%
27
23
adverb
9 8
  88.9%
9
8
81 71
  87.7%
81
71
27 23
  85.2%
27
23
14 9
  64.3%
14
9

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9 a,b
legend
a c
  d
3010

a — total number of pairs, for a given category (e.g. verb/noun)

c — number of unique pairs within set (a)

d — (c) relative to (a)

bar — proportion of (a–c):c

Table 9c
legend
a c
  d
50
45

a — unique pairs for Donald Trump

c — unique pairs for Joe Biden

d — (c) relative to (a) (i.e. Joe Biden relative to Donald Trump)

bars — (a) and (c)

Table 9
commentary

Detailed Part of Speech Tags

You can really get into the weeds here. Parts of speech are counted more granularly in these tables — nouns and verbs are split into classes and many other word types are shown, such as conjunctions and prepositions.

Table 10a
detailed POS tags — nouns and verbs
Count by part of speech tag: NN (noun, singular), NNP (proper noun, singular), NNPS (proper noun, plural), NNS (noun plural), VB (verb, base form), VBD (verb, past tense), VBG (verb, gerund/present participle), VBN (verb, past participle), VBP (verb, sing. present, non-3d), VBZ (verb, 3rd person sing. present)
Penn Treebank part of speech tag
NN NNP NNPS NNS VB VBD VBG VBN VBP VBZ
Donald Trump
2,208
9.59%
2208
943
4.10%
943
30
0.13%
30
919
3.99%
919
1,226
5.33%
1226
998
4.34%
998
499
2.17%
499
413
1.79%
413
1,459
6.34%
1459
727
3.16%
727
Joe Biden
2,853
11.88%
129.2%
2853
796
3.31%
84.4%
796
58
0.24%
193.3%
58
987
4.11%
107.4%
987
1,425
5.93%
116.2%
1425
748
3.11%
74.9%
748
697
2.90%
139.7%
697
444
1.85%
107.5%
444
1,157
4.82%
79.3%
1157
980
4.08%
134.8%
980

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10b
detailed POS tags — adjectives, pronouns, adverbs and wh-words
Count by part of speech tag: JJ (adjective), JJR (adjective, comparative), JJS (adjective, superlative), PRP (personal pronoun), PRP$ (possessive pronoun), RB (adverb), RBR (adverb, comparative), RBS (adverb, superlative), WDT (wh-determiner), WP (wh-pronoun), WP$ (possessive wh-pronoun), WRB (wh-abverb)
Penn Treebank part of speech tag
JJ JJR JJS PRP PRP$ RB RBR RBS WDT WP WP$ WRB
Donald Trump
1,045
4.54%
1045
76
0.33%
76
68
0.30%
68
3,379
14.68%
3379
247
1.07%
247
1,637
7.11%
1637
34
0.15%
34
7
0.03%
7
110
0.48%
110
188
0.82%
188
176
0.76%
176
Joe Biden
1,148
4.78%
109.9%
1148
99
0.41%
130.3%
99
35
0.15%
51.5%
35
2,535
10.56%
75.0%
2535
297
1.24%
120.2%
297
1,338
5.57%
81.7%
1338
20
0.08%
58.8%
20
4
0.02%
57.1%
4
115
0.48%
104.5%
115
313
1.30%
166.5%
313
224
0.93%
127.3%
224

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10c
detailed POS tags — prepositions, conjunctions, determiners and others
Count by part of speech tag: CC (coordinating conjunction), CD (cardinal digit), DT (determiner), EX (existential there), FW (foreign word), IN (preposition/subordinating conjunction), MD (modal), PDT (predeterminer), POS (possessive ending), RP (particle), TO (to), UH (interjection)
Penn Treebank part of speech tag
CC CD DT EX FW IN MD PDT POS RP TO UH
Donald Trump
937
4.07%
937
369
1.60%
369
1,944
8.45%
1944
52
0.23%
52
3
0.01%
3
2,058
8.94%
2058
406
1.76%
406
14
0.06%
14
41
0.18%
41
162
0.70%
162
591
2.57%
591
48
0.21%
48
Joe Biden
800
3.33%
85.4%
800
421
1.75%
114.1%
421
2,353
9.80%
121.0%
2353
101
0.42%
194.2%
101
1
0.00%
33.3%
1
2,514
10.47%
122.2%
2514
389
1.62%
95.8%
389
39
0.16%
278.6%
39
51
0.21%
124.4%
51
176
0.73%
108.6%
176
869
3.62%
147.0%
869
29
0.12%
60.4%
29

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10
legend
a
b
c
10

a — total number of words with a given tag

b — (a) relative to all tagged words

c — (a) relative to number of words with this tag used by Donald Trump

bar — proportion of a

Table 10
commentary

Exclusive and Shared Usage

This section enumerates words that were exclusive to a candidate (e.g. used by one candidate but not the other). This content provides insight into what the candidates' priorities are and reveals differences in perspective on similar topics.

For a given part of speech, the table breaks down the number of words that were spoken by only one of the candidates or both candidates (intersection). The last row includes words spoken by either candidate (union).

Table 11
exclusive word usage
Total and unique words used exclusively by a candidate, or by both.
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Donald Trump
1,641 800
100.0% 48.8%
8.9% 26.0%
841800
4354561561981621245555
891 456
54.3% 51.2%
10.6% 25.4%
435456
435456
354 198
21.6% 55.9%
5.3% 20.9%
156198
156198
286 124
17.4% 43.4%
13.1% 25.4%
162124
162124
110 55
6.7% 50.0%
9.5% 30.6%
5555
5555
Joe Biden
2,411 1,334
100.0% 55.3%
13.1% 43.4%
10771334
5977632673971161813159
1,360 763
56.4% 56.1%
16.1% 42.5%
597763
597763
664 397
27.5% 59.8%
10.0% 41.9%
267397
267397
297 181
12.3% 60.9%
13.6% 37.1%
116181
116181
90 59
3.7% 65.6%
7.8% 32.8%
3159
3159
both candidates
14,387 938
100.0% 6.5%
78.0% 30.5%
13449938
55264795212297136814088552
6,005 479
41.7% 8.0%
71.2% 26.7%
5526479
5526479
5,509 297
38.3% 5.4%
82.7% 31.3%
5212297
5212297
1,508 140
10.5% 9.3%
68.9% 28.7%
1368140
1368140
937 52
6.5% 5.5%
80.9% 28.9%
88552
88552
total
18,439 3,072
100.0% 16.7%
100.0% 100.0%
153673072
6633179657169481700488978180
8,429 1,796
45.7% 21.3%
100.0% 100.0%
66331796
66331796
6,664 948
36.1% 14.2%
100.0% 100.0%
5716948
5716948
2,188 488
11.9% 22.3%
100.0% 100.0%
1700488
1700488
1,158 180
6.3% 15.5%
100.0% 100.0%
978180
978180

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 11c
legend
a d
b e
c f
4030
40302015105

a — total number of words in set (e.g. obama \ romney, obama ∩ romney, obama ∪ romney , for a given part of speech

b — (a) relative to all exclusive words in n+v+adj+adv

c — (a) relative to all words in n+v+adj+adv

d — unique words in (a)

e — (d) relative to (a)

f — (d) relative to all unique words in n+v+adj+adv

bar1 — normalized ratio of (a-d):d

bar2 — absolute ratio of (a-d):d for all POS groups (first column) or POS group (other columns)

Table 11
commentary

Pronoun Usage

This section explores pronoun use in detail. Refer to the methods section for details.

Pronoun Count

Fraction of all words that were pronouns.

Table 12a
pronoun fraction
Fraction of words that were pronouns.
speaker all pronouns
Donald Trump
23,014 1,998
100.0% 8.7%
210161998
5,189 64
22.5% 1.2%
512564
Joe Biden
24,016 2,547
100.0% 10.6%
214692547
4,730 69
19.7% 1.5%
466169
total
47,030 3,408
100.0% 7.2%
436223408
9,919 75
21.1% 0.8%
984475

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 12b
exclusive and shared pronouns
Pronouns exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Donald Trump
6 6
0.1% 100.0%
06
Joe Biden
33 11
0.3% 33.3%
2211
both candidates
9,880 58
99.6% 0.6%
982258

Hover over fields with (e.g. 155) to download the corresponding data file.

Pronoun by Person, Gender and Count

Pronoun usage by person (1st, 2nd, 3rd), gender (masculine, feminine, neuter) and count (singular, plural).

Table 13a
Pronoun by person
Count of pronouns by first, second or third person.
pronoun person
all first second third
Donald Trump
3,631 24
100.0% 0.7%
1475106962143612
1,485 10
40.9% 0.7%
147510
698 2
19.2% 0.3%
6962
1,448 12
39.9% 0.8%
143612
Joe Biden
2,832 22
100.0% 0.8%
110574863121912
1,112 7
39.3% 0.6%
11057
489 3
17.3% 0.6%
4863
1,231 12
43.5% 1.0%
121912

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13b
Pronoun by gender
Count of pronouns by masculine, feminine or neuter gender.
pronoun gender
all masculine feminine neuter
Donald Trump
986 8
100.0% 0.8%
29146026272
295 4
29.9% 1.4%
2914
62 2
6.3% 3.2%
602
629 2
63.8% 0.3%
6272
Joe Biden
865 8
100.0% 0.9%
42932424043
432 3
49.9% 0.7%
4293
26 2
3.0% 7.7%
242
407 3
47.1% 0.7%
4043

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13c
Pronoun by number
Count of pronouns by singular or plural.
pronoun number
all singular plural
Donald Trump
4,012 46
100.0% 1.1%
273728122918
2,765 28
68.9% 1.0%
273728
1,247 18
31.1% 1.4%
122918
Joe Biden
3,547 44
100.0% 1.2%
242127108217
2,448 27
69.0% 1.1%
242127
1,099 17
31.0% 1.5%
108217

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13
legend
a b
c d
153

a — total number of pronouns, by type

b — unique pronouns in (a)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a)

bar — proportion of (a – b):b

Table 13
commentary

First and third person pronouns — a closer look

These tables break pronouns by interesting contrasts. For example, the ratio of singular to plural 1st person pronouns reveals the use of "I/my/myself" vs. "we/our/ours".

Table 14a
1st person pronouns, by count
Count of singular and plural first person pronouns. This table contrasts use of I/my/myself vs. we/our/ours.
pronoun
first first singular first plural
Donald Trump
1,485 10
100.0% 0.7%
94955265
954 5
64.2% 0.5%
9495
531 5
35.8% 0.9%
5265
Joe Biden
1,112 7
100.0% 0.6%
62834774
631 3
56.7% 0.5%
6283
481 4
43.3% 0.8%
4774
Table 14b
3rd person pronouns, by count
Count of singular and plural third person pronouns. This table contrasts he/she/his/her/it vs. they/them/theirs.
pronoun
third third singular third plural
Donald Trump
1,448 12
100.0% 0.8%
97884584
986 8
68.1% 0.8%
9788
462 4
31.9% 0.9%
4584
Joe Biden
1,231 12
100.0% 1.0%
85783624
865 8
70.3% 0.9%
8578
366 4
29.7% 1.1%
3624
Table 14c
Me and you — 1st person singular and second person pronouns
Count of 1st person singular and second person pronouns. This table contrasts me/my/myself vs you/yours/yourself.
pronoun
all 1st singular 2nd
Donald Trump
1,652 7
100.0% 0.4%
94956962
954 5
57.7% 0.5%
9495
698 2
42.3% 0.3%
6962
Joe Biden
1,120 6
100.0% 0.5%
62834863
631 3
56.3% 0.5%
6283
489 3
43.7% 0.6%
4863
Table 14d
I, me, myself and my — closer look at 1st person singular pronouns
Count of specific 1st person singular pronouns: I, me, myself and my.
pronoun
all I me myself my
Donald Trump
953
100.0%
802.000113.0001.00037.000
802
84.2%
802.000
113
11.9%
113.000
1
0.1%
1.000
37
3.9%
37.000
Joe Biden
631
100.0%
495.00059.0000.00077.000
495
78.4%
495.000
59
9.4%
59.000
0
0.0%
0.000
77
12.2%
77.000
Table 14
legend
a b
c d
153

a — total number of pronouns, by type

b — unique pronouns in (a) (if more than one)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a) (if less than 100%)

bar — proportion of (a – b):b

Table 14
commentary

Pronouns by Category

This table tallies the use of pronoun by category. The categories are personal, demonstrative, indefinite, object, possessive, interrogative, others, relative, reflexive. Note that some pronouns that belong to multiple categories are counted in only one. For a list of pronouns for each category, see the pronoun methods section.

Table 15
Pronouns by cateogry
Count of pronouns by category.
pronoun category
all personal demonstrative indefinite object possessive interrogative others relative reflexive
Donald Trump
5,189
100.0%
3125.000560.000571.000263.000234.000224.000127.00083.0009.000
3,125
60.2%
31187
560
10.8%
5564
571
11.0%
54625
263
5.1%
2585
234
4.5%
2277
224
4.3%
2204
127
2.4%
1207
83
1.6%
821
9
0.2%
45
Joe Biden
4,730
100.0%
2312.000713.000536.000215.000293.000345.000208.00098.00013.000
2,312
48.9%
23057
713
15.1%
7094
536
11.3%
50927
215
4.5%
2105
293
6.2%
2867
345
7.3%
3387
208
4.4%
2008
98
2.1%
971
13
0.3%
94
Table 15
legend
a b
15

a — total number of pronouns, by category

b — (a) as fraction of all pronouns

bar — proportion of (a)

Table 15
commentary

Noun Phrase Usage

Noun phrases were extracted from the text and analyzed for frequency, word count, unique word count and richness. Single-word phrases were not counted.

Top-level noun phrases are those without a parent noun phrase (a parent phrase is one that a similar, longer phrase). Derived noun phrases are those with a parent (more details about noun phrase analysis).

The top-level noun phrases can be interpreted as independent concepts. Derived noun phrases can be interpreted as variants on concepts embodied by the top-level phrases.

Noun Phrase Count and length

This table reports the absolute number of noun phrases, which is related to the number of nouns, and their length.

Table 16a
noun phrase count
Counts of noun phrases in words and per noun.
speaker noun phrase count
all top-level
Donald Trump
1,031 371
100.0% 36.0%
0.27 0.38
660371
943 362
91.5% 38.4%
0.24 0.37
581362
Joe Biden
1,162 465
100.0% 40.0%
0.26 0.36
697465
1,068 453
91.9% 42.4%
0.24 0.35
615453

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 16b
noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Donald Trump
2.13 2 3
2.1262.0003.000
2.14 2 3
2.1382.0003.000
Joe Biden
2.15 2 3
2.1502.0003.000
2.16 2 3
2.1592.0003.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 16a
legend
a d
b e
c f
1070

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of noun phrases per noun

d — number of unique phrases

e — (c) relative to (a)

f — number of unique noun phrases per unique noun

bar — normalized ratio of (a–c):c

Table 16b
legend
a b c
102080

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c


Table 16
commentary

Exclusive and Shared Noun Phrase Count and length

Table 17a
exclusive and shared noun phrase count
Counts of exclusive and shared noun phrases in words and per noun.
speaker noun phrase count
all top-level
Donald Trump
800 315
36.5% 39.4%
485315
788 314
98.5% 39.8%
474314
Joe Biden
966 413
44.0% 42.8%
553413
938 408
97.1% 43.5%
530408
both candidates
427 55
19.5% 12.9%
37255
285 40
66.7% 14.0%
24540

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 17b
exclusive and shared noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Donald Trump
2.16 2 3
2.1592.0003.000
2.16 2 3
2.1642.0003.000
Joe Biden
2.18 2 3
2.1762.0003.000
2.18 2 3
2.1802.0003.000
both candidates
2.02 2 2
2.0162.0002.000
2.01 2 2
2.0072.0002.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 17a
legend
a c
b d
1070

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of unique phrases

d — (c) relative to (a)

bar — normalized ratio of (a–c):c

Table 17b
legend
a b c
102080

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c


Table 17
commentary

Windbag Index

The Windbag Index is a compound measure that characterizes the complexity of speech. A low index is indicative of succinct speech with low degree of repetition and large number of independent concepts.

Unlike the Flesch-Kincaid readability metrics, the Windbag Index does not take into account the length of sentences or complexity (e.g. number of syllables) of individual words.

Table 18
windbag index
Windbag Index for each speaker. The higher the value, the more repetitive the speech.
speaker Windbag Index
index value index terms
Donald Trump
19,556
+243.9%
19556.06384501
0.418 0.192 0.251 0.157 0.265 0.173 0.360 0.976
-1.1% -18.4% -12.5% -27.2% -13.8% -27.2% -10.1% +0.2%
0.4182671417398110.1922917099522130.2506426735218510.1573236889692590.265229615745080.1731343283582090.359844810863240.975741239892183
Joe Biden
5,686
-70.9%
5686.15537130611
0.423 0.236 0.286 0.216 0.308 0.238 0.400 0.974
+1.2% +22.5% +14.3% +37.3% +16.0% +37.3% +11.2% -0.2%
0.4230929380413060.2355083161106190.2864066975104650.2160789001793190.3077609277430870.2377049180327870.4001721170395870.974193548387097
Table 18
legend
The Windbag Index is 1/(t1*t2*...*t9) where t1,t2,...,t8 are

t1 — fraction of words that are non-stop

t2 — fraction of non-stop words that are unique

t3 — fraction of nouns that are unique

t4 — fraction of verbs that are unique

t5 — fraction of adjectives that are unique

t6 — fraction of adverbs that are unique

t7 — fraction of noun phrases that are unique

t8 — fraction of noun phrases that are top-level


Large individual terms t1...t9 contribute to a smaller index.

The percentage values below the index and each term are relative differences to the other speaker's corresponding term (i.e. 100*(a-b)/b where a is the value for one speaker and b for the other).
Table 18
commentary

Word Clouds

In the word clouds below, the size of the word is proportional to the number of times it was used by a candidate (method details).

Not all words from a group used to draw the cloud fit in the image — less frequently used words for large word groups may fall outside the image.

All Words for Each Candidate

Each candidate's debate portion was extracted and frequencies were compiled for each part of speech (noun, verb, adjective, adverb), with words colored by their part of speech category.

The distribution of sizes within a tag cloud follows the frequency distribution of words. However, word size cannot be compared between clouds, since the minimum and maximum size of the words is fixed.

Debate Word Cloud for Donald Trump - all words

Debate tag cloud for Donald Trump
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Debate Word Cloud for Joe Biden - all words

Debate tag cloud for Joe Biden
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb
commentary

Exclusive Words for Each Candidate

The clouds below show words used exlusively by a candidate. For example, if candidate A used the word "invest" (any number of times), but candidate B did not, then the word will appear in the exclusive word tag cloud for candidate A.

Words exclusive to Donald Trump

Debate tag cloud for Donald Trump
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Words exclusive to Joe Biden

Debate tag cloud for Joe Biden
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb
commentary

Pronouns for Each Candidate

Word clouds based on only pronouns.

Pronouns for Donald Trump

Debate tag cloud for Donald Trump
Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other

Pronouns for Joe Biden

Debate tag cloud for Joe Biden
Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other
commentary

Part of Speech Word Clouds

In these clouds, words from each major part of speech were colored based on whether they were exclusive to a candidate or shared by the candidates.

The size of the word is relative to the frequency for the candidate — word sizes between candidates should not be used to indicate difference in absolute frequency.

Cloud of noun words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Cloud of verb words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Cloud of adjective words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Cloud of adverb words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Cloud of all words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Word Pair Clouds for Each Candidate

Pairs used only once during the debate are not shown.

word pairs for Donald Trump

JJ/JJ by Donald Trump
JJ/RB by Donald Trump
JJ/N by Donald Trump
JJ/V by Donald Trump
RB/RB by Donald Trump
RB/N by Donald Trump
RB/V by Donald Trump
N/N by Donald Trump
N/V by Donald Trump
V/V by Donald Trump

word pairs for Joe Biden

JJ/JJ by Joe Biden
JJ/RB by Joe Biden
JJ/N by Joe Biden
JJ/V by Joe Biden
RB/RB by Joe Biden
RB/N by Joe Biden
RB/V by Joe Biden
N/N by Joe Biden
N/V by Joe Biden
V/V by Joe Biden
commentary

Downloads

Debate transcript

Parsed word lists and word clouds (word lists, part of speech lists, noun phrases, sentences) (word clouds)

Raw data structure

Please see the methods section for details about these files.