Word Analysis of 2020 U.S. Presidential Debates

Donald Trump vs. Joe Biden (combined debates)

29 September — 22 October 2020

Introduction

Speaking Turns and Interruptions

Here, I look at the length of each turn of uninterrupted speech.

Table 1

length of sections in words

The number of uninterrupted deliveries (sections), mode/median/mean length of sections in words, and the shortest section length in words that composed 10%, 50% and 90% of the debate.

speaker

sections

section length

debate contiguity (L₁₀ L₅₀ L₉₀)

Donald Trump

705 download text •

4.0	11.0	32.3

13	89	169

Joe Biden

502 download text •

2.0	15.0	47.4

38	112	182

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 1

legend

a	b	c

a — section length (mode), shortest section length in 10% of debate

b — section length (median), shortest section length in 50% of debate

c — section length (mean), shortest section length in 90% of debate

bar — proportion of a:b:c

Table 1

commentary

Flesch-Kincaid Reading Ease and Grade Level

The Flesch-Kincaid reading ease and grade level metrics are designed to indicate how difficult a passage in English is to understand.

This metric does not take repetition into account. A grade level 10 sentence that is repeated 100 times still generates the same metrics because the words per sentence and syllables per word remain constant. To measure how many times a speaker repeats themselves, I use my Windbag Index, below.

Reading ease ranges from 100 (easiest) down to 0 (hardest) and can be interpreted as follows

100 –90	5th grade	Very easy to read. Easily understood by an average 11-year-old student.
90 – 80	6th grade	Easy to read. Conversational English for consumers.
80 – 70	7th grade	Fairly easy to read.
70 – 60	8th & 9th grade	Plain English. Easily understood by 13- to 15-year-old students.
60 – 50	10th to 12th grade	Fairly difficult to read.
50 – 30	college	Difficult to read.
30 – 10	college graduate	Very difficult to read. Best understood by college/university graduates.
10 – 0	professional	Extremely difficult to read. Best understood by college/university graduates.

The grade level corresponds roughly to a U.S. grade level. It has a minimum value of –3.4 and no upper bound.

Two sets of readability scores are calculated. One for the entire debate and one that only considers section with at least 9 words.

Table 2a

readability — entire debate

Flesch-Kincaid reading ease and grade level.

speaker

grade level

reading ease

sections

sentences

words

syllables

Donald Trump

3.84 download text •
100.0%

84.69
100.0%

705
100.0%

2,367
100.0%

22,798
100.0%

30,281
100.0%

Joe Biden

5.45 download text •
141.9%

78.63
92.8%

502
71.2%

1,873
79.1%

23,785
104.3%

32,422
107.1%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 2b

readability — excluding short sections

Flesch-Kincaid reading ease and grade level for sections with at least 9 words.

speaker

grade level

reading ease

sections

sentences

words

syllables

Donald Trump

4.16 download text •
100.0%

83.70
100.0%

471
100.0%

2,115
100.0%

21,885
100.0%

29,137
100.0%

Joe Biden

5.89 download text •
141.6%

77.33
92.4%

313
66.5%

1,678
79.3%

23,100
105.6%

31,546
108.3%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 2

legend

a
b

a — value for candidate

b — value relative to Donald Trump

bar — proportion of a

Table 2

commentary

Sentence Size

Table 3

sentence size

Number of sentences spoken by each speaker and sentence word count statistics. Number of words in a sentence is shown by average and 50%/90% cumulative values for all, stop and non-stop words.

speaker

number of sentences

sentence size

all

stop

non-stop

Donald Trump

2,370

9.7 download text •	13	30

5.6 download text •	8	18

4.1 download text •	6	13

Joe Biden

1,874

12.8 download text •	19	40

7.4 download text •	11	24

5.4 download text •	8	18

total

4,244

13.1	16	36

8.4	10	22

6.7	8	17

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 3

legend

a	b	c

a — average sentence size

b — largest sentence size for 50% of content

c — largest sentence size for 90% of content

bar — proportion of a:b:c

Table 3

commentary

Word Statistics

Debate Word Count

Summary Word Count

The summary word count reports the total number of words and the number of unique, non-stop words used by each candidate. Word number is expressed as both absolute and relative values.

Table 4a

all words

Number of all words and unique words used by each speaker.

set

word count

Donald Trump

23,014 download text •	1,998
48.9%	8.7%

Joe Biden

24,016 download text •	2,547
51.1%	10.6%

total

47,030	3,408
100.0%	7.2%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 4b

exclusive and shared words

Words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).

set

word count

Donald Trump

1,746 download text •	861
7.6%	49.3%

Joe Biden

2,577 download text •	1,410
10.7%	54.7%

both candidates

42,707 download text •	1,137
90.8%	2.7%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 4

legend

a	c
b	d

a — word count

b — word count, as fraction in total in debate

c — unique words in (a)

d — unique words in (a), as fraction in (a)

bar — proportion of (a-c):c

Table 4

commentary

Stop Word Contribution

In the table below, the candidates' delivery is partitioned into stop and non-stop words. Stop words (full list) are frequently-used bridging words (e.g. pronouns and conjunctions) whose meaning depends entirely on context. The fraction of words that are stop words is one measure of the complexity of speech.

Table 5a

non-stop words

Counts of stop and non-stop words.

speaker

all words

stop words

non-stop words

Donald Trump

23,014 download text •	1,998
100.0%	8.7%

13,388 download text •	147
58.2%	1.1%

9,626 download text •	1,851
41.8%	19.2%

Joe Biden

24,016 download text •	2,547
100.0%	10.6%

13,855 download text •	154
57.7%	1.1%

10,161 download text •	2,393
42.3%	23.6%

total

47,030	3,408
100.0%	7.2%

27,243	160
57.9%	0.6%

19,787	3,248
42.1%	16.4%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 5b

exclusive and shared non-stop words

Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).

set

word count

Donald Trump

1,740 download text •	855
18.1%	49.1%

Joe Biden

2,537 download text •	1,397
25.0%	55.1%

both candidates

15,510 download text •	996
78.4%	6.4%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 5

legend

a	c
b	d

a — total number of words, for a given category (all, stop, non-stop)

b — (a) relative to words in the debate if category=all, otherwise relative to words by the candidate

c — number of unique words with set (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 5

commentary

Word frequency

The word frequency table summarizes the frequency with which words were used. I show the average word frequency and the weighted cumulative frequencies at 50 and 90 percentile. The average word frequency indicates how many times, on average, a word is used. For a given fraction of the entire delivery, the weighted cumulative frequency indicates the largest word frequency within this fraction (details about weighted cumulative distribution).

Table 6a

word use frequency

Average and 50%/90% percentile word frequencies.

speaker

word frequency

all

stop

non-stop

Donald Trump

11.5 download text •	92	648

91.1 download text •	295	733

5.2 download text •	12	111

Joe Biden

9.4 download text •	86	642

90.0 download text •	242	869

4.2 download text •	10	90

total

13.8	163	1,297

170.3	539	1,460

6.1	19	165

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 6b

exclusive and shared non-stop word use frequency

Average and 50%/90% cumulative percentile word frequencies. Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).

set

word frequency

Donald Trump

2.04 download text •	3	9

Joe Biden

1.82 download text •	2	7

total

6.09	19	165

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 6

legend

a	b	c

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 6

commentary

All further word use statistics represent content that has been filtered for stop words, unless explicitly indicated.

Part of Speech Analysis

In this section, word frequency is broken down by their part of speech (POS). The four POS groups examined are nouns, verbs, adjectives and adverbs. Conjunctions and prepositions are not considered. The first category (n+v+adj+adv) is composed of all four POS groups.

Part of Speech Count

Table 7

part of speech count

Count of words categorized by part of speech (POS).

part of speech

n+v+adj+adv

nouns (n)

verbs (v)

adjectives (adj)

adverbs (adv)

Donald Trump

8,945 download text •	1,738
38.9%	19.4%

3,890 download text •	975
43.5%	25.1%

3,318 download text •	522
37.1%	15.7%

1,067 download text •	283
11.9%	26.5%

670 download text •	116
7.5%	17.3%

Joe Biden

9,494 download text •	2,272
39.5%	23.9%

4,539 download text •	1,300
47.8%	28.6%

3,346 download text •	723
35.2%	21.6%

1,121 download text •	345
11.8%	30.8%

488 download text •	116
5.1%	23.8%

total

18,439 download text •	3,072
39.2%	16.7%

8,429 download text •	1,796
45.7%	21.3%

6,664 download text •	948
36.1%	14.2%

2,188 download text •	488
11.9%	22.3%

1,158 download text •	180
6.3%	15.5%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 7

legend

a	c
b	d

a — total number of words for a given POS (all, noun, verb, adjective, adverb)

b — (a) relative to all words by candidate

c — unique words in (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 7

commentary

Part of Speech Frequency

Table 8

part of speech frequency

Frequency of words categorized by part of speech (POS).

part of speech frequency

n+v+adj+adv

nouns (n)

verbs (v)

adjectives (adj)

adverbs (adv)

pronouns (pro)

Donald Trump

5.15	12	111

3.99	8	52

6.36	24	349

3.77	7	40

5.78	17	51

81.08	402	802

Joe Biden

4.18	10	90

3.49	7	55

4.63	14	396

3.25	6	51

4.21	11	71

68.55	334	495

total

6.00	18	178

4.69	12	85

7.03	34	745

4.48	10	59

6.43	21	129

132.25	819	1,297

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 8

legend

a	b	c

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 8

commentary

Part of Speech Pairing

Through word pairing, I extract concepts from the text. The number of unique word pairs is a function of sentence length and is one of the measures of complexity.

Table 9a

part of speech pairing — Donald Trump

Word pairs (total and unique) categorized by part of speech (POS)

part of speech pairings - Donald Trump

noun

verb

adjective

adverb

noun

418 download text •	228
	54.5%

verb

175 download text •	139
	79.4%

16 download text •	15
	93.8%

adjective

575 download text •	399
	69.4%

2 download text •	2
	100.0%

33 download text •	27
	81.8%

adverb

9 download text •	9
	100.0%

118 download text •	81
	68.6%

39 download text •	27
	69.2%

23 download text •	14
	60.9%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 9b

part of speech pairing — Joe Biden

Word pairs (total and unique) categorized by part of speech (POS)

part of speech pairings - Joe Biden

noun

verb

adjective

adverb

noun

531 download text •	360
	67.8%

verb

214 download text •	177
	82.7%

29 download text •	28
	96.6%

adjective

604 download text •	447
	74.0%

15 download text •	9
	60.0%

32 download text •	23
	71.9%

adverb

8 download text •	8
	100.0%

84 download text •	71
	84.5%

25 download text •	23
	92.0%

10 download text •	9
	90.0%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 9c

unique part of speech pairing — candidate comparison

Unique word pairs categorized by part of speech (POS)

unique part of speech pairings

noun (n)

verb (v)

adjective (adj)

adverb (adv)

noun

228 download text •	360 download text •
	157.9%

verb

139 download text •	177 download text •
	127.3%

15 download text •	28 download text •
	186.7%

adjective

399 download text •	447 download text •
	112.0%

2 download text •	9 download text •
	450.0%

27 download text •	23 download text •
	85.2%

adverb

9 download text •	8 download text •
	88.9%

81 download text •	71 download text •
	87.7%

27 download text •	23 download text •
	85.2%

14 download text •	9 download text •
	64.3%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 9 a,b

legend

a	c
	d

a — total number of pairs, for a given category (e.g. verb/noun)

c — number of unique pairs within set (a)

d — (c) relative to (a)

bar — proportion of (a–c):c

Table 9c

legend

a	c
	d

a — unique pairs for Donald Trump

c — unique pairs for Joe Biden

d — (c) relative to (a) (i.e. Joe Biden relative to Donald Trump)

bars — (a) and (c)

Table 9

commentary

Detailed Part of Speech Tags

You can really get into the weeds here. Parts of speech are counted more granularly in these tables — nouns and verbs are split into classes and many other word types are shown, such as conjunctions and prepositions.

Table 10a

detailed POS tags — nouns and verbs

Count by part of speech tag: NN (noun, singular), NNP (proper noun, singular), NNPS (proper noun, plural), NNS (noun plural), VB (verb, base form), VBD (verb, past tense), VBG (verb, gerund/present participle), VBN (verb, past participle), VBP (verb, sing. present, non-3d), VBZ (verb, 3rd person sing. present)

Penn Treebank part of speech tag

NNP

NNPS

NNS

VBD

VBG

VBN

VBP

VBZ

Donald Trump

2,208 download text •
9.59%

943 download text •
4.10%

30 download text •
0.13%

919 download text •
3.99%

1,226 download text •
5.33%

998 download text •
4.34%

499 download text •
2.17%

413 download text •
1.79%

1,459 download text •
6.34%

727 download text •
3.16%

Joe Biden

2,853 download text •
11.88%
129.2%

796 download text •
3.31%
84.4%

58 download text •
0.24%
193.3%

987 download text •
4.11%
107.4%

1,425 download text •
5.93%
116.2%

748 download text •
3.11%
74.9%

697 download text •
2.90%
139.7%

444 download text •
1.85%
107.5%

1,157 download text •
4.82%
79.3%

980 download text •
4.08%
134.8%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 10b

detailed POS tags — adjectives, pronouns, adverbs and wh-words

Count by part of speech tag: JJ (adjective), JJR (adjective, comparative), JJS (adjective, superlative), PRP (personal pronoun), PRP$ (possessive pronoun), RB (adverb), RBR (adverb, comparative), RBS (adverb, superlative), WDT (wh-determiner), WP (wh-pronoun), WP$ (possessive wh-pronoun), WRB (wh-abverb)

Penn Treebank part of speech tag

JJR

JJS

PRP

PRP$

RBR

RBS

WDT

WP$

WRB

Donald Trump

1,045 download text •
4.54%

76 download text •
0.33%

68 download text •
0.30%

3,379 download text •
14.68%

247 download text •
1.07%

1,637 download text •
7.11%

34 download text •
0.15%

7 download text •
0.03%

110 download text •
0.48%

188 download text •
0.82%

176 download text •
0.76%

Joe Biden

1,148 download text •
4.78%
109.9%

99 download text •
0.41%
130.3%

35 download text •
0.15%
51.5%

2,535 download text •
10.56%
75.0%

297 download text •
1.24%
120.2%

1,338 download text •
5.57%
81.7%

20 download text •
0.08%
58.8%

4 download text •
0.02%
57.1%

115 download text •
0.48%
104.5%

313 download text •
1.30%
166.5%

224 download text •
0.93%
127.3%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 10c

detailed POS tags — prepositions, conjunctions, determiners and others

Count by part of speech tag: CC (coordinating conjunction), CD (cardinal digit), DT (determiner), EX (existential there), FW (foreign word), IN (preposition/subordinating conjunction), MD (modal), PDT (predeterminer), POS (possessive ending), RP (particle), TO (to), UH (interjection)

Penn Treebank part of speech tag

PDT

POS

Donald Trump

937 download text •
4.07%

369 download text •
1.60%

1,944 download text •
8.45%

52 download text •
0.23%

3 download text •
0.01%

2,058 download text •
8.94%

406 download text •
1.76%

14 download text •
0.06%

41 download text •
0.18%

162 download text •
0.70%

591 download text •
2.57%

48 download text •
0.21%

Joe Biden

800 download text •
3.33%
85.4%

421 download text •
1.75%
114.1%

2,353 download text •
9.80%
121.0%

101 download text •
0.42%
194.2%

1 download text •
0.00%
33.3%

2,514 download text •
10.47%
122.2%

389 download text •
1.62%
95.8%

39 download text •
0.16%
278.6%

51 download text •
0.21%
124.4%

176 download text •
0.73%
108.6%

869 download text •
3.62%
147.0%

29 download text •
0.12%
60.4%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 10

legend

a
b
c

a — total number of words with a given tag

b — (a) relative to all tagged words

c — (a) relative to number of words with this tag used by Donald Trump

bar — proportion of a

Table 10

commentary

Exclusive and Shared Usage

This section enumerates words that were exclusive to a candidate (e.g. used by one candidate but not the other). This content provides insight into what the candidates' priorities are and reveals differences in perspective on similar topics.

For a given part of speech, the table breaks down the number of words that were spoken by only one of the candidates or both candidates (intersection). The last row includes words spoken by either candidate (union).

Table 11

exclusive word usage

Total and unique words used exclusively by a candidate, or by both.

part of speech

n+v+adj+adv

nouns (n)

verbs (v)

adjectives (adj)

adverbs (adv)

Donald Trump

1,641 download text •	800
100.0%	48.8%
8.9%	26.0%

891 download text •	456
54.3%	51.2%
10.6%	25.4%

354 download text •	198
21.6%	55.9%
5.3%	20.9%

286 download text •	124
17.4%	43.4%
13.1%	25.4%

110 download text •	55
6.7%	50.0%
9.5%	30.6%

Joe Biden

2,411 download text •	1,334
100.0%	55.3%
13.1%	43.4%

1,360 download text •	763
56.4%	56.1%
16.1%	42.5%

664 download text •	397
27.5%	59.8%
10.0%	41.9%

297 download text •	181
12.3%	60.9%
13.6%	37.1%

90 download text •	59
3.7%	65.6%
7.8%	32.8%

both candidates

14,387 download text •	938
100.0%	6.5%
78.0%	30.5%

6,005 download text •	479
41.7%	8.0%
71.2%	26.7%

5,509 download text •	297
38.3%	5.4%
82.7%	31.3%

1,508 download text •	140
10.5%	9.3%
68.9%	28.7%

937 download text •	52
6.5%	5.5%
80.9%	28.9%

total

18,439 download text •	3,072
100.0%	16.7%
100.0%	100.0%

8,429 download text •	1,796
45.7%	21.3%
100.0%	100.0%

6,664 download text •	948
36.1%	14.2%
100.0%	100.0%

2,188 download text •	488
11.9%	22.3%
100.0%	100.0%

1,158 download text •	180
6.3%	15.5%
100.0%	100.0%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 11c

legend

a	d
b	e
c	f

a — total number of words in set (e.g. obama \ romney, obama ∩ romney, obama ∪ romney , for a given part of speech

b — (a) relative to all exclusive words in n+v+adj+adv

c — (a) relative to all words in n+v+adj+adv

d — unique words in (a)

e — (d) relative to (a)

f — (d) relative to all unique words in n+v+adj+adv

bar1 — normalized ratio of (a-d):d

bar2 — absolute ratio of (a-d):d for all POS groups (first column) or POS group (other columns)

Table 11

commentary

Pronoun Usage

This section explores pronoun use in detail. Refer to the methods section for details.

Pronoun Count

Fraction of all words that were pronouns.

Table 12a

pronoun fraction

Fraction of words that were pronouns.

speaker

all

pronouns

Donald Trump

23,014 download text •	1,998
100.0%	8.7%

5,189 download text •	64
22.5%	1.2%

Joe Biden

24,016 download text •	2,547
100.0%	10.6%

4,730 download text •	69
19.7%	1.5%

total

47,030 download text •	3,408
100.0%	7.2%

9,919 download text •	75
21.1%	0.8%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 12b

exclusive and shared pronouns

Pronouns exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).

set

word count

Donald Trump

6 download text •	6
0.1%	100.0%

Joe Biden

33 download text •	11
0.3%	33.3%

both candidates

9,880 download text •	58
99.6%	0.6%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Pronoun by Person, Gender and Count

Pronoun usage by person (1st, 2nd, 3rd), gender (masculine, feminine, neuter) and count (singular, plural).

Table 13a

Pronoun by person

Count of pronouns by first, second or third person.

pronoun person

all

first

second

third

Donald Trump

3,631 download text •	24
100.0%	0.7%

1,485 download text •	10
40.9%	0.7%

698 download text •	2
19.2%	0.3%

1,448 download text •	12
39.9%	0.8%

Joe Biden

2,832 download text •	22
100.0%	0.8%

1,112 download text •	7
39.3%	0.6%

489 download text •	3
17.3%	0.6%

1,231 download text •	12
43.5%	1.0%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 13b

Pronoun by gender

Count of pronouns by masculine, feminine or neuter gender.

pronoun gender

all

masculine

feminine

neuter

Donald Trump

986 download text •	8
100.0%	0.8%

295 download text •	4
29.9%	1.4%

62 download text •	2
6.3%	3.2%

629 download text •	2
63.8%	0.3%

Joe Biden

865 download text •	8
100.0%	0.9%

432 download text •	3
49.9%	0.7%

26 download text •	2
3.0%	7.7%

407 download text •	3
47.1%	0.7%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 13c

Pronoun by number

Count of pronouns by singular or plural.

pronoun number

all

singular

plural

Donald Trump

4,012 download text •	46
100.0%	1.1%

2,765 download text •	28
68.9%	1.0%

1,247 download text •	18
31.1%	1.4%

Joe Biden

3,547 download text •	44
100.0%	1.2%

2,448 download text •	27
69.0%	1.1%

1,099 download text •	17
31.0%	1.5%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 13

legend

a	b
c	d

a — total number of pronouns, by type

b — unique pronouns in (a)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a)

bar — proportion of (a – b):b

Table 13

commentary

First and third person pronouns — a closer look

These tables break pronouns by interesting contrasts. For example, the ratio of singular to plural 1st person pronouns reveals the use of "I/my/myself" vs. "we/our/ours".

Table 14a

1st person pronouns, by count

Count of singular and plural first person pronouns. This table contrasts use of I/my/myself vs. we/our/ours.

pronoun

first

first singular

first plural

Donald Trump

1,485	10
100.0%	0.7%

954	5
64.2%	0.5%

531	5
35.8%	0.9%

Joe Biden

1,112	7
100.0%	0.6%

631	3
56.7%	0.5%

481	4
43.3%	0.8%

Table 14b

3rd person pronouns, by count

Count of singular and plural third person pronouns. This table contrasts he/she/his/her/it vs. they/them/theirs.

pronoun

third

third singular

third plural

Donald Trump

1,448	12
100.0%	0.8%

986	8
68.1%	0.8%

462	4
31.9%	0.9%

Joe Biden

1,231	12
100.0%	1.0%

865	8
70.3%	0.9%

366	4
29.7%	1.1%

Table 14c

Me and you — 1st person singular and second person pronouns

Count of 1st person singular and second person pronouns. This table contrasts me/my/myself vs you/yours/yourself.

pronoun

all

1st singular

2nd

Donald Trump

1,652	7
100.0%	0.4%

954	5
57.7%	0.5%

698	2
42.3%	0.3%

Joe Biden

1,120	6
100.0%	0.5%

631	3
56.3%	0.5%

489	3
43.7%	0.6%

Table 14d

I, me, myself and my — closer look at 1st person singular pronouns

Count of specific 1st person singular pronouns: I, me, myself and my.

pronoun

all

myself

Donald Trump

953
100.0%

802
84.2%

113
11.9%

1
0.1%

37
3.9%

Joe Biden

631
100.0%

495
78.4%

59
9.4%

0
0.0%

77
12.2%

Table 14

legend

a	b
c	d

a — total number of pronouns, by type

b — unique pronouns in (a) (if more than one)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a) (if less than 100%)

bar — proportion of (a – b):b

Table 14

commentary

Pronouns by Category

This table tallies the use of pronoun by category. The categories are personal, demonstrative, indefinite, object, possessive, interrogative, others, relative, reflexive. Note that some pronouns that belong to multiple categories are counted in only one. For a list of pronouns for each category, see the pronoun methods section.

Table 15

Pronouns by cateogry

Count of pronouns by category.

pronoun category

all

personal

demonstrative

indefinite

object

possessive

interrogative

others

relative

reflexive

Donald Trump

5,189
100.0%

3,125
60.2%

560
10.8%

571
11.0%

263
5.1%

234
4.5%

224
4.3%

127
2.4%

83
1.6%

9
0.2%

Joe Biden

4,730
100.0%

2,312
48.9%

713
15.1%

536
11.3%

215
4.5%

293
6.2%

345
7.3%

208
4.4%

98
2.1%

13
0.3%

Table 15

legend

a	b

a — total number of pronouns, by category

b — (a) as fraction of all pronouns

bar — proportion of (a)

Table 15

commentary

Noun Phrase Usage

Noun phrases were extracted from the text and analyzed for frequency, word count, unique word count and richness. Single-word phrases were not counted.

Top-level noun phrases are those without a parent noun phrase (a parent phrase is one that a similar, longer phrase). Derived noun phrases are those with a parent (more details about noun phrase analysis).

The top-level noun phrases can be interpreted as independent concepts. Derived noun phrases can be interpreted as variants on concepts embodied by the top-level phrases.

Noun Phrase Count and length

This table reports the absolute number of noun phrases, which is related to the number of nouns, and their length.

Table 16a

noun phrase count

Counts of noun phrases in words and per noun.

speaker

noun phrase count

all

top-level

Donald Trump

1,031 download text •	371
100.0%	36.0%
0.27	0.38

943 download text •	362
91.5%	38.4%
0.24	0.37

Joe Biden

1,162 download text •	465
100.0%	40.0%
0.26	0.36

1,068 download text •	453
91.9%	42.4%
0.24	0.35

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 16b

noun phrase length

Average and 50%/90% cumulative length of noun phrases, in words.

speaker

noun phrase length

all

top-level

Donald Trump

2.13	2	3

2.14	2	3

Joe Biden

2.15	2	3

2.16	2	3

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 16a

legend

a	d
b	e
c	f

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of noun phrases per noun

d — number of unique phrases

e — (c) relative to (a)

f — number of unique noun phrases per unique noun

bar — normalized ratio of (a–c):c

Table 16b

legend

a	b	c

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c

Table 16

commentary

Exclusive and Shared Noun Phrase Count and length

Table 17a

exclusive and shared noun phrase count

Counts of exclusive and shared noun phrases in words and per noun.

speaker

noun phrase count

all

top-level

Donald Trump

800 download text •	315
36.5%	39.4%

788 download text •	314
98.5%	39.8%

Joe Biden

966 download text •	413
44.0%	42.8%

938 download text •	408
97.1%	43.5%

both candidates

427 download text •	55
19.5%	12.9%

285 download text •	40
66.7%	14.0%

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 17b

exclusive and shared noun phrase length

Average and 50%/90% cumulative length of noun phrases, in words.

speaker

noun phrase length

all

top-level

Donald Trump

2.16	2	3

2.16	2	3

Joe Biden

2.18	2	3

2.18	2	3

both candidates

2.02	2	2

2.01	2	2

Hover over fields with • (e.g. 155•) to download the corresponding data file.

Table 17a

legend

a	c
b	d

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of unique phrases

d — (c) relative to (a)

bar — normalized ratio of (a–c):c

Table 17b

legend

a	b	c

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c

Table 17

commentary

Windbag Index

The Windbag Index is a compound measure that characterizes the complexity of speech. A low index is indicative of succinct speech with low degree of repetition and large number of independent concepts.

Unlike the Flesch-Kincaid readability metrics, the Windbag Index does not take into account the length of sentences or complexity (e.g. number of syllables) of individual words.

Table 18

windbag index

Windbag Index for each speaker. The higher the value, the more repetitive the speech.

speaker

Windbag Index

index value

index terms

Donald Trump

19,556
+243.9%

0.418	0.192	0.251	0.157	0.265	0.173	0.360	0.976
-1.1%	-18.4%	-12.5%	-27.2%	-13.8%	-27.2%	-10.1%	+0.2%

Joe Biden

5,686
-70.9%

0.423	0.236	0.286	0.216	0.308	0.238	0.400	0.974
+1.2%	+22.5%	+14.3%	+37.3%	+16.0%	+37.3%	+11.2%	-0.2%

Table 18

legend

The Windbag Index is 1/(t1*t2*...*t9) where t1,t2,...,t8 are

t1 — fraction of words that are non-stop

t2 — fraction of non-stop words that are unique

t3 — fraction of nouns that are unique

t4 — fraction of verbs that are unique

t5 — fraction of adjectives that are unique

t6 — fraction of adverbs that are unique

t7 — fraction of noun phrases that are unique

t8 — fraction of noun phrases that are top-level

Large individual terms t1...t9 contribute to a smaller index.

The percentage values below the index and each term are relative differences to the other speaker's corresponding term (i.e. 100*(a-b)/b where a is the value for one speaker and b for the other).

Table 18

commentary

Word Clouds

In the word clouds below, the size of the word is proportional to the number of times it was used by a candidate (method details).

Not all words from a group used to draw the cloud fit in the image — less frequently used words for large word groups may fall outside the image.

All Words for Each Candidate

Each candidate's debate portion was extracted and frequencies were compiled for each part of speech (noun, verb, adjective, adverb), with words colored by their part of speech category.

The distribution of sizes within a tag cloud follows the frequency distribution of words. However, word size cannot be compared between clouds, since the minimum and maximum size of the words is fixed.

Debate Word Cloud for Donald Trump - all words

Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Debate Word Cloud for Joe Biden - all words

Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

commentary

Exclusive Words for Each Candidate

The clouds below show words used exlusively by a candidate. For example, if candidate A used the word "invest" (any number of times), but candidate B did not, then the word will appear in the exclusive word tag cloud for candidate A.

Words exclusive to Donald Trump

Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Words exclusive to Joe Biden

Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

commentary

Pronouns for Each Candidate

Word clouds based on only pronouns.

Pronouns for Donald Trump

Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other

Pronouns for Joe Biden

Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other

commentary

Part of Speech Word Clouds

In these clouds, words from each major part of speech were colored based on whether they were exclusive to a candidate or shared by the candidates.

The size of the word is relative to the frequency for the candidate — word sizes between candidates should not be used to indicate difference in absolute frequency.