1. The first set of tests: incorrect words present
    1. Table for the words present in the sets
    2. Table for the words uniquely present in each set and in common
    3. Table of the part-of-speech (POS) for the words present in the sets (tokens)
    4. Table of the POS for the words present in the sets (lemmas)
    5. Table of the POS for the words uniquely present in each set and in common (tokens)
    6. Table of the POS for the words uniquely present in each set and in common (lemmas)
  2. The second set of tests: incorrect words removed
    1. Table for the words present in the sets
    2. Table for the words uniquely present in each set and in common
    3. Table of the POS for the words present in the sets (tokens)
    4. Table of the POS for the words present in the sets (lemmas)
    5. Table of the POS for the words uniquely present in each set and in common (tokens)
    6. Table of the POS for the words uniquely present in each set and in common (lemmas)

The first set of tests: incorrect words present

Table of results

Table for the words present in the sets

  War Other
Token 1858 3765
Lemma 1516 2898
Difference 342 867

Table for the words uniquely present in each set and in common

  War Other Common
Token 908 2815 950
Lemma 611 1994 905
Difference 297 821 45

Percentage of common words (tokens) in the total of words (war + other): 16,89%
Percentage of common words (lemmas) in the total of words (war + other): 20,69%

Table of the part-of-speech (POS) for the words present in the sets (tokens)

  War Other
ADJ 429 977
ADP 36 51
ADV 117 184
AUX 47 86
CCONJ 7 13
DET 44 43
NOUN 539 1067
NUM 14 19
PRON 35 63
PROPN 58 145
PUNCT 0 1
SCONJ 3 4
VERB 529 1109
X 0 3
Total 1858 3765

Table of the POS for the words present in the sets (lemmas)

  War Other
ADJ 309 678
ADP 32 49
ADV 112 171
AUX 8 21
CCONJ 7 13
DET 24 29
NOUN 519 972
NUM 15 18
PRON 29 46
PROPN 55 156
PUNCT 0 1
SCONJ 6 5
VERB 400 737
X 0 2
Total 1516 2898

Table of the POS for the words uniquely present in each set and in common (tokens)

  War Other Common
ADJ 239 787 191
ADP 7 21 26
ADV 38 106 75
AUX 12 51 34
CCONJ 0 5 9
DET 7 18 32
NOUN 255 782 293
NUM 3 9 12
PRON 10 21 28
PROPN 29 121 17
PUNCT 0 2 0
SCONJ 0 0 4
VERB 306 882 227
X 2 10 2
Total 908 2815 950

Table of the POS for the words uniquely present in each set and in common (lemmas)

  War Other Common
ADJ 152 516 174
ADP 4 18 25
ADV 39 103 72
AUX 5 14 8
CCONJ 0 5 7
DET 7 14 16
NOUN 201 643 304
NUM 3 9 12
PRON 3 18 23
PROPN 37 126 23
PUNCT 0 3 0
SCONJ 0 0 6
VERB 157 513 235
X 3 12 0
Total 611 1994 905

The second set of tests: incorrect words removed

Table of results

Table for the words present in the sets

  War Other
Token 1813 3603
Lemma 1464 2718
Difference 349 885

Table for the words uniquely present in each set and in common

  War Other Common
Token 866 2656 947
Lemma 563 1817 901
Difference 303 839 46

Percentage of common words (tokens) in the total of words (war + other): 17,49%
Percentage of common words (lemmas) in the total of words (war + other): 21,54%

Table of the POS for the words present in the sets (tokens)

  War Other
ADJ 390 860
ADP 35 41
ADV 108 175
AUX 42 69
CCONJ 7 13
DET 33 49
NOUN 559 1077
NUM 14 23
PRON 36 57
PROPN 46 133
PUNCT 1 0
SCONJ 5 6
VERB 530 1098
X 7 2
Total 1813 3603

Table of the POS for the words present in the sets (lemmas)

  War Other
ADJ 304 598
ADP 31 45
ADV 107 166
AUX 8 20
CCONJ 7 13
DET 19 37
NOUN 502 926
NUM 15 17
PRON 34 41
PROPN 45 127
PUNCT 2 0
SCONJ 4 4
VERB 386 724
X 0 0
Total 1464 2718

Table of the POS for the words uniquely present in each set and in common (tokens)

  War Other Common
ADJ 225 722 186
ADP 4 17 29
ADV 36 104 77
AUX 12 60 35
CCONJ 0 4 7
DET 4 13 32
NOUN 254 759 295
NUM 5 6 13
PRON 10 19 30
PROPN 25 100 20
PUNCT 0 0 1
SCONJ 0 0 4
VERB 289 842 217
X 2 10 1
Total 866 2656 947

Table of the POS for the words uniquely present in each set and in common (lemmas)

  War Other Common
ADJ 126 452 170
ADP 5 17 27
ADV 39 99 72
AUX 5 12 5
CCONJ 0 4 6
DET 4 12 15
NOUN 199 616 307
NUM 2 6 13
PRON 4 14 23
PROPN 30 99 20
PUNCT 0 1 1
SCONJ 0 0 5
VERB 146 475 237
X 3 9 0
Total 563 1816 901