- The first set of tests: incorrect words present
- Table for the words present in the sets
- Table for the words uniquely present in each set and in common
- Table of the part-of-speech (POS) for the words present in the sets (tokens)
- Table of the POS for the words present in the sets (lemmas)
- Table of the POS for the words uniquely present in each set and in common (tokens)
- Table of the POS for the words uniquely present in each set and in common (lemmas)
- The second set of tests: incorrect words removed
- Table for the words present in the sets
- Table for the words uniquely present in each set and in common
- Table of the POS for the words present in the sets (tokens)
- Table of the POS for the words present in the sets (lemmas)
- Table of the POS for the words uniquely present in each set and in common (tokens)
- Table of the POS for the words uniquely present in each set and in common (lemmas)
The first set of tests: incorrect words present
Table of results
Table for the words present in the sets
|
War |
Other |
Token |
1858 |
3765 |
Lemma |
1516 |
2898 |
Difference |
342 |
867 |
Table for the words uniquely present in each set and in common
|
War |
Other |
Common |
Token |
908 |
2815 |
950 |
Lemma |
611 |
1994 |
905 |
Difference |
297 |
821 |
45 |
Percentage of common words (tokens) in the total of words (war + other): 16,89%
Percentage of common words (lemmas) in the total of words (war + other): 20,69%
Table of the part-of-speech (POS) for the words present in the sets (tokens)
|
War |
Other |
ADJ |
429 |
977 |
ADP |
36 |
51 |
ADV |
117 |
184 |
AUX |
47 |
86 |
CCONJ |
7 |
13 |
DET |
44 |
43 |
NOUN |
539 |
1067 |
NUM |
14 |
19 |
PRON |
35 |
63 |
PROPN |
58 |
145 |
PUNCT |
0 |
1 |
SCONJ |
3 |
4 |
VERB |
529 |
1109 |
X |
0 |
3 |
Total |
1858 |
3765 |
Table of the POS for the words present in the sets (lemmas)
|
War |
Other |
ADJ |
309 |
678 |
ADP |
32 |
49 |
ADV |
112 |
171 |
AUX |
8 |
21 |
CCONJ |
7 |
13 |
DET |
24 |
29 |
NOUN |
519 |
972 |
NUM |
15 |
18 |
PRON |
29 |
46 |
PROPN |
55 |
156 |
PUNCT |
0 |
1 |
SCONJ |
6 |
5 |
VERB |
400 |
737 |
X |
0 |
2 |
Total |
1516 |
2898 |
Table of the POS for the words uniquely present in each set and in common (tokens)
|
War |
Other |
Common |
ADJ |
239 |
787 |
191 |
ADP |
7 |
21 |
26 |
ADV |
38 |
106 |
75 |
AUX |
12 |
51 |
34 |
CCONJ |
0 |
5 |
9 |
DET |
7 |
18 |
32 |
NOUN |
255 |
782 |
293 |
NUM |
3 |
9 |
12 |
PRON |
10 |
21 |
28 |
PROPN |
29 |
121 |
17 |
PUNCT |
0 |
2 |
0 |
SCONJ |
0 |
0 |
4 |
VERB |
306 |
882 |
227 |
X |
2 |
10 |
2 |
Total |
908 |
2815 |
950 |
Table of the POS for the words uniquely present in each set and in common (lemmas)
|
War |
Other |
Common |
ADJ |
152 |
516 |
174 |
ADP |
4 |
18 |
25 |
ADV |
39 |
103 |
72 |
AUX |
5 |
14 |
8 |
CCONJ |
0 |
5 |
7 |
DET |
7 |
14 |
16 |
NOUN |
201 |
643 |
304 |
NUM |
3 |
9 |
12 |
PRON |
3 |
18 |
23 |
PROPN |
37 |
126 |
23 |
PUNCT |
0 |
3 |
0 |
SCONJ |
0 |
0 |
6 |
VERB |
157 |
513 |
235 |
X |
3 |
12 |
0 |
Total |
611 |
1994 |
905 |
The second set of tests: incorrect words removed
Table of results
Table for the words present in the sets
|
War |
Other |
Token |
1813 |
3603 |
Lemma |
1464 |
2718 |
Difference |
349 |
885 |
Table for the words uniquely present in each set and in common
|
War |
Other |
Common |
Token |
866 |
2656 |
947 |
Lemma |
563 |
1817 |
901 |
Difference |
303 |
839 |
46 |
Percentage of common words (tokens) in the total of words (war + other): 17,49%
Percentage of common words (lemmas) in the total of words (war + other): 21,54%
Table of the POS for the words present in the sets (tokens)
|
War |
Other |
ADJ |
390 |
860 |
ADP |
35 |
41 |
ADV |
108 |
175 |
AUX |
42 |
69 |
CCONJ |
7 |
13 |
DET |
33 |
49 |
NOUN |
559 |
1077 |
NUM |
14 |
23 |
PRON |
36 |
57 |
PROPN |
46 |
133 |
PUNCT |
1 |
0 |
SCONJ |
5 |
6 |
VERB |
530 |
1098 |
X |
7 |
2 |
Total |
1813 |
3603 |
Table of the POS for the words present in the sets (lemmas)
|
War |
Other |
ADJ |
304 |
598 |
ADP |
31 |
45 |
ADV |
107 |
166 |
AUX |
8 |
20 |
CCONJ |
7 |
13 |
DET |
19 |
37 |
NOUN |
502 |
926 |
NUM |
15 |
17 |
PRON |
34 |
41 |
PROPN |
45 |
127 |
PUNCT |
2 |
0 |
SCONJ |
4 |
4 |
VERB |
386 |
724 |
X |
0 |
0 |
Total |
1464 |
2718 |
Table of the POS for the words uniquely present in each set and in common (tokens)
|
War |
Other |
Common |
ADJ |
225 |
722 |
186 |
ADP |
4 |
17 |
29 |
ADV |
36 |
104 |
77 |
AUX |
12 |
60 |
35 |
CCONJ |
0 |
4 |
7 |
DET |
4 |
13 |
32 |
NOUN |
254 |
759 |
295 |
NUM |
5 |
6 |
13 |
PRON |
10 |
19 |
30 |
PROPN |
25 |
100 |
20 |
PUNCT |
0 |
0 |
1 |
SCONJ |
0 |
0 |
4 |
VERB |
289 |
842 |
217 |
X |
2 |
10 |
1 |
Total |
866 |
2656 |
947 |
Table of the POS for the words uniquely present in each set and in common (lemmas)
|
War |
Other |
Common |
ADJ |
126 |
452 |
170 |
ADP |
5 |
17 |
27 |
ADV |
39 |
99 |
72 |
AUX |
5 |
12 |
5 |
CCONJ |
0 |
4 |
6 |
DET |
4 |
12 |
15 |
NOUN |
199 |
616 |
307 |
NUM |
2 |
6 |
13 |
PRON |
4 |
14 |
23 |
PROPN |
30 |
99 |
20 |
PUNCT |
0 |
1 |
1 |
SCONJ |
0 |
0 |
5 |
VERB |
146 |
475 |
237 |
X |
3 |
9 |
0 |
Total |
563 |
1816 |
901 |