A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Czech part-of-speech tagset is available in Czech corpora annotated by Majka or Ajka morphological tagging tools. The tagset was revisited in 2011.

An Example of a tag in the CQL concordance search box[tag="NNS"] finds all nouns in plural, e.g. people, years (note: please make sure that you use straight double quotation marks)

Tagset

k1 – Substantives
x Special paradigm
P půl (half)
g Gender
M Animate masculine
I Inanimate masculine
N Neuter
F Feminine
R Family (surname)
n Number
S Singular
P Plural
c Case
1–7 First–Seventh
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete
z Word Form Type
S -s enclitic
k2 – Adjectives
e Negation
A Affirmation
N Negation
g Gender
M Animate masculine
I Inanimate masculine
N Neuter
F Feminine
n Number
S Singular
P Plural
D Dual
c Case
1–7 First–Seventh
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete
z Word Form Type
S -s enclitic
k3 – Pronomina
x Type (x)
P Personal
O Possessive
D Demonstrative
T Delimitative
y Type (y)
F Reflexive
Q Interrogative
R Relative
N Negative
I Indeterminate
p Person
1 First
2 Second
3 Third
X First, second or third
g Gender
M Animate masculine
I Inanimate masculine
N Neuter
F Feminine
n Number
S Singular
P Plural
D Dual
c Case
1–7 First–Seventh
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete
z Word Form Type
S -s enclitic
k4 – Numerals
x Type (x)
C Cardinal
O Ordinal
R Reproductive
G Grammar
H Grammar
y Type (y)
N Negative
I Indeterminate
g Gender
M Animate masculine
I Inanimate masculine
N Neuter
F Feminine
n Number
S Singular
P Plural
D Dual
c Case
1–7 First–Seventh
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete
t Grammar Terminal
A–F Terminal A–F
I–R Terminal I–R
S Q@
T O@
U L@
V jedno
W sto
X dvě
Y stě
Z tři/čtyři
z Word Form Type
S -s enclitic
k5 – Verbs
e Negation
A Affirmation
N Negation
a Aspect
P Perfect
I Imperfect
B Biaspectual
m Type (Mode)
F Infinitive
I Present indicative
R Imperative
A Active part. (past)
N Passive part.
S Adv. part. (present)
D Adv. part. (past)
B Future indicative
p Person
1 First
2 Second
3 Third
X First, second or third
g Gender
M Animate masculine
I Inanimate masculine
N Neuter
F Feminine
n Number
S Singular
P Plural
D Dual
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete
z Word Form Type
S -s enclitic
k6 – Adverbs
e Negation
A Affirmation
N Negation
x Pron. Adv. Type (x)
D Demonstrative
T Delimitative
M Modal
S Status
y Pron. Adv. Type (y)
Q Interrogative
R Relative
N Negative
I Indeterminate
d Degree
1 Positive
2 Comparative
3 Superlative
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete
z Word Form Type
S -s enclitic
k7 – Preposition
c Case
1 First
2 Second
3 Third
4 Fourth
5 Fifth
6 Sixth
7 Seventh
k8 – Conjunction
x Type
C Coordinate
S Subordinate
z Word Form Type
S -s enclitic
k9 – Particle
z Word Form Type
S -s enclitic
k0 – Interjection
kA – Abbreviation
kY – by, aby, kdyby
m Relation to the Verb Mode
C conditional
p Person
1 First
2 Second
3 Third
n Number
S Singular
P Plural
w Stylistic flag
A Archaism
B Poeticism
C Only in corpora
E Expressive
H Conversational
K Bookish
O Regional
R Rare
Z Obsolete

See: Czech tagset summary


Reference

JAKUBÍČEK, Miloš, Vojtěch KOVÁŘ a Pavel ŠMERK. Czech Morphological Tagset Revisited. In Horák, Rychlý. Proceedings of Recent Advances in Slavonic Natural Language Processing 2011. Brno: Tribun EU, 2011, pp. 29-42, 14 s. ISBN 978-80-263-0077-9.