A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Turkish PoS tagset

This Turkish part-of-speech tagset is available in Turkish corpora annotated by the tool TRmorph – a free morphological analyzer for Turkish.

An Example of a tag in the CQL concordance search box: [tag="<N<n[:>.*"]</n[:> finds all nouns, e.g. çoğu, yapmayı (note: please make sure that you use straight double quotation marks)

Part-of-speech tagset

Part of speech PoS tag
adjective <Adj[:>].*
adverb <Adv[:>].*
conjunction <Cnj[:>].*
determiner <Det[:>].*
Interjection <Ij[:>].*
noun <N[:>].*
numeral <Num[:>].*
Onomatopoeia <Onom[:>]
postposition <Postp[:>].*
pronoun <Prn[:>].*
Punctuation <Punc>
The words var and yok <exist[:>].*
verb <V[:>].*

Subcategories

Nominal categories

Function surface tag
Number
First person singular -(I)m
Second person singular -(I)n
Third person singular -(s)I
First person plural -(I)mIz
Second person plural -(I)nIz
Third person plural -lArI
Case
Accusative -(y)I
Dative -(y)A
Ablative -DAn
Locative -DA
Genitive -(n)In
Instrumental/commutative -(y)lA

Nouns

Proper noun
Abbreviation
Abbreviated proper noun

Pronouns

Proper noun
Abbreviation
Abbreviated proper noun

Verbal categories

Suffixes making compound verbs
Express Tag Suffix
ability <abil[:>] -(y)Abil
immediacy <iver[:>] -(y)Iver
habitual <agel[:>] -(y)Agel
repetition/continuity <adur[:>] -(y)Adur
almost <ayaz[:>] -(y)Ayaz
stop/freeze in action <akal[:>] -(y)Akal
somewhat like <agor[:>] -(y)Agör
Other verbal categories
causative <caus[:>]
generalizing modality marker <dir[:>]
negative <neg[:>]
passive <pass[:>]
question <q[:>]
reciprocal <rcp[:>]
reflexive <rfl[:>]

Other verbal categories

causative <V><caus[:>]
copula <V><cpl.*
generalizing modality marker <V><dir[:>]
negative <V><neg[:>]
passive <V><pass[:>]
question <V><q[:>]
reciprocal <V><rcp[:>]
reflexive <V><rfl[:>]

Tense/aspect/modality markers

Express Tag Suffix
evidential past (perfective) <evid[:>] #NÁZEV?
future <fut[:>] -(y)AcAk
obligative <obl[:>] #NÁZEV?
imperfective <impf[:>] #NÁZEV?
imperfective <cont[:>] -(I)yor
past (perfective) <past[:>] #NÁZEV?
conditional <cond[:>] -sA,-(y)A
optative <opt[:>] -(y)A
imperative <imp[:>]
aorist <aor[:>] -Ar,-Ir,-z,-

Postpositions

adjectival phrase / adverbial phrase

postpositions with ablative complement <Postp:.*:ablC>
postpositions with accusative complement <Postp:.*:accC>
postpositions with dative complement <Postp:.*:datC>
postpositions with genitive complement <Postp:.*:genC>
postpositions with instrumental complement <Postp:.*:insC>
postpositions without complement or more than 1 complement <Postp:.*:nomC>
postpositions with noun phrase suffixed with either -lI or -sIz <Postp:.*:liC>
postpositions with numeric complement <Postp:.*numC>

Numbers

Arabic numeral <Num:ara[:>]
Roman numeral <Num:rom[:>]
numeral which is the question word <Prn:qst[:>]

Conjunctions

coordinating conjunction <Cnj:coo>
adverbial conjunction <Cnj:adv>
subordinating conjunction <Cnj:sub>
kâh <Cnj:adv:partial>

Determiners

definite determiner <Det:def>
indefinite determiner <Det:indef>

Other

partial word of a multi-word expression .*:partial>.*
reduplication .*:mredup>.*
typo .*:typo>.*

Source: http://coltekin.net/cagri/trmorph/trmorph-manual.pdf

This tagset can be downloaded in Excel format.

Bibliography

Çöltekin, Ç. TRmorph: A morphological analyzer for Turkish.

Turkish text corpora in Sketch Engine

Sketch Engine offers dozens of Turkish corpora.

or