A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Turkish PoS tagset

This Turkish part-of-speech tagset is available in Turkish corpora annotated by the tool TRmorph – a free morphological analyzer for Turkish.

An Example of a tag in the CQL concordance search box: [tag="<n[:>].*"]</n[:> finds all nouns. (note: please make sure that you use straight double quotation marks)

Part-of-speech tagset

Part of speech PoS tag
adjective <adj[:>].*</adj[:>
adverb <adv[:>].*</adv[:>
conjunction <cnj[:>].*</cnj[:>
determiner <det[:>].*</det[:>
Interjection <ij[:>].*</ij[:>
noun <n[:>].*</n[:>
numeral <num[:>].*</num[:>
Onomatopoeia <onom[:>]</onom[:>
postposition <postp[:>].*</postp[:>
pronoun <prn[:>].*</prn[:>
Punctuation .*
Question particle mI .*
symbols of the alphabet .*
The word değil .*
The words var and yok <exist[:>]</exist[:>
verb <v[:>].*</v[:>

Subcategories

Nominal categories

Function surface tag
Number
First person singular -(I)m
Second person singular -(I)n
Third person singular -(s)I
First person plural -(I)mIz
Second person plural -(I)nIz
Third person plural -lArI
Case
Accusative -(y)I
Dative -(y)A
Ablative -DAn
Locative -DA
Genitive -(n)In
Instrumental/commutative -(y)lA

Nouns

Proper noun
Abbreviation
Abbreviated proper noun

Pronouns

Proper noun
Abbreviation
Abbreviated proper noun

Verbal categories

Suffixes making compound verbs
Express Tag Suffix
ability <abil[:>]</abil[:> -(y)Abil
immediacy <iver[:>]</iver[:> -(y)Iver
habitual <agel[:>]</agel[:> -(y)Agel
repetition/continuity <adur[:>]</adur[:> -(y)Adur
almost <ayaz[:>]</ayaz[:> -(y)Ayaz
stop/freeze in action <akal[:>]</akal[:> -(y)Akal
somewhat like <agor[:>]</agor[:> -(y)Agör

Other verbal categories
causative <caus[:>]</caus[:>
copula <cpl.*< td=””> </cpl.*<>
generalizing modality marker <dir[:>]</dir[:>
negative <neg[:>]</neg[:>
passive <pass[:>]</pass[:>
question <q[:>]</q[:>
reciprocal <rcp[:>]</rcp[:>
reflexive <rfl[:>]</rfl[:>

Tense/aspect/modality markers

Express Tag Suffix
evidential past (perfective) <evid[:>]</evid[:> -mIş
future <fut[:>]</fut[:> -(y)AcAk
obligative <obl[:>]</obl[:> -mAlI
imperfective <impf[:>]</impf[:> -mAktA
imperfective <cont[:>]</cont[:> -(I)yor
past (perfective) <past[:>]</past[:> -DI
conditional <cond[:>]</cond[:> -sA,-(y)A
optative <opt[:>]</opt[:> -(y)A
imperative <imp[:>]</imp[:>
aorist <aor[:>]</aor[:> -Ar,-Ir,-z,-

Postpositions

adjectival phrase
adverbial phrase
postpositions with ablative complement <postp:.*:ablc></postp:.*:ablc>
postpositions with accusative complement <postp:.*:accc></postp:.*:accc>
postpositions with dative complement <postp:.*:datc></postp:.*:datc>
postpositions with genitive complement <postp:.*:genc></postp:.*:genc>
postpositions with instrumental complement <postp:.*:insc></postp:.*:insc>
postpositions without complement or more than 1 complement <postp:.*:nomc></postp:.*:nomc>
postpositions with noun phrase suffixed with either -lI or -sIz <postp:.*:lic></postp:.*:lic>
postpositions with numeric complement <postp:.*:numc></postp:.*:numc>

Numbers

Arabic numeral <num:ara[:>]</num:ara[:>
Roman numeral <num:rom[:>]</num:rom[:>
numeral which is the question word <prn:qst[:>]</prn:qst[:>

Conjunctions

coordinating conjunction
adverbial conjunction
subordinating conjunction

Determiners

definite determiner <det:def[:>]</det:def[:>
indefinite determiner <det:indef[:>]</det:indef[:>

Other

partial word of a multi-word expression .*:partial>.*
reduplication .*:mredup>.*
typo .*:typo>.*

Source: http://coltekin.net/cagri/trmorph/trmorph-manual.pdf

This tagset can be downloaded in in Excel format.

Bibliography

Çöltekin, Ç. TRmorph: A morphological analyzer for Turkish.

Turkish text corpora in Sketch Engine

Sketch Engine offers dozens Turkish language corpora.

or