A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Korean HanNanum part-of-speech tagset simplified is available in Korean corpora annotated by the Hannanum tool which is a morphological analyzer and a POS tagger

HanNanum – a Korean Morphological Analyzer and POS Tagger – is used for tagging Korean texts. The 22 tags present configuration of HanNanum demo java class is used to assign tags to words. Some own pre/post processing scripts were used to correct/improve lemmatization, add sentence markup, add English tag names.

An Example of a tag in the CQL concordance search box: [tag="V"] searches all verbs, e.g. 하는 (note: please make sure that you use straight double quotation marks)

Tagset

ORIGINAL SIMPLIFIED (in Sketch Engine) DESCRIPTION Example
NOMINALS
NC N noun
NQ NP proper noun
NB N noun
NP Pron pronoun
NN Num numeral
PREDICATES
PV V verb
PA Adj adjective
PX VAux auxiliary verb
MODIFIERS
MM Det determiner
MA Adv adverb
INTERJECTION
II Interj interjection
RELATIONAL SUFFIX
JC Suff
JX Suff
JP Suff
ENDING
EP Suff
EC Suff
ET Suff
EF Suff
AFFIXES
XP Pref 접두사
XS Suff
SYMBOLS
SP Sym 쉼표
SL Sym 여는 따옴표 및 묶음표
SD Sym 이음표
SU Sym 단위 기호
SF Sym 마침표
SR Sym 닫는 따옴표 및 묶음표
SE Sym 줄임표
SY Sym 기타 기호
FOREIGN WORDS
F X

Source: https://www.sketchengine.co.uk/wp-content/uploads/Original-HanNanum-manual.pdf

Korean text corpora in Sketch Engine

Sketch Engine offers dozens Korean language corpora.

or

Other text corpora in Sketch Engine

Sketch Engine offers 350+ language corpora.

Use Sketch Engine in minutes

Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.