A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Vietnamese part-of-speech tagset is available in Vietnamese corpora annotated by the vnTagger tool developed by Lê Hồng Phương.

The tagset for the Vietnamese language contains 17 main lexical tags and one tag for an unrecognized token.

An Example of a tag in the CQL concordance search box[tag="N.*"] finds all nouns, e.g. người,
(note: please make sure that you use straight double quotation marks)


 Tag  Description
 Np  Proper noun
 Nc  Classifier
 Nu  Unit noun
 N  Common noun
 V  Verb
 A  Adjective
 P  Pronoun
 R  Adverb
 L  Determiner
 M  Numeral
 E  Preposition
 C  Subordinating conjunction
 CC  Coordinating conjunction
 I  Interjection
 T  Auxiliary, modal words
 Y  Abbreviation
 Z  Bound morphemes
 X  Unknown

Source: https://github.com/hakz/vntagger-gate-plugin.vntagger/blob/master/README.txt