A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
When creating user corpora, the recommended tagset is always preselected. Using a different tagset, if there are more than one for the language, is only recommended for advanced users. Tagsets cannot be normally changed for preloaded corpora.

Since Word Sketches, thesaurus, term extraction and trends make use of POS tagging, their respective settings (e.g. Word Sketch grammar, term grammar) have to be written using the tagset used by the corpus.

Malaysian and Indonesian corpora in Sketch Engine can have these tagsets:

(to check the tagset used, go to Corpus Statistics and details page)

An Example of a tag in the CQL concordance search box[tag=”num” & morph=””] searches for cardinal numerals

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
adjective lemma besar besar big
noun lemma buku buku book
number lemma tujuh tujuh seven
pronoun kue aku+e I, me, my (depending on the position)
determiner bukunya buku can be ‘his/her book’ or ‘the book’
coordinating conjunction dan dan and
subordinating conjunction ketika ketika when (not the WH question word)
verb lemma ada ada there is
particle lah lah
modal mengerling mengerling ogle
interjection jijik jijik
question word siapa siapa who
preposition lemma di di in/on double preposition: ‘di dalam’ = inside,
‘di atas’ = above, ‘di bawah’ = below

Personal Pronoun

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
first person kue aku+e I, me, my (depending on the position) Saya is the formal ‘I’. There are plenty of unformal ‘I’ variations
second person anda anda You, your (depending on the position) Anda is the formal ‘You’. There are plenty of unformal ‘You’ variations
third person mereka mereka they, them, their (depending on the position)

Number

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
singular buku buku book
melompat lompat jump
plural buku-buku buku books this number plurality feature not only occurs in nouns. It is not only show the plurality on number of entity, but also show plurality of the event or referring to plural entities
melompat-lompat lompat jump repeatedly

Numeral Type

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
cardinal number tujuh tujuh seven
ordinal number ketujuh tujuh the seventh
collective number bertujuh tujuh in group of seven
referential number tujuh-tujuhnya tujuh all the seven of them
ketujuh-tujuhnya tujuh all the seven of them

Verb Type

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
verb ‘to have’ berpayung payung having umbrella(payung)
verb ‘to be’ perbesar besar making something to be big

Voice Type

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
active voice mengirim kirim send
passive voice dikirim kirim being sent

Aspect Type

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
perfective aspect kirim kirim send
imperfective aspect mengirim kirim send

Noun Type

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
original bare noun type buku buku book
derived abstract type kedinginan dingin coldness
derived act noun pengiriman kirim delivery
pembukuan buku archiving
derived actor noun pengirim kirim deliverer
derived entity noun kiriman kirim package
derived theme noun pertokoan toko something related to shop

Adjective Type

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
original bare adjective type besar besar big
superlative adjective terbesar besar the biggest
exceed adjective kebesaran besar too big equally grammatical with “terlalu besar”
manner adjective keibuan ibu motherly
union adjective seibu ibu same/one mother
adjectival phrase secepat-cepatnya cepat as quick as possible
sebesar-besarnya besar as big as possible

Clitic Mark

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
enclitic bukuku buku+aku my book The clitics are pronoun. In this example another form will be “buku aku”, where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun
proclitic kukirim aku+kirim I send The clitics are pronoun. In this example another form will be “aku cari”, where it also grammatically correct. Sequence of pronoun followed by verb mostly makes the pronoun becomes the subject in the sentence

Transitivity Mark

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
Applicative kirimi kirim send
Causative kirimkan kirim send

Others

TAG DESCRIPTION EXAMPLE EXAMPLE’S ANALYSIS TRANSLATION IN ENGLISH NOTES
capitalized word mark Buku buku Book Capitalized word
possesive bukuku buku+aku my book The clitics are pronoun. In this example another form will be “buku aku”, where it also grammatically correct. Sequence of noun followed by pronoun, makes the pronoun becomes possesive pronoun

Source at: http://wiki.apertium.org/wiki/Indonesian_and_Malaysian/Previous_morphology