O‘zbek tilida pos tegging masalasi:      muammo va takliflar

Ботир Элов; Shahlo Hamroyeva; Oqila Abdullayeva; Mohiyaxon Uzoqova

Authors

Ботир Элов
Shahlo Hamroyeva Tashkent State University of Uzbek Language and Literature named after Alisher Navoi. https://orcid.org/0000-0002-5429-4708
Oqila Abdullayeva Tashkent State University of Uzbek Language and Literature named after Alisher Navoi. https://orcid.org/0000-0002-2524-4832
Mohiyaxon Uzoqova Tashkent State University of Uzbek Language and Literature named after Alisher Navoi. https://orcid.org/0000-0001-7102-0824

Keywords:

Tag, markup, annotation, tagset, NLP, corpus, CLAWS

Abstract

Speaking of a language corpus, the issue of building a linguistic
database becomes the subject of concern because of its complexity
and importance at the same time. The process of assigning appropriate
identifiers to speech fragments in corpus texts is problematic since language
modeling is associated with the rules and patterns of tagging existing in the
language. Tagging, especially grammatical tagging or PoS tagging, is also
a topical issue for Uzbek corpus linguistics. Because a special “encoded”
symbol system serves as the primary key in solving NLP problems related
to the Uzbek language. The article analyzes the studies of tagging and PoS
tagging in world linguistics and considers the current tagging process in
Uzbek linguistics. Based on the rules of the Uzbek language, an alternative

set of tags was proposed taking sets of tags widely used in the world into
consideration.

Evaluation of part of speech tagging in uzbek language: problems and proposals

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Most read articles by the same author(s)

Language

Current Issue

Information