Research Article Open Access

UzNER: A Human-Reviewed Benchmark for Uzbek Named Entity Recognition With Gazetteer-Augmented Transformer Models

Bobur Saidov1,2, Vladimir Barakhnin1,3, Zarnigor Fayzullaeva4, Umid Ibragimov1 and Ulugbek Tursunov1
  • 1 Faculty of Mechanics and Mathematics, Novosibirsk State University, Novosibirsk, Russia
  • 2 Faculty of Computer Engineering, Urgench State University, Urgench, Uzbekistan
  • 3 Federal Research Center for Information and Computational Technologies, Novosibirsk, Russia
  • 4 Faculty of Software Engineering, University of Information Technologies, Tashkent, Uzbekistan

Abstract

UzNER-100K is a large-scale human-reviewed benchmark for Uzbek named entity recognition with 100,000 training sentences, 18 fine-grained entity types and 200,083 entity mentions across 114,269 sentences in total. The corpus was constructed through an LLM-assisted, expert-reviewed annotation pipeline that achieved strong reliability on the main audit subset while substantially reducing corpus-construction effort. The benchmark includes a standard test split, a gold-audited subset and a hard subset designed to stress long, ambiguous and structurally complex cases. We evaluate 10 Uzbek NER systems spanning recurrent, monolingual Uzbek, multilingual transformer and hybrid architectures. The best model, XLM-R + Gazetteer + CRF, reaches 91.03 Micro-F1 on the standard test set, 89.67 on the gold-audited subset and 83.21 on the hard subset. Quality control included a dedicated inter-annotator agreement audit, achieving 91.3% span-level agreement, 93.7% entity-type agreement, and a Cohen’s Kappa of 0.914. In addition, a qualitative native-speaker assessment confirmed the linguistic naturalness of the model outputs while highlighting remaining challenges in legal, administrative, and event-related expressions.

Journal of Computer Science
Volume 22 No. 6, 2026, 1894-1911

DOI: https://doi.org/10.3844/jcssp.2026.1894.1911

Submitted On: 21 March 2026 Published On: 22 June 2026

How to Cite: Saidov, B., Barakhnin, V., Fayzullaeva, Z., Ibragimov, U. & Tursunov, U. (2026). UzNER: A Human-Reviewed Benchmark for Uzbek Named Entity Recognition With Gazetteer-Augmented Transformer Models. Journal of Computer Science, 22(6), 1894-1911. https://doi.org/10.3844/jcssp.2026.1894.1911

  • 42 Views
  • 13 Downloads
  • 0 Citations

Download

Keywords

  • Uzbek NER
  • Low-Resource NLP
  • Benchmark Dataset
  • Multilingual Transformers
  • Gazetteer-Enhanced Decoding