Summary

UD_Kyrgyz-KTMU is dependency parsing based treebank in Kyrgyz language. Sentences were selected partly from Kyrgyz story and novel books, partly from Kyrgyz news websites.

Introduction

The treebank consists of 781 sentences (7.4K tokens) for now and its domain is mainly news headlines. Kyrgyz UD treebank follows the Universal Dependencies (UD) annotation standard.

Acknowledgments

We would like to thank all the people who contributed to this corpus: Assoc.Prof.Dr. Bakit Sharshembaev

References

An academic paper describing this resource is pending, for the time being please use the repository URL to cite this dataset.

Changelog

  • 2023-05-15 v2.12
    • Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.12
License: CC BY-SA 4.0
Includes text: yes
Genre: news fiction
Lemmas: manual native
UPOS: manual native
XPOS: manual native
Features: manual native
Relations: manual native
Contributors: Benli, İbrahim
Contributing: here
Contact: ibrahimbenli@hotmail.com
===============================================================================