/Kazakh-Russian-Code-Switching-Analysis

This is Moldir's term project for Data Science for Linguists 2023

Primary LanguageJupyter NotebookCreative Commons Attribution 4.0 InternationalCC-BY-4.0

Kazakh-Russian-Code-Switching-Analysis

A term project by Moldir Baidildinova (mob75@pitt.edu, m.baydildinova@gmail.com).

Completed February 12, 2023 through May 3, 2023.

Summary

This term project aims to carry out an explanatory analysis of Kazakh-Russian CS based on the conversational dataset and investigate structural and syntactic types of CS through linguistic annotation. The questions in focus are whether Kazakh-Russian bilingualism is balanced and whether the language shift is happening toward L2 Russian.

  • You can find my guestbook here

Data

The data sources utilized in this project are sourced from the IARPA Babel Program Kazakh language collection release IARPA-babel302b-v1.0a.

Directory

See the License to understand what you may and may not do with this project.