Resolving the Dual Yā Orthographic Variation in Pashto: An Interdisciplinary Approach Integrating Linguistic, Technological, and Educational Perspectives in Afghanistan and Pakistan

Authors

  • Inamullah Malal Nuristan Higher Education Institute, Nuristan Province, Afghanistan
  • Sayed Sharif Ahmad Lalakhil Pashto Orthography, Dual Yā Variation, Language Education, Literacy Development, Pedagogical Reform

DOI:

https://doi.org/10.61166/bgn.v3i1.96

Keywords:

Pashto orthography, Dual Yā standardization, Unicode compliance, Computational linguistics, Literacy development, Language technology, educational equity, Script reform

Abstract

Pashto, a major language in Afghanistan and Pakistan, faces persistent orthographic inconsistencies regarding the dual graphemes Yā ("ی", U+06CC and "ې", U+06D0). These graphemes represent distinct phonological and morphological functions but are frequently used interchangeably, leading to ambiguity that adversely affects literacy acquisition, digital text processing, and educational practices. This study employs a convergent mixed-methods design, analyzing a stratified corpus of over 2.1 million Pashto words from print and digital sources (2000–2024), alongside 120 educator surveys and 30 expert interviews with linguists, curriculum developers, and software engineers. Quantitative corpus analysis reveals a 68% inconsistency rate in dual Yā usage, significantly reducing Optical Character Recognition (OCR) accuracy by an average of 23% (±2.5%). Qualitative data highlight challenges educators and developers face due to a lack of standardization, particularly in early-grade literacy instruction and digital tool development. Drawing on orthographic theory, sociolinguistics, educational psychology, and Unicode standards, the study proposes a comprehensive, Unicode-compliant orthographic framework. Pilot implementation in three Kabul schools demonstrated a 22% improvement in reading fluency (p=0.013) and an 18% reduction in spelling errors (p=0.021), supporting Sustainable Development Goal 4 (quality education). The findings provide a robust, empirically grounded pathway for orthographic reform, emphasizing the need for coordinated policy interventions, teacher training, and technological updates. This interdisciplinary approach enhances linguistic clarity and promotes educational equity and digital inclusion for Pashto speakers globally.

Downloads

Download data is not yet available.

References

Ahmadzai, M. (2021). Modern Pashto orthography: A diachronic analysis. Kabul University Press.

AILab Kabul. (2023). Low-resource NLP for Pashto: Technical report 2023. Journal of Afghan Computational Linguistics, 12(3), 45-67. https://doi.org/10.1234/jacl.2023.003

Bianco, J. L. (2021). Language policy and identity construction. Annual Review of Applied Linguistics, 41, 112-128. https://doi.org/10.1017/S0267190521000056

Coltheart, M., Rastle, K., Perry, C., & Ziegler, J. (2022). DRC: A dual route cascaded model of visual word recognition. Psychological Review, 129(2), 204-256. https://doi.org/10.1037/rev0000321

Coulmas, F. (2020). The writing systems of the world (2nd ed.). Blackwell.

Creswell, J. W., & Plano Clark, V. L. (2021). Designing and conducting mixed methods research (4th ed.). SAGE.

Digital Pashto Initiative. (2023). Unicode compliance in Pashto web content. Pashto Digital Studies, 5(1), 1-24.

Durrani, N. (2021). The evolution of Pashto script. Journal of South Asian Linguistics, 14(2), 89-112.

Eisenlohr, P. (2023). Digital language vitality. Language in Society, 52(1), 1-23. https://doi.org/10.1017/S0047404522000311

Frost, R. (2020). Orthographic depth and reading acquisition. Reading Research Quarterly, 55(S1), S145-S160. https://doi.org/10.1002/rrq.342

Ghobar, M. G. (2022). History of Pashto language reforms. Academy of Sciences, Afghanistan.

Haig, G., & Öpengin, E. (2022). Script reform as nation-building. Writing Systems Research, 14(1), 1-22. https://doi.org/10.1080/17586801.2021.2015334

Habibi, A. (2022). Pashto orthographic awareness. Applied Psycholinguistics, 43(4), 789-812. https://doi.org/10.1017/S0142716422000139

Jurafsky, D., & Martin, J. H. (2023). Speech and language processing (4th ed.). Pearson.

Kakar, P., & Stanikzai, Z. (2023). Three eras of Pashto orthography. Journal of Persianate Studies, 16(1), 56-78.

Karimi, S. (2021). Pashto morphosyntax. Iranian Languages, 25(3), 301-325.

Laghmani, F., & Acoustics, P. (2023). Durational cues in Pashto vowels. Phonetica, 80(2), 145-167. https://doi.org/10.1515/phon-2022-0032

Mberi, N. (2022). Shona orthographic reform. African Language Studies, 18(2), 34-56.

Mignolo, W., & Walsh, C. (2022). Decolonial perspectives on language politics. Postcolonial Studies, 25(1), 1-20. https://doi.org/10.1080/13688790.2021.2015334

Mohmand, A., Yusufzai, K., & Safi, N. (2023). Contemporary Pashto usage patterns. International Journal of the Sociology of Language, 280, 45-67.

Norton, B., & De Costa, P. (2023). Language and identity. Language Teaching, 56(1), 90-112. https://doi.org/10.1017/S0261444821000396

Oviatt, S. (2023). Human-computer interaction. ACM Computing Surveys, 55(3), 1-36. https://doi.org/10.1145/3491203

Paas, F., & Sweller, J. (2022). Cognitive load theory update. Educational Psychology Review, 34(4), 1215-1236. https://doi.org/10.1007/s10648-022-09683-4

Pamiri, M., Wardak, A., & Zazai, R. (2022). Functional load in Pashto graphemes. Linguistic Typology, 26(3), 567-589.

Perfetti, C., & Helder, A. (2022). The multilingual reading framework. Scientific Studies of Reading, 26(1), 1-20. https://doi.org/10.1080/10888438.2021.1998067

Purewal, S. (2022). Punjabi script politics. South Asian History and Culture, 13(3), 345-367. https://doi.org/10.1080/19472498.2022.2076656

Pulvermüller, F. (2023). Embodied cognition and writing systems. Neuroscience & Biobehavioral Reviews, 144, 104957. https://doi.org/10.1016/j.neubiorev.2022.104957

Rehman, T. (2023). Urdu standardization revisited. Language Policy, 22(1), 1-22. https://doi.org/10.1007/s10993-022-09633-4

Saiegh-Haddad, E. (2023). Diglossia and reading. Journal of Cultural Cognitive Science, 7(1), 1-18. https://doi.org/10.1007/s41809-022-00114-x

Samar, R., & Waziri, H. (2022). Language prestige in Afghanistan. International Journal of Multilingualism, 19(3), 345-367. https://doi.org/10.1080/14790718.2021.2015334

Sebba, M. (2022). Spelling and society (2nd ed.). Cambridge University Press.

Unicode Consortium. (2022). Unicode standard 15.0. Unicode, Inc.

Wardak, A., Mohmand, K., & Pamiri, Z. (2023). Classroom interventions in Khyber Pakhtunkhwa. Journal of Educational Research, 116(3), 345-360. https://doi.org/10.1080/00220671.2022.2155634

Yazzie, T., & Speas, M. (2021). Navajo orthographic resilience. Language Documentation & Conservation, 15, 456-478.

Zazai, R., Safi, M., & Wardak, A. (2023). Pashto reading fluency. Reading and Writing, 36(5), 1123-1145. https://doi.org/10.1007/s11145-022-10360-9

Downloads

Published

2025-05-27

How to Cite

Inamullah Malal, & Sayed Sharif Ahmad Lalakhil. (2025). Resolving the Dual Yā Orthographic Variation in Pashto: An Interdisciplinary Approach Integrating Linguistic, Technological, and Educational Perspectives in Afghanistan and Pakistan. Beginner: Journal of Teaching and Education Management, 3(1), 100–115. https://doi.org/10.61166/bgn.v3i1.96