From Digitized Literature to Language Corpuses:  Significance, Prospects and Challenges

Authors

  • GAYANE HOVHANNISYAN Brusov State University Author
  • SRBUHI AYDINYAN Brusov State University Author

DOI:

https://doi.org/10.52027/18294685-hga2023.sp

Keywords:

linguistic database, digital heritage

Abstract

Digitization of natural texts first of all implies the systematic storage of extra-linguistic data and knowledge in the way of natural language processing. Depending on the way language is documented, it is oral/auditory and written/visual. Since the natural language and its oral and written texts, in addition to the facts, contain the historical-cultural, value-oriented, linguistic realities of the people, the transformation of the digitized texts into linguistic databases or corpora is invaluable from the point of view of effective management for the development of the state, international integration and security. Armenian is among the languages with medium or low resources, and in that sense, its development requires the involvement of large personnel and program resources. The report discusses various dimensions of solving this problem, including design of databases, continuous construction, their parallelization with other languages, and descriptive markup work. 

Author Biographies

  • GAYANE HOVHANNISYAN , Brusov State University

    Gayane Hovhannisyan is a Doctor of Sciences in Philology, Professor. She has been teaching languages for more than 30 years, has extensive international academic and research experience and many published works. Her scientific interests include patterns of language acquisition, their formalization, as well as problems of language thinking and verbal communication.  She currently leads a Brusov State University research group that is working on the creation of parallel Armenian-English digital databases.

  • SRBUHI AYDINYAN , Brusov State University

    Srbuhi Aydinyan is a postgraduate student of the Communication and Translation Studies Department of the English Lan
    guage at Brusov State University. She is a member of the research group that is working on the topic of “Experimental studies of the foundations, criteria and tools for the creation of Armenian-Eng
    lish parallel corpuses.”  She is a participant of various scientific conferences and seminars in Armenia and abroad. Her research interests include corpus linguistics, corpus creation and natural language processing. 

first page of the article

Published

11-11-2023

How to Cite

From Digitized Literature to Language Corpuses:  Significance, Prospects and Challenges. (2023). Bulletin of Armenian Libraries, 6(1), 66-70. https://doi.org/10.52027/18294685-hga2023.sp

Similar Articles

1-10 of 27

You may also start an advanced similarity search for this article.