The New Phase of Digitization of the Materials of the National Library of Armenia (OCR)

Authors

  • Naira Meliqbekyan National Library of Armenia, Head of the Digitization and Publishing Department Author

DOI:

https://doi.org/10.52027/

Keywords:

digitization, text document, artificial intelligence

Abstract

Optical character recognition (OCR) is the process of converting an image of text into a machine-encoded text format. For example, when we scan a form or document, the computer saves the scan as an image file. A text editor cannot be used to edit, search, or count words in an image file. OCR helps convert an image into a text document and content is stored as text data. Optical character recognition (OCR) provides the added functionality of being able to edit and search these documents. Such data can be used for analysis, optimization, process automation and productivity improvement. It saves time and reduces errors. We inform our thousands of users that the OCR-passed digital collections of old and new digitized periodicals, editions dedicated to the anniversaries of periodicals are replenished thanks to daily hard work. As of 26.10.2023 #armocr #ocr has received around 93,691 PDFs or 1,496,939 pages: newspaper, magazine, abstracts, NLA publication. These repositories can serve as a unique online library of educational resources.

first page of the article

Published

01-01-2024

How to Cite

The New Phase of Digitization of the Materials of the National Library of Armenia (OCR). (2024). Bulletin of Armenian Libraries, 6(2), 47-51. https://doi.org/10.52027/

Similar Articles

You may also start an advanced similarity search for this article.