Digital Armenian
(Coordinator: Marat Yavrumyan)
The virtual (digital) reality, “intelligent technologies” using elements of artificial intelligence are an integral part of our lives today. In these conditions, it is primary to ensure the operation of Armenian as a language capable of serving virtual (digital) reality, digital economy, digital public services, digital culture and everyday life, as well as creating digital reality.
It should be noted that despite separate projects, Armenian today does not keep pace with current trends in language technology. This circumstance makes the Armenian language, the cultural heritage created in the Armenian language incomprehensible to the modern challenges, with all the ensuing consequences.
To fill the gap of the past years and create the necessary preconditions for short-term development, the following four directions of actions are proposed:
- Infrastructure building and development
- System builder projects
- “Armenian Treebank” project, which integrates Armenian into the global systems of machine processing of languages using elements of artificial intelligence (Universal Dependencies, Stanford NLP, Spacy.io, etc.);
- “Ngram” national system based on the digital program of the National Library. The system allows visualizing the reality reflected in the text through the chronology, frequency, context of the use of word pairs etc.
- Text-to-Speech (TTS) and Speech-to-Text (STT) open systems (eg. on Mozilla Foundation tech platforms).
- Dictionaries of both modern and professional Armenian vocabularies in digital Wiki environment.
- State (interstate) language policy
- Educational resources