HOHHOT — A corpus of Mongolian language literature containing 120 million words is expected to be completed by 2024, local authorities said Friday.
The Inner Mongolia Academy of Social Sciences said the corpus will contain all traditional Mongolian language literature from between 1225 and 1453, and woodcut literature from modern times.
The contemporary literature will be classified according to subjects, including laws and regulations, literature and art, news and media, medicine, education and military.
It aims to become the largest, most comprehensive and precise online corpus of Mongolian language literature in the world, to protect, develop and make better use of Mongolian language documents.
So far, around 8 million words have been proofread and added.