Aug 5th, Twigfarm (CEO Sunho Baek), a deep-learning NLP company, was awarded with 'Data Construction Project for Machine Learning’, a project lead by “Ministry of Science and ICT” and the “National Information Society Agency (NIA)”. Twigfarm will undertake the sub-categories 'Korean-English Colloquial and Technical Science Translation Corpus’ and 'Broadcasting and Specialized Multilingual Translation Corpus’.
This project is centered on Twigfarm and Activo. In addition, professional translation companies such as Lexcode, Furmo DT, and Eqqui Korea, are involved as well. Professors and students of Hankuk University of Foreign Studies and Chung-Ang University will be translating the data. Both universities have excellent translation and interpretation training program. All the while, the data will be thoroughly inspected by Korean Standards Association.
For this project, Twigfarm plans refine text data useful for learning and construct data composed of original text and translated text all the while intending to build labeling data that can be used for machine learning. Through this, Twigfarm is to build 10 million sentences of artificial intelligence learning data within this year.
Sunho Baek, CEO of Twigfarm, said, “Data preprocessing has been largely automated, but it is true that still much of it is manual work. Together with F&J, we plan to provide education and jobs to socially disadvantaged and women with career breaks.” F&J is a social enterprise.
CEO Baek Sun-ho said, “Due to the lack of data it is difficult to research and develop neural network-based machine translators. Our goal is to improve the research environment for Korean machine translation.”
[Editor : Hyun-jin Park]