Corpus is very important for the development of the language tools, I found we have an existing opensource hosted project about KhmerText which is mostly provide free/opensource data, the collection of Khmer Corpus.
About the project
Open data for a Khmer language corpus and lexicographic data that can be used for the development of free language tools for Khmer language, such as automatic translators, dictionaries, linguistic analysis tools, etc.
Website:
http://khmertext.sourceforge.net/
Download:
http://sourceforge.net/projects/khmertext/files/
No comments:
Post a Comment