Showing posts with label Release. Show all posts
Showing posts with label Release. Show all posts

Thursday, April 12, 2018

Segmentation - New Zero Width Space (ZWSP) Online Tool - ondra.cf by Danh Hong

Thank Danh Hong who always be with Khmer Unicode solution from font design, OCR... and now segmentation tool: ondra.cf

Danh Hong's Tool for ZWSP


Online tool and even the API available tools are required for bushing more product related in Khmer.
Mostly I use tool from kheng.info as I've been listing them in my list as I can see both tools are great to have in the community and hope for heavy content organization will support them for continuous development.

kheng.info

I've tried out both tools to see the result, there are some points in yellow remark base on the text:
ondra.cf vs kheng.info

Of course, base on above highlight, it would be better when training data is enough but I could see Danh Hong's tool made correctly for numeric data, although requires more data training to correct some concrete words such as country names as example.

Anyway, the tool will help our community growing.

Thanks everyone for hard work and share to us.

Tuesday, May 26, 2015

KhmerOCR Demo App Released on GitHub

First of all, as I have already stated in my GitHub, do not expect this release app, the full OCR system but it's only my demo at the first sight to answer to my research using Support Vector Machine in 2013 and slightly updated on 2014. Thanks for understanding.

Since I do not commit my time to continue on this topic, I would prefer to publish the demo and soon will make up the source code to public as well.

Currently people are working on TesseractOCR and we are waiting for result, of course some result can be found with the OCR Team at khmerocr.org, please try out and support this team if any.

Here if you're still interesting to see, mine, please download from GitHub: KhmerOCR.NET-App