A Java/.NET GUI frontend for Tesseract OCR engine. Supports optical character recognition for Vietnamese language.
supports optical character recognition for vietnamese language. released and distributed under the apache license, v2.0.
features:
* multiplatform (java version only) o windows o solaris o linux/unix o mac os x o others * pdf, tiff, jpeg, gif, png, bmp image formats * multipage tiff images * selection box * file draganddrop * paste image from clipboard * postprocessing for vietnamese to boost accuracy rate * vietnamese input methods * localized user interface * integrated scanning support (on windows only) * watch folder monitor for support of batch processing * custom text replacement in postprocessing