dynamsoft document capture enable users to convert scanned images and pdf files to editable word document. designed to automate information capture from scanned documents or existing pdfs, users are empowered to extract text for purposes of document management, records archive, etc.
following are the highlights of dynamsoft document capture.
input files for ocrdocument scanning: integrated with dynamic web twain, you can scan documents, books, checks and other paper forms from twain/sane/ica scanners;import local files: you can load pdf files and images from your local machine and upload them to dynamsoft’s server for text recognition;output files for ocrconvert to searchable pdfs: on top of regular pdf format, you can also create pdf/a files, which is suitable for archiving and longterm preservation. to further decrease the file size for storage, you can choose pdf with mrc (mixed raster content), which is a compression technology to minimize the size of pdf and pdf/a files.convert image to text: convert existing pdf and other images to editable text, such as rtf, microsoft® word, powerpoint® presentationsconvert to ebooks: to scan a book and save it as a searchable document, please choose “epub” in the output format list. plus, the layout and formatting of books will be retained.convert to excel®: this feature is useful for forms recognition. you will no longer need to retype the tabular data from scans.other featureslanguage list: the website supports english and 119 other western languages as well as arabic, chinese simplified and traditional, japanese and korean (cckj).search text in the ocr result: after text recognition, users are enabled to search certain keywords for redaction or highlight.send documents to the cloud: the ocr results can be exported to the most popular cloud storage services, including box, dropbox, and microsoft onedrive®.