Soft Solutions, based in Delhi, reportedly launched
a new software -- Image to OCR Converter, for text recognition in images, pdf and scanned documents.
The OCR -- Optical Character Recognition -- software can read text from bmp, pdf, tiff, jpg, gif, png and 'all major image formats,' company officials said, and 'saves the extracted text in word, doc, pdf, html and text formats with accurate text formatting and spacing.'
One can see the advantages, of course: It avoids retyping of scanned documents by converting the scanned image and pdf files back to text based formats.
Image to OCR Converter recognizes more than 40 different languages, company officials say, adding that images, pdf and scanned documents in any supported language can be converted back to the original language text complete with all language fonts and styles.
Image to OCR Converter recognizes English, German, French, Spanish, Italian, UK English, Swedish, Danish, Norwegian, Dutch, Portuguese, Brazilian, Galician, Icelandic, Greek, Czech, Hungarian, Polish, Romanian, Slovak, Croatian, Serbian, Slovenian, Luxembourg, Finnish, Turkish, Russian, Byelorussian, Ukrainian, Macedonian, Bulgarian, Estonian, Lithuanian, Afrikaans, Albanian, Catalan, Irish Gaelic, Scottish Gaelic, and Basque.
And we're betting you didn't know there was enough of a difference between Irish and Scottish Gaelic to merit a separate language designation.
The software also provides security features such as password protection and watermark to the converted documents to prevent unauthorized viewing and copying or illegal distribution. Image to OCR Converter can automatically detect and correct rotated, skewed and tilted documents, company officials claim: 'Broken text and characters is also reconstructed to provide better accuracy and recognition.'
And believe us, they take their Irish and Scottish Gaelic differences seriously: News.Scotsman.com reported
that 'the search for a Gaelic-speaking boy to star in a Hollywood blockbuster has ended in Ireland after the film's makers failed to find a suitable Scottish youngster.'
The news agency writes 'Confusingly for purists of the language both Irish and Scottish Gaelic is to be spoken in the forthcoming historical drama The Eagle of the Ninth. The decision was attacked by the SNP's Westminster Gaelic spokesman as a 'snub' to the Scottish tongue, the news agency reported, and by the national Gaelic development agency as 'disappointing. ''
David Sims is a contributing editor for TMCnet. To read more of David’s articles, please visit his columnist page. He also blogs for TMCnet here.
Edited by Amy Tierney