在OCR（光学字符识别）之前，您会建议使用哪种软件进行图像增强？

We are currently researching ways of enhancing image quality prior to submission to OCR. The OCR engine we are currently utilizing is the Scansoft API from Nuance (v15). We were researching the Lead Tools but have since decided to look elsewhere. The licensing costs associated with Lead Tools is just too great. To start with we are looking for simple image enhancement features such as: deskewing, despeckling, line removal, punch hole removal, sharpening, etc. We are running a mix of .NET and Java software, but java solution would be preferred.

我们目前正在研究在提交给OCR之前提高图像质量的方法。我们目前使用的OCR引擎是Nuance的Scansoft API(v15)。我们正在研究Lead Tools,但后来又决定寻找其他地方。与Lead Tools相关的许可成本太高了。首先,我们正在寻找简单的图像增强功能,例如:去歪斜,去斑,线移除,打孔去除,锐化等。我们正在运行.NET和Java软件的混合,但java解决方案将是首选。

7 个解决方案

#1

Kofax is good for pre-processing, but for the types of cleanup you are talking about may be overkill unless the images are really bad. Unless your specialty is in image processing, I'd recommend working with a provider that does the image cleanup and the OCR so you can focus on the value you actually add.

Kofax很适合预处理,但是对于你正在谈论的清理类型可能有点过分,除非图像非常糟糕。除非您的专业是图像处理,否则我建议您与提供图像处理和OCR的提供商合作,以便您可以专注于实际添加的值。

We license the OCR development kit from ABBYY (ABBY SDK) and have found it to be superb for both image processing and OCR. The API is quite extensive, and the sample apps, help and support have been beyond impressive. I definitely recommend taking a look.

我们从ABBYY(ABBY SDK)获得OCR开发套件的许可,并发现它对于图像处理和OCR都非常出色。 API非常广泛,示例应用程序,帮助和支持令人印象深刻。我绝对建议你去看看。

#2

Disclaimer: I work for Atalasoft

免责声明:我在Atalasoft工作

We have those functions and run-time royalty-free licensing for .NET.

我们拥有.NET的这些功能和运行时免版税许可。

http://www.atalasoft.com/products/dotimage/

We also have OCR components including a .NET wrapper for Abbyy, Tesseract and others and Searchable PDF generation (image on top of text in a PDF)

我们还有OCR组件,包括用于Abbyy,Tesseract等的.NET包装器和可搜索的PDF生成(图像在PDF文本的顶部)

#3

Not sure if this would be quite up to the standards that you guys would need, but perhaps you should look at some of the Paint.Net APIs. I don't know how easy it would be to extract their image processing algorithms for use in your project, but I believe they do some of the things you are looking for. Plus it is an open source project with an MIT License, so it should be pretty friendly for business use.

不确定这是否符合你们需要的标准,但也许你应该看看一些Paint.Net API。我不知道提取他们的图像处理算法以便在您的项目中使用是多么容易,但我相信他们会做一些您正在寻找的事情。此外,它是一个带有MIT许可证的开源项目,因此对于商业用途应该非常友好。

#4

Research about KOFAX VRS at KOFAX.com

关于KOFAX的KOFAX VRS的研究

#5

Maybe JMagick, it is an open source Java interface of ImageMagick. It is implemented in the form of a thin Java Native Interface (JNI) layer into the ImageMagick API. It's licensed under the LGPL so it shouldn't be a problem license wise.

也许是JMagick,它是ImageMagick的开源Java界面。它以瘦Java本机接口(JNI)层的形式实现到ImageMagick API中。它是根据LGPL许可的,所以它不应该是一个问题许可证。

http://sourceforge.net/projects/jmagick/

#6

I would suggest Intel for its zero-cost runtime licensing.

我建议英特尔实现零成本运行时许可。

#7

Depends on the number and quality of the original images. Managed code and imaging tool kits will work but it's not always the best solution if you haved several million images to process. For small batches and tight budgets, I agree with the previous posters that projects like Aforge, Paint.NET, and other open source computer vision libraries will do the trick. Of course, you are on your own if the results are not improving... At least this let's you put everything you need under one application for a low cost.

取决于原始图像的数量和质量。托管代码和映像工具包可以使用,但如果您需要处理数百万个图像,它并不总是最佳解决方案。对于小批量和预算紧张,我同意之前的海报,像Aforge,Paint.NET和其他开源计算机视觉库这样的项目将会起到作用。当然,如果结果没有改善,那么你就是独立的......至少你可以在一个应用程序中以低成本放置所需的一切。

If you are processing several hundred thousand images a month, then I would suggest you divide up the process into smaller workflow step and tweak each one until your cost per image gets as close to zero as you can. You will find that the OCR results rise quickly at first and then level off sooner than you expected. (I'm not a big fan of OCR but it has its place)

如果您每月处理数十万张图像,那么我建议您将流程分成较小的工作流程步骤并调整每个图像,直到每张图像的成本尽可能接近零。您会发现OCR结果首先快速上升,然后比您预期的更快地平稳。 (我不是OCR的忠实粉丝,但它有它的位置)

I use commercial Windows product from Recogniform to process and clean up the images prior to OCR in a batch mode using scripts adjusted for various kinds of images. If an image fails QC or is rejected by the OCR engine, it is "repaired" by hand using a custom .NET application built with Atalasoft's toolkit. Batch process everything and only touch what fails.

我使用来自Recogniform的商业Windows产品,在批处理模式下使用针对各种图像调整的脚本在OCR之前处理和清理图像。如果图像未通过QC或被OCR引擎拒绝,则使用使用Atalasoft工具包构建的自定义.NET应用程序手动“修复”。批处理所有内容,只触摸失败的内容。

#1

#2