OmniPage Capture SDK: Developer API for integrating OCR into C++, C# or Java

Established as the core technology behind all Kofax imaging products, the OmniPage Capture SDK is widely recognized as the foremost imaging and OCR toolkit on the market today.

Reliable, highly available applications

The OmniPage SDK includes new, innovative capabilities that enable developers to create reliable, long running document processing applications that can automatically scale up and take advantage of all or the designated CPU power of a multicore machine. All these can be achieved with low programming effort.

Powerful form processing capabilities

The OmniPage Capture SDK offers various tools and capabilities required to create complete form processing applications. It enables users to easily define form templates for all the document types they wish to process and contains a convenient API to execute form matching and data extraction.

Automatic and trainable document classification

The new Document Classifier module enables users to separate different documents and sort them based on layout and/or textual similarities. This intelligent and trainable tool can be used in a wide range of business processes, such as email forwarding, mail-room automation, spam filtering, form processing and data extraction.

New features in version 21

Faster, simpler and more intuitive

Developers striving to quickly and easily add powerful OCR and image processing features to their applications have always been relying on the OmniPage Capture SDK.

    • Simplified installation and licensing
    • Improved handwriting recognition with new ICR2 engine
    • Optimised automatic language recognition and 3D deskewing
    • Additional support for document output as PDF/A-1a, -2a, -3a, -3b and -3u, and HTML5
    • New, intuitive distribution assistant
    • Additional and highly productive APIs and settings
    • Automatable training tool for document classification
    • OCR processing for Hebrew and formatted text output for Arabic
OmniPage Capture SDK ist in der Version 20 noch besser, schneller, effizienter und dadurch günstiger als seine Vorgänger
Improved recognition
0%
Character accuracy
0%
Layout accuracy

The recognition accuracy for Asian languages has increased enormously in the new version: layout accuracy by 45% and character accuracy by 40%. Additionally, the SDK now contains an OCR engine for Thai.

Form processing made easy
OmniPage Capture SDK hilft bei der Formularverarbeitung

The technology for form processing has been improved considerably in the new OmniPage Capture SDK. It now contains the Form Template Editor, a powerful tool for creating, editing, testing and managing form templates. Improved programming interfaces ensure a reliable template matching and comfortable form data extraction.

Improved image processing by including meta data
OmniPage Capture SDK unterstützt beim Fotografieren eines Dokuments mit einem Handy

The OmniPage Capture SDK improves the processing of images taken with digital cameras or smartphones by obtaining the EXIF data from these images. A host of new algorithms for automatic resolution calculation or for shading correction as well as new workflows ensure that a maximum of data gets captured and converted into text during image processing.

User-friendly APIs

The OmniPage Capture SDK provides software developers with multiple programming interfaces and a large selection of samples. For newcomers, however, it is not always obvious how to select the right interface and get started with programming because of the sheer amount of modules and functions.
All three programming interfaces of the SDK cover almost the same functionality and provide the same OCR accuracy. They do, however, follow different concepts and some of them have been implemented for multiple programming languages. Depending on which programming language you want to use and if you need the Visual Controls of the SDK you can quickly reduce the selection of interfaces to the one that suits you best.

This classical, function-oriented interface controls image input, recognition and output. It is divided into a Kernel Level for processing single pages and a Plus Level for handling multi-page documents. It can be used with all native programming languages supporting DLL function calls. In C#, it is defined as two large static classes.

When should you use the RecAPI?

  • When using a native programming language able to call functions from a DLL
  • When porting existing Capture SDK projects

Examples:

  • Sample 1 – Converting an image file into a searchable PDF document
  • Sample 2 – Recognising text and barcodes on all pages of an image file

This modern class library offers the same functionality as the native RecAPI but is the first choice for C# programmers.

When should you use .NET Objects?

  • For all new C# projects that don’t need Visual Controls (control elements of the Professional Toolbox)

Samples:

  • Sample 1 – Converting an image file into a searchable PDF document
  • Sample 2 – Recognising text and barcodes on all pages of an image file

This ActiveX interface provides an older class library and is the only access to the control elements (Visual Controls) of the Professional Toolbox.

When should you use IPRO?

  • For projects that need Visual Controls (control elements of the Professional Toolbox)
  • For VB.NET projects

Operating systems

For commercial software vendors looking for OCR precision and sophisticated image processing, the OmniPage Capture SDK offers scalable OCR and comprehensive imaging features to develop high-value, competitive products on all platforms.

Development environment:

  • Windows 7, 8, 8.1, 10, 32-bit or 64-bit
  • Windows Server 2008 R2, 2012 R2, 2016
  • Intel- or AMD processor, 32-bit or 64-bit
  • 4 GB of memory (RAM), 6 GB recommended
  • 4 GB of free hard disk capacity
  • Microsoft Visual C/C++/C# version 2003 or higher
  • Microsoft Visual Basic .NET

Runtime environment:

  • Windows 7, 8, 8.1, 10, 32-bit or 64-bit
  • Windows Server 2008 R2, 2012 R2, 2016
  • Intel- or AMD processor, 32-bit or 64-bit
  • 2 GB of memory (RAM), 4 GB recommended
  • 300 MB of free hard disk capacity

Hardware requirements:

  • Intel or AMD 64-bit CPU

Tested operating systems:

  • Fedora 20, 21
  • Debian 7.5, 7.7 and 8.1
  • Oracle Linux 6.5, 7.0
  • CentOS 6.3

Hardware requirements:

  • Intel 32-bit- or 64-bit CPU

Software requirements:

  • OS X 10.8 or higher
Main Modules and Features Windows Linux Mac
Western and Asian OCR
Single OCR engine
Two-Way voting engine
Three-Way voting engine
Full Asian engine
Fast Asian engine
Voting Asian engine
Thai OCR engine (Text, XML, Searchable PDF only)
Arabic OCR engine (Text, XML, Searchable PDF only)
Hebrew OCR engine (Text, XML, Searchable PDF only)
Vietnamese OCR engine (Text, XML, Searchable PDF only)
Hong Kong character recognition option
Textual PDF output for Arabic, Thai, Hebrew
Arabic accuracy improvement
Asian zoning and accuracy improvement
Eastern European languages and Turkish OCR accuracy improvements
Support for Form Template Editor format
Asian language support for template form processing API
Standard 1D/2D Barcode
QR and DataMatrix in Standard 2D Barcode
ICR module
Updated and improved ICR engine
OMR module
Document Classifier API
Command Line Document Classifier training tool
Logical Form Recognition (LFR) enhancement and radio button detection
Intelligent Workflow Runner (IWR)
Intelligent Workflow Runner (IWR) API
IWR Customer Workflow
Image and searchable PDF output
Textual PDF output
Language detection—multiple Western languages on same page
Language detection—single Western or Asian language for whole page
Camera/Smartphone image handling improvement
24/7 stability enhancement
Enhanced JPG2K and MRC
More JPG2K speed improvements
PDF/A-1,2,3a, PDF/A-2,3b PDF/A-3u support
PDF font substitution and form field rendering improvements
Formatted output—RTF
Formatted output—DOCX/XLSX/HTML
HTML5 output
Formatted OCR output for Arabic, Hebrew
Formatted OCR output for Thai, Vietnamese
Central Licensing for Scalable Servers
Code security enhancement
Apple Store compliant
Windows Only Features
Professional Visual Toolbox
Form Template Editor (FTE)
Enhanced Form Template Editor with Asian language support
Document Classifier Assistant
WIA2 support
VB and C# .NET API
Scanning Enhancement Tools (SET)
Hardware/OS Configuration Support
Native 32-bit library
Native 64-bit library

Do you want to stay ahead of your competition?
We are happy to help you!

Licensing options

The licensing options of the OmniPage SDK have been created with a special focus on user demands. No dongles are required. The following licenses are for developers to create applications. To obtain licensing options and prices of runtime licenses for distributing and installing the applications alongside the required SDK components please send us a short request from our contact page.

Base OCR Kit

0
  • Developer license for installing the OmniPage Capture SDK. Contains all modules and multiple test runtime licenses to test applications in the target environment.
    • C/C++/.NET libraries for Western OCR
    • OCR-A, OCR-B, E-13B (MICR)
    • PDF technology: PDF 1.7, PDF/A, PDF Normal, image-only PDF, PDF with image-over-text and high PDF-MRC compression rate
    • Professional Visual Toolbox

Recognition Add-on

0
  • Add-on for advanced document processing technologies
    • Barcode
    • ICR
    • OMR
    • Form Template Editor
    • Document Classifier Assistant

Asian Plus Add-on

0
  • Add-on for additional support of Asian languages
    • Chinese (Simplified and Traditional)
    • Japanese
    • Korean
    • Arabic
    • Thai