home Log-In to Library sitemap
image spacer image
spacer

Education

spacer

What next:

download Download

Articles

State of OCR
Distributed Capture
Easy, Effective Data Processing

view View

The Software
Press Releases
Case Studies

view Contact

Technical Support

AnyDoc Software
Regional Sales Rep
     
 

AnyDoc Software, Inc.
P: 800.775.3222
F: 813.222.0018
E: info@anydocsoftware.com

 
image spacer image
image spacer image
   
spacer
spacer
 

default font sizeincrease font size
Glossary of Terms

 
spacer
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

A

spacer

AccuID

An OCR for AnyDoc® method for identifying master form templates. This method works at the form family level to build an identification table based on the unique topology of each master form template in the form family and compare it to incoming data images. Then AccuID sorts and identifies data images from the correct master form template for processing.

 

AccuZip

OCR for AnyDoc® accesses the AccuZip database to validate addresses in the United States of America. It is capable of processing address information from all 50 states, and from virtually all U.S. territories and military installations abroad. AccuZip is available as an add-on feature for OCR for AnyDoc.

 

Address Extraction

The Address Extraction feature of OCR for AnyDoc® allows you to extract U.S. and Canadian address data off your documents and define how the data will display in your output.

Address Extraction gives you the option to validate address data using AccuZip.

 

AnyApp Technology

AnyApp™ locates data on template-resistant forms—regardless of where it resides on the document—by searching for defined data labels, such as “amount due” and “invoice number”; data format, such as dd/mm/yyyy; data type, such as alpha, numeric or both; and/or location, such as “just look in the top half of the document for this data.” It then remembers where it found the data when the document type is processed again. AnyApp is the technology behind the AnyDoc Software solutions AnyDoc®EOB™, AnyDoc®INVOICE™, AnyDoc®REMIT™, AnyDoc®NOTICE™, and more.

 

Attachments

In OCR for AnyDoc®, attachments are document images to be archived along with the processed document.

 

Audit Phase

During the Audit phase of OCR for AnyDoc®, the Auditor module allows you to review the work of your verification operators. It allows you to audit specific tasks, do random checks, or review all of an operator’s work.

 

AutoFlow

An automated method used to check available workstations for batch processing jobs that need to be completed.

more case studies

B

 

Bar Code Zone

A master form template zone that defines the location of bar code data in the document image. OCR for AnyDoc® recognizes these bar codes and converts them into alphanumeric data.

 

Batch Separator Page

Printed for a form family to provide a fast and accurate method of entering control information for a given batch of forms.

 

BCR

Bar Code Recognition. The process of reading and extracting data from bar codes on a document. See also Bar Code Zone.

more case studies

C

 

Caere

An OCR engine developed by Caere, Incorporated (now Nuance Communications, Incorporated).

 

Capture

Document capture is the method of obtaining the document image (either from scanning or importing) from which OCR for AnyDoc will extract data. Data capture is the extraction of this data, which can be used in a database or back-end system.

 

Character Constraint Boxes

With character restraint boxes, you restrict the amount of data that can be entered on a form by providing a specific number of boxes to be filled in by the user.

 

CMS 1500

The standard form from the Health Care Finance Administration, designated for submitting healthcare claims to insurance companies. Previously known as the HCFA 1500.

 

Commit Phase

The last phase of OCR for AnyDoc® batch processing prior to data output, where output files (e.g., TXT, GTO, XML, PDF) and archive images are written to the appropriate directories.

 

Conditional Procedure

A user-designed routine that features advanced character searches, recognition and replacements. A conditional procedure retains or filters data, based on the specific condition.

more case studies

D

 

Data Capture

The ability to capture digital data off scanned paper document images. This data then can be transmitted to a financial or back-end system for entry into an ODBC-compliant database.

 

Date Extraction

A feature in OCR for AnyDoc® that automatically converts extracted date information from multiple input formats into a user-defined output format.

 

Delimiters

Special characters that separate data fields and/or records so the data can be parsed from the file by a program or a script.

 

Distributed Capture

A means by which organizations can scan documents remotely, either from branch offices around the world or simply downstairs. The scanned images are then transmitted via a secure Internet connection for data capture processing at a centralized location, such as corporate headquarters.

 

Document Set

A group of related documents that need to be processed together as a batch.

more case studies

E

 

EDI

Electronic Data Interchange. The transfer of data from one business to another over a network.  

 

Endorser

A mechanism found on some scanners that print an incremental number on an image, which facilitates document indexing.

 

EOB

Explanation of Benefits. A statement from a healthcare provider that itemizes how benefits were approved or denied for a claim.

 

External Table

A database table connected through an ODBC link.

 

Extract Process

An OCR for AnyDoc® batch control process that de-skews the image, performs form removal functions, enhances images, regenerates characters, applies pre- and post-processing rules that have been set, etc.

more case studies

F

 

Form Family

One or more master form templates grouped together for batch processing.  Examples of form families include a batch of invoices that must be batch-balanced and a mortgage folder that has pages to be processed, containing information used to index other pages in the folder within an image retrieval system.

Form families can do the following:

  • Archive images
  • Batch balance controls
  • Create a header record
  • Enhance form identification
  • Name directory structure
  • Perform batch verification
more case studies

H

 

HCFA 1500

See CMS 1500.

 

High Speed Verification

An optional verification phase in OCR for AnyDoc® batch processing in which operators see only the image’s questionable characters. Verification is “high speed” when the operators can correct at once all questionable characters in a batch, rather than tabbing to each questionable character on a data image.

more case studies

I

 

ICR

Intelligent Character Recognition. The process of converting handwritten characters into ASCII text through the use of a recognition engine.

 

Identify Phase

The phase of OCR for AnyDoc® processing, during which AutoID automatically identifies each document type.  AutoID uses static elements on each document, such as barcodes, literals or graphics, to identify the document.

 

Image Registration

The use of an image on a document (such as a square or a triangle), both contiguous and containable, as a registration point to help OCR for AnyDoc® auto ID a document type.

 

Import Phase

The process of bringing images into OCR for AnyDoc® with or without the use of a scanner. Import is a batch processing phase in OCR for AnyDoc.

 

Indexing

A means of electronically identifying a scanned document image for archival and retrieval purposes.

 

Intelligent Extraction

On an OCR for AnyDoc® template, Intelligent Extraction recognizes a date, an address, or a currency type and converts that data zone into a user-specified format.
For example, all dates can be output into the format MM/DD/YYYY, no matter how the date is written on the document.

 

Inverted Text

The placement of white text on a black background in a scanned document image. The text and background must be inverted for OCR for AnyDoc® to read the text.

more case studies

J

 

Job Queue Directory

A temporary network location where OCR for AnyDoc® stores its processing files. These files identify the status of each job/image/page.

 

Job Manager

In OCR for AnyDoc® and AnyDoc®CAPTUREit, Job Manager is a server component, typically installed on a network server, that is used to facilitate automated remote data capture.

more case studies

K

 

Key

A key (or primary key) is a field that uses a number or character sequence unique to each record in a table (e.g., social security number) for identification purposes.

 

Key-from-Image

The process by which data entry operators view and key data off electronic, rather than paper, versions of documents. The key-from-image approach to data entry is approximately 10% more efficient than traditional data entry methods.
In OCR for AnyDoc®, the key-from-image verification GUI is the default verification method. With the program’s rope and expand capabilities, however, operators key significantly less data.

more case studies

L

 

Literal

Synonymous with text. A literal can be machine print (OCR) or handprint (ICR). Static literals on a document can help OCR for AnyDoc® with registration points and to identify a form type.

 

Lookup Table

A table that OCR for AnyDoc® accesses to validate specific data residing on a processed document.
For example, OCR for AnyDoc can access a P.O. number table to validate the vendor associated with the P.O. number on an invoice.

more case studies

M

 

Manual Indexing

The assignment of areas in a particular document to a particular field in a document or data table.

 

Mark Sense

Data confined to one or more selections in a series, as in a survey. The data is selected by checking a box or filling in a bubble.

For example, a survey may include gender information. A respondent fills in the bubble next to ‘M’ or ‘F’ on the survey to indicate his or her gender. OCR for AnyDoc® seeks that mark sense zone for the data and extracts the selected response for that question on the form, based on the pixilation present in the selected bubble.

 

Mark Sense Mark

A mark on a form identifying a selection of mark sense data. The mark consists of the presence of pixels (such as a check mark, a filled-in box, a signature, etc.). The recognition engine searches for the presence (a “hit”) or absence (a “miss”) of a mark.

 

Master Form Template

Scanned or imported document images used to define the zones and parameters for processing data from structured documents of the same type.

more case studies

N

 

Noise Filtering

Removes particles (black dots representing noise) from the document image.

 

Note Zone

Note zones define areas of the form containing data that are not processed by an OCR or ICR engine. OCR for AnyDoc® prompts the operator to enter the data during verification. A note zone is useful for obtaining data such as signatures or other unconstrained handprint.

more case studies

O

 

Omit Zone

With omit zones, you define the areas of a document to be ignored during OCR or ICR evaluation.  Omit zones ensure that preprinted literals in a zone are not recognized as text.

 

OMR

Optical Mark Recognition. The process of data selection from a list of options on a document, based on the presence or absence of a mark next to item(s) on that list. See also Mark Sense.

 

Orientation

The way text is displayed on a page, either vertically (portrait) or horizontally (landscape). The orientation parameters in OCR for AnyDoc® allow users to ensure that text in a page reads from left to right as it is being processed, regardless of the text orientation on the page when it was scanned.

 

Output

Output is the final phase of OCR for AnyDoc® processing. Once the data has been captured, validated and verified, both the data and the document images are then delivered to a company’s back-end system.

 

Output Parameters

Enable the configuration of both ASCII text and images output by OCR for AnyDoc®. They can be configured in the form level or the zone level.

 

Overlay

An image that is superimposed on all data images during verification and/or is archived for a specific master form template.

more case studies

P

 

Parameter

A set of tools to help OCR for AnyDoc® fine-tune form removal and recognition. It also helps to define rules and output specifications.

 

Pass 1 Verification

During this phase of OCR for AnyDoc® verification, operators view a data image’s questionable characters in the context of the zone and form in which they appear. Pass 1 Verification also allows the operator to correct any recognition rules implemented by rules parameters, mark sense parameters, table link parameters, etc.

 

Pass 2 Verification

An optional OCR for AnyDoc® verification phase that functions either as a method to verify data not examined by Pass 1 Verification, or as a follow-on supplement to Pass 1 Verification.

 

Patch Code

A parallel pattern of alternating black bars separated by spaces and placed near the leading edge of a paper document. Sometimes used to separate documents and batches or to perform identification.

 

Permissions

Security measures applied to objects (e.g., database tables, etc.), based on defined user rights.

 

Pixels

Picture (pix) elements (els). Filled-in dots in a grid that form text or a picture on a computer screen or on printed output.

 

Process

Process is the hardest working phase in OCR for AnyDoc®.  During processing, OCR for AnyDoc separates data from non-data form elements, such as character boxes, lines and background noise. Once the data is separated, OCR for AnyDoc captures the data and validates it against pre-defined business rules.

more case studies

Q

 

Quality Assure (QA)

In AnyDoc®CAPTUREit and OCR for AnyDoc®, an additional batch processing phase (off by default) that allows the operator to check and improve the quality of scanned or imported images.
In OCR for AnyDoc, whether and how the quality assurance phase is used depends upon the form family settings.

 

Questionable Character

A data character with a value undetermined by the recognition engine (where the confidence percent level is below the configured value).

 

QuickApp™

With QuickApp technology, OCR for AnyDoc® users can eliminate key-from-image processes when capturing data from exception or seldom-seen documents – without the need for a template.

more case studies

R

 

Reader Response Zone

A type of mark sense zone, Reader Response zones define areas of the form to be evaluated for the presence or absence of a mark, which typically takes the form of a circle around a number.

 

Registration Zone

The defined area of a document image that allows OCR for AnyDoc® to determine the image’s length and width so the program can effectively remove skew from the image and align it to the associated Master Form Template. The Registration Zone consists of two or more registration points on the document, defined by an image, a literal, a cross line and/or data.

 

Remote Verification

The ability of a human operator to verify, from an off-site location, characters flagged by OCR for AnyDoc® as questionable.
Both AnyDoc®CAPTUREit and OCR for AnyDoc enable access to data verification from a remote location via a LAN or an Internet connection.

 

Rope and Expand

Roping and expanding magnifies a selected area on a scanned image. The smaller the roped area, the greater the magnification.
During template design, an area must be roped and expanded prior to adding a zone.
During key-from-image processing, roping and expanding text on the document image automatically populates the associated data fields.

more case studies

S

 

Sticky Note

A tool within OCR for AnyDoc® that verification operators can use, without disrupting verification activities, to notify a supervisor of unexpected results in a particular row or line of data during processing.

 

String

A sequence of data characters.

 

Structured Documents

Forms and documents where the desired data are located in static positions on the page across the document type. Examples of structured documents include surveys and vehicle registration forms.
OCR for AnyDoc® and AnyDoc®CAPTUREit specialize in processing structured documents.

more case studies
T

 

Table

A file containing organized data (in rows and columns) on a specific topic, such as Vendor ID Number. As OCR for AnyDoc® processes, it can access lookup tables to automatically populate data fields related to the documents and data it captures.

 

Template

See Master Form Template.

more case studies

U

 

Unstructured Documents

Forms and documents where the desired data can be located in varying positions on the page of the same document type. Examples of unstructured documents include invoices and Explanation of Benefits (EOB) forms.
AnyApp technology was developed to process unstructured documents and is found in AnyDoc®EOB™, AnyDoc®INVOICE™, AnyDoc®REMIT™, AnyDoc®NOTICE™, and more.

 

User Group

A group of OCR for AnyDoc® users with the same access rights. The OCR for AnyDoc administrator defines the user groups and grants rights to them.

more case studies

V

 

Verify Phase

The phase of OCR for AnyDoc® processing where data characters flagged as questionable by OCR for AnyDoc get verified, either by a separate recognition engine or by a human operator. This is done during the Verify phase of OCR for AnyDoc processing.

more case studies

W

 

Work Flow Manager

The control panel for all production-level batch processing performed by OCR for AnyDoc®.

more case studies

Z

 

Zone

An area in the Master Form Template defined as the location of a specific data type. Each zone type is designated by a separate zone boundary color.

 

To learn more, contact us today.

spacer
image spacer image