ELO DocXtractor II Form eliminates the high costs of capturing forms from day one. The intelligent module is the ideal solution for processing structured documents such as forms and features all the functionalities required for this. A large amount of data is thus captured in a short time and at the best quality. This leads to more flexible and, more importantly, quicker running business processes.
Increase your efficiency
with maximum productivity and minimal work
ELO DocXtractor II FORM
>>
>>
Competitive advantages
through time savings
Efficiency from day one
Forms are still a major constituent of the business world. Companies receive all types of different forms, such as orders, requests, replies, etc. every day. These arrive by post, fax or the Internet in varying qualities and with diverse field information. Regardless of whether the information is handwritten or printed; manual capture is generally very time-consuming and prone to errors. With ELO DocXtractor II Form, a large volume of data can be captured in a short time and in better quality. ELO DocXtractor II Form is seamlessly integrated in the ELO ECM Suite. The simplicity of setting up forms and its unique user-friendliness enables this flexibly extendable system to successfully get started with business process optimisation.
ELO DocXtractor II Form provides you with the
following:
O
DocXtractor I
I FOR
M
The complet
e solution for rapid form processing
· Ability to focus on actual profit-making tasks · Cost savings
· Efficient working due to fewer errors
· Faster additional processing through faster availability and higher data quality
· More flexible processes
· A quality-assured data export to databases of downstream systems (e.g. in order processing or at authorities for faster additional processing)
The solution
Functionality of
ELO DocXtractor II Form
Image preprocessing: Sound results
After the documents have been scanned in, the configurable preparation of the scanned documents for the OCR takes place. The following processing steps can be used here:
· Removal of preprint and lines (e.g. on remittance slips)
· Rotation tolerance and upside-down correction for images rotated by 180° · Removal of noise for better image quality · Automated restoration of dots removed for the OCR (e.g. dots in the date or for umlauts in German)
· Angle of rotation correction for images scanned in at the wrong angle
· Removal of punched holes
If the customer has a scan software which offers image preprocessing, this function can simply be deactivated within ELO DocXtractor II Form.
Intelligent classification
In ELO DocXtractor II Form, the classification of documents can be based on the layout or a simple content-related procedure. With the layout-related classification, a document is analysed based on its form and structure. The content-related method interprets the entire content of the document or part of the content.
Possible form classifications:
· Classification based on layout
· Classification based on search patterns
(text at a specific position)
· Classification using barcodes · Classification using page sizes
With layout-based classification, for training purposes the ELO DocXtractor II Form only requires a sample document in order to analyse all the required features. Documents with an almost identical layout are then recognised automatically. By specifying a search pattern, like a specific position in the text, 100% classification can be ensured. Other classification methods are available in ELO DocXtractor II Mailroom.
>>
Competitive advantages
through time savings
>>
Intelligent and structured
Targeted processing
A form comprises different field types. After ELO DocXtractor II FORM has classified the document, class-specific fields are extracted. The following field types are supported:
· Address (for recognising address fields)
· Anchor (for locating an exact position in the document) · Check box (for recognising check boxes)
· Barcode (for barcode decoding within a document) · Search pattern (definition of search and result patterns) · Table (for extracting complete tables)
· Text (for extracting information at specific positions)
· TopDown (function for selecting information from the database) · Various special fields such as batch, transaction and document ID
O
DocXtractor I
I FOR
M
The complet
e solution for rapid form processing
Fig. 2: Document definition and form verifier in ELO DocXtractor II FORM
The solution
The “anchor” field type describes the anchoring of a position within the document for the exact positioning of a specific information field. This is necessary if when printing out the document, for example, it is still only a certain percentage of its original size. The “check box” field type is used to recognise check boxes in a document. ELO DocXtractor II Form automatically recognises whether the box is checked or not and assesses this information.
Using the “barcode” field type, a barcode reader can be used to decode barcodes on a document. The system finds the barcode regardless of its position in the document.
>>
Flexible adaptation
O
DocXtractor I
I F
orm
The complet
e solution for rapid form processing
Using the “search pattern” field type it is possible to define search and result patterns which are linked to one another. Thus in a free letter, for example, the word “date” can represent the search pattern and the following value, e.g. “21.10.2008” can be accepted as the result pattern.
The “table” field type is used to extract complete tables. Tables in a fixed position (e.g. on forms) and free tables (e.g. on invoices) can be defined. Since the structure of tables may vary considerably, different extraction strategies are offered.
The “text” field type is used to extract information with a fixed position in forms. Using regular expressions such as “dd.mm.yy” for the date or the structure of the value for invoice or tax numbers, it is possible to specify the structure of the content in advance and to verify it precisely using these criteria.
The “TopDown” field type (alignment of data) is always used to select information which is filed in a database. The alignment is high performance and tolerant vis-à-vis OCR errors or different spellings. Even OCR results which have been heavily distorted can still result in good alignment values.
Save costs in post-processing
When the field types have been analysed, ELO DocXtractor II Form provides a unique opportunity to check and correct them. ELO DocXtractor II Form Improver determines the best values based on all the information selected and the alternatives. This increases the quality of recognition and considerably reduces post-processing efforts.
The solution
High quality of exported data
All checks carried out during the analysis can be activated in the verifier (post-processing location) at the click of a button. Here it is ensured that the data selected also matches. Carrying out checks in the verifier avoids incorrect entries when manually capturing data and thus significantly increases the quality of the exported data.
Logic check
Using restrictions (determination of requirements/ conditions), complex mathematical and logic checks can be defined for fields. The restrictions are assessed by ELO DocXtractor II Improver. The features of every checked value are determined. Based on these configurable features, it is determined whether a value is to be exported directly or verified first. Via restrictions, customer-specific checks can
Secure testing for maximum success
Processes which cannot be featured with the standards provided can be implemented using the ELO DocXtractor Scripting Programming Language (SPL), an integrated programming language. ELO DocXtractor contains a complete development environment for testing programs for this purpose. A range of language elements is available for the various use cases.
ELO is available through:
t/Germany
. Reproduction, in par
t or in whole, only with written permission
. Item no. A002-DOCX-EN
ELO Digital Office GmbH · Stuttgart (Germany) · www.elo.com · [email protected] ELO Digital Office CH AG · Zürich (Switzerland) · www.elo.ch · [email protected] ELO Digital Office AT GmbH · Linz (Austria) · www.elo.com · [email protected]