|
Document
Image Management
Imaging
101
In the Beginning
Personal computers first emerged at the
onset of the 1980s, many thought they would quickly usher
in the era of the paperless office. Currently over 300 million
tons of paper per year is used. Offices are drowning in paperwork
that sometimes even impedes their ability to compete. One
potential solution: computerize the paper, which means
to turn pages into computer image documents and then use computers
to maintain the information contained therein.
Understanding
Document Images
Document images are the computer-file equivalent
to paper documents. In other words, paper documents are scanned
by computer scanners, these pictures (images) of pages is
then captured in computer memory. These captured pictures
(images) are then stored in electronic form as computer files
which can be viewed on PC monitors, printed, archived electronically
(put in long-term computer storage) or retrieved electronically,
virtually instantaneously, via a number of computerized methods.
Document images differ, for example, from
word processed documents, in that image documents are essentially
computerized images made up of patterns of tiny dots, whereas
word-processed documents code all the numbers and alphabetic
letters used in text, so the text can be edited. With image
documents, at least initially, any text on a page is handled,
as a picture of text, not coded text that can be easily edited.
Some of the potential
benefits of image document processing include:
- Automation of
some of the endless flow of paperwork that passes through
every office
- Reduction of paper
piles on people's desks (and associated problems)
- A logical method
of organizing the papers that everyone uses
- Reduction of time
spent transporting documents from one desk or office to
another plus the time it takes for documents to move from
point to point
- Reduction of time
spent looking for lost and archive documents.
Using
document image systems to create a totally paperless
office is expensive and could very well be counter productive.
There will always be some paper, but using document image
systems to manage certain types of documents for specific
applications can be highly rewarding.
Document Image processing is not a panacea,
however buyers must consider not only whether upfront and
ongoing costs are justified, but also must make sure that
the systems truly provide the benefits they claim to provide.
After all, one doesn't replace one, or even twenty $500 file
cabinets with a $10,000 image document processing system without
good, solid reasoning. The real payoffs of document image
processing are as follows:
- An existing
workforce can handle a larger workload by automating certain
aspects. Sometimes, where the burden of processing paper
documents has become unmanageable and is having an adverse
affect on business operation, image processing can bring
the situation back under control.
- A reduced
workforce can handle existing workloads (i.e., enabling
the dreaded "downsizing").
- Business,
cycles can be accelerated so that certain transactions are
completed more quickly. In some cases, businesses can gain
competitive advantages over their rivals by using image
processing systems to speed customer service.
- Space
can be saved because image document processing systems take
up less room than any physical file cabinets they replace.
- Customer
satisfaction can be greatly increased, as the documents
necessary to respond to requests are made instantly available.
Elements of Document
Image Systems
Document image systems consist of a number
of elements:
- Scanners
to start the process of turning paper documents into electronic
images
- Software
to process and manage image files
- General?purpose
PCs or specially?equipped image processing scan stations
- Monitors
suitable for handling image documents
- Storage
systems, ranging from hard disk drives to optical?disk drives
and CD Jukeboxes
- Miscellaneous
capabilities, such as duplexing (the ability to process
both sides of scanned pages at once); scanning of color
documents; book scanning mode (flatbed scanning) and the
ability to link multiple scanners to a single system.
Computers
Computers are required to run document image
systems. Some systems may require centralized computers, called
servers, where most actual document storage will be located,
but all systems require end?user computers where documents
are indexed, viewed, printed, faxed or otherwise processed.
Originally computers were always bundled
in with turnkey document image processing systems, because
the integrating of various hardware and software was difficult.
Now vendors of document image processing systems offer software
only and leave the rest up to the MIS department of the organization.
There is now a trend to revert back to the turnkey systems
as organizations find this to be less expensive than have
specially trained MIS staff and so that there is only one
place to point to when problems occur.
Monitors
No longer are special monitors required
to deal with images. Monitors used to be a major issue for
document image processing systems but now a regular 17"
monitor is acceptable, however 19" or larger monitor
is preferred only because it provides more area to view multiple
images.
Storage Systems
Storage systems consist of:
- Storage
hardware, which may be comprised of hard drives, optical?disk
drives, other types of computer storage or some hierarchical
mix of all types of storage.
- Software
that manages the storage of image files so they are retrievable
and all files can be backed up.
In
the early days of image document processing systems, optical?disk
drives were the premier choice for system storage because
they alone offered very high capacity storage and random access
for virtually instantaneous retrieval of stored documents.
Today, hard?disk drives have caught up to
optical disks in terms of capacity, plus they tend to provide
even speedier random access to image files. Moreover, hard
drives offer a much better price and performance. For these
reasons, hard disk drives are now becoming widely used for
document image processing.
There is one excellent reason, however,
to work with CD-ROM or optical drives that use write?once,
read?many (WORM) technology and that reason is image document
security. Because of the nature of the ways hard drives work
anything
on them can be erased and replaced with new data in a split?second.
WORM drives, in contrast, allow a file to be "written"
on them only once, and then those files cannot be erased.
They can be "read" unlimited times, which means
they can be retrieved, viewed, printed and so on, but never
natively edited or modified. Thus, when image documents must
be archived and their "integrity" never compromised,
WORM storage is the preferred solution.
Whether choosing fixed or removable storage
technology, it's critical that buyers make certain they can
always allow for enough storage to meet their growing needs.
Having an image document system that runs out of storage space
is like having a car that runs out of gas
you have to
stop until you get some more.
There is a technology, which uses the best
of both worlds. It is writing the files onto CD-ROM as a way
of permanent record and then moving the files to a special
area on the hard disk that is not backed up on tape. If for
some reason the hard disk fails the images can easily be restored
from the permanent CD's. Incidentally, most hard disk storage
uses a RAID technique that allows one disk drive to fail without
any loss of data. A new drive is inserted and the RAID system
automatically loads up (from the other drives) what was missing
on the drive that failed.
Printers
Laser and Inkjet printers are used to print
images. Be aware that even though you have a 20 page a minute
laser printer, when you print images you will probably only
get 4 to 5 pages per minute. This is because an average business
page is made up of over 300,000 bits and these have to be
transmitted to the printer. There are special boards installed
on printers that receive the image in compressed mode (like
on your hard disks) and explode them inside the printer allowing
the printer to print at printer speed.
Software
Imaging software provides a broad range
of functionality supporting such processes as document entry
and indexing, information storage and retrieval, business
form creation, workflow routing and communications. Moreover,
it's the software that provides the visual interface through
which end users execute the all-important functions and system
administrators manage overall system functions. Prospective
buyers should understand the use of the software before buying.
Remember, the reason for archiving a document is so that you
will be able to retrieve it when you want. There are four
basic ways documents are located:
1. By looking in the
folder.
2. By searching for the document by name.
3. By searching on a field with an indexing system to find
the document.
4. By searching for a word or words in the document. This
can only be done if the images have previously been OCR'd.
Here are areas of
software that need to be considered:
Ease of Use
The software should be so easy to use; it
should require very little training for the average user.
Software that requires extensive training for the casual user,
can take away from many of the benefits of the imaging system.
Consider how easily and intuitively the software lets users
retrieve image document files. For example, do people have
to work their way through numerous layers of drop?down menus
and then work their way answering crowded dialog boxes?
- How many
retrieval methods are available?
- Can users
leaf quickly through documents in a file folder or just
see a list of file names?
- Can they
search for indexed key words (which support quick searching)?
- Can they
initiate full-text key-word searches?
Filing
Many systems use the same terminology as
with manual filing cabinets (i.e. filing cabinets, folders,
documents). It should be easy to create Folders and to insert
a Document into them. Documents may need to be cross-referenced,
similar to index cards in a library (i.e. Invoice Number;
Purchase Order; Policy Number; Effective Date; etc.).
OCR
Optical Character Recognition software scans
an image and converts the alphanumeric to computer?coded text.
This text can then be stored in a search software database.
Then the image software can perform full?text key?word searches
of the text to find the document with the matching words.
Annotation Tools
The system should not let users edit or
change the original content of image documents, thus providing
the confidence that these are the unchanged originals. Annotations
might include highlighting, redacting (blacking out), stick'em
notes, etc. For this, the software needs to provide on?screen
annotation tools and it's critical that such tools be convenient
and easy to use. Make sure the annotations do not affect the
original image.
Workflow
When the document image processing system
is connected to a network, it is then possible to create computerized
workflows for document images. Workflow is software that allows
for the preprogrammed routing of documents to the workstations
of the various workers. An example is when a worker needs
an item purchased. They fill out a purchase request form.
This form is sent to the supervisor for approval. It is then
sent to accounting for budget approval. From there, it goes
to purchasing where it is attached to a purchase order that
is sent to the warehouse for receiving. When it is received
the receiving documents are attached and it is sent to accounting.
It is finally paid and all the documents are attached to a
copy of the voucher check. In such a situation, workflow would
automatically route the document without the need of physically
moving paper. BIG saving. Some companies have reported clerical
savings of 70% because of the efficiencies and reduction of
mistakes.
Network Communications
Many document image systems operate on a
CFC Standalone basis. For example, when workers need to retrieve
archived documents, calls are placed to document image system
operators who retrieve the files, print a copy and deliver
them to requesting parties. On the other hand, it increasingly
makes sense for document image processing systems to connect
to LANs and corporate networks so managers, users and other
authorized parties can access them. This is especially true
where document image systems are used, not just to put files
into deep archives, but to move them through a processing
cycle, as with the workflow systems described above. It is
important for the buyer to make sure that all the filing structures,
cross references and filing systems are the same for the CFC
Standalone and the network version of the document image software,
if they plan to start with the CFC Standalone and later migrate
to the network.
What happens to
the paper?
When an organization first installs a document
image system normally they are afraid to destroy the scanned
paper. There is the fear that they will need it and worry
that the computer will lose the scanned image. Since good
systems have total backup, most fears are quickly relieved
about a computer's loss of images. Soon the employees become
comfortable in the retrieving of images and they no longer
fear the loss of paper
because with a click, they can
print it. Now, what about the pages that legally say you MUST
keep the original. Even though the courts and Congress are
eliminating more and more barriers of "legal requirements",
there are some remaining where the "original" must
be produced.
Let's see if a live example can show a solution
to the "original" requirement. An insurance company
gets signed applications with eight or so supporting pages.
They scan in the signed application and the supporting papers.
They separate the signed application from the rest of the
papers. The "rest of the papers" are destroyed at
the end of the day when the backups are completed. The signed
applications are then filed as a group by the date scanned.
The scanning software automatically saves the scan-date with
the image. If ever in the future (1 in 100,000) requires the
signed original they need only check for the date scanned,
go to that box and search through the stack for the paper.
As you can see it is much easier to file by date scanned than
putting into folders, particularly when the original very
rarely is required.
Intangibles
The best use of document image systems is
to automate one or more aspects of the processing of specific
sets of documents. Which documents these are may vary from
business to business and department to department. They can
be sales orders, purchasing, accounting, claims processing,
support, etc. Often, these documents, once filed, will rarely
be retrieved. However, when they are needed efficient retrieval
can greatly increase the efficiency of an operation. Whatever
the application the organization should move slowly into the
various aspects of imaging, letting the employees become comfortable
with this new medium. You will be surprised at the new ways
to benefit your origination your people will discover.
Graduation
Congratulations, you have completed Imaging
101. By now you should at least understand a little bit more
as to a document image system is. When PC's first appeared
people were heard to say, "You don't really believe there
will be a PC on every desk
besides there is just enough
room for the typewriter and you'll never replace that."
We believe that in not too many years people will look back
and try to remember what filing cabinets looked like. Or in
a museum see "IN and OUT" boxes. Think, less than
10 years ago only a very few knew what voicemail was. Less
than 5 years ago who would of thought of sending grandma an
email. Get your feet wet, come join the revolution
Scan
it!
Return
to top
|