Home: Literature: Imaging 101: Printer Friendly Version

Document Image Management

Imaging 101

In the Beginning
   Personal computers first emerged at the onset of the 1980s, many thought they would quickly usher in the era of the paperless office. Currently over 300 million tons of paper per year is used. Offices are drowning in paperwork that sometimes even impedes their ability to compete. One potential solution: computerize the paper, which means to turn pages into computer image documents and then use computers to maintain the information contained therein.

Understanding Document Images
   Document images are the computer-file equivalent to paper documents. In other words, paper documents are scanned by computer scanners, these pictures (images) of pages is then captured in computer memory. These captured pictures (images) are then stored in electronic form as computer files which can be viewed on PC monitors, printed, archived electronically (put in long-term computer storage) or retrieved electronically, virtually instantaneously, via a number of computerized methods.
   Document images differ, for example, from word processed documents, in that image documents are essentially computerized images made up of patterns of tiny dots, whereas word-processed documents code all the numbers and alphabetic letters used in text, so the text can be edited. With image documents, at least initially, any text on a page is handled, as a picture of text, not coded text that can be easily edited.

Some of the potential benefits of image document processing include:

  • Automation of some of the endless flow of paperwork that passes through every office
  • Reduction of paper piles on people's desks (and associated problems)
  • A logical method of organizing the papers that everyone uses
  • Reduction of time spent transporting documents from one desk or office to another plus the time it takes for documents to move from point to point
  • Reduction of time spent looking for lost and archive documents.

   Using document image systems to create a totally paperless office is expensive and could very well be counter productive. There will always be some paper, but using document image systems to manage certain types of documents for specific applications can be highly rewarding.
   Document Image processing is not a panacea, however buyers must consider not only whether upfront and ongoing costs are justified, but also must make sure that the systems truly provide the benefits they claim to provide. After all, one doesn't replace one, or even twenty $500 file cabinets with a $10,000 image document processing system without good, solid reasoning. The real payoffs of document image processing are as follows:

  •   An existing workforce can handle a larger workload by automating certain aspects. Sometimes, where the burden of processing paper documents has become unmanageable and is having an adverse affect on business operation, image processing can bring the situation back under control.
  •   A reduced workforce can handle existing workloads (i.e., enabling the dreaded "downsizing").
  •   Business, cycles can be accelerated so that certain transactions are completed more quickly. In some cases, businesses can gain competitive advantages over their rivals by using image processing systems to speed customer service.
  •   Space can be saved because image document processing systems take up less room than any physical file cabinets they replace.
  •   Customer satisfaction can be greatly increased, as the documents necessary to respond to requests are made instantly available.

Elements of Document Image Systems
   Document image systems consist of a number of elements:

  •   Scanners to start the process of turning paper documents into electronic images
  •   Software to process and manage image files
  •   General?purpose PCs or specially?equipped image processing scan stations
  •   Monitors suitable for handling image documents
  •   Storage systems, ranging from hard disk drives to optical?disk drives and CD Jukeboxes
  •   Miscellaneous capabilities, such as duplexing (the ability to process both sides of scanned pages at once); scanning of color documents; book scanning mode (flatbed scanning) and the ability to link multiple scanners to a single system.

Computers
   Computers are required to run document image systems. Some systems may require centralized computers, called servers, where most actual document storage will be located, but all systems require end?user computers where documents are indexed, viewed, printed, faxed or otherwise processed.
   Originally computers were always bundled in with turnkey document image processing systems, because the integrating of various hardware and software was difficult. Now vendors of document image processing systems offer software only and leave the rest up to the MIS department of the organization. There is now a trend to revert back to the turnkey systems as organizations find this to be less expensive than have specially trained MIS staff and so that there is only one place to point to when problems occur.

Monitors
   No longer are special monitors required to deal with images. Monitors used to be a major issue for document image processing systems but now a regular 17" monitor is acceptable, however 19" or larger monitor is preferred only because it provides more area to view multiple images.

Storage Systems
   Storage systems consist of:

  •   Storage hardware, which may be comprised of hard drives, optical?disk drives, other types of computer storage or some hierarchical mix of all types of storage.
  •   Software that manages the storage of image files so they are retrievable and all files can be backed up.

   In the early days of image document processing systems, optical?disk drives were the premier choice for system storage because they alone offered very high capacity storage and random access for virtually instantaneous retrieval of stored documents.
   Today, hard?disk drives have caught up to optical disks in terms of capacity, plus they tend to provide even speedier random access to image files. Moreover, hard drives offer a much better price and performance. For these reasons, hard disk drives are now becoming widely used for document image processing.
   There is one excellent reason, however, to work with CD-ROM or optical drives that use write?once, read?many (WORM) technology and that reason is image document security. Because of the nature of the ways hard drives work…anything on them can be erased and replaced with new data in a split?second. WORM drives, in contrast, allow a file to be "written" on them only once, and then those files cannot be erased. They can be "read" unlimited times, which means they can be retrieved, viewed, printed and so on, but never natively edited or modified. Thus, when image documents must be archived and their "integrity" never compromised, WORM storage is the preferred solution.
   Whether choosing fixed or removable storage technology, it's critical that buyers make certain they can always allow for enough storage to meet their growing needs. Having an image document system that runs out of storage space is like having a car that runs out of gas…you have to stop until you get some more.
   There is a technology, which uses the best of both worlds. It is writing the files onto CD-ROM as a way of permanent record and then moving the files to a special area on the hard disk that is not backed up on tape. If for some reason the hard disk fails the images can easily be restored from the permanent CD's. Incidentally, most hard disk storage uses a RAID technique that allows one disk drive to fail without any loss of data. A new drive is inserted and the RAID system automatically loads up (from the other drives) what was missing on the drive that failed.

Printers
   Laser and Inkjet printers are used to print images. Be aware that even though you have a 20 page a minute laser printer, when you print images you will probably only get 4 to 5 pages per minute. This is because an average business page is made up of over 300,000 bits and these have to be transmitted to the printer. There are special boards installed on printers that receive the image in compressed mode (like on your hard disks) and explode them inside the printer allowing the printer to print at printer speed.

Software
   Imaging software provides a broad range of functionality supporting such processes as document entry and indexing, information storage and retrieval, business form creation, workflow routing and communications. Moreover, it's the software that provides the visual interface through which end users execute the all-important functions and system administrators manage overall system functions. Prospective buyers should understand the use of the software before buying. Remember, the reason for archiving a document is so that you will be able to retrieve it when you want. There are four basic ways documents are located:

    1. By looking in the folder.
    2. By searching for the document by name.
    3. By searching on a field with an indexing system to find the document.
    4. By searching for a word or words in the document. This can only be done if the images have previously been OCR'd.

Here are areas of software that need to be considered:

Ease of Use
   The software should be so easy to use; it should require very little training for the average user. Software that requires extensive training for the casual user, can take away from many of the benefits of the imaging system. Consider how easily and intuitively the software lets users retrieve image document files. For example, do people have to work their way through numerous layers of drop?down menus and then work their way answering crowded dialog boxes?

  •   How many retrieval methods are available?
  •   Can users leaf quickly through documents in a file folder or just see a list of file names?
  •   Can they search for indexed key words (which support quick searching)?
  •   Can they initiate full-text key-word searches?

Filing
   Many systems use the same terminology as with manual filing cabinets (i.e. filing cabinets, folders, documents). It should be easy to create Folders and to insert a Document into them. Documents may need to be cross-referenced, similar to index cards in a library (i.e. Invoice Number; Purchase Order; Policy Number; Effective Date; etc.).

OCR
   Optical Character Recognition software scans an image and converts the alphanumeric to computer?coded text. This text can then be stored in a search software database. Then the image software can perform full?text key?word searches of the text to find the document with the matching words.

Annotation Tools
   The system should not let users edit or change the original content of image documents, thus providing the confidence that these are the unchanged originals. Annotations might include highlighting, redacting (blacking out), stick'em notes, etc. For this, the software needs to provide on?screen annotation tools and it's critical that such tools be convenient and easy to use. Make sure the annotations do not affect the original image.

Workflow
   When the document image processing system is connected to a network, it is then possible to create computerized workflows for document images. Workflow is software that allows for the preprogrammed routing of documents to the workstations of the various workers. An example is when a worker needs an item purchased. They fill out a purchase request form. This form is sent to the supervisor for approval. It is then sent to accounting for budget approval. From there, it goes to purchasing where it is attached to a purchase order that is sent to the warehouse for receiving. When it is received the receiving documents are attached and it is sent to accounting. It is finally paid and all the documents are attached to a copy of the voucher check. In such a situation, workflow would automatically route the document without the need of physically moving paper. BIG saving. Some companies have reported clerical savings of 70% because of the efficiencies and reduction of mistakes.

Network Communications
   Many document image systems operate on a CFC Standalone basis. For example, when workers need to retrieve archived documents, calls are placed to document image system operators who retrieve the files, print a copy and deliver them to requesting parties. On the other hand, it increasingly makes sense for document image processing systems to connect to LANs and corporate networks so managers, users and other authorized parties can access them. This is especially true where document image systems are used, not just to put files into deep archives, but to move them through a processing cycle, as with the workflow systems described above. It is important for the buyer to make sure that all the filing structures, cross references and filing systems are the same for the CFC Standalone and the network version of the document image software, if they plan to start with the CFC Standalone and later migrate to the network.

What happens to the paper?
   When an organization first installs a document image system normally they are afraid to destroy the scanned paper. There is the fear that they will need it and worry that the computer will lose the scanned image. Since good systems have total backup, most fears are quickly relieved about a computer's loss of images. Soon the employees become comfortable in the retrieving of images and they no longer fear the loss of paper…because with a click, they can print it. Now, what about the pages that legally say you MUST keep the original. Even though the courts and Congress are eliminating more and more barriers of "legal requirements", there are some remaining where the "original" must be produced.
   Let's see if a live example can show a solution to the "original" requirement. An insurance company gets signed applications with eight or so supporting pages. They scan in the signed application and the supporting papers. They separate the signed application from the rest of the papers. The "rest of the papers" are destroyed at the end of the day when the backups are completed. The signed applications are then filed as a group by the date scanned. The scanning software automatically saves the scan-date with the image. If ever in the future (1 in 100,000) requires the signed original they need only check for the date scanned, go to that box and search through the stack for the paper. As you can see it is much easier to file by date scanned than putting into folders, particularly when the original very rarely is required.


Intangibles
   The best use of document image systems is to automate one or more aspects of the processing of specific sets of documents. Which documents these are may vary from business to business and department to department. They can be sales orders, purchasing, accounting, claims processing, support, etc. Often, these documents, once filed, will rarely be retrieved. However, when they are needed efficient retrieval can greatly increase the efficiency of an operation. Whatever the application the organization should move slowly into the various aspects of imaging, letting the employees become comfortable with this new medium. You will be surprised at the new ways to benefit your origination your people will discover.

Graduation
   Congratulations, you have completed Imaging 101. By now you should at least understand a little bit more as to a document image system is. When PC's first appeared people were heard to say, "You don't really believe there will be a PC on every desk…besides there is just enough room for the typewriter and you'll never replace that." We believe that in not too many years people will look back and try to remember what filing cabinets looked like. Or in a museum see "IN and OUT" boxes. Think, less than 10 years ago only a very few knew what voicemail was. Less than 5 years ago who would of thought of sending grandma an email. Get your feet wet, come join the revolution…Scan it!


Home: Literature: Imaging 101: Printer Friendly Version