Pitfalls of
being Paperless
Or
What you needed to know before you buy!
Purchasing a document image management (DIM) system
has many potential pitfalls that one can fall into without realizing it until
its too late. Hopefully this will help you avoid these costly pitfalls. Here
are some of the areas we will briefly cover:
- How to determine the size of the system needed
and what to be concerned with in the area of expandability.
- What to look for in the software and what things
about software to avoid. Not only the DIM software, but the other necessary
software running in the background, such as the database, backup, storage
management, CD authoring, etc.
- A gauge as to your requirements in the capacity
of the various computers and peripheral items, such as scanners.
- Hidden costs that show up after the initial
investment.
- General things to know document image management
(DIM) before you buy.
How to determine the size for both hardware
and software
In order to determine size you will need to know how many pages you plan to
store; what it will take in equipment to keep current on a daily basis; and
how to estimate your future needs.
- Here is how to calculate the number of pages
to be stored.
- Count the number of file drawers you are
using. A full file drawer holds about 3,500 pages. So if you have 10 four-drawer
filing cabinets, that would be 40 file drawers at 3,500 would equal 140,000
pages.
- Arbitrarily, select out 100 pages and count
how many are double sided. Since you will be storing images remember to
increase your counts by this percentage of double-sided pages when calculating
storage requirements. Let's say that 15 of the 100 pages were double sided,
that is 15%. So of the 140,000 pages in the above example would be increased
to 161,000 pages.
- Next, count the number of storage boxes
transferred to remote storage. If you know how many are transferred each
year this will help you in determining how many will be added each year.
Say each year you send 22 boxes to storage that would be 77,000 pages.
Then increase this count by the 15% for double-sided pages giving 88,500.
Then by dividing by the 240 working days in a year you have 369 pages
per day to scan.
- A double check might be to have your mailroom
count all the incoming mail for a week, then multiply that count by 52
weeks.
- Of course you can use our Sizer Program
to calculate needs OR do it yourself using the following:
- Total file drawers times 3,500 equals
"history" pages
- New file drawers (the number added each
year) times 3,500 equals "current" year pages times 3 equals
more "history" pages. To allow for future needs triple (at
a minimum) your current yearly pages.
- Take all the "history" pages
and divide by 25,000 (pages per GB) and this gives an estimate of the
gigabytes (GB) of storage required.
- Next divide "current" year
pages by 240 workdays to determine pages per day to be scanned.
- CFC Standalone versus Network.
- When it comes to determining required capacity
there are three areas:
- The number of concurrent users.
- The pages to be scanned daily.
- The pages (images) to be stored for
a three-year period. Now you can see why the need for all those questions
and page counting.
- If your firm can get by with only ONE person
having access to the files at a time, then a "CFC Standalone" system
is the least costly.
- But, if there is a need for more than one
person to have access to the files, then a "Server" system is
required.
Software considerations
- The images MUST be stored in a standard
industry format. Beware, some firms use a proprietary image formats. I.e.
they take a standard TIFF and embed controls or links that force you to use
their viewer.
- Any database software that is not OPEN architecture
also locks you to that vendor
and if something happens to them
well.
Know what database is used with the DIM software. You want a SQL type of database
and even then, beware of databases that constantly need to be reorganized.
When these types of databases become corrupt, data is normally lost.
- Make sure that the backup software also has
a backup "agent" for the database. This will allow the database,
while active, to have a snapshot for the backup tape.
- The software needs to be "obviously"
easy to use. Even the training of the administrator should not require more
than 1 day. Software that requires "days" of training may be too
complicated for firms with less than 1,000 employees.
- Remember, software not bundled with the hardware
requires a competent staff to install: the DIM software, the database, the
storage management, and the backup software as well as configuring the operating
system on the server. Many extremely qualified MIS people have never handled
some of these areas.
Hidden Costs or overlooked costs
- Software worth having has an annual update
charge. The developer needs this revenue in order to keep the software current
with the constantly changing world of computers. Beware of firms that do not
charge, as they will have to make up the difference in other ways. However,
some firms make it a practice not to inform the buyer of this annual charge
before the sale. Most software bill for the annual update after 30 - 90 days
from the initial sale.
- Support is a separate matter. Most firms will
not provide support unless the customer has installed current upgrades. Support
can be paid for on an annual basis or on a call-by-call basis. Make sure you
know the costs of support before buying.
- Maintenance of scanners and computers. Normally,
workstations and scanners can be down for a day or two, while servers need
rapid response. Most on-site contracts need to be initiated at time of purchase
so make sure your server and scanners are covered when installed.
- Scanners have a number of moving parts; some
will wear out (i.e. feeder pads and rollers and lamps.), make sure you have
coverage of these items.
- Always anticipate that your needs will grow
from the current requirements. Make sure the software can grow to meet those
future needs.
- Make sure the software allows for other interfacing,
either by the developer or by other software programmers. If not now, in the
future you may want to interface other software with you DIM system and not
be able to.
General things to know before buying
- A common question is "We already have
a server why do we need another?" Most data servers do a number of tasks
and the amount of data selected at any one time rarely exceeds 4K, whereas
image servers deal in images ranging in size from 25K to 250K and many times
ten to fifty images are being processed, so you see imaging processing on
an existing server would SLOW it down.
- If a "Server" system is required
then the question is how big a server? How many concurrent users are needed?
Will they ALL be locally attached? The choices are between a "Client-Server"
and a "Terminal Server". On a "Client-Server" only those
workstations with the software loaded can run the software, versus the "Terminal
Server" where all who can logon can use the system. Here is a simple
chart on how to estimate the server capacity.
| Servers |
Viewers |
Terminal Server |
| Regular |
5 - 10 Users |
5 Users |
| Medium |
15 - 25 Users |
15 Users |
| Heavy |
25 - 50 Users |
25 Users |
| Heavy extra memory |
100+ Users |
50 Users |
Regular - single CPU, 256MB, 2-6 36GB
HDD
Medium - single CPU, 512MB, 2-6 36GB HDD
All servers should have RAID controllers.
- You can store an average of 25,000 normal office
documents per gigabyte (GB). Don't buy more disks than you expect to need
over the next three years because three years from now the cost per GB will
be much less. However, do allow for the possibility that your storage may
increase more rapidly than you currently anticipate.
- Selecting which scanner or scanners involves
considering several variables:
- How many pages need to be scanned daily.
- How many indexes need to be key entered
per document.
- On average how many pages make up a document.
- How often will the flatbed be required?
(Too thin or thick for the Automatic Document Feed "ADF")
- What is the maximum size of the page to
be scanned?
- Is color required?
- What is the percentage of double-sided
pages? Rule of thumb over 15% requires a duplex scanner.
- Can all the documents easily be brought
into one location? In other words, do you need a scanner in more than
one location?
- Quality and condition of documents to scan.
I.e. if these are dark copies of an original then special hardware and
software connected to the scanner may be required to produce readable
images.
- There are several methods of setting up scanning
operations.
- The operator job may be to remove staples
and clips and then scan. This is normally referred to as pre-scan prep
work.
- In some offices the pre and post scan work
is assigned to another than the operator, thus allowing the scan operator
to concentrate on the scanning. Post scan work is the assembling of the
pages after the scan, if they are going to be saved instead of destroyed.
- Then consider how many indexes per document
will be required. If for example, the account number is required (and
the rest imported from other databases - discussed later) or if the account
number, date, name, phone number, etc. need to be key entered from each
document, then the scanning throughput is greatly reduced.
- Another option used by some offices is
to scan into a "batch" area and then later index and transfer
the document to an appropriate folder.
- The speed of the scanner is not the most important
factor in selecting a scanner. Actually the factors such as size of document,
color or two-sided pages can force the choice of scanners. The following is
a "guesstimate" of scanner throughput by a single operator doing
minimal pre and post scan work.
|
------------
|
Indexing |
------------ |
| |
Minimum
|
Average
|
Heavy
|
| Keystrokes |
15
|
30
|
60
|
| Pages per Document |
10
|
10
|
10
|
| Daily (6 hr) Throughput Using |
|
|
|
|
15 ppm
|
1,888
|
1,588
|
1,350
|
|
20 ppm
|
2,069
|
1,714
|
1,440
|
|
27 ppm
|
2,236
|
1,827
|
1,519
|
|
29 ppm
|
2,272
|
1,851
|
1,535
|
|
50 ppm
|
2,500
|
2,000
|
1,636
|
These calculations presume that the operator
is very diligent in their work. We know of a customer with a 50-ppm scanner
and only require 16 keystrokes per document and they are pleased to get
1,900 pages per day. With the exception of the pre and post scan there are
other ways to increase the scanner input.
- There are several methods of automating the
filing and filling of the cross-index fields.
- The placing of the document in the DIM
folder and the cross indexing of the document in some cases can be automated.
- By having barcode labels on documents
and letting the DIM system read and automatically create a folder,
name the document and save the image. This type of operation will
place individual pages or the page with the barcode and all the pages
that follow until the next bar coded page.
- Zone OCR (Optical Character Recognition)
of a form is one method. When there is sufficient quantity of the
same form to justify the cost of setting up and using this type of
software. This software recognizes the form and knows to OCR a specific
area on the form for say the account number. Using the account number
the software automatically locates the proper folder and names the
document and places the form and all the pages with that form.
- Another way to increase the input into
the system is to enter the very minimum amount of data at the time of
scanning, then at the end of the day have the computer select all the
new entries, connect to another database (such as customer or accounting)
and retrieve the other information to be used in the cross reference fields.
I.e. an insurance company might only enter the policy number and then,
that night, retrieve the policyholder name, agent, effective date, type
of coverage, etc.
- Some systems even allow for instant online
verification of data as well as the populating of the cross-reference
fields by linking the databases.
- Workflow is the current hot button in imaging.
Workflow developers give examples of firms saving up to 70% of the clerical
costs. This has excited the imaginations of cost conscious managers. Please
DO NOT plan to install a workflow engine at the same time you install a DIM.
People need to become comfortable dealing with images on their workstation.
They need to build confidence in the storage and retrieval of document before
adding additional software. When everyone feels comfortable then consider
Workflow. It can do wonders, but requires a thoroughly thought out implementation.
Workflow in effect puts your business practices (for handling documents) in
the hands of the workflow engine.
This gives you some of the basics
that need to be considered before installing a document image management system.
Our dealers are specially trained to assist you in analyzing your needs. Some
of the reasons as to why we only sell the software and hardware bundled should
be obvious. One that is not obvious is that we wanted our dealers to spend their
time helping you focus on getting the job done and on the behind the scenes
mechanics. Repeatedly our customers are amazed
the system is delivered
and the next day they are using it. |