Over 90 percent of the information in the world is still on paper. Many of those paper documents include color graphics and/or photographs that represent significant invested value, and almost none of that rich content is on the Internet.
 
That's because scanning such documents and getting them onto a website has been problematic at best. At the high resolution necessary to ensure the readability of the text and to preserve the quality of the images, file sizes become far too bulky for acceptable download speed. Reducing resolution to achieve satisfactory download speed means forfeiting quality and legibility. Conventional web formats such as JPEG, GIF, and PNG produce prohibitively large image files at decent resolution. DjVu is a new Image Compression Technology that enables small file sizes for high quality image files.

There are two types of scanning - bitonal or grey scale. It is fairly important for any publication to use one of these methods for consistency, although it is possible to scan a book that has the occasional photograph or lithographic illustration as bitonal with interspersed grey scale scans.

Either type can be used with optical character recognition (OCR) if required. (Subject to a satisfactory original document or book).

Creation of PDF files (for Adobe Acrobat)

It is possible to convert the TIFF image files to PDF format for use with Adobe Acrobat. This includes:

- Creation of page numbered bookmarks for the whole document
- Supply of the PDF file(s) on CD

 

Specification

Details

Bitonal scanning
This is simple black and white and is ideal for most printed books, including those with illustrations that are line engravings.

An exception might be an old printed book that is very inconsistent in print density, i.e. a mixture of grey and black.

The end result graphics files are much smaller than grey scale scans, and therefore more images can be stored on a CD either as graphics files or PDF files.
Grey scale scanning
Grey scale scanning is ideal for documents or books that contain:
- hand written pages
-photographs
- lithographs

Hand written documents can sometimes be scanned as bitonal, but the ink density needs to be very good, and very consistent.

Images scanned in grey scale can result in very large file sizes, and therefore fewer images can be stored on a single CD. Typically about 500 pages, either as TIFF files or PDF files fit on a CD.
Paper Size A2 / A3 / A4
Dpi Resolution 75 / 150 / 200 / 300 / 600 / 1200
Scanning Format Tiff (CCITT G4) Black & White format
  PDF Image
  DjVu

E-book Publishing: Cost effective and environment friendly content delivery

E-book publishing is a cost effective method of digital content delivery. We have wide experience in publishing e-books in a number of formats, and targeted towards various delivery platforms.

We create PDF, REB, OEB and LIT formats destined for computers and hand held devices. We have successfully delivered large e-book creation projects to numerous international clients.

Methodology : Text is extracted from either Hard Copy / TIFF / PDF / page layout formats such as Quark, Pagemaker etc. This is initially converted to XML and then further processed to the Open E-book Standard (OEB 1.0) and from there on to different e-book formats as specified by the client.

We can deliver fully interactive, multimedia enabled e-books in any format that you desire.

Acrobat PDF Conversion

We use most of the Adobe suite of products for PDF creation, selecting tools depending upon output requirements.

This is a list of the most common PDF outputs that we deliver:

 
PDF Normal
(Fully Text searchable with Image and Graphics)
 
PDF Image+Text
(OCR text embedded over original Image)
Clients who do not have big budgets for conversion generally use PDF Image+Text. The output will have full text search though accuracy will be input image dependent. No parsing on converted text is generally done for PDF Image+Text outputs. However we can also deliver parsed (99.995% accurate) text embedded on an image.
 

Full color PDF
(Exact replica of printed books)
E-Book publishers prefer these outputs.

 
PDF Forms
(Blank PDF forms)
Widely used on the internet to collect, submit and transfer information.

In addition to Adobe tools, we also use other color-supporting applications to create high quality color pdf output files.

Forms Processing

We offers high volume forms processing solutions. We employs a combination of the world's best ICR / OMR applications with internally developed Image View applications to offer you competitive pricing and rapid turnaround times. Some of the kinds of forms we have handled are:

 
Hospital Records Indexing
 
Medical Claims Processing
 

Insurance Application Docket Processing

 
Insurance Claims Processing
 

Educational Institutions – Admission Forms and Pre admission process

 
Education Boards – Pre and Post Examination process
 

Company Personnel Records Digitization

 
Company Forms Digitization

Word Processing / Text Conversion

Scope and Objective

Many industries today generate documents in various proprietary formats that cannot be distributed widely due to the requirement of the application, or even a specific version of the application on the target platform. Word Processing enables the conversion of the data in these proprietary formats to universally accepted delivery formats such as RTF and DOC.
 
The finished documents are cross-platform, require freely downloadable viewers and have compact file sizes. They are also printable with no loss in image or data quality.
 
Methodology:

With the help of the latest OCR software, we achieve the highest possible accuracy. This is followed by multiple manual proofreading processes the finished files are created in .doc or .rtf format.
 
The following is a list of formats that we are proficient in converting:
 
Hard Copy (Paper) / Scanned Images of Hard Copy documents:

Paper documents are scanned and converted to an image format which can then be indexed using a search tool or be OCR (Optical Character Recognition) to create editable searchable text documents. eg. TIFF, GIF, PDF, JPEG, and DjVu.

Tagging & Markup

These services are required where highly complicated unstructured data in various disoriented formats need to be structured in a cohesive manner for immediate access at any given point of time through the Internet / Intranet.

SGML (Standard Generalized Markup Language)

The SGML language allows to describe the hierarchical logical structure of a class of textual documents written in a DTD (Document Type Definition). We can create a book DTD or a journal article DTD. HTML (Hypertext Markup Language) is the DTD for the structure of Web pages.

SGML is currently used for data creation, for data exchange, for data storage, for indexing, searching and retrieving, for printing out and for data viewing.