Optical Character Recognition

Unlock the data in your scans! Creekside Digital can run Optical Character Recognition to extract text from your raw images. While the typical output of the OCR process is a searchable PDF file with an invisible layer of text under the original document image, the text content can also be exported as separate “sidecar” files in many other formats including plain text, Excel, and XML. Creekside Digital has OCR’d millions of documents for our customers, and we have the experience and capacity to handle jobs of any size.

Indexing and Data Capture

By default, scans are sequentially named as they come off the scanner (e.g., “0000007.jpg,” “0000008.jpg,” etc. This preserves the sequence in which each document image appeared in the source book or film, which is important information by itself. However, quite often the images we create from source documents need to be indexed or named according to certain fields or data they contain, or using some predefined naming convention. Creekside Digital can perform this service for you to allow easy, rapid identification and retrieval of specific records within a group of documents.

Another common service we provide is the identification and transcription of specific fields within a given document image, such as handwritten entries on a form. This is called Data Capture. Depending on your specific requirements, Creekside Digital can manually enter these values into a spreadsheet and / or use some or all of them to name each image.

Multipage File Assembly

Often, more than one document image makes up the original record. For example, a student record file might consist of six separate pages which appear sequentially on a roll of source microfilm. Creekside Digital can create a single multipage TIFF or PDF file which contains all six document images belonging to that particular student’s record. Typically, such multipage files are named according to some unique characteristic of the record (e.g., student name or Social Security Number). This application is common with personnel records and student transcripts.

Creekside Digital can also create multipage “reader” files for 35mm newspaper microfilm conversions. These allow patrons and staff to easily scroll through multiple pages from a given roll of digitized newspapers. We adjust the number of pages in each multipage “chunk” in order to find a size that opens quickly and easily, as each project is unique.

Latest on Facebook

6 days ago

Creekside Digital

Creekside Digital's MD facilities will officially re-open this coming Tuesday 5/26. We have had a million little things to do, items to source, and procedures to implement to prepare for the return of the staff, but we're in good shape to be able to get everyone back to work safely. ... See MoreSee Less

Creekside Digitals MD facilities will officially re-open this coming Tuesday 5/26. We have had a million little things to do, items to source, and procedures to implement to prepare for the return of the staff, but were in good shape to be able to get everyone back to work safely.

4 weeks ago

Creekside Digital

Special Collections, Musselman Library, Gettysburg CollegeOur exhibit, "Teaching with Asian Art: Illustrating Professor Kramer's Textbook," is sitting lonely in the Special Collections Reading Room, but you can still see many amazing pieces from Prof. Frank Kramer's collection on GettDigital. There are images and descriptions of over 800 pieces of art from China, Japan, Korean, Java, and Central Asia dating from the 17th-20th centuries. gettysburg.contentdm.oclc.org/digital/collection/p4016coll6

Today you can also interact with a panel of a large, early 20th century porcelain mallet vase in jigsaw puzzle format: www.jigsawplanet.com/?rc=play&pid=27264f8fdb28
... See MoreSee Less

Image attachment

 

Comment on Facebook

Love any netsuke and all things Asian!! Nobody does carving out Jade and other materials better!!

1 month ago

Creekside Digital

Postcard Mass Digitization at Smithsonian's National Museum of African ArtPlease enjoy this short video showing production of our digitization project of the Stephen Grant postcard collection at the National Museum of African Art's... ... See MoreSee Less

Video image

Comment on Facebook

Load more
Get Started