Optical Character Recognition

Unlock the data in your scans! Creekside Digital can run Optical Character Recognition to extract text from your raw images. While the typical output of the OCR process is a searchable PDF file with an invisible layer of text under the original document image, the text content can also be exported as separate “sidecar” files in many other formats including plain text, Excel, and XML. Creekside Digital has OCR’d millions of documents for our customers, and we have the experience and capacity to handle jobs of any size.

Indexing and Data Capture

By default, scans are sequentially named as they come off the scanner (e.g., “0000007.jpg,” “0000008.jpg,” etc. This preserves the sequence in which each document image appeared in the source book or film, which is important information by itself. However, quite often the images we create from source documents need to be indexed or named according to certain fields or data they contain, or using some predefined naming convention. Creekside Digital can perform this service for you to allow easy, rapid identification and retrieval of specific records within a group of documents.

Another common service we provide is the identification and transcription of specific fields within a given document image, such as handwritten entries on a form. This is called Data Capture. Depending on your specific requirements, Creekside Digital can manually enter these values into a spreadsheet and / or use some or all of them to name each image.

Multipage File Assembly

Often, more than one document image makes up the original record. For example, a student record file might consist of six separate pages which appear sequentially on a roll of source microfilm. Creekside Digital can create a single multipage TIFF or PDF file which contains all six document images belonging to that particular student’s record. Typically, such multipage files are named according to some unique characteristic of the record (e.g., student name or Social Security Number). This application is common with personnel records and student transcripts.

Creekside Digital can also create multipage “reader” files for 35mm newspaper microfilm conversions. These allow patrons and staff to easily scroll through multiple pages from a given roll of digitized newspapers. We adjust the number of pages in each multipage “chunk” in order to find a size that opens quickly and easily, as each project is unique.

Latest on Facebook

2 days ago

Creekside Digital

2 million objects digitized! That's pretty cool.TWO MILLIONTH OBJECT DIGITIZED!!!

In June 2016 we let you know our office hit 1 million objects digitized. It took us a little over 3 years to hit that mark. Now, less than 2 years later, we’ve piled on another million and have just hit our TWO MILLIONTH object digitized! We're making progress but we’re still just getting started!

And what object had the honor of being our two-millionth? This beautiful orchid from our ongoing project to digitize Smithsonian Gardens living orchids collection.

This plant, the two-edged Laelia (Laelia anceps) from Mexico, Guatemala, and Honduras belongs to an unusually hardy family. Species have adapted to compete in crowded landscapes and can withstand harsh environments where few other plants can survive.

In early November, Mexicans celebrate the Day of the Dead to honor their dead relatives. Grave-sites are elaborately decorated with brightly colored and highly fragrant Laelia anceps orchids ranging in color from pale lavender to white.

In addition to objects digitized at Smithsonian Gardens, the road to this milestone has also included team members and objects from these museums:

- Smithsonian's National Museum of Natural History
- Cooper Hewitt
- Smithsonian's Freer and Sackler Galleries
- Smithsonian National Museum of African American History and Culture
- National Museum of American History
- National Air and Space Museum, Smithsonian Institution

... with many more to come! Thank you to all museums, object handlers and business partners that made this possible!
... See MoreSee Less

2 million objects digitized!  Thats pretty cool.

2 weeks ago

Creekside Digital

Filming Friday at Smithsonian Gardens with Smithsonian Enterprises. Stay tuned for a new video showing our digitization process of this special living orchid collection! ... See MoreSee Less

Get Started