We Are Moving!

Glen Arm, MD area circa WWI

Glen Arm, MD area circa WWI

Greetings and Happy New Year! 2010 was another great year for our company. Despite a bad recession, we managed to bring on new customers and continue delivering high-quality images to our ever-expanding customer base. Additionally, we announced a new microfilm scanning partnership with LYRASIS, the nation’s largest regional membership organization for libraries and information professionals.

We’ve also made significant investments in our growth and infrastructure, and plan on expanding our suite of services very soon. To that end, we are relocating our operations to Glen Arm, MD in the beautiful Long Green Valley Historic District, about 15 minutes from our current location in Towson.

Effective February 1, 2011, our new address will be:

Creekside Digital
5200 Glen Arm Road
Suite Q
Glen Arm, MD 21057

With triple the square footage of our current location, our main imaging studio will now be housed in a fireproof concrete vault which ironically once functioned as a corporate archives facility. This will allow us to stay focused on our mission of providing the highest quality images available anywhere, at any price.

Please contact us should you be planning to send materials to us before February 1.

Also, if you haven’t already — please visit our Facebook Page. It contains up-to-the-minute happenings around the shop, news and announcements, and links to related articles which may be of interest to our customers, partners, and friends. It’s updated a lot more frequently than this blog, so you may want to “Like It” to keep tabs on us.

Have a great 2011!

– Jim and the team at Creekside Digital

Read More

LYRASIS Partnership Expanded to Include Books on Microfilm, Segmentation Options

Indianapolis Recorder, February 7, 1931 (used with permission -- click for searchable PDF)

Indianapolis Recorder, February 7, 1931 (used with permission -- click for searchable PDF)

We are pleased to announce that Creekside Digital’s partnership with LYRASIS, the nation’s largest regional membership organization for libraries, has been expanded in two key ways:

  • Books on microfilm are now eligible for the Sloan-subsidized pricing under the LYRASIS Mass Digitization Collaborative; and
  • Digitized newspapers may now be optionally segmented (organized) by year, month, or date (issue).

Specifically regarding books on microfilm: all 2-up frames are split into two separate output pages. Page images are cropped to the page edge and deskewed. Additionally, in cases where it would improve image quality, Creekside Digital will apply geometric curve correction to remove any page curvature present on the imaged pages, which usually results in improved OCR accuracy. As with our LYRASIS newspaper conversions, all books-on-microfilm projects yield four files per output page: an uncompressed archival TIFF master, a JPEG2000 derivative, a downscaled searchable PDF reader, and a plain text file of the OCR engine’s raw output. These services are all included in the Sloan-subsidized pricing for digitization of books on microfilm that’s available to all LYRASIS members for one low per-page price. Yes — it’s quite a deal.

In the last month, we have digitized student newspapers on microfilm for Clarion University (PA) and Presbyterian College (SC). We are currently wrapping up the digitization of 39 rolls of microfilm of the Duquesne Duke student weekly newspaper for Duquesne University (PA). All of these projects were digitized to Library of Congress’ NDNP imaging specs, and the Duquesne project is the first under our agreement with LYRASIS to feature issue segmentation, with the reader PDFs from each issue being combined into a single multipage file named according to the issue’s date.

John Coffer, modern wet-plate collodion photographer

John Coffer, modern wet-plate collodion photographer

As always, for questions regarding pricing, logistics, etc., please contact Laurie Gemmill, Mass Digitization Program Manager for LYRASIS, at 800-233-3401 x2908, or email her at laurie.gemmill@lyrasis.org.

Also, for an interesting distraction, take a look at John Coffer’s website. John is a decidely retro photographer in the Finger Lakes area of New York. He lives without electricity, Internet access, or even running water, and has traveled across America in a horse-drawn darkroom / wagon performing his wet-plate collodion photography. Currently, he’s hosting three-day workshops on his farm (teepees included!) where students from all over the world camp out and receive hands-on training to help keep the “lost art” of wet-plate collodion photography alive.

Read More

Just how many books are there? How about 129,864,880!

Yesterday, Google posted on the official Inside Google Books blog that, by their calculation, there are approximately that many books out there. From our perspective, wow, that’s a lot of work! Capturing books at the highest quality possible, as well as scanning microfilm and similar items such as manuscripts, ledgers, scrapbooks, et al, ad naseum, is exactly what we do. In between the larger projects going in and out of Creekside Digital, we work on a lot of smaller orders. For example, in the few days alone, we’ve scanned an oversized scrapbook of letters to the founder of a national banking organization, the first in a series of hopefully many more romance novels, and a small collection of works from an historical foundation in Texas. Our i2S CopiBook HD 600 does a superb job with just about anything we can throw at it. These works are being digitized for conversion into ebook format, or being republished in searchable, high-quality digital format, complete with full-color pictures where applicable. So if you’ve been thinking about converting your books to digital, give us a call . . . we’ll help you move your part of the 130 million from paper to pixels.

One other topic of discussion, especially with school right around the corner: text books. Does Creekside Digital scan text books? Absolutely. However, we are *not* in the business of facilitating copyright infringement, nor do we wish to engage textbook publishers in legal battles, so we require that all textbook orders are accompanied by a signed affadavit stating that either a) you own the rights to the books you’re having scanned, or b) you are having the book scanned for your own personal use and study, you agree not to distribute electronic versions of the digitized book to others, even for free, and furthermore, you agree to pay for our legal defense should we be sued. See Page 2 of our Microfilm Small Order Form for a general idea of what to expect. Having said that, yes, searchable digital versions of textbooks are superior to their heavy, physical counterparts in just about every way, so please contact us for information!

Read More

LYRASIS Partners with Creekside Digital to Offer Expanded Microfilm Scanning Services

LYRASIS logo with tagline
Today, LYRASIS, the United States’ largest regional membership organization serving libraries and information professionals, announced that Creekside Digital is now providing preservation-quality microfilm scanning services to LYRASIS members through the Mass Digitization Collaborative. Here is the text of the press release (official press release on the LYRASIS site here):

LYRASIS Partners with Creekside Digital to Offer Expanded Microfilm Scanning Services

Atlanta, GA, — July 15, 2010 — LYRASIS partners with Creekside Digital, specialists in document digitization, to address members’ needs for expanded microfilm scanning services.

LYRASIS’ Mass Digitization Collaborative enables members to digitize books, serials, and newspapers for a low fee. Initially, the focus of the Collaborative was providing book scanning options. Now, in response to increased requests for scanning newspapers and books on microfilm, LYRASIS is broadening its focus and digital partnerships to afford members extended microfilm scanning services, in addition to optional encoding services. “We are thrilled by the opportunity to offer our preservation-quality microfilm scanning services to LYRASIS members through the Mass Digitization Collaborative,” said Jim Studnicki, President, Creekside Digital.

“While we will continue to leverage the expertise of The Internet Archive for book scanning, our partnership with Creekside Digital enables us to provide affordable, high-quality microfilm scanning and related text encoding services that follow relevant standards, such as the National Digital Newspaper Program guidelines,” commented Robin Dale, Director of Digital & Preservation Services.

LYRASIS member, Indiana University – Purdue University Indianapolis (IUPUI) participated in a pilot project to ensure member satisfaction with Creekside Digital’s conversion services. “We are working with Creekside Digital to digitize a collection of microfilmed Indianapolis newspapers. The project is not yet complete, but thus far their services have far exceeded our expectations, when it comes to communication and quality of work. We have received samples of the microfilm and we are satisfied with the results. Creekside Digital has gone above and beyond to address our questions about microfilming and the digitization process,” said Jenny Johnson, Digital Initiatives Project Coordinator, IUPUI University Library.

About Creekside Digital

Founded in 2006, Creekside Digital specializes in providing archival-quality digitization of microfilm as well as bound and oversized books, newspapers, and documents and manuscripts. For more information, please visit www.creeksidedigital.com.

About LYRASIS

Created in April 2009 by the merger of PALINET and SOLINET and joined shortly thereafter by NELINET, LYRASIS is the nation’s largest regional membership organization serving libraries and information professionals – providing opportunities for networking and collaboration, offering innovative solutions, and significant cost savings through group purchasing for products and services. For more information, please visit www.lyrasis.org.

# # #

Creekside’s Take on the Announcement

This partnership represents a fabulous opportunity for LYRASIS members (and for those institutions contemplating joining LYRASIS or renewing their membership) to get truly top-drawer, archival microfilm digitization at a very reasonable price. How is this possible? For those who don’t know, the LYRASIS Mass Digitization Collaborative is a program made possible by a $1 million grant from the Alfred P. Sloan Foundation. This funding is used to subsidize the cost of the digitization for member institutions. There are essentially two requirements to take advantage of the subsidized pricing:

  • An institution must be a LYRASIS member; and
  • All images digitized via the Mass Digitization Collaboration must be made freely available online, either through the Internet Archive or a similar website.

Various options are available for LYRASIS members, up to and including full National Digital Newspaper Program-specification scanning including METS / ALTO structural XML metadata. At a minimum, all projects digitized by Creekside Digital through the Mass Digitzation Collaborative will include:

  • 300-400dpi true optical resolution 8-bit uncompressed TIFF Digital Master file
  • JPEG2000 derivative
  • Searchable PDF derivative (“reader” files downscaled to 150dpi at Medium quality)
  • “Sidecar” plain text versions of each image’s OCR data (required for loading images into searchable databases or content management systems such as CONTENTdm).

For questions regarding pricing, project procedures, LYRASIS membership, the Mass Digitization Collaborative, etc., please contact Laurie Gemmill, Mass Digitization Program Manager for LYRASIS, at 800-233-3401 x2908, or email her at laurie.gemmill@lyrasis.org.

For questions of a more technical nature regarding the microfilm digitization process, please contact Creekside Digital.

Read More

Santa Paula Historical Society Case Study

Background

Organized in 1977, the Santa Paula Historical Society of Santa Paula, California is dedicated to preserving, protecting and perpetuating the history of the town and the surrounding area. Its founding eleven Directors were descendants of pioneers or long-time residents, and many of them continue to serve the Society today.

When the Santa Paula Historical Society decided to convert its microfilm collection of local newspapers to a searchable digital format, President Mary Alice Orcutt Henderson contacted Creekside Digital. Prior to committing to a project, SPHS sent Creekside its oldest and most difficult to read roll of microfilm in the collection, and Creekside provided free samples in order to demonstrate both the quality of the microfilm scans and the accuracy of the Optical Character Recognition.

Having experienced our library’s microfilm machine’s extremely poor quality and clarity in a printed page, my skepticism continued until receiving the free samples from Creekside Digital. I was astounded! On the screen the images were easily enlarged for readability; the printed page was equally easily read. More importantly, the whole process (of converting SPHS’s microfilm) was facilitated. I was sold!” – Mary Alice Orcutt Henderson, President of SPHS

The Process

Creekside Digital uses high speed rollfilm scanners which utilize latest-generation technology including fiber-optic lighting, Nikkor lenses, CCD cameras and cutting-edge microprocessor designs in order to provide previously unachievable image quality. Combined with “ribbon scanning” – a process by which a roll of microfilm is scanned in its entirety, with frame detection, quality control and image file output occurring after the initial film capture – the potential for common scanning problems including improperly cropped images and skipped frames is eliminated.

At A Glance

  • 240 rolls of 35mm newspaper microfilm spanning 110 years of history
  • Digitized at 300dpi / 256‐color grayscale
  • Optical Character Recognition output to multipage PDF files
  • Fulltext Index creation allows for searching across multiple years of content

Once SPHS’s microfilm rolls were scanned, text was extracted from the raw images using Optical Character Recognition. OCR creates an invisible layer of digital text behind the image of the original newspaper page which may be copied, pasted, and searched like any other digital text. Multipage PDF files (one page for each frame on the original microfilm) were then output from the OCR creation process. Finally, a Fulltext Index was created for each roll of film to allow patrons of the Santa Paula Historical Society to quickly search the newspapers for surnames and place-names specific to their region – a boon to historians and genealogists alike.

Results

To finance our project, the Society and the (Santa Paula) Library jointly shared the expenses. Now both organizations have a full set of the scanned newspapers to offer to the interested public. Yet, better than the product is the willingness of Creekside Digital to assist with any questions – no matter how simple or complex. All in all, it was money well spent for both our community library and the nonprofit historical society.” – Mary Alice Orcutt Henderson

Are you ready to convert your microfilm to digital images and data? Contact Creekside Digital today!

Read More

Mixed Bag

Greetings!

The Baltimore Metro Area hit a new record temperature for today’s date with a high of 105 degrees. Yes — it was miserable, and we’re all hiding in the air conditioning. No break in sight, either. But just five months ago, we literally had five feet of snow piled up in the parking lot!

As I previously mentioned, we’ve had many smaller projects going in and out of the shop. As an example, this morning we scanned a 46 year-old yearbook to searchable PDF for an individual customer in New Jersey, and also processed a small historic microfilm project for a municipal customer in New Hampshire. Both jobs came out very well. The CopiBook HD 600 is *exceedingly* sharp edge-to-edge, and it really shows. Those of you considering doing work yourselves on a $99 Best Buy flatbed scanner versus sending it to us would be shocked at the difference — it is literally apples and oranges. I hope to have some hi-res samples available on the site soon to really demonstrate the machine’s capabilities. But if you want sharp, accurate images, you will be hard-pressed to beat our CopiBook HD 600.

At the ALA show, I saw the Austrian Treventus book scanning robot being marketed here in the US by Indus. Interesting concept — pros and cons to its approach. It uses two CCD sensors embedded in its “splitting wedge”and actually line-scans both sides of each page simultaneously. It probably does the best job of eliminating page curvature of any robot out there, but it’s capped at 300dpi true optical resolution, so it won’t work for true preservation-quality work. Its maximum page size is also limited, and the price tag is a bit high (as with all robots). But it seems this unit would be a monster for rapid digitization of general circulation stacks, for example.

Case studies will be up on this site soon — Santa Paula Historical Society, and John Rigby & Co., Inc., with others to follow.

– Jim

Read More

Creekside Digital’s new website is LIVE!

Yesterday, we “flipped the switch” on our new website. The old homegrown site had served us well throughout the first part of our company’s existence, but we have aggressively grown over the past year and required something which better described our new capabilities to current and potential customers. Kudos to Cloud Nine Labs for working with us to build exactly what we wanted.

We’ve had lots of different kinds of projects going in and out of the shop lately, the largest of which is a 137 roll newspaper conversion including NDNP-specification scanning. This means uncompressed 8-bit TIFF Master files between 300-400dpi (depending on the reduction ratio of the film), which gives us really huge images — 30 to 60 MB each. Then of course, there are several other derivatives, but none are nearly as large as the TIFFs. We conservatively estimate that the final project will consume approximately 5.5 TB of disk space.

Creekside Digital is attending ALA 2010 in DC!

Also, Creekside Digital will be attending the American Library Association 2010 Annual Conference this weekend in nearby Washington DC. We look forward to seeing old friends and making some new ones there!

Read More

A Busy Summer

Frontispiece from a recent project scanned at 400dpi (click for detail)

It’s been awhile since our last update to the blog. We’ve been heads-down with a lot of important digitization projects lately. About a month ago, we performed onsite scanning on a large job for a legal services company here in the Northeast. Currently, we have two large jobs in the shop. One is a 461 roll conversion of 16mm microfilm for a large utility customer in the Western US. This job includes Creekside Digital’s ultra high-quality film scanning, OCR, and PDF compression. At the same time, we’re running a 134 roll 35mm microfilmed newspaper project for a library system (also out west), which includes scanning, OCR, and creation of FullText indices to allow the library’s staff and patrons to keyword search all 134 rolls of film simultaneously. In between, of course, we’ve been running a variety of small jobs for folks all over the US and Canada: single rolls for individuals; orders which are a handful of rolls for engineering firms, hospitals, and churches; and plenty of fiche jobs for parts suppliers, school districts, and litigation service companies.

Very soon, microfiche orders will be accepted through our Small Order Form as well as rollfilm. That means that if you have a small number of sheets / jackets of microfiche (up to 1,000), you’ll be able to simply fill out the form in your browser, entering the number of sheets you have and the type of processing you need (PDF or JPG, OCR, etc.). The form will automatically calculate the exact price for you (including shipping!), and you’ll print it out and mail it to us with you fiche. We’ll process your order and send your fiche back along with a DVD of your digitized fiche images. We expect to have microfiche digitization available through the Small Order Form within the next few weeks.

Have a great 4th of July!

– Jim and the folks at Creekside Digital

Read More

On NARA Publications & working with difficult film

Nobody has more microfilm than the US Government — and no one has created more public domain microfilm publications than the National Archives and Records Administration, or NARA. In fact, Creekside Digital first ventured into the high-quality microfilm digitization industry as part of a software project built around NARA microfilm, and it continues to be a large part of our business today.

Period document from 35mm NARA microfilm (18x) preserving original paper texture and creases (click for detail: 300dpi JPEG @ 60% quality, 1.51 MB)

Currently, we’ve got another project in the shop that is all National Archives microfilm — in this case, period documents from the 1860s. About half is 16mm, and the other half is 35mm. Typical of NARA rollfilm, the original documents on the 16mm rolls are the same size has those on the 35mm rolls, but the reduction ratio is higher — meaning the frames are smaller, and more documents can fit on a single roll. This also means that there’s less image information in each frame with which to work, which means it’s harder to get a nice, tight focus and set the scanner’s critical settings. You really have to take your time. Each roll of film is different, and there’s no “magic button” which automatically configures the best scanner settings whenever a new roll is loaded. That’s up to the operator, and that’s where experience with the equipment and having “an eye” for what great images look like makes all the difference.

One common issue with scanning period docs on NARA microfilm is the tremendous variation in image density on the same roll. Often, a roll will have a sequence of very dark documents, followed by extremely light papers. This is makes it very difficult to optimally set lamp and gamma: too low, and you end up with a bunch of dark and mostly illegible pages (virtually impossible to frame detect) followed by a document scanned at “normal” brightness. Too hot, and yes, the darker frames become legible — but the previously “normal” documents become overexposed and details like the paper grain start to burn out. It takes a lot of extra time to get everything set “just right” in order to output great images across an entire 100′ roll of film.

16mm NARA roll in the Auditor showing tremendous variation in film density

Creekside Digital’s scanning platform offers an unprecedented level of bandwidth to allow it to better capture the full range of image density found in a given roll of film. Grayscale images no longer look like a poor Xerox copy or over-dithered; shades of gray are as true to the original document as the source film allows. And although perfect automatic frame detection remains a pipe dream with film of highly-variable density, we have software tools which allow a person to visually audit and correct the frame boundaries proposed by the machine.

The screenshot shown here illustrates the high variation in film density we’re encountering on this project. Notice the automatic frame detection has done a good job at establishing initial frame boundaries (the yellow boxes). Any part of any yellow box can be grabbed by the operator with the mouse and dragged into the correct position. Frames can also be removed entirely, and dark frames which were cropped or skipped entirely by the machine can be added by a person prior to output. Try that with first-generation technology that outputs the images “on-the-fly” as the film passes through the scanner!

Read More

‘Tis the Season . . . for Archival Conversions

What’s going on at Creekside Digital? Well, it’s unseasonably cold here in Maryland today. It was 12 degrees this morning and it’s up to a balmy 21 now. It’s windy too, and the wind chill has given rise to a Severe Weather alert for our area.

Recently, we’ve been working on a pair of 35mm archival conversions for libraries in Southern California. This means that the rollfilm is scanned at much higher resolution than normal — in this case, 400dpi — and that the output scans are 8-bit grayscale lossless TIFF images. It’s a more expensive, time-consuming process than “ordinary” 200 / 300dpi scanning with lossy compression, but the result is as close to a bit-for-bit copy of your source rollfilm as you can get given the current state of scanner technology.

Archival 400dpi lossless image at 100% zoom (click for detail)

A few things to keep in mind:

  • 8-bit grayscale TIFFs at 400dpi are huge. A single 11″ x 17″ document might be 22 MB in size. Architectural drawings or a single 2-up newspaper frame might be up to 100MB or more. So delivery via external hard drive is pretty much a requirement for anything over a few rolls of film.
  • Creekside Digital delivers its 8-bit grayscale TIFFs using LZW compression. This is a “lossless” compression scheme which uses rather complex mathematical algorithms to shrink down the file size while preserving bit-for-bit perfection of the original scan (i.e., an LZW-compressed file is visually identical to its uncompressed counterpart). The file size savings are marginal compared to “lossy” compression schemes, such as the popular JPEG format, but there’s no degradation of the image’s quality, either.
  • 150dpi Lossy at 350%

    "Reader" 150dpi lossy image at 350% zoom (click for detail)

  • Files this large are clumsy to work with, even on a fast, new computer. They consume vast amounts of disk space and can take awhile to copy or print. Generally, they’re too large to email. Because of this, we usually create downscaled “reader copies” in PDF format. These are lower-resolution files which library staff and patrons work with day-to-day; when they want to examine the original document with ALL of its high-resolution detail, they can pull the corresponding TIFF image. Additionally, it’s possible to deliver multipage reader copies on DVD (preferred by librarians) for ease of use on non-networked PCs, with each multipage file corresponding to an entire source roll of microfilm. And because the reader copies are so much smaller, you might fit fifteen rolls of film on a single DVD.
  • PDF reader copies can be OCR’d, of course, allowing keyword search of typed / printed documents.

To see the difference between archival- and reader-quality images, click each of the two thumbnails above. The top image is a section of a 400dpi archival scan viewed at 100% zoom (no magnification) — you can see that the high resolution of the scan captures a tremendous amount of detail, and that there is no noise or other “artifacts” associated with compression (as lossless compression was used). The bottom image is the same frame, but as the image was a 150dpi reader, it was necessary to zoom to 350% magnification in order to view the same section of the original document at the same size — yes, archival scans are that large. Note the compression artifacts and loss of image detail on the reader copy. Again, readers are great for day-to-day use, but are no substitute for the detail captured with archival scans.

When you want the best-possible microfilm scan quality combined with the portability of the PDF format, contact Creekside Digital about archival scanning of your microfilm.

Happy Holidays!

Read More
Page 2 of 3«123»