PDF Migration from CD-ROM to WEB
The Early Days of Publishing PDF on CD-ROM
When Verity first produced Acrobat Catalog and Acrobat Search for Adobe, R.R. Donnelly was in on the deal and you could buy it from them for $5,000 plus $50.00 per CD produced. Only a few well-funded pioneers published this way.
In 1996, when Acrobat 3.0 was introduced, The Catalog and Search tools became "free" (bundled with the Acrobat application suite). No more seat charge. CDs became an exciting way to deliver paper collections in digital format to users.
I have the CDs that were distributed at the AIIM conventions in 1996 and 1997, but after that, no more free CDs were distributed. Why? My guess is that attention was already shifting to the Web as a delivery vehicle.
I personally own way too many CDs, and keeping track of them is a chore. I prefer that reference material be "stored" on the web for me, accessible on a moments notice.
Pressure to Migrate to the Web
The arrival of Acrobat 4.0 has come and gone now, but Acrobat Catalog has hardly changed from the beginning, and it has limitations.
For one, it limits use on a LAN by using hard-coded drive letters when finding indexes. If your PC happens to be configured with different drive letter assignments, the Catalog search indexes will not be found.
For another, it cannot be used by a web server to search PDF for display in the web browser.
Adobe has produced a free iFilter for Microsoft Index Server (bundled with NT), but this only extracts the text for indexing, and cannot produce search term highlighting or hit-page downloading, as can SearchPDF.
The lack of a full-text search tool from Adobe for searching PDF on the Web has created pressure.
Finally, Acrobat Catalog can not effectively build a search-index that spans multiple CDs. The is no friendly built-in way to say "please insert Disk 2". The root problem is that digital document collections can easily expand beyond the storage capacity of a single CD. Who wants the hassle of swapping CDs? The web is the solution.
False Hopes with SearchPDF
Verity introduced a free beta application called SearchPDF in 1997, that used the Catalog-created search-index to work with PDF on the web. I thought at the time that for sure, Verity would produce a reasonably priced search product for PDF on the Web. But Verity's products became limited to expensive, high-end products.
SearchPDF Provides the Solution
SearchPDF draws upon the native capabilities of the dtSearch engine to correctly handle PDF on the Web. This means Search Term Highlighting and Intelligent Page Downloading, where just the hit pages come down to the browser, and the first one is displayed as soon as it arrives.
SearchPDF then goes beyond the abilites of Acrobat Catalog and Search by using Custom Index (DocInfo) Fields stored in the PDF files, and retrieved for use in the search-index. Fielded, structured queries can be combined with full-text queries, and this gives power and flexibility to the searcher.
Acrobat Search has an interface that is limited to pop-up boxes in the Acrobat Viewer. SearchPDF occupies the rich web-space, and uses this to present elegant search and retrieval screens.
By using Active Server Pages (ASP) and recordsets, the search and retrieval screens take on all the dynamic attributes of a document management system!
SearchPDF also extends beyond search and retrieval screens and can be used with hyperlinks throughout the web portal to create dynamic document listings.
The Future
Digital Publications, once entirely associated with CDs, will move to the Web and to DVDs. The Web will dominate. The competitive advantage will be in how rich and effective the delivery can be. SearchPDF can take you to the leading edge today.
Live Demonstration of CD-ROM to Web Migration
We have created a demonstration migration. We started with 50 PDF technical papers published on CD, and present them to you with all the advanced features of SearchPDF on the web. Click here to visit this demonstration now.
|