Omnipage Pro 14 - How do I save OCR data in TIFF so I can search document with Home XP Documents sea

Discussion in 'Scanners' started by RJ, Nov 4, 2004.

  1. RJ

    RJ Guest

    I want to scan documents, save the image for legal reasons but be able to
    search the documents with the Start/Search/Documents utility.

    I have Omnipage Pro 14 on a Home XP system. Omnipage OCR is much more
    accurate than the MS Document Imaging so I'd like to use it.

    However, when I scan a document, OCR it and save it to TIFF format the file
    does not show up as being OCR'd by XP and documents scanned with Omnipage
    don't show up in Start/Search/Documents search. Documents scanned and OCR'd
    with Office Document Imaging are found correctly by the
    Start/Search/Documents.

    How do I save my OCR'd Omnipage documents in a TIFF so the data is available
    to MS Document search?

    I realize I could save as TIFF and searchable Word document but I'd rather
    have a single source picture/OCR data file. I could save as PDF but I
    believe PDF but XP cannot search a PDF file. I'd appreciate any advice.

    thanks
    rj
     
    RJ, Nov 4, 2004
    #1
    1. Advertisements

  2. ScanSoft OmniPage Pro 14 is neither a part of MS Office, nor is it a
    Microsoft product. If you bothered to read ScanSoft OmniPage Pro 14 's
    excellent Help files, you would know that....

    TIFF is a GRAPHICS file format -- when you save your OCR as a TIFF, you are
    not saving it as editable / searchable text -- you are saving it as a
    PICTURE of text. Since there is, by definition, no text in a graphic file,
    neither MS Windows, nor any other program, can find text within a graphic
    file.
     
    joust in jest, Nov 4, 2004
    #2
    1. Advertisements

  3. You can't. OCR takes you from an image file (like TIFF) to a text file or
    word processor file. That kind of file can be word-indexed for searches. Going back
    to an image file loses that benefit: you convert words to pixels, and lose the words.
    And you need to OCR it over again to get back at the words.

    If you want to keep the original image, but with the text for searches,
    PDF (as you note) or perhaps DjVu would be the right choices.
    Adobe Reader can -- and you're going to need for accessing the PDF file anyway.
    If you have very many PDF files, talk to Adobe -- I'm almost sure they have
    a solution. It may not fit your budget, though.

    I believe DjVu now is capable of something similar, but I don't think
    many OCR programs can save to that format directly.
     
    Anders Thulin, Nov 4, 2004
    #3
  4. RJ

    Graham Jones Guest

    For a cheap way to search PDF files, you could try SearchWithin from
    http://www.software995.com/
     
    Graham Jones, Nov 4, 2004
    #4
  5. RJ

    friend® Guest

    *I want to scan documents, save the image for legal reasons but be able to
    *search the documents with the Start/Search/Documents utility.

    images or rtf, doc documents are inherently unsafe, they can be easily
    manipulated afterwards. PDF files can be manipulated but all versions
    are saved, the document/file can be ammended not deleted and replaced,
    all versions can be checked for changes. The best solution is to use
    Adobe Acrobat 6, it scanes the document, and can OCR withing the
    program, giving you several options to save, as an image, as image
    over text or text. For legal reason saving image over test is the
    best. Acrobat 6 allows to search multiple files in a subdirectory for
    a string of characters without creating indices. OCR is not as good as
    FineReader 7, but it all depends on type size and face. Get a demo
    version of Acrobat 6 and try, if it is work related, claim from taxes.
    Images saved by Acrobat 6 can be jpeg, jpeg2000 and probably other
    compression methods as well. Jpeg is not good for text, but jpeg2000
    is OK, smaller file size for equal quality.
    As a matter of fact, winxp is piece of shit, get 2000 and use 3rd
    party real programs, not cut down bloatware from µshit.
    *
    *I have Omnipage Pro 14 on a Home XP system. Omnipage OCR is much more
    *accurate than the MS Document Imaging so I'd like to use it.

    Omnipage is extremely expensive.
    *
    *However, when I scan a document, OCR it and save it to TIFF format the file
    *does not show up as being OCR'd by XP and documents scanned with Omnipage
    *don't show up in Start/Search/Documents search. Documents scanned and OCR'd
    *with Office Document Imaging are found correctly by the
    *Start/Search/Documents.
    *
    *How do I save my OCR'd Omnipage documents in a TIFF so the data is available
    *to MS Document search?
    *
    *I realize I could save as TIFF and searchable Word document but I'd rather
    *have a single source picture/OCR data file. I could save as PDF but I
    *believe PDF but XP cannot search a PDF file. I'd appreciate any advice.
    *
    *thanks
    *rj
    *
     
    friend®, Nov 5, 2004
    #5
    1. Advertisements

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments (here). After that, you can post your question and our members will help you out.