Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7431569
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 29, 20262026-05-29T09:18:58+00:00 2026-05-29T09:18:58+00:00

is there anyway to perform OCR while uploading a document? can we index the

  • 0
  • is there anyway to perform OCR while uploading a document?

  • can we index the entire document?

  • can the search engine index the entire document? Even though users are required to pay to view the full document?

  • can the document be displayed as a preview with only the selected excerpt visible and the rest blurry with the format of the document still viewable?

I’ve been trying to find easy solutions to these questions using simple php functions or something that wouldn’t seem like rocket science to accomplish. But everywhere I look I see people talking about ApachePOI and Solr Cell and all these server commands that I have no idea about. For the last question, i could only figure out that we can use PHPGD and generate images with blurred content, but I wasnt sure how to make that work if there was formatted text, images and tables etc in the document.

So if someone has easy solutions, or even complicated solutions buts with EASY instructions, those will do. Something like “php document content extraction for noobs”, that will start from the a-b-c’s of it.

Thank you in advance!

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-29T09:18:59+00:00Added an answer on May 29, 2026 at 9:18 am

    Zend_Search_Lucene contains some code to read the docx file, which will run in PHP alone.

    For PDF and doc, you can use command line utilities to extract the plain text content, such as catdoc or pdftotext. You can find such utilities for most file formats out there if you search around. They are usually packaged by most distributions.

    From the raw text format, you can feed it to any full text search engine.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Is there any way I can get jQuery to perform a function when you
Is there anyway to perform a timely action in ASP.NET using a timer but
Is there any way to perform SQL Like Queries or Filtering on Java Data
Is there anyway to have a sort of virtual static member in C++? For
Is there anyway to configure a WCF service with a failover endpoint if the
Is there anyway in Java to delete data (e.g., a variable value, object) and
Is there anyway to upgrade the installed JRE in the system? We are having
Is there anyway to declare an object of a class before the class is
Is there anyway to combine all resources into a single exe file such as
Is there anyway to build a solution to target 64 bit environment in vs2003?

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.