Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 556453
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 13, 20262026-05-13T11:55:16+00:00 2026-05-13T11:55:16+00:00

So I have files…. .doc .docx .xls .xlsx and .pdf that are on the

  • 0

So I have files….

.doc
.docx
.xls
.xlsx
and .pdf

that are on the my server.

Is it possible (and if it is, how) to extract the meta data from those files using PHP?
I’m looking for things like Author, keywords, title, etc…

In office documents it’s the information stored along with the document properties (File…Properties…Summary for 2003, Prepare…Properties for 2007).

In PDFs it’s information found in Document Properties.

This is not on a Windows server.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-13T11:55:17+00:00Added an answer on May 13, 2026 at 11:55 am

    I have managed to extract a lot of Meta information using XPDF on a linux system a few years back. Nowadays, though, I would say Zend_PDF is your best bet. Haven’t used it myself but looks good and promises everything you need. Seems to have no library dependencies, either.

    For Word .DOCs, if you don’t find a better way, plug into an OpenOffice server instance / command line and convert the files to ODT, which is XML and parseable. If it’s not possible to extract the meta data per Macro – it should be, but I don’t know how much work it is. This OpenOffice Forum entry gives a ton of starting points for automated conversion.

    The …X formats are some sort of XML, so it should be easily possible to fetch the meta data from them. Alternatively, you should be able to use OpenOffice’s conversion filters here as well, if they transport the meta data.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have files that are automatically uploaded onto a server from mobile phones, and
I have files on a server that can be accessed from a URL formatted
I have files(pdf,doc,txt,xsl,etc..) stored in my mongo *db*. I want to retrieve and open
I have files with tons of real time data that I process with an
I have files that I want only 'foo' and 'bar' left from split. dn
I have files that I want to delete. Connection can be from file sharing,
I have files like .doc .pdf .excel... and i want to open them externally.
I have files on the server. Originally their names are readable and users put
In my Visual Studio 2010 project I have files with .mm file extension, that
Let's assume that I have files a.cpp b.cpp and file c.h. Both of the

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.