Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8156365
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 6, 20262026-06-06T16:55:58+00:00 2026-06-06T16:55:58+00:00

I have a folder which contains doc, docx, xlsx, pdf and txt files. I

  • 0

I have a folder which contains doc, docx, xlsx, pdf and txt files. I am uploading all these files into Marklogic with this XQuery:-

for $d in xdmp:filesystem-directory("C:\uploads")//dir:entry
return 
  xdmp:document-load($d//dir:pathname,
    <options xmlns="xdmp:document-load">
    <uri>{concat("/documents/", string($d//dir:filename))}</uri>
    <permissions>{xdmp:default-permissions()}</permissions>
    <collections>{xdmp:default-collections()}</collections>
    <format>binary</format>
    </options>)

I have also installed content processing for my database. Now when I upload doc and pdf files they get converted to xml & xhtml files. But docx, xlsx, & txt do not get converted. Can somebody tell me why these files are not getting converted?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-06T16:56:00+00:00Added an answer on June 6, 2026 at 4:56 pm

    Enable the Office OpenXML Extract pipeline to convert the .docx, .xlsx, and .pptx files.

    Files with these extensions are already XML. If you were to change their extension to .zip, you could extract and see the files are just composed of interrelated XML parts.

    The Office OpenXML Extract pipeline will unzip Office 2007/2010 files and store their requisite parts in a directory sibling to the main file, similar to the other conversion pipelines. This pipeline allows you to store the raw Open XML. There is no further conversion to XHTML of DocBook at this time.

    There is no conversion for .txt that I’m aware of. Those are just text files and will be inserted as text in MarkLogic. You could convert to XML by simply wrapping the text in a parent element and changing the file extension to .xml.

    Hope this helps.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have a folder in c:\program files(x86)\company\application\ which contains all the app files. How
I have a folder which contains the following files: Elephant.19864.archive.other.pdf Elephant.17334.other.something.pdf Turnip.19864.something.knight.pdf Camera.22378.nothing.elf.pdf I
I have subversioned my entire folder which contains all the source files, binaries and
I have a folder which contains few files and some directories which I need
I have a folder, which was a git repo. It contains some files and
I have a source folder which contains 4 csv files with different no of
I have a c:\config folder which contains several configuration files (config_x). I would like
I have a folder which contains some subversion revision checkouts (these are checked out
i have a folder in sd card which contains several files. now i need
I have a folder, which contains html files, images, stylesheets, and js. I have

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.