Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 45111
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 10, 20262026-05-10T15:45:28+00:00 2026-05-10T15:45:28+00:00

I need to convert HTML documents into valid XML, preferably XHTML. What’s the best

  • 0

I need to convert HTML documents into valid XML, preferably XHTML. What’s the best way to do this? Does anybody know a toolkit/library/sample/…whatever that helps me to get that task done?

To be a bit more clear here, my application has to do the conversion automatically at runtime. I don’t look for a tool that helps me to move some pages to XHTML manually.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. 2026-05-10T15:45:28+00:00Added an answer on May 10, 2026 at 3:45 pm

    Convert from HTML to XML with HTML Tidy

    Downloadable Binaries

    JRoppert, For your need, i guess you might want to look at the Sources

    c:\temp>tidy -help tidy [option...] [file...] [option...] [file...] Utility to clean up and pretty print HTML/XHTML/XML see http://tidy.sourceforge.net/  Options for HTML Tidy for Windows released on 14 February 2006:  File manipulation -----------------  -output <file>, -o  write output to the specified <file>  <file>  -config <file>      set configuration options from the specified <file>  -file <file>, -f    write errors to the specified <file>  <file>  -modify, -m         modify the original input files  Processing directives ---------------------  -indent, -i         indent element content  -wrap <column>, -w  wrap text at the specified <column>. 0 is assumed if  <column>            <column> is missing. When this option is omitted, the                      default of the configuration option 'wrap' applies.  -upper, -u          force tags to upper case  -clean, -c          replace FONT, NOBR and CENTER tags by CSS  -bare, -b           strip out smart quotes and em dashes, etc.  -numeric, -n        output numeric rather than named entities  -errors, -e         only show errors  -quiet, -q          suppress nonessential output  -omit               omit optional end tags  -xml                specify the input is well formed XML  -asxml, -asxhtml    convert HTML to well formed XHTML  -ashtml             force XHTML to well formed HTML  -access <level>     do additional accessibility checks (<level> = 0, 1, 2, 3).                      0 is assumed if <level> is missing.  Character encodings -------------------  -raw                output values above 127 without conversion to entities  -ascii              use ISO-8859-1 for input, US-ASCII for output  -latin0             use ISO-8859-15 for input, US-ASCII for output  -latin1             use ISO-8859-1 for both input and output  -iso2022            use ISO-2022 for both input and output  -utf8               use UTF-8 for both input and output  -mac                use MacRoman for input, US-ASCII for output  -win1252            use Windows-1252 for input, US-ASCII for output  -ibm858             use IBM-858 (CP850+Euro) for input, US-ASCII for output  -utf16le            use UTF-16LE for both input and output  -utf16be            use UTF-16BE for both input and output  -utf16              use UTF-16 for both input and output  -big5               use Big5 for both input and output  -shiftjis           use Shift_JIS for both input and output  -language <lang>    set the two-letter language code <lang> (for future use)  Miscellaneous -------------  -version, -v        show the version of Tidy  -help, -h, -?       list the command line options  -xml-help           list the command line options in XML format  -help-config        list all configuration options  -xml-config         list all configuration options in XML format  -show-config        list the current configuration settings  Use --blah blarg for any configuration option 'blah' with argument 'blarg'  Input/Output default to stdin/stdout respectively Single letter options apart from -f may be combined as in:  tidy -f errs.txt -imu foo.html For further info on HTML see http://www.w3.org/MarkUp 
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 85k
  • Answers 85k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer What could be a better source of information than the… May 11, 2026 at 5:15 pm
  • Editorial Team
    Editorial Team added an answer It would not be possible to ExecuteCommand against a stopped… May 11, 2026 at 5:15 pm
  • Editorial Team
    Editorial Team added an answer Just realised the problem: if current is not a valid… May 11, 2026 at 5:15 pm

Related Questions

I need to convert HTML documents into valid XML, preferably XHTML. What's the best
I need to convert a Word document into HTML file(s) in Java. The function
When I write papers or documentation it makes think using LaTeX or OpenOffice is
Quick question, If I want to document some code on a basic HTML and
I need to add line breaks in the positions that the browser naturally adds

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.