An example HTML document retrieved over HTTP lacks:
- a HTTP
Content-Typeheader - a HTML
<meta charset="<character encoding>" /> - a HTML
<meta http-equiv='Content-Type' content='Type=text/html; charset=<character encoding>'>
With regards to HTML5, is a default, for example UTF-8, assumed as the character encoding? Or is it entirely up the application reading the HTML document to choose a default?
The charset is determined using these rules:
But the most important thing is:
Sources: