I am working with a large number of HTML files that are mostly encoded as utf-8. There are accented characters galore as many are in French. I have been converting them to HTML entities as I go, but I noticed that even in IE5.5 (according IE tester) the nonconverted accented characters are displaying properly.
Should I be concerned with character display and convert them all to HTML entities just to be on the safe side?
If the files are UTF-8 encoded, you should set the
Content-Typeheader to betext/html; charset=UTF-8and have an equivalent meta tag on the page:This gives the browser all the information for displaying UTF-8 characters correctly. There is no need to encode accented characters.