In a request to YQL (select * from html where url="...") I got the following response:
callback({
"query":
{"count":"1","created":"2011-05-09T23:29:05Z","lang":"en-US"
}, "results": ["<body>... we\ufffdll call Mr ...</body>"]
}
This is from the YQL console page.
When I type that sequence into firebug (even on YQL’s page) I get:
... we�ll call Mr ...
What am I doing wrong? Is YQL’s site in a bad encoding? Is there some way to convert symbols like this to their ascii equivalent?
BTW this isn’t my site so it’s not like I can change the meta charset on that site
It seems like that (the question mark in a solid black diamond) is what you should be seeing: http://www.fileformat.info/info/unicode/char/fffd/browsertest.htm
The comment on that character’s page says:
Maybe the answers to these might help get a better answer:
Update
You might want to check out the
charsetoption in thewhereclause of your YQL query – I’m not entirely sure what it does but it looks like it forces the YQL engine to use the specified charset when parsing the page. Perhaps setting it toUTF-8will solve your problem.For example,