Along the lines of my previous question, How do I convert unicode characters to floats in Python? , I would like to find a more elegant solution to calculating the value of a string that contains unicode numeric values.
For example, take the strings “1⅕” and “1 ⅕”. I would like these to resolve to 1.2
I know that I can iterate through the string by character, check for unicodedata.category(x) == “No” on each character, and convert the unicode characters by unicodedata.numeric(x). I would then have to split the string and sum the values. However, this seems rather hacky and unstable. Is there a more elegant solution for this in Python?
I think this is what you want…
or the “comprehensive” solution, for those who prefer that style:
But beware, there are many unicode values that seem to not have a numeric value assigned in python (for example ⅜⅝ don’t work… or maybe is just a matter with my keyboard xD).
Another note on the implementation: it’s “too robust”, it will work even will malformed numbers like “123½3 ½” and will eval it to 1234.0… but it won’t work if there are more than one dots.