I've started collecting some character set resources.
Please feel free to add any you know of that are helpful.
I currently have the problem of some textual data that has curly quotes (hex 91 and 92 in ISO-8859-1, 2018 and 2019 in Unicode) stored in MySQL on my web server, being retrieved with PHP, sent over an HTTP-based RPC mechanism to a Python program running on my desktop, and being displayed with wxWindows using wxPython. Somewhere in the process the characters are getting screwed up and those characters wind up being displayed as boxes in my wxPython application.
It seems to make it all the way to wxWindows without getting screwed up, but wxTextCtrl seems to have a problem getting fed Latin-1. It might expect Unicode, but I've been able to paste those same curly quotes in from another source and it worked, though maybe internally in the clipboard it's Unicode. I have to do some more digging. I tried converting it to utf8 on the PHP side, but that got even more screwed up, but I think that didn't get handled right by Python. This stuff's all a mess.
It turns out that wxTextCtrl takes Unicode just fine, but my stuff's in Latin-1. But I can't figure out how to tell Python "this string is latin-1, not ASCII, damnit" so that it'll let me convert it from that to Unicode. In the time I've taken to try to find the "correct" answer I could have written a conversion routine that took all the common latin-1 characters that sneak in places (like curly quotes) and converted them to their Unicode equivalents. Early on, I tried converting the text into UTF-8 on the server side, but that didn't work right 
Stuff like this makes me feel good.
http://radioapologia.com/
Bahnsen is one of my favorite people ever.
Thanks to the person who sent me a link!
This guy(mpeg) is way better than my friend Mark.
Calif. Supreme Court to take up gay marriage ban
I would argue the point is notdefinitional. While the wordmarriage is su...
Justin: Nov 20, 4:37pm