The "everything is UTF-8" made my cringe. No it's not. Win32 uses UTF-16. So does JS. Those are just the ones I know off the top of my head. I will give you that a lot of plain text documents are UTF-8. Also worth pointing out that an ASCii string is a UTF-8 string