Code Converter
- From network character code To Arena's internal code
- If CHARSET parameter is specified:
Follow the CHARSET parameter
- IF CHARSET parameter is NOT specified:
Guess a kind of the network character code
How to guess it? - Our assumptions are:
- Assumption 1:
All of HTML documents start with a symbol `<' (less-than),
e.g., <HTML>, <HEAD>, etc...
- Assumption 2:
You can say ``it's UNICODE.''
if the first byte of Latin-1 is 0x00.
How to guess it? - Our rule is:
- If the first byte is 0x00 then:
It's UNICODE.
- Otherwise:
It's ISO-2022.