UTF-8 |
- Inital
EF BB BF is a signature, indicating that the rest
of the file is UTF-8. - Any
EF BF BE is an error. - A real ZWNBSP at the start of a file requires a signature first.
|
UTF-8N |
- All of the text is normal UTF-8; there is no signature.
- Inital
EF BB BF is a ZWNBSP. - Any
EF BF BE is an error.
|
UTF-16 |
- Initial
FE FF is a signature indicating the rest of the
text is big endian UTF-16. - Initial
FF FE is a signature indicating the rest of the
text is little endian UTF-16. - If neither of these are present, all of the text is big endian.
- A real ZWNBSP at the start of a file requires a signature first.
|
UTF-16BE |
- All of the text is big endian: there is no signature.
- Initial
FE FF is a ZWNBSP. - Any
FF FE is an error.
|
UTF-16LE |
- All of the text is little endian: there is no signature.
- Initial
FF FE is a ZWNBSP. - Any
FE FF is an error.
|
UTF-32 |
- Initial
00 00 FE FF is a signature indicating the rest
of the text is big endian UTF-32. - Initial
FF FE 00 00 is a signature indicating the rest
of the text is little endian UTF-32. - If neither of these are present, all of the text is big endian.
- A real ZWNBSP at the start of a file requires a signature first.
|
UTF-32BE |
- All of the text is big endian: there is no signature.
- Initial
00 00 FE FF is a ZWNBSP. - Any
FF FE 00 00 is an error.
|
UTF-32LE |
- All of the text is little endian: there is no signature.
- Initial
FF FE 00 00 is a ZWNBSP. - Initial
00 00 FE FF is an error.
|