2009-05-13 01:56:01来源:未知 阅读 ()

EUC stands for Extended Unix Code. It is a multibyte encoding standard developed by AT&T and supported on all System V implementations used to represent large Asian characters sets. There are several variants, two of them are for Chinese.
It defines both a fixed length and variable length encoding. It's a 8 bit coding method
If codeset 0 is ASCII, then the EUC codeset is ASCII transparent. Often this is the local version of ASCII. The rules for describing a legal EUC codeset. These rules are the following:
1) Each character of an EUC multibyte string is chosen from among four distinct multibyte codesets (0,1,2,and 3).
2) Codeset 0 must be a 7bit codeset.
3) No multibyte character of Codeset 1 will use either SS2 or SS3 as its first byte.
4) Characters from codeset 2 will be preceded by the byte SS2.
5) Characters from codeset 3 will be preceded by the byte SS3.
6) For codesets 1, 2, and 3, every byte of every character must have the eighth bit set.
- codeset 0 : ASCII
- codeset 1 : CNS 11643-1992 plane 1
- codeset 2 : CNS 11643-1992 plane 2 - 16
- codeset 3 : [not used]
- codeset 0 : ASCII
- codeset 1 : GB 2312-80
- codeset 2 : [not used]
- codeset 3 : [not used]
从上面看来,eucCN就是GB2312,在FreeBSD 4.11中,已经不存在GB2312这个locale,eucCN就是GB2312,使用8位的两字节编码。
下一篇:Happy new year
- 选择FreeBSD的中文编码 2009-05-13
- FireFox3对FTP下中文名文件的问题 2009-05-13
- 最近几天FreeBSD系统的一些切入点 2009-05-13
- Java语言编码规范(中文版) 2009-05-13
- FreeBSD下安装小企鹅输入法(fcitx) 2009-05-13
IDC资讯: 主机资讯 注册资讯 托管资讯 vps资讯 网站建设
网站运营: 建站经验 策划盈利 搜索优化 网站推广 免费资源
网络编程: Asp.Net编程 Asp编程 Php编程 Xml编程 Access Mssql Mysql 其它
服务器技术: Web服务器 Ftp服务器 Mail服务器 Dns服务器 安全防护
软件技巧: 其它软件 Word Excel Powerpoint Ghost Vista QQ空间 QQ FlashGet 迅雷
网页制作: FrontPages Dreamweaver Javascript css photoshop fireworks Flash