Multinational Character Set

From HandWiki
(Redirected from CP 1100)
Short description: DEC character encoding used on VT220 terminals
Multinational Character Set (MCS)
MIME / IANADEC-MCS
Alias(es)IBM1100, CP1100, WE8DEC, csDECMCS, dec
Language(s)English, various others
ExtendsUS-ASCII
Succeeded byISO 8859-1, LICS, BraSCII, Cork encoding

The Multinational Character Set (DMCS or MCS) is a character encoding created in 1983 by Digital Equipment Corporation (DEC) for use in the popular VT220 terminal. It was an 8-bit extension of ASCII that added accented characters, currency symbols, and other character glyphs missing from 7-bit ASCII. It is only one of the code pages implemented for the VT220 National Replacement Character Set (NRCS).[1][2] MCS is registered as IBM code page/CCSID 1100 (Multinational Emulation) since 1992.[3][4] Depending on associated sorting Oracle calls it WE8DEC, N8DEC, DK8DEC, S8DEC, or SF8DEC.[5][6]

Such "extended ASCII" sets were common (the National Replacement Character Set provided sets for more than a dozen European languages), but MCS has the distinction of being the ancestor of ECMA-94 in 1985[7] and ISO 8859-1 in 1987.[8]

The code chart of MCS with ECMA-94, ISO 8859-1 and the first 256 code points of Unicode have many more similarities than differences. In addition to unused code points, differences from ISO 8859-1 are:

MCS code point Unicode mapping Character
0xA8 U+00A4 ¤
0xD7 U+0152 Œ
0xDD U+0178 Ÿ
0xF7 U+0153 œ
0xFD U+00FF ÿ

Character set

DEC Multinational Character Set[3][9][10][11][12][13][14]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0_ NUL SOH STX ETX EOT ENQ ACK BEL  BS   HT   LF   VT   FF   CR   SO   SI  
1_ DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN  EM  SUB ESC  FS   GS   RS   US 
2_  SP  ! " # $ % & ' ( ) * + , - . /
3_ 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4_ @ A B C D E F G H I J K L M N O
5_ P Q R S T U V W X Y Z [ \ ] ^ [[Underscore|]]
6_ ` a b c d e f g h i j k l m n o
7_ p q r s t u v w x y z { | } ~ DEL
8_ IND NEL SSA ESA HTS HTJ VTS PLD PLU  RI   SS2 SS3
9_ DCS PU1 PU2 STS CCH MW SPA EPA CSI  ST  OSC  PM  APC
A_ ¡ ¢ £ ¥ § ¤
00A4
© ª «
B_ ° ± ² ³ µ · ¹ º » ¼ ½ ¿
C_ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
D_ Ñ Ò Ó Ô Õ Ö Œ
0152
Ø Ù Ú Û Ü Ÿ
0178
ß
E_ à á â ã ä å æ ç è é ê ë ì í î ï
F_ ñ ò ó ô õ ö œ
0153
ø ù ú û ü ÿ
00FF
  Differences from ISO-8859-1

See also

References

  1. "VT220 Programmer Reference Manual". Digital Equipment Corporation (DEC). 1984. http://vt100.net/docs/vt220-rm/. 
  2. "TinyTERM Emulator — National Replacement Character Set (NRCS)". Century Software. http://www.centurysoftware.com/support/help/ttus_help/ttusNational_Replacement_Character_S.htm.  [sic]
  3. 3.0 3.1 "SBCS code page information - CPGID: 01100 / Name: Multinational Emulation". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. IBM. 1992-10-01. https://www-01.ibm.com/software/globalization/cp/cp01100.html.  [1] [2] [3]
  4. "CCSID 1100 information document". http://www-01.ibm.com/software/globalization/ccsid/ccsid1100.html. 
  5. "Appendix A: Locale Data". Oracle9i Database Globalization Support Guide (Release 2 (9.2) ed.). Oracle Corporation. 2002. Oracle A96529-01. https://docs.oracle.com/cd/B10501_01/server.920/a96529.pdf. Retrieved 2017-02-14. 
  6. "Oracle characterset descriptions for 9.2". Daylight Chemical Information Systems. 2017. http://www.daylight.com/meetings/emug04/Delany/charsets.html. 
  7. Standard ECMA-94: 8-bit Single-Byte Coded Graphic Character Set (1 ed.). European Computer Manufacturers Association (ECMA). March 1985. http://www.ecma-international.org/publications/files/ECMA-ST-WITHDRAWN/ECMA-94,%201st%20Edition,%20March%201985.pdf. Retrieved 2016-12-01. "Since 1982 the urgency of the need for an 8-bit single-byte coded character set was recognized in ECMA as well as in ANSI/X3L2 and numerous working papers were exchanged between the two groups. In February 1984 ECMA TC1 submitted to ISO/TC97/SC2 a proposal for such a coded character set. At its meeting of April 1984 SC decided to submit to TC97 a proposal for a new item of work for this topic. Technical discussions during and after this meeting led TC1 to adopt the coding scheme proposed by X3L2. Part 1 of Draft International Standard DTS 8859 is based on this joint ANSI/ECMA proposal.... Adopted as an ECMA Standard by the General Assembly of Dec. 13–14, 1984." 
  8. "ISO 8859-1 and MCS". ISO 8859 Alphabet Soup. 1998. http://czyborra.com/charsets/iso8859.html#ISO-8859-1.  [4] [5]
  9. "VT220 Programmer Reference Manual". Digital Equipment Corporation (DEC). Table 2-3: DEC Multinational Character Set (C1 and GR Codes). http://vt100.net/docs/vt220-rm/table2-3b.html. 
  10. VAX/VMS User's Manual. Digital Equipment Corporation (DEC). April 1986. AI-Y517A-TE. 
  11. DEC (February 1992). "Chapter 2: Character Encoding - DEC Supplemental Graphic Character Set". VT420 Programmer Reference Manual (2 ed.). Digital Equipment Corporation. pp. 24–25. EK–VT420–RM.002. http://manx-docs.org/collections/mds-199909/cd3/term/vt420rm2.pdf. Retrieved 2017-01-29. 
  12. "Locale::RecodeData::DEC_MCS - Conversion routines for DEC_MCS". CPAN libintl-perl. 2016. http://search.cpan.org/~guido/libintl-perl/lib/Locale/RecodeData/DEC_MCS.pm. 
  13. "DEC Multinational Character Set (DEC MCS)". http://www.kostis.net/charsets/decmcs.htm. 
  14. "DEC Multinational Character Set (1987) to Unicode". Unicode, Inc.. 1999-07-07. https://www.math.nmsu.edu/~mleisher/Software/csets/DECMCS.TXT. 

Template:Digital Equipment Corporation