W3C home > Mailing lists > Public > ietf-charsets@w3.org > January to March 2002

Re: Registration of new charset: SAMI-WS2

From: Harald Tveit Alvestrand <harald@alvestrand.no>
Date: Mon, 14 Jan 2002 15:12:10 +0100
To: Gustav Foseid <gustavf@initio.no>, ietf-charsets@iana.org
Cc: i18n-sme@lister.ping.uio.no
Message-id: <1038623862.1011021130@localhost>
Apologies for losing track of this registration for several months....
could you please resubmit the registration with the information required by 
RFC 2278?

I think the data is:

- Use in MIME: YES
- Intended use: LIMITED USE
- Unicode mapping table: In reference

Since the Statskontoret reference does not include any control characters, 
you might also want to include the information that this charset includes 
control charset number 1 (which has CR and LF in the right places).

OK?

           Harald

--On 30. september 2001 17:15 +0200 Gustav Foseid <gustavf@initio.no> wrote:

> Charset name:
>
>   SAMI-WS2
>
>   This is the name character set is know as in GNU libc.
>
> Published specifications:
>
>   A specification is available in "Statskontoret, teknisk norm Nr
>   35:1" Annex B, published by "Statskontoret, Box 2280, SE-1003 17
>   Stockholm, Sweden".  This document is available online from
>   http://skolelinux.ping.uio.no/info/samisk/Tn35.pdf
>
> Contact address for further information:
>
>   Gustav Foseid
>   <gustavf@initio.no>
>
>   Forum for implementation of support for North Sami in free software
>   <i18n-sme@lister.uio.no>
>
> Intended usage:
>
>   SAMI-WS2 is developed as a single byte encoding for all characters
>   used in the Sami languages, of which Northern Sami is the most
>   widely used.  It is developed for use in Microsoft Windows
>   environments, but is also being incorporated in GNU libc.
>
>   Today SAMI-WS2 is the most widely used encoding for Sami languages.
>   Other available single byte encodings are ISO-8859-10, ISO-IR-209
>   and a character set developed for use in Macintosh envionments.
>   SAMI-WS2 is Microsoft Windows Sami version 2.
>
>   This charset is suitable for use in MIME text body parts.
>
> Unicode mapping
>
>   Format: Column #1 is the SAMI-WS2 code (in hex as 0xXX)
>           Column #2 is the Unicode code (in hex as UXXXX)
>           Column #3 is the Unicode name
>
>   0X00       U0000    #   NULL
>   0X01       U0001    #   START OF HEADING
>   0X02       U0002    #   START OF TEXT
>   0X03       U0003    #   END OF TEXT
>   0X04       U0004    #   END OF TRANSMISSION
>   0X05       U0005    #   ENQUIRY
>   0X06       U0006    #   ACKNOWLEDGE
>   0X07       U0007    #   BELL
>   0X08       U0008    #   BACKSPACE
>   0X09       U0009    #   HORIZONTAL TABULATION
>   0X0A       U000A    #   LINE FEED
>   0X0B       U000B    #   LINE TABULATION
>   0X0C       U000C    #   FORM FEED
>   0X0D       U000D    #   CARRIAGE RETURN
>   0X0E       U000E    #   SHIFT OUT
>   0X0F       U000F    #   SHIFT IN
>   0X10       U0010    #   DATA LINK ESCAPE
>   0X11       U0011    #   DEVICE CONTROL ONE
>   0X12       U0012    #   DEVICE CONTROL TWO
>   0X13       U0013    #   DEVICE CONTROL THREE
>   0X14       U0014    #   DEVICE CONTROL FOUR
>   0X15       U0015    #   NEGATIVE ACKNOWLEDGE
>   0X16       U0016    #   SYNCHRONOUS IDLE
>   0X17       U0017    #   END OF TRANSMISSION BLOCK
>   0X18       U0018    #   CANCEL
>   0X19       U0019    #   END OF MEDIUM
>   0X1A       U001A    #   SUBSTITUTE
>   0X1B       U001B    #   ESCAPE
>   0X1C       U001C    #   FILE SEPARATOR
>   0X1D       U001D    #   GROUP SEPARATOR
>   0X1E       U001E    #   RECORD SEPARATOR
>   0X1F       U001F    #   UNIT SEPARATOR
>   0X20       U0020    #   SPACE
>   0X21       U0021    #   EXCLAMATION MARK
>   0X22       U0022    #   QUOTATION MARK
>   0X23       U0023    #   NUMBER SIGN
>   0X24       U0024    #   DOLLAR SIGN
>   0X25       U0025    #   PERCENT SIGN
>   0X26       U0026    #   AMPERSAND
>   0X27       U0027    #   APOSTROPHE
>   0X28       U0028    #   LEFT PARENTHESIS
>   0X29       U0029    #   RIGHT PARENTHESIS
>   0X2A       U002A    #   ASTERISK
>   0X2B       U002B    #   PLUS SIGN
>   0X2C       U002C    #   COMMA
>   0X2D       U002D    #   HYPHEN-MINUS
>   0X2E       U002E    #   FULL STOP
>   0X2F       U002F    #   SOLIDUS
>   0X30       U0030    #   DIGIT ZERO
>   0X31       U0031    #   DIGIT ONE
>   0X32       U0032    #   DIGIT TWO
>   0X33       U0033    #   DIGIT THREE
>   0X34       U0034    #   DIGIT FOUR
>   0X35       U0035    #   DIGIT FIVE
>   0X36       U0036    #   DIGIT SIX
>   0X37       U0037    #   DIGIT SEVEN
>   0X38       U0038    #   DIGIT EIGHT
>   0X39       U0039    #   DIGIT NINE
>   0X3A       U003A    #   COLON
>   0X3B       U003B    #   SEMICOLON
>   0X3C       U003C    #   LESS-THAN SIGN
>   0X3D       U003D    #   EQUALS SIGN
>   0X3E       U003E    #   GREATER-THAN SIGN
>   0X3F       U003F    #   QUESTION MARK
>   0X40       U0040    #   COMMERCIAL AT
>   0X41       U0041    #   LATIN CAPITAL LETTER A
>   0X42       U0042    #   LATIN CAPITAL LETTER B
>   0X43       U0043    #   LATIN CAPITAL LETTER C
>   0X44       U0044    #   LATIN CAPITAL LETTER D
>   0X45       U0045    #   LATIN CAPITAL LETTER E
>   0X46       U0046    #   LATIN CAPITAL LETTER F
>   0X47       U0047    #   LATIN CAPITAL LETTER G
>   0X48       U0048    #   LATIN CAPITAL LETTER H
>   0X49       U0049    #   LATIN CAPITAL LETTER I
>   0X4A       U004A    #   LATIN CAPITAL LETTER J
>   0X4B       U004B    #   LATIN CAPITAL LETTER K
>   0X4C       U004C    #   LATIN CAPITAL LETTER L
>   0X4D       U004D    #   LATIN CAPITAL LETTER M
>   0X4E       U004E    #   LATIN CAPITAL LETTER N
>   0X4F       U004F    #   LATIN CAPITAL LETTER O
>   0X50       U0050    #   LATIN CAPITAL LETTER P
>   0X51       U0051    #   LATIN CAPITAL LETTER Q
>   0X52       U0052    #   LATIN CAPITAL LETTER R
>   0X53       U0053    #   LATIN CAPITAL LETTER S
>   0X54       U0054    #   LATIN CAPITAL LETTER T
>   0X55       U0055    #   LATIN CAPITAL LETTER U
>   0X56       U0056    #   LATIN CAPITAL LETTER V
>   0X57       U0057    #   LATIN CAPITAL LETTER W
>   0X58       U0058    #   LATIN CAPITAL LETTER X
>   0X59       U0059    #   LATIN CAPITAL LETTER Y
>   0X5A       U005A    #   LATIN CAPITAL LETTER Z
>   0X5B       U005B    #   LEFT SQUARE BRACKET
>   0X5C       U005C    #   REVERSE SOLIDUS
>   0X5D       U005D    #   RIGHT SQUARE BRACKET
>   0X5E       U005E    #   CIRCUMFLEX ACCENT
>   0X5F       U005F    #   LOW LINE
>   0X60       U0060    #   GRAVE ACCENT
>   0X61       U0061    #   LATIN SMALL LETTER A
>   0X62       U0062    #   LATIN SMALL LETTER B
>   0X63       U0063    #   LATIN SMALL LETTER C
>   0X64       U0064    #   LATIN SMALL LETTER D
>   0X65       U0065    #   LATIN SMALL LETTER E
>   0X66       U0066    #   LATIN SMALL LETTER F
>   0X67       U0067    #   LATIN SMALL LETTER G
>   0X68       U0068    #   LATIN SMALL LETTER H
>   0X69       U0069    #   LATIN SMALL LETTER I
>   0X6A       U006A    #   LATIN SMALL LETTER J
>   0X6B       U006B    #   LATIN SMALL LETTER K
>   0X6C       U006C    #   LATIN SMALL LETTER L
>   0X6D       U006D    #   LATIN SMALL LETTER M
>   0X6E       U006E    #   LATIN SMALL LETTER N
>   0X6F       U006F    #   LATIN SMALL LETTER O
>   0X70       U0070    #   LATIN SMALL LETTER P
>   0X71       U0071    #   LATIN SMALL LETTER Q
>   0X72       U0072    #   LATIN SMALL LETTER R
>   0X73       U0073    #   LATIN SMALL LETTER S
>   0X74       U0074    #   LATIN SMALL LETTER T
>   0X75       U0075    #   LATIN SMALL LETTER U
>   0X76       U0076    #   LATIN SMALL LETTER V
>   0X77       U0077    #   LATIN SMALL LETTER W
>   0X78       U0078    #   LATIN SMALL LETTER X
>   0X79       U0079    #   LATIN SMALL LETTER Y
>   0X7A       U007A    #   LATIN SMALL LETTER Z
>   0X7B       U007B    #   LEFT CURLY BRACKET
>   0X7C       U007C    #   VERTICAL LINE
>   0X7D       U007D    #   RIGHT CURLY BRACKET
>   0X7E       U007E    #   TILDE
>   0X7F       U007F    #   DELETE
>   0X80       U20AC    #   EURO SIGN
>   0X82       U010C    #   LATIN CAPITAL LETTER C WITH CARON
>   0X83       U0192    #   LATIN SMALL LETTER F WITH HOOK
>   0X84       U010D    #   LATIN SMALL LETTER C WITH CARON
>   0X85       U01B7    #   LATIN CAPITAL LETTER EZH
>   0X86       U0292    #   LATIN SMALL LETTER EZH
>   0X87       U01EE    #   LATIN CAPITAL LETTER EZH WITH CARON
>   0X88       U01EF    #   LATIN SMALL LETTER EZH WITH CARON
>   0X89       U0110    #   LATIN CAPITAL LETTER D WITH STROKE
>   0X8A       U0160    #   LATIN CAPITAL LETTER S WITH CARON
>   0X8B       U2039    #   SINGLE LEFT-POINTING ANGLE QUOTATION MARK
>   0X8C       U0152    #   LATIN CAPITAL LIGATURE OE
>   0X91       U2018    #   LEFT SINGLE QUOTATION MARK
>   0X92       U2019    #   RIGHT SINGLE QUOTATION MARK
>   0X93       U201C    #   LEFT DOUBLE QUOTATION MARK
>   0X94       U201D    #   RIGHT DOUBLE QUOTATION MARK
>   0X95       U2022    #   BULLET
>   0X96       U2013    #   EN DASH
>   0X97       U2014    #   EM DASH
>   0X98       U0111    #   LATIN SMALL LETTER D WITH STROKE
>   0X99       U01E6    #   LATIN CAPITAL LETTER G WITH CARON
>   0X9A       U0161    #   LATIN SMALL LETTER S WITH CARON
>   0X9B       U203A    #   SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
>   0X9C       U0153    #   LATIN SMALL LIGATURE OE
>   0X9F       U0178    #   LATIN CAPITAL LETTER Y WITH DIAERESIS
>   0XA0       U00A0    #   NO-BREAK SPACE
>   0XA1       U01E7    #   LATIN SMALL LETTER G WITH CARON
>   0XA2       U01E4    #   LATIN CAPITAL LETTER G WITH STROKE
>   0XA3       U00A3    #   POUND SIGN
>   0XA4       U00A4    #   CURRENCY SIGN
>   0XA5       U01E5    #   LATIN SMALL LETTER G WITH STROKE
>   0XA6       U00A6    #   BROKEN BAR
>   0XA7       U00A7    #   SECTION SIGN
>   0XA8       U00A8    #   DIAERESIS
>   0XA9       U00A9    #   COPYRIGHT SIGN
>   0XAA       U021E    #   LATIN CAPITAL LETTER H WITH CARON
>   0XAB       U00AB    #   LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
>   0XAC       U00AC    #   NOT SIGN
>   0XAD       U00AD    #   SOFT HYPHEN
>   0XAE       U00AE    #   REGISTERED SIGN
>   0XAF       U021F    #   LATIN SMALL LETTER H WITH CARON
>   0XB0       U00B0    #   DEGREE SIGN
>   0XB1       U00B1    #   PLUS-MINUS SIGN
>   0XB2       U01E8    #   LATIN CAPITAL LETTER K WITH CARON
>   0XB3       U01E9    #   LATIN SMALL LETTER K WITH CARON
>   0XB4       U00B4    #   ACUTE ACCENT
>   0XB5       U00B5    #   MICRO SIGN
>   0XB6       U00B6    #   PILCROW SIGN
>   0XB7       U00B7    #   MIDDLE DOT
>   0XB8       U014A    #   LATIN CAPITAL LETTER ENG
>   0XB9       U014B    #   LATIN SMALL LETTER ENG
>   0XBA       U0166    #   LATIN CAPITAL LETTER T WITH STROKE
>   0XBB       U00BB    #   RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
>   0XBC       U0167    #   LATIN SMALL LETTER T WITH STROKE
>   0XBD       U00BD    #   VULGAR FRACTION ONE HALF
>   0XBE       U017D    #   LATIN CAPITAL LETTER Z WITH CARON
>   0XBF       U017E    #   LATIN SMALL LETTER Z WITH CARON
>   0XC0       U00C0    #   LATIN CAPITAL LETTER A WITH GRAVE
>   0XC1       U00C1    #   LATIN CAPITAL LETTER A WITH ACUTE
>   0XC2       U00C2    #   LATIN CAPITAL LETTER A WITH CIRCUMFLEX
>   0XC3       U00C3    #   LATIN CAPITAL LETTER A WITH TILDE
>   0XC4       U00C4    #   LATIN CAPITAL LETTER A WITH DIAERESIS
>   0XC5       U00C5    #   LATIN CAPITAL LETTER A WITH RING ABOVE
>   0XC6       U00C6    #   LATIN CAPITAL LETTER AE
>   0XC7       U00C7    #   LATIN CAPITAL LETTER C WITH CEDILLA
>   0XC8       U00C8    #   LATIN CAPITAL LETTER E WITH GRAVE
>   0XC9       U00C9    #   LATIN CAPITAL LETTER E WITH ACUTE
>   0XCA       U00CA    #   LATIN CAPITAL LETTER E WITH CIRCUMFLEX
>   0XCB       U00CB    #   LATIN CAPITAL LETTER E WITH DIAERESIS
>   0XCC       U00CC    #   LATIN CAPITAL LETTER I WITH GRAVE
>   0XCD       U00CD    #   LATIN CAPITAL LETTER I WITH ACUTE
>   0XCE       U00CE    #   LATIN CAPITAL LETTER I WITH CIRCUMFLEX
>   0XCF       U00CF    #   LATIN CAPITAL LETTER I WITH DIAERESIS
>   0XD0       U00D0    #   LATIN CAPITAL LETTER ETH
>   0XD1       U00D1    #   LATIN CAPITAL LETTER N WITH TILDE
>   0XD2       U00D2    #   LATIN CAPITAL LETTER O WITH GRAVE
>   0XD3       U00D3    #   LATIN CAPITAL LETTER O WITH ACUTE
>   0XD4       U00D4    #   LATIN CAPITAL LETTER O WITH CIRCUMFLEX
>   0XD5       U00D5    #   LATIN CAPITAL LETTER O WITH TILDE
>   0XD6       U00D6    #   LATIN CAPITAL LETTER O WITH DIAERESIS
>   0XD7       U00D7    #   MULTIPLICATION SIGN
>   0XD8       U00D8    #   LATIN CAPITAL LETTER O WITH STROKE
>   0XD9       U00D9    #   LATIN CAPITAL LETTER U WITH GRAVE
>   0XDA       U00DA    #   LATIN CAPITAL LETTER U WITH ACUTE
>   0XDB       U00DB    #   LATIN CAPITAL LETTER U WITH CIRCUMFLEX
>   0XDC       U00DC    #   LATIN CAPITAL LETTER U WITH DIAERESIS
>   0XDD       U00DD    #   LATIN CAPITAL LETTER Y WITH ACUTE
>   0XDE       U00DE    #   LATIN CAPITAL LETTER THORN
>   0XDF       U00DF    #   LATIN SMALL LETTER SHARP S
>   0XE0       U00E0    #   LATIN SMALL LETTER A WITH GRAVE
>   0XE1       U00E1    #   LATIN SMALL LETTER A WITH ACUTE
>   0XE2       U00E2    #   LATIN SMALL LETTER A WITH CIRCUMFLEX
>   0XE3       U00E3    #   LATIN SMALL LETTER A WITH TILDE
>   0XE4       U00E4    #   LATIN SMALL LETTER A WITH DIAERESIS
>   0XE5       U00E5    #   LATIN SMALL LETTER A WITH RING ABOVE
>   0XE6       U00E6    #   LATIN SMALL LETTER AE
>   0XE7       U00E7    #   LATIN SMALL LETTER C WITH CEDILLA
>   0XE8       U00E8    #   LATIN SMALL LETTER E WITH GRAVE
>   0XE9       U00E9    #   LATIN SMALL LETTER E WITH ACUTE
>   0XEA       U00EA    #   LATIN SMALL LETTER E WITH CIRCUMFLEX
>   0XEB       U00EB    #   LATIN SMALL LETTER E WITH DIAERESIS
>   0XEC       U00EC    #   LATIN SMALL LETTER I WITH GRAVE
>   0XED       U00ED    #   LATIN SMALL LETTER I WITH ACUTE
>   0XEE       U00EE    #   LATIN SMALL LETTER I WITH CIRCUMFLEX
>   0XEF       U00EF    #   LATIN SMALL LETTER I WITH DIAERESIS
>   0XF0       U00F0    #   LATIN SMALL LETTER ETH
>   0XF1       U00F1    #   LATIN SMALL LETTER N WITH TILDE
>   0XF2       U00F2    #   LATIN SMALL LETTER O WITH GRAVE
>   0XF3       U00F3    #   LATIN SMALL LETTER O WITH ACUTE
>   0XF4       U00F4    #   LATIN SMALL LETTER O WITH CIRCUMFLEX
>   0XF5       U00F5    #   LATIN SMALL LETTER O WITH TILDE
>   0XF6       U00F6    #   LATIN SMALL LETTER O WITH DIAERESIS
>   0XF7       U00F7    #   DIVISION SIGN
>   0XF8       U00F8    #   LATIN SMALL LETTER O WITH STROKE
>   0XF9       U00F9    #   LATIN SMALL LETTER U WITH GRAVE
>   0XFA       U00FA    #   LATIN SMALL LETTER U WITH ACUTE
>   0XFB       U00FB    #   LATIN SMALL LETTER U WITH CIRCUMFLEX
>   0XFC       U00FC    #   LATIN SMALL LETTER U WITH DIAERESIS
>   0XFD       U00FD    #   LATIN SMALL LETTER Y WITH ACUTE
>   0XFE       U00FE    #   LATIN SMALL LETTER THORN
>   0XFF       U00FF    #   LATIN SMALL LETTER Y WITH DIAERESIS
>
> --
> Gustav Foseid, Initio IT-lÝsninger AS
> http://www.initio.no/
>
>
Received on Monday, 14 January 2002 09:49:52 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 5 June 2006 15:10:52 GMT