Windows-1258


Windows-1258

Windows-1258 is a codepage used in Microsoft Windows to represent Vietnamese texts. It makes use of combining diacritical marks. Windows-1258 is not compatible with VISCII. It is very similar to windows-1252 with the differences being that s-caron and z-caron (which were added to windows-1252 later) are missing, four of the letters with diacritics have been replaced by combining diacritics and a few other letter/diacritic combinations have been replaced.

Use of combining diacritics means that windows-1258 can cover the large number of letter/diacritic combinations in Vietnamese without compromising coverage of control codes or symbols.

UTF-8 is the preferred encoding for Vietnamese in modern applications. Windows 1258 may not always round trip Unicode encoded Vietnamese due to Unicode normalization differences.

Codepage layout

The following table shows Windows-1258. Only the upper half (128–255) of the table is shown, the lower half (0–127) being plain ASCII. Each character is shown with its Unicode equivalent and its decimal code. Differences from Windows-1252 are marked with thick green borders.

Windows-1258
−0 −1 −2 −3 −4 −5 −6 −7 −8 −9 −A −B −C −D −E −F
 
8−
 

20AC
128
 
201A
130
ƒ
0192
131

201E
132

2026
133

2020
134

2021
135
ˆ
02C6
136

2030
137
 
2039
139
Œ
0152
140
     
 
9−
 
 
2018
145

2019
146

201C
147

201D
148

2022
149

2013
150

2014
151
˜
02DC
152

2122
153
 
203A
155
œ
0153
156
    Ÿ
0178
159
 
A−
 
NBSP
00A0
160
¡
00A1
161
¢
00A2
162
£
00A3
163
¤
00A4
164
¥
00A5
165
¦
00A6
166
§
00A7
167
¨
00A8
168
©
00A9
169
ª
00AA
170
«
00AB
171
¬
00AC
172
SHY
00AD
173
®
00AE
174
¯
00AF
175
 
B−
 
°
00B0
176
±
00B1
177
²
00B2
178
³
00B3
179
´
00B4
180
µ
00B5
181

00B6
182
·
00B7
183
¸
00B8
184
¹
00B9
185
º
00BA
186
»
00BB
187
¼
00BC
188
½
00BD
189
¾
00BE
190
¿
00BF
191
 
C−
 
À
00C0
192
Á
00C1
193
Â
00C2
194
Ă
0102
195
Ä
00C4
196
Å
00C5
197
Æ
00C6
198
Ç
00C7
199
È
00C8
200
É
00C9
201
Ê
00CA
202
Ë
00CB
203
̀
0300
204
Í
00CD
205
Î
00CE
206
Ï
00CF
207
 
D−
 
Đ
0110
208
Ñ
00D1
209
̉
0309
210
Ó
00D3
211
Ô
00D4
212
Ơ
01A0
213
Ö
00D6
214
×
00D7
215
Ø
00D8
216
Ù
00D9
217
Ú
00DA
218
Û
00DB
219
Ü
00DC
220
Ư
01AF
221
̃
0303
222
ß
00DF
223
 
E−
 
à
00E0
224
á
00E1
225
â
00E2
226
ă
0103
227
ä
00E4
228
å
00E5
229
æ
00E6
230
ç
00E7
231
è
00E8
232
é
00E9
233
ê
00EA
234
ë
00EB
235
́
0301
236
í
00ED
237
î
00EE
238
ï
00EF
239
 
F−
 
đ
0111
240
ñ
00F1
241
̣
0323
242
ó
00F3
243
ô
00F4
244
ơ
01A1
245
ö
00F6
246
÷
00F7
247
ø
00F8
248
ù
00F9
249
ú
00FA
250
û
00FB
251
ü
00FC
252
ư
01B0
253

20AB
254
ÿ
00FF
255

External links


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Windows-1258 — Windows Codepages 874  Thai 932  Japanisch 936  Vereinfachtes Chinesisch 949  Koreanisch 950  Traditionelles Chinesisch 1250  Mitteleuropäisch 1251  Kyrillisch 1252 …   Deutsch Wikipedia

  • Windows-1258 — La page de code Windows 1258 (dans le registre IANA des jeux de caractères codés pour l’informatique et les normes Internet, aussi connue comme CP1258) est utilisée dans Microsoft® Windows® pour représenter les textes en quôc ngu, l’actuelle… …   Wikipédia en Français

  • Windows-1252 — ou CP1252 est un jeu de caractères, utilisé historiquement par défaut sur le système d exploitation Microsoft Windows en anglais et dans les principales langues d’Europe de l’Ouest (dont le français). Sommaire 1 Contexte 2 Aspects techniques …   Wikipédia en Français

  • Windows-1252 — ISO 8859 1 Latin 1, Westeuropäisch 2 Latin 2, Mitteleuropäisch 3 Latin 3, Südeuropäisch 4 Latin 4, Baltisch 5 Kyrillisch 6 Arabisch 7 Griechisch 8 …   Deutsch Wikipedia

  • Windows code page — Windows code pages are sets of characters or code pages (known as character encodings in other operating systems) used in Microsoft Windows from the 1980s and 1990s. Windows code pages were gradually superseded when Unicode was implemented in… …   Wikipedia

  • Windows-1251 — набор символов и кодировка, являющаяся стандартной 8 битной кодировкой для всех русских версий Microsoft Windows. Пользуется довольно большой популярностью. Была создана на базе кодировок, использовавшихся в ранних «самопальных» русификаторах… …   Википедия

  • Windows-1254 — Windows 1254  кодовая страница, используемая Microsoft Windows для представления турецкого языка. Символы с кодами от A0 до FF совместимы с ISO 8859 9. Для современных приложений UTF 8 предпочтительней windows 1254. Таблица кодов Символы с… …   Википедия

  • Windows-1252 — Windows 1252, sometimes called incorrectly ANSI . Blue dots indicate unused or control characters Windows 1252 or CP 1252 is a character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows in English and… …   Wikipedia

  • Windows-1252 — Windows 1252, sometimes called incorrectly ANSI . Blue dots indicate unused or control characters Windows 1252 or CP 1252 es una codificacion de caracteres del alfabeto latino, usado por defecto en los componentes oficiales de Microsoft Windows… …   Wikipedia Español

  • Windows-1255 — is a codepage used under Microsoft Windows to write Hebrew. It is an almost compatible superset of ISO 8859 8 the symbols are in the same positions (except for A4, which is sheqel sign in Windows 1255 but generic currency sign in ISO 8859 8), but …   Wikipedia


Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.