Character Set
After specifying the RTF version, you must declare the default character set used in the document unless it is \
ansi (the default). The control word for the character set must precede any plain text or any table control words. The RTF Specification supports the following document character sets <character set>
<character set> (\
ansi | \mac | \pc | \pca)? \ansicpgN?
where the control words are defined by
Control word Character set
\
ansi ANSI (the default)
\mac Apple Macintosh
\pc IBM PC code page 437
\pca IBM PC code page 850, used by IBM Personal System/2 (not implemented in version 1 of Microsoft Word for
OS/2)
\ansicpgN This keyword represents the default
ANSI code page used to perform the
Unicode to
ANSI conversion when writing RTF text. N represents the code page in decimal. This is typically set to the default
ANSI code page of the
run-time environment (for example, \ansicpg1252 for U.S. Windows). The reader can use the same
ANSI code page to convert
ANSI text back to
Unicode. If it appears, this keyword should be emitted in the RTF header section right after the \
ansi, \mac, \pc or \pca keyword. Possible values include those in the following table.
\fbidis Flag written by RichEdit to indicate a single font is active instead of a set of associated fonts.
Code page Name
437 United States IBM
708 Arabic (ASMO 708)
709 Arabic (ASMO 449+, BCON V4)
710 Arabic (transparent Arabic)
711 Arabic (Nafitha Enhanced)
720 Arabic (transparent ASMO)
819 Windows 3.1 (United States and Western Europe)
850 IBM multilingual
852 Eastern European
860 Portuguese
862 Hebrew
863 French Canadian
864 Arabic
865 Norwegian
866 Soviet Union
874 Thai
932 Japanese
936 Simplified Chinese
949 Korean
950 Traditional Chinese
1250 Eastern European
1251 Cyrillic
1252 Western European
1253 Greek
1254 Turkish
1255 Hebrew
1256 Arabic
1257 Baltic
1258 Vietnamese
1361 Johab
10000 MAC Roman
10001 MAC Japan
10004 MAC Arabic
10005 MAC Hebrew
10006 MAC Greek
10007 MAC Cyrillic
10029 MAC Latin2
10081 MAC Turkish
57002 Devanagari
57003 Bengali
57004 Tamil
57005 Telugu
57006 Assamese
57007 Oriya
57008 Kannada
57009 Malayalam
57010 Gujarati
57011 Punjabi
Note that runs of text marked with a particular font index (see \fN in the Font Table section) use the codepage for that font as given by \cpgN or implied by \fcharsetN, unless they use
Unicode RTF described in the following section.