ជំនួយ:Multilingual support
អត្ថបទនេះត្រូវការបកប្រែទៅជាភាសាខ្មែរ។ អត្ថបទនេះត្រូវបានសរសេរជាភាសាផ្សេង ដែលមិនមែនជាភាសាខ្មែរ។ បើសិនជាអត្ថបទទុកសម្រាប់អ្នកអានមកពី សហគមន៍នៃភាសាមួយនេះ វាគួរតែចែកចាយទៅវិគីភីឌាជាភាសានោះ។ សូមមើល បញ្ជីនៃគម្រោងវិគីភីឌាទាំងអស់។ សូមមើល ច្រកចូលអត្ថបទនេះ លើក្រុមទំព័រដែលត្រូវការបំណកប្រែទៅជាភាសាខ្មែរ ដើម្បីពិភាក្សា ។ ប្រសិនបើ អត្ថបទមិនត្រូវបានសរសេរជាភាសាខ្មែរឡើងវិញទេ ក្នុងរយៈពេលពីរសប្ដាហ៍ទៀត អត្ថបទនឹងត្រូវចុះបញ្ជីដើម្បីលុបចោល និង/ឬ ប្ដូរវាទៅកាន់វិគីភីឌាជាភាសាដើមរបស់វាវិញ ។ បើសិនជាលោកអ្នក គ្រាន់តែចង់បិទស្លាកទំព័រនេះត្រូវការបំណកប្រែ សូមបញ្ចូល {{អាទេស.:ត្រូវការបកប្រែ | ទំ. = Multilingual support | ភាសា = មិនស្គាល់ | ផ្ដល់យោបល់ = }} ~~~~ ទៅខាងក្រោម នៃផ្នែក នៃក្រុមទំព័រនេះត្រូវការបំណកប្រែទៅជាភាសាខ្មែរ ។ |
Articles on the English Wikipedia may contain words or texts written in different languages and scripts. To be able to correctly view and edit these articles requires that you have the appropriate fonts installed and to have correctly configured your operating system and browser. This guide will help you to do so.
Overview
កែប្រែយូនីកូដ
កែប្រែArticles on Wikipedia are encoded using Unicode (specifically UTF-8)[១], an industry standard designed to allow text and symbols from all of the writing systems of the world to be consistently represented and manipulated by computers. Because UTF-8 is backwards compatible with ASCII, and most modern browsers have at least basic Unicode support, most users will experience little difficulty reading and editing Wikipedia.
For older browsers, MediaWiki, the Wikipedia software, serves the wikitext in a safe mode upon editing. Characters that cannot be represented in ASCII are temporarily converted to hexadecimal character references, looking like ሴ. Existing hexadecimal character references get an additional leading zero so they are not converted to actual characters when the page is saved, and look like ሴ. Likewise, to create a hexadecimal character reference in safe mode, not the character itself, a leading zero should be added. One can check whether safe mode is used by editing this section. If M looks like M rather than M, safe mode is used.
Font
កែប្រែMost computers with Microsoft Windows, Apple's OS X and many Linux variants will already have fonts with support for Latin, Greek, Cyrillic, Hebrew, Arabic, Chinese, Japanese, Korean and the International Phonetic Alphabet installed. Many mobile devices, such as the iPhone and iPad also include such fonts. Several historic and accented characters (used in the transliteration of foreign scripts) may be missing, though.
Microsoft fonts include:
Font | Product | Scripts |
---|---|---|
Arial Unicode MS [១] |
|
|
Lucida Sans Unicode [២] | ||
Tahoma [៣] | ||
Microsoft Sans Serif [៤] |
- Arial Unicode MS
- supports a wide number of scripts, but is of a slightly lower quality than Arial because it lacks kerning and is not smoothed. It contains a small bug which causes double-wide diacritics to be placed on the wrong characters.
- Lucida Sans Unicode
- has a slightly smaller character repertoire than that of Arial Unicode MS, but is more legible.
- Tahoma
- has a slightly smaller character repertoire than that of Arial Unicode MS, but is more legible.
- Microsoft Sans Serif
- has better support for historical and accented Latin characters. (Note that this is a different font than MS Sans Serif, a bitmapped font that shipped with older versions of Windows.)
Other available unicode fonts
កែប្រែFont | Typeface | Sample | License | Format | Encoding |
---|---|---|---|---|---|
Aboriginal | sans serif, serif | Freeware | OpenType | Unicode 5.2 | |
Charis SIL | serif | Open Source | OpenType | Unicode 5.1 | |
Code2002 | Freeware (must not be altered) | Unicode, plane 2 | |||
Code2001 0.919 | Freeware (must not be altered) | Unicode, plane 1 | |||
Code2000 1.171 | sans-serif | Shareware (unrestricted) | TrueType | Unicode, plane 0 | |
DejaVu | Sans, Sans Mono and Serif | Open Source | OpenType | Unicode 5.1 | |
Doulos SIL | serif | Open Source | OpenType | Unicode 5.1 | |
Everson Mono 3.2b4 | monospace | Shareware | TrueType | Unicode | |
TITUS Cyberbit Basic | serif | Non-commercial | Unicode 4.0 | ||
Unicode Fonts for Ancient Scripts (Greek, Egyptian, cuneiform...) | Aegean, Aegyptus, Akkadian, Alexander, Analecta... | , Ͱ | |||
Japanese |
Browsers
កែប្រែ- Internet Explorer
- supports Latin (however not all extended sets), Greek, Cyrillic, Arabic and Hebrew. Support for East Asian and some Indic scripts is available if support for this has been installed for Windows. As Internet Explorer will only use the default font for other scripts, those are usually not supported (unless the default font does).
- Firefox
- tries to render any character using all the fonts available on the system so multilingual support is generally good. The default rendering engine does not support complex script rendering, however. Some Linux distributions ship with a Pango-based rendering engine which does, this may currently cause some display glitches with justified text, though.
- Opera
- tries to render any character using all the fonts available on the system so multilingual support is also good.[២] Opera uses the operating system to perform contextual glyph selection, ligature forming, character stacking, combining character support and other character shaping tasks.[៣]
- Chrome
- Renders many, but not all characters... Does not render Oriya, Sinhala and Tibetan scripts from examples below, while Firefox doesn't render Sinhala only.
Scripts
កែប្រែភូមា
កែប្រែAvailable fonts
កែប្រែFont | License | យូនីកូដ | OpenType | AAT | Graphite |
---|---|---|---|---|---|
Padauk 2.6 | OFL | × | × | × | |
Parabaik | OFL, GPL | × | × | ||
Parabaik Sans | OFL, GPL | × | × | ||
Myanmar3 Myanmar3 from BBC website |
LGPL | × | × | ||
Myanmar2 | LGPL | × | × | ||
WinUni Innwa | Freeware | × | × |
Canadian Syllabics (Inuktitut, Cree)
កែប្រែ- Aboriginal Sans (Unicode), from LanguageGeek
Cherokee
កែប្រែ- Digohweli, from LanguageGeek
កូបទិក
កែប្រែThis is the Language used in Egypt before Arabic. It is currently used solely as a liturgical language.
- Quivira 3.5: Use this for the best Coptic letter/ word spacing and sizing. It provides full Unicode support for all Coptic letters.
- GNU FreeSerif
- Antinoou is a new Sahidic Coptic unicode font, which will probably become standard for Sahidic.
- Alphabetum is a commercial unicode font, but it is the only font that provides Bohairic Coptic letters rather than Sahidic.
Deseret
កែប្រែEast Asian
កែប្រែScript | Correct rendering | កុំព្យូទ័ររបស់អ្នក |
---|---|---|
តួអក្សរចិនបុរាណ |
人人生來自由, | |
Simplified Chinese |
人人生来自由, | |
Japanese |
すべての人間は、生まれながらにして自由であり、 | |
ភាសាកូរ៉េ |
모든 인간은 태어날 때부터 |
Ethiopic
កែប្រែThe Ethiopic syllabary is used in central east Africa for Amharic, Bilen, Oromo, Tigré, Tigrinya, and other languages. It evolved from the script for classical Ge'ez, which is now strictly a liturgical language.
Font | គំរូ | License | Format | ការអ៊ិនកូដ |
---|---|---|---|---|
Abyssinica SIL | OFL | OpenType, AAT and Graphite | Unicode 4.1 + SIL PUA | |
Code2000 1.16 | Shareware | TrueType | Unicode | |
Ethiopia Jiret | GPL2 | Unicode 3.0 | ||
Everson Mono | Shareware | TrueType | Unicode | |
GF Zemen Unicode | GPL2 | TrueType | Unicode | |
TITUS Cyberbit | Non-commercial | Unicode 4.0 |
Indic
កែប្រែThe following table compares how a correctly enabled computer would render the following scripts with how your computer renders them:
Script | Correct rendering | កុំព្យូទ័ររបស់អ្នក |
---|---|---|
Bengali | ক + ি → কি | |
Devanāgarī | क + ि → कि | |
Gujarati | ક + િ → કિ | |
Gurmukhī | ਕ + ਿ → ਕਿ | |
Kannada | ಕ + ಿ → ಕಿ | |
Malayalam | ക + െ → കെ | |
Oriya | କ + େ → କେ | |
Sinhala | ඵ + ේ → ඵේ | |
Tibetan | ར + ྐ + ྱ → རྐྱ | |
Tamil | க + ே → கே | |
Telugu | య + ీ → యీ |
Old Persian cuneiform
កែប្រែThe Old Persian cuneiform script was used to write the Old Persian language. The script is encoded in block "Old Persian", code points 103A0–103DF (Unicode.org chart). It is supported by the following fonts:
- Aegean (free font)
Correct rendering | កុំព្យូទ័ររបស់អ្នក | Transliteration |
---|---|---|
𐎣𐎲𐎢𐎪𐎡𐎹 | Kambujiya (Cambyses II) |
Old Tagalog/Baybayin
កែប្រែBaybayin (also known as the Tagalog script in Unicode and Alibata) is a form of pre-Spanish Philippine writing system in which modern minority scripts in the Philippines has descended.
Correct rendering | កុំព្យូទ័ររបស់អ្នក |
---|---|
Download and installation:
- Paul Morrow's Baybayin Fonts. Offers the most extensive list of Baybayin fonts for Windows and Macintosh operating systems.
- PNKL is a free unicode font support which defines own assignment of Baybayin alphabet to a normal keyboard. Available for Windows and Linux users.
Sundanese
កែប្រែThe Sundanese script is used to write the Sundanese language. The script is encoded in block "Sundanese", code points 1B80–1BBF (Unicode.org chart). It is supported by the following fonts:
- Sundanese Unicode (direct download link) (free font)
Syriac/Aramaic script
កែប្រែSyriac and Aramaic scripts like most Semitic scripts flow from right-to-left which can cause letter to appear in the wrong order. The tag {{rtl-lang}} can be used to fix this issue.
- Meltho OpenType™ Syriac Fonts (free font).
Script | Correct rendering | កុំព្យូទ័ររបស់អ្នក |
---|---|---|
Madnḥāyā | ទំព័រគំរូ:Script/Mdnh | |
Serṭā | ទំព័រគំរូ:Script/Serto | |
Estrangelo | ទំព័រគំរូ:Script/Strng |
Most operating system provide support for Syriac script natively, however only the Madnḥāyā variety (ទំព័រគំរូ:Script/Mdnh) is rendered correctly. In order to render the Serṭā (ទំព័រគំរូ:Script/Serto) and Estrangelo (ទំព័រគំរូ:Script/Strng) varieties, additional fonts are needed. This is supported by the following fonts:
ករណីពិសេស
កែប្រែអេស្ប៉ារ៉ាន់តូ
កែប្រែIn edit box | In database and output |
---|---|
S | S |
Sx | Ŝ |
Sxx | Sx |
Sxxx | Ŝx |
Sxxxx | Sxx |
Sxxxxx | Ŝxx |
ការដំឡើង Mediawiki ដែលបានកំណត់រចនាសម្ព័ន្ធសម្រាប់ អេស្ប៉ារ៉ាន់តូ sប្រើ UTF-8 សម្រាប់ការផ្ទុក និងការបង្ហាញ។ ទោះយ៉ាងណាក៏ដោយ នៅពេលកែសម្រួលអត្ថបទត្រូវបានបំប្លែងទៅជាទម្រង់ដែលត្រូវបានរចនាឡើងដើម្បីឱ្យកាន់តែងាយស្រួលក្នុងការកែសម្រួលដោយប្រើក្តារចុចស្តង់ដារ។
តួអក្សរដែលអនុវត្តគឺ៖ Ĉ, Ĝ, Ĥ, Ĵ, Ŝ, Ŭ, ĉ, ĝ, ĥ, ĵ, ŝ, ŭ។ អ្នកអាចបញ្ចូលវាដោយផ្ទាល់នៅក្នុងប្រអប់កែសម្រួល ប្រសិនបើអ្នកមានឧបករណ៍សម្រាប់ធ្វើដូច្នេះ។ ទោះយ៉ាងណាក៏ដោយ នៅពេលអ្នកកែសម្រួលទំព័រម្តងទៀត អ្នកនឹងឃើញពួកវាត្រូវបានអ៊ិនកូដជា Sx ។ ទម្រង់នេះត្រូវបានគេហៅថា "x-sistemo" ឬ "x-kodo" ។ ដើម្បីរក្សាសមត្ថភាពធ្វើដំណើរជុំវិញនៅពេលដែល x មួយ ឬច្រើនធ្វើតាមតួអក្សរទាំងនេះ ឬទម្រង់មិនសង្កត់សំឡេងរបស់ពួកគេ (C, G, H, J, S, U, c, g, h, j, s, u) ចំនួននៃ x ក្នុងប្រអប់កែសម្រួលគឺជាចំនួនទ្វេដងក្នុងអត្ថបទដែលបានរក្សាទុកពិតប្រាកដ។
For example, the interlanguage link [[en:Luxury car]] to en:Luxury car has to be entered in the edit box as [[en:Luxxury car]] on eo:. This has caused problems with interwiki update bots in the past.
Romanian
កែប្រែThe Romanian alphabet contains an S-comma (Ș ș) and T-comma (Ț ț). These characters were added to Unicode 3.0 at the request of the Romanian standardization institute. Font support for these characters is poor, so the Romanian Wikipedia represents these letters with an S-cedilla (Ş ş) and T-cedilla (Ţ ţ) instead.[៤]
សូមមើលផងដែរ
កែប្រែNotes
កែប្រែ- ↑ Until June 2005, when MediaWiki 1.5 came into use on the Wikimedia projects, articles on the English Wikipedia were encoded using ISO/IEC 8859-1 (although the additional characters from the Windows-1252 character set were used in practice.) All characters from the ISO/IEC 10646 Universal Character Set could be accessed through numerical entities, as specified by the HTML 4.01 specification. Since, nearly all pages have been converted to use Unicode directly.
- ↑ http://www.opera.com/support/kb/view/435/
- ↑ http://www.opera.com/docs/specs/#text
- ↑ សូមមើលផងដែរ ro:Wikipedia:Diacritice