INTERNATIONAL STANDARD IS0 9 Second edition 1995 02 I 5 Information and documentation Transliteration of Cyrillic characters into Latin characters Slavic and non Slavic languages Information et docume[.]
INTERNATIONAL STANDARD IS0 Second edition 1995-02-I Information and documentation Transliteration of Cyrillic characters into Latin characters - Slavic and non-Slavic languages Information et documentation - TranslitGration des caracthes caract&es laths - Langues slaves et non slaves cyrilliques en Reference number IS0 9:1995(E) IS0 9:1995(E) Foreword IS0 (the International Organization for Standardization) is a worldwide federation of national standards bodies (IS0 member bodies) The work of preparing International Standards is normally carried out through IS0 technical committees Each member body interested in a subject for which a technical committee has been established has the right to be represented on that committee International organizations, governmentat and non-governmental, in liaison with ISO, also take part in the work IS0 collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization Draft International Standards adopted by the technical committees are circulated to the member bodies for voting Publication as an International Standard requires approval by at least 75 % of the member bodies casting a vote International Standard IS0 was prepared by Technical ISO/TC 46, Information and documentation, Subcommittee version of written languages Committee SC 2, Con- This second edition cancels and replaces the first edition (IS0 9:1986), of which it constitutes a technical revision Annexes A to D of this International IS0 Standard are for information only 1995 All nghts reserved Unless otherwrse specified, no part of thus publrcation may be reproduced or utrlrzed In any form or by any means, electronrc or mechanical, including photocopying and microfilm, without permission in writing from the publisher International Organization for Standardization Case postale 56 W-1 211 Genave 20 Swrtzerland Printed in Switzerland ii IS0 IS0 9:1995(E) Introduction This International Standard is one of a series of International Standards, dealing with the conversion of systems of writing The aim of this International Standard and others in the series is to provide a means for international communication of written messages in a form which permits the automatic transmission and reconstitution of these by men or machines The system of conversion, in this case, must be univocal and entirely reversible This means that no consideration should be given to phonetic and aesthetic matters nor to certain national customs: all these considerations are, indeed, ignored by the machine performing the function The adoption of this International Standard for international communication leaves every country free to adopt for its own use a national standard which may be different, on condition that it be compatible with the International Standard The system proposed herein should make this possible, and be acceptable for international use if the graphisms it creates are such that they may be converted automatically into the graphisms used in any national system, so long as it is strict This International Standard may be used by anyone who has a clear understanding of the system and is certain that it can be applied without ambiguity The result obtained will not give a correct pronunciation of the original text in a person’s own language; but it will serve as a means of finding automatically the original graphism and thus allow anyone who has a knowledge of the original language to pronounce it correctly Similarly, one can only pronounce correctly a text written in, for example, English or Polish, if one has a knowledge of English or Polish The adoption of national standards compatible with this International Standard will permit the representation, in an international publication, of the morphemes of each language according to the customs of the country where it is spoken It will be possible to simplify this representation in order to take into account the extent of the character sets available on different kinds of machine INTERNATIONAL STANDARD IS0 9:1995(E) o IS0 - Transliteration of Information and documentation Cyrillic characters into Latin characters - Slavic and nonSlavic languages Scope This International Standard establishes a system for the transliteration into Latin characters of Cyrillic characters constituting the alphabets of Slavic and non-Slavic languages, in accordance with the principles of stringent conversion in order to permit international information exchange, particularly by electronic means For the transliteration of Slavic Cyrillic characters, tables and reproduce the tables published in the first edition of IS0 9:1986; for the transliteration of Cyrillic characters constituting the alphabets of non-Slavic languages, table adopts the transliteration of tables and for all characters similar to those of Slavic languages and gives equivalents for all supplementary characters introduced in the alphabets of non-Slavic languages Table includes in a single sequence, listed in the Cyrillic alphabetic order, the 118 single or diacriticcarrying characters that appear in one or another of the considered alphabets The list of the languages written in these alphabets is given in annex C General principles writing systems of conversion of 2.1 The words in a language, which are written according to a given script (the converted system), sometimes have to be rendered according to a different system (the conversion system) normally used for a different language The procedure is often used for historical or geographical texts, cartographical documents and in particular bibliographical work where characters must be converted from different writing systems into a single alphabet to allow for alphabetical intercalation in bibliographies, catalogues, indexes, toponymic lists, etc It is indispensable in that it permits the univocal transmission of a written message between two countries using different writing systems, or exchanging a message the writing of which is different from their own It thereby permits transmission by manual, mechanical as well as electronic means The two basic methods writing are transliteration of conversion of a system and transcription of 2.2 Transliteration is the process which consists of representing the characters” of an alphabetical or syllabic writing by the characters of a conversion alphabet In principle, the conversion should be made character by character: each character of the converted graphical system is rendered by only one character of the conversion alphabet, this being the easiest way to ensure the complete and unambiguous reversibility of the conversion alphabet in the converted system When the number of characters used in the conversion system is smaller than the number of characters of the converted system, it is necessary to use digraphs or diacritical marks In this case, arbitrary choices and the use of purely conventional marks shall be avoided as far as possible, and a certain phonetic logic shall be maintained in order to give the system a wide acceptance However, it must be accepted that the graphism obtained cannot always be correctly pronounced according to the phonetic habits of the language (or of all the languages) which usually use(s) the conversion alphabet On the other hand this graphism shall be such that the reader who has a knowledge of the converted language may mentally restore unequivocally the original graphism and thus pronounce it 1) A character is an element of an alphabetical or other type of writing system that syllable, a word or even a prosodrcal characteristic of a given language It IS used either tdeographical character, a dig& a punctuation mark) or in combination (e.g an accent, accent or a diacritical mark, for example 8, 8, 6, is therefore a character in the same way graphically alone (e.g a dracntrcal as a basic represents a phoneme, a a letter, a syllabrc sign, an mark) A letter having an letter IS0 9:1995(E) IS0 2.3 Retransliteration is the process whereby the characters of a conversion alphabet are transformed back into those of the converted writing system It is the exact opposite of the transliteration process in that the rules of a transliteration system are applied in reverse in order to reconvert the transliterated word to its original form 2.4 Transcription is the process whereby the pronunciation of a given language is noted by the system of signs of a conversion language A transcription system is of necessity based on the orthographical conventions of the conversion language Transcription is not strictly reversible Transcription may be used for the conversion of writing systems It is the only method that can used for systems that are not entirely alphabetical syllabic and for all ideophonographical systems writing like Chinese all be or of 2.5 To carry out romanization (the conversion of non-Latin writing systems to the Latin alphabet) either transliteration or transcription or a combination of the two may be used depending on the nature of the converted system 2.6 A conversion system proposed for international use may call for compromise and the sacrifice of certain national customs It is therefore necessary for each community of users to accept concessions, fully abstaining in every case from imposing as a matter of course solutions that are actually justified only by national practice (regarding pronunciation and orthography) When a country uses two systems univocally convertible one into the other to write its own language, the system of transliteration thus implemented shall be taken a priori as a basis for the international standardized system, as far as it is compatible with the other principles exposed hereafter 2.7 When necessary, the conversion systems should specify an equivalent for each character, not only the letters but also the punctuation marks, numbers, etc They should similarly take into account the arrangement of the sequence of characters that make up the text, for example the direction of the script, and specify the way of distinguishing words and of using separation signs, following as closely as possible the customs of the language(s) which use the converted writing system 2.8 When romanizing a script which has no uppercase characters, it is usual to capitalize some words, following national usage Principles alphabetical 3.1 of conversion for writing systems The conversion may be made at various levels The first level is that of completely reversible stringent transliteration which is necessary to attain in full the aim given in 2.2 This conversion applies all principles of transliteration without exception It does not permit variants The conventional systems of stringent transliteration should be applied as such without any change to meet national or regional customs as regards pronunciation or orthography They permit the univocal international transmission of messages by mechanical or electronic means To permit an international unequivocal communication, International Standards on transliteration must apply by priority the principle of stringent conversion These can then be used as a basis for the establishment of rules for simplified conversion and for preparation of national standards The second level is that of simplified conversion The simplification can be made necessary, for example, by the use of machines that not accept all the alphabet characters required for stringent conversion The method of conversion may allow national or regional variants, which may not permit complete reversibility The simplified conversion may be the subject of International Standards or agreements The third level is that of popular conversion which, for example, should enable the same foreign names to be written in a uniform manner in the newspapers of a given country It is obliged to take into account phonetic or graphic practices and therefore can only be national 3.2 In cases where the same characters appear in one alphabet used with some differences by different languages, these characters would be transliterated in the same way, irrespective of the language they belong to 3.3 If the converted alphabet gives a different form to the same character according to its place in the word (as is the case for example in the Arabic, Hebrew and Greek alphabets), the conversion alphabet will use only one character of constant form IS0 9:1995(E) IS0 Transliteration table Table - Cyrillic written laAaJ 36 B B r r s JJ, J-J @ E lo e 11 x l2 table for Slavic Cyrillic Transliteration into Latin characters from Cyrillic characters of Slavic alphabets (Bulgarian, Byelorussian, Macedonian, Russian, Serbo-Croatian, Ukrainian) character printed lo General J characters Respective languages Examples a A all anpec adres b B ait 6a6a baba b v v d Bbl VY I-OJIOBa golova t z g G all d y J) d D all na da [ E beru &Ka &lka (f$ E uk TBOE: tVO& z a" XypHan Zumal z a’* 3Be3na zvezda e’ c f G x x # x 3 IS0 IS0 9: 1995(E) Table - Cyrillic character No printed - 13 - 14 s s T - Transliteration into Latin characters from Cyrillic characters of Slavic alphabets (Bulgarian, Eyelorussian, Macedonian, Russian, Setbo-Croatian, Ukrainian) 3espective languages Examples h A mk SBe3Ail -2vczda KHHra kniga z % I I a* I J J J J li K L A L n-l M n N h l IMI vgmk ru sr uk I 15 written (continued) i I +I \ bilij be uk I 16 17 - I I J J uk hAa mk sr jeuaH i’zda V V jedan t l 18 WpBLIfi pew.i all KZlK kak all nuna lipa be bg I-U uk t - 19 - 20 - A 21 - 22 - 23 - 24 - MM H H WY-Ii n N mk $r all all mk sr Jhyt5aB MYX kxlv mui HlixHati ” nlzmJ IbNBil iha IS0 9:1995(E) IS0 Table - Cyrillic character uo 2s 26 27 written printed n rI P P Transliteration into Latin characters from Cyrillic characters of Slavic alphabets (Bulgarian, Byelowssian, Macedonian, Russian, Serbo-Croatian, Ukrainian) 0 l.ln P P P I” 28 33 34 35 F I lespective Ianguages Examples all 36UECTBO ohZestvo all nap pS.l R all phr6a ryba -4 all ccc-rpa sestra TOBapHLU tovaii2 T r‘\ c 29 32 (continued) U all Kyrla k&a mk Kylia all yTp0 be CnoptliK slo%ik all QA3UKa fizika all ;UMUYCCKM himiEeskij all [CHTPaJIbHbI oentral’nyj V utro -u I f 11 H C IS0 IS0 9:1995(E) Table - Cyrillic No written e (J NvJ-Jfy J D 39IIIJTp~jy g s iy s s ” j7 ho Y q 42 qJT[ bl y z cb~% f,I 43bb u u 6 3 all mk sr all bg ru uk rr J7 y r / Respective languages bg ru Examples WCM uawja Easy t?amiya LLIKOXI Skola LLLMT Ziit 06wmnemic oblvlenie by1 be ru 6bHl be bg ru uk anb6oM al’bom 3TO fh EllY~j p beru ~5mJ&o~ fi (J be bg ru uk MXHbIii -uLz~ ii A be bg ru uk 5ih4a bc mk uk ‘ph ‘@a 44 t7 3 * NOTE - For the diacritical Transliteration into Latin characters from Cyrillic characters of Slavic alphabets (Bulgarian, Byelorussian, Macedonian, Russian, Serbo-Croatian, Ukrainian) character printed (concluded) f signs used, see annex A IS0 9:1995(E) IS0 Table - Complementary Cyrillic table for the Slavic Cyrillic characters used by some communities outside the boundaries of their native countries character , No printed written Transliteration into Latin characters from Cyrillic characters of Slavic alphabets i - i; established Examples @CT &-lSt r 5’ 52 v V VVY NOTE - Character 48 is also used in the Ukraine 888 f l? ‘;I 2;CTb &st’ MRiKZl m’ika Kaoenpa MVpO kahxira mire IS0 9:1995(E) Table - Table for Cyrilfic No characters of non-Slavic languages No No No No 25 49 73 97 26 50 74 98 27 51 75 99 28 52 76 100 29 53 77 101 30 54 78 102 31 55 79 103 32 56 80 104 33 57 81 105 10 34 58 82 106 11 35 59 83 107 12 36 60 84 108 13 37 61 85 109 14 38 62 86 110 15 39 63 87 111 16 40 64 88 17 41 65 89 18 42 66 90 19 43 67 91 20 44 68 92 21 45 69 93 22 46 70 94 23 47 71 95 24 48 72 96 W f x h Y b uz C TIC c* v d Y c ‘zi c 112 113 114 115 116 117 118 IS0 9:1995(E) IS0 Annex A (informative) Diacritical The diacritical IS0 5426 signs used for the transliteration of Slavic Cyrillic characters (tables and 2) signs used in tables and of this International Standard are taken from the code table of Table A.1 indicates their position in that code table Table A.1 No Position Diacritical in the code table signs used in tables No Position and in the code table 412 40 43 10 413 41 3114 11 4115 43 3113 13 413 44 4/l 15 4/l 45 413 16 17 41% 4f15 46 1) 413 21 413 48 4/l 24 413 49 4115 30 412 50 4115 31 412 51 411 33 4115 52 411 37 4115 38 4J3 39 4115 1) Character 47 IS an apostrophe coded 2/7 rn ISOIIEC 646 IS0 9:1995(E) IS0 Annex B (informative) Diacritical The diacritical table signs used for the transliteration of Cyrillic of non-Slavic languages (table 3) signs used are taken from the code table of IS0 5426 Table B.l indicates Table B.l No of transliterated 2, 3, 4, 5, 19, 62, 14, 22, characters 26, 34, 59, 79, 98, 107 84 18,23, 56,101 29, 42, 57, 65, 80, 93, 108 Diacritical Position their position in the code signs used in table in the code table 418 418 + 5/S 416 415 7/l 7, 12, 27, 30, 37, 47, 55, 64, 68, 83 8, 114 13, 28, 32, 53, 86 15, 39, 52, 97 102 412 4110 4l7 20, 35, 36,41, 76, 81, 95, 103 21 25,43, 48, 51, 72, 77,91, 96 31, 40, 60, 94, 99, 104, I 13, 115 4/l 4/l f 512 33, 44, 54, 63, 69, 73, 75, 85, 100, 110, 112 61, 82, 111 87 66 105 109 116 117 118 411 4/l 415 + 516 7/l 3/14 3113 316 10 characters 516 416 + 5f3 512 413 4/l 418 Name Diaeresis Diaeresis + Dot below Breve Matron Ligature ae - Small letter Acute accent Circle above Dot above Dot below Breve + Hook to right Hazek HaEek + Hook to left Hook to left Circumflex accent Grave accent Double acute accent Matron + Dot below Ligature oe - Small letter Tverdyj znak Mjagkij znak Double dagger Grave accent (used alone) Diaeresis (used alone) IS0 IS0 9:1995(E) Annex C (informative) List of languages Russian name covered by table English French name name a6a3nHcKtiti 4baza jbaza 36xa3ctW7 4bkhaz 3bkhaze aaapcKuR 4war 3var aflbtreAcKfifi (qepKeccKH@ 4dyge (Clrcasslan) jdygh6 a3ep6atiflmaHcKLdfi 4ze rl 3z& anTailcKL1L? 4ltay 3ltaien 6anKapcKvlA Balkar >alkar 6aWKIIpCKHn Bashklr Jachklr 6ypfl~C~Mfi Buryat 3ouriate raray3cKnri Gagauzi gagaouze naprvlHcKnB Dargwa jargwa JlonraHcKllti Dolgan jolgane nyHraHcKMti Dungan joungane (clrcassien) lngouche LlHQUJCKlnti lngush Ka6apnHHCKUti (V?pKeCCKW?) Kabardlan Ka3axcKMA Kazakh kazakh KaflMbl~Kklti Kalmyk kalmouk KapallMcKMI;I Karalm ca ra?te KapaKannaKcKMti Karakalpak karakalpak KaparaeBcKMh Karachay karatcha’i KapenbcKflti Karellan carelien KeTCKMti Ket k&e KOMhl-3blpRlJKtlti Komi-Zynan koml-zyr@ne (Clrcasslan) kabarde (clrcasslen) KOMM-IlepMRlJKML? Koml-Permian koml-permlen KOPRKCKHti Koryak korlak KpblMCKO-TaTapCKklL? Cnmean Tatar tatar de Cnmee KyMblKCKflti Kumyk koumyk KyPACKMI;( Kurdish kurde Kblprbl3CKmi Klrglz klrghlze naKcKL4ti Lak lak ne3rmHcKml;l Lezglan lezghlen MaHCklikKklh Mans.1 mansi MapLltiCKklti (rOPHblL;1) Mari (high) mari (haut) Mapkli?CKnti (nyrosor;l) Man (low) marl (has) MOnnaBCKllti Moldavlan moldave MOHrOnbCKnil Mongolian mongol MOpROBCKO-MOKUJaHCKMti Mordvin-Moksha mordve-mokcha MOpflOBCKO-3p3RHCKUfi Mordvln-Erza mordve-erza HaHaticKtiti Nanay nana’i HraHacaHcKt?ti Nganasan nganassane HeHeuKLIfi Nenets rknetse HMBXCKML? Nlvkh nlvkhe HoraticKmti Noghay n0ga.i 11 IS0 9:1995(E) Russian IS0 English name name French OCeTMHCKMti Ossetic os&te CaaMcKVlti Saam same CeJl bKynCKn ti Selkup selkoupe TaGacapaHcKLlh Tabasaran tabassarane TaRWlKCKMci Tajk tadji k TaTapCKMh Tatar tatar TaTCUlL? Tat tate To~anapcKMLl Tafalar tofalar TyBtlHCKllfi Tuva touva TypKMeHCKLdl Turkmen turkmene oudmourte yLlMypTCKl4ti Udmurt yn3(re)ticwti Udegey oudegu6 y36eKCKMti Uzbek ouzbek ytirypCKt4ti Uyghur 0uTgour YJlbCICKMl? Ulch oultche XaKaccKHR Khakass khakasse Vakh Khanty khanty de Vakh XaHTbll?CWti (BaXOBCKLlti) XaHTblhCKCIh (Ka3blMCKLdti) Kazym Khanty khanty de Kazym XaHTbltiCKC1h (CypQ’TCUlti) Surgut Khanty khanty de Sourgout XaHTbltiCKMh Shurykshar (lJJypblKUJapCKLlti) Khanty khanty de Chourykchar UblraHCKLlti Tsigan tsigane qereHcKl&l Chechen tch&tch&ne ryBaWCKL4ti Chuvash tchouvache qyKOTCKLlfi Chukcha tchouktche ILIOPCKLI~ Shor char uJyrHaHCKl4ti Shugnan chougnane 3BeHKflfiCKW? Evenkl evenke 3BeHCKMti Even evhne 3CKHMOCCKMh Eskimo esquimau IoKarclpcKtcll? Yukagir loukaguir f13Q4lRMCKL4ti Yazgulam iazgoulame FIKylCKVll? Yakut iakoute 12 name IS0 9:1995(E) IS0 Annex D (informative) Bibliography [ I] ISO/I EC 646: 1991, information [Z] IS0 5426:1983, technology - /SO 7-bit coded character set for information Extension of the Latin alphabet coded character set for bibliographic interchange information Interchange 13 IS0 IS0 9:1995(E) KS 01.140.10 Descriptors: documentatron, Price based on 13 Pages transllteratlon, letters (symbols), Cyrtlk characters, Latin characters