The Unicode Standard contains characters and scripts from all over the world.
These characters are can be used not only for modern communication, but also
to represent the classical forms of many languages. The standard includes the
European alphabetic scripts, Asian ideographic characters, Middle Eastern
right-to-left scripts, and African and American characters and scripts.
Many archaic and historic scripts are encoded.
In addition, the Unicode Standard contains many important symbol sets,
including currency symbols, punctuation marks, mathematical symbols,
technical symbols, geometric shapes, dingbats, and emoji.
Version | Date | Characters | Scripts | Details |
16.0 | 2024 Sep | 154 998 (+5185) | 168 | Garay, Gurung Khema, Kirat Rai, Ol Onal, Sunuwar, Todhri, Tulu-Tigalari |
15.1 | 2023 Sep | 149 813 (+627) | 168 (+7) | Additional CJK ideographs |
15.0 | 2022 Sep | 149 186 (+4489) | 161 (+2) | Kawi and Mundari, 20 emoji, 4192 CJK ideographs, control characters for Egyptian hieroglyphs |
14.0 | 2021 Sep | 144 697 (+838) | 159 (+5) | Toto, Cypro-Minoan, Vithkuqi, Old Uyghur, Tangsa, extended IPA, Arabic script additions for use in languages across Africa and in Iran, Pakistan, Malaysia, Indonesia, Java, and Bosnia, additions for honorifics and Quranic use, additions to support languages in North America, the Philippines, India, and Mongolia, U+20C0, Znamenny musical notation, 37 emoji |
13.0 | 2020 Mar | 143 859 (+5930) | 154 (+4) | Chorasmian, Dhives Akuru, Khitan small script, Yezidi, 4,969 CJK ideographs, Arabic script additions writing Hausa, Wolof, and other African languages, additions writing Hindko and Punjabi in Pakistan, Bopomofo additions for Cantonese, Creative Commons license symbols, graphic characters for compatibility with teletext and home computer systems, 55 emoji |
12.1 | 2019 May | 137 929 (+1) | 150 | U+32FF (Reiwa) |
12.0 | 2019 Mar | 137 928 (+554) | 150 (+4) | Elymaic, Nandinagari, Nyiakeng Puachue Hmong, Wancho, Miao script, hiragana and katakana small letters, Tamil historic fractions and symbols, Lao letters for Pali, Latin letters for Egyptological and Ugaritic transliteration, hieroglyph format controls, 61 emoji |
11.0 | 2018 Jun | 137 374 (+684) | 146 (+15) | Dogra, Georgian Mtavruli capital letters, Gunjala Gondi, Hanifi Rohingya, Indic Siyaq Numbers, Makasar, Medefaidrin, Old Sogdian and Sogdian, Maya numerals, 5 CJK Unified Ideographs, symbols for xiangqi and star ratings, 145 emoji |
10.0 | 2017 Jun | 136 690 (+8518) | 139 (+4) | Zanabazar Square, Soyombo, Masaram Gondi, Nüshu, hentaigana, 7494 CJK Unified Ideographs, 56 emoji, U+20BF (Bitcoin) |
9.0 | 2016 Jun | 128 172 (+7500) | 135 (+6) | Adlam, Bhaiksuki, Marchen, Newa, Osage, Tangut, 72 emoji |
8.0 | 2015 Jun | 120 672 (+7716) | 129 (+6) | Ahom, Anatolian hieroglyphs, Hatran, Multani, Old Hungarian, SignWriting, additional CJK Unified Ideographs, lowercase letters for Cherokee, 5 emoji skin tone modifiers |
7.0 | 2014 Jun | 112 956 (+2834) | 123 (+23) | Bassa Vah, Caucasian Albanian, Duployan, Elbasan, Grantha, Khojki, Khudawadi, Linear A, Mahajani, Manichaean, Mende Kikakui, Modi, Mro, Nabataean, Old North Arabian, Old Permic, Pahawh Hmong, Palmyrene, Pau Cin Hau, Psalter Pahlavi, Siddham, Tirhuta, Warang Citi, and dingbats |
6.3 | 2013 Sep | 110 122 (+5) | 100 | 5 bidirectional formatting characters |
6.2 | 2012 Sep | 110 117 (+1) | 100 | U+20BA (Turkish Lira) |
6.1 | 2012 Jan | 110 116 (+732) | 100 (+7) | Chakma, Meroitic cursive, Meroitic hieroglyphs, Miao, Sharada, Sora Sompeng, and Takri |
6.0 | 2010 Oct | 109 384 (+2088) | 93 (+3) | Batak, Brahmi, Mandaic, playing card symbols, transport and map symbols, alchemical symbols, emoticons and emoji, additional CJK Unified Ideographs |
5.2 | 2009 Oct | 107 296 (+6648) | 90 (+15) | Avestan, Bamum, Gardiner's sign list of Egyptian hieroglyphs, Imperial Aramaic, Inscriptional Pahlavi, Inscriptional Parthian, Javanese, Kaithi, Lisu, Meetei Mayek, Old South Arabian, Old Turkic, Samaritan, Tai Tham and Tai Viet, additional CJK Unified Ideographs, Jamo for Old Hangul, Vedic Sanskrit |
5.1 | 2008 Apr | 100 648 (+1624) | 75 (+11) | Carian, Cham, Kayah Li, Lepcha, Lycian, Lydian, Ol Chiki, Rejang, Saurashtra, Sundanese, and Vai, sets of symbols for the Phaistos Disc, Mahjong tiles, Domino tiles, additions to Burmese, Scribal abbreviations, U+1E9E (Latin Sharp S) |
5.0 | 2006 Jul | 99 024 (+1369) | 64 (+5) | Balinese, cuneiform, N'Ko, 'Phags-pa, Phoenician |
4.1 | 2005 Mar | 97 655 (+1273) | 59 (+7) | Buginese, Glagolitic, Kharosthi, New Tai Lue, Old Persian, Sylheti Nagri, and Tifinagh, Coptic disunified from Greek, ancient Greek numbers and musical symbols, first named character sequences were introduced |
4.0 | 2003 Apr | 96 382 (+1226) | 52 (+7) | Cypriot syllabary, Limbu, Linear B, Osmanya, Shavian, Tai Le, and Ugaritic, Hexagram symbols |
3.2 | 2002 Mar | 95 156 (+1016) | 45 (+4) | Philippine scripts (Buhid, Hanunoo, Tagalog, and Tagbanwa) |
3.1 | 2001 Mar | 94 140 (+44946) | 41 (+3) | Deseret, Gothic and Old Italic, sets of symbols for Western and Byzantine music, 42 711 additional CJK Unified Ideographs |
3.0 | 1999 Sep | 49 194 (+10307) | 38 (+13) | Cherokee, Ge'ez, Khmer, Mongolian, Burmese, Ogham, runes, Sinhala, Syriac, Thaana, Canadian Aboriginal syllabics, and Yi Syllables, Braille patterns |
2.1 | 1998 May | 38 887 (+2) | 25 | U+20AC (Euro Sign), U+FFFC (Object Replacement Character) |
2.0 | 1996 Jul | 38 885 (+11373, -6656) | 25 (+1) | Original set of Hangul syllables removed, new set of 11 172 Hangul syllables added at new location, Tibetan added back in a new location and with a different character repertoire |
1.1 | 1993 Jun | 34 168 (+5963, -9) | 24 (-1) | 33 reclassified as control characters. 4306 Hangul syllables, Tibetan removed |
1.0.1 | 1992 Jun | 28 327 (+21204, -9) | 25 (+1) | The initial 20 902 CJK Unified Ideographs |
1.0 | 1991 Oct | 7 129 | 24 | Initial coverage: Arabic, Armenian, Bengali, Bopomofo, Cyrillic, Devanagari, Georgian, Greek and Coptic, Gujarati, Gurmukhi, Hangul, Hebrew, Hiragana, Kannada, Katakana, Lao, Latin, Malayalam, Odia, Tamil, Telugu, Thai, and Tibetan |
|