Unicode

The Unicode Standard contains characters and scripts from all over the world. These characters are can be used not only for modern communication, but also to represent the classical forms of many languages. The standard includes the European alphabetic scripts, Asian ideographic characters, Middle Eastern right-to-left scripts, and African and American characters and scripts. Many archaic and historic scripts are encoded.

In addition, the Unicode Standard contains many important symbol sets, including currency symbols, punctuation marks, mathematical symbols, technical symbols, geometric shapes, dingbats, and emoji.

Version Date Characters Scripts Details
16.0 2024 Sep 154 998 (+5185) 168 Garay, Gurung Khema, Kirat Rai, Ol Onal, Sunuwar, Todhri, Tulu-Tigalari
15.1 2023 Sep 149 813 (+627) 168 (+7) Additional CJK ideographs
15.0 2022 Sep 149 186 (+4489) 161 (+2) Kawi and Mundari, 20 emoji, 4192 CJK ideographs, control characters for Egyptian hieroglyphs
14.0 2021 Sep 144 697 (+838) 159 (+5) Toto, Cypro-Minoan, Vithkuqi, Old Uyghur, Tangsa, extended IPA, Arabic script additions for use in languages across Africa and in Iran, Pakistan, Malaysia, Indonesia, Java, and Bosnia, additions for honorifics and Quranic use, additions to support languages in North America, the Philippines, India, and Mongolia, U+20C0, Znamenny musical notation, 37 emoji
13.0 2020 Mar 143 859 (+5930) 154 (+4) Chorasmian, Dhives Akuru, Khitan small script, Yezidi, 4,969 CJK ideographs, Arabic script additions writing Hausa, Wolof, and other African languages, additions writing Hindko and Punjabi in Pakistan, Bopomofo additions for Cantonese, Creative Commons license symbols, graphic characters for compatibility with teletext and home computer systems, 55 emoji
12.1 2019 May 137 929 (+1) 150 U+32FF (Reiwa)
12.0 2019 Mar 137 928 (+554) 150 (+4) Elymaic, Nandinagari, Nyiakeng Puachue Hmong, Wancho, Miao script, hiragana and katakana small letters, Tamil historic fractions and symbols, Lao letters for Pali, Latin letters for Egyptological and Ugaritic transliteration, hieroglyph format controls, 61 emoji
11.0 2018 Jun 137 374 (+684) 146 (+15) Dogra, Georgian Mtavruli capital letters, Gunjala Gondi, Hanifi Rohingya, Indic Siyaq Numbers, Makasar, Medefaidrin, Old Sogdian and Sogdian, Maya numerals, 5 CJK Unified Ideographs, symbols for xiangqi and star ratings, 145 emoji
10.0 2017 Jun 136 690 (+8518) 139 (+4) Zanabazar Square, Soyombo, Masaram Gondi, Nüshu, hentaigana, 7494 CJK Unified Ideographs, 56 emoji, U+20BF (Bitcoin)
9.0 2016 Jun 128 172 (+7500) 135 (+6) Adlam, Bhaiksuki, Marchen, Newa, Osage, Tangut, 72 emoji
8.0 2015 Jun 120 672 (+7716) 129 (+6) Ahom, Anatolian hieroglyphs, Hatran, Multani, Old Hungarian, SignWriting, additional CJK Unified Ideographs, lowercase letters for Cherokee, 5 emoji skin tone modifiers
7.0 2014 Jun 112 956 (+2834) 123 (+23) Bassa Vah, Caucasian Albanian, Duployan, Elbasan, Grantha, Khojki, Khudawadi, Linear A, Mahajani, Manichaean, Mende Kikakui, Modi, Mro, Nabataean, Old North Arabian, Old Permic, Pahawh Hmong, Palmyrene, Pau Cin Hau, Psalter Pahlavi, Siddham, Tirhuta, Warang Citi, and dingbats
6.3 2013 Sep 110 122 (+5) 100 5 bidirectional formatting characters
6.2 2012 Sep 110 117 (+1) 100 U+20BA (Turkish Lira)
6.1 2012 Jan 110 116 (+732) 100 (+7) Chakma, Meroitic cursive, Meroitic hieroglyphs, Miao, Sharada, Sora Sompeng, and Takri
6.0 2010 Oct 109 384 (+2088) 93 (+3) Batak, Brahmi, Mandaic, playing card symbols, transport and map symbols, alchemical symbols, emoticons and emoji, additional CJK Unified Ideographs
5.2 2009 Oct 107 296 (+6648) 90 (+15) Avestan, Bamum, Gardiner's sign list of Egyptian hieroglyphs, Imperial Aramaic, Inscriptional Pahlavi, Inscriptional Parthian, Javanese, Kaithi, Lisu, Meetei Mayek, Old South Arabian, Old Turkic, Samaritan, Tai Tham and Tai Viet, additional CJK Unified Ideographs, Jamo for Old Hangul, Vedic Sanskrit
5.1 2008 Apr 100 648 (+1624) 75 (+11) Carian, Cham, Kayah Li, Lepcha, Lycian, Lydian, Ol Chiki, Rejang, Saurashtra, Sundanese, and Vai, sets of symbols for the Phaistos Disc, Mahjong tiles, Domino tiles, additions to Burmese, Scribal abbreviations, U+1E9E (Latin Sharp S)
5.0 2006 Jul 99 024 (+1369) 64 (+5) Balinese, cuneiform, N'Ko, 'Phags-pa, Phoenician
4.1 2005 Mar 97 655 (+1273) 59 (+7) Buginese, Glagolitic, Kharosthi, New Tai Lue, Old Persian, Sylheti Nagri, and Tifinagh, Coptic disunified from Greek, ancient Greek numbers and musical symbols, first named character sequences were introduced
4.0 2003 Apr 96 382 (+1226) 52 (+7) Cypriot syllabary, Limbu, Linear B, Osmanya, Shavian, Tai Le, and Ugaritic, Hexagram symbols
3.2 2002 Mar 95 156 (+1016) 45 (+4) Philippine scripts (Buhid, Hanunoo, Tagalog, and Tagbanwa)
3.1 2001 Mar 94 140 (+44946) 41 (+3) Deseret, Gothic and Old Italic, sets of symbols for Western and Byzantine music, 42 711 additional CJK Unified Ideographs
3.0 1999 Sep 49 194 (+10307) 38 (+13) Cherokee, Ge'ez, Khmer, Mongolian, Burmese, Ogham, runes, Sinhala, Syriac, Thaana, Canadian Aboriginal syllabics, and Yi Syllables, Braille patterns
2.1 1998 May 38 887 (+2) 25 U+20AC (Euro Sign), U+FFFC (Object Replacement Character)
2.0 1996 Jul 38 885
(+11373, -6656)
25 (+1) Original set of Hangul syllables removed, new set of 11 172 Hangul syllables added at new location, Tibetan added back in a new location and with a different character repertoire
1.1 1993 Jun 34 168
(+5963, -9)
24 (-1) 33 reclassified as control characters. 4306 Hangul syllables, Tibetan removed
1.0.1 1992 Jun 28 327
(+21204, -9)
25 (+1) The initial 20 902 CJK Unified Ideographs
1.0 1991 Oct 7 129 24 Initial coverage: Arabic, Armenian, Bengali, Bopomofo, Cyrillic, Devanagari, Georgian, Greek and Coptic, Gujarati, Gurmukhi, Hangul, Hebrew, Hiragana, Kannada, Katakana, Lao, Latin, Malayalam, Odia, Tamil, Telugu, Thai, and Tibetan

= Äldre brusare stödjer inte alla symboler för HTML 5.