Combining Diacritical Marks

Unicode character block
(1991)66 (+66)1.0.1 (1992)68 (+2)1.1 (1993)72 (+4)3.0 (1999)82 (+10)3.2 (2002)96 (+14)4.0 (2003)107 (+11)4.1 (2005)112 (+5) Unicode documentationCode chart ∣ Web pageNote: Two characters were moved from the Greek and Coptic block to the Combining Diacritical Marks block in version 1.0.1 during the process of unifying with ISO 10646.[1][2][3]

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context. Its block name in Unicode 1.0 was Generic Diacritical Marks.[4]

Block

Combining Diacritical Marks[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+030x ◌̀ ◌́ ◌̂ ◌̃ ◌̄ ◌̅ ◌̆ ◌̇ ◌̈ ◌̉ ◌̊ ◌̋ ◌̌ ◌̍ ◌̎ ◌̏
U+031x ◌̐ ◌̑ ◌̒ ◌̓ ◌̔ ◌̕ ◌̖ ◌̗ ◌̘ ◌̙ ◌̚ ◌̛ ◌̜ ◌̝ ◌̞ ◌̟
U+032x ◌̠ ◌̡ ◌̢ ◌̣ ◌̤ ◌̥ ◌̦ ◌̧ ◌̨ ◌̩ ◌̪ ◌̫ ◌̬ ◌̭ ◌̮ ◌̯
U+033x ◌̰ ◌̱ ◌̲ ◌̳ ◌̴ ◌̵ ◌̶ ◌̷ ◌̸ ◌̹ ◌̺ ◌̻ ◌̼ ◌̽ ◌̾ ◌̿
U+034x ◌̀ ◌́ ◌͂ ◌̓ ◌̈́ ◌ͅ ◌͆ ◌͇ ◌͈ ◌͉ ◌͊ ◌͋ ◌͌ ◌͍ ◌͎  CGJ 
U+035x ◌͐ ◌͑ ◌͒ ◌͓ ◌͔ ◌͕ ◌͖ ◌͗ ◌͘ ◌͙ ◌͚ ◌͛ ◌͜◌ ◌͝◌ ◌͞◌ ◌͟◌
U+036x ◌͠◌ ◌͡◌ ◌͢◌ ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ
Notes
1.^ As of Unicode version 15.1

Character table

Code Glyph Decimal Description
U+0300    ̀ 768 Combining Grave Accent
U+0301    ́ 769 Combining Acute Accent
U+0302    ̂ 770 Combining Circumflex Accent
U+0303    ̃ 771 Combining Tilde
U+0304    ̄ 772 Combining Macron
U+0305    ̅ 773 Combining Overline
U+0306    ̆ 774 Combining Breve
U+0307    ̇ 775 Combining Dot Above
U+0308    ̈ 776 Combining Diaeresis
U+0309    ̉ 777 Combining Hook Above
U+030A    ̊ 778 Combining Ring Above
U+030B    ̋ 779 Combining Double Acute Accent
U+030C    ̌ 780 Combining Caron
U+030D    ̍ 781 Combining Vertical Line Above
U+030E    ̎ 782 Combining Double Vertical Line Above
U+030F    ̏ 783 Combining Double Grave Accent
U+0310    ̐ 784 Combining Candrabindu
U+0311    ̑ 785 Combining Inverted Breve
U+0312    ̒ 786 Combining Turned Comma Above
U+0313    ̓ 787 Combining Comma Above
U+0314    ̔ 788 Combining Reversed Comma Above
U+0315    ̕ 789 Combining Comma Above Right
U+0316    ̖ 790 Combining Grave Accent Below
U+0317    ̗ 791 Combining Acute Accent Below
U+0318    ̘ 792 Combining Left Tack Below
U+0319    ̙ 793 Combining Right Tack Below
U+031A    ̚ 794 Combining Left Angle Above
U+031B    ̛ 795 Combining Horn
U+031C    ̜ 796 Combining Left Half Ring Below
U+031D    ̝ 797 Combining Up Tack Below
U+031E    ̞ 798 Combining Down Tack Below
U+031F    ̟ 799 Combining Plus Sign Below
U+0320    ̠ 800 Combining Minus Sign Below
U+0321    ̡ 801 Combining Palatalized Hook Below
U+0322    ̢ 802 Combining Retroflex Hook Below
U+0323    ̣ 803 Combining Dot Below
U+0324    ̤ 804 Combining Diaeresis Below
U+0325    ̥ 805 Combining Ring Below
U+0326    ̦ 806 Combining Comma Below
U+0327    ̧ 807 Combining Cedilla
U+0328    ̨ 808 Combining Ogonek
U+0329    ̩ 809 Combining Vertical Line Below
U+032A    ̪ 810 Combining Bridge Below
U+032B    ̫ 811 Combining Inverted Double Arch Below
U+032C    ̬ 812 Combining Caron Below
U+032D    ̭ 813 Combining Circumflex Accent Below
U+032E    ̮ 814 Combining Breve Below
U+032F    ̯ 815 Combining Inverted Breve Below
U+0330    ̰ 816 Combining Tilde Below
U+0331    ̱ 817 Combining Macron Below
U+0332    ̲ 818 Combining Low Line
U+0333    ̳ 819 Combining Double Low Line
U+0334    ̴ 820 Combining Tilde Overlay
U+0335    ̵ 821 Combining Short Stroke Overlay
U+0336    ̶ 822 Combining Long Stroke Overlay
U+0337    ̷ 823 Combining Short Solidus Overlay
U+0338    ̸ 824 Combining Long Solidus Overlay
U+0339    ̹ 825 Combining Right Half Ring Below
U+033A    ̺ 826 Combining Inverted Bridge Below
U+033B    ̻ 827 Combining Square Below
U+033C    ̼ 828 Combining Seagull Below
U+033D    ̽ 829 Combining X Above
U+033E    ̾ 830 Combining Vertical Tilde
U+033F    ̿ 831 Combining Double Overline
U+0340    ̀ 832 Combining Grave Tone Mark
U+0341    ́ 833 Combining Acute Tone Mark
U+0342    ͂ 834 Combining Greek Perispomeni
U+0343    ̓ 835 Combining Greek Koronis
U+0344    ̈́ 836 Combining Greek Dialytika Tonos
U+0345    ͅ 837 Combining Greek Ypogegrammeni
U+0346    ͆ 838 Combining Bridge Above
U+0347    ͇ 839 Combining Equals Sign Below
U+0348    ͈ 840 Combining Double Vertical Line Below
U+0349    ͉ 841 Combining Left Angle Below
U+034A    ͊ 842 Combining Not Tilde Above
U+034B    ͋ 843 Combining Homothetic Above
U+034C    ͌ 844 Combining Almost Equal To Above
U+034D    ͍ 845 Combining Left Right Arrow Below
U+034E    ͎ 846 Combining Upwards Arrow Below
U+034F    ͏ 847 Combining Grapheme Joiner
U+0350    ͐ 848 Combining Right Arrowhead Above
U+0351    ͑ 849 Combining Left Half Ring Above
U+0352    ͒ 850 Combining Fermata
U+0353    ͓ 851 Combining X Below
U+0354    ͔ 852 Combining Left Arrowhead Below
U+0355    ͕ 853 Combining Right Arrowhead Below
U+0356    ͖ 854 Combining Right Arrowhead And Up Arrowhead Below
U+0357    ͗ 855 Combining Right Half Ring Above
U+0358    ͘ 856 Combining Dot Above Right
U+0359    ͙ 857 Combining Asterisk Below
U+035A    ͚ 858 Combining Double Ring Below
U+035B    ͛ 859 Combining Zigzag Above
U+035C    ͜ 860 Combining Double Breve Below
U+035D    ͝ 861 Combining Double Breve
U+035E    ͞ 862 Combining Double Macron
U+035F    ͟ 863 Combining Double Macron Below
U+0360    ͠ 864 Combining Double Tilde
U+0361    ͡ 865 Combining Double Inverted Breve
U+0362    ͢ 866 Combining Double Rightwards Arrow Below
U+0363    ͣ 867 Combining Latin Small Letter A
U+0364    ͤ 868 Combining Latin Small Letter E
U+0365    ͥ 869 Combining Latin Small Letter I
U+0366    ͦ 870 Combining Latin Small Letter O
U+0367    ͧ 871 Combining Latin Small Letter U
U+0368    ͨ 872 Combining Latin Small Letter C
U+0369    ͩ 873 Combining Latin Small Letter D
U+036A    ͪ 874 Combining Latin Small Letter H
U+036B    ͫ 875 Combining Latin Small Letter M
U+036C    ͬ 876 Combining Latin Small Letter R
U+036D    ͭ 877 Combining Latin Small Letter T
U+036E    ͮ 878 Combining Latin Small Letter V
U+036F    ͯ 879 Combining Latin Small Letter X

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Combining Diacritical Marks block:

Version Final code points[a] Count UTC ID L2 ID WG2 ID Document
1.0.0 U+0300..0341 66 (to be determined)
N3153 (pdf, doc) Umamaheswaran, V. S. (2007-02-16), "m49.1g", Unconfirmed minutes of WG 2 meeting 49 AIST, Akihabara, Tokyo, Japan; 2006-09-25/29, Correct the glyphs for 0340 COMBINING GRAVE TONE MARK and 0341 COMBINING ACUTE TONE MARK to be the same as their canonical equivalent characters at 0300 and 0301 respectively.
N3353 (pdf, doc) Umamaheswaran, V. S. (2007-10-10), "M51.3c", Unconfirmed minutes of WG 2 meeting 51 Hanzhou, China; 2007-04-24/27, Correction to the glyphs for 0333 and 0347
L2/08-287 Davis, Mark (2008-08-04), Public Review Issue #122: Proposal for Additional Deprecated Characters
L2/08-253R2 Moore, Lisa (2008-08-19), "Consensus 116-C13", UTC #116 Minutes, Change the deprecated property by removing 0340, 0341, 17D3, and adding 0149, 0F77, 0F79, 17A4, 2329, 232A.
L2/16-178 Evans, Lorna; Keating, Patricia (2016-07-06), Representative glyph and annotation additions for U+033B
L2/16-216 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu; Moore, Lisa (2016-07-30), "18. Combining Diacritical Marks", Recommendations to UTC #148 August 2016 on Script Proposals
L2/16-203 Moore, Lisa (2016-08-18), "Consensus 148-C3", UTC #148 Minutes, Change the glyph for U+033B COMBINING SQUARE BELOW to a horizontal rectangle as documented in L2/16-178.
1.0.1 U+0344..0345 2 (to be determined)
UTC/1999-017 Davis, Mark (1999-06-02), Data cross-checks (for Agenda)
L2/99-176R Moore, Lisa (1999-11-04), "Data Cross-Checks", Minutes from the joint UTC/L2 meeting in Seattle, June 8-10, 1999
L2/07-068 Muller, Eric (2007-02-05), Representation of Greek mute iota
L2/07-015 Moore, Lisa (2007-02-08), "Action item 110-A37", UTC #110 Minutes, Create an FAQ on the Greek mute iota representation and casing issues.
L2/07-198 Hudson, John (2007-06-01), Representation of Greek mute iota (response to L2/07-068)
L2/07-263 Anderson, Deborah (2007-08-07), Input on Greek Mute Iota
1.1 U+0342..0343, 0360..0361 4 (to be determined)
3.0 U+0346..034E, 0362 10 L2/98-299 N1845 Everson, Michael (1998-09-08), Additional IPA "disturbed speech" characters for the UCS
L2/98-372 N1884R2 (pdf, doc) Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
L2/98-329 N1920 Combined PDAM registration and consideration ballot on WD for ISO/IEC 10646-1/Amd. 30, AMENDMENT 30: Additional Latin and other characters, 1998-10-28
3.2 U+034F 1 L2/00-274 N2236 Davis, Mark (2000-08-09), Proposal for addition of COMBINING GRAPHEME JOINER
L2/00-187 Moore, Lisa (2000-08-23), "Grapheme Joiner", UTC minutes -- Boston, August 8-11, 2000
L2/01-042 N2317 Everson, Michael (2001-01-19), On the use of JOINERS in ligation
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.8 Proposal for addition of COMBINING GRAPHEME JOINER", Minutes of the SC2/WG2 meeting in Athens, September 2000
L2/00-324 Moore, Lisa (2001-01-29), "Motion 85-M13", Minutes from UTC #85, San Diego
L2/01-344 N2353 (pdf, doc) Umamaheswaran, V. S. (2001-09-09), "7.14", Minutes from SC2/WG2 meeting #40 -- Mountain View, April 2001
L2/05-094 Hallissy, Bob; Priest, Lorna (2005-04-18), Use of CGJ in Latin script diacritics
L2/05-108R Moore, Lisa (2005-08-26), "C.7", UTC #103 Minutes
U+0363..036F 13 L2/00-061 N2160 Küster, Mark (2000-02-02), Proposal concerning combining diacritics for medievalistics [for Middle High German]
L2/00-330 N2266 Küster, Marc; Wojtovicz, Isabel (2000-09-14), Diacritics for medieval studies
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.17 Diacritics for Medieval Studies", Minutes of the SC2/WG2 meeting in Athens, September 2000
4.0 U+0350..0357 8 L2/02-141 N2419 Everson, Michael; et al. (2002-03-20), Uralic Phonetic Alphabet characters for the UCS
L2/02-192 Everson, Michael (2002-05-02), Everson's Reply on UPA
N2442 Everson, Michael; Kolehmainen, Erkki I.; Ruppel, Klaas; Trosterud, Trond (2002-05-21), Justification for placing the Uralic Phonetic Alphabet in the BMP
L2/02-291 Whistler, Ken (2002-05-31), WG2 report from Dublin
L2/02-292 Whistler, Ken (2002-06-03), Early look at WG2 consent docket
L2/02-166R2 Moore, Lisa (2002-08-09), "Scripts and New Characters - UPA", UTC #91 Minutes
L2/02-253 Moore, Lisa (2002-10-21), "Consensus 92-C2", UTC #92 Minutes
U+035D..035F 3 L2/02-176 Whistler, Ken (2002-03-07), How to make "oo" with combining breve/macron over pair?
L2/02-204 N2457 Additional Double Diacritics, 2002-05-10
L2/02-166R2 Moore, Lisa (2002-08-09), "Combining Breve/macron over the oo Pair", UTC #91 Minutes
L2/02-367 Constable, Peter (2002-10-30), Double Combining Breve Below, Double Combining Macron
L2/03-142R N2594 McGowan, Rick; Constable, Peter; Whistler, Ken (2003-07-01), Proposal to encode combining double breve below
4.1 U+0358 1 L2/97-148 N1593 Everson, Michael (1997-05-27), Proposal to add Latin characters by Latinized Taiwanese languages
L2/97-288 N1603 Umamaheswaran, V. S. (1997-10-24), "8.24.8", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June - 4 July 1997
L2/98-089 N1712R Comments on proposal to add Latin characters to Latinized Taiwanese languages, 1998-03-18
L2/98-286 N1703 Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.26", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
L2/02-348 N2507 Draft of Proposal to add Latin characters required by Latinized Taiwanese Holo language to ISO/IEC 10646, 2002-03-31
L2/03-317 N2626 Proposal on IPA Extensions & Combining Diacritical Marks for ISO/IEC 10646 in BMP, 2003-09-27
L2/03-318 N2628 Proposal to add Combining Right Dot Above as required by Taiwanese Holo language Romanization to ISO/IEC 10646, 2003-09-29
L2/03-339 N2646 Constable, Peter (2003-10-08), Comments on N2626, Proposal on IPA Extensions and Combining Diacritic Marks for ISO/IEC 10646 in BMP
L2/03-369 N2669 Comments on "Combining dot above", 2003-10-20
L2/04-076 N2699 Tai, Kaihsu; Chen, Pochung; Tan-Tenn, Henry (2003-10-20), Response to WG2-N2669
L2/03-372 N2673 Anderson, Deborah (2003-10-22), Comments on N2626, "Proposal on IPA Extensions & Combining Diacritic Marks for ISO 10646 in BMP"
L2/04-107 N2713 Revised Proposal for encoding A Supplemented Set of IPA Combining Marks, Modifier Letters & Five-Degree Contour Tone Marks, 2004-03-20
L2/04-200 N2770R Tai, Kaihsu; Chen, Pochung; Tan-Tenn, Henry (2004-05-29), Response to WG2-N2713 - Combining right dot above
U+0359 1 L2/03-157 Pantelia, Maria (2003-05-19), Additional Beta Code Characters not in Unicode (WIP)
L2/03-186R N2612-4 Pantelia, Maria (2003-06-11), Proposal to encode one additional combining diacritic in the UCS
U+035A 1 L2/02-364 N2630 Glass, Andrew; Baums, Stefan; Salomon, Richard (2003-09-29), Proposal to add one combining diacritic to the UCS
U+035B 1 L2/03-219 N2597 Proposal to encode one combining character in the UCS, 2003-06-25
U+035C 1 L2/02-031 Anderson, Deborah (2002-01-21), TLG Miscellanea Proposal
L2/02-033 Anderson, Deborah (2002-01-21), TLG Unicode Proposal (draft)
L2/02-053 Anderson, Deborah (2002-02-04), Description of TLG Documents
L2/02-273 Pantelia, Maria (2002-07-31), TLG Unicode Proposal
L2/02-287 Pantelia, Maria (2002-08-09), Proposal Summary Form accompanying TLG Unicode Proposal (L2/02-273)
L2/02-312R Pantelia, Maria (2002-11-07), Proposal to encode additional Greek editorial and punctuation characters in the UCS
L2/03-324 N2642 Pantelia, Maria (2003-10-06), Proposal to encode additional Greek editorial and punctuation characters in the UCS
  1. ^ Proposed code points and characters names may differ from final code points and names

See also

References

  1. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  2. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  4. ^ "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.