Publications may be listed more than once under different
headings.
[Blocks] |
Blocks data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/Blocks.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/Blocks.txt |
[Charts] |
Online Code Charts
http://www.unicode.org/charts/
An index to character names with links to the corresponding chart is
found at
http://www.unicode.org/charts/charindex.html
|
[Charts14] |
Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/LineBreakTest.html
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/LineBreakTest.html |
[Charts15] |
Normalization Charts
http://www.unicode.org/charts/normalization/ |
[Charts29] |
Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/SentenceBreakTest.html
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/SentenceBreakTest.html |
[CLDR] |
Unicode Locales Project (Unicode Common Locale Data Repository)
http://cldr.unicode.org/ |
[Code9] |
Reference implementations of the Unicode Bidirectional
Algorithm
For C reference code, see:
http://www.unicode.org/Public/PROGRAMS/BidiReferenceC/
For Java reference code, see:
http://www.unicode.org/Public/PROGRAMS/BidiReferenceJava/
|
[Code14] |
Sample implementation of the
Unicode Line Breaking Algorithm
http://www.unicode.org/Public/PROGRAMS/LineBreakSampleCpp/ |
[Corrections] |
Normalization Corrections data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NormalizationCorrections.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NormalizationCorrections.txt |
[Corrigendum1] |
Corrigendum #1: UTF-8 Shortest Form
http://www.unicode.org/versions/corrigendum1.html |
[Corrigendum2] |
Corrigendum #2: Yod with Hiriq Normalization
http://www.unicode.org/versions/corrigendum2.html |
[Corrigendum3] |
Corrigendum #3: U+F951 Normalization
http://www.unicode.org/versions/corrigendum3.html |
[Corrigendum4] |
Corrigendum #4: Five CJK Canonical Mapping Errors
http://www.unicode.org/versions/corrigendum4.html |
[Corrigendum5] |
Corrigendum #5: Normalization Idempotency
http://www.unicode.org/versions/corrigendum5.html |
[Corrigendum6] |
Corrigendum #6: Bidi Mirroring
http://www.unicode.org/versions/corrigendum6.html |
[Corrigendum7] |
Corrigendum #7: UAX #14, Unicode Line Breaking Algorithm, rule LB8
http://www.unicode.org/versions/corrigendum7.html |
[Corrigendum8] |
Corrigendum #8: Bidi_Class Fix for U+070F Syriac Abbreviation Mark
http://www.unicode.org/versions/corrigendum8.html |
[Corrigendum9] |
Corrigendum #9: Clarification About Noncharacters
http://www.unicode.org/versions/corrigendum9.html |
[Data9] |
Unicode Bidirectional Algorithm property data files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/BidiMirroring.txt
http://www.unicode.org/Public/UCD/latest/ucd/BidiBrackets.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/BidiMirroring.txt
http://www.unicode.org/Public/7.0.0/ucd/BidiBrackets.txt |
[Data11] |
East Asian Width property data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/EastAsianWidth.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/EastAsianWidth.txt |
[Data14] |
Unicode Line Breaking Algorithm property data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/LineBreak.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/LineBreak.txt |
[Data24] |
Unicode Script Property data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/Scripts.txt |
[Data34] |
Unicode Named Character Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NamedSequences.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NamedSequences.txt |
[Data45] |
U-Source Ideographs data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/USourceData.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/USourceData.txt |
[DataProv] |
Provisional Named Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NamedSequencesProv.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NamedSequencesProv.txt |
[Demo9] |
Online demo of a reference implementation of the Unicode Bidirectional Algorithm
http://www.unicode.org/cldr/utility/bidi.jsp |
[DerivedBIDI] |
Derived Bidi Properties
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedBidiClass.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/extracted/DerivedBidiClass.txt |
[Errata] |
Updates and Errata
http://www.unicode.org/errata |
[Exclusions] |
Composition Exclusion Table
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/CompositionExclusions.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/CompositionExclusions.txt |
[FAQ] |
Unicode Frequently Asked Questions
http://www.unicode.org/faq/
For answers to common questions on technical issues. |
[Feedback] |
Reporting Form
http://www.unicode.org/reporting.html
For reporting errors and requesting information online. |
[Glossary] |
Unicode Glossary
http://www.unicode.org/glossary/
For explanations of terminology used in this and other documents. |
[Glyphs45] |
U-Source Ideographs glyph table
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/USourceGlyphs.pdf
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/USourceGlyphs.pdf
|
[HangulST] |
Hangul Syllable Types
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/HangulSyllableType.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/HangulSyllableType.txt |
[NormProps] |
Derived Normalization Properties
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/DerivedNormalizationProps.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/DerivedNormalizationProps.txt |
[Policies] |
Unicode Policies
http://www.unicode.org/policies/policies.html |
[Props] |
Unicode Text Segmentation property data files
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/SentenceBreakProperty.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/SentenceBreakProperty.txt |
[PropValue] |
Property Value Aliases data file
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/PropertyValueAliases.txt |
[Reports] |
Unicode Technical Reports
http://www.unicode.org/reports/
For information on the status and development process for technical reports, and for a list of technical reports. |
[Stability] |
Unicode Consortium Stability Policies
http://www.unicode.org/policies/stability_policy.html |
[Tests9] |
Unicode Bidirectional Algorithm test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/BidiTest.txt
http://www.unicode.org/Public/UCD/latest/ucd/BidiCharacterTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/BidiTest.txt
http://www.unicode.org/Public/7.0.0/ucd/BidiCharacterTest.txt |
[Tests14] |
Unicode Line Breaking Algorithm test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/LineBreakTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/LineBreakTest.txt |
[Tests15] |
Unicode Normalization Forms test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/NormalizationTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/NormalizationTest.txt |
[Tests29] |
Unicode Text Segmentation test data
For the latest version, see:
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/SentenceBreakTest.txt
For the 7.0.0 version, see:
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/7.0.0/ucd/auxiliary/SentenceBreakTest.txt |
[UAX9] |
UAX #9: Unicode Bidirectional Algorithm
http://www.unicode.org/reports/tr9/ |
[UAX11] |
UAX #11: East Asian Width
http://www.unicode.org/reports/tr11/ |
[UAX14] |
UAX #14: Unicode Line Breaking Algorithm
http://www.unicode.org/reports/tr14/ |
[UAX15] |
UAX #15: Unicode Normalization Forms
http://www.unicode.org/reports/tr15/ |
[UAX24] |
UAX #24: Unicode Script Property
http://www.unicode.org/reports/tr24/ |
[UAX29] |
UAX #29: Unicode Text Segmentation
http://www.unicode.org/reports/tr29/ |
[UAX31] |
UAX #31: Unicode Identifier and Pattern Syntax
http://www.unicode.org/reports/tr31/ |
[UAX34] |
UAX #34: Unicode Named Character Sequences
http://www.unicode.org/reports/tr34/ |
[UAX38] |
UAX #38: Unicode Han Database (Unihan)
http://www.unicode.org/reports/tr38/ |
[UAX41] |
UAX #41: Common References for Unicode Standard Annexes
http://www.unicode.org/reports/tr41/ |
[UAX42] |
UAX #42:Unicode Character Database in XML
http://www.unicode.org/reports/tr42/ |
[UAX44] |
UAX #44:Unicode Character Database
http://www.unicode.org/reports/tr44/ |
[UAX45] |
UAX #45:U-Source Ideographs
http://www.unicode.org/reports/tr45/ |
[UCD] |
Unicode Character Database
http://www.unicode.org/ucd/
For detailed documentation about the Unicode Character Database, see Unicode Standard Annex #44: Unicode Character Database
http://www.unicode.org/reports/tr44/ |
[Unicode] |
The Unicode Standard For the latest version, see:
http://www.unicode.org/versions/latest/
For the 7.0.0 version, see:
http://www.unicode.org/versions/Unicode7.0.0/ |
[Unicode3.0] |
The Unicode Consortium. The Unicode Standard, Version 3.0
(Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5). |
[Unicode3.1] |
The Unicode Consortium. The Unicode
Standard, Version 3.1.0, defined by: The Unicode Standard, Version
3.0 (Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as
amended by the Unicode Standard Annex #27: Unicode 3.1
http://www.unicode.org/reports/tr27/ |
[Unicode3.2] |
The Unicode Consortium. The Unicode
Standard, Version 3.2.0, defined by: The Unicode Standard, Version 3.0
(Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as amended by
the Unicode Standard Annex #27: Unicode 3.1 and the Unicode
Standard Annex #28: Unicode 3.2
http://www.unicode.org/reports/tr28/ |
[Unicode4.0] |
The Unicode Consortium.
The Unicode Standard, Version 4.0
(Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1). |
[Unicode4.0.1] |
The Unicode Consortium. The Unicode Standard, Version 4.0.1, defined by:
The Unicode Standard, Version 4.0 (Boston, MA, Addison-Wesley, 2003. ISBN
0-321-18578-1), as amended by
Unicode 4.0.1
http://www.unicode.org/versions/Unicode4.0.1/ |
[Unicode4.1] |
The Unicode Consortium. The Unicode Standard, Version 4.1.0, defined by:
The Unicode Standard, Version 4.0
(Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1), as amended by
Unicode 4.0.1 and by
Unicode 4.1.0
http://www.unicode.org/versions/Unicode4.1.0/ |
[Unicode5.0] |
The Unicode Consortium.
The Unicode Standard, Version
5.0 (Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0). |
[Unicode5.1] |
The Unicode Consortium. The Unicode Standard, Version 5.1.0, defined by: The Unicode Standard, Version 5.0
(Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0), as amended by
Unicode 5.1.0 |
[Unicode5.2] |
The Unicode Consortium. The Unicode Standard, Version 5.2.0, defined
by: The Unicode Standard, Version 5.2 (Mountain View, CA: The Unicode Consortium, 2009. ISBN 978-1-936213-00-9) |
[Unicode6.0] |
The Unicode Consortium. The Unicode Standard, Version 6.0.0
(Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6)
http://www.unicode.org/versions/Unicode6.0.0/ |
[Unicode6.1] |
The Unicode Consortium. The Unicode Standard, Version 6.1.0
(Mountain View, CA: The Unicode Consortium, 2012. ISBN 978-1-936213-02-3)
http://www.unicode.org/versions/Unicode6.1.0/ |
[Unicode6.2] |
The Unicode Consortium. The Unicode Standard, Version 6.2.0
(Mountain View, CA: The Unicode Consortium, 2012. ISBN 978-1-936213-07-8)
http://www.unicode.org/versions/Unicode6.2.0/ |
[Unicode6.3] |
The Unicode Consortium. The Unicode Standard, Version 6.3.0
(Mountain View, CA: The Unicode Consortium, 2013. ISBN 978-1-936213-08-5)
http://www.unicode.org/versions/Unicode6.3.0/ |
[Unicode7.0] |
The Unicode Consortium. The Unicode Standard, Version 7.0.0
(Mountain View, CA: The Unicode Consortium, 2014. ISBN 978-1-936213-09-2)
http://www.unicode.org/versions/Unicode7.0.0/ |
[UTC] |
Unicode Technical Committee
http://www.unicode.org/consortium/utc.html |
[UTN5] |
UTN #5: Canonical Equivalences in Applications
http://www.unicode.org/notes/tn5 |
[UTR17] |
UTR #17: Unicode Character Encoding Model
http://www.unicode.org/reports/tr17/ |
[UTR20] |
UTR # 20: Unicode in XML and other Markup Languages
http://www.unicode.org/reports/tr20/ |
[UTR23] |
UTR # 23: The Unicode Character Property Model
http://www.unicode.org/reports/tr23/ |
[UTR25] |
UTR # 25: Unicode Support for Mathematics
http://www.unicode.org/reports/tr25/ |
[UTR33] |
UTR # 33: Unicode Conformance Model
http://www.unicode.org/reports/tr33/ |
[UTR36] |
UTR #36: Unicode Security Considerations
http://www.unicode.org/reports/tr36/ |
[UTR50] |
UTR #50: Unicode Vertical Text Layout
http://www.unicode.org/reports/tr50/ |
[UTS6] |
UTS #6: A Standard Compression Scheme
for Unicode
http://www.unicode.org/reports/tr6/ |
[UTS10] |
UTS #10: Unicode Collation Algorithm
(UCA)
http://www.unicode.org/reports/tr10/ |
[UTS18] |
UTS #18: Unicode Regular Expressions
http://www.unicode.org/reports/tr18/ |
[UTS22] |
UTS #22: Unicode Character Mapping Markup Language
http://www.unicode.org/reports/tr22/ |
[UTS35] |
UTS #35: Unicode Locale Data Markup Language (LDML)
http://www.unicode.org/reports/tr35/ |
[UTS37] |
UTS #37: Unicode Ideographic Variation Database
http://www.unicode.org/reports/tr37/ |
[UTS39] |
UTS #39: Unicode Security Mechanisms
http://www.unicode.org/reports/tr39/ |
[UTS46] |
UTS #46: Unicode IDNA Compatibility Processing
http://www.unicode.org/reports/tr46/ |
[Versions] |
Versions of the Unicode Standard
http://www.unicode.org/versions/
For information on version numbering, and citing and referencing the Unicode Standard,
the Unicode Character Database, and Unicode Technical Reports. |
[Cedar97] |
Cy Cedar, David Veintimilla, Michel Suignard, and Asmus
Freytag, Report from the Trenches: Microsoft Publisher goes Unicode. Proceedings
of the Eleventh International Unicode Conference, San Jose, CA, 1997. |
[CharLint] |
Charlint—A Character Normalization Tool
http://www.w3.org/International/charlint/ |
[CharMod] |
Martin J. Dürst, François Yergeau, Richard Ishida, Misha Wolf, and Tex Texin,
W3C Character Model for the World Wide Web.
(See http://www.w3.org/TR/charmod/.) |
[CharNorm] |
Martin J. Dürst, François Yergeau, Richard Ishida, Misha Wolf, Tex Texin, and Addison
Phillips, Character Model for the World Wide Web 1.0: Normalization,
W3C Working Draft. (See http://www.w3.org/TR/charmod-norm.) |
[CharReq] |
Martin J. Dürst, Requirements
for String Identity Matching and String Indexing, W3C Working
Draft. (See
http://www.w3.org/TR/WD-charreq.) |
[Knuth78] |
Donald E. Knuth and Michael F. Plass, Breaking Lines
into Paragraphs, republished in Digital Typography, CSLI 78
(Stanford, California: CLSI Publications 1997). |
[Suign98] |
Michel Suignard, Worldwide Typography and How to Apply
JIS X 4051-1995 to Unicode. Proceedings of the Twelfth International
Unicode/ISO 10646 Conference, Tokyo, Japan, 1998. |
[TEX] |
Donald E. Knuth, TEX, the Program,
Volume B of Computers & Typesetting (Reading, MA,
Addison-Wesley, 1986). |
Copyright © 2001–2014 Unicode, Inc. All Rights Reserved. The Unicode Consortium makes no expressed
or implied warranty of any kind, and assumes no liability for errors or
omissions. No liability is assumed for incidental and consequential damages in
connection with or arising out of the use of the information or programs
contained or accompanying this technical report. The Unicode
Terms of Use apply.
Unicode and the Unicode logo are trademarks of Unicode,
Inc., and are registered in some jurisdictions.