| | Unicode Explained |
| | Preface |
| | Part I: Working with Characters |
| | | Characters as Data |
| | | Section 1.1. Introduction to Characters and Unicode |
| | | Section 1.2. What's in a Character? |
| | | Section 1.3. Variation of Writing Systems |
| | | Section 1.4. Glyphs and Fonts |
| | | Section 1.5. Definitions of Character Repertoires |
| | | Section 1.6. Numbering Characters |
| | | Section 1.7. Encoding Characters as Octet Sequences |
| | | Section 1.8. Working with Encodings |
| | | Section 1.9. Working with Fonts |
| | | Section 1.10. Summaries |
| | | Writing Characters |
| | | Section 2.1. Method Varieties |
| | | Section 2.2. Keyboard Variation and Settings |
| | | Section 2.3. Virtual Keyboards |
| | | Section 2.4. Program Commands |
| | | Section 2.5. Character Maps |
| | | Section 2.6. Replacements on the Fly |
| | | Section 2.7. Special Techniques |
| | | Section 2.8. Escape Sequences |
| | | Section 2.9. Specialized Editors |
| | | Section 2.10. Exercise |
| | | Character Sets and Encodings |
| | | Section 3.1. Good Old ASCII |
| | | Section 3.2. ISO 8859 Codes |
| | | Section 3.3. Windows Latin 1 and Other Windows Codes |
| | | Section 3.4. Other 8-bit Codes |
| | | Section 3.5. Unicode and UTF-8 |
| | | Section 3.6. Encodings for East Asian Language |
| | | Section 3.7. Converters and Transcoding |
| | | Section 3.8. Using Character Codes |
| | Part II: A Systematic Look at Unicode |
| | | The Structure of Unicode |
| | | Section 4.1. Design Principles |
| | | Section 4.2. Versions of Unicode |
| | | Section 4.3. Coding Space |
| | | Section 4.4. Unicode Terms |
| | | Section 4.5. Guide to the Unicode Standard |
| | | Section 4.6. Unicode and Fonts |
| | | Section 4.7. Criticism of Unicode |
| | | Section 4.8. Questions and Answers |
| | | Properties of Characters |
| | | Section 5.1. Character Classification |
| | | Section 5.2. An Overview of Properties |
| | | Section 5.3. Compositions and Decompositions |
| | | Section 5.4. Normalization |
| | | Section 5.5. Case Properties |
| | | Section 5.6. Collation and Sorting |
| | | Section 5.7. Text Boundaries |
| | | Section 5.8. Directionality |
| | | Section 5.9. Line-Breaking Properties |
| | | Section 5.10. Unicode Conformance Requirements |
| | | Section 5.11. Effects on Choosing Characters |
| | | Unicode Encodings |
| | | Section 6.1. Unicode Encodings in General |
| | | Section 6.2. UTF-32 and UCS-4 |
| | | Section 6.3. UTF-16 and UCS-2 |
| | | Section 6.4. UTF-8 |
| | | Section 6.5. Byte Order |
| | | Section 6.6. Conversions Between Unicode Encodings |
| | | Section 6.7. Other Encodings |
| | | Section 6.8. Auto-Detecting the Encoding |
| | | Section 6.9. Choosing an Encoding |
| | Part III: Advanced Unicode Topics |
| | | Characters and Languages |
| | | Section 7.1. Writing Systems and IT |
| | | Section 7.2. Character Requirements of Languages |
| | | Section 7.3. Transliteration and Transcription |
| | | Section 7.4. Language Metadata |
| | | Section 7.5. Languages and Fonts |
| | | Character Usage |
| | | Section 8.1. Basics of Character Usage |
| | | Section 8.2. ASCII (Basic Latin) |
| | | Section 8.3. Latin-1 Supplement (ISO 8859-1) |
| | | Section 8.4. Other Latin Letters |
| | | Section 8.5. Other European Alphabetic Scripts |
| | | Section 8.6. Diacritic Marks |
| | | Section 8.7. Letterlike Symbols |
| | | Section 8.8. General Punctuation |
| | | Section 8.9. Line Structure Control |
| | | Section 8.10. Mathematical and Technical Symbols |
| | | Section 8.11. Other Blocks |
| | | The Character Level and Above |
| | | Section 9.1. Levels of Text Representation and Processing |
| | | Section 9.2. Characters and Markup |
| | | Section 9.3. Media Types for Text |
| | | Characters in Internet Protocols |
| | | Section 10.1. Information About Encoding |
| | | Section 10.2. Characters in MIME |
| | | Section 10.3. Content Negotiation and Multilingual Sites |
| | | Section 10.4. Characters in Protocol Headers |
| | | Section 10.5. Characters in Domain Names and URLs |
| | | Characters in Programming |
| | | Section 11.1. Characters in Computer Languages |
| | | Section 11.2. Character and String Data |
| | | Section 11.3. The Preparedness Principle |
| | | Section 11.4. Character Input and Output |
| | | Section 11.5. Processing Form Data |
| | | Section 11.6. Identifiers, Patterns, and Regular Expressions |
| | | Section 11.7. International Components for Unicode (ICU) |
| | | Section 11.8. Using Locales |
| | Tables for Writing Characters |
| | | Section A.1. Additional Notes |
| | About the Author |
| | Colophon |
| | Index |