| | Copyright |
| | Preface |
| | | Section 0.1. What Is Text Processing? |
| | | Section 0.2. The Philosophy of Text Processing |
| | | Section 0.3. What You'll Need to Use This Book |
| | | Section 0.4. Conventions Used in This Book |
| | | Section 0.5. A Word on Source Code Examples |
| | | Section 0.6. External Resources |
|
| | Acknowledgments |
| | Chapter 1. Python Basics |
| | | Section 1.1. Techniques and Patterns |
| | | Section 1.2. Standard Modules |
| | | Section 1.3. Other Modules in the Standard Library |
|
| | Chapter 2. Basic String Operations |
| | | Section 2.1. Some Common Tasks |
| | | Section 2.2. Standard Modules |
| | | Section 2.3. Solving Problems |
|
| | Chapter 3. Regular Expressions |
| | | Section 3.1. A Regular Expression Tutorial |
| | | Section 3.2. Some Common Tasks |
| | | Section 3.3. Standard Modules |
|
| | Chapter 4. Parsers and State Machines |
| | | Section 4.1. An Introduction to Parsers |
| | | Section 4.2. An Introduction to State Machines |
| | | Section 4.3. Parser Libraries for Python |
|
| | Chapter 5. Internet Tools and Techniques |
| | | Section 5.1. Working with Email and Newsgroups |
| | | Section 5.2. World Wide Web Applications |
| | | Section 5.3. Synopses of Other Internet Modules |
| | | Section 5.4. Understanding XML |
|
| | Appendix A. A Selective and Impressionistic Short Review of Python |
| | | Section A.1. What Kind of Language Is Python? |
| | | Section A.2. Namespaces and Bindings |
| | | Section A.3. Datatypes |
| | | Section A.4. Flow Control |
| | | Section A.5. Functional Programming |
|
| | Appendix B. A Data Compression Primer |
| | | Section B.1. Introduction |
| | | Section B.2. Lossless and Lossy Compression |
| | | Section B.3. A Data Set Example |
| | | Section B.4. Whitespace Compression |
| | | Section B.5. Run-Length Encoding |
| | | Section B.6. Huffman Encoding |
| | | Section B.7. Lempel Ziv-Compression |
| | | Section B.8. Solving the Right Problem |
| | | Section B.9. A Custom Text Compressor |
| | | Section B.10. References |
|
| | Appendix C. Understanding Unicode |
| | | Section C.1. Some Background on Characters |
| | | Section C.2. What Is Unicode? |
| | | Section C.3. Encodings |
| | | Section C.4. Declarations |
| | | Section C.5. Finding Codepoints |
| | | Section C.6. Resources |
|
| | Appendix D. A State Machine for Adding Markup to Text |
| | Appendix E. Glossary |