Register your product to gain access to bonus material or receive a coupon.
Uses worked examples to illustrate the relative strengths and weaknesses of various approaches. Ex.___
Introduces important methodological tools such as evaluation, wizard of oz techniques, etc. Ex.___
Shows students how the same algorithm can be used for speech recognition and word-sense disambiguation. Ex.___
Gives students an understanding of how language-related algorithms can be applied to important real-world problems. Ex.___
This book takes an empirical approach to language processing, based on applying statistical and other machine-learning algorithms to large corpora.KEY TOPICS:Methodology boxes are included in each chapter. Each chapter is built around one or more worked examples to demonstrate the main idea of the chapter. Covers the fundamental algorithms of various fields, whether originally proposed for spoken or written language to demonstrate how the same algorithm can be used for speech recognition and word-sense disambiguation. Emphasis on web and other practical applications. Emphasis on scientific evaluation. MARKET:Useful as a reference for professionals in any of the areas of speech and language processing.
1. Introduction.
I. WORDS.
2. Regular Expressions and Automata.II. SYNTAX.
8. Word Classes and Part-of-Speech Tagging.III. SEMANTICS.
14. Representing Meaning.IV. PRAGMATICS.
18. Discourse.APPENDICES.
A. Regular Expression Operators.
This is an exciting time to be working in speech and language processing. Historically distinct fields (natural language processing, speech recognition, computational linguistics, computational psycholinguistics) have begun to merge. The commercial availability of speech recognition and the need for Web-based language techniques have provided an important impetus for development of real systems. The availability of very large on-line corpora has enabled statistical models of language at every level, from phonetics to discourse. We have tried to draw on this emerging state of the art in the design of this pedagogical and reference work:
The book is primarily intended for use in a graduate or advanced undergraduate course or sequence. Because of its comprehensive coverage and the large number of algorithms, the book is also useful as a reference for students and professionals in any of the areas of speech and language processing.
The book is divided into four parts in addition to an introduction and end matter. Part I, "Words", introduces concepts related to the processing of words: phonetics, phonology, morphology, and algorithms used to process them: finite automata, finite transducers, weighted transducers, N-grams, and Hidden Markov Models. Part II, "Syntax", introduces parts-of-speech and phrase structure grammars for English and gives essential algorithms for processing word classes and structured relationships among words: part-of-speech taggers based on HMMs and transformation-based learning, the CYK and Earley algorithms for parsing, unification and typed feature structures, lexicalized and probabilistic parsing, and analytical tools like the Chomsky hierarchy and the pumping lemma. Part III, "Semantics", introduces first order predicate calculus and other ways of representing meaning, several approaches to compositional semantic analysis, along with applications to information retrieval, information extraction, speech understanding, and machine translation. Part IV, "Pragmatics", covers reference resolution and discourse structure and coherence, spoken dialogue phenomena like dialogue and speech act modeling, dialogue structure and coherence, and dialogue managers, as well as a comprehensive treatment of natural language generation and of machine translation.
The book provides enough material to be used for a full-year sequence in speech and language processing. It is also designed so that it can be used for a number of different useful one-term courses:
NLP 1 quarter | NLP 1 semester | Speech + NLP 1 semester | Comp. Linguistics 1 quarter |
---|---|---|---|
1. Intro | 1. Intro | 1. Intro | 1. Intro |
2. Regex, FSA | 2. Regex, FSA | 2. Regex, FSA | 2. Regex, FSA |
8. POS tagging | 3. Morph., FST | 3. Morph., FST | 3. Morph., FST |
9. CFGs | 6. N-grams | 4. Comp. Phonol. | 4. Comp. Phonol. |
10. Parsing | 8. POS tagging | 5. Prob. Pronun. | 10. Parsing |
11. Unification | 9. CFGs | 6. N-grams | 11. Unification |
14. Semantics | 10. Parsing | 7. HMMs & ASR | 13. Complexity |
15. Sem. Analysis | 11. Unification | 8. POS tagging | 16. Lex. Semantics |
18. Discourse | 12. Prob. Parsing | 9. CFGs | 18. Discourse |
20. Generation | 14. Semantics | 10. Parsing | 19. Dialogue |
15. Sem. Analysis | 12. Prob. Parsing | ||
16. Lex. Semantics | 14. Semantics | ||
17. WSD and IR | 15. Sem. Analysis | ||
18. Discourse | 19. Dialogue | ||
20. Generation | 21. Mach. Transl. | ||
21. Mach. Transl. |
Selected chapters from the book could also be used to augment courses in Artificial Intelligence, Cognitive Science, or Information Retrieval.