Zitouni I. (ed.) Natural Language Processing of Semitic Languages

Файл формата pdf
размером 5,35 МБ

Добавлен пользователем Shushimora 01.10.2014 19:10
Описание отредактировано 06.02.2021 23:54

Zitouni I. (ed.) Natural Language Processing of Semitic Languages

Springer, 2014. — 477 p.

Modern communication technologies, such as the television and the Internet, have made readily available massive amounts of information in many languages. More such data is being generated in real time, 24 h a day and 7 days a week, aided by social networking sites such as Facebook and Twitter. This information explosion is in the form of multilingual audio, video, and Web content. The task of processing this large amount of information demands effective, scalable, multilingual media processing, monitoring, indexing, and search solutions. Natural Language Processing (NLP) technologies have long been used to address this task, and several researchers have developed several technical solutions for it. In the last two decades, NLP researchers have developed exciting algorithms for processing large amounts of text in many different languages. Nowadays the English language has obtained the lion’s share in terms of available resources as well as developed NLP technical solutions. In this book, we address another group of interesting and challenging languages for NLP research, that is, the Semitic languages. The Semitic languages have existed in written form since a very early date, with texts written in a script adapted from Sumerian cuneiform. Most scripts used to write Semitic languages are abjads, a type of alphabetic script that omits some or all of the vowels. This is feasible for these languages because the consonants in the Semitic languages are the primary carriers of meaning. Semitic languages have interesting morphology, where word roots are not themselves syllables or words, but isolated sets of consonants (usually three characters). Words are composed out of roots by adding vowels to the root consonants (although prefixes and suffixes are often added as well). For example, in Arabic, the root meaning write has the form k - t - b. From this root, words are formed by filling in the vowels, e.g., kitAb book, kutub books, kAtib writer, kuttAb writers, kataba he wrote, yaktubu he writes, etc. Semitic languages, as stated in Wikipedia, are spoken by more than 270 million people. The most widely spoken Semitic languages today are Arabic (206 million native speakers), Amharic (27 million), Hebrew (7 million), Tigrinya (6.7 million), Syriac (1million), and Maltese (419 thousand).NLP research applied to Semitic languages has been the focus of attention of many researchers for more than a decade, and several technical solutions have been proposed, especially Arabic NLP where we find a very large amount of accomplished research. This will be reflected in this book, where Arabic will take the lion’s share. Hebrew also has been the center of attention of several NLP research works, but to a smaller degree when compared to Arabic. Most of the key published research works in Hebrew NLP will be discussed in this book. For Amharic, Maltese, and Syriac, because of the very limited amount of NLP research publicly available, we didn’t limit ourselves to present key techniques, but we also proposed solutions inspired from Arabic and Hebrew. Our aim for this book is to provide a one-stop shop to all the requisite background and practical advice when building NLP applications for Semitic languages. While this is quite a tall order, we hope that, at a minimum, you find this book a useful resource.
Similar to English, the dominant approach in NLP for Semitic languages has been to build a statistical model that can learn from examples. In this way, a model can be robust to changes in the type of text and even the language of text on which it operates. With the right design choices, the same model can be trained to work in a new domain simply by providing new examples in that domain. This approach also obviates the need for researchers to lay out, in a painstaking fashion, all the rules that govern the problem at hand and the manner in which those rules must be combined.A statistical system typically allows for researchers to provide an abstract expression of possible features of the input, where the relative importance of those features can be learned during the training phase and can be applied to new text during the decoding, or inference, phase.While this book will devote some attention to cutting-edge algorithms and techniques, the primary purpose will be a thorough explication of best practices in the field. Furthermore, every chapter describes how the techniques discussed apply to Semitic languages.

Natural Language Processing Core-Technologies.
Linguistic Introduction: The Orthography, Morphology and Syntax of Semitic Languages.
Morphological Processing of Semitic Languages.
Syntax and Parsing of Semitic Languages.
Semantic Processing of Semitic Languages.
Language Modeling.
Natural Language Processing Applications.
Statistical Machine Translation.
Named Entity Recognition.
Anaphora Resolution.
Relation Extraction.
Information Retrieval.
Question Answering.
Automatic Summarization.
Automatic Speech Recognition.

Чтобы скачать этот файл зарегистрируйтесь и/или войдите на сайт используя форму сверху.
Регистрация

Смотри также

Подробнее

Bird S., Klein E., Loper E. Natural Language Processing with Python

Раздел: Наука о данных → Обработка естественного языка / Aвтоматическое распознавание речи / Анализ текста

O’Reilly, 2009. — 504 p. This is a book about Natural Language Processing. By natural language we mean a language that is used for everyday communication by humans; languages such as English, Hindi, or Portuguese. In contrast to artificial languages such as programming languages and mathematical notations, natural languages have evolved as they pass from generation to...

4,10 МБ
добавлен 28.11.2011 12:08
описание отредактировано 05.08.2022 17:25

Подробнее

Clark A., Fox C., Lappin S. (Eds.) The Handbook of Computational Linguistics and Natural Language Processing

Раздел: Искусственный интеллект → Компьютерная лингвистика

Wiley-Blackwell, 2010 — 800 p. ISBN-10: 1118347188 This comprehensive reference work provides an overview of the concepts, methodologies, and applications in computational linguistics and natural language processing (NLP). Features contributions by the top researchers in the field, reflecting the work that is driving the discipline forward Includes an introduction to the major...

2,99 МБ
дата добавления неизвестна
описание отредактировано 05.10.2010 15:17

Подробнее

Ganegedara T. Natural Language Processing with TensorFlow: Teach language to machines using Python's deep learning library

Раздел: Наука о данных → Обработка естественного языка / Aвтоматическое распознавание речи / Анализ текста

Packt Publishing, 2018. — 472 p. — ISBN: 1788478312. Natural language processing (NLP) supplies the majority of data available to deep learning applications, while TensorFlow is the most important deep learning framework currently available. Natural Language Processing with TensorFlow brings TensorFlow and NLP together to give you invaluable tools to work with the immense...

7,95 МБ
добавлен 09.07.2018 02:14
описание отредактировано 06.08.2022 22:53

Подробнее

Kapetanios Epaminondas, Tatar Doina, Sacarea Christian. Natural Language Processing: Semantic Aspects

Раздел: Искусственный интеллект → Компьютерная лингвистика

CRC Press. 2013. 346 pages. ISBN: 1466584963 This book introduces the semantic aspects of natural language processing and its applications. Topics covered include: measuring word meaning similarity, multi-lingual querying, and parametric theory, named entity recognition, semantics, query language, and the nature of language. The book also emphasizes the portions of mathematics...

17,09 МБ
добавлен 06.08.2014 02:31
описание отредактировано 08.08.2014 17:06

Подробнее

Mihalcea R., Radev D. Graph-based Natural Language Processing and Information Retrieval

Раздел: Искусственный интеллект → Компьютерная лингвистика

Издательство Cambridge University Press, 2011, -202 pp. Graph theory is a well-studied discipline as are the fields of natural language processing and information retrieval. Traditionally, these areas of study have been perceived as distinct, with different algorithms, different applications, and different potential end-users. However, as recent research work has shown, these...

1,12 МБ
добавлен 26.06.2012 14:25
описание отредактировано 26.06.2012 14:53

Подробнее

Murphy K.P. Machine Learning: A Probabilistic Perspective

Раздел: Искусственный интеллект → Машинное обучение (Machine Learning)

Massachusetts Institute of Technology, 2012. — 1067 p. — ISBN: 0262018020, 978-0262018029. Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive...

25,69 МБ
добавлен 02.02.2014 03:25
описание отредактировано 15.01.2022 04:28

Главная

Наверх