Using Comparable Corpora For Under Resourced Areas Of Machine Translation

DOWNLOAD
Download Using Comparable Corpora For Under Resourced Areas Of Machine Translation PDF/ePub or read online books in Mobi eBooks. Click Download or Read Online button to get Using Comparable Corpora For Under Resourced Areas Of Machine Translation book now. This website allows unlimited access to, at the time of writing, more than 1.5 million titles, including hundreds of thousands of titles in various foreign languages. If the content not found or just blank you must refresh this page
Using Comparable Corpora For Under Resourced Areas Of Machine Translation
DOWNLOAD
Author : Inguna Skadiņa
language : en
Publisher: Springer
Release Date : 2019-02-06
Using Comparable Corpora For Under Resourced Areas Of Machine Translation written by Inguna Skadiņa and has been published by Springer this book supported file pdf, txt, epub, kindle and other format this book has been release on 2019-02-06 with Computers categories.
This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.
Building And Using Comparable Corpora
DOWNLOAD
Author : Serge Sharoff
language : en
Publisher: Springer Science & Business Media
Release Date : 2013-12-13
Building And Using Comparable Corpora written by Serge Sharoff and has been published by Springer Science & Business Media this book supported file pdf, txt, epub, kindle and other format this book has been release on 2013-12-13 with Computers categories.
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.
Computational Linguistics And Intelligent Text Processing
DOWNLOAD
Author : Alexander Gelbukh
language : en
Publisher: Springer
Release Date : 2013-03-12
Computational Linguistics And Intelligent Text Processing written by Alexander Gelbukh and has been published by Springer this book supported file pdf, txt, epub, kindle and other format this book has been release on 2013-03-12 with Computers categories.
This two-volume set, consisting of LNCS 7816 and LNCS 7817, constitutes the thoroughly refereed proceedings of the 13th International Conference on Computer Linguistics and Intelligent Processing, CICLING 2013, held on Samos, Greece, in March 2013. The total of 91 contributions presented was carefully reviewed and selected for inclusion in the proceedings. The papers are organized in topical sections named: general techniques; lexical resources; morphology and tokenization; syntax and named entity recognition; word sense disambiguation and coreference resolution; semantics and discourse; sentiment, polarity, subjectivity, and opinion; machine translation and multilingualism; text mining, information extraction, and information retrieval; text summarization; stylometry and text simplification; and applications.
Chinese Lexical Semantics
DOWNLOAD
Author : Donghong Ji
language : en
Publisher: Springer
Release Date : 2013-02-15
Chinese Lexical Semantics written by Donghong Ji and has been published by Springer this book supported file pdf, txt, epub, kindle and other format this book has been release on 2013-02-15 with Computers categories.
This book constitutes carefully reviewed and revised selected papers from the 13th Chinese Lexical Semantics Workshop, CLSW 2012, held in Wuhan, China, in July 2012. The 67 full papers and 17 short papers presented in this volume were carefully reviewed and selected from 169 submissions. They are organized in topical sections named: applications on natural language processing; corpus linguistics; lexical computation; lexical resources; lexical semantics; new methods for lexical semantics; and other topics.
Machine Learning In Translation Corpora Processing
DOWNLOAD
Author : Krzysztof Wolk
language : en
Publisher: CRC Press
Release Date : 2019-02-25
Machine Learning In Translation Corpora Processing written by Krzysztof Wolk and has been published by CRC Press this book supported file pdf, txt, epub, kindle and other format this book has been release on 2019-02-25 with Computers categories.
This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.
Routledge Encyclopedia Of Translation Technology
DOWNLOAD
Author : Chan Sin-wai
language : en
Publisher: Taylor & Francis
Release Date : 2023-04-26
Routledge Encyclopedia Of Translation Technology written by Chan Sin-wai and has been published by Taylor & Francis this book supported file pdf, txt, epub, kindle and other format this book has been release on 2023-04-26 with Foreign Language Study categories.
Routledge Encyclopedia of Translation Technology, second edition, provides a state-of-the-art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional, and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part 1 presents general issues in translation technology, such as its history and development, translator training, and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part 2 discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom, and the United States; Part 3 evaluates specific matters in translation technology, with entries focused on subjects such as alignment, concordancing, localization, online translation, and translation memory. The new edition has five additional chapters, with many chapters updated and revised, drawing on the expertise of over 50 contributors from around the world and an international panel of consultant editors to provide a selection of chapters on the most pertinent topics in the discipline. All the chapters are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.
Corpus Use In Cross Linguistic Research
DOWNLOAD
Author : Marlén Izquierdo
language : en
Publisher: John Benjamins Publishing Company
Release Date : 2023-11-02
Corpus Use In Cross Linguistic Research written by Marlén Izquierdo and has been published by John Benjamins Publishing Company this book supported file pdf, txt, epub, kindle and other format this book has been release on 2023-11-02 with Language Arts & Disciplines categories.
Cross-linguistic research is a fruitful field of language inquiry that has benefited enormously from the use of corpora. As sources of linguistic data of various kinds and as tools for language processing, corpora have shaped the development of cross-linguistic research, enabling both language description and practical applications. This volume contains twelve studies that emphasize the usefulness and usability of parallel corpora in accurately exploring the structure and use of seven under-researched languages and language varieties. The first part emphasizes the role of corpus-based descriptive analyses at the lexicogrammatical and discursive levels, as a first step on the way towards concrete applications like translation or language teaching. The second part focuses on the role of parallel-corpus-based language processing techniques and applications that facilitate professional communication. This book will be of interest to scholars in contrastive linguistics, translation studies, discourse analysis, language teaching, and natural language processing.
Neural Machine Translation
DOWNLOAD
Author : Philipp Koehn
language : en
Publisher: Cambridge University Press
Release Date : 2020-06-18
Neural Machine Translation written by Philipp Koehn and has been published by Cambridge University Press this book supported file pdf, txt, epub, kindle and other format this book has been release on 2020-06-18 with Computers categories.
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
Corpus Analysis For Language Studies At The University Level
DOWNLOAD
Author : Giedrė Valūnaitė Oleškevičienė
language : en
Publisher: Cambridge Scholars Publishing
Release Date : 2021-02-08
Corpus Analysis For Language Studies At The University Level written by Giedrė Valūnaitė Oleškevičienė and has been published by Cambridge Scholars Publishing this book supported file pdf, txt, epub, kindle and other format this book has been release on 2021-02-08 with Language Arts & Disciplines categories.
This book highlights corpora use in teaching foreign languages in university education. It will appeal to both academics and practitioners interested in the process of teaching foreign languages at more advanced levels while applying corpus analysis and building tools for corpus annotation. It provides a detailed case study of analyzing the terminology of constitutional law in both English and Lithuanian as an example to illustrate the possibility of integrating corpus analysis tools into the process of teaching foreign languages in university education. The book reveals that initial linguistic knowledge is essential when teaching and learning foreign languages at more advanced levels while applying corpus annotation. In addition, it shows that, even though the use of new corpus software is perceived as a positive, there are still certain issues to be solved in this regard, such as the constant renewal of public computers in universities and the technical and methodological support for teachers while using corpora tools.
Multilingual Processing In Eastern And Southern Eu Languages
DOWNLOAD
Author : Cristina Vertan
language : en
Publisher: Cambridge Scholars Publishing
Release Date : 2012-04-25
Multilingual Processing In Eastern And Southern Eu Languages written by Cristina Vertan and has been published by Cambridge Scholars Publishing this book supported file pdf, txt, epub, kindle and other format this book has been release on 2012-04-25 with Language Arts & Disciplines categories.
This volume draws attention to many specific challenges of multilingual processing within the European Union, especially after the recent successive enlargement. Most of the languages considered herein are not only ‘less resourced’ in terms of processing tools and training data, but also have features which are different from the well known international language pairs. The 16 contributions address specific problems and solutions for languages from south-eastern and central Europe in the context of multilingual communication, translation and information retrieval.