Parallel Corpora for Contrastive and Translation Studies

Download Parallel Corpora for Contrastive and Translation Studies PDF Online Free

Author :
Publisher : John Benjamins Publishing Company
ISBN 13 : 9027262845
Total Pages : 313 pages
Book Rating : 4.0/5 (272 download)

DOWNLOAD NOW!


Book Synopsis Parallel Corpora for Contrastive and Translation Studies by : Irene Doval

Download or read book Parallel Corpora for Contrastive and Translation Studies written by Irene Doval and published by John Benjamins Publishing Company. This book was released on 2019-03-20 with total page 313 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume assesses the state of the art of parallel corpus research as a whole, reporting on advances in both recent developments of parallel corpora – with some particular references to comparable corpora as well– and in ways of exploiting them for a variety of purposes. The first part of the book is devoted to new roles that parallel corpora can and should assume in translation studies and in contrastive linguistics, to the usefulness and usability of parallel corpora, and to advances in parallel corpus alignment, annotation and retrieval. There follows an up-to-date presentation of a number of parallel corpus projects currently being carried out in Europe, some of them multimodal, with certain chapters illustrating case studies developed on the basis of the corpora at hand. In most of these chapters, attention is paid to specific technical issues of corpus building. The third part of the book reflects on specific applications and on the creation of bilingual resources from parallel corpora. This volume will be welcomed by scholars, postgraduate and PhD students in the fields of contrastive linguistics, translation studies, lexicography, language teaching and learning, machine translation, and natural language processing.

Advances in Corpus Linguistics

Download Advances in Corpus Linguistics PDF Online Free

Author :
Publisher : Rodopi
ISBN 13 : 9789042017412
Total Pages : 430 pages
Book Rating : 4.0/5 (174 download)

DOWNLOAD NOW!


Book Synopsis Advances in Corpus Linguistics by : Karin Aijmer

Download or read book Advances in Corpus Linguistics written by Karin Aijmer and published by Rodopi. This book was released on 2004 with total page 430 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides an up-to-date survey of current issues and approaches in corpus linguistics in the form of twenty-two recent research articles. The articles cover a wide range of topics illustrating the diversity of research that is characteristic of corpus linguistics today. Central themes are the relationship between theory, intuition and corpus data and the role of corpora in linguistic research. The majority of the articles are empirical studies of specific aspects of English, ranging from lexis and grammar to discourse and pragmatics. Other areas explored are language variation, language change and development, language learning, cross-linguistic comparisons of English and other languages, and the development of linguistic software tools. The contributors to the volume include some of the leading figures in the field such as M.A.K. Halliday, John Sinclair, Geoffrey Leech and Michael Hoey. The theoretical and methodological issues addressed in the volume demonstrate clearly the steady advance of an expanding discipline inspired by an empirical, usage-based approach to the study of language. The volume is essential reading for researchers and students interested in the use of computer corpora in linguistic research.

Crosslingual Implementation of Linguistic Taggers Using Parallel Corpora

Download Crosslingual Implementation of Linguistic Taggers Using Parallel Corpora PDF Online Free

Author :
Publisher : Lulu.com
ISBN 13 : 0557448093
Total Pages : 74 pages
Book Rating : 4.5/5 (574 download)

DOWNLOAD NOW!


Book Synopsis Crosslingual Implementation of Linguistic Taggers Using Parallel Corpora by : Hani Safadi

Download or read book Crosslingual Implementation of Linguistic Taggers Using Parallel Corpora written by Hani Safadi and published by Lulu.com. This book was released on 2010-04-27 with total page 74 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book addresses the problem of creating linguistic taggers for resource-poor languages using existing taggers in resource rich languages. Linguistic taggers are classifiers that map individual words or phrases from a sentence to a set of tags. Linguistic taggers are usually trained using supervised learning algorithms.The proposed approach does not require that the input sentence be translated into the source language. Instead, projection of linguistic tags is accomplished through the use of a parallel corpus, which is a collection of texts that are available in a source language and a target language. The correspondence between words of the source and target language allows to project tags from source to target language words.A parallel corpus of the source and target languages might not be readily available for many language pairs. To deal with this problem, we describe a system for automatic acquisition of aligned, bilingual corpora from pre-specified domains on the World Wide Web.

Web Corpus Construction

Download Web Corpus Construction PDF Online Free

Author :
Publisher : Morgan & Claypool Publishers
ISBN 13 : 1627053123
Total Pages : 197 pages
Book Rating : 4.6/5 (27 download)

DOWNLOAD NOW!


Book Synopsis Web Corpus Construction by : Roland Schäfer

Download or read book Web Corpus Construction written by Roland Schäfer and published by Morgan & Claypool Publishers. This book was released on 2013-07-01 with total page 197 pages. Available in PDF, EPUB and Kindle. Book excerpt: The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora).

Bitext Alignment

Download Bitext Alignment PDF Online Free

Author :
Publisher : Morgan & Claypool Publishers
ISBN 13 : 1608455106
Total Pages : 168 pages
Book Rating : 4.6/5 (84 download)

DOWNLOAD NOW!


Book Synopsis Bitext Alignment by : Jörg Tiedemann

Download or read book Bitext Alignment written by Jörg Tiedemann and published by Morgan & Claypool Publishers. This book was released on 2011 with total page 168 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks

Neural Machine Translation

Download Neural Machine Translation PDF Online Free

Author :
Publisher : Cambridge University Press
ISBN 13 : 1108497322
Total Pages : 409 pages
Book Rating : 4.1/5 (84 download)

DOWNLOAD NOW!


Book Synopsis Neural Machine Translation by : Philipp Koehn

Download or read book Neural Machine Translation written by Philipp Koehn and published by Cambridge University Press. This book was released on 2020-06-18 with total page 409 pages. Available in PDF, EPUB and Kindle. Book excerpt: Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.

Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications

Download Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications PDF Online Free

Author :
Publisher : Elsevier
ISBN 13 : 0444640436
Total Pages : 540 pages
Book Rating : 4.4/5 (446 download)

DOWNLOAD NOW!


Book Synopsis Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications by :

Download or read book Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications written by and published by Elsevier. This book was released on 2018-08-27 with total page 540 pages. Available in PDF, EPUB and Kindle. Book excerpt: Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications, Volume 38, the latest release in this monograph that provides a cohesive and integrated exposition of these advances and associated applications, includes new chapters on Linguistics: Core Concepts and Principles, Grammars, Open-Source Libraries, Application Frameworks, Workflow Systems, Mathematical Essentials, Probability, Inference and Prediction Methods, Random Processes, Bayesian Methods, Machine Learning, Artificial Neural Networks for Natural Language Processing, Information Retrieval, Language Core Tasks, Language Understanding Applications, and more. The synergistic confluence of linguistics, statistics, big data, and high-performance computing is the underlying force for the recent and dramatic advances in analyzing and understanding natural languages, hence making this series all the more important. - Provides a thorough treatment of open-source libraries, application frameworks and workflow systems for natural language analysis and understanding - Presents new chapters on Linguistics: Core Concepts and Principles, Grammars, Open-Source Libraries, Application Frameworks, Workflow Systems, Mathematical Essentials, Probability, and more

Natural Language Understanding in a Semantic Web Context

Download Natural Language Understanding in a Semantic Web Context PDF Online Free

Author :
Publisher : Springer
ISBN 13 : 9783319413358
Total Pages : 0 pages
Book Rating : 4.4/5 (133 download)

DOWNLOAD NOW!


Book Synopsis Natural Language Understanding in a Semantic Web Context by : Caroline Barrière

Download or read book Natural Language Understanding in a Semantic Web Context written by Caroline Barrière and published by Springer. This book was released on 2016-11-24 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book serves as a starting point for Semantic Web (SW) students and researchers interested in discovering what Natural Language Processing (NLP) has to offer. NLP can effectively help uncover the large portions of data held as unstructured text in natural language, thus augmenting the real content of the Semantic Web in a significant and lasting way. The book covers the basics of NLP, with a focus on Natural Language Understanding (NLU), referring to semantic processing, information extraction and knowledge acquisition, which are seen as the key links between the SW and NLP communities. Major emphasis is placed on mining sentences in search of entities and relations. In the course of this “quest", challenges will be encountered for various text analysis tasks, including part-of-speech tagging, parsing, semantic disambiguation, named entity recognition and relation extraction. Standard algorithms associated with these tasks are presented to provide an understanding of the fundamental concepts. Furthermore, the importance of experimental design and result analysis is emphasized, and accordingly, most chapters include small experiments on corpus data with quantitative and qualitative analysis of the results. This book is divided into four parts. Part I “Searching for Entities in Text” is dedicated to the search for entities in textual data. Next, Part II “Working with Corpora” investigates corpora as valuable resources for NLP work. In turn, Part III “Semantic Grounding and Relatedness” focuses on the process of linking surface forms found in text to entities in resources. Finally, Part IV “Knowledge Acquisition” delves into the world of relations and relation extraction. The book also includes three appendices: “A Look into the Semantic Web” gives a brief overview of the Semantic Web and is intended to bring readers less familiar with the Semantic Web up to speed, so that they too can fully benefit from the material of this book. “NLP Tools and Platforms” provides information about NLP platforms and tools, while “Relation Lists” gathers lists of relations under different categories, showing how relations can be varied and serve different purposes. And finally, the book includes a glossary of over 200 terms commonly used in NLP. The book offers a valuable resource for graduate students specializing in SW technologies and professionals looking for new tools to improve the applicability of SW techniques in everyday life – or, in short, everyone looking to learn about NLP in order to expand his or her horizons. It provides a wealth of information for readers new to both fields, helping them understand the underlying principles and the challenges they may encounter.

Theory And Practice Of Computation - Proceedings Of Workshop On Computation: Theory And Practice (Wctp2015)

Download Theory And Practice Of Computation - Proceedings Of Workshop On Computation: Theory And Practice (Wctp2015) PDF Online Free

Author :
Publisher : World Scientific
ISBN 13 : 9813202823
Total Pages : 250 pages
Book Rating : 4.8/5 (132 download)

DOWNLOAD NOW!


Book Synopsis Theory And Practice Of Computation - Proceedings Of Workshop On Computation: Theory And Practice (Wctp2015) by : Shin-ya Nishizaki

Download or read book Theory And Practice Of Computation - Proceedings Of Workshop On Computation: Theory And Practice (Wctp2015) written by Shin-ya Nishizaki and published by World Scientific. This book was released on 2017-02-24 with total page 250 pages. Available in PDF, EPUB and Kindle. Book excerpt: This is the proceedings of the Fourth Workshop on Computing: Theory and Practice, WCTP 2015 devoted to theoretical and practical approaches to computation. This workshop was organized by four top universities in Japan and the Philippines: Tokyo Institute of Technology, Osaka University, University of the Philippines - Diliman, and De La Salle University. The proceedings provides a view of the current movement in research in these two countries. The papers included in the proceedings focus on the two research areas: theoretical and practical aspects of computation.

Corpus-based Language Studies

Download Corpus-based Language Studies PDF Online Free

Author :
Publisher : Taylor & Francis
ISBN 13 : 9780415286237
Total Pages : 412 pages
Book Rating : 4.2/5 (862 download)

DOWNLOAD NOW!


Book Synopsis Corpus-based Language Studies by : Tony McEnery

Download or read book Corpus-based Language Studies written by Tony McEnery and published by Taylor & Francis. This book was released on 2006 with total page 412 pages. Available in PDF, EPUB and Kindle. Book excerpt: Covering the major approaches to the use of corpus data, this work gathers together influential readings from leading names in the discipline, including Biber, Widdowson, Sinclair, Carter and McCarthy.

Using Comparable Corpora for Under-resourced Areas of Machine Translation

Download Using Comparable Corpora for Under-resourced Areas of Machine Translation PDF Online Free

Author :
Publisher :
ISBN 13 : 9783319990057
Total Pages : 323 pages
Book Rating : 4.9/5 (9 download)

DOWNLOAD NOW!


Book Synopsis Using Comparable Corpora for Under-resourced Areas of Machine Translation by : Inguna Skadina

Download or read book Using Comparable Corpora for Under-resourced Areas of Machine Translation written by Inguna Skadina and published by . This book was released on 2019 with total page 323 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.

Corpus Linguistics. Volume 1

Download Corpus Linguistics. Volume 1 PDF Online Free

Author :
Publisher : Walter de Gruyter
ISBN 13 : 3110211424
Total Pages : 797 pages
Book Rating : 4.1/5 (12 download)

DOWNLOAD NOW!


Book Synopsis Corpus Linguistics. Volume 1 by : Anke Lüdeling

Download or read book Corpus Linguistics. Volume 1 written by Anke Lüdeling and published by Walter de Gruyter. This book was released on 2008-12-10 with total page 797 pages. Available in PDF, EPUB and Kindle. Book excerpt: This volume provides an up-to-date survey of the field of corpus linguistics, a field whose methodology has revolutionized much of the empirical work done in most fields of linguistic study over the past decade. Corpus linguistics investigates human language by starting out from large collections of texts - spoken, written, or recorded. These language corpora, which are now regularly available in electronic form, are the basis for quantitative and qualitative research on almost any question of linguistic interest. Many techniques that are in use in corpus linguistics today are rooted in the tradition of the late 18th and 19th century, when linguistics began to make use of mathematical and empirical methods. Modern corpus linguistics has used and developed these methods in close connection with computer science and computational linguistics. The handbook sketches the history of corpus linguistics, shows its potential, discusses its problems, and describes various methods of collecting, annotating, and searching corpora as well as processing corpus data. It also reports case studies that illustrate the wide range of linguistic research questions addressed in corpus linguistics. The over 60 articles included in the handbook are divided into five sections: (1) the origins and history of corpus linguistics and surveys of its relationship to central fields of linguistics (2) corpus compilation (3) corpus types (4) preprocessing of corpora (5) the use and exploitation of corpora. The final section gives an overview of the results of corpus studies obtained in phonetics, phonology, morphology, syntax, semantics, sociolinguistics, historical linguistics, stylometry, dialectology, and discourse analysis. It also reports on recent advances made in human and machine translation, contrastive studies, computer-assisted language learning, and automatic summarization. The contributors to the volume are internationally known experts in their respective fields. The handbook is intended for a wide audience ranging from teachers, university students, and scholars to anyone interested in the use of computers in linguistic analyses and applications.

Annotation, exploitation and evaluation of parallel corpora: TC3 I

Download Annotation, exploitation and evaluation of parallel corpora: TC3 I PDF Online Free

Author :
Publisher : Language Science Press
ISBN 13 : 3946234852
Total Pages : 165 pages
Book Rating : 4.9/5 (462 download)

DOWNLOAD NOW!


Book Synopsis Annotation, exploitation and evaluation of parallel corpora: TC3 I by : Silvia Hansen-Schirra

Download or read book Annotation, exploitation and evaluation of parallel corpora: TC3 I written by Silvia Hansen-Schirra and published by Language Science Press. This book was released on 2017-02-27 with total page 165 pages. Available in PDF, EPUB and Kindle. Book excerpt: Exchange between the translation studies and the computational linguistics communities has traditionally not been very intense. Among other things, this is reflected by the different views on parallel corpora. While computational linguistics does not always strictly pay attention to the translation direction (e.g. when translation rules are extracted from (sub)corpora which actually only consist of translations), translation studies are amongst other things concerned with exactly comparing source and target texts (e.g. to draw conclusions on interference and standardization effects). However, there has recently been more exchange between the two fields – especially when it comes to the annotation of parallel corpora. This special issue brings together the different research perspectives. Its contributions show – from both perspectives – how the communities have come to interact in recent years.

Statistical Machine Translation

Download Statistical Machine Translation PDF Online Free

Author :
Publisher : Cambridge University Press
ISBN 13 : 0521874157
Total Pages : 447 pages
Book Rating : 4.5/5 (218 download)

DOWNLOAD NOW!


Book Synopsis Statistical Machine Translation by : Philipp Koehn

Download or read book Statistical Machine Translation written by Philipp Koehn and published by Cambridge University Press. This book was released on 2010 with total page 447 pages. Available in PDF, EPUB and Kindle. Book excerpt: The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.

A Practical Handbook of Corpus Linguistics

Download A Practical Handbook of Corpus Linguistics PDF Online Free

Author :
Publisher : Springer Nature
ISBN 13 : 3030462161
Total Pages : 686 pages
Book Rating : 4.0/5 (34 download)

DOWNLOAD NOW!


Book Synopsis A Practical Handbook of Corpus Linguistics by : Magali Paquot

Download or read book A Practical Handbook of Corpus Linguistics written by Magali Paquot and published by Springer Nature. This book was released on 2021-05-04 with total page 686 pages. Available in PDF, EPUB and Kindle. Book excerpt: This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.

International Conference on Innovative Computing and Communications

Download International Conference on Innovative Computing and Communications PDF Online Free

Author :
Publisher : Springer Nature
ISBN 13 : 9811551138
Total Pages : 1152 pages
Book Rating : 4.8/5 (115 download)

DOWNLOAD NOW!


Book Synopsis International Conference on Innovative Computing and Communications by : Deepak Gupta

Download or read book International Conference on Innovative Computing and Communications written by Deepak Gupta and published by Springer Nature. This book was released on 2020-08-01 with total page 1152 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book includes high-quality research papers presented at the Third International Conference on Innovative Computing and Communication (ICICC 2020), which is held at the Shaheed Sukhdev College of Business Studies, University of Delhi, Delhi, India, on 21–23 February, 2020. Introducing the innovative works of scientists, professors, research scholars, students and industrial experts in the field of computing and communication, the book promotes the transformation of fundamental research into institutional and industrialized research and the conversion of applied exploration into real-time applications.

English Corpus Linguistics

Download English Corpus Linguistics PDF Online Free

Author :
Publisher : Cambridge University Press
ISBN 13 : 1009365428
Total Pages : 211 pages
Book Rating : 4.0/5 (93 download)

DOWNLOAD NOW!


Book Synopsis English Corpus Linguistics by : Charles F. Meyer

Download or read book English Corpus Linguistics written by Charles F. Meyer and published by Cambridge University Press. This book was released on 2023-06-30 with total page 211 pages. Available in PDF, EPUB and Kindle. Book excerpt: Corpus linguistics is a research method which draws on authentic language examples, collected and organized into 'corpora', or searchable 'bodies' of data. The method was established in the 1960s, and has rapidly developed since then. Now in its second edition, this book provides a step-by-step guide on how to create and analyze linguistic corpora. It has been extensively updated to reflect the most recent developments in this ever-evolving field, and now covers the empirical foundation of corpus-based research, new methodological considerations that guide the creation of a corpus, new kinds of research that can be conducted on corpora, and the most up-to-date information on how qualitative and quantitative analyses of corpora are conducted. Theoretical approaches are introduced in an accessible, easy-to-read way, and the book is illustrated with a wide range of different linguistic corpora, making it essential reading for researchers and students in a number of subfields of linguistics.