Networks and Knowledge in Roget s Thesaurus

Author: Werner Hüllen
Publisher: OUP Oxford
ISBN: 0191564680
Format: PDF, ePub, Mobi
Download Now
In this book Werner Hüllen examines Roget's Thesaurus in relation to linguistics, philosophy and history. He explores the influence of Roget's Thesaurus abroad (Germany and the Romance countries). He epitomizes its history and compares the various editions of the book. In lexical case studies he evaluates some entries with pertinence to their cultural and political implications. He discusses the didactic potential of thesauri in general and considers the implications of the Thesaurus for the study of scholarly linguistics and psychology. He discusses how Roget's Thesaurus prepared the way for the more recent idea of network semantics. By analyzing retrieval techniques one can show, he claims, how the words of languages were (and are) stored in the minds of those who speak them. Professor Hüllen concludes by considering the role of synonymy in language from a perspective of cognitive linguistics showing that it is indispensable for communication.

Mapping Scientific Frontiers

Author: Chaomei Chen
Publisher: Springer Science & Business Media
ISBN: 9781852334949
Format: PDF, ePub
Download Now
This is a comprehensive introduction to scientific visualization. It provides a complete history of the development of the field with illustrations of how the techniques can be applied in different field, including the history itself.

A History of Roget s Thesaurus Origins Development and Design

Author: Werner Hüllen
Publisher: OUP Oxford
ISBN: 0191555010
Format: PDF, ePub
Download Now
In 1852 Peter Mark Roget eclipsed a rich tradition of topically based dictionaries with the publication of his Thesaurus of English Words and Phrases, Classified and Arranged so as to Facilitate the Expression of Ideas. Based on intuition as much as on specific linguistic principles, Roget's book has been a bestseller ever since and is one of the most widely-used reference works ever published. In this book Werner H--uuml--;llen gives the first history of its genesis and publication, and investigates the principles of its structural design. The author opens with an account of Roget's life and his background in natural science. He then charts the parallel histories of dictionaries of synonyms and concepts within the general context of lexicography. Synonymy, he argues, is a necessary feature of languages without which communication would be impossible. He traces its theory and practice from Plato to the emergence of French and English synonym dictionaries in the seventeenth century. Roget's was the first such book to be arranged by topic and the first to encompass the semantic network of the entire language. The author examines the manner and method of its compilation, the practical outcomes of the traditions on which it was based, and the ways in which the Thesaurus reflects and reveals Roget's beliefs and background. A History of Roget's Thesaurus will interest students and scholars of linguistics, semantics, and lexicography, as well as anyone wishing to know more about a great literary achievement and an astonishing publishing phenomenon.

Big Data Analytics and Knowledge Discovery

Author: Sanjay Madria
Publisher: Springer
ISBN: 3319439464
Format: PDF, ePub
Download Now
This book constitutes the refereed proceedings of the 18th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2016, held in Porto, Portugal, September 2016. The 25 revised full papers presented were carefully reviewed and selected from 73 submissions. The papers are organized in topical sections on Mining Big Data, Applications of Big Data Mining, Big Data Indexing and Searching, Big Data Learning and Security, Graph Databases and Data Warehousing, Data Intelligence and Technology.

Large Scale Knowledge Resources Construction and Application

Author: Takenobu Tokunaga
Publisher: Springer
ISBN: 3540781595
Format: PDF, ePub
Download Now
Atthestartofthe21stcentury,wearenowwellonthewaytowardsaknowled- intensive society, in which knowledge plays ever more important roles. Thus, research interest should inevitably shift from information to knowledge, with the problems of building, organizing, maintaining and utilizing knowledge - coming centralissues in a wide varietyof ?elds. The 21stCentury COE program “Framework for Systematization and Application of Large-scale Knowledge - sources (COE-LKR)” conducted by the Tokyo Institute of Technology is one of several early attempts worldwide to address these important issues. Inspired by this project, LKR2008 aimed at bringing together diverse contributions in cognitive science, computer science, education and linguistics to explore design, construction, extension, maintenance, validation and application of knowledge. Respondingtoourcallforpapers,wereceived38submissionfromavarietyof researchareas.EachpaperwasreviewedbythreeProgramCommitteemembers. Since we were aiming at an interdisciplinary conference covering a wide range of topics concerning large-scale knowledge resources (LKR), each paper was assigned a reviewer from a topic area outside the main thrust of the paper. This reviewer was asked to assess whether the authors described the moti- tion and importance of their work in a comprehensible manner even for readers in other research areas. Following a rigorous reviewing process, we accepted 14 regular papers and 12 poster papers.

Explorations in Automatic Thesaurus Discovery

Author: Gregory Grefenstette
Publisher: Springer Science & Business Media
ISBN: 1461527104
Format: PDF, ePub
Download Now
Explorations in Automatic Thesaurus Discovery presents an automated method for creating a first-draft thesaurus from raw text. It describes natural processing steps of tokenization, surface syntactic analysis, and syntactic attribute extraction. From these attributes, word and term similarity is calculated and a thesaurus is created showing important common terms and their relation to each other, common verb--noun pairings, common expressions, and word family members. The techniques are tested on twenty different corpora ranging from baseball newsgroups, assassination archives, medical X-ray reports, abstracts on AIDS, to encyclopedia articles on animals, even on the text of the book itself. The corpora range from 40,000 to 6 million characters of text, and results are presented for each in the Appendix. The methods described in the book have undergone extensive evaluation. Their time and space complexity are shown to be modest. The results are shown to converge to a stable state as the corpus grows. The similarities calculated are compared to those produced by psychological testing. A method of evaluation using Artificial Synonyms is tested. Gold Standards evaluation show that techniques significantly outperform non-linguistic-based techniques for the most important words in corpora. Explorations in Automatic Thesaurus Discovery includes applications to the fields of information retrieval using established testbeds, existing thesaural enrichment, semantic analysis. Also included are applications showing how to create, implement, and test a first-draft thesaurus.