LREC 2000 2nd International Conference on Language Resources & Evaluation | |
Conference Papers
Papers by paper title: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Papers by ID number: 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-377. |
Previous Paper Next Paper
Title | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts |
Authors |
Johannessen Janne Bondi (The Text Laboratory, University of Oslo, P.O. Box 1102 Blindern, N-0317 Oslo, Norway, j.m.b.johannessen@ilf.uio.no) Noklestad Anders (The Text Laboratory, University of Oslo, P.O. Box 1102 Blindern, N-0317 Oslo, Norway, anders.noklestad@ilf.uio.no) Hagen Kristin (The Text Laboratory, University of Oslo, P.O. Box 1102 Blindern, N-0317 Oslo, Norway, kristin.hagen@ilf.uio.no) |
Keywords | Accessibility, Corpus, Grammatical Tagging, User Friendly Search System, Web Interface |
Session | Session WP8 - Corpus Tools |
Abstract | A general purpose text corpus meant for linguists and lexicographers needs to satify quality criteria at at least four different levels. The first two criteria are fairly well established; the corpus should have a wide variety of texts and be tagged according to a fine-grained system. The last two criteria are much less widely appreciated, unfortunately. One has to do with variety of search criteria: the user should be allowed to search for any information contained in the corpus, and with any combination possible. In addition, the search results should be presented in a choice of ways. The fourth criterion has to do with accessability. It is a rather surprising fact that while user interfaces tend to be simple and self explanatory in most areas of life represented electronically, corpus interfaces are still extremely user unfriendly. In this paper, we present a corpus whose interface we have given a lot of thought, and likewise the possible search options, viz. the Oslo Corpus of Tagged Norwegian Texts. |