Title |
Introduction of KIBS (Korean Information Base System) Project |
Authors |
Chae Young-Soog (Korea Terminology Research Center for Language and Knowledge Engineering, Department of Computer Science, Korea Advanced Institute of Science and Technology, 373-1 Kusong-dong Yusong-gu Taejon 305-701 Korea, yschae@korterm.kaist.ac.kr) Choi Key-Sun (Korea Terminology Research Center for Language and Knowledge Engineering, Department of Computer Science, Korea Advanced Institute of Science and Technology, 373-1 Kusong-dong Yusong-gu Taejon 305-701 Korea, kschoi@korterm.kaist.ac.kr) |
Keywords |
Analytic Tools, Corpus, Information Base, Project Introduction, Tree Bank |
Session |
Session WP8 - Corpus Tools |
Full Paper |
239.ps, 239.pdf |
Abstract |
This project has been carried out on the basis of resources and tools for Korean NLP. The main research is the construction of raw corpus of 64 million tokens and Part-of-Speech tagged corpus of about 11 million tokens. And we develop some analytic tools to construct and some supporting tools to navigate them. This paper represents the present state of the work carried out by the KIBS project. We introduce a KAIST tag set of POS and syntax for standard corpus and annotation principles. And we explain several error types represented in tagged corpus. |