The classical Chinese poetry is the main form and brilliant heritage of Chinese culture, thus worthy of being inherited and carried forward. These poems are numerous and diverse in genres after a long period of development, but the knowledge in them is mainly hidden in free texts, impeding the spread of Chinese culture. As a result, it’s urgent to organize and process Chinese poetry more efficiently and build knowledge sources that can serve for further mining, analysis and propagation of the poetry. In this paper, we take a preliminary step towards the above target and construct a geo-tagged Chinese poetry corpus. A basic annotation criterion is first given to guide the tagging process to obtain unified results. Then we present details about the collecting, annotating and statistics about the data, from which a geo-tagged corpus of 8000 Chinese poems is built. Finally, the corpus is utilized to generate a geographic visualization, proving its effectiveness in promoting the comprehension of Chinese poetry.
@InProceedings{ZHANG18.12, author = {Weili Zhang and Xianpei Han}, title = {A Geo-Tagged Chinese Poetry Corpus }, booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)}, year = {2018}, month = {may}, date = {7-12}, location = {Miyazaki, Japan}, editor = {Erhong Yang and Le Sun}, publisher = {European Language Resources Association (ELRA)}, address = {Paris, France}, isbn = {979-10-95546-29-0}, language = {english} }