Title |
A Digital Archive of Research Papers in Computer Science |
Authors |
Manuela Sassi, Gabriella Pardelli, Stefania Biagioni, Carlo Carlesi and Sara Goggi |
Abstract |
This paper presents the results of a terminological work conducted by the authors on a Digital Archives Net of the Italian National Research Council (CNR) in the field of Computer Science. In particular, the research tends to analyse the use of certain terms in Computer Science in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. Its main source is a reference corpus made up of 13,500 documents which collects the scientific productions of CNR. This study is divided in three sections: 1) an introductory one dedicated to the data extracted from the scientific documentation; 2) the second section is devoted to the description of the contents managed by the PUMA system; 3) the third part contains a statistical representation of terms extracted from archive: some comparison tables between the occurrences of the most used terms in the scientific documentation will be created and diagrams with percentages about the most frequently used terms will be displayed too. Indexes and concordances will allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge. |
Topics |
Digital libraries, Document Classification, Text categorisation, Text mining |
Full paper |
A Digital Archive of Research Papers in Computer Science |
Slides |
- |
Bibtex |
@InProceedings{SASSI10.945,
author = {Manuela Sassi and Gabriella Pardelli and Stefania Biagioni and Carlo Carlesi and Sara Goggi}, title = {A Digital Archive of Research Papers in Computer Science}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |