LREC 2018 Proceedings

Summary of the paper

Title	Semantic Frame Parsing for Information Extraction : the CALOR corpus
Authors	Gabriel Marzinotto, Jeremy Auguste, Frederic Bechet, Géraldine Damnati and Alexis Nasr
Abstract	This paper presents a publicly available corpus of French encyclopedic history texts annotated according to the Berkeley FrameNet formalism. The main difference in our approach compared to previous works on semantic parsing with FrameNet is that we are not interested here in full text parsing but rather on partial parsing. The goal is to select from the FrameNet resources the minimal set of frames that are going to be useful for the applicative framework targeted, in our case Information Extraction from encyclopedic documents. Such an approach leverage the manual annotation of larger corpus than those obtained through full text parsing and therefore open the door to alternative methods for Frame parsing than those used so far on the FrameNet 1.5 benchmark corpus. The approaches compared in this study rely on an integrated sequence labeling model which jointly optimizes frame identification and semantic role segmentation and identification. The models compared are CRFs and multitasks bi-LSTMs.
Topics	Statistical And Machine Learning Methods, Corpus (Creation, Annotation, Etc.), Semantics
Full paper	Semantic Frame Parsing for Information Extraction : the CALOR corpus
Bibtex	@InProceedings{MARZINOTTO18.527, author = {Gabriel Marzinotto and Jeremy Auguste and Frederic Bechet and Géraldine Damnati and Alexis Nasr}, title = "{Semantic Frame Parsing for Information Extraction : the CALOR corpus}", booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)}, year = {2018}, month = {May 7-12, 2018}, address = {Miyazaki, Japan}, editor = {Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga}, publisher = {European Language Resources Association (ELRA)}, isbn = {979-10-95546-00-9}, language = {english} }