LREC 2006 - Proceedings sorted by sessions

Authors	Titles
Session O1-W Lexicons, Semantics & Infrastructural Issues :
Y. Hayashi, T. Ishida	A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
C. Soria, M. Tesconi, F. Bertagna, N. Calzolari, A. Marchetti, M. Monachini	Moving to dynamic computational lexicons with LeXFlow
V. Cavalli-sforza, A. Soudi	IMORPH?: An Inheritance and Equivalence Based Morphology Description Compiler
A. Itai, S. Wintner, S. Yona	A Computational Lexicon of Contemporary Hebrew
E. Agirre, I. Aldezabal, J. Etxeberria, E. Izagirre, K. Mendizabal, E. Pociello, M. Quintian	A methodology for the joint development of the Basque WordNet and Semcor
S. Elkateb, W. Black, H. Rodríguez, M. Alkhalifa, E. Aribau, P. Vossen, A. Pease, C. Fellbaum	Building a WordNet for Arabic
V. Tablan, T. Polajnar, H. Cunningham, K. Bontcheva	User-friendly ontology authoring using a controlled language
Session O2-EW Named Entity Recognition & Time Annotation :
L. Nezda, A. Hickl, J. Lehmann, S. Fayyaz	What in the World is a Shahab? Wide Coverage Named Entity Recognition for Arabic
I. Michailidis, K. Diamantaras, S. Vasileiadis, Y. Frère	Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
B. Pouliquen, M. Kimler, R. Steinberger, C. Ignat, T. Oellinger, K. Blackler, F. Fluart, W. Zaghouani, A. Widiger, A. Forslund, C. Best	Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
V. Giouli, A. Konstandinidis, E. Desypri, H. Papageorgiou	Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue
C. Forăscu, I. Pistol, D. Cristea	Temporality in relation with discourse structure
B. Boguraev, R. Ando	Analysis of TimeBank as a Resource for TimeML Parsing
F. Pan, R. Mulkar, J. Hobbs	An Annotated Corpus of Typical Durations of Events
Session O3-EW Authoring Tools, Information Extraction and Retrieval :
F. Bustamante, A. Arnaiz, M. Ginés	A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
M. Reynaert	Corpus-Induced Corpus Clean-up
F. Bustamante, E. Díaz	Spelling Error Patterns in Spanish for Word Processing Applications
E. Luca, A. Nürnberger	Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
J. Iria, F. Ciravegna	A Methodology and Tool for Representing Language Resources for Information Extraction
N. Habash, C. Mah, S. Imran, R. Calistri-yeh, P. Sheridan	Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
A. Klassmann, F. Offenga, D. Broeder, R. Skiba, P. Wittenburg	Comparison of Resource Discovery Methods
Session O4-S Speech Corpora and Dialogue :
C. Cieri, W. Andrews, J. Campbell, G. Doddington, J. Godfrey, S. Huang, M. Liberman, A. Martin, H. Nakasone, M. Przybocki, K. Walker	The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
E. Grishina	Spoken Russian in the Russian National Corpus (RNC)
E. Barker, R. Higashinaka, F. Mairesse, R. Gaizauskas, M. Walker, J. Foster	Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
A. Nimaan, P. Nocera, J. Bonastre	Towards automatic transcription of Somali language
C. Cucchiarini, H. Hamme, O. Herwijnen, F. Smits	JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
S. Galliano, E. Geoffrois, G. Gravier, J. Bonastre, D. Mostefa, K. Choukri	Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
J. Polifroni, M. Walker	Learning Database Content for Spoken Dialogue System Design
Session O5-E Machine Translation and Evaluation :
D. Mostefa, O. Hamon, K. Choukri	Evaluation of Automatic Speech Recognition and Speech Translation within TC-STAR: Results from the first evaluation campaign
O. Hamon, M. Rajman	X-Score: Automatic Evaluation of Machine Translation Grammaticality
K. Miller, M. Vanni	Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm
E. Macklovitch	TransType2: The Last Word
U. Schäfer, D. Beck	Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components
O. Hamon, A. Popescu-belis, K. Choukri, M. Dabbadie, A. Hartley, W. Mustafa El Hadi, M. Rajman, I. Timimi	CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
S. Strassel, C. Cieri, A. Cole, D. Dipersio, M. Liberman, X. Ma, M. Maamouri, K. Maeda	Integrated Linguistic Resources for Language Exploitation Technologies
Session O6-WMT Ontologies :
P. Cimiano, M. Hartung, E. Ratsch	Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
J. Iria, C. Brewster, F. Ciravegna, Y. Wilks	An Incremental Tri-Partite Approach To Ontology Learning
G. Grefenstette, F. Debili, C. Fluhr, S. Zinger	Exploiting text for extracting image processing resources
R. Marinelli, A. Roventini, G. Spadoni	Using Core Ontology for Domain Lexicon Structuring
S. Li, Q. Lu, W. Li, R. Xu	Interaction between Lexical Base and Ontology with Formal Concept Analysis
R. Bartolini, C. Caracciolo, E. Giovannetti, A. Lenci, S. Marchi, V. Pirrelli, C. Renso, L. Spinsanti	Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Session O7-GWT Standards :
N. Ide, L. Romary	Representing Linguistic Corpora and Their Annotations
T. Declerck	SynAF: Towards a Standard for Syntactic Annotation
G. Francopoulo, M. George, N. Calzolari, M. Monachini, N. Bel, M. Pet, C. Soria	Lexical Markup Framework (LMF)
M. Assem, A. Gangemi, G. Schreiber	Conversion of WordNet to a standard RDF/OWL representation
L. Henriksen, C. Povlsen, A. Vasiljevs	EuroTermBank - a Terminology Resource based on Best Practice
G. Tummarello, C. Morbidoni, F. Kepler, F. Piazza, P. Puliti	A novel Textual Encoding paradigm based on Semantic Web tools and semantics
Session O8-W Lexicon Acquisition :
P. Chesley, S. Salmon-alt	Automatic extraction of subcategorization frames for French
A. Berglund, R. Johansson, P. Nugues	Extraction of Temporal Information from Texts in Swedish
M. Schiehlen, K. Spranger	The Mass-Count Distinction: Acquisition and Disambiguation
P. Smrž	Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts
Y. Zhang, V. Kordoni	Automated Deep Lexical Acquisition for Robust Open Texts Processing
S. Amsalu	Data-driven Amharic-English Bilingual Lexicon Acquisition
Session O9-SE TTS & Units for TTS :
Q. Yang, J. Martens, N. Konings, H. Heuvel	Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names
S. Breuer, S. Bergmann, R. Dragon, S. Möller	Set-up of a Unit-Selection Synthesis with a Prominent Voice
D. Oliver, K. Szklanny	Creation and analysis of a Polish speech database for use in unit selection synthesis
J. Pérez, A. Bonafonte, H. Hain, E. Keller, S. Breuer, J. Tian	ECESS Inter-Module Interface Specification for Speech Synthesis
M. Garcia, C. D’alessandro, G. Bailly, P. Mareüil, M. Morel	A joint prosody evaluation of French text-to-speech synthesis systems
A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. Heuvel, H. Hain, X. Wang, M. Garcia	TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis
Session O10-E Parsing Evaluation :
P. Paroubek, I. Robba, A. Vilnat, C. Ayache	Data, Annotations and Measures in EASY: the Evaluation Campaign for Parsers of French
G. Neumann, B. Crysmann	Exploring HPSG-based Treebanks for Probabilistic Parsing
R. Subba, B. Eugenio, E. Terenzi	Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
B. Roark, M. Harper, E. Charniak, B. Dorr, M. Johnson, J. Kahn, Y. Liu, M. Ostendorf, J. Hale, A. Krasnyanskaya, M. Lease, I. Shafran, M. Snover, R. Stewart, L. Yung	SParseval: Evaluation Metrics for Parsing Speech
A. Denis, M. Quignard, G. Pitel	A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
T. Vanrullen, P. Blache, J. Balfourier	Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Session O11-W Corpus & Lexicon :
C. Fairon, S. Paumier	A translated corpus of 30,000 French SMS
I. Castellón, A. Fernández-montraveta, G. Vázquez, L. Alonso, J. Capilla	The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
S. Klatt	A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
K. Inui, T. Hirano, R. Iida, A. Fujita, Y. Matsumoto	Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Session O12-W Tokenization and Tagging :
M. Forst, R. Kaplan	The importance of precise tokenizing for deep grammars
F. Wang, X. Deng, F. Zou	Towards Unified Chinese Segmentation Algorithm
F. Pîrvan, D. Tufi	Tagset Mapping and Statistical Training Data Cleaning-up
J. Hlavácová	New Approach to Frequency Dictionaries — Czech Example
Session O13-M Multimodal Corpora & Natural Interaction :
N. Campbell, T. Sadanobu, M. Imura, N. Iwahashi, S. Noriko, D. Douxchamps	A Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
D. Byron, E. Fosler-lussier	The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
A. Green, H. Hüttenrauch, E. Topp, K. Severinson	Developing a Contextualized Multimodal Corpus for Human-Robot Interaction
S. Luz, M. Bouamrane, M. Masoodian	Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction
Session O14- W Sentiments & Semantics :
A. Andreevskaia, S. Bergler	Semantic Tag Extraction from WordNet Glosses
A. Esuli, F. Sebastiani	SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
C. Strapparava, A. Valitutti, O. Stock	The Affective Weight of Lexicon
M. Taboada, C. Anthony, K. Voll	Methods for Creating Semantic Orientation Dictionaries
Session O15-W Treebanks and Acquisition :
F. Dell’orletta, A. Lenci, S. Montemagni, V. Pirrelli	Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
V. Novak, J. Hajic	Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
M. Maamouri, A. Bies, T. Buckwalter, M. Diab, N. Habash, O. Rambow, D. Tabessi	Developing and Using a Pilot Dialectal Arabic Treebank
M. Marneffe, B. Maccartney, C. Manning	Generating Typed Dependency Parses from Phrase Structure Parses
Session O16-W MWEs and Collocations :
A. Alonge	The Italian Metaphor Database
R. Osswald, H. Helbig, S. Hartrumpf	The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
S. Sharoff, B. Babych, A. Hartley	Using collocations from comparable corpora to find translation equivalents
M. Sailer, B. Trawinski	The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research
Session O17-TGW Multilingual Tools and Terminology :
D. Stefãnescu, D. Tufi	Aligning Multilingual Thesauri
G. Ajani, G. Boella, L. Lesmo, M. Martin, A. Mazze	A Development Tool For Multilingual Ontology-based Conceptual Dictionaries
B. Hughes, T. Baldwin, S. Bird, J. Nicholson, A. Mackinlay	Reconsidering Language Identification for Written Language Resources
X. Ma	Champollion: A Robust Parallel Text Sentence Aligner
Session O18-M Multimodal Corpora Annotation and Tools :
J. Niekrasz, J. Gruenstein	NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
K. Pastra	Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms
N. Bernsen, L. Dybkjær, S. Kiilerich	H. C. Andersen Conversation Corpus
D. Sonntag, M. Romanelli	A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Session O19-GW Annotation and Annotation Tools :
A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado	SALTO - A Versatile Multi-Level Annotation Tool
M. Balasubramanya, M. Higgins, P. Lucas, J. Senn, D. Widdows	Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources
K. Erk, S. Pado	SHALMANESER - A Toolchain For Shallow Semantic Parsing
M. Yaseen, M. Attia, B. Maegaard, K. Choukri, N. Paulsson, S. Haamid, S. Krauwer, C. Bendahman, H. Fersøe, M. Rashwan, B. Haddad, C. Mukbel, A. Mouradi, A. Al-kufaishi, M. Shahin, N. Chenfour, A. Ragheb	Building Annotated Written and Spoken Arabic LR’s in NEMLAR Project
Session O20-W Multilingual Corpora :
S. Sharoff	A Uniform Interface to Large-Scale Linguistic Resources
D. Ciobanu, T. Hartley, S. Sharoff	Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
A. Feldman, J. Hana, C. Brew	A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources
J. Kuhn, M. Jellinghaus	Multilingual parallel treebanking: a lean and flexible approach
K. Miller, M. Vanni, O. Rambow, B. Dorr, D. Farwell, R. Green, N. Habash, S. Helmreich, E. Hovy, L. Levin, K. Miller, T. Mitamura, F. Reeder, A. Siddharthan	Parallel Syntactic Annotation of Multiple Languages
J. Granfeldt, P. Nugues, M. Ågren, J. Thulin, E. Persson, S. Schlyter	CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Session O21-TEW Structuring Terminology :
C. Jouis	"Hierarchical Relationships ""is-a"": Distinguishing Belonging, Inclusion and Part/of Relationships "
O. Ferret	Building a network of topical relations from a corpus
A. Meur, M. Derouin	Lemma-oriented dictionaries, concept-oriented terminology and translation memories
Y. Chou, C. Huang	Hantology - A Linguistic Resource for Chinese Language Processing and Studying
J. Couturier, S. Neuvel, P. Drouin	Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
B. Alex, M. Nissim, C. Grover	The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Session O22-GW Infrastructures, Methodologies and Standards :
B. Hughes	Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
D. Broeder, A. Claus, F. Offenga, R. Skiba, P. Trilsbeek, P. Wittenburg	LAMUS - the Language Archive Management and Upload System
M. Gavrilidou, P. Labropoulou, S. Piperidis, V. Giouli, N. Calzolari, M. Monachini, C. Soria, K. Choukri	Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
F. Sasaki	Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
N. Ide, K. Suderman	Integrating Linguistic Resources: The American National Corpus Model
P. Wittenburg, D. Broeder, W. Klein, S. Levinson, L. Romary	Foundations of Modern Language Resource Archives
Session O23-SG Speech Corpora & Annotation :
A. Bies, S. Strassel, H. Lee, K. Maeda, S. Kulick, Y. Liu, M. Harper, M. Lease	Linguistic Resources for Speech Parsing
K. Uchimoto, R. Hamabe, T. Maruyama, K. Takanashi, T. Kawahara, H. Isahara	Dependency-structure Annotation to Corpus of Spontaneous Japanese
O. Baude, M. Jacobson, A. Tchobanov, R. Walter	Interoperability of audio corpora: the case of the French corpora
M. Voghera, F. Cutugno	An observatory on Spoken Italian linguistic resources and descriptive standards
A. Sandrelli, C. Bendazzoli	Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)
K. Maeda, C. Cieri, K. Walker	Low-cost Customized Speech Corpus Creation for Speech Technology Applications
Session O24-EW Evaluation of Information Retrieval :
A. Abdelsapor, N. Adly, K. Darwish, O. Emam, W. Magdy, M. Nagi	Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents
H. Dalianis, B. Jongejan	Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser
L. Ku, Y. Liang, H. Chen	Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
A. Fujii, M. Iwayama, N. Kando	Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop
C. Peters	The Impact of Evaluation on Multilingual Information Retrieval System Development
S. Tongchim, P. Srichaivattana, V. Sornlertlamvanich, H. Isahara	Blind Evaluation for Thai Search Engines
Session O25-WE Machine Translation and Evaluation :
J. Giménez, E. Amigó	IQMT: A Framework for Automatic Machine Translation Evaluation
A. Popescu-belis, P. Estrella, M. King, N. Underwood	A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
D. Vilar, J. Xu, L. D'haro, H. Ney	Error Analysis of Statistical Machine Translation Output
K. Uchimoto, N. Hayashida, T. Ishida, H. Isahara	Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
A. Patry, F. Gotti, P. Langlais	MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
A. Mauser, E. Matusov, H. Ney	Training a Statistical Machine Translation System without GIZA++
Session O26-W Question Answering :
G. Fliedner	Towards Natural Interactive Question Answering
D. Buscaldi, P. Rosso	Mining Knowledge from Wikipedia for the Question Answering task
V. Moriceau	Language Challenges for Data Fusion in Question-Answering
L. Zhou, C. Lin, E. Hovy	Summarizing Answers for Complicated Questions
S. Harabagiu, C. Human	An Answer Bank for Temporal Inference
G. Marton, B. Katz	Using Semantic Overlap Scoring in Answering TREC Relationship Questions
Session O27-G Large Programs, Data Centers and International Cooperation :
S. Itahashi, C. Tseng, S. Nakamura	Oriental COCOSDA: Past, Present and Future
B. Maegaard, J. Fenstad, L. Ahrenberg, K. Kvale, K. Mühlenbock, B. Heid	KUNSTI - Knowledge Generation for Norwegian Language Technology
E. D'halleweyn, J. Odijk, L. Teunissen, C. Cucchiarini	The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources
S. Chaudiron, J. Mariani	Techno-langue: The French National Initiative for Human Language Technologies (HLT)
B. Maegaard, S. Krauwer, K. Choukri, L. Jørgensen	The BLARK concept and BLARK for Arabic
C. Cieri, M. Liberman	More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Session O28-SE ASR, Tools and Evaluation :
M. Rayner, P. Bouillon, B. Hockey, N. Chatzichrisafis	REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
T. Kato, T. Toda, H. Saruwatari, K. Shikano	Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
J. Mauclair, Y. Estève, S. Petit-renaud, P. Deléglise	Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
D. Gibbon, S. Tseng	Discourse functions of duration in Mandarin: resource design and implementation
J. Fiscus, J. Ajot, N. Radde, C. Laprun	Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
S. Burger, Z. Sloane, J. Yang	Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages
Session O29-E Evaluation Semantics and Senses :
R. Johansson, P. Nugues	Construction of a FrameNet Labeler for Swedish Text
M. Sahlgren	Towards pertinent evaluation methodologies for word-space models
S. Schulte	Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
R. Passonneau	Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation
A. Rumshisky, J. Pustejovsky	Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora
R. Navigli	Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
Session O30-W Parallel Corpora & MT :
A. Eisele	Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries
Y. Nitta, M. Saraki, S. Ikehara	Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
S. Hasan, A. Isbihani, H. Ney	Creating a Large-Scale Arabic to French Statistical Machine Translation System
X. Ma, C. Cieri	Corpus Support for Machine Translation at LDC
Session O31-W Corpus Construction & Annotation :
O. Medelyan, S. Schulz, J. Paetzold, M. Poprat, K. Markó	Language Specific and Topic Focused Web Crawling
D. Tufis, E. Irimia	RoCo-News: A Hand Validated Journalistic Corpus of Romanian
C. Grover, R. Tobin	Rule-Based Chunking and Reusability
E. Hajicova, P. Sgall	Corpus Annotation as a Test of a Linguistic Theory
Session O32-EW Coreference, Summarisation and Evaluation :
R. Catizone, A. Dalli, Y. Wilks	Evaluating Automatically Generated Timelines from the Web
O. Postolache, D. Cristea, C. Orasan	Transferring Coreference Chains through Word Alignment
O. Uryupina	Coreference Resolution with and without Linguistic Knowledge
E. Hovy, C. Lin, L. Zhou, J. Fukumoto	Automated Summarization Evaluation with Basic Elements
Session O33-EMT Evaluation of Multimodality & Dialogue :
J. Soler, P. Cerezo, C. Ávila, D. Merino	Usability evaluation of 3G multimodal services in Telefónica Móviles España
H. Dybkjær, L. Dybkjær	Act-Topic Patterns for Automatically Checking Dialogue Models
D. Mostefa, M. Garcia, K. Choukri	Evaluation of multimodal components within CHIL
H. Bunt	Dimensions in Dialogue Act Annotation
Session O34-WE Morphology & Tagging :
A. Novák	Morphological Tools for Six Small Uralic Languages
J. Vaneyghen, G. Pauw, D. Compernolle, W. Daelemans	A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
M. Mieskes, M. Strube	Part-of-Speech Tagging of Transcribed Speech
B. Hughes, D. Gibbon, T. Trippel	Feature-Based Encoding and Querying Language Resources with Character Semantics
Session O35-T Terminology & Knowledge Acquisition :
W. Mustafa el Hadi, I. Timimi, M. Dabbadie, K. Choukri, O. Hamon, Y. Chiao	Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model
T. Charoenporn, C. Kruengkrai, T. Theeramunkong, V. Sornlertlamvanich, H. Isahara	Word Knowledge Acquisition for Computational Lexicon Construction
M. Lafourcade	Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
N. Okazaki, S. Ananiadou	Clustering acronyms in biomedical text for disambiguation
Session O36-W Semantically Annotated Corpora :
B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, V. Lenzi, R. Sprugnoli	I-CAB: the Italian Content Annotation Bank
A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado	The SALSA Corpus: a German corpus resource for lexical semantics
H. Papageorgiou, E. Desipri, M. Koutsombogera, K. Pouli, P. Prokopidis	Adding multi-layer semantics to the Greek Dependency Treebank
E. Agirre, I. Aldezabal, J. Etxeberria, E. Pociello	A Preliminary Study for Building the Basque PropBank
Session O37-S Lexicon and Pronunciation :
J. Gros, V. Cvetko-orešnik, P. Jakopin, A. Mihelic	SI-PRON: a Pronunciation Lexicon for Slovenian
S. Schaden, U. Jekosch	“Casselberveetovallarga” and Other Unpronounceable Places: The CrossTowns Corpus
D. Graff, T. Buckwalter, M. Maamouri, H. Jin	Lexicon Development for Varieties of Spoken Colloquial Arabic
T. Pellegrini, L. Lamel	Experimental detection of vowel pronunciation variants in Amharic
Session O38-W Lexicon Syntax and Semantics :
M. Slavcheva	Semantic Descriptors: The Case of Reflexive Verbs
A. Korhonen, Y. Krymolowski, T. Briscoe	A Large Subcategorization Lexicon for Natural Language Processing Applications
P. Saint-dizier	PrepNet: a Multilingual Lexical Description of Prepositions
K. Kipper, A. Korhonen, N. Ryant, M. Palmer	Extending VerbNet with Novel Verb Classes
Session O39-W Lexicons, Semantics, Clustering Tools :
M. Islam, D. Inkpen	Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
D. Widdows, A. Toumouh, B. Dorow, A. Lehireche	Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
M. Maragoudakis, K. Kermanidis, A. Garbis, N. Fakotakis	Dealing with Imbalanced Data using Bayesian Techniques
B. Medlock	An Introduction to NLP-based Textual Anonymisation
R. Marinelli, R. Bindi	Proper Names and Linguistic Dynamics
V. Rus, A. Graesser	The Look and Feel of a Confident Entailer
Session O40-T Creating Terminologies :
M. L’homme, H. Bae	A Methodology for Developing Multilingual Resources for Terminology
G. Nenadic, N. Okazaki, S. Ananiadou	Towards a terminological resource for biomedical text mining
B. Dobrov, N. Loukachevitch	Development of Linguistic Ontology on Natural Sciences and Technology
R. Melz, P. Ryu, K. Choi	Compiling large language resources using lexical similarity metrics for domain taxonomy learning
W. Peters, M. Sagri, D. Tiscornia, S. Castagnoli	The LOIS Project
S. Anstein, G. Kremer, U. Reyle	Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Session O41-M Emotions :
C. Clavel, I. Vasilescu, L. Devillers, T. Ehrette, G. Richard	Fear-type emotions of the SAFE Corpus: annotation issues
L. Devillers, R. Cowie, J. Martin, E. Douglas-cowie, S. Abrilian, M. Mcrorie	Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
K. Laskowski, S. Burger	Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
D. Reidsma, D. Heylen, R. Ordelman	Annotating Emotion in Meetings
T. Vogt, E. André	Improving Automatic Emotion Recognition from Speech via Gender Differentiation
J. Martin, G. Caridakis, L. Devillers, K. Karpouzis, S. Abrilian	Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Session O42-EW Question Answering Evaluation :
L. Gillard, P. Bellot, M. El-bèze	Question Answering Evaluation Survey
J. Ko, L. Hiyakumoto, E. Nyberg	Exploiting Multiple Semantic Resources for Answer Selection
H. Shima, M. Wang, F. Lin, T. Mitamura	Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
F. Lacatusu, A. Hickl, S. Harabagiu	Impact of Question Decomposition on the Quality of Answer Summaries
B. Magnini, D. Giampicollo, L. Aunimo, C. Ayache, P. Osenova, A. Peñas, M. Rijke, B. Sacaleanu, D. Santos, R. Sutcliffe	The Multilingual Question Answering Track at CLEF
C. Ayache, B. Grau, A. Vilnat	EQueR: the French Evaluation campaign of Question-Answering Systems
Session P1-W : Anaphora & Coreference
A. Nøklestad, Ø. Reigem, C. Johansson	Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
L. Hasler, C. Orasan, K. Naumann	NPs for Events: Experiments in Coreference Annotation
T. Caselli, I. Prodanof	Annotating Bridging Anaphors in Italian: in Search of Reliability
D. Goecke, A. Witt	Exploiting logical document structure for anaphora resolution
H. Suzuki, G. Kacmarcik	RefRef: A Tool for Viewing and Exploring Coreference Space
W. Gegg-harrison, D. Byron	PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
M. Poesio, M. Kabadjov, P. Goux, U. Kruschwitz, E. Bishop, L. Corti	An Anaphora Resolution-Based Anonymization Module
G. Puscasu, R. Mitkov	If it were then, then when was it? Establishing the anaphoric role of then
Session P2-W : Corpora & LR Infrastructures
D. Kokkinakis	Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience
N. Oostdijk, L. Boves	User requirements analysis for the design of a reference corpus of written Dutch
C. Onelli, D. Poietti, C. Seidenari, F. Tamburini	The DiaCORIS project: a diachronic corpus of written Italian
D. Santos, S. Inácio	Annotating COMPARA, a Grammar-aware Parallel Corpus
D. Guthrie, B. Allison, W. Liu, L. Guthrie, Y. Wilks	A Closer Look at Skip-gram Modelling
A. Chalamandaris, A. Protopapas, P. Tsiakoulis, S. Rapti	All Greek to me! An automatic Greeklish to Greek transliteration system
Session P3-W : Machine Translation
A. Patel, D. Radev	Lexical similarity can distinguish between automatic and manual translations
O. Bojar, M. Prokopova	Czech-English Word Alignment
L. Cyrus	Building a resource for studying translation shifts
A. Wu, K. Lowery	A Hebrew Tree Bank Based on Cantillation Marks
S. Oepen, J. Lønning	Discriminant-Based MRS Banking
I. Kruijff-korbayová, K. Chvatalova, O. Postolache	Annotation Guidelines for Czech-English Word Alignment
H. Kaji, M. Watanabe	Automatic Construction of Japanese WordNet
R. Rapp, C. Vide	Example-Based Machine Translation Using a Dictionary of Word Pairs
B. Schrader	Non-probabilistic alignment of rare German and English nominal expressions
M. Popovic, H. Ney	POS-based Word Reorderings for Statistical Machine Translation
V. Vandeghinste, I. Schuurman, M. Carl, S. Markantonatou, T. Badia	METIS-II: Machine Translation for Low Resource Languages
R. Ion, A. Ceauşu, D. Tufiş	Dependency-Based Phrase Alignment
M. Miháltz, G. Pohl	Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
A. Ceauşu, D. Ştefănescu, D. Tufiş	Acquis Communautaire Sentence Alignment using Support Vector Machines
Session P4-W : Question Answering, Summarisation, Classification
J. Civera, A. Juan	Bilingual Machine-Aided Indexing
S. Verberne, L. Boves, N. Oostdijk, P. Coppen	Data for question answering: The case of why
H. Saggion	Multilingual Multidocument Summarization Tools and Evaluation
H. Saggion, R. Gaizauskas	Language Resources for Background Gathering
Wei. Li, Wen. Li, Q. Lu	Mining Implicit Entities in Queries
F. Bertagna	Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
R. Sauri, M. Verhagen, J. Pustejovsk	SlinkET: A Partial Modal Parser for Events
Session P5-W : Learning and Acquisition
V. Nováček, P. Smrž, J. Pomikálek	Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
D. Kawahara, S. Kurohashi	Case Frame Compilation from the Web using High-Performance Computing
B. Sagot, L. Clément, E. Villemonte de la Clergerie, P. Boullier	The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
K. Kanzaki, Q. Ma, E. Yamamoto, H. Isahara	Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives
B. Wellner, M. Vilain	Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
N. Bel, S. Espeja, M. Marimon	New tools for the encoding of lexical data extracted from corpus
Y. Ohishi, K. Itou, K. Takeda, A. Fujii	Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Session P6-WT : Corpora: Creation, Annotation
M. Pazienza, M. Pennacchiotti, F. Zanzotto	Mixing WordNet, VerbNet and PropBank for studying verb relations
J. Apresjan, I. Boguslavsky, B. Iomdin, L. Iomdin, A. Sannikov, V. Sizov	A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
N. Xue	Annotating the Predicate-Argument Structure of Chinese Nominalizations
S. Džeroski, T. Erjavec, N. Ledinek, P. Pajas, Z. Žabokrtsky, A. Žele	Towards a Slovene Dependency Treebank
J. Nivre, J. Nilsson, J. Hall	Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation
R. Bernardi, A. Bolognesi, C. Seidenari, F. Tamburini	POS tagset design for Italian
T. Ohta, Y. Tateisi, J. Kim, A. Yakushiji, J. Tsujii	Linguistic and Biological Annotations of Biological Interaction Events
N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz, I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde	Structure, Annotation and Tools in the Basque ZT Corpus
N. Masaki, I. Hiroshi, H. Taiichi, T. Takenobu	A new approach to syntactic annotation
Y. Matsumoto, M. Asahara, K. Hashimoto, Y. Tono, A. Ohtani, T. Morita	An Annotated Corpus Management Tool: ChaKi
Y. Xia, K. Wong., Wen. Li	Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
J. Jung, M. Miyake, H. Akam	Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network
V. Hoste, G. Pauw	KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
F. Barreto, A. Branco, E. Ferreira, A. Mendes, M. Nascimento, F. Nunes, J. Silva	Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
H. Bunt, A. Schiffrin	Methodological Aspects of Semantic Annotation
Session P7-T : Terminology & Ontologies
M. Lin, H. Chen	Constructing a Named Entity Ontology from Web Corpora
K. Ahmad, M. Musacchio, G. Palumbo	Ontological and Terminological Commitments and the Discourse of Specialist Communities
L. Lesmo, L. Robaldo	From Natural Language to Databases via Ontologies
A. Simões, J. Almeida	T2O - Recycling Thesauri into a Multilingual Ontology
L. Bordoni, T. Mazzoli	Towards an Ontology for Art and Colours
M. Lundälv, K. Mühlenbock, B. Farre, A. Brännström	SYMBERED - a Symbol-Concept Editing Tool
Y. Kiyota, H. Nakagawa	A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
H. Brugman, V. Malaisé, L. Gazendam	A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
T. Declerck, A. Pérez, O. Vela, Z. Gantner, D. Manzano-macho	Multilingual Lexical Semantic Resources for Ontology Translation
A. Estellés, A. Alcina, V. Soler	Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology
L. Sarmento, B. Maia, D. Santos, A. Pinto, L. Cabral	Corpógrafo V3 From Terminological Aid to Semi-automatic Knowledge Engineering
Y. Senda, Y. Sinohara, M. Okumura	Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Session P8-E : Evaluation of Question Answering & Summarisation
A. Moschitti, R. Basili	A Tree Kernel approach to Question and Answer Classification in Question Answering Systems
I. Cramer, J. Leidner, D. Klakow	Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
L. Costa, L. Sarmento	Component Evaluation in a Question Answering System
B. Grau, A. Ligozat, I. Robba, A. Vilnat, L. Monceau	FRASQUES : A Question Answering system in the EQueR evaluation campaign
T. Akiba	Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue
M. Bilotti, E. Nyberg	Evaluation for Scenario Question Answering Systems
M. Hassel, J. Sjöbergh	Towards Holistic Summarization - Selecting Summaries, Not Sentences
C. Orasan, L. Hasler	Computer-aided summarisation - what the user really wants
Session P9-M : Multimodal Tools
P. Berck, A. Russel	ANNEX - a web-based Framework for Exploiting Annotated Media Resources
P. Wittenburg, H. Brugman, A. Russel, A. Klassmann, H. Sloetjes	ELAN: a Professional Framework for Multimodality Research
A. Popescu-belis, M. Georgescul	TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
F. Behrens, J. Milde	The Eclipse Annotator: an extensible system for multimodal corpus creation
K. Maeda, H. Lee, J. Medero, S. Strassel	A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
K. Ahmad, C. Bennett, T. Oliver	Visual Surveillance and Video Annotation and Description
Session P10-S : Speech Corpora & Annotation
N. Grønnum	DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus
Y. Irie, S. Matsubara, N. Kawaguchi, Y. Yamaguchi, Y. Inagaki	Layered Speech-Act Annotation for Spoken Dialogue Corpus
T. Ohno, S. Matsubara, H. Kashioka, N. Kato, Y. Inagaki	A Syntactically Annotated Corpus of Japanese Spoken Monologue
I. Chiari	Slips and errors in spoken data transcription
D. Gibbon, F. Fernandes, T. Trippel	A BLARK extension for temporal annotation mining
P. Paggio	Annotating Information Structure in a Corpus of Spoken Danish
T. Tan, L. Besacier	A French Non-Native Corpus for Automatic Speech Recognition
A. Bogacka, K. Dziubalska-kołaczyk, G. Krynicki, D. Pietrala, M. Wypych	General and Task-Specific Corpus Resources for Polish Adult Learners of English
A. Zgank, D. Verdonik, A. Markus, Z. Kacic	SINOD - Slovenian non-native speech database
V. Alabau, C. Martinez	Bilingual speech corpus in two phonetically similar languages
M. Kaiser, H. Mögele, F. Schiel	Bikers Accessing the Web: The SmartWeb Motorbike Corpus
A. Moreno, A. Febrer, L. Márquez	Generation of Language Resources for the Development of Speech Technologies in Catalan
J. Benedí, E. Lleida, A. Varona, M. Castro, I. Galiano, R. Justo, I. López de Letona, A. Miguel	Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
S. Rosset, S. Petel	The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus
H. Boril, T. Boril, P. Pollák	Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
E. Westerhout, P. Monachesi	A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)
R. Savy, F. Cutugno, C. Crocco	Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb)
Session P11-WME : Computational Lexicons
F. Vriend, L. Boves, H. Heuvel, R. Hout, J. Kruijsen, J. Swanenberg	A Unified Structure for Dutch Dialect Dictionary Data
M. Mangeot, A. Chalvin	Dictionary Building with the Jibiki Platform: the GDEF case
V. Trón, P. Halácsy, P. Rebrus, A. Rung, P. Vajda, E. Simon	Morphdb.hu: Hungarian lexical database and morphological grammar
B. Cartoni	Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes
T. Erjavec, T. Fišer	Building Slovene WordNet
S. Schulz, K. Markó, P. Daumke, U. Hahn, S. Hanser, P. Nohama, R. Andrade, E. Pacheco, M. Romacker	Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MORPHOSAURUS Subword Lexicon
S. Nimb	LEXADV - a multilingual semantic Lexicon for Adverbs
C. Krstev, R. Stanković, D. Vitas, I. Obradović	WS4LR: A Workstation for Lexical Resources
A. Geyken, N. Schrader	LexikoNet - a lexical database based on type and role hierarchies
J. Pustejovsky, C. Havasi, J. Littman, A. Rumshisky, M. Verhagen	Towards a Generative Lexical Resource: The Brandeis Semantic Ontology
T. Kanamaru, M. Murata, H. Isahara	Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning
A. Roventini	Linking Verbal Entries of Different Lexical Resources
N. Ruimy	Merging two Ontology-based Lexical Resources
S. Bosch, L. Pretorius, J. Jones	Towards machine-readable lexicons for South African Bantu languages
M. Lopatkova, Z. Zabokrtsky, K. Skwarska	Valency Lexicon of Czech Verbs: Alternation-Based Model
G. Vetulani, Z. Vetulani, T. Obrębski	Syntactic Lexicon of Polish Predicative Nouns
R. Manurung, D. O’mara, H. Pain, G. Ritchie, A. Waller	Building a Lexical Database for an Interactive Joke-Generator
Session P12-W : Named Entity
C. Sporleder, M. Erp, T. Porcelijn, A. Bosch, P. Arntzen	Identifying Named Entities in Text Databases from the Natural History Domain
P. Lucas, M. Balasubramanya, D. Widdows, M. Higgins	The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions
P. Mazur, R. Dale	Named Entity Extraction with Conjunction Disambiguation
U. Schäfer	OntoNERdIE - Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources
Session P13-W : Corpora
V. Tablan, W. Peters, D. Maynard, H. Cunningham	Creating Tools for Morphological Analysis of Sumerian
C. Benzmüller, H. Horacek, H. Lesourd, I. Kruijff-korbayova, M. Schiller, M. Wolska	"A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material"
C. Bosco, V. Lombardo	Comparing linguistic information in treebank annotations
I. Langkilde-geary, J. Betteridge	A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation
A. Nazarenko, E. Alphonse, J. Derivière, T. Hamon, G. Vauvert, D. Weissenbacher	The ALVIS Format for Linguistically Annotated Documents
L. Sarmento	BACO - A large database of text and co-occurrences
M. Nascimento, J. Gonçalves, L. Pereira, A. Estrela, A. Pereira, R. Santos, S. Oliveira	The African Varieties of Portuguese: Compiling Comparable Corpora and Analysing Data-derived Lexicon
L. Dinu, A. Dinu	On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
U. Quasthoff, M. Richter, C. Biemann	Corpus Portal for Search in Monolingual Corpora
I. Berlocher, K. Huh, E. Laporte, J. Nam	Morphological annotation of Korean with Directly Maintainable Resources
A. Bosch, I. Schuurman, V. Vandeghinste	Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
G. Noord, I. Schuurman, V. Vandeghinste	Syntactic Annotation of Large Corpora in STEVIN
M. Wong	Skeleton Parsing in Chinese: Annotation Scheme and Guidelines
K. Tenfjord, P. Meurer, K. Hofland	The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language
A. Cole	Corpus Development and Publication
M. Sousa, T. Trippel	Building a historical corpus for Classical Portuguese: some technological aspects
T. Obrebski, M. Stolarski	UAM Text Tools - a flexible NLP architecture
Session P14-GW : General Issues and Large Programs
R. Ivanovska-naskova	Development of the First LRs for Macedonian: Current Projects
T. Wandmacher, J. Antoine	Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
N. Cucurullo, S. Montemagni, M. Paoli, E. Picchi, E. Sassolini	Dialectal resources on-line: the ALT-Web experience
M. Monachini, N. Calzolari, K. Choukri, J. Friedrich, G. Maltese, M. Mammini, J. Odijk, M. Ulivieri	Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
F. Calzolari, E. Sassolini, M. Sassi, S. Cucurullo, E. Picchi, F. Bertagna, A. Enea, M. Monachini, C. Soria, N. Calzolari	Next Generation Language Resources using Grid
M. Kemps-snijders, M. Nederhof, P. Wittenburg	LEXUS, a web-based tool for manipulating lexical resources
F. Offenga, D. Broeder, P. Wittenburg, J. Ducret, L. Romary	Metadata Profile in the ISO Data Category Registry
H. Uszkoreit, F. Xu, J. Steffen, I. Aslan	The pragmatic combination of different cross-lingual resources for multilingual information services
I. Maks, B. Boelhouwer	Exploring opportunities for comparability and enrichment by linking lexical databases
Session P15-W : Multiwords & Collocations
R. Xu, Q. Lu, S. Li	The Design and Construction of A Chinese Collocation Bank
L. Wanner, M. Ramos	Local Document Relevance Clustering in IR Using Collocation Information
S. Cinkova, P. Pecina, P. Podvesky, P. Schlesinger	Semi-automatic Building of Swedish Collocation Lexicon
N. Grégoire	Elaborating the parameterized Equivalence Class Method for Dutch
A. Mendes, S. Antunes, M. Nascimento, J. Casteleiro, L. Pereira, T. Sá	COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
W. Anderson, P. Kotzé	Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho
Session P16-T : Terminology Multiwords & Collocations
M. Brekke, K. Innselset, M. Kristiansen, K. Øvsthus	KB-N: Automatic Term Extraction from Knowledge Bank of Economics
P. Alessandro, F. Marco, M. Massimo	Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
H. Hjelm	Extraction of Cross Language Term Correspondences
J. Ritz, U. Heid	Extraction tools for collocations and their morphosyntactic specificities
R. Kongkachandra, K. Chamnongthai	Semantic-Based Keyword Recovery Function for Keyword Extraction System
Session P17-E : Evaluation: annotation, acquisition, alignment, recognition
P. Wagacha, G. Pauw, P. Githinji	A Grapheme-Based Approach for Accent Restoration in Gikuyu
J. Semecký	On Automatic Assignment of Verb Valency Frames in Czech
K. Kageura, G. Kikui	A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)
R. Passonneau, N. Habash, O. Rambow	Inter-annotator Agreement on a Multilingual Semantic Annotation Task
G. Szarvas, R. Farkas, L. Felföldi, A. Kocsor, J. Csirik	A highly accurate Named Entity corpus for Hungarian
E. Bick	Turning a Dependency Treebank into a PSG-style Constituent Treebank
M. Ueyama	Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval
D. Kokkinakis, D. Dannélls	Recognizing Acronyms and their Definitions in Swedish Medical Texts
Y. Chiao, O. Kraif, D. Laurent, T. Nguyen, N. Semmar, F. Stuck, J. Véronis, W. Zaghouani	Evaluation of multilingual text alignment systems: the ARCADE II project
M. Salmenkivi	Finding representative sets of dialect words for geographical regions
D. Santos, N. Seco, N. Cardoso, R. Vilela	HAREM: An Advanced NER Evaluation Contest for Portuguese
Session P18-M : Multimodal Corpora and Annotation
J. Maas, B. Wrede	BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction
J. Segouat, A. Braffort, E. Martin	Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data
J. Bungeroth, D. Stein, P. Dreuw, M. Zahedi, H. Ney	A German Sign Language Corpus of the Domain Weather Report
S. Abrilian, L. Devillers, J. Martin	Annotation of Emotions in Real-Life Video Interviews: Variability between Coders
G. Lechenadec, V. Maffiolo, N. Chateau, J. Colletta	Creation of a Corpus of Multimodal Spontaneous Expressions of Emotions in Human-Machine Interaction
P. Strauß, H. Hoffman, W. Minker, H. Neumann, G. Palm, S. Scherer, F. Schwenker, H. Traue, W. Walter, U. Weidenbacher	Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments
I. Kruijff-korbayova, T. Becker, N. Blaylock, C. Gerstenberger, M. Kaisser, P. Poller, V. Rieser, J. Schehl	The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Session P19-E : Evaluation : Speech
J. Ko, F. Murase, T. Mitamura, E. Nyberg, M. Tateishi, I. Akahori	Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
A. Raake, B. Katz	SUS-based Method for Speech Reception Threshold Measurement in French
P. Mareüil, C. D'Alessandro, A. Raake, G. Bailly, M. Garcia, M. Morel	A joint intelligibility evaluation of French text-to-speech systems: the EvaSy SUS/ACR campaign
M. Przybocki, G. Sanders, A. Le	Edit Distance: A Metric for Machine Translation Evaluation
G. Hodász	Evaluation Methods of a Linguistically Enriched Translation Memory System
J. Laoudi, C. Tate, C. Voss	Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding
H. Bonneau-maynard, C. Ayache, F. Bechet, A. Denis, A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau	Results of the French Evalda-Media evaluation campaign for literal understanding
B. Andrassy, H. Hoege	Human and machine recognition as a function of SNR
J. Hoidekr, J. Psutka, A. Pražák, J. Psutka	Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries
N. Bernsen, T. Hansen, S. Kiilerich, T. Madsen	Field Evaluation of a Single-Word Pronunciation Training System
R. Banchs, A. Bonafonte, J. Pérez	Acceptance Testing of a Spoken Language Translation System
E. Coussé, S. Gillis	Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
D. Buehler, W. Minker	Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
J. Soler, P. Cerezo, D. Merino, A. Sánchez	MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España
Session P20-S : Speech Corpora for TTS Systems and Emotion and Speech tools
D. Braga, L. Coelho, J. Teixeira, D. Freitas	ProGmatica: a Prosodic and Pragmatic Database for European Portuguese
I. Amdal, T. Svendsen	FonDat1: A Speech Synthesis Corpus for Norwegian
M. Umbert, A. Moreno, P. Agüero, A. Bonafonte	Spanish Synthesis Corpora
H. Mögele, M. Kaiser, F. Schiel	SmartWeb UMTS Speech Data Collection - The SmartWeb Handheld Corpus
C. Draxler, K. Jänsch	Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation
Z. Huang, L. Chen, M. Harper	An Open Source Prosodic Feature Extraction Tool
B. Tsou, T. Lai, K. Sin, L. Cheung	Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
I. Saratxaga, E. Navas, I. Hernáez, I. Aholab	Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Session P21-W : Corpora, Lexicons & Multilinguality
B. Megyesi, A. Hein, É. Johanson	Building a Swedish-Turkish Parallel Corpus
T. Erjavec	The English-Slovene ACQUIS corpus
R. Steinberger, B. Pouliquen, A. Widiger, C. Ignat, T. Erjavec, D. Tufiş, D. Varga	The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
B. Maegaard, L. Offersgaard, L. Henriksen, H. Jansen, X. Lepetit, C. Navarretta, C. Povlsen	The MULINCO corpus and corpus platform
J. Tiedemann	ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts
E. Guevara, S. Scalise, A. Bisetto, C. Melloni	MORBO/COMP: a multilingual database of compound words
A. Sansò	Documenting variation across Europe and the Mediterranean: The Pavia Typological Database
S. Cinková	From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of the Functional Generative Description
D. Samy, A. Sandoval, J. Guirao, E. Alfonseca	Building a Parallel Multilingual Corpus (Arabic-Spanish-English)
W. Ma, C. Huang	Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
M. Geilfuss, J. Milde	SAM - an annotation editor for parallel texts
Session P22-W : Tools for Corpora and Lexicons
J. Kinoshita, L. Salvador, C. Menezes	CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
P. Nemec	Tree Searching/Rewriting Formalism
P. Le, T. Nguyen, L. Romary, A. Roussanaly	A Lexicalized Tree-Adjoining Grammar for Vietnamese
C. Rohrer, M. Forst	Improving coverage and parsing quality of a large-scale LFG for German
T. Baldwin, S. Awab	Open Source Corpus Analysis Tools for Malay
J. Nivre, J. Hall, J. Nilsson	MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Authors

Titles

Session O1-W Lexicons, Semantics & Infrastructural Issues :

Y. Hayashi, T. Ishida

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

C. Soria, M. Tesconi, F. Bertagna, N. Calzolari, A. Marchetti, M. Monachini

Moving to dynamic computational lexicons with LeXFlow

V. Cavalli-sforza, A. Soudi

IMORPH?: An Inheritance and Equivalence Based Morphology Description Compiler

A. Itai, S. Wintner, S. Yona

A Computational Lexicon of Contemporary Hebrew

E. Agirre, I. Aldezabal, J. Etxeberria, E. Izagirre, K. Mendizabal, E. Pociello, M. Quintian

A methodology for the joint development of the Basque WordNet and Semcor

S. Elkateb, W. Black, H. Rodríguez, M. Alkhalifa, E. Aribau, P. Vossen, A. Pease, C. Fellbaum

Building a WordNet for Arabic

V. Tablan, T. Polajnar, H. Cunningham, K. Bontcheva

User-friendly ontology authoring using a controlled language

Session O2-EW Named Entity Recognition & Time Annotation :

L. Nezda, A. Hickl, J. Lehmann, S. Fayyaz

What in the World is a Shahab? Wide Coverage Named Entity Recognition for Arabic

I. Michailidis, K. Diamantaras, S. Vasileiadis, Y. Frère

Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime

B. Pouliquen, M. Kimler, R. Steinberger, C. Ignat, T. Oellinger, K. Blackler, F. Fluart, W. Zaghouani, A. Widiger, A. Forslund, C. Best

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

V. Giouli, A. Konstandinidis, E. Desypri, H. Papageorgiou

Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

C. Forăscu, I. Pistol, D. Cristea

Temporality in relation with discourse structure

B. Boguraev, R. Ando

Analysis of TimeBank as a Resource for TimeML Parsing

F. Pan, R. Mulkar, J. Hobbs

An Annotated Corpus of Typical Durations of Events

Session O3-EW Authoring Tools, Information Extraction and Retrieval :

F. Bustamante, A. Arnaiz, M. Ginés

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

M. Reynaert

Corpus-Induced Corpus Clean-up

F. Bustamante, E. Díaz

Spelling Error Patterns in Spanish for Word Processing Applications

E. Luca, A. Nürnberger

Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods

J. Iria, F. Ciravegna

A Methodology and Tool for Representing Language Resources for Information Extraction

N. Habash, C. Mah, S. Imran, R. Calistri-yeh, P. Sheridan

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

A. Klassmann, F. Offenga, D. Broeder, R. Skiba, P. Wittenburg

Comparison of Resource Discovery Methods

Session O4-S Speech Corpora and Dialogue :

C. Cieri, W. Andrews, J. Campbell, G. Doddington, J. Godfrey, S. Huang, M. Liberman, A. Martin, H. Nakasone, M. Przybocki, K. Walker

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

E. Grishina

Spoken Russian in the Russian National Corpus (RNC)

E. Barker, R. Higashinaka, F. Mairesse, R. Gaizauskas, M. Walker, J. Foster

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

A. Nimaan, P. Nocera, J. Bonastre

Towards automatic transcription of Somali language

C. Cucchiarini, H. Hamme, O. Herwijnen, F. Smits

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

S. Galliano, E. Geoffrois, G. Gravier, J. Bonastre, D. Mostefa, K. Choukri

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

J. Polifroni, M. Walker

Learning Database Content for Spoken Dialogue System Design

Session O5-E Machine Translation and Evaluation :

D. Mostefa, O. Hamon, K. Choukri

Evaluation of Automatic Speech Recognition and Speech Translation within TC-STAR: Results from the first evaluation campaign

O. Hamon, M. Rajman

X-Score: Automatic Evaluation of Machine Translation Grammaticality

K. Miller, M. Vanni

Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm

E. Macklovitch

TransType2: The Last Word

U. Schäfer, D. Beck

Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

O. Hamon, A. Popescu-belis, K. Choukri, M. Dabbadie, A. Hartley, W. Mustafa El Hadi, M. Rajman, I. Timimi

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

S. Strassel, C. Cieri, A. Cole, D. Dipersio, M. Liberman, X. Ma, M. Maamouri, K. Maeda

Integrated Linguistic Resources for Language Exploitation Technologies

Session O6-WMT Ontologies :

P. Cimiano, M. Hartung, E. Ratsch

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

J. Iria, C. Brewster, F. Ciravegna, Y. Wilks

An Incremental Tri-Partite Approach To Ontology Learning

G. Grefenstette, F. Debili, C. Fluhr, S. Zinger

Exploiting text for extracting image processing resources

R. Marinelli, A. Roventini, G. Spadoni

Using Core Ontology for Domain Lexicon Structuring

S. Li, Q. Lu, W. Li, R. Xu

Interaction between Lexical Base and Ontology with Formal Concept Analysis

R. Bartolini, C. Caracciolo, E. Giovannetti, A. Lenci, S. Marchi, V. Pirrelli, C. Renso, L. Spinsanti

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Session O7-GWT Standards :

N. Ide, L. Romary

Representing Linguistic Corpora and Their Annotations

T. Declerck

SynAF: Towards a Standard for Syntactic Annotation

G. Francopoulo, M. George, N. Calzolari, M. Monachini, N. Bel, M. Pet, C. Soria

Lexical Markup Framework (LMF)

M. Assem, A. Gangemi, G. Schreiber

Conversion of WordNet to a standard RDF/OWL representation

L. Henriksen, C. Povlsen, A. Vasiljevs

EuroTermBank - a Terminology Resource based on Best Practice

G. Tummarello, C. Morbidoni, F. Kepler, F. Piazza, P. Puliti

A novel Textual Encoding paradigm based on Semantic Web tools and semantics

Session O8-W Lexicon Acquisition :

P. Chesley, S. Salmon-alt

Automatic extraction of subcategorization frames for French

A. Berglund, R. Johansson, P. Nugues

Extraction of Temporal Information from Texts in Swedish

M. Schiehlen, K. Spranger

The Mass-Count Distinction: Acquisition and Disambiguation

P. Smrž

Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts

Y. Zhang, V. Kordoni

Automated Deep Lexical Acquisition for Robust Open Texts Processing

S. Amsalu

Data-driven Amharic-English Bilingual Lexicon Acquisition

Session O9-SE TTS & Units for TTS :

Q. Yang, J. Martens, N. Konings, H. Heuvel

Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

S. Breuer, S. Bergmann, R. Dragon, S. Möller

Set-up of a Unit-Selection Synthesis with a Prominent Voice

D. Oliver, K. Szklanny

Creation and analysis of a Polish speech database for use in unit selection synthesis

J. Pérez, A. Bonafonte, H. Hain, E. Keller, S. Breuer, J. Tian

ECESS Inter-Module Interface Specification for Speech Synthesis

M. Garcia, C. D’alessandro, G. Bailly, P. Mareüil, M. Morel

A joint prosody evaluation of French text-to-speech synthesis systems

A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. Heuvel, H. Hain, X. Wang, M. Garcia

TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis

Session O10-E Parsing Evaluation :

P. Paroubek, I. Robba, A. Vilnat, C. Ayache

Data, Annotations and Measures in EASY: the Evaluation Campaign for Parsers of French

G. Neumann, B. Crysmann

Exploring HPSG-based Treebanks for Probabilistic Parsing

R. Subba, B. Eugenio, E. Terenzi

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

B. Roark, M. Harper, E. Charniak, B. Dorr, M. Johnson, J. Kahn, Y. Liu, M. Ostendorf, J. Hale, A. Krasnyanskaya, M. Lease, I. Shafran, M. Snover, R. Stewart, L. Yung

SParseval: Evaluation Metrics for Parsing Speech

A. Denis, M. Quignard, G. Pitel

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

T. Vanrullen, P. Blache, J. Balfourier

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Session O11-W Corpus & Lexicon :

C. Fairon, S. Paumier

A translated corpus of 30,000 French SMS

I. Castellón, A. Fernández-montraveta, G. Vázquez, L. Alonso, J. Capilla

The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

S. Klatt

A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

K. Inui, T. Hirano, R. Iida, A. Fujita, Y. Matsumoto

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

Session O12-W Tokenization and Tagging :

M. Forst, R. Kaplan

The importance of precise tokenizing for deep grammars

F. Wang, X. Deng, F. Zou

Towards Unified Chinese Segmentation Algorithm

F. Pîrvan, D. Tufi

Tagset Mapping and Statistical Training Data Cleaning-up

J. Hlavácová

New Approach to Frequency Dictionaries — Czech Example

Session O13-M Multimodal Corpora & Natural Interaction :

N. Campbell, T. Sadanobu, M. Imura, N. Iwahashi, S. Noriko, D. Douxchamps

A Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

D. Byron, E. Fosler-lussier

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

A. Green, H. Hüttenrauch, E. Topp, K. Severinson

Developing a Contextualized Multimodal Corpus for Human-Robot Interaction

S. Luz, M. Bouamrane, M. Masoodian

Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction

Session O14- W Sentiments & Semantics :

A. Andreevskaia, S. Bergler

Semantic Tag Extraction from WordNet Glosses

A. Esuli, F. Sebastiani

SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining

C. Strapparava, A. Valitutti, O. Stock

The Affective Weight of Lexicon

M. Taboada, C. Anthony, K. Voll

Methods for Creating Semantic Orientation Dictionaries

Session O15-W Treebanks and Acquisition :

F. Dell’orletta, A. Lenci, S. Montemagni, V. Pirrelli

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

V. Novak, J. Hajic

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

M. Maamouri, A. Bies, T. Buckwalter, M. Diab, N. Habash, O. Rambow, D. Tabessi

Developing and Using a Pilot Dialectal Arabic Treebank

M. Marneffe, B. Maccartney, C. Manning

Generating Typed Dependency Parses from Phrase Structure Parses

Session O16-W MWEs and Collocations :

A. Alonge

The Italian Metaphor Database

R. Osswald, H. Helbig, S. Hartrumpf

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

S. Sharoff, B. Babych, A. Hartley

Using collocations from comparable corpora to find translation equivalents

M. Sailer, B. Trawinski

The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research

Session O17-TGW Multilingual Tools and Terminology :

D. Stefãnescu, D. Tufi

Aligning Multilingual Thesauri

G. Ajani, G. Boella, L. Lesmo, M. Martin, A. Mazze

A Development Tool For Multilingual Ontology-based Conceptual Dictionaries

B. Hughes, T. Baldwin, S. Bird, J. Nicholson, A. Mackinlay

Reconsidering Language Identification for Written Language Resources

X. Ma

Champollion: A Robust Parallel Text Sentence Aligner

Session O18-M Multimodal Corpora Annotation and Tools :

J. Niekrasz, J. Gruenstein

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

K. Pastra

Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms

N. Bernsen, L. Dybkjær, S. Kiilerich

H. C. Andersen Conversation Corpus

D. Sonntag, M. Romanelli

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Session O19-GW Annotation and Annotation Tools :

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado

SALTO - A Versatile Multi-Level Annotation Tool

M. Balasubramanya, M. Higgins, P. Lucas, J. Senn, D. Widdows

Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

K. Erk, S. Pado

SHALMANESER - A Toolchain For Shallow Semantic Parsing

M. Yaseen, M. Attia, B. Maegaard, K. Choukri, N. Paulsson, S. Haamid, S. Krauwer, C. Bendahman, H. Fersøe, M. Rashwan, B. Haddad, C. Mukbel, A. Mouradi, A. Al-kufaishi, M. Shahin, N. Chenfour, A. Ragheb

Building Annotated Written and Spoken Arabic LR’s in NEMLAR Project

Session O20-W Multilingual Corpora :

S. Sharoff

A Uniform Interface to Large-Scale Linguistic Resources

D. Ciobanu, T. Hartley, S. Sharoff

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

A. Feldman, J. Hana, C. Brew

A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources

J. Kuhn, M. Jellinghaus

Multilingual parallel treebanking: a lean and flexible approach

K. Miller, M. Vanni, O. Rambow, B. Dorr, D. Farwell, R. Green, N. Habash, S. Helmreich, E. Hovy, L. Levin, K. Miller, T. Mitamura, F. Reeder, A. Siddharthan

Parallel Syntactic Annotation of Multiple Languages

J. Granfeldt, P. Nugues, M. Ågren, J. Thulin, E. Persson, S. Schlyter

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Session O21-TEW Structuring Terminology :

C. Jouis

"Hierarchical Relationships ""is-a"": Distinguishing Belonging, Inclusion and Part/of Relationships "

O. Ferret

Building a network of topical relations from a corpus

A. Meur, M. Derouin

Lemma-oriented dictionaries, concept-oriented terminology and translation memories

Y. Chou, C. Huang

Hantology - A Linguistic Resource for Chinese Language Processing and Studying

J. Couturier, S. Neuvel, P. Drouin

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

B. Alex, M. Nissim, C. Grover

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Session O22-GW Infrastructures, Methodologies and Standards :

B. Hughes

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

D. Broeder, A. Claus, F. Offenga, R. Skiba, P. Trilsbeek, P. Wittenburg

LAMUS - the Language Archive Management and Upload System

M. Gavrilidou, P. Labropoulou, S. Piperidis, V. Giouli, N. Calzolari, M. Monachini, C. Soria, K. Choukri

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

F. Sasaki

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

N. Ide, K. Suderman

Integrating Linguistic Resources: The American National Corpus Model

P. Wittenburg, D. Broeder, W. Klein, S. Levinson, L. Romary

Foundations of Modern Language Resource Archives

Session O23-SG Speech Corpora & Annotation :

A. Bies, S. Strassel, H. Lee, K. Maeda, S. Kulick, Y. Liu, M. Harper, M. Lease

Linguistic Resources for Speech Parsing

K. Uchimoto, R. Hamabe, T. Maruyama, K. Takanashi, T. Kawahara, H. Isahara

Dependency-structure Annotation to Corpus of Spontaneous Japanese

O. Baude, M. Jacobson, A. Tchobanov, R. Walter

Interoperability of audio corpora: the case of the French corpora

M. Voghera, F. Cutugno

An observatory on Spoken Italian linguistic resources and descriptive standards

A. Sandrelli, C. Bendazzoli

Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)

K. Maeda, C. Cieri, K. Walker

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

Session O24-EW Evaluation of Information Retrieval :

A. Abdelsapor, N. Adly, K. Darwish, O. Emam, W. Magdy, M. Nagi

Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents

H. Dalianis, B. Jongejan

Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser

L. Ku, Y. Liang, H. Chen

Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks

A. Fujii, M. Iwayama, N. Kando

Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop

C. Peters

The Impact of Evaluation on Multilingual Information Retrieval System Development

S. Tongchim, P. Srichaivattana, V. Sornlertlamvanich, H. Isahara

Blind Evaluation for Thai Search Engines

Session O25-WE Machine Translation and Evaluation :

J. Giménez, E. Amigó

IQMT: A Framework for Automatic Machine Translation Evaluation

A. Popescu-belis, P. Estrella, M. King, N. Underwood

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

D. Vilar, J. Xu, L. D'haro, H. Ney

Error Analysis of Statistical Machine Translation Output

K. Uchimoto, N. Hayashida, T. Ishida, H. Isahara

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

A. Patry, F. Gotti, P. Langlais

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

A. Mauser, E. Matusov, H. Ney

Training a Statistical Machine Translation System without GIZA++

Session O26-W Question Answering :

G. Fliedner

Towards Natural Interactive Question Answering

D. Buscaldi, P. Rosso

Mining Knowledge from Wikipedia for the Question Answering task

V. Moriceau

Language Challenges for Data Fusion in Question-Answering

L. Zhou, C. Lin, E. Hovy

Summarizing Answers for Complicated Questions

S. Harabagiu, C. Human

An Answer Bank for Temporal Inference

G. Marton, B. Katz

Using Semantic Overlap Scoring in Answering TREC Relationship Questions

Session O27-G Large Programs, Data Centers and International Cooperation :

S. Itahashi, C. Tseng, S. Nakamura

Oriental COCOSDA: Past, Present and Future

B. Maegaard, J. Fenstad, L. Ahrenberg, K. Kvale, K. Mühlenbock, B. Heid

KUNSTI - Knowledge Generation for Norwegian Language Technology

E. D'halleweyn, J. Odijk, L. Teunissen, C. Cucchiarini

The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

S. Chaudiron, J. Mariani

Techno-langue: The French National Initiative for Human Language Technologies (HLT)

B. Maegaard, S. Krauwer, K. Choukri, L. Jørgensen

The BLARK concept and BLARK for Arabic

C. Cieri, M. Liberman

More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities

Session O28-SE ASR, Tools and Evaluation :

M. Rayner, P. Bouillon, B. Hockey, N. Chatzichrisafis

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

T. Kato, T. Toda, H. Saruwatari, K. Shikano

Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

J. Mauclair, Y. Estève, S. Petit-renaud, P. Deléglise

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

D. Gibbon, S. Tseng

Discourse functions of duration in Mandarin: resource design and implementation

J. Fiscus, J. Ajot, N. Radde, C. Laprun

Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech

S. Burger, Z. Sloane, J. Yang

Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages

Session O29-E Evaluation Semantics and Senses :

R. Johansson, P. Nugues

Construction of a FrameNet Labeler for Swedish Text

M. Sahlgren

Towards pertinent evaluation methodologies for word-space models

S. Schulte

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

R. Passonneau

Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation

A. Rumshisky, J. Pustejovsky

Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora

R. Navigli

Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory

Session O30-W Parallel Corpora & MT :

A. Eisele

Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries

Y. Nitta, M. Saraki, S. Ikehara

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

S. Hasan, A. Isbihani, H. Ney

Creating a Large-Scale Arabic to French Statistical Machine Translation System

X. Ma, C. Cieri

Corpus Support for Machine Translation at LDC

Session O31-W Corpus Construction & Annotation :

O. Medelyan, S. Schulz, J. Paetzold, M. Poprat, K. Markó

Language Specific and Topic Focused Web Crawling

D. Tufis, E. Irimia

RoCo-News: A Hand Validated Journalistic Corpus of Romanian

C. Grover, R. Tobin

Rule-Based Chunking and Reusability

E. Hajicova, P. Sgall

Corpus Annotation as a Test of a Linguistic Theory

Session O32-EW Coreference, Summarisation and Evaluation :

R. Catizone, A. Dalli, Y. Wilks

Evaluating Automatically Generated Timelines from the Web

O. Postolache, D. Cristea, C. Orasan

Transferring Coreference Chains through Word Alignment

O. Uryupina

Coreference Resolution with and without Linguistic Knowledge

E. Hovy, C. Lin, L. Zhou, J. Fukumoto

Automated Summarization Evaluation with Basic Elements

Session O33-EMT Evaluation of Multimodality & Dialogue :

J. Soler, P. Cerezo, C. Ávila, D. Merino

Usability evaluation of 3G multimodal services in Telefónica Móviles España

H. Dybkjær, L. Dybkjær

Act-Topic Patterns for Automatically Checking Dialogue Models

D. Mostefa, M. Garcia, K. Choukri

Evaluation of multimodal components within CHIL

H. Bunt

Dimensions in Dialogue Act Annotation

Session O34-WE Morphology & Tagging :

A. Novák

Morphological Tools for Six Small Uralic Languages

J. Vaneyghen, G. Pauw, D. Compernolle, W. Daelemans

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

M. Mieskes, M. Strube

Part-of-Speech Tagging of Transcribed Speech

B. Hughes, D. Gibbon, T. Trippel

Feature-Based Encoding and Querying Language Resources with Character Semantics

Session O35-T Terminology & Knowledge Acquisition :

W. Mustafa el Hadi, I. Timimi, M. Dabbadie, K. Choukri, O. Hamon, Y. Chiao

Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model

T. Charoenporn, C. Kruengkrai, T. Theeramunkong, V. Sornlertlamvanich, H. Isahara

Word Knowledge Acquisition for Computational Lexicon Construction

M. Lafourcade

Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

N. Okazaki, S. Ananiadou

Clustering acronyms in biomedical text for disambiguation

Session O36-W Semantically Annotated Corpora :

B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, V. Lenzi, R. Sprugnoli

I-CAB: the Italian Content Annotation Bank

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado

The SALSA Corpus: a German corpus resource for lexical semantics

H. Papageorgiou, E. Desipri, M. Koutsombogera, K. Pouli, P. Prokopidis

Adding multi-layer semantics to the Greek Dependency Treebank

E. Agirre, I. Aldezabal, J. Etxeberria, E. Pociello

A Preliminary Study for Building the Basque PropBank

Session O37-S Lexicon and Pronunciation :

J. Gros, V. Cvetko-orešnik, P. Jakopin, A. Mihelic

SI-PRON: a Pronunciation Lexicon for Slovenian

S. Schaden, U. Jekosch

“Casselberveetovallarga” and Other Unpronounceable Places: The CrossTowns Corpus

D. Graff, T. Buckwalter, M. Maamouri, H. Jin

Lexicon Development for Varieties of Spoken Colloquial Arabic

T. Pellegrini, L. Lamel

Experimental detection of vowel pronunciation variants in Amharic

Session O38-W Lexicon Syntax and Semantics :

M. Slavcheva

Semantic Descriptors: The Case of Reflexive Verbs

A. Korhonen, Y. Krymolowski, T. Briscoe

A Large Subcategorization Lexicon for Natural Language Processing Applications

P. Saint-dizier

PrepNet: a Multilingual Lexical Description of Prepositions

K. Kipper, A. Korhonen, N. Ryant, M. Palmer

Extending VerbNet with Novel Verb Classes

Session O39-W Lexicons, Semantics, Clustering Tools :

M. Islam, D. Inkpen

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

D. Widdows, A. Toumouh, B. Dorow, A. Lehireche

Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain

M. Maragoudakis, K. Kermanidis, A. Garbis, N. Fakotakis

Dealing with Imbalanced Data using Bayesian Techniques

B. Medlock

An Introduction to NLP-based Textual Anonymisation

R. Marinelli, R. Bindi

Proper Names and Linguistic Dynamics

V. Rus, A. Graesser

The Look and Feel of a Confident Entailer

Session O40-T Creating Terminologies :

M. L’homme, H. Bae

A Methodology for Developing Multilingual Resources for Terminology

G. Nenadic, N. Okazaki, S. Ananiadou

Towards a terminological resource for biomedical text mining

B. Dobrov, N. Loukachevitch

Development of Linguistic Ontology on Natural Sciences and Technology

R. Melz, P. Ryu, K. Choi

Compiling large language resources using lexical similarity metrics for domain taxonomy learning

W. Peters, M. Sagri, D. Tiscornia, S. Castagnoli

The LOIS Project

S. Anstein, G. Kremer, U. Reyle

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Session O41-M Emotions :

C. Clavel, I. Vasilescu, L. Devillers, T. Ehrette, G. Richard

Fear-type emotions of the SAFE Corpus: annotation issues

L. Devillers, R. Cowie, J. Martin, E. Douglas-cowie, S. Abrilian, M. Mcrorie

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

K. Laskowski, S. Burger

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

D. Reidsma, D. Heylen, R. Ordelman

Annotating Emotion in Meetings

T. Vogt, E. André

Improving Automatic Emotion Recognition from Speech via Gender Differentiation

J. Martin, G. Caridakis, L. Devillers, K. Karpouzis, S. Abrilian

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Session O42-EW Question Answering Evaluation :

L. Gillard, P. Bellot, M. El-bèze

Question Answering Evaluation Survey

J. Ko, L. Hiyakumoto, E. Nyberg

Exploiting Multiple Semantic Resources for Answer Selection

H. Shima, M. Wang, F. Lin, T. Mitamura

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

F. Lacatusu, A. Hickl, S. Harabagiu

Impact of Question Decomposition on the Quality of Answer Summaries

B. Magnini, D. Giampicollo, L. Aunimo, C. Ayache, P. Osenova, A. Peñas, M. Rijke, B. Sacaleanu, D. Santos, R. Sutcliffe

The Multilingual Question Answering Track at CLEF

C. Ayache, B. Grau, A. Vilnat

EQueR: the French Evaluation campaign of Question-Answering Systems

Session P1-W : Anaphora & Coreference

A. Nøklestad, Ø. Reigem, C. Johansson

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

L. Hasler, C. Orasan, K. Naumann

NPs for Events: Experiments in Coreference Annotation

T. Caselli, I. Prodanof

Annotating Bridging Anaphors in Italian: in Search of Reliability

D. Goecke, A. Witt

Exploiting logical document structure for anaphora resolution

H. Suzuki, G. Kacmarcik

RefRef: A Tool for Viewing and Exploring Coreference Space

W. Gegg-harrison, D. Byron

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

M. Poesio, M. Kabadjov, P. Goux, U. Kruschwitz, E. Bishop, L. Corti

An Anaphora Resolution-Based Anonymization Module

G. Puscasu, R. Mitkov

If it were then, then when was it? Establishing the anaphoric role of then

Session P2-W : Corpora & LR Infrastructures

D. Kokkinakis

Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience

N. Oostdijk, L. Boves

User requirements analysis for the design of a reference corpus of written Dutch

C. Onelli, D. Poietti, C. Seidenari, F. Tamburini

The DiaCORIS project: a diachronic corpus of written Italian

D. Santos, S. Inácio

Annotating COMPARA, a Grammar-aware Parallel Corpus

D. Guthrie, B. Allison, W. Liu, L. Guthrie, Y. Wilks

A Closer Look at Skip-gram Modelling

A. Chalamandaris, A. Protopapas, P. Tsiakoulis, S. Rapti

All Greek to me! An automatic Greeklish to Greek transliteration system

Session P3-W : Machine Translation

A. Patel, D. Radev

O. Bojar, M. Prokopova

Czech-English Word Alignment

L. Cyrus

Building a resource for studying translation shifts

A. Wu, K. Lowery

A Hebrew Tree Bank Based on Cantillation Marks

S. Oepen, J. Lønning

Discriminant-Based MRS Banking

I. Kruijff-korbayová, K. Chvatalova, O. Postolache

Annotation Guidelines for Czech-English Word Alignment

H. Kaji, M. Watanabe

Automatic Construction of Japanese WordNet

R. Rapp, C. Vide

Example-Based Machine Translation Using a Dictionary of Word Pairs

B. Schrader

Non-probabilistic alignment of rare German and English nominal expressions

M. Popovic, H. Ney

POS-based Word Reorderings for Statistical Machine Translation

V. Vandeghinste, I. Schuurman, M. Carl, S. Markantonatou, T. Badia

METIS-II: Machine Translation for Low Resource Languages

R. Ion, A. Ceauşu, D. Tufiş

Dependency-Based Phrase Alignment

M. Miháltz, G. Pohl

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

A. Ceauşu, D. Ştefănescu, D. Tufiş

Acquis Communautaire Sentence Alignment using Support Vector Machines

Session P4-W : Question Answering, Summarisation, Classification

J. Civera, A. Juan

Bilingual Machine-Aided Indexing

S. Verberne, L. Boves, N. Oostdijk, P. Coppen

Data for question answering: The case of why

H. Saggion

Multilingual Multidocument Summarization Tools and Evaluation

H. Saggion, R. Gaizauskas

Language Resources for Background Gathering

Wei. Li, Wen. Li, Q. Lu

Mining Implicit Entities in Queries

F. Bertagna

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

R. Sauri, M. Verhagen, J. Pustejovsk

SlinkET: A Partial Modal Parser for Events

Session P5-W : Learning and Acquisition

V. Nováček, P. Smrž, J. Pomikálek

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

D. Kawahara, S. Kurohashi

Case Frame Compilation from the Web using High-Performance Computing

B. Sagot, L. Clément, E. Villemonte de la Clergerie, P. Boullier

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

K. Kanzaki, Q. Ma, E. Yamamoto, H. Isahara

Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

B. Wellner, M. Vilain

Leveraging Machine Readable Dictionaries in Discriminative Sequence Models

N. Bel, S. Espeja, M. Marimon

New tools for the encoding of lexical data extracted from corpus

Y. Ohishi, K. Itou, K. Takeda, A. Fujii

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus

Session P6-WT : Corpora: Creation, Annotation

M. Pazienza, M. Pennacchiotti, F. Zanzotto

Mixing WordNet, VerbNet and PropBank for studying verb relations

J. Apresjan, I. Boguslavsky, B. Iomdin, L. Iomdin, A. Sannikov, V. Sizov

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

N. Xue

Annotating the Predicate-Argument Structure of Chinese Nominalizations

S. Džeroski, T. Erjavec, N. Ledinek, P. Pajas, Z. Žabokrtsky, A. Žele

Towards a Slovene Dependency Treebank

J. Nivre, J. Nilsson, J. Hall

Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

R. Bernardi, A. Bolognesi, C. Seidenari, F. Tamburini

POS tagset design for Italian

T. Ohta, Y. Tateisi, J. Kim, A. Yakushiji, J. Tsujii

Linguistic and Biological Annotations of Biological Interaction Events

N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz, I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde

Structure, Annotation and Tools in the Basque ZT Corpus

N. Masaki, I. Hiroshi, H. Taiichi, T. Takenobu

A new approach to syntactic annotation

Y. Matsumoto, M. Asahara, K. Hashimoto, Y. Tono, A. Ohtani, T. Morita

An Annotated Corpus Management Tool: ChaKi

Y. Xia, K. Wong., Wen. Li

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

J. Jung, M. Miyake, H. Akam

Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network

V. Hoste, G. Pauw

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

F. Barreto, A. Branco, E. Ferreira, A. Mendes, M. Nascimento, F. Nunes, J. Silva

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

H. Bunt, A. Schiffrin

Methodological Aspects of Semantic Annotation

Session P7-T : Terminology & Ontologies

M. Lin, H. Chen

Constructing a Named Entity Ontology from Web Corpora

K. Ahmad, M. Musacchio, G. Palumbo

Ontological and Terminological Commitments and the Discourse of Specialist Communities

L. Lesmo, L. Robaldo

From Natural Language to Databases via Ontologies

A. Simões, J. Almeida

T2O - Recycling Thesauri into a Multilingual Ontology

L. Bordoni, T. Mazzoli

Towards an Ontology for Art and Colours

M. Lundälv, K. Mühlenbock, B. Farre, A. Brännström

SYMBERED - a Symbol-Concept Editing Tool

Y. Kiyota, H. Nakagawa

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

H. Brugman, V. Malaisé, L. Gazendam

A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

T. Declerck, A. Pérez, O. Vela, Z. Gantner, D. Manzano-macho

Multilingual Lexical Semantic Resources for Ontology Translation

A. Estellés, A. Alcina, V. Soler

Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology

L. Sarmento, B. Maia, D. Santos, A. Pinto, L. Cabral

Corpógrafo V3 From Terminological Aid to Semi-automatic Knowledge Engineering

Y. Senda, Y. Sinohara, M. Okumura

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Session P8-E : Evaluation of Question Answering & Summarisation

A. Moschitti, R. Basili

A Tree Kernel approach to Question and Answer Classification in Question Answering Systems

I. Cramer, J. Leidner, D. Klakow

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

L. Costa, L. Sarmento

Component Evaluation in a Question Answering System

B. Grau, A. Ligozat, I. Robba, A. Vilnat, L. Monceau

FRASQUES : A Question Answering system in the EQueR evaluation campaign

T. Akiba

Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue

M. Bilotti, E. Nyberg

Evaluation for Scenario Question Answering Systems

M. Hassel, J. Sjöbergh

Towards Holistic Summarization - Selecting Summaries, Not Sentences

C. Orasan, L. Hasler

Computer-aided summarisation - what the user really wants

Session P9-M : Multimodal Tools

P. Berck, A. Russel

ANNEX - a web-based Framework for Exploiting Annotated Media Resources

P. Wittenburg, H. Brugman, A. Russel, A. Klassmann, H. Sloetjes

ELAN: a Professional Framework for Multimodality Research

A. Popescu-belis, M. Georgescul

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

F. Behrens, J. Milde

The Eclipse Annotator: an extensible system for multimodal corpus creation

K. Maeda, H. Lee, J. Medero, S. Strassel

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

K. Ahmad, C. Bennett, T. Oliver

Visual Surveillance and Video Annotation and Description

Session P10-S : Speech Corpora & Annotation

N. Grønnum

DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus

Y. Irie, S. Matsubara, N. Kawaguchi, Y. Yamaguchi, Y. Inagaki

Layered Speech-Act Annotation for Spoken Dialogue Corpus

T. Ohno, S. Matsubara, H. Kashioka, N. Kato, Y. Inagaki

A Syntactically Annotated Corpus of Japanese Spoken Monologue

I. Chiari

Slips and errors in spoken data transcription

D. Gibbon, F. Fernandes, T. Trippel

A BLARK extension for temporal annotation mining

P. Paggio

Annotating Information Structure in a Corpus of Spoken Danish

T. Tan, L. Besacier

A French Non-Native Corpus for Automatic Speech Recognition

A. Bogacka, K. Dziubalska-kołaczyk, G. Krynicki, D. Pietrala, M. Wypych

General and Task-Specific Corpus Resources for Polish Adult Learners of English

A. Zgank, D. Verdonik, A. Markus, Z. Kacic

SINOD - Slovenian non-native speech database

V. Alabau, C. Martinez

Bilingual speech corpus in two phonetically similar languages

M. Kaiser, H. Mögele, F. Schiel

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

A. Moreno, A. Febrer, L. Márquez

Generation of Language Resources for the Development of Speech Technologies in Catalan

J. Benedí, E. Lleida, A. Varona, M. Castro, I. Galiano, R. Justo, I. López de Letona, A. Miguel

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

S. Rosset, S. Petel

The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus

H. Boril, T. Boril, P. Pollák

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

E. Westerhout, P. Monachesi

A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)

R. Savy, F. Cutugno, C. Crocco

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb)

Session P11-WME : Computational Lexicons

F. Vriend, L. Boves, H. Heuvel, R. Hout, J. Kruijsen, J. Swanenberg

A Unified Structure for Dutch Dialect Dictionary Data

M. Mangeot, A. Chalvin

Dictionary Building with the Jibiki Platform: the GDEF case

V. Trón, P. Halácsy, P. Rebrus, A. Rung, P. Vajda, E. Simon

Morphdb.hu: Hungarian lexical database and morphological grammar

B. Cartoni

Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes

T. Erjavec, T. Fišer

Building Slovene WordNet

S. Schulz, K. Markó, P. Daumke, U. Hahn, S. Hanser, P. Nohama, R. Andrade, E. Pacheco, M. Romacker

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MORPHOSAURUS Subword Lexicon

S. Nimb

LEXADV - a multilingual semantic Lexicon for Adverbs

C. Krstev, R. Stanković, D. Vitas, I. Obradović

WS4LR: A Workstation for Lexical Resources

A. Geyken, N. Schrader

LexikoNet - a lexical database based on type and role hierarchies

J. Pustejovsky, C. Havasi, J. Littman, A. Rumshisky, M. Verhagen

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

T. Kanamaru, M. Murata, H. Isahara

Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning

A. Roventini

Linking Verbal Entries of Different Lexical Resources

N. Ruimy

Merging two Ontology-based Lexical Resources

S. Bosch, L. Pretorius, J. Jones

Towards machine-readable lexicons for South African Bantu languages

M. Lopatkova, Z. Zabokrtsky, K. Skwarska

Valency Lexicon of Czech Verbs: Alternation-Based Model

G. Vetulani, Z. Vetulani, T. Obrębski

Syntactic Lexicon of Polish Predicative Nouns

R. Manurung, D. O’mara, H. Pain, G. Ritchie, A. Waller

Building a Lexical Database for an Interactive Joke-Generator

Session P12-W : Named Entity

C. Sporleder, M. Erp, T. Porcelijn, A. Bosch, P. Arntzen

Identifying Named Entities in Text Databases from the Natural History Domain

P. Lucas, M. Balasubramanya, D. Widdows, M. Higgins

The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions

P. Mazur, R. Dale

Named Entity Extraction with Conjunction Disambiguation

U. Schäfer

OntoNERdIE - Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources

Session P13-W : Corpora

V. Tablan, W. Peters, D. Maynard, H. Cunningham

Creating Tools for Morphological Analysis of Sumerian

C. Benzmüller, H. Horacek, H. Lesourd, I. Kruijff-korbayova, M. Schiller, M. Wolska

"A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material"

C. Bosco, V. Lombardo

Comparing linguistic information in treebank annotations

I. Langkilde-geary, J. Betteridge

A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation

A. Nazarenko, E. Alphonse, J. Derivière, T. Hamon, G. Vauvert, D. Weissenbacher

The ALVIS Format for Linguistically Annotated Documents

L. Sarmento

BACO - A large database of text and co-occurrences

M. Nascimento, J. Gonçalves, L. Pereira, A. Estrela, A. Pereira, R. Santos, S. Oliveira

The African Varieties of Portuguese: Compiling Comparable Corpora and Analysing Data-derived Lexicon

L. Dinu, A. Dinu

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects

U. Quasthoff, M. Richter, C. Biemann

Corpus Portal for Search in Monolingual Corpora

I. Berlocher, K. Huh, E. Laporte, J. Nam

Morphological annotation of Korean with Directly Maintainable Resources

A. Bosch, I. Schuurman, V. Vandeghinste

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

G. Noord, I. Schuurman, V. Vandeghinste

Syntactic Annotation of Large Corpora in STEVIN

M. Wong

Skeleton Parsing in Chinese: Annotation Scheme and Guidelines

K. Tenfjord, P. Meurer, K. Hofland

The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language

A. Cole

Corpus Development and Publication

M. Sousa, T. Trippel

Building a historical corpus for Classical Portuguese: some technological aspects

T. Obrebski, M. Stolarski

UAM Text Tools - a flexible NLP architecture

Session P14-GW : General Issues and Large Programs

R. Ivanovska-naskova

Development of the First LRs for Macedonian: Current Projects

T. Wandmacher, J. Antoine

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People

N. Cucurullo, S. Montemagni, M. Paoli, E. Picchi, E. Sassolini

Dialectal resources on-line: the ALT-Web experience

M. Monachini, N. Calzolari, K. Choukri, J. Friedrich, G. Maltese, M. Mammini, J. Odijk, M. Ulivieri

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

F. Calzolari, E. Sassolini, M. Sassi, S. Cucurullo, E. Picchi, F. Bertagna, A. Enea, M. Monachini, C. Soria, N. Calzolari

Next Generation Language Resources using Grid

M. Kemps-snijders, M. Nederhof, P. Wittenburg

LEXUS, a web-based tool for manipulating lexical resources

F. Offenga, D. Broeder, P. Wittenburg, J. Ducret, L. Romary

Metadata Profile in the ISO Data Category Registry

H. Uszkoreit, F. Xu, J. Steffen, I. Aslan

The pragmatic combination of different cross-lingual resources for multilingual information services

I. Maks, B. Boelhouwer

Exploring opportunities for comparability and enrichment by linking lexical databases

Session P15-W : Multiwords & Collocations

R. Xu, Q. Lu, S. Li

The Design and Construction of A Chinese Collocation Bank

L. Wanner, M. Ramos

Local Document Relevance Clustering in IR Using Collocation Information

S. Cinkova, P. Pecina, P. Podvesky, P. Schlesinger

Semi-automatic Building of Swedish Collocation Lexicon

N. Grégoire

Elaborating the parameterized Equivalence Class Method for Dutch

A. Mendes, S. Antunes, M. Nascimento, J. Casteleiro, L. Pereira, T. Sá

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

W. Anderson, P. Kotzé

Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho

Session P16-T : Terminology Multiwords & Collocations

M. Brekke, K. Innselset, M. Kristiansen, K. Øvsthus

KB-N: Automatic Term Extraction from Knowledge Bank of Economics

P. Alessandro, F. Marco, M. Massimo

Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts

H. Hjelm

Extraction of Cross Language Term Correspondences

J. Ritz, U. Heid

Extraction tools for collocations and their morphosyntactic specificities

R. Kongkachandra, K. Chamnongthai

Semantic-Based Keyword Recovery Function for Keyword Extraction System

Session P17-E : Evaluation: annotation, acquisition, alignment, recognition

P. Wagacha, G. Pauw, P. Githinji

A Grapheme-Based Approach for Accent Restoration in Gikuyu

J. Semecký

On Automatic Assignment of Verb Valency Frames in Czech

K. Kageura, G. Kikui

A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)

R. Passonneau, N. Habash, O. Rambow

Inter-annotator Agreement on a Multilingual Semantic Annotation Task

G. Szarvas, R. Farkas, L. Felföldi, A. Kocsor, J. Csirik

A highly accurate Named Entity corpus for Hungarian

E. Bick

Turning a Dependency Treebank into a PSG-style Constituent Treebank

M. Ueyama

Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval

D. Kokkinakis, D. Dannélls

Recognizing Acronyms and their Definitions in Swedish Medical Texts

Y. Chiao, O. Kraif, D. Laurent, T. Nguyen, N. Semmar, F. Stuck, J. Véronis, W. Zaghouani

Evaluation of multilingual text alignment systems: the ARCADE II project

M. Salmenkivi

Finding representative sets of dialect words for geographical regions

D. Santos, N. Seco, N. Cardoso, R. Vilela

HAREM: An Advanced NER Evaluation Contest for Portuguese

Session P18-M : Multimodal Corpora and Annotation

J. Maas, B. Wrede

BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction

J. Segouat, A. Braffort, E. Martin

Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data

J. Bungeroth, D. Stein, P. Dreuw, M. Zahedi, H. Ney

A German Sign Language Corpus of the Domain Weather Report

S. Abrilian, L. Devillers, J. Martin

Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

G. Lechenadec, V. Maffiolo, N. Chateau, J. Colletta

Creation of a Corpus of Multimodal Spontaneous Expressions of Emotions in Human-Machine Interaction

P. Strauß, H. Hoffman, W. Minker, H. Neumann, G. Palm, S. Scherer, F. Schwenker, H. Traue, W. Walter, U. Weidenbacher

Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments

I. Kruijff-korbayova, T. Becker, N. Blaylock, C. Gerstenberger, M. Kaisser, P. Poller, V. Rieser, J. Schehl

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Session P19-E : Evaluation : Speech

J. Ko, F. Murase, T. Mitamura, E. Nyberg, M. Tateishi, I. Akahori

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

A. Raake, B. Katz

SUS-based Method for Speech Reception Threshold Measurement in French

P. Mareüil, C. D'Alessandro, A. Raake, G. Bailly, M. Garcia, M. Morel

A joint intelligibility evaluation of French text-to-speech systems: the EvaSy SUS/ACR campaign

M. Przybocki, G. Sanders, A. Le

Edit Distance: A Metric for Machine Translation Evaluation

G. Hodász

Evaluation Methods of a Linguistically Enriched Translation Memory System

J. Laoudi, C. Tate, C. Voss

Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding

H. Bonneau-maynard, C. Ayache, F. Bechet, A. Denis, A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau

Results of the French Evalda-Media evaluation campaign for literal understanding

B. Andrassy, H. Hoege

Human and machine recognition as a function of SNR

J. Hoidekr, J. Psutka, A. Pražák, J. Psutka

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

N. Bernsen, T. Hansen, S. Kiilerich, T. Madsen

Field Evaluation of a Single-Word Pronunciation Training System

R. Banchs, A. Bonafonte, J. Pérez

Acceptance Testing of a Spoken Language Translation System

E. Coussé, S. Gillis

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

D. Buehler, W. Minker

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

J. Soler, P. Cerezo, D. Merino, A. Sánchez

MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

Session P20-S : Speech Corpora for TTS Systems and Emotion and Speech tools

D. Braga, L. Coelho, J. Teixeira, D. Freitas

ProGmatica: a Prosodic and Pragmatic Database for European Portuguese

I. Amdal, T. Svendsen

FonDat1: A Speech Synthesis Corpus for Norwegian

M. Umbert, A. Moreno, P. Agüero, A. Bonafonte

Spanish Synthesis Corpora

H. Mögele, M. Kaiser, F. Schiel

SmartWeb UMTS Speech Data Collection - The SmartWeb Handheld Corpus

C. Draxler, K. Jänsch

Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation

Z. Huang, L. Chen, M. Harper

An Open Source Prosodic Feature Extraction Tool

B. Tsou, T. Lai, K. Sin, L. Cheung

Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort

I. Saratxaga, E. Navas, I. Hernáez, I. Aholab

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Session P21-W : Corpora, Lexicons & Multilinguality

B. Megyesi, A. Hein, É. Johanson

Building a Swedish-Turkish Parallel Corpus

T. Erjavec

The English-Slovene ACQUIS corpus

R. Steinberger, B. Pouliquen, A. Widiger, C. Ignat, T. Erjavec, D. Tufiş, D. Varga

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

B. Maegaard, L. Offersgaard, L. Henriksen, H. Jansen, X. Lepetit, C. Navarretta, C. Povlsen

The MULINCO corpus and corpus platform

J. Tiedemann

ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts

E. Guevara, S. Scalise, A. Bisetto, C. Melloni

MORBO/COMP: a multilingual database of compound words

A. Sansò

Documenting variation across Europe and the Mediterranean: The Pavia Typological Database

S. Cinková

From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of the Functional Generative Description

D. Samy, A. Sandoval, J. Guirao, E. Alfonseca

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

W. Ma, C. Huang

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

M. Geilfuss, J. Milde

SAM - an annotation editor for parallel texts

Session P22-W : Tools for Corpora and Lexicons

J. Kinoshita, L. Salvador, C. Menezes

CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus

P. Nemec

Tree Searching/Rewriting Formalism

P. Le, T. Nguyen, L. Romary, A. Roussanaly

A Lexicalized Tree-Adjoining Grammar for Vietnamese

C. Rohrer, M. Forst

Improving coverage and parsing quality of a large-scale LFG for German

T. Baldwin, S. Awab

Open Source Corpus Analysis Tools for Malay

J. Nivre, J. Hall, J. Nilsson

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

B. Sagot, P. Boullier

Deep non-probabilistic parsing of large corpora

S. Frota, M. Vigário, F. Martin

FreP: An Electronic Tool for Extracting Frequency Information of Phonological Units from Portuguese Written Text

R. Levy, G. Andrew

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

U. Heid, E. Taljard, D. Prinsloo

Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho

E. Dhonnchadha, J. Genabith

A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation

P. Halácsy, A. Kornai, C. Oravecz, V. Trón, D. Varga

Using a morphological analyzer in high precision POS tagging of Hungarian

M. Verhagen, R. Knippen, I. Mani, J. Pustejovsky

Annotation of Temporal Relations with Tango

C. Havasi, J. Pustejovsky, M. Verhagen

BULB: A Unified Lexical Browser

B. Waldron, A. Copestake, U. Schäfer, B. Kiefer

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Y. Kato, S. Matsubara, Y. Inagaki

A Corpus Search System Utilizing Lexical Dependency Structure

C. Fairon, S. Paumier

A framework for real-time dictionary updating

E. Alfonseca, A. Moreno-sandoval, J. Guirao, M. Ruiz-casado

The wraetlic NLP suite

J. Atserias, B. Casas, E. Comelles, M. González, L. Padró, M. Padró

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Session P23-GW : Web services and Digital Archives and Libraries

R. Bernardi, D. Calvanese, L. Dini, V. Tomaso, E. Frasnelli, U. Kugler, B. Plank

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

D. Broeder, F. Offenga, P. Wittenburg, P. Kamp, D. Nathan, S. Strömqvist

Technologies for a Federation of Language Resource Archives

P. Berck, H. Bibiko, M. Kemps-snijders, A. Russel, P. Wittenburg

Ontology-based Language Archive Utilization

M. Kemps-snijders, J. Ducret, L. Romary, P. Wittenburg

An API for accessing the Data Category Registry

M. Boekestein, G. Depoorter, R. Veenendaal

Functioning of the Centre for Dutch Language and Speech Technology

E. Suzuki, T. Suzuki, K. Kakihana

On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language

Session P24-WM : Semantics, Ontologies, Semantic Web

A. Oltramari

LexiPass methodology: a conceptual path from frames to senses and back

P. Morărescu

Principles for annotating and reasoning with spatial information

P. Buitelaar, P. Cimiano, S. Racioppa, M. Siegel

Ontology-based Information Extraction with SOBA

E. Yamamoto, K. Kanzaki, H. Isahara

Detection of inconsistencies in concept classifications in a large dictionary — Toward an improvement of the EDR electronic dictionary —

D. Dutoit

Alexandria: A Powerful Multilingual Resource For web

A. Kawtrakul, C. Pechsiri, T. Permpool, D. Thamvijit, P. Sornprasert, C. Yingsaeree, M. Suktarachan

Ontology Driven K-Portal Construction and K-Service Provision

P. Bouquet, L. Serafini, S. Zanobini

The role of lexical resources in matching classification schemas

K. Simov, P. Osenova

Shallow Semantic Annotation of Bulgarian

T. Declerck, M. Vela

Generic NLP Tools for Supporting Shallow Ontology Building

D. Beermann, L. Hellan

Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars

P. Filipe, N. Mamede

A Framework to Integrate Ubiquitous Knowledge Modeling

Session P25-T : Terminology Lexicons & Tools

H. Ozaku, A. Abe, K. Sagara, N. Kuwahara, K. Kogure

Features of Terms in Actual Nursing Activities

A. Storrer, S. Wellinghoff

Automated detection and annotation of term definitions in German text corpora

J. Feliu, J. Vivaldi, M. Cabré

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

C. Yirong, L. Qin, L. Wenjie, S. Zhifang, J. Luning

A Study on Terminology Extraction Based on Classified Corpora

A. Mulloni, V. Pekar

Automatic Detection of Orthographics Cues for Cognate Recognition

B. Tsou, O. Kwong

Toward a Pan-Chinese Thesaurus

G. Pardelli, M. Sassi, S. Goggi, P. Orsolini

Natural Language Processing: A Terminological And Statistical Approach

M. Itagaki, A. Aue, T. Aikawa

Detecting Inter-domain Semantic Shift using Syntactic Similarity

Session P26-EW : Evaluation: Lexicons, Tools

K. Smaïli, C. Lavecchia, J. Haton

Linguistic features modeling based on Partial New Cache

N. Ruimy

Structuring a Domain Vocabulary in a General Knowledge Environment

R. Passonneau, R. Blitz, D. Elson, A. Giral, J. Klavans

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

C. Kruengkrai, V. Sornlertlamvanich, H. Isahara

A Conditional Random Field Framework for Thai Morphological Analysis

K. Kuroda, M. Utiyama, H. Isahara

Getting Deeper Semantics than Berkeley FrameNet with MSFA

B. Alex

Integrating Language Knowledge Resources to Extend the English Inclusion Classifier to a New Language

M. Kouylekov, B. Magnini

Building a Large-Scale Repository of Textual Entailment Rules

S. Schaden

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

Ž. Agić, M. Tadić

Evaluating Morphosyntactic Tagging of Croatian Texts

U. Petersen

Querying Both Parallel And Treebank Corpora: Evaluation Of A Corpus Query System

J. Medero, K. Maeda, S. Strassel, C. Walker

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

A. Sinopalnikova, P. Smrž

Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements

H. Halpin

Automatic Evaluation and Compositon of NLP Pipelines with Web Services

A. Barbu, E. Ionescu, V. Mititelu

Romanian Valence Dictionary in XML Format

Session P27-E : Evaluation methodologies & Evaluation of Language Technologies

A. Lisowska, N. Underwood

Rote: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

M. King, N. Underwood

Evaluating Symbiotic Systems: the challenge

N. Underwood, A. Lisowska

The Evolution of an Evaluation Framework for a Text Mining System

F. Ibekwe-sanjuan

A task-oriented framework for evaluating theme detection systems: A discussion paper

L. Sitbon, P. Bellot

Tools and methods for objective or contextual evaluation of topic segmentation

F. Zou, F. Wang, X. Deng, S. Han

Evaluation of Stop Word Lists in Chinese Language

A. Shehata, F. Zanzotto

A Dependency-based Algorithm for Grammar Conversion

N. Semmar, M. Laib, C. Fluhr

A Deep Linguistic Analysis for Cross-language Information Retrieval

W. Abramowicz, A. Filipowska, J. Piskorski, K. Węcel, K. Wieloc

Linguistic Suite for Polish Cadastral System

K. Ahmad, L. Gillam, D. Cheng

Sentiments on a Grid: Analysis of Streaming News and Views

B. Pedersen

Query Expansion on Compounds

D. Rojas, T. Aikawa

Predicting MT Quality as a Function of the Source Language

Session P28-M : Multimodal: Methodologies & Natural Interaction

M. Rajman, M. Ailomaa, A. Lisowska, M. Melichar, S. Amstrong

Extending the Wizard of Oz Methodology for Language-enabled Multimodal Systems

S. Raidt, G. Bailly, F. Elisei

Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?

R. Taib, N. Ruiz

Tangible Objects for the Acquisition of Multimodal Interaction Patterns

Session P29-S : S2S Translation

D. Verdonik, M. Rojc

Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems

J. Pérez, A. Bonafonte

GAIA: Common Framework for the Development of Speech Translation Technologies

H. Tohyama, S. Matsubara

Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus

H. Heuvel, K. Choukri, C. Gollan, A. Moreno, D. Mostefa

TC-STAR: New language resources for ASR and SLT purposes

Session P30-S : Speech: TTS & ASR

B. Williams, R. Jones, I. Uemlianin

Tools and resources for speech synthesis arising from a Welsh TTS project

H. Somers, G. Evans, Z. Mohamed

Developing Speech Synthesis for Under-Resourced Languages by “Faking it”: An Experiment with Somali

D. Athanassia-lida, A. Chalamandaris

Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations

H. Kawanami, T. Kitamura, K. Shikano

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

M. Bahrani, H. Sameti, N. Hafezi, H. Movassagh

Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

J. Talley

Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints

P. Ircing, J. Hoidekr, J. Psutka

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech