ID |
Title |
Authors |
1 |
CLiPS Stylometry Investigation (CSI) corpus: A Dutch corpus for the detection of age, gender, personality, sentiment and deception in text |
Ben Verhoeven and Walter Daelemans |
2 |
Amazigh Verb Conjugator |
Fadoua Ataa Allah and Siham Boulaknadel |
3 |
Extracting News Web Page Creation Time with DCTFinder |
Xavier Tannier |
4 |
WordNet–Wikipedia–Wiktionary: Construction of a Three-way Alignment |
Tristan Miller and Iryna Gurevych |
6 |
Expanding n-gram analytics in ELAN and a case study for sign synthesis |
Rosalee Wolfe, John McDonald, Larwan Berke and Marie Stumbo |
7 |
New Spanish speech corpus database for the analysis of people suffering from Parkinson’s disease |
Juan Rafael Orozco-Arroyave, Julián David Arias-Londoño, Jesús Francisco Vargas-Bonilla, María Claudia Gonzalez-Rátiva and Elmar Nöth |
8 |
A LDA-BASED TOPIC CLASSIFICATION APPROACH FROM HIGHLY IMPERFECT AUTOMATIC TRANSCRIPTIONS |
Mohamed Morchid, Richard Dufour and Georges Linares |
9 |
ELRA Services |
Victoria Arranz and Khalid Choukri |
12 |
REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations |
Peter Exner and Pierre Nugues |
14 |
Digital Library 2.0: Source of Knowledge and Research Collaboration Platform |
Włodzimierz Gruszczyński and Maciej Ogrodniczuk |
16 |
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos and Google Translate |
Anabela Barreiro, Johanna Monti, Brigitte Orliac, Susanne Preuß, Kutz Arrieta, Wang Ling, Fernando Batista and Isabel Trancoso |
18 |
Towards Building A Kashmiri Treebank: Setting Up the Treebanking Pipeline |
Riyaz Ahmad Bhat, Shahid Musjtaq Bhat and Dipti Misra Sharma |
19 |
CHARACTERIZING AND PREDICTING BURSTY EVENTS: THE BUZZ CASE STUDY ON TWITTER |
Mohamed Morchid, Georges Linares and Richard Dufour |
21 |
Constructing a Chinese–Japanese Parallel Corpus from Wikipedia |
Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi |
23 |
Causality in Biomedical Scientific Discourse: A Meta-knowledge Perspective |
Claudiu Mihăilă and Sophia Ananiadou |
24 |
ML-Optimization of Ported Constraint Grammars |
Eckhard Bick |
25 |
A Study on Crowdsourcing Enterprise Question Collection and Annotation |
Yuan Luo, Thomas Boucher, Tolga Oral, David Osofsky and Sara Weber |
27 |
The Impact of Cohesion Errors in Extraction Based Summaries |
Evelina Rennes and Arne Jonsson |
28 |
Can the Crowd be Controlled?: A Case Study on Crowd Sourcing and Automatic Validation of Completed Tasks based on User Modeling |
Balamurali A.R and Pratibha Moogi |
29 |
Using Twitter and Sentiment Analysis for event detection |
Georgios Paltoglou |
31 |
A decade of HLT Agency activities in the Low Countries: from resource maintenance (BLARK) to service offerings (BLAISE) |
Peter Spyns and Remco van Veenendaal |
32 |
AN EFFORTLESS WAY TO CREATE LARGE-SCALE DATASETS FOR FAMOUS SPEAKERS |
François SALMON and Félicien VALLET |
34 |
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing |
Ismail El Maarouf, Jane Bradbury and Vít Baisa |
39 |
Three dimensions of the so-called "interoperability" of annotation schemes |
Eva Hajičová |
41 |
German Alcohol Language Corpus - the Question of Dialect |
Florian Schiel |
42 |
A Japanese Word Dependency Corpus |
Shinsuke Mori, Hideki Ogura and Tetsuro Sasada |
43 |
An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus |
Lise Rebout and Phillippe Langlais |
45 |
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime |
Véronique MORICEAU and Xavier Tannier |
46 |
Exploiting Portuguese Lexical Knowledge Bases for Answering Open Domain Cloze Questions Automatically |
Hugo Gonçalo Oliveira and Paulo Gomes |
47 |
A Database for Measuring Linguistic Information Content |
Richard Sproat |
48 |
The N2 Corpus: A Semantically-Annotated Collection of Islamist Extremist Stories |
Mark Finlayson, Jeffry Halverson and Steven Corman |
50 |
Is Obligatoriness in Valency Important for Word Order in Czech? |
Jiří Mírovský and Kateřina Rysová |
51 |
Representing Multimodal Linguistic Annotated data |
Brigitte BIGI, Tatsuya Watanabe and Laurent Prévot |
52 |
How to evaluate a corpus |
Adam Kilgarriff, Pavel Rychlý, Milos Jakubicek, Vojtěch Kovář and Vitek Baisa |
53 |
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government |
Fabrizio Gotti, Phillippe Langlais and Atefeh Farzindar |
58 |
GLÀFF, a Large Versatile French Lexicon |
Nabil Hathout, Franck Sajous and Basilio Calderone |
59 |
TaLAPi – A Thai Linguistically Annotated Corpus for Language Processing |
Ai Ti Aw, Sharifah Mahani Aljunied, Nattadaporn Lertcheva and Sasiwimon Kalunsima |
60 |
Sentence Rephrasing for Parsing Sentences with OOV Words |
Hen-Hsen Huang, Huan-Yuan Chen, Chang-Sheng Yu and Hsin-Hsi Chen |
62 |
Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser |
Cheikh Bamba Dione |
63 |
A Compact Interactive Visualization of Dependency Treebank Query Results |
Chris Culy, Marco Passarotti and Ulla König-Cardanobile |
66 |
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis. |
Menno van Zaanen, Gerhard Van Huyssteen, Suzanne Aussems, Chris Emmery and Roald Eiselen |
67 |
Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. |
Diana Maynard |
70 |
Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D |
Scott Martens and Marco Passarotti |
71 |
Automatic detection of other-repetition occurrences: application to French conversational Speech |
Brigitte BIGI, Roxane Bertrand and Mathilde Guardiola |
73 |
How Could Veins Speed Up The Process Of Discourse Parsing |
Daniel Anechitei, Elena Mitocariu and Dan Cristea |
74 |
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving |
Akira Fujita, Akihiro Kameda, Ai Kawazoe and Yusuke Miyao |
77 |
A Model for Processing Illocutionary Structures and Argumentation in Debates |
Kasia Budzynska, Mathilde Janier, Chris Reed, patrick saint-dizier, Manfred Stede and Olena yakorska |
79 |
Verbs of Saying with a Textual Connecting Function in the Prague Dependency Treebank |
Magdalena Rysova |
80 |
ProsoReportDialog: a tool for temporal variables description in dialogues |
Jean-Philippe Goldman |
83 |
Clustering tweets usingWikipedia concepts |
Guoyu Tang, Weizhi Wang, Yunqing Xia, Raymond Lau and Fang Zheng |
85 |
The USAGE review corpus for fine grained multi lingual opinion analysis |
Roman Klinger and Philipp Cimiano |
88 |
Comparability improves the co-clustering and co-categorization of thematic bi-lingual corpora |
Guiyao Ke, Pierre-Francois Marteau and Gildas Ménier |
89 |
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive |
Jetske Klatter, Roeland Van Hout, Henk van den Heuvel, Paula Fikkert, Anne Baker, Jan De Jong, Frank Wijnen, Eric Sanders and Paul Trilsbeek |
90 |
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features |
Daisuke Kawahara and Martha Palmer |
94 |
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control |
Mitesh M. Khapra, Ananthakrishnan Ramanathan, Anoop Kunchukuttan, Karthik Visweswariah and Pushpak Bhattacharyya |
95 |
Exploiting networks in Law |
Livio Robaldo, Guido Boella, Luigi Di Caro and Andrea Violato |
96 |
Parsing Chinese Synthetic Words with a Character-based Dependency Model |
Fei Cheng, Kevin Duh and Yuji Matsumoto |
99 |
Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain |
Zdenka Uresova, Jan Hajic, Pavel Pecina and Ondrej Dusek |
101 |
Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems |
Xiaoyun Wang, Jinsong ZHANG, Masafumi NISHIDA and Seiichi YAMAMOTO |
102 |
Bilingual Dictionary Construction with Transliteration Filtering |
John Richardson, Toshiaki Nakazawa and Sadao Kurohashi |
103 |
Large SMT data-sets extracted from Wikipedia |
Dan Tufiș |
104 |
How to Use less Features and Reach Better Performance in Author Gender Identification |
Juan Soler and Leo Wanner |
105 |
DeLex, a freely-avaible, large-scale and linguistically grounded morphological lexicon for German |
Benoît Sagot |
106 |
Grounding language in source code with application to API retrieval |
Huijing Deng and Grzegorz Chrupała |
107 |
Production of Phrase Tables in 11 European Languages using an Improved Sub-sentential Aligner |
Juan Luo and Yves Lepage |
110 |
Evaluation of Simple Distributional Compositional Operations on Longer Texts |
Tamara Polajnar, Laura Rimell and Stephen Clark |
113 |
First Insight into Quality-Adaptive Dialogue |
Stefan Ultes, Hüseyin Dikme and Wolfgang Minker |
115 |
TMO -- The Federated Ontology of the TrendMiner Project |
Hans-Ulrich Krieger and Thierry Declerck |
116 |
Zmorge: A German Morphological Lexicon Extracted from Wiktionary |
Rico Sennrich and Beat Kunz |
117 |
Weighting Schemes for Phrase Extraction |
Saab Mansour and Hermann Ney |
118 |
The Sweet-Home speech and multimodal corpus for home automation interaction |
Michel Vacher, Benjamin Lecouteux, Pedro Chahuara, François Portet, Brigitte Meillon and Nicolas Bonnefond |
119 |
Tools for Arabic Natural Language Processing: a case study in qalqalah prosody |
Claire Brierley, Majdi Sawalha and Eric Atwell |
120 |
Variations on quantitative comparability measures and their evaluations on synthetic French-English comparable corpora |
Guiyao Ke, Pierre-Francois Marteau and Gildas Menier |
121 |
Automatic creation of WordNets from parallel corpora |
Antoni Oliver and Salvador Climent |
122 |
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources |
Spandana Gella, Carlo Strapparava and Vivi Nastase |
124 |
Annotating Question Decomposition on Complex Clinical Queries |
Kirk Roberts, Halil Kilicoglu, Marcelo Fiszman and Dina Demner-Fushman |
125 |
Boosting Open Information Extraction with Noun-Based Relations |
clarissa xavier and Vera Lima |
127 |
Bootstrapping Open-Source English-Bulgarian Computational Dictionary |
Krasimir Angelov |
128 |
MotàMot project: conversion of a French-Khmer published dictionary for building a multilingual lexical system |
Mathieu Mangeot |
130 |
JUST.ASK, a QA system that learns to answer new questions from previous interactions |
Sérgio Curto, Ana C. Mendes, Pedro Curto and Luísa Coheur |
132 |
Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages |
manjira sinha and Tirthankar Dasgupta |
134 |
The Nijmegen Corpus of Casual Czech |
Mirjam Ernestus, Lucie Kočková-Amortová and Petr Pollák |
138 |
Enhancing Archaic Language Processing via the Properties Shared in Modern and Archaic Corpora: A Case Study on Chinese |
Yan Song and Fei Xia |
143 |
Open-domain Interaction and Online Content in the Sami Language |
Kristiina Jokinen |
144 |
A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing JapaneseWord Vectors |
Akira Utsumi |
145 |
Cross-Language Authorship Attribution |
Dasha Bogdanova and Angeliki Lazaridou |
147 |
Using Transfer Learning to Assist Exploratory Corpus Annotation |
Paul Felt, Eric Ringger, Kevin Seppi and Kristian Heal |
148 |
MODAR 2.0: A Modality Annotated Corpus of Egyptian and Modern Standard Arabic Tweets |
Rania Al-Sabbagh, Roxana Girju and Jana Diesner |
150 |
ANCOR_Centre, a large free spoken French coreference corpus: description of the resource and reliability measures |
Judith Muzerelle, Anaïs Lefeuvre, Emmanuel Schang, Jean-Yves Antoine, Aurore Pelletier, Denis Maurel, Iris Eshkol and Jeanne Villaneau |
151 |
Guampa: a Toolkit for Collaborative Translation |
Alex Rudnick, Taylor Skidmore and Michael Gasser |
153 |
Experiences with the ISOcat Data Category Registry |
Daan Broeder, Ineke Schuurman and Menzo Windhouwer |
154 |
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework |
Menzo Windhouwer, Justin Petro and Shakila Shayan |
155 |
Building a Corpus of Manually Revised Texts from Discourse Perspective |
Ryu Iida and Takenobu Tokunaga |
156 |
The CMD Cloud |
Matej Durco and Menzo Windhouwer |
159 |
Linguistic landscaping of South Asia using digital language resources: Genetic vs. areal linguistics |
Lars Borin, Anju Saxena, Taraka Rama and Bernard Comrie |
160 |
Single-Person and Multi-Party 3D Visualizations for Nonverbal Communication Analysis |
Michael kipp, Levin Freiherr von Hollen, Michael Christopher Hrstka and Franziska Zamponi |
162 |
Collection of a Simultaneous Translation Corpus for Comparative Analysis |
Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda and Satoshi Nakamura |
163 |
The AV-LASYN Database : A synchronous corpus of audio and 3D facial marker data for audio-visual laughter synthesis |
Huseyin Cakmak, Jerome Urbain and Thierry Dutoit |
165 |
Interoperability of dialogue corpora through ISO 24617-2 based querying |
Volha Petukhova, Andrei Malchanau and Harry Bunt |
168 |
ASR-based CALL systems and learner speech data: new resources and opportunities for research and development in second language learning |
Catia Cucchiarini, Steve Bodnar and Bart Penning de Vries |
169 |
The DBOX corpus collection of spoken human-human and human-machine dialogues |
Volha Petukhova, Martin Gropp, Dietrich Klakow, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, John Dines, Olivier Deroo, Ronny Egeler, Uwe Meinz and Steffen Liersch |
171 |
The Database for Spoken German – DGD2 |
Thomas Schmidt |
175 |
Building a Dataset of Multilingual Cognates for the Romanian Lexicon |
Liviu Dinu and Alina Maria Ciobanu |
176 |
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web |
Giuseppe Rizzo, Marieke van Erp and Raphaël Troncy |
178 |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings |
Ting Liu, Kit Cho, G. Aaron Broadwell, Samira Shaikh, Tomek Strzalkowski, John Lien, Sarah Taylor, Laurie Feldman, Boris Yamrom, Nick Webb, Umit Boz and Ignacio Cases |
180 |
SenTube: A Corpus for Sentiment Analysis on YouTube Social Media |
Olga Uryupina, Barbara Plank, Aliaksei Severyn, Agata Rotondi and Alessandro Moschitti |
182 |
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus |
Carlos Daniel Hernandez Mena and Abel Herrera Camacho |
183 |
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling |
Arantza del Pozo, Carlo Aliprandi, Aitor Álvarez, Carlos Mendes, Joao P. Neto, Sérgio Paulo, Nicola Piccinini and Matteo Raffaelli |
185 |
Exploring and Visualizing Variation in Language Resources |
Peter Fankhauser, Jörg Knappen and Elke Teich |
186 |
Simple Effective Microblog Named Entity Recognition: Arabic as an Example |
Kareem Darwish and Wei Gao |
187 |
A New Multi-Document Summarization Corpus for European Portuguese |
Miguel B. Almeida, Mariana Almeida, André F. T. Martins, Helena Figueira, Pedro Mendes and Cláudia Pinto |
188 |
Hope and Fear: How Opinions Influence Factuality |
Chantal van Son, Marieke van Erp, Antske Fokkens and Piek Vossen |
189 |
Linking Pictographs to Synsets: Sclera2Cornetto |
Vincent Vandeghinste and Ineke Schuurman |
190 |
Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies |
Hans-Ulrich Krieger, Christian Spurk, Hans Uszkoreit, Feiyu Xu, Yi Zhang, Frank Müller and Thomas Tolxdorff |
192 |
The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech |
Dietmar Schabus, Michael Pucher and Phil Hoole |
195 |
Genres in the Prague Discourse Treebank |
Lucie Poláková, Pavlína Jínová and Jiří Mírovský |
196 |
Speech recognition web services for Dutch |
Joris Pelemans, Kris Demuynck, Hugo Van hamme and Patrick Wambacq |
199 |
Translation errors from English to Portuguese: an annotated corpus |
Angela Costa, Tiago Luís and Luísa Coheur |
202 |
The liability of service providers in e-Research Infrastructures: killing the messenger? |
Pawel Kamocki |
203 |
Adapting VerbNet to French using existing resources |
Quentin Pradet, Laurence Danlos and Gaël de Chalendar |
205 |
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling |
Sharid Loaiciga, Thomas Meyer and Andrei Popescu-Belis |
206 |
The evolving infrastructure for language resources and the role for data scientists |
Nelleke Oostdijk and Henk van den Heuvel |
207 |
A New Form of Humor – Mapping Constraint-Based Computational Morphologies to a Finite-State Representation |
Attila Novák |
208 |
Unsupervised Translation Model Adaptation |
Saab Mansour and Hermann Ney |
209 |
SLMotion - An extensible sign language oriented video analysis tool |
Matti Karppa, Ville Viitaniemi, Marcos Luzardo, Jorma Laaksonen and Tommi Jantunen |
210 |
CFT13: A new resource for research into the post-editing process |
Michael Carl, Mercedes Martínez García and Bartolomé Mesa-Lao |
211 |
Mörkum Njálu. An annotated corpus to analyse and explain grammatical divergences between 13th-century manuscripts of Njál's saga. |
Ludger Zeevaert |
212 |
Edda, an Icelandic-Bulgarian Multimodal Parallel Corpus: Building and Application |
Georgi Iliev, Ægir Sverrisson and Zhivko Terziivanov |
214 |
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users |
Jean-Philippe Goldman, Adrian Leeman, Marie-José Kolly, Ingrid Hove, Ibrahim Almajai, Volker Dellwo and Steven Moran |
218 |
ClearTK 2.0: Design Patterns for Machine Learning in UIMA |
Steven Bethard, Philip Ogren and Lee Becker |
219 |
A Conventional Orthography for Tunisian Arabic |
Inès Zribi, Rahma Boujelbane, Abir Masmoudi, Mariem Ellouze, Lamia Belguith and Nizar Habash |
220 |
Creating a massively parallel Bible corpus |
Thomas Mayer and Michael Cysouw |
221 |
Corpus-Based Computation of Reverse Associations |
Reinhard Rapp |
222 |
LexTec – a rich language resource for technical domains in Portuguese |
Palmira Marrafa, Raquel Amaro and Sara Mendes |
225 |
Boosting the creation of a treebank |
Blanca Arias, Nuria Bel, Mercè Lorente, Montserrat Marimón, Alba Milà and Jorge Vivaldi |
227 |
The Dutch LESLLA Corpus |
Eric Sanders, Ineke van de Craats and Vanja de Lint |
228 |
Improvements to Dependency Parsing Using Automatic Simplification of Data |
Tomáš Jelínek |
230 |
Discosuite - A parser test suite for German discontinuous structures |
Wolfgang Maier, Miriam Kaeshammer, Peter Baumann and Sandra Kübler |
231 |
TWITTER IRONY DETECTION: FEATURE DESIGN AND EVALUATION |
Francesco Barbieri and Horacio Saggion |
233 |
DBpedia Domains: augmeting DBpedia with domain information |
Gregor Titze, Volha Bryl and Simone Paolo Ponzetto |
235 |
Mining a multimodal corpus for non-verbal behavior sequences conveying attitudes |
Mathieu Chollet, Magalie Ochs and Catherine Pelachaud |
236 |
Biomedical entity extraction using machine-learning based approaches |
Cyril Grouin |
239 |
Parsing Heterogeneous Corpora with a Rich Dependency Grammar |
Achim Stein |
240 |
DB-CoM: A database and web interface for corpora multilevel searches |
Heliana Mello, Tommaso Raso, Maryualê Mittmann and Diego Souza |
241 |
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors |
Samira Shaikh, Tomek Strzalkowski, Ting Liu, George Aaron Broadwell, Boris Yamrom, Sarah Taylor, Laurie Feldman, Kit Cho, Umit Boz, Ignacio Cases, Yuliya Peshkova and Ching-Sheng Lin |
242 |
First approach toward Semantic Role Labeling for Basque |
Haritz Salaberri, Olatz Arregi and Beñat Zapirain |
247 |
GENERATING A LEXICON OF ERRORS IN PORTUGUESE TO SUPPORT AN ERROR IDENTIFICATION SYSTEM FOR SPANISH NATIVE LEARNERS |
Lianet Sepúlveda Torres, Magali Sanches Duran and Sandra Aluísio |
248 |
xLiD-Lexica: Cross-lingual Linked Data Lexica |
Lei Zhang, Michael Färber and Achim Rettinger |
250 |
Annotating Relation Mentions in Tabloid Press |
Hong Li, Sebastian Krause, Feiyu Xu, Hans Uszkoreit, Robert Hummel and Veselina Mironova |
251 |
Efficient Reuse of Structured and Unstructured Resources for Ontology Population |
Chetana Gavankar, Ashish Kulkarni and Ganesh Ramakrishnan |
252 |
MAPPING DIACHRONIC AND DIATOPIC VARIATION IN SPOKEN CZECH: THE DIALEKT AND ORAL2016 CORPORA |
Marie Kopřivová, Hana Goláňová, Petra Klimešová and David Lukeš |
253 |
Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records |
Patrick Schone, Heath Nielson and Mark Ward |
254 |
Reusing Swedish FrameNet for training semantic roles |
Ildikó Pilán and Elena Volodina |
255 |
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification |
Kay Berkling, Johanna Fay, Masood Ghayoomi, Katrin Hein and Sebastian Stüker |
257 |
Graph-based Dependency Parsing with Self-Adaptation of Different Factored Models |
Xuezhe Ma, Chunyang Wu and Hai Zhao |
258 |
PACE Corpus: a multilingual corpus of Polarity-annotated textual data from the domains Automotive and CEllphone |
Christian Haenig, Andreas Niekler and Carsten Wuensch |
262 |
Szeged Corpus 2.5: Morphological Modifications in a Manually POS-tagged Hungarian Corpus |
Veronika Vincze, Viktor Varga, Katalin Ilona Simkó, János Zsibrita, Ágoston Nagy, Richárd Farkas and János Csirik |
263 |
Linked Open Data and Web Corpus Data for noun compound bracketing |
Pierre André Ménard and Caroline Barriere |
264 |
A Multimodal Corpora for Silent Speech Interaction |
João Freitas, António Teixeira and Miguel Dias |
267 |
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations |
Tomoko Izumi, Tomohide Shibata, Hisako Asano, Yoshihiro Matsuo and Sadao Kurohashi |
270 |
An Open Discourse Corpus for Chinese with Annotated Explicit Discourse Connectives |
Lanjun Zhou, Binyang LI, Zhongyu Wei and Kam-Fai Wong |
271 |
Extraction of Referent Changing Words for Question Answering |
kugatsu sadamitsu, Ryuichiro Higashinaka and Yoshihiro Matsuo |
272 |
MTWatch: A Tool for the Analysis of Noisy Parallel Data |
Sandipan Dandapat and Declan Groves |
274 |
DISTRIBUTED DISTRIBUTIONAL SIMILARITIES OF GOOGLE BOOKS OVER THE CENTURIES |
Martin Riedl, Richard Steuer and Chris Biemann |
275 |
The CLE Urdu POS Tagset |
Saba Urooj, Sarmad Hussain, Asad Mustafa, Rahila Parveen, Farah Adeeba, Tafseer Ahmed and Miriam Butt |
276 |
NoSta-D Named Entity Annotation for German: Guidelines and Dataset |
Darina Benikova, Chris Biemann and Marc Reznicek |
277 |
Phone Boundary Annotation in Conversational Speech |
Yi-Fen Liu, Shu-Chuan Tseng and J.-S Roger Jang |
278 |
Colloquial Corpus of Japanese Sign Language: A Design of Language Resources for Observing Sign Language Conversation |
Mayumi Bono, Kouhei Kikuchi and Yutaka Osugi |
279 |
Walenty: Towards a comprehensive valence dictionary of Polish |
Adam Przepiórkowski, Filip Skwarski, Elżbieta Hajnicz, Agnieszka Patejuk, Marek Świdziński and Marcin Woliński |
280 |
Computational Narratology: Extracting Tense Clusters from Narrative Texts |
Thomas Bögel, Jannik Strötgen and Michael Gertz |
284 |
Designing the Latvian Speech Recognition Corpus |
Mārcis Pinnis, Ilze Auziņa and Kārlis Goba |
285 |
Aligning parallel texts with InterText |
Pavel Vondřička |
286 |
Corpus for Coreference Resolution on Scientific Papers |
Panot Chaimongkol and Akiko Aizawa |
288 |
From Non Word to New Word: Automatically Identifying Neologisms in French Newspapers |
Ingrid Falk, Delphine Bernhard and Christophe Gérard |
289 |
Evaluating the effects of interactivity in a post-editing workbench |
Michael Carl, Mercedes García Martínez, Bartolomé Mesa-Lao, Nancy Underwood, Vicent Alabau, Jesús González-Rubio, Luis A. Leiva, Daniel Ortíz-Martínez, Germán Sanchis-Trilles and Francisco Casacuberta |
290 |
The Research and Teaching Corpus of Spoken German – FOLK |
Thomas Schmidt |
291 |
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers |
Stefania Degaetano-Ortlieb, Peter Fankhauser, Hannah Kermes, Ekaterina Lapshinova-Koltunski, Noam Ordan and Elke Teich |
292 |
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter |
Hassan Saif, Miriam Fernandez, Yulan HE and Harith Alani |
293 |
Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese |
Patrik Lambert and Carlos Rodriguez-Penagos |
294 |
The SYN-series corpora of written Czech |
Milena Hnátková, Michal Křen, Pavel Procházka and Hana Skoumalová |
298 |
ParCor: A Parallel Pronoun-Coreference Corpus to Support Statistical MT |
Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann and Bonnie Webber |
299 |
A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology |
Johann-Mattis List and Jelena Prokić |
300 |
Corpus of 19th-century Czech Texts: Problems and Solutions |
Karel Kučera and Martin Stluka |
302 |
Investigating the Image of Entities in Social Media: Dataset Design and First Results |
Julien Velcin, Young-Min KIM, Caroline Brun, Jean-Yves Dormagen, Eric SanJuan, Leila Khouas, Anne Peradotto, Stéphane Bonnevay, Claude Roux, Julien Boyadjian and Alejandro Molina |
303 |
The Norwegian Dependency Treebank |
Per Erik Solberg, Arne Skjærholt, Lilja Øvrelid, Kristin Hagen and Janne Bondi Johannessen |
308 |
Extending standoff annotation |
Maik Stührenberg |
309 |
An efficient language independent toolkit for complete morphological disambiguation |
László Laki and György Orosz |
311 |
A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation |
Eunah Cho, Sarah Fünfer, Sebastian Stüker and Alex Waibel |
312 |
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish |
Niklas Vanhainen and Giampiero Salvi |
316 |
Turkish Resources for Visual Word Recognition |
Begum Erten, Cem Bozsahin and Deniz Zeyrek |
317 |
An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis |
Eshrag Refaee and Verena Rieser |
318 |
The IMAGACT Visual Ontology. An Extendable Multilingual Infrastructure for the representation of lexical encoding of Action |
Massimo Moneglia, Susan Brown, Francesca Frontini, Gloria Gagliardi, Fahad Khan, Monica Monachini and Alessandro Panunzi |
319 |
Collaboration in the Production of a Massively Multilingual Lexicon |
Martin Benjamin |
320 |
Bridging the gap between speech technology and natural language processing: an evaluation method for spoken term discovery |
Bogdan Ludusan, Maarten Versteegh, Aren Jansen, Guillaume Gravier, Mark Johnson and Emmanuel Dupoux |
321 |
Modeling and evaluating dialog success in the LAST MINUTE corpus |
Dietmar Rösner, Rafael Friesen, Stephan Günther and Rico Andrich |
322 |
Emotions Are A Personal Thing: Towards Speaker-adaptive Emotion Recognition |
Maxim Sidorov, Stefan Ultes and Alexander Schmitt |
323 |
The pragmatic annotation of a corpus of academic lectures |
Sian Alsop and Hilary Nesi |
325 |
Using TEI, CMDI and ISOcat in CLARIN-DK |
Dorte Haltrup Hansen, Lene Offersgaard and Sussi Olsen |
327 |
Comparative analysis of verbal alignment in human-human and human-agent interactions |
Sabrina Campano, Jessica Durand and Chloé Clavel |
328 |
Transliteration and alignment of parallel texts from Cyrillic to Latin |
Petic Mircea and Daniela Gîfu |
329 |
How to tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations |
Corina Dima, Verena Henrich, Erhard Hinrichs and Christina Hoppermann |
330 |
Can Numerical Expressions Be Simpler? Implementation and Demostration of a Numerical Simplification System for Spanish |
Susana Bautista and Horacio Saggion |
331 |
4FX: Light Verb Constructions in a Multilingual Parallel Corpus |
Anita Rácz, István Nagy T. and Veronika Vincze |
332 |
The e-Identity Exploration Workbench |
Fritz Kliche, Andre Blessing, Jonathan Sonntag and Uli Heid |
334 |
Bodily Emotional Actions Behavior Corpus |
Nesrine Fourati and Catherine Pelachaud |
335 |
Using Stem-Templates to improve Arabic POS and Gender/Number Tagging |
Kareem Darwish |
337 |
Construction of Diachronic Ontologies from People's Daily of Fifty Years |
Shaoda He, Xiaojun Zou, Liumingjing Xiao and Junfeng Hu |
338 |
ROOTS: a toolkit for easy, fast and consistent processing of large sequential annotated data collections |
Jonathan Chevelu, Gwénolé Lecorvé and Damien Lolive |
339 |
Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary |
Martin Jansche |
340 |
Automatic Error Detection concerning the Definite and Indefinite Conjugation in the HunLearner Corpus |
Veronika Vincze, János Zsibrita, Péter Durst and Martina Katalin Szabó |
341 |
Speech-Based Emotion Recognition: Feature Selection by Self-Adapted Multi-Criteria Genetic Algorithm |
Maxim Sidorov, Kristina Brester, Wolfgang Minker and Eugene Semenkin |
345 |
Constructing and exploiting an automatically annotated resource of legislative texts |
Stefan Höfler and Kyoko Sugisaki |
346 |
GenitivDB – a Corpus-Generated Database for German Genitive Classification |
Roman Schneider |
349 |
Resources in Conflict: A Bilingual Valency Lexicon vs. a Bilingual Treebank vs. a Linguistic Theory |
Jana Sindlerova, Zdenka Uresova and Eva Fucikova |
350 |
A Unifying Opinion Annotation Framework across Genres and in Multiple Languages: The NewSoMe Corpus |
Roser Saurí, Judith Domingo and Toni Badia |
353 |
Buy one get one free: Distant annotation of Chinese tense, event type and modality |
Nianwen Xue and Yuchen Zhang |
354 |
Multimodal dialogue segmentation with gesture post-processing |
Kodai Takahashi and Masashi Inoue |
356 |
The Dangerous Myth of the Star System |
André Bittar, dini luca, Sigrid Maurel and Mathieu Ruhlmann |
358 |
Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese |
and Heng Ji |
360 |
CoRoLa – The Reference Corpus of Contemporary Romanian Language |
Verginica Barbu Mititelu, Elena Irimia and Dan Tufiș |
361 |
Building a reference corpus for countability in English |
Tibor Kiss, Francis Jeffry Pelletier and Tobias Stadtfeld |
362 |
The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction |
Gaël de Chalendar |
363 |
A SICK cure for the evaluation of compositional distributional semantic models |
Marco Marelli, Stefano Menini, Marco Baroni, Luisa Bentivogli, Raffaella bernardi and Roberto Zamparelli |
364 |
Twente Debate Corpus – A Multimodal Corpus for Head Movement Analysis |
Bayu Rahayudi, Ronald Poppe and Dirk Heylen |
365 |
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech |
Annika Hämäläinen, Jairo Avelar, Silvia Rodrigues, Miguel Sales Dias, Artur Kolesiński, Tibor Fegyó, Géza Németh, Petra Csobánka, Karine Lan and David Hewson |
368 |
Sharing Cultural Heritage: the Clavius on the Web Project |
Matteo Abrate, Angelo Mario Del Grosso, Emiliano Giovannetti, Angelica Lo Duca, Damiana Luzzi, Lorenzo Mancini, Andrea Marchetti, Irene Pedretti and Silvia Piccini |
370 |
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL |
Bo Liu, Jingjing Liu, Dimitris Metaxas and Carol Neidle |
371 |
Exploring factors that contribute to successful fingerspelling comprehension |
Leah Geer and Jonathan Keane |
372 |
The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems |
Nobal Niraula, Vasile Rus, Rajendra Banjade, Dan Stefanescu, William Baggett and Brent Morgan |
373 |
On the annotation of TMX translation memories for advanced leveraging in computer-aided translation |
Mikel Forcada |
374 |
The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech |
Shannon Hennig, Ryad Chellali and Nick Campbell |
375 |
Annotating the MASC Corpus with BabelNet |
Andrea Moro, Roberto Navigli, Francesco Maria Tucci and Rebecca J. Passonneau |
376 |
All Fragments Count in Parser Evaluation |
Joost Bastings |
377 |
TexAFon 2.0: A text processing tool for the generation of expressive speech in TTS applications |
Juan-María Garrido, Yesika Laplaza, Benjamin Kolz and Miquel Cornudella |
378 |
A Persian Treebank with Stanford Typed Dependencies |
Mojgan Seraji, Carina Jahani, Beáta Megyesi and Joakim Nivre |
379 |
A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon |
Marion Baranes and Benoît Sagot |
380 |
Named Entity Recognition on Turkish Tweets |
Dilek Kucuk, Guillaume Jacquet and Ralf Steinberger |
381 |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French |
Anne Lacheret, Julie Beliao, Anne Dister, Kim Gerdes, Jean-Philippe Goldman, Sylvain Kahane, Nicolas Obin and Atanas Tchobanov |
382 |
The IULA Spanish LSP Treebank |
Montserrat Marimon, Núria Bel, Beatriz Fisas, Blanca Arias, Silvia Vázquez, Jorge Vivaldi, Carlos Morell and Mercè Lorente |
383 |
Morpho-Syntactic Study of Errors from Speech Recognition System |
Maria Goryainova, Cyril Grouin, Sophie Rosset and Ioana Vasilescu |
384 |
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech |
Nianwen Xue, Ondrej Bojar, Jan Hajic, Martha Palmer, Zdenka Uresova and Xiuhong Zhang |
385 |
Annotating Relations in Scientific Articles |
Adam Meyers, Giancarlo Lee, Angus Grieve-Smith, Yifan He and Harriet Taber |
386 |
Annotating Clinical Events in Text Snippets for Phenotype Detection |
Prescott Klassen, Fei Xia, Lucy Vanderwende and Meliha Yetisgen |
387 |
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling |
Pablo Ruiz, Aitor Álvarez and Haritz Arzelus |
389 |
Use of unsupervised word classes for entity recognition: Application to the detection of disorders in clinical reports |
Maria Evangelia Chatzimina, Cyril Grouin and Pierre Zweigenbaum |
390 |
On Complex Word Alignment Configurations |
Miriam Kaeshammer and Anika Westburg |
391 |
HFST-SweNER – A New NER Resource for Swedish |
Dimitrios Kokkinakis, Jyrki Niemi, Sam Hardwick, Krister Lindén and Lars Borin |
392 |
Building and Modelling Multilingual Subjective Corpora |
Motaz Saad, David Langlois and Kamel Smaili |
394 |
GRASS: the Graz corpus of Read And Spontaneous Speech |
Barbara Schuppler, Martin Hagmueller, Juan Andres Morales-Cordovilla and Hannes Pessentheiner |
396 |
Linguistic resources and cats: how to use ISOcat, RELcat and SCHEMAcat |
Menzo Windhouwer and Ineke Schuurman |
397 |
Bring vs. MTRoget: Evaluating automatic thesaurus translation |
Lars Borin, Jens Allwood and Gerard de Melo |
398 |
Introducing a Framework for the Evaluation of Music Detection Tools |
Paula Lopez-Otero, Laura Docio-Fernandez and Carmen Garcia-Mateo |
400 |
Cross-linguistic annotation of narrativity for English/French verb tense disambiguation |
Cristina Grisot and Thomas Meyer |
401 |
The taraXÜ corpus of human-annotated machine translations |
Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Maja Popović, Cindy Tscherwinka, David Vilar and Hans Uszkoreit |
402 |
Detecting Document Structure in a Very Large Corpus of UK Financial Reports |
Mahmoud El-Haj, Paul Rayson, Steve Young and Martin Walker |
403 |
Latent Semantic Analysis Models on Wikipedia and TASA |
Dan Stefanescu, Rajendra Banjade and Vasile Rus |
405 |
The Strategic Impact of META-NET on the Regional, National and International Level |
Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audronė Bielevičienė, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Carmen Garcia-Mateo, Josef van Genabith, Jan Hajic, Inma Hernaez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev and Krister Linden |
409 |
The Weltmodell: A Data-Driven Commonsense Knowledge Base |
Alan Akbik and Thilo Michael |
410 |
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications |
Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadina, Bolette Pedersen, Costanza Navarretta, Núria Bel, Krister Linden, Marketa Lopatkova, Jan Hajic, Gisle Andersen and Przemyslaw Lenkiewicz |
413 |
A Large Opinion Corpus in Portuguese - Tackling Out-Of-Vocabulary Words |
Nathan Hartmann, Lucas Avanço, Pedro Balage, Magali Duran, Maria das Graças Volpe Nunes, Thiago Pardo and Sandra Aluísio |
414 |
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages |
Anoop Kunchukuttan, Abhijit Mishra, Rajen Chatterjee, Ritesh Shah and Pushpak Bhattacharyya |
415 |
The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars |
Erhard Hinrichs and Steven Krauwer |
416 |
Sprinter: Language Technologies for Interactive and Multimedia Language Learning |
Renlong Ai, Marcela Charfuelan, Walter Kasper, Tina Klüwer, Hans Uszkoreit, Feiyu Xu, Sandra Gasber and Philip Gienandt |
417 |
A Constraint Approach to Pivot-based Bilingual Dictionary Induction |
Wushouer Mairidan, Toru Ishida, Donghui Lin and Katsutoshi Hirayama |
419 |
Two Approaches to Metaphor Detection |
Brian MacWhinney and Davida Fromm |
421 |
Crowdsourcing and annotating NER for Twitter #drift |
and Anders Søgaard |
422 |
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction |
Sebastian Krause, Hong Li, Feiyu Xu, Hans Uszkoreit, Robert Hummel and Luise Spielhagen |
424 |
Evaluating corpora documentation with regards to the Ethics and Big Data Charter |
Alain Couillault, Karën Fort, Gilles Adda and Hugues Mazancourt (de) |
425 |
Bootstrapping Term Extractors for Multiple Languages |
Ahmet Aker, Monica Paramita and Robert Gaizauskas |
426 |
Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch |
Els Lefever, Marjan Van de Kauter and Veronique Hoste |
427 |
Measuring Readability of Polish Texts: Baseline Experiments |
Bartosz Broda, Bartłomiej Nitoń, Włodzimierz Gruszczyński and Maciej Ogrodniczuk |
429 |
Introducing a web application for labeling visualizing speech and correcting derived speech signals |
Raphael Winkelmann and Georg Raess |
430 |
Development of a TV Broadcasts Speech Recognition System for Qatari Arabic |
Mohamed Elmahdy, Mark Hasegawa-Johnson and Eiman Mustafawi |
431 |
Can Crowdsourcing be used for Effective Annotation of Arabic? |
Wajdi Zaghouani |
432 |
Design and development of RDB version of Corpus of Spontaneous Japanese |
Hanae Koiso, Yasuharu Den, Ken'ya Nishikawa and Kikuo Maekawa |
434 |
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech |
Mohamed Elmahdy, Mark Hasegawa-Johnson and Eiman Mustafawi |
435 |
Vocabulary-Based Language Similarity using Web Corpora |
Dirk Goldhahn and Uwe Quasthoff |
436 |
NewsReader: recording history from daily news streams |
Piek Vossen, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving and Willem Van Hage |
437 |
A set of open source tools for Turkish natural language processing |
Çağrı Çöltekin |
438 |
The Gulf of Guinea creole corpora |
Tjerk Hagemeijer, Michel Généreux, Iris Hendrickx, Amália Mendes, Abigail Tiny and Armando Zamora |
440 |
S-pot - a benchmark in spotting signs within continuous signing |
Ville Viitaniemi, Tommi Jantunen, Leena Savolainen, Matti Karppa and Jorma Laaksonen |
441 |
Converting an HPSG Treebank into its Parallel Dependency Treebank |
Masood Ghayoomi and Jonas Kuhn |
442 |
TweetNorm_es: an annotated corpus for Spanish microtext normalization |
Iñaki Alegria, Nora Aranberri, Pere Comas, Victor Fresno, Pablo Gamallo, Lluís Padró, Iñaki San Vicente, Jordi Turmo and Arkaitz Zubiaga |
443 |
The American Sign Language STEM Vocabulary Learning Tool |
Jon Henner, Jeanne Reis, Jonathan Laurence and Robert Hoffmeister |
444 |
The Procedure of Lexico-Semantic Annotation of Składnica Treebank |
Elżbieta Hajnicz |
445 |
MASS - Modal Annotation in Spontaneous Speech: semantic annotation scheme for modality in a spontaneous speech Brazilian Portuguese corpus |
Luciana Avila and Heliana Mello |
449 |
Adding the highly inflected agglutinative language Hungarian to a multilingual set of text mining tools |
Júlia Pajzs, Ralf Steinberger, Maud Ehrmann, Mohamed Ebrahim, Leonida Della Rocca, Stefano Bucci, Eszter Simon and Tamás Váradi |
452 |
Legal aspects of text mining |
Maarten Truyens and Patrick Van Eecke |
453 |
Treelet Probabilities for HPSG Parsing and Error Correction |
Angelina Ivanova and Gertjan van Noord |
454 |
The Tunisian Dialect phonetic dictionary for speech recognition |
Abir Masmoudi, Mariem Ellouze Khmekhem, Yannick Esteve, Lamia Hadrich Belguith and Nizar Habash |
455 |
Discovering frames in specialized domains |
Marie-Claude L' Homme, Benoît Robichaud and Carlos Subirats Rüggeberg |
457 |
Resources for the Detection of Conventionalized Metaphors in Four Languages |
Lori Levin, Teruko Mitamura, Brian MacWhinney, Davida Fromm, Jaime Carbonell, Weston Feely, Robert Frederking, Anatole Gershman and Carlos Ramirez |
459 |
CLARIN-NL: Major results |
Jan Odijk |
461 |
Annotation of Computer Science Papers for Semantic Relation Extrac-tion |
Yuka Tateisi, Yo Shidahara, Yusuke Miyao and Akiko Aizawa |
462 |
Identifying Idioms in Translated Chinese} |
Wan Yu Ho, Christine Kng, Shan Wang and Francis Bond |
463 |
Machine Translation for Subtitling: A Large-Scale Evaluation |
Thierry Etchegoyhen, Lindsay Bywood, Panayota Georgakopoulou, Mirjam Sepesy Maucec, Arantza del Pozo, Mark Fishel, Martin Volk, Jie Jiang, Gerard van Loenhout and Anja Turner |
465 |
T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing |
Elisabetta Jezek, Bernardo Magnini, Anna Feltracco, Alessia Bianchini and octavian popescu |
466 |
NARROWING THE GAP BETWEEN TERMBASES AND CORPORA IN COMMERCIAL ENVIRONMENTS |
Kara Warburton |
467 |
Author-Specific Sentiment Aggregation for Rating Prediction of Reviews |
Subhabrata Mukherjee and Sachindra Joshi |
468 |
Clustering of Multi-Word Named Entity variants: Multilingual Evaluation |
Guillaume Jacquet, Maud Ehrmann and Ralf Steinberger |
470 |
Designing and evaluating a reliable corpus of web genres |
Noushin Rezapour Asheghi, Serge Sharoff and Katja Markert |
471 |
Crowdsourcing as a preprocessing for complex semantic annotation tasks |
Héctor Martínez Alonso and Lauren Romeo |
473 |
Automatic Annotation of Machine Translation Datasets with Binary Quality Judgements |
Marco Turchi and Matteo Negri |
475 |
Semantic clustering of pivot paraphrases |
Marianna Apidianaki, Emilia Verzeni and Diana McCarthy |
476 |
When POS data sets don’t add up: Combatting sample bias |
Dirk Hovy, Barbara Plank and Anders Søgaard |
479 |
Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline |
Matthew Shardlow |
480 |
Learning from Domain Complexity |
Robert Remus and Dominique Ziegelmayer |
483 |
Benchmarking Twitter Sentiment Analysis Tools |
Ahmed Abbasi, Ammar Hassan and Milan Dhar |
484 |
Designing a bilingual speech corpus for French and German language learners: a two-step process |
Camille Fauth, Anne Bonneau, Frank Zimmerer, Juergen Trouvain, Bistra Andreeva, Vincent Colotte, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella and Bernd Möbius |
486 |
Creative language explorations through a high-expressivity N-grams query language |
Carlo Strapparava, Lorenzo Gatti, Marco Guerini and Oliviero Stock |
489 |
@iuvaris - A workbench for Latin |
Steffen Eger and Ineta Sejane |
492 |
Using Word Familiarities and Word Associations to Measure Corpus Representativeness |
Reinhard Rapp |
494 |
Deep Syntax Annotation of the Sequoia French Treebank |
Marie Candito, Guy Perrier, Bruno Guillaume, Corentin Ribeyre, Karën Fort, Djamé Seddah and Eric de la Clergerie |
496 |
Developing a French FrameNet: Methodology and First results |
Marie Candito, Pascal Amsili, Lucie Barque, Farah Benamara, Gaël de Chalendar, Marianne Djemaa, Pauline Haas, Richard Huyghe, Yvette Yannick Mathieu, Philippe Muller, Benoît Sagot and Laure Vieu |
497 |
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines |
Marta Sabou, Kalina Bontcheva, Leon Derczynski and Arno Scharl |
498 |
Locating Requests among Open Source Software Communication Messages |
Ioannis Korkontzelos and Sophia Ananiadou |
500 |
Harmonizing German Language Resources for Sentiment Analysis |
Thierry Declerck and Hans-Ulrich Krieger |
501 |
Word-Formation Network for Czech |
Magda Sevcikova and Zdenek Zabokrtsky |
502 |
An Analysis of Older Users' Interactions with Spoken Dialogue Systems |
Jamie Bost and Johanna Moore |
504 |
Innovations in Parallel Corpus Search Tools |
Martin Volk, Johannes Graën and Elena Callegaro |
506 |
Machine Translationness: Machine-likeness in Machine Translation Evaluation |
Joaquim Moré and Salvador Climent |
507 |
Towards an environment for the production and the validation of lexical semantic resources |
Mikaël Morardo and Eric De La Clergerie |
508 |
The Distress Analysis Interview Corpus of human and computer interviews |
Jonathan Gratch, Ron Artstein, Gale Lucas, Stefan Scherer, Angela Nazarian, Rachel Wood, Jill Boberg, David DeVault, Stacy Marsella, David Traum, Albert "Skip" Rizzo and Louis-Philippe Morency |
510 |
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer |
Timur Gilmanov, Olga Scrivner and Sandra Kübler |
511 |
Semi-automatic annotation of the UCU accents speech corpus |
Rosemary Orr, Marijn Huijbregts, Roeland van Beek, Lisa Teunissen, Kate Backhouse and David van Leeuwen |
513 |
Comparative Analysis of Portuguese Named Entities Recognition Tools |
Daniela Amaral, Evandro Fonseca, Lucelene Lopes and Renata Vieira |
514 |
A corpus of European Portuguese child and child-directed speech |
Ana Lúcia Santos, Michel Généreux, Aida Cardoso, Celina Agostinho and Silvana Abalada |
515 |
Using C5.0 for Boosting Frame-Semantic Parsing Accuracy |
Guntis Barzdins, Didzis Gosko, Laura Rituma and Peteris Paikens |
517 |
‘interHist’ ̶ an interactive visual interface for corpus exploration |
Verena Lyding, Lionel Nicolas and Egon Stemle |
518 |
Identification of Multiword Expressions in the brWaC |
Rodrigo Boos, Kassius Prestes and Aline Villavicencio |
519 |
Collocation or Free Combination? – Applying Machine Translation Techniques to identify collocations in Japanese |
Lis Pereira, Elga Strafella and Yuji Matsumoto |
520 |
AusTalk: an audio-visual corpus of Australian English |
Dominique Estival, Steve Cassidy, Felicity Cox and Denis Burnham |
521 |
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus |
Nathan Schneider, Spencer Onuffer, Nora Kazour, Emily Danchik, Michael Mordowanec, Henrietta Conrad and Noah A. Smith |
522 |
Automatic semantic relation extraction from Portuguese texts |
Leonardo Sameshima Taba and Helena Caseli |
523 |
A Multidialectal Parallel Corpus of Arabic |
Houda Bouamor, Nizar Habash and Kemal Oflazer |
525 |
Transfer learning of feedback head expressions in comparable multimodal Danish and Polish corpora |
Costanza Navarretta and Magdalena Lis |
529 |
Comparing two acquisition systems for automatically building an English–Croatian parallel corpus from multilingual websites |
Miquel Esplà-Gomis, Filip Klubička, Nikola Ljubešić, Sergio Ortiz-Rojas, Vassilis Papavassiliou and Prokopis Prokopidis |
530 |
Annotation of temporal structure in Estonian texts |
Siim Orasmaa |
531 |
HuRIC: a Human Robot Interaction Corpus |
Emanuele Bastianelli, Giuseppe Castellucci, Danilo Croce, Roberto Basili and Daniele Nardi |
532 |
On the origin of errors: A fine-grained analysis of MT and PE errors and their relationship |
Joke Daems, Lieve Macken and Sonia Vandepitte |
533 |
The WaveSurfer Automatic Speech Recognition Plugin |
Giampiero Salvi and Niklas Vanhainen |
535 |
Vystadial 2013: Free English and Czech telephone speech corpus |
Matěj Korvas, Ondřej Plátek, Ondřej Dušek, Lukáš Žilka and Filip Jurčíček |
536 |
Creating and Automatically Expanding a Gold Standard for Chemistry-Disease Relations in Patent Texts |
Antje Schlaf, Claudia Bobach and Matthias Irmer |
537 |
The SSPNet-Mobile Corpus: Social Signal Processing over mobile phones. |
Anna Polychroniou, Hugues Salamin and Alessandro Vinciarelli |
538 |
Hybrid Annotation of a Polish Dependency Treebank |
Alina Wróblewska and Adam Przepiórkowski |
541 |
Bootstrapping an Italian VerbNet: data-driven analysis of verbal alternations |
Veronica Viola, Gianluca Lebani and Alessandro Lenci |
543 |
Self-training a Constituency Parser using n-gram Trees |
Arda Celebi and Arzucan Özgür |
544 |
A Tagged Corpus and a Tagger for Urdu |
Bushra Jawaid, Amir Kamran and Ondrej Bojar |
545 |
Lexical Substitution Dataset for German |
Kostadin Cholakov, Chris Biemann, Judith Eckle-Kohler and Iryna Gurevych |
546 |
A cascade approach for complex-type classification |
Lauren Romeo, Sara Mendes and Núria Bel |
549 |
Recognition of Products and Brandnames. Application to the Cosmetic domain. |
Cédric Lopez, Frédérique Segond and Olivier Hondermarck |
552 |
Annotation of specialized corpora using a comprehensive entity and relation scheme |
Louise Deleger, Anne-Laure Ligozat, Cyril Grouin, Pierre Zweigenbaum and Aurelie Neveol |
553 |
Annotation Pro + TGA: automation of speech timing analysis |
KATARZYNA KLESSA and Dafydd Gibbon |
556 |
Polysemy index for nouns: an experiment on Italian using the PAROLE SIMPLE CLIPS lexical database |
Francesca Frontini, Valeria Quochi, Sebastian Padó, Monica Monachini and Jason Utt |
558 |
YouDACC: the Youtube Dialectal Arabic Comment Corpus |
Ahmed Salama, Houda Bouamor, Behrang Mohit and Kemal Oflazer |
562 |
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar |
Dan Flickinger, Emily M. Bender and Stephan Oepen |
563 |
ENRICHING THE “SENSO COMUNE” PLATFORM WITH AUTOMATICALLY ACQUIRED DATA. |
Tommaso Caselli, Laure Vieu, Carlo Strapparava and Guido Vetere |
564 |
Online experiments with the percy software framework - experiences and some early results |
Christoph Draxler |
567 |
Improving the exploitation of linguistic annotations in ELAN |
Onno Crasborn and Han Sloetjes |
568 |
A deep context grammatical model for authorship attribution |
Simon Fuller, Phil Maguire and Philippe Moser |
569 |
Manual Analysis of Structurally Informed Reordering in German-English Machine Translation |
Teresa Herrmann, Jan Niehues and Alex Waibel |
570 |
Presenting a system of man-machine interaction for performing map tasks. |
Gabriele Pallotti, Francesca Frontini, Fabio Affè, Monica Monachini and Stefania Ferrari |
574 |
Automatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Reranking Feature |
Moritz Wittmann, Marion Weller and Sabine Schulte im Walde |
575 |
NASTIA: Negotiating Appointment Setting Interface |
Layla El Asri, Rémi Lemonnier, Romain Laroche, Olivier Pietquin and Hatim Khouzaimi |
576 |
DINASTI: Dialogues with a Negotiating Appointment Setting Interface |
Layla El Asri, Romain Laroche and Olivier Pietquin |
577 |
An Open Educational Resource: The SOURCe Project |
Fryni Kakoyianni-Doa and Eleni Tziafa |
578 |
LQVSumm: A corpus of linguistic quality violations in multi-document summarization |
Annemarie Friedrich, Marina Valeeva and Alexis Palmer |
579 |
Potsdam Commentary Corpus 2.0: Annotation for Discourse Research |
Manfred Stede and Arne Neumann |
582 |
DENSE COMPONENTS IN THE STRUCTURE OF WORDNET |
Ahti Lohk, Kaarel Allik, Heili Orav and Leo Võhandu |
583 |
Choosing which to use? A study of distributional models for nominal lexical semantic classification |
Lauren Romeo, Gianluca Lebani, Núria Bel and Alessandro Lenci |
585 |
Extensions of the Sign Language Recognition and Translation Corpus RWTH-PHOENIX-Weather |
Jens Forster, Christoph Schmidt, Oscar Koller, Martin Bellgardt and Hermann Ney |
587 |
HESITA(te) in Portuguese |
Sara Candeias, Dirce Celorico, Jorge Proença, Arlindo Veiga, Carla Lopes and Fernando Perdigão |
588 |
MUHIT: A Multilingual Harmonized Dictionary |
Sameh Alansary |
589 |
Predicate Matrix: extending SemLink through WordNet mappings |
German Rigau, Maddalen Lopez de Lacalle and Egoitz Laparra |
590 |
T2K: a Tool for Automatically Extracting Domain-Specific Information from Texts |
Felice Dell'Orletta, Giulia Venturi, Andrea Cimino and Simonetta Montemagni |
591 |
EMOVO CORPUS: AN ITALIAN EMOTIONAL SPEECH DATABASE |
Giovanni Costantini, Iacopo Iaderola, Andrea Paoloni and Massimiliano Todisco |
593 |
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic |
Ryan Roth, Mohamed Al-Badrashiny, Ahmed El Kholy, Ramy Eskander, Mona Diab, Nizar Habash, Arfath Pasha, Manoj Pooleery and Owen Rambow |
594 |
Developing Politeness Annotated Corpus of Hindi Blogs |
Ritesh Kumar |
596 |
The CMU METAL Farsi NLP Approach |
Weston Feely, Mehdi Manshadi, Robert Frederking and Lori Levin |
597 |
Recognising suicidal messages in Dutch social media |
Bart Desmet and Véronique Hoste |
599 |
A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions |
Maximilian Köper and Sabine Schulte im Walde |
602 |
Mapping the Lexique des Verbes du Français (Lexicon of French Verbs) to a NLP lexicon using examples |
Bruno Guillaume, Karën Fort, Guy Perrier and Paul Bédaride |
604 |
Language Resources for French in the Biomedical Domain |
Aurelie Neveol and Pierre Zweigenbaum |
605 |
MOBILE TURK: PORTABLE SPEECH HARVESTING VIA MOBILE CROWD SOURCING |
Jingjing Liu, Jeff Chan, David Gaddy, Ian McGraw, Scott Cyphers and Jim Glass |
606 |
MERLIN: A trilingual learner corpus |
Adriane Boyd, Jirka Hana, Lionel Nicolas, Detmar Meurers, Andrea Abel, Karin Schöne, Barbora Štindlová, Chiara Vettori and Katrin Wisniewski |
607 |
Computer-aided morphology expansion for Old Swedish |
Yvonne Adesam, Malin Ahlberg, Peter Andersson, Gerlof Bouma, Markus Forsberg and Mans Hulden |
610 |
Two-Step Machine Translation with Lattices |
Bushra Jawaid and Ondrej Bojar |
611 |
The Munich BioVoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production |
Björn Schuller, Felix Friedmann and Florian Eyben |
612 |
DysList: An Annotated Resource of Dyslexic Errors |
Luz Rello, Ricardo Baeza-Yates and Joaquim Llisterri |
616 |
Estimation of Speaking Style in Speech Corpora Focusing on speech transcriptions |
Raymond SHEN and Hideaki KIKUCHI |
617 |
Evaluation of different strategies for domain adaptation in opinion mining |
Anne Garcia-Fernandez, Olivier Ferret and Marco Dinarelli |
618 |
Clinical Data-Driver Probabilistic Graph Processing |
Travis Goodwin and Sanda Harabagiu |
619 |
Comparing Similarity Measures for Distributional Thesauri |
Muntsa Padró, Marco Idiart, Aline Villavicencio and Carlos Ramisch |
621 |
AraNLP: a Java-based Library for the Processing of Arabic Text. |
Maha Althobaiti, Udo Kruschwitz and Massimo Poesio |
623 |
Identifying a Topic Mention in Everyday Conversations For a Clinical Application |
Golnar Sheikhshab and Izhak Shafran |
624 |
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data |
Jena D. Hwang, Annie Zaenen and Martha Palmer |
627 |
Web-imageability of the Behavioral Features of Basic-level Concepts |
Yoshihiko Hayashi |
628 |
The Human Communication Science Virtual Laboratory: A Web Based Repository API |
Steve Cassidy, Dominique Estival, Timothy Jones, Denis Burnham and Jared Burghold |
630 |
Building a Terminological Resource for Crisis Management: The Case of Floods and Protests |
Irina Temnikova, Andrea Varga and Dogan Biyikli |
632 |
ILLINOISCLOUDNLP: Text Analytics Services in the Cloud |
Hao Wu, Zhiye Fei, Aaron Dai, Mark Sammons and Dan Roth |
633 |
Text Readability and Word Distribution in Japanase |
Satoshi Sato |
634 |
The Use of a FileMaker Pro Database in Evaluating Sign Language Notation Systems |
Julie Hochgesang |
636 |
Mapping CPA onto OntoNotes Senses |
Octavian Popescu, Martha Palmer and Patrick Hanks |
639 |
Language CoLLAGE: Grammatical Description with the LinGO Grammar Matrix |
Emily M. Bender |
640 |
Applying CNL rules in text alternatives authoring and evaluation for images in the web |
Silvia Rodríguez Vázquez, Pierrette Bouillon and Anton Bolfing |
641 |
Collection of Dialectal Arabic Data From Social Media |
Chris Callison-Burch |
643 |
Reconstructing the Semantic Landscape of Natural Language Processing |
Elisa Omodei, Jean-Philippe Cointet and Thierry Poibeau |
645 |
Discovering and visualising stories in news |
Marieke van Erp, Marit Nijsen, Gleb Satyukov and Piek Vossen |
646 |
Supervised Within-Document Event Coreference using Information Propagation |
Zhengzhong Liu, Jun Araki, Eduard Hovy and Teruko Mitamura |
647 |
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments |
Ana Aguiar, Mariana Kaiseler, Hugo Meinedo and Pedro Almeida |
648 |
Language Resource Addition: Dictionary or Corpus? |
Shinsuke Mori and Graham Neubig |
650 |
The DIRHA simulated corpus |
Luca Cristoforetti, Mirco Ravanelli, Maurizio Omologo, Alessandro Sosi, Alberto Abad, Martin Hagmueller and Petros Maragos |
656 |
The Slovak Categorized News Corpus |
Daniel Hladek, Jan Stas and Jozef Juhar |
657 |
High Quality Word Lists as a Resource for Multiple Purposes |
Uwe Quasthoff, Dirk Goldhahn, Thomas Eckart, Erla Hallsteinsdóttir and Sabine Fiedler |
658 |
Quality-based active sample selection strategy for statistical machine translation |
Varvara Logacheva and Lucia Specia |
659 |
The Multilingual Paraphrase Database |
Juri Ganitkevitch and Chris Callison-Burch |
660 |
Language Processing Infrastructure in the XLike Project |
Lluís Padró, Zeljko Agic, Xavier Carreras, Blaz Fortuna, Esteban García-Cuesta, Zhixing Li, Tadej Stajner and Marko Tadić |
661 |
Conceptual transfer: Using local classifiers for transfer selection |
Gregor Thurmair |
664 |
Metadata as Linked Open Data: mapping disparate XML metadata registries into one RDF/OWL registry. |
Marta Villegas, Maite Melero and Núria Bel |
665 |
Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium |
Grégoire Détrez, Víctor M. Sánchez-Cartagena and Aarne Ranta |
669 |
Annotating Arguments: The NOMAD Collaborative Annotation Tool |
Georgios Petasis |
670 |
A stream computing approach towards scalable NLP |
Xabier Artola, Zuhaitz Beloki and Aitor Soroa |
672 |
ISLEX – a Multilingual Web Dictionary |
Þórdís Úlfarsdóttir |
674 |
Exploiting catenae in parallel treebank alignment |
Manuela Sanguinetti and Cristina Bosco |
675 |
Sublanguage Corpus Analysis Toolkit: A tool for assessing the representativeness and sublanguage characteristics of corpora |
Irina Temnikova, William A. Baumgartner Jr. and K. Bretonnel Cohen |
676 |
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation |
Violeta Seretan, Johanna Gerlach and Pierrette Bouillon |
677 |
Correcting Errors in a New Gold Standard for Tagging Icelandic Text |
Sigrún Helgadóttir, Hrafn Loftsson and Eiríkur Rögnvaldsson |
679 |
Semi-compositional Method for Synonym Extraction of Multi-Word Terms |
Béatrice DAILLE and Amir HAZEM |
680 |
TUKE-BNews-SK Slovak Broadcast News Corpus Construction and Evaluation |
Matus Pleva and Jozef Juhar |
681 |
The Hungarian Gigaword Corpus |
Csaba Oravecz, Tamás Váradi and Bálint Sass |
682 |
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT |
kunal sachdeva, Rishabh Srivastava, Sambhav Jain and Dipti Sharma |
686 |
From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach. |
Maria Pia di Buono and Mario Monteleone |
687 |
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard |
Stephen Wattam, Paul Rayson, Marc Alexander and Jean Anderson |
688 |
Named Entity Corpus Construction using Wikipedia and DBpedia Ontology |
Younggyun Hahm, Kyungtae Lim, Jungyeul Park, Youngsik Kim, Dosam Hwang and Key-Sun Choi |
689 |
A model to generate adaptive multimodal job interviews with a virtual recruiter |
Zoraida Callejas, Brian Ravenet, Magalie Ochs and Catherine Pelachaud |
690 |
The SETimes.HR Linguistically Annotated Corpus of Croatian |
Željko Agić and Nikola Ljubešić |
691 |
ACTIV-ES: a comparable, cross-dialect corpus of ‘everyday’ Spanish from Argentina, Mexico, and Spain |
Jerid Francom, Mans Hulden and Adam Ussishkin |
692 |
Multiple Choice Question Corpus Analysis for Distractor Characterization |
Van-Minh Pho, Thibault André, Anne-Laure Ligozat, Brigitte Grau, Gabriel Illouz and Thomas Francois |
694 |
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing |
Željko Agić, Daša Berović, Danijela Merkler and Marko Tadić |
695 |
Euronews: a multilingual speech corpus for ASR |
Roberto Gretter |
696 |
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing |
Masood Ghayoomi, Kiril Simov and Petya Osenova |
697 |
A Multimodal Corpus of Rapid Dialogue Games |
Maike Paetzel, David Nicolas Racca and David DeVault |
701 |
Identification of Technology Terms in Patents |
Peter Anick, Marc Verhagen and James Pustejovsky |
703 |
Towards Linked Hypernyms Dataset 2.0: complementing DBpedia with hypernym discovery |
Tomáš Kliegr and Milan Dojchinovski |
704 |
Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone |
Eduard Bejček, Kettnerová Václava and Marketa Lopatkova |
706 |
TermWise: A CAT-tool with Context-Sensitive Terminological Support. |
Kris Heylen, Stephen Bond, Dirk De Hertog De Hertog, Ivan Vulić and Hendrik Kockaert |
708 |
Measuring the Impact of Spelling Errors Upon the Quality of Machine Translation |
Irina Galinskaya, Valentin Gusev, Elena Mescheryakova and Mariya Shmatova |
709 |
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System |
Sakriani Sakti, Keigo Kubo, Sho Matsumiya, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Fumihiro Adachi and Ryosuke Isotani |
710 |
The Slovene BNSI Broadcast News database and reference speech corpus GOS: Towards the uniform guidelines for future work |
Andrej Zgank, Ana Zwitter Vitez and Darinka Verdonik |
712 |
From videos to synsets: enriching ItalWordNet multimodally |
Roberto Bartolini, Valeria Quochi, Irene De Felice, Irene Russo and Monica Monachini |
714 |
Language Editing Dataset of Academic Texts |
Vidas Daudaravicius |
715 |
A Toolkit for Efficient Learning of Lexical Units for Speech Recognition |
Matti Varjokallio and mikko kurimo |
716 |
Towards Automatic Transformation between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features |
Yuichi Ishimoto, Tomoyuki Tsuchiya, Hanae Koiso and Yasuharu Den |
717 |
A Japanese conversation corpus for training and evaluation of prediction model of backchannels. |
Hiroaki Noguchi, Yasuhiro Katagiri and Yasuharu Den |
719 |
Aix Map Task corpus: The French multimodal corpus of task-oriented dialogue |
Jan Gorisch, Corine Astésano, Ellen, Gurman Bard, Brigitte BIGI and Laurent Prévot |
721 |
Adapting a part-of-speech tagset to non-standard text: the case of STTS |
Heike Zinsmeister, Ulrich Heid and Kathrin Beck |
722 |
Alert! … Calm down, there is nothing to worry about. Warning and soothing speech synthesis. |
Milan Rusko, Sakhia Darjaa, Marian Trnka, Marian Ritomsky and Robert Sabo |
723 |
Multiword Expressions in Machine Translation |
Valia Kordoni and Iliana Simova |
726 |
CROMER: a Tool for Cross-Document Event and Entity Coreference |
Christian Girardi, Manuela Speranza, Rachele Sprugnoli and Sara Tonelli |
727 |
RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus |
Tiberiu Boroș, Adriana Stan, Oliver Watts and Stefan Daniel Dumitrescu |
728 |
Nederlab, towards a Virtual Research Environment for textual data |
marc kemps-snijders |
729 |
Coreference Resolution for Latvian |
Arturs Znotins and Peteris Paikens |
730 |
How to construct a multi-lingual domain ontology |
Nitsan Chrizman and Alon Itai |
732 |
Automatic language identity tagging on word and sentence-level in multilingual text sources : a case-study on Luxembourgish |
Thomas Lavergne, Gilles Adda and Martine Adda-Decker |
733 |
Towards Shared Datasets for Normalization Research |
Orphee De Clercq, Sarah Schulz, Bart Desmet and Veronique Hoste |
735 |
Rule-based Reordering Space in Statistical Machine Translation |
Nicolas Pécheux, Alexander Allauzen and François Yvon |
736 |
KALAKA-3: a database for the recognition of spoken European languages on YouTube audios |
Luis Javier Rodriguez-Fuentes, Mikel Penagarikano, Amparo Varona, Mireia Diez and German Bordel |
737 |
Mining Online Discussion Forums for Metaphors |
Andrew Gargett and John Barnden |
738 |
TagNText: A parallel corpus for the induction of resource-specific non-taxonomical relations from tagged images |
Theodosia Togia and Ann Copestake |
739 |
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis |
Carmen Garcia-Mateo, Antonio Cardenal, Xose Luis Regueira, Elisa Fernández Rei, Marta Martinez, Roberto Seara, Rocío Varela and Noemí Basanta |
741 |
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores |
Demulier Virginie, Elisabetta Bevacqua, Florian Focone, Tom Giraud, Pamela Carreno, Brice Isableu, Sylvie Gibet, Pierre De Loor and Jean-Claud Martin |
743 |
Managing complex access scenarios to large and heterogeneous corpora |
Piotr Banski, Nils Diewald, Michael Hanl, Marc Kupietz and Andreas Witt |
744 |
Basque Speecon-like and Basque SpeechDat MDB-600: speech databases for the development of ASR technology for Basque |
Igor Odriozola, Inma Hernaez, María Inés Torres, Luis Javier Rodriguez-Fuentes, Mikel Penagarikano and Eva Navas |
745 |
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment |
Andrew Gargett and Sam Hellmuth |
747 |
A multimodal interpreter for 3D visualization and animation of verbal concepts |
Coline Claude-Lachenaud, Eric Charton, Benoit Ozell and Michel Gagnon |
748 |
Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate |
Tobias Bocklet, Andreas Maier, Korbinian Riedhammer, Ulrich Eysholdt and Elmar Nöth |
750 |
Classifying Inconsistencies in DBpedia Multilingual Chapters |
Elena Cabrio, Serena Villata, Julien Cojan and Fabien Gandon |
751 |
TVD: A Reproducible and Multiply Aligned TV Series Dataset |
Anindya Roy, Camille Guinaudeau, Herve Bredin and Claude Barras |
753 |
Towards an SMS Dictionary Construction: An Alignment-based Approach |
Reda Bestandji, Cédric Lopez and Mathieu Roche |
754 |
Compounds and distributional thesauri |
Olivier Ferret |
755 |
TALC-sef A Manually-Revised pos-TAgged Literary Corpus in Serbian, English and French |
Antonio Balvet, Dejan Stosic and Aleksandra MILETIC |
756 |
Crowdsourcing for Evaluating Translation Quality |
Shinsuke Goto, Donghui Lin and Toru Ishida |
758 |
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis |
Billy T.M. Wong |
763 |
Flow Graph Corpus from Recipe Texts |
Shinsuke Mori, Hirokuni Maeta, Yoko Yamakata and Tetsuro Sasada |
764 |
Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction |
Johannes Kirschnick, Alan Akbik and Holmer Hemsen |
766 |
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie |
Rachel Bawden, Marie-Amélie Botalla, kim gerdes and Sylvain Kahane |
768 |
Morfeusz – Revolutions |
Marcin Woliński |
769 |
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool |
Mauro Dragoni, Alessio Bosca, Matteo Casu and Andi Rexha |
770 |
Student achievement and French sentence repetition test scores |
Deryle Lonsdale and Benjamin Millard |
771 |
Human annotation of ASR error regions: Is "gravity" a sharable concept for human annotators? |
Daniel Luzzati, Cyril Grouin, Ioana Vasilescu, Martine Adda-Decker, Eric Bilinski, Nathalie Camelin, Juliette Kahn, Carole Lailler, Lori Lamel and Sophie Rosset |
772 |
SwissAdmin: A multilingual tagged parallel corpus of press releases |
Yves Scherrer, Luka Nerima, Lorenza Russo, Maria Ivanova and Eric Wehrli |
773 |
To pay or to get paid: Enriching a Valency Lexicon with Diatheses |
Anna Vernerová, Václava Kettnerová and Marketa Lopatkova |
774 |
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation |
Liang Tian, Derek F. Wong, Lidia S. Chao, Paulo Quaresma and Francisco Oliveira |
775 |
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools |
Rodrigo Agerri, Josu Bermudez and German Rigau |
777 |
Annotating the Focus of Negation in Japanese Text |
Suguru Matsuyoshi, Ryo Otsuki and Fumiyo Fukumoto |
780 |
NIF4OGGD - NLP Interchange Format for Open German Governmental Data |
Mohamed Sherif, Sandro Coelho, Ricardo Usbeck, Sebastian Hellmann, Jens Lehmann, Martin Brümmer and Andreas Both |
781 |
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain |
Alessio Bosca, Matteo Casu, Matteo Dragoni and Nikolaos Marianos |
784 |
Optimising Morphological Analyser Transducers Using Morphological Structure and Automatically Induced Flag Diacritics |
Senka Drobac, Krister Lindén, Tommi Pirinen and Miikka Silfverberg |
786 |
META-SHARE: One year after |
Stelios Piperidis, Harris Papageorgiou, Christian Spurk, Georg Rehm, Khalid Choukri, Olivier Hamon, Nicoletta Calzolari, Riccardo Del Gratta, Bernardo Magnini and Christian Girardi |
787 |
USING A SERIOUS GAME TO COLLECT A CHILD LEARNER SPEECH CORPUS |
Claudia Baur and Manny Rayner |
788 |
The LREMap disclosed |
Riccardo Del Gratta |
789 |
The Development of the Multilingual LUNA Corpus for Spoken Language System Porting |
Evgeny Stepanov, Giuseppe Riccardi and Ali Orkan Bayer |
791 |
Ranking Job Offers for Candidates: learning hidden knowledge from Big Data |
Marc Poch, Núria Bel, Sergio Espeja and Felipe Navio |
792 |
An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora |
Valérie Hanoka and Benoît Sagot |
793 |
Crowd-sourcing evaluation of automatically acquired, morphologically related word groupings |
Claudia Borg and Albert Gatt |
795 |
An Innovative World Language Centre : Challenges for the Use of Language Technology |
Auður Hauksdóttir |
797 |
A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging based on closely related languages |
Yves Scherrer and Benoît Sagot |
799 |
New bilingual speech databases for audio diarization |
David Tavarez, Eva Navas, Daniel Erro, Ibon Saratxaga and Inma Hernaez |
801 |
A new part-of-speech tagger for Norwegian: Building on existing language resources |
Cristina Sánchez Marco |
803 |
Bilingual dictionaries for all EU languages |
Ahmet Aker, Monica Paramita, Marcis Pinnis and Robert Gaizauskas |
804 |
Synergy of Nederlab and @PhilosTEI: diachronic and multilingual Text-Induced Corpus Clean-up |
Martin Reynaert |
807 |
Quality Estimation for Synthetic Parallel Data Generation |
Raphael Rubino and Antonio Toral |
808 |
Adding a Third Language to a Lexical Resource Describing Legal Terminology: the assignment of equivalents |
Janine Pimentel |
809 |
An Out-of-Domain Test Suite for Dependency Parsing of German |
Wolfgang Seeker and Jonas Kuhn |
810 |
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0 |
Maud Ehrmann, Daniele Vaennella, John Philip McCrae, Philipp Cimiano and Roberto Navigli |
813 |
NOMAD: Linguistic Resources and Tools aimed at Policy Formulation and Validation through non-Moderated Crowd-sourcing |
George Kiomourtzis, George Giannakopoulos, Pythagoras Karampiperis and Vangelis Karkaletsis |
814 |
Encompassing a spectrum of LT users in the CLARIN-DK Infrastructure |
Lina Henriksen, Dorte Haltrup Hansen, Bente Maegaard, Bolette Sandford Pedersen and Claus Povlsen |
816 |
Automatically enriching spoken corpora with syntactic information for linguistic studies |
Alexis Nasr, Frederic Bechet, Benoit Favre, Thierry Bazillon, Jose Deulofeu and Andre Valli |
817 |
Propa-L: a semantic filtering service from a lexical network created using Games With A Purpose |
Mathieu Lafourcade and Karën Fort |
818 |
A reduced inventory of categories for training a model for the Italian Stanford Dependencies |
Maria Simi, Cristina Bosco and Simonetta Montemagni |
819 |
A Preprocessing Pipeline for Chinese-English Patent Machine Translation |
Minwei Feng and Hermann Ney |
820 |
Meta-Classifiers Easily Improve Commercial Sentiment Detection Tools |
Mark Cieliebak, Oliver Dürr and Fatih Uzdilli |
823 |
UnixMan Corpus: A Resource for Grounded Language Learning in the Unix Domain |
Kyle Richardson and Jonas Kuhn |
824 |
GraPAT: a Tool for Graph Annotations |
Jonathan Sonntag and Manfred Stede |
825 |
Providing Searchability Using Broker Architecture on an Evolving Infrastructure |
Matthijs Brouwer, Hennie Brugman, Marc Kemps-Snijders, Jan Pieter Kunst, Maarten van der Peet, Rob Zeeman and Junte Zhang |
826 |
Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations |
Antonio Pareja-Lora, Guillermo Cárcamo-Escorza and Alicia Ballesteros-Calvo |
828 |
A Framework for Compiling High Quality Knowledge Bases From Raw Corpora |
Gongye Jin, Daisuke Kawahara and Sadao Kurohashi |
829 |
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data |
Jason Utt, Sylvia Springorum and Sabine Schulte im Walde |
832 |
The Tutorbot Corpus – A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue |
Maria Koutsombogera, Samer Al Moubayed, Bajibabu Bollepalli, Ahmed Hussen Abdelaziz, Martin Johansson, José David Aguas Lopes, Jekaterina Novikova, Catharine Oertel, Kalin Stefanov and Gül Varol |
834 |
TweetCaT: a tool for building Twitter corpora of smaller languages |
Nikola Ljubešić, Darja Fišer and Tomaž Erjavec |
835 |
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation |
Ondrej Bojar, Vojtěch Diatka, Pavel Stranak and Daniel Zeman |
837 |
Combining dependency information and generalization in a pattern-based approach to the classification of lexical-semantic relation instances |
Silvia Necsulescu, Sara Mendes and Núria Bel |
838 |
Using Audio Books for Training a Text-to-Speech System |
Aimilios Chalamandaris, Pirros Tsiakoulis, Sotiris Karabetsos and Spyros Raptis |
840 |
Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution. |
Agata Cybulska and Piek Vossen |
841 |
caWaC -- A web corpus of Catalan |
Nikola Ljubešić and Antonio Toral |
842 |
Recent Developments in DeReKo |
Marc Kupietz and Harald Lüngen |
843 |
Evaluating Chinese Web-as-Corpus: Or: How Big Data is Killing Chinese Corpus Linguistics |
Shu-Kai HSIEH |
844 |
Automatic acquisition of Urdu nouns (along with gender and irregular plurals) |
Tafseer Ahmed |
845 |
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections |
Clare Llewellyn, Claire Grover, Jon Oberlander and Ewan Klein |
846 |
Billions of Parallel Words for Free |
Jörg Tiedemann, Raivis Skadiņš, Roberts Rozis and Daiga Deksne |
847 |
Generating Polarity Lexicons with WordNet propagation in 5 languages |
Isa Maks, Ruben Izquierdo, Francesca Frontini, Montse Cuadros, Rodrigo Agerri and Piek Vossen |
848 |
Online optimisation of log-linear weights in interactive machine translation |
Mara Chinea-Rios, Germán Sanchis Trilles, Daniel Daniel Ortiz-Martínez and Francisco Casacuberta |
849 |
Extending HeidelTime for Temporal Expressions Referring to Historic Dates |
Jannik Strötgen, Thomas Bögel, Julian Zell and Michael Gertz |
850 |
An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation |
Nadjet Bouayad-Agha, Alicia Burga, Gerard Casamayor, Joan Codina, Rogelio Nazar and Leo Wanner |
851 |
POL: Visualizing, Manipulating and Evaluating Ontology Alignments |
Bernardo Severo, Cassia Trojahn and Renata Vieira |
852 |
A 500 Million Word POS-Tagged Icelandic Corpus |
Thomas Eckart, Erla Hallsteinsdóttir, Sigrún Helgadóttir, Uwe Quasthoff and Dirk Goldhahn |
853 |
A Collection of Scholarly Book Reviews from the Platforms of electronic sources in Humanities and Social Sciences OpenEdition.org |
Chahinez Benkoussas, Hussam Hamdan, Patrice Bellot, Frédéric Béchet and Elodie Faath |
855 |
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian |
Anton Karl Ingason, Hrafn Loftsson, Eiríkur Rögnvaldsson, Einar Freyr Sigurðsson and Joel C. Wallenberg |
856 |
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format |
Michael Röder, Ricardo Usbeck, Sebastian Hellmann, Daniel Gerber and Andreas Both |
857 |
Assessment of non-native spoken Spanish using quantitative scores and perceptual evaluation |
Valentín Cardeñoso-Payo, César González-Ferreras and David Escudero |
858 |
Exploiting the large-scale German Broadcast Corpus to boost the Fraunhofer IAIS Speech Recognition System |
Michael Stadtschnitzer, Jochen Schwenninger, Daniel Stein and Joachim Koehler |
859 |
Summarizing news clusters on the basis of thematic chains |
Natalia Loukachevitch and Aleksey Alekseev |
860 |
The Hamburg Dependency Treebank |
Kilian A. Foth, Niels Beuck, Arne Köhn and Wolfgang Menzel |
862 |
Discovering the Italian literature: interactive access to audio indexed text resources |
Vincenzo Galatà, Alberto Benin, Piero Cosi, Giuseppe Riccardo Leone, Giulio Paci, Giacomo Sommavilla and Fabio Tesser |
863 |
Enabling Language Resources to Expose Translations as Linked Data on the Web |
Jorge Gracia, Elena Montiel-Ponsoda, Daniel Vila-Suero and Guadalupe Aguado-de-Cea |
864 |
Pivot-based multilingual dictionary building using Wiktionary |
Judit Ács |
865 |
Exploring the utility of coreference chains for improved identification of personal names |
Andrea Glaser and Jonas Kuhn |
866 |
Multilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses |
Tatiana Erekhinskaya, Mghana Satpute and Dan Moldovan |
867 |
Relation Inference in Lexical Network ... with Refinements |
Manel Zarrouk and Mathieu Lafourcade |
869 |
A Multimodal Dataset for Deception Detection |
Veronica Perez-Rosas, Rada Mihalcea, Alexis Narvaez and Mihai Burzo |
870 |
C-PhonoGenre: a 7-hours corpus of 7 speaking styles in French: relations between situational features and prosodic properties |
Jean-Philippe Goldman, Tea Prsir and Antoine Auchlin |
877 |
The AMARA Corpus: Building parallel language resources for the educational domain |
Ahmed Abdelali, Francisco Guzman, Hassan Sajjad and Stephan Vogel |
879 |
Dependency parsing representation effects on the accuracy of semantic applications — an example of an inflective language |
Lauma Pretkalniņa, Artūrs Znotiņš and Laura Rituma |
882 |
Ruled-based, Interlingual Motivated Mapping of plWordNet onto SUMO Ontology |
Paweł Kędzia and Maciej Piasecki |
883 |
Extending the coverage of a MWE database for Persian CPs exploiting valency alternations |
Pollet Samvelian and Pegah Faghiri |
887 |
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers |
Andrea Horbach, Alexis Palmer and Magdalena Wolska |
889 |
Macrosyntactic Segmenters of a spoken French Corpora |
Ilaine Wang, Sylvain Kahane and Isabelle Tellier |
891 |
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution |
Anders Björkelund, Kerstin Eckart, Arndt Riester, Nadja Schauffler and Katrin Schweitzer |
892 |
A flexible language learning platform based on language resources and web services |
Elena Volodina, Ildikó Pilán, Lars Borin and Therese Lindström Tiedemann |
893 |
Towards interoperable discourse annotation. Discourse features in the Ontologies of Linguistic Annotation |
Christian Chiarcos |
896 |
Morphological parsing of Swahili using crowdsourced lexical resources |
Patrick Littell, Kaitlyn Price and Lori Levin |
899 |
Improving Entity Linking using Surface Form Refinement |
Eric Charton, Marie-jean Meurs, Ludovic Jean-Louis and Michel Gagnon |
900 |
Extracting semantic relations from Portuguese corpora using lexical-syntactic patterns |
Raquel Amaro |
902 |
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain |
Ulli Waltinger and Artem Ostankov |
904 |
An analysis of ambiguity in word sense annotations |
David Jurgens |
906 |
VOLIP: a corpus of spoken Italian and a virtuous example of reuse of linguistic resources |
Iolanda Alfano, Francesco Cutugno, Aurelio De Rosa, Claudio Iacobini, Renata Savy and Miriam Voghera |
909 |
Chasing the Perfect Splitter: A Comparison of Different Compound Splitting Tools |
Carla Parra Escartín |
911 |
A Comparison of MT Errors and ESL Errors |
Homa B. Hashemi and Rebecca Hwa |
912 |
New functions for a multipurpose multimodal tool for phonetic and linguistic analysis of very large speech corpora |
Philippe MARTIN |
913 |
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings |
Paul Buitelaar, Georgeta Bordea and Barry Coughlan |
914 |
The American News Corpus |
Ann Irvine, Joshua Langfus and Chris Callison-Burch |
915 |
Thirty Dependency Treebanks Stanfordized |
Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman and Zdeněk Žabokrtský |
916 |
Building Sense-Tagged Multilingual Corpora |
Shan Wang and Francis Bond |
918 |
Multilingual corpora with coreferential annotation of person entities |
Marcos Garcia and Pablo Gamallo |
919 |
A Large Scale, Multi-Genre, Multi-Dialect Lexicon for Arabic Sentiment Analysis |
Muhammad Abdul-Mageed and Mona Diab |
920 |
Evaluation of Random Indexing for Technology Term Recognition. |
Behrang Qasemizadeh and Siegfried Handschuh |
921 |
Optimizing a distributional semantic model for the prediction of German Particle Verb Compositionality |
Stefan Bott and Sabine Schulte im Walde |
922 |
A HINDI-ENGLISH CODE-SWITCHING CORPUS |
Anik Dey and Pascale Fung |
926 |
The Language Application Grid |
Nancy Ide, James Pustejovsky, Christopher Cieri, Eric Nyberg, Di Wang, Keith Suderman, Marc Verhagen and Jonathan Wright |
929 |
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech |
George Christodoulides, Mathieu Avanzi and Jean-Philippe Goldman |
930 |
Integration of Workflow and Pipeline for Composite Language Services |
Trang Mai Xuan, Yohei Murakami, Donghui Lin and Toru Ishida |
931 |
Segmentation evaluation metrics, a comparison grounded on prosodic and discourse units |
Klim Peshkov and Laurent Prévot |
934 |
KoKo: An L1 learner corpus for German |
Andrea Abel, Aivars Glaznieks, Lionel Nicolas and Egon Stemle |
935 |
Improving MT Evaluation of Czech through Paraphrasing |
Petra Barancikova, Rudolf Rosa and Ales Tamchyna |
936 |
How to Make Use of Confidential Corpora without Seeing the Raw Data |
Erik Faessler, Johannes Hellrich and Udo Hahn |
940 |
Open Philology at the University of Leipzig |
Frederik Baumgardt, Monica Berti, Giuseppe Celano, Gregory R. Crane, Stella Dee, Maryam Foradi, Emily Franzini, Greta Franzini, Simon Frazier, Monica Lent, Maria Moritz and Simona Stoyanova |
941 |
A Modular System for Rule-based Text Categorisation |
Malvina Nissim |
943 |
DCEP -Digital Corpus of the European Parliament |
Najeh Hajlaoui, David KOLOVRATNIK, Jaakko Väyrynen, Ralf Steinberger and Daniel Varga |
944 |
Evaluating Human-Robot Interaction |
Kristinn R. Thorisson and Gudberg K. Jonsson |
945 |
Facing the Identification Problem in Language-Related Scientific Data Analysis. |
JOSEPH MARIANI |
947 |
Smile and Laughter in Human-Machine Interaction: a study of engagement |
Mariette Soury and Laurence Devillers |
954 |
Utilizing constituent structure for compound analysis |
Kristín Bjarnadóttir and Jón Daðason |
956 |
Large-scale Arabic Error Annotation: Guidelines and Framework |
Wajdi Zaghouani, Behrang Mohit, Nizar Habash, Ossama Obeid, Nadi Tomeh and Kemal Oflazer |
959 |
El-WOZ: a client-server wizard-of-oz open-source interface |
Thomas Pellegrini, Vahid Hedayati and Angela Costa |
960 |
ETER : a new metric for the evaluation of hierarchical named entity recognition |
Mohamed Ben Jannet, Martine Adda-Decker, Olivier Galibert, Juliette Kahn and Sophie Rosset |
963 |
Detecting Subevent Structure for Event Coreference Resolution |
Jun Araki, Zhengzhong Liu, Eduard Hovy and Teruko Mitamura |
964 |
An efficient and user-friendly tool for machine translation quality estimation |
Kashif Shah, Marco Turchi and Lucia Specia |
965 |
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts |
Alexandra Balahur, Marco Turchi, Ralf Steinberger, Jose Manuel Perea-Ortega, Guillaume Jacquet, Dilek Kucuk, Vanni Zavarella and Adil El Ghali |
967 |
Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers |
Joachim Bingel and Thomas Haider |
969 |
Per-Fide - Parallel Annotated Corpora |
Rui Brito, José João Almeida and Alberto Simões |
970 |
Validation Issues induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks |
Ophélie Lacroix and Denis Béchet |
971 |
MAT: a tool for L2 pronunciation errors annotation |
Renlong Ai and Marcela Charfuelan |
973 |
Word Semantic Similarity for Morphologically Rich Languages |
Elias Iosif, Kalliopi Zervanou, Maria Giannoudaki and Alexandros Potamianos |
975 |
LexTerm: A Design for Merged Lexicographical and Terminological Databases |
Joshua Elliot, Logan Kearsley and Alan Melby |
977 |
Focusing Annotation for Semantic Role Labeling |
Daniel Peterson, Martha Palmer and Shumin Wu |
978 |
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows |
Emanuele Lapponi, Erik Velldal and Stephan Oepen |
979 |
Developing a Framework for Describing Relations among Language Resources |
Penny Labropoulou, Christopher Cieri and Maria Gavrilidou |
980 |
Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory |
Clément de Groc and Xavier Tannier |
982 |
Word Alignment-Based Reordering of Source Chunks in PB-SMT |
Santanu Pal, Sudip Kumar Naskar and Sivaji Bandyopadhyay |
983 |
Taalportaal: a language portal for the grammar of Dutch and Frisian |
Frank Landsbergen |
985 |
A Framework for Public Health Surveillance |
Andrew Yates, Jon Parker, Nazli Goharian and Ophir Frieder |
990 |
A tool suite for creating question answering benchmarks |
Axel-Cyrille Ngonga Ngomo, Norman Heino and René Speck |
991 |
Thematic Cohesion: measuring terms discriminatory power toward themes |
Clément de Groc, Xavier Tannier and Claude de Loupy |
992 |
Terminology Resources and Terminology Work Benefit from Cloud Services |
Tatiana Gornostay and Andrejs Vasiļjevs |
995 |
Bidirectionnal converter between syntactic annotations : from French Treebank Dependencies to PASSAGE annotations, and back |
Munshi Asadullah, Patrick Paroubek and Anne Vilnat |
996 |
VarClass: An Open-source Language Identification Tool for Language Varieties |
Marcos Zampieri and Binyam Gebre |
998 |
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation |
Achim Rettinger, Lei Zhang, Daša Berović, Danijela Merkler, Matea Srebačić and Marko Tadić |
1000 |
On Paraphrase Identification Corpora |
Vasile Rus and Rajendra Banjade |
1001 |
A Gold Standard Corpus of Comparisons in Product Reviews |
Wiltrud Kessler and Jonas Kuhn |
1003 |
Automatic building of probabilistic morphological resources for the biomedical domain |
vincent claveau and Ewa Kijak |
1005 |
A System for Experiments with Dependency Parsers for Bulgarian |
Kiril Simov, Ginka Ivanova, Maria Mateva and Petya Osenova |
1007 |
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities |
Thamar Solorio, Ragib Hasan and Mainul Mizan |
1009 |
Textual Emigration Analysis (TEA) |
Andre Blessing and Jonas Kuhn |
1010 |
Toward a unifying model for Opinion, Sentiment and Emotion information extraction |
amel fraisse and Patrick Paroubek |
1011 |
Towards automatic quality assessment of component metadata |
Thorsten Trippel, Daan Broeder, Matej Durco and Oddrun Ohren |
1012 |
PropBank: Semantics of New Predicate Types |
Claire Bonial |
1016 |
RESTful Annotation and Efficient Collaboration |
Jonathan Wright |
1017 |
ALICO: a multimodal corpus for the study of active listening |
Joanna Skubisz, Zofia Malisz, Marcin Wlodarczak, Hendrik Buschmeier, Ipke Wachsmuth, Stefan Kopp and Petra Wagner |
1018 |
PoliTa: A multitagger for Polish |
Łukasz Kobyliński |
1019 |
A Corpus of Participant Roles in Contentious Discussions |
Siddharth Jain, Archna Bhatia, Angelique Rein and Eduard Hovy |
1020 |
Revising the annotation of a Broadcast News corpus: a linguistic approach |
Vera Cabarrão, Helena Moniz, Fernando Batista, Ricardo Ribeiro, Nuno Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata and David Martins de Matos |
1021 |
Gold-standard for topic-specific sentiments in economic texts |
Pyry Takala, Pekka Malo, Ankur Sinha and Oskar Ahlgren |
1022 |
Visualization of Language Relations and Families: MultiTree |
Damir Cavar and Malgorzata Cavar |
1023 |
HiEve: A Corpus for Extracting Event Hierarchies from News Stories |
Goran Glavaš and Jan Šnajder |
1026 |
Image Annotation with ISO-Space: Distinguishing Content from Structure |
James Pustejovsky and Zachary Yocum |
1027 |
The ETAPE speech processing evaluation |
Olivier Galibert, Jeremy Leixa, Gilles Adda, Khalid Choukri and Guillaume Gravier |
1029 |
PanLex: Building a Resource for Panlingual Lexical Translation |
David Kamholz and Jonathan Pool |
1031 |
NomLex-BR: A Lexicon of Portuguese Nominalizations |
Valeria de Paiva, Livy Real, Alexandre Rademaker and Gerard de Melo |
1032 |
VERTa: Facing a Multilingual Experience of a Linguistic MT Evaluation |
Elisabet Comelles, Jordi Atserias, Victoria Arranz, Irene Castellon and Jordi Sesé |
1036 |
Corpus and Method for Identifying Citations in Non-Academic Text |
Yifan He and Adam Meyers |
1037 |
Building a Dataset for Summarization and Keyword Extraction from Emails |
Vanessa Loza, Shibamouli Lahiri, Rada Mihalcea and Sean Lai |
1038 |
Improving Open Relation Extraction via Sentence Re-structuring |
Jordan Schmidek and Denilson Barbosa |
1042 |
Transcribing and Annotating a Corpus of Read Speech: Evaluating a Three-Step Crowdsourcing Approach |
Annika Hämäläinen, Fernando Pinto Moreira, Jairo Avelar, Daniela Braga and Miguel Sales Dias |
1043 |
Semi-supervised methods for expanding psycholinguistics norms by integrating distributional similarity with the structure of WordNet |
Michael Mohler, Marc Tomlinson, David Bracewell and Bryan Rink |
1044 |
Collaborative language documentation with networked smartphones |
Florian R. Hanke, Steven Bird and Haejoong Lee |
1046 |
Languagesindanger.eu - including multimedia language resources to disseminate knowledge and create educational material on less‑resourced languages |
Dagmar Jung, KATARZYNA KLESSA, Zsuzsa Duray, Beatrix Oszkó, Mária Sipos, Sándor Szeverényi, Zsuzsa Várnai, Trilsbeek Paul and Tamás Váradi |
1048 |
A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence |
Bistra Andreeva, William Barry and Jacques Koreman |
1051 |
Using Resource-Rich Languages to Improve Morphological Analysis of Under-Resourced Languages |
Peter Baumann and Janet Pierrehumbert |
1052 |
Accommodations in Tuscany as Linked Data |
Clara Bacciu, Angelica Lo Duca, Andrea Marchetti and Maurizio Tesconi |
1053 |
The DWAN framework: Application of a web annotation framework for the general humanities to the domain of language resources |
Przemyslaw Lenkiewicz, Olha Shkaravska, Twan Goosen, Daan Broeder, Menzo Windhouwer, Stephanie Roth and Olof Olsson |
1056 |
Croatian Memories |
Arjan van Hessen, Franciska de Jong and Stef Scagliola |
1057 |
Combining sentence repetition and fluency features for oral proficiency measurement |
Deryle Lonsdale and Carl Christensen |
1058 |
Semantic Search in Documents Enriched by LOD-based Annotations |
Pavel Smrz |
1059 |
A meta-data driven platform for semi-automatic configuration of ontology mediators |
Manuel Fiorelli, Maria Teresa Pazienza and Armando Stellato |
1060 |
The Ellogon Pattern Engine: Context-free Grammars over Annotations |
Georgios Petasis |
1061 |
Missed opportunities in translation memory matching |
Friedel Wolff, Laurette Pretorius and Paul Buitelaar |
1062 |
Universal Stanford dependencies: A cross-linguistic typology |
Marie-Catherine de Marneffe, Natalia Silveira, Timothy Dozat and Christopher D. Manning |
1063 |
Getting Reliable Annotations for Sarcasm in Online Dialogues |
Reid Swanson, Marilyn Walker, Luke Eisenberg and Thomas Corcoran |
1064 |
Identifying Biomedical Term Candidates from Multilingual Parallel Corpora |
Johannes Hellrich, Simon Clematide, Udo Hahn and Dietrich Rebholz-Schuhmann |
1065 |
On the Importance of Text Analysis for Stock Price Prediction |
Heeyoung Lee, Mihai Surdeanu, Bill MacCartney and Dan Jurafsky |
1066 |
On the use of a fuzzy classifier to speed the Sp ToBI labeling of the Glissando Spanish corpus |
David Escudero, Aguilar-Cuevas Lourdes, González-Ferreras César, Gutiérrez-González Yurena and Valentín Cardeñoso-Payo |
1067 |
Definition patterns for predicative terms in specialized lexical resources |
Antonio San Martín and Marie-Claude L' Homme |
1068 |
Native Language Identification Using Large, Longitudinal Data |
Xiao Jiang, Yufan Guo, Jeroen Geertzen, Dora Alexopoulou and Anna Korhonen |
1070 |
Construction and Annotation of a French Folkstale Corpus |
Anne Garcia-Fernandez and Anne-Laure Ligozat |
1071 |
The Making of AncientGreekWordNet |
Yuri Bizzoni, Federico Boschetti, Harry Diakoff, Riccardo Del Gratta, Monica Monachini and Gregory Crane |
1072 |
Enriching ODIN |
Fei Xia, William Lewis, Michael Wayne Goodman, Joshua Crowgey and Emily M. Bender |
1073 |
Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing |
Ozlem Cetinoglu |
1074 |
CroDeriV: a new resource for processing Croatian morphology |
Krešimir Šojat, Matea Srebačić, Marko Tadić and Tin Pavelić |
1075 |
Building a Database of Japanese Adjective Examples from Special Purpose Web Corpora |
Masaya Yamaguchi |
1078 |
Praaline: Integrating Tools for Speech Corpus Research |
George Christodoulides |
1079 |
A multilingual FrameNet-based grammar for language technology |
Dana Dannells and Normunds Grūzītis |
1080 |
Mapping Between English Strings and Reentrant Semantic Graphs |
Fabienne Braune, Daniel Bauer and Kevin Knight |
1081 |
The KiezDeutsch Korpus (KiDKo) Release 1.0 |
Ines Rehbein, Sören Schalowski and Heike Wiese |
1083 |
Etymological Wordnet: Tracing The History of Words |
Gerard de Melo |
1085 |
The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking |
Victoria Rosén |
1086 |
Interoperability and Customisation of Annotation Schemata in Argo |
Rafal Rak, Andrew Rowley, Jacob Carter, Riza Theresa Batista-Navarro and Sophia Ananiadou |
1088 |
Polish Coreference Corpus in Numbers |
Maciej Ogrodniczuk, Mateusz Kopeć and Agata Savary |
1089 |
A Gold Standard Dependency Corpus for English |
Natalia Silveira, Timothy Dozat, Marie-Catherine de Marneffe, John Bauer, Samuel Bowman and Chris Manning |
1090 |
DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian |
Jan Šnajder |
1092 |
Extracting Information for Context-aware Meeting Preparation |
Simon Scerri, Behrang Qasemizadeh, Ismael Rivera and Maciej Dabrowski |
1093 |
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization |
Kai Hong, John Conroy, Benoit Favre, Alex Kulesza, Hui Lin and Ani Nenkova |
1094 |
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus |
Stephanie Strassel, Zhiyi Song, Haejoong Lee, Kevin Walker, Jonathan Wright, Jennifer Garland, Dana Fore, Brian Gainor, Preston Cabe, Thomas Thomas, Brendan Callahan and Ann Sawyer |
1095 |
Comparing the Quality of Focused Crawlers and of the Linguistic Resources Obtained from Them |
Bruno Laranjeira, Viviane Moreira, Aline Villavicencio, Carlos Ramisch and Maris José Finatto |
1096 |
Augmenting English Adjective Senses with Supersenses |
Yulia Tsvetkov, Nathan Schneider, Dirk Hovy, Archna Bhatia, Manaal Faruqui and Chris Dyer |
1097 |
N-gram Counts and Language Models from the Common Crawl |
Christian Buck, Kenneth Heafield and Bas van Ooyen |
1099 |
ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English |
Tim vor der Brück, Alexander Mehler and Zahurul Islam |
1102 |
Demonstrating SUMMA, Creating Summarization Systems |
Horacio Saggion |
1103 |
BiographyNet: Methodological Issues when NLP supports historical research |
Antske Fokkens, Serge Ter Braake, Niels Ockeloen, Piek Vossen, Susan Legêne and Guus Schreiber |
1104 |
Enhancing the TED-LIUM corpus with selected data for language modeling and more TED Talks |
Anthony Rousseau, Paul Deléglise and Yannick Estève |
1105 |
Enrichment of Bilingual Dictionary through News Stream Data |
Ajay Dubey, Parth Gupta, Vasudeva Varma and Paolo Rosso |
1106 |
sloWCrowd: A crowdsourcing tool for lexicographic tasks |
Darja Fišer, Aleš Tavčar and Tomaž Erjavec |
1107 |
A Large Scale Database of Strongly-related Events in Japanese |
Tomohide Shibata, Shotaro Kohama and Sadao Kurohashi |
1108 |
FLELex: a graded lexical resource for French foreign learners |
Thomas Francois, Nùria Gala, Patrick Watrin and Cédrick Fairon |
1112 |
Building Domain Specific Bilingual Dictionaries |
Lucas Hilgert, Lucelene Lopes, Artur Freitas, Renata Vieira, Denise Hogetop and Aline Vanin |
1115 |
A Corpora of Machine Translation Errors Extracted from Translation Students Exercises |
Guillaume Wisniewski and François Yvon |
1116 |
Finding Romanized Arabic Dialect in Code-Mixed Tweets |
Clare Voss, Stephen Tratz, Jamal Laoudi and Douglas Briesch |
1119 |
Annotating a Music Corpus Using Semi-Supervised Learning |
Nicolas Auguin and Pascale Fung |
1120 |
#mygoal: Finding Motivations on Twitter |
Marc Tomlinson, David Bracewell, Wayne Krug and David Hinote |
1125 |
The RATS Collection: Supporting HLT Research with Degraded Audio Data |
David Graff, Stephanie Strassel, Kevin Walker, Karen Jones, Xiaoyi Ma and Ann Sawyer |
1126 |
Modeling Language Proficiency Using Implicit Feedback |
Chris Hokamp, Rada Mihalcea and Peter Schuelke |
1127 |
Event Extraction Using Distant Supervision |
Kevin Reschke, Martin Jankowiak, Mihai Surdeanu, Christopher Manning and Daniel Jurafsky |
1134 |
TLAXCALA: a multilingual corpus of independent news |
Antonio Toral |
1135 |
Creating and using large monolingual parallel corpora for sentential paraphrase generation |
Sander Wubben, Antal van den Bosch and Emiel Krahmer |
1137 |
Benchmarking of English-Hindi parallel corpora |
Jayendra Rakesh Yeka, Prasanth Kolachina and Dipti Misra Sharma |
1138 |
New Framework for Sign Recognition based on 3D Handshape Identification and Linguistic Modeling |
Mark Dilsizian, Polina Yanovich, Shu Wang, Carol Neidle and Dimitris Metaxas |
1139 |
Evaluating Improvised Hip Hop Lyrics - Challenges and Observations |
Karteek Addanki and Dekai Wu |
1143 |
Votter Corpus: A Corpus of Social Polling Language |
Nathan Green and Septina Dian Larasati |
1144 |
SinoCoreferencer: An End-to-End Chinese Event Coreference Resolver |
Chen Chen and Vincent Ng |
1145 |
Developing a dialectal Egyptian Arabic Treebank: Impact of Morphology and Syntax on Annotation and Tool Development |
Mohamed Maamouri, Ann Bies, Seth Kulick, Michael Ciul, Nizar Habash and Ramy Eskander |
1146 |
A German Twitter Snapshot |
Tatjana Scheffler |
1147 |
A Comparative Evaluation Framework for NLG in Interactive Systems |
Anja Belz and Helen Hastie |
1149 |
Relating Frames and Constructions in Japanese FrameNet |
Kyoko Ohara |
1150 |
A Graph-Based Approach for Computing Free Word Associations |
Gemma Bel Enguix, Reinhard Rapp and Michael Zock |
1151 |
Developing text resources for ten South African languages |
Roald Eiselen and Martin Puttkammer |
1153 |
A Bayesian Model for Multi-Annotator Document Labeling |
Robbie Haertel, Paul Felt, Eric Ringger and Kevin Seppi |
1154 |
Automatic Detection of Narrative Structure |
Jessica Ouyang and Kathy McKeown |
1155 |
OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries |
Anabela Barreiro, Fernando Batista, Ricardo Ribeiro and Isabel Trancoso |
1156 |
Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction |
Tilia Ellendorff, Fabio Rinaldi and Simon Clematide |
1157 |
Crowdsourcing for the identification of event nominals: an experiment |
Rachele Sprugnoli |
1158 |
Automatic Annotation of Syntactic Categories in Chinese Word Structures |
Jianqiang Ma |
1159 |
Incorporating alternate translations into English Translation Treebank |
Ann Bies, Justin Mott, Seth Kulick, Jennifer Garland and Colin Warner |
1161 |
Towards Compiling a large scale three-way Egyptian Arabic Dictionary |
Mona Diab, Mohamed AlBadrashiny, Maryam Aminian, Mohammed Attia, Heba Elfardy, Nizar Habash and Abdelati Hawwari |
1163 |
A SKOS-based Schema for TEI encoded Dictionaries at ICLTT |
Thierry Declerck, Karlheinz Mörth and Eveline Wandl-Vogt |
1164 |
Using LDA Topic Modeling to Quantify Idiomaticity: A Pilot Study |
Lurdes Ruela, Anna Feldman and Jing Peng |
1166 |
Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data |
Milen Kouylekov and Stephan Oepen |
1167 |
Eliciting and Annotating Uncertainty in Spoken Language |
Heather Pon-Barry |
1168 |
A hierarchical taxonomy for classifying hardness of inference tasks |
Martin Gleize and Brigitte Grau |
1169 |
Automatic Methods for the Extension of a Bilingual Dictionary using Comparable Corpora |
Michael Rosner and Kurt Sultana |
1170 |
A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data |
Yutaka Mitsuishi, Vit Novacek and Pierre-Yves Vandenbussche |
1174 |
Casa de la Lhéngua: a set of language resources and natural language processing tools for Mirandese |
José Pedro Ferreira, Cristiano Chesi, Daan Baldewijns, Fernando Miguel Pinto, Margarita Correia, Daniela Braga, Hyongsil Cho, Amadeu Ferreira and Miguel Dias |
1176 |
Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS |
Jan Strunk, Florian Schiel and Frank Seifart |
1179 |
MultiVal - Towards a multilingual valence lexicon |
Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu and Montserrat Marimon |
1182 |
Global Intelligent Content: Active Curation of Language Resources using Linked Data |
David Lewis, Rob Brennan, Leroy Finn, Dominic Jones, Alan Meehan, Declan O'sullivan, Sebastian Hellmann and Felix Sasaki |
1183 |
On the Romance Languages Mutual Intelligibility |
Liviu Dinu and Alina Maria Ciobanu |
1184 |
Aggregation methods for efficient collocation detection |
anca dinu, Liviu Dinu and Ionut Sorodoc |
1185 |
Annotating Inter-Sentence Temporal Relations based on Linguistic Relations |
Jennifer D'Souza and Vincent Ng |
1189 |
Terminology localization guidelines in the national scenario |
Juris Borzovs, Ilze Ilziņa, Iveta Keiša, Mārcis Pinnis and Andrejs Vasiļjevs |
1193 |
Teenage and adult speech in school context: building and processing a corpus of European Portuguese |
Ana Isabel Mata, Helena Moniz, Fernando Batista and Julia Hirschberg |
1194 |
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness |
Archna Bhatia, Mandy Simons, Lori Levin, Yulia Tsvetkov, Chris Dyer and Jordan Bender |
1195 |
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction |
Michaela Regneri and Rui Wang |
1197 |
An evaluation of the role of statistical measures and frequency for MWE identification |
Sandra Antunes and Amália Mendes |
1198 |
Reliable human MT evaluation metric with high inter-annotator agreement |
Chi-kiu Lo and Dekai Wu |
1199 |
Dual Subtitles as Parallel Corpora |
Shikun Zhang, Wang Ling and Chris Dyer |
1200 |
Using a machine learning model to assess the complexity of stress systems |
Liviu Dinu, Alina Maria Ciobanu, Ioana Chitoran and Vlad Niculae |
1201 |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
Ana Isabel Mata, Helena Moniz, Telmo Móia, Anabela Gonçalves, Fátima Silva, Fernando Batista, Inês Duarte, Fátima Oliveira and Isabel Falé |
1203 |
Collaborative Machine-Assisted Corpus-Dictionary Linkage |
Kevin Black, Eric Ringger, Paul Felt, Kevin Seppi, Kristian Heal and Deryle Lonsdale |
1207 |
Finite-state morphological transducers for three Kypchak languages |
Jonathan Washington, Ilnar Salimzyanov and Francis Tyers |
1211 |
The Corpus of Polish Manual Extractive and Abstractive Summaries |
Maciej Ogrodniczuk and Mateusz Kopeć |
1212 |
GlobalPhone: Pronunciation Dictionaries in 20 Languages |
Tanja Schultz |
1213 |
Pre-ordering of phrase-based machine translation input in translation workflow |
Alexandru Ceausu |
1214 |
A Vector Space Model for Syntactic Distances Between Dialects |
Emanuele Di Buccio, Giorgio Maria Di Nunzio and Gianmaria Silvello |
1216 |
A finite-state morphological analyzer for a Lakota HPSG grammar |
Christian Curtis |
1217 |
A Wikipedia-based Corpus for Contextualized Machine Translation |
and Pushpendre Rastogi |
1218 |
Alignments and other Semantic Web technologies to exploit Lexical Linked Data. The example of three ANR projects |
David Rouquet |
1219 |
Part-of-Speech Tagging for Resource-Poor Languages using Interlinear Glossed Text |
Ryan Georgi, Fei Xia and William Lewis |
1222 |
Annotating Events in an Emotion Corpus |
Sophia Lee, Shoushan Li and Chu-Ren Huang |
1226 |
Statistical Analysis of Multilingual Text Corpus and Development of Language Models |
shyam sundar agrawal, Abhimanue, Shweta Bansal and Minakshi Mahajan |
1227 |
LDC activities |
Christopher Cieri |