Apologies for cross-posting

2nd Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia (EURALI) @ LREC-COLING 2024

Date: 20-25 May, 2024

Venue: Lingotto Conference Centre - Torino (Italia)

Main website: https://sites.google.com/view/eurali/

LREC-COLING 2024 website: https://lrec-coling-2024.org/

——————————————————————————————————

Workshop overview and objectives

This workshop will focus on the development of language technology resources and tools for indigenous, endangered and lesser-resourced languages on the Eurasian continent.

In a media-centric world where language technology allows people to break cultural and language barriers, it is important that speakers of endangered and indigenous languages can be empowered to use this technology to continue to share their knowledge and culture with the world. With the hope of bridging this gap, the goal of this workshop is to increase visibility and promote research for lesser-resourced and underrepresented languages in Europe and Asia. Through collaboration between NLP researchers, language experts and linguists working for the benefit of endangered languages in these communities, we aim to create language technology resources that will help to preserve and revive these languages for future generations. Furthermore, the workshop aims to promote the emergence of new methods that benefit linguists, for instance for automation of analysis and validation processes, field linguists, the facilitation of data collection and analysis processes, and computational linguists by developing new techniques necessary for linguistic analysis, development of supervised or weakly supervised methods for the analysis of poorly written or undocumented languages.

The main objective of the workshop is to create basic resources and develop tools for Eurasiatic languages, including but not limited to the following topics:

identifying languages and variants spoken in these regions
creation of language resources and applications, e.g. sentiment analysis, named entity recognition, and syntactic parsing
standardization for endangered languages
automatic identification and classification of lexical variation and language varieties
adaptation of fundamental NLP tools for these languages, e.g., morphological analysis, taggers and parsers
reusability of language resources in NLP applications, e.g. machine translation, and POS tagging
machine translation between closely related languages

evaluation of language resources and tools when applied to lesser-resourced languages in the same language families

corpora, resources, and tools for closely related languages
linguistic and textual similarities among languages in Eurasia
digitalization of endangered languages
challenges in the creation of language resources and tools from linguistic perspectives (which includes any perspective formal theory)

Submissions

We are seeking submissions under the following category:

Full papers: 8 pages+unlimited reference

Short papers (work in progress): 4 pages+unlimited reference

Posters (innovative ideas/proposals, a research idea of students): 4 pages+unlimited reference

Demo (of working online/standalone systems): 2 pages

Papers must describe original, completed or in progress, and unpublished work. The accepted papers will be given up for full/short paper and poster in the workshop proceedings and will be presented as an oral presentation or poster.

Papers should be formatted according to the LREC-COLING style sheet (https://lrec-coling-2024.org/authors-kit/), which is provided on the LREC-COLING 2024 website(https://lrec-coling-2024.org/). Please submit papers in PDF format to the START account (the submission link will be available soon). For further information on this initiative, please refer to the https://sites.google.com/view/eurali/.

Important Dates (tentative)

February 23, 2024: Paper submissions due

March 22, 2024: Paper notification of acceptance

May 20-25, 2024: Workshop

Workshop Chair:

Atul Kr. Ojha, Sina Ahmadi,

Chao-Hong Liu, Potamu Research Ltd, Dublin (Ireland)

John P. McCrae, University of Galway, Galway (Ireland)

Theodorus Fransen, Università Cattolica del Sacro Cuore, Milan (Italy)

Silvie Cinková, Charles University, Prague (Czech Republic)

Programme Committee (to be updated):

Abigail Walsh*, Dublin City University, Dublin (Ireland)

Agata Savary, University of Paris-Saclay, Paris-Saclay (France)

A. Seza Doğruöz, Ghent University, Ghent (Belgium)

Alina Karakanta, University of Leiden, Leiden (Netherlands)

Alina Wróblewska, Institute of Computer Science, Jana Kazimierza, Warszawa (Poland)

Akanksha Bansal, Panlingua, Delhi (India)

Anabela Barreiro*, INESC-ID, Lisboa (Portugal)

Atul Kr. Ojha, University of Galway, Galway (Ireland) & Panlingua, (India)

Bharathi Raja Chakravarthi, University of Galway, Galway (Ireland)

Bogdan Babych, Heidelberg University, Heidelberg (Germany)

Chao-Hong Liu, Potamu Research Ltd, Dublin (Ireland)

Daan van Esch, Google, Amsterdam (Netherlands)

Daniel Zeman, Charles University, Prague (Czech Republic)

Deepak Alok, IIT-Delhi, Delhi (India)

Dorothee Beermann, Norwegian University of Science and Technology, Trøndelag (Norway)

Esha Banerjee, J.P. Morgan, Bengaluru (India)

Ekaterina Vylomova, University of Melbourne, Melbourne (Australia)

George Rehm, GmbH, Berlin (Germany)

Jamal Abdul Nasir, University of Galway, Galway (Ireland)

Joakim Nivre, Uppsala University, (Sweden)

John P. McCrae, University of Galway, (Ireland)

Jonathan Washington, Swarthmore College, Swarthmore (USA)

Joseph Mariani, LIMSI-CNRS, Pairs (France)

Kaja Dobrovoljc, University of Ljubljana, Ljubljana (Slovenia)

Katharina Kann*, University of Colorado at Boulder, USA

Kevin Patrick Scannell, Cadhan Aonair, LLC, Missouri (USA)

Khalid Choukri, ELDA/ELRA, Paris (France)

Marie-Catherine de Marneffe, UCLouvainCollège Léon Durpiez, (Belgium)

Massimo Monaglia, University of Florence, (Italy)

Nicoletta Calzolari, CNR-ILC, (Italy)

Olesea Caftanatov, Vladimir Andrunachievici Institute of Mathematics and Computer Science, Chişinău (Moldova)

Richard Sproat, Google, Tokyo (Japan)

Rico Sennrich, University of Zurich, Zurich (Switzerland)

Ritesh Kumar, Agra University, Agra (India)

Saliha Muradoglu, Australian National University, Canberra (Australia)

Silvie Cinková, Charles University, Prague (Czech Republic)

Sina Ahmadi, George Mason University, (USA)

Stella Markantonatou, Athena RC, Athens (Greece)

Sourabrata Mukherjee, Charles University, Prague (Czech Republic)

Sunipa Dev, Google, Washington (USA)

Theodorus Fransen, Università Cattolica del Sacro Cuore, Milan (Italy)

Valentin Malykh, MTS AI / ITMO University

Verginica Barbu Mititelu, Research Institute for Artificial Intelligence, Bucharest (Romania)

Voula Giouli, Institute for Language and Speech Processing, Athens (Greece)