SIGTYP -- Workshop 2022 Schedule

SIGTYP2022 — JULY,14th — 508-Tahuya/Hybrid

We kindly invite everyone to join the virtual part of SIGTYP 2022! Below you may explore papers, slides, and recorded talks. All discussions are happening in our Rocket.Chat. Each paper is provided with its own discussion channel. On our Rocket.Chat and Google Group you may also find a single Zoom link that will be used during the day of the workshop.
SIGTYP2022 Proceedings are now available here and on the ACL Anthology website.
The Best Paper Award: "Typological Word Order Correlations with Logistic Brownian Motion" ! Congratulations!

Time zone: America/Seattle

8:30 – 8:40 Opening Session

By SIGTYP2022 Organizing Committee

Opening remarks: the SIGTYP 2022 workshop, SIGTYP development, MRL and FieldMatters 2022! Slides are available here.

✻ Keynote Talk ✻

8:40 – 9:30 Kristen Howell: Grammar Inference for Local Languages. Leveraging Typology for Automatic Grammar Generation

Kristen Howell is a data scientist at LivePerson Inc. in Seattle, Washington. Her research interests range from grammar engineering and grammar inference to conversational NLP. Throughout this research, the common thread is multilingual NLP across typologically diverse languages. Kristen received her PhD from the University of Washington in 2020, where she engaged with typological literature to develop technology for automatically generating grammars for local languages. Recent work at LivePerson has focused on multilingual NLP, leveraging deep learning techniques for conversational AI.
Abstract: In this talk Kristen will describe the benefit of implemented grammars as well as the challenges involved in creating them. She presents an inference system that can be used to automatically generate such grammars on the basis of interlinear glossed text (IGT) corpra. The inference system, called BASIL -- Building Analyses from Syntactic Inference in Local Languages, leverages typologically informed heuristics to infer syntactic and morphological information from linguistic corpora to select analyses that model the language. She will engage with the question of whether and to what extent typological features are apparent in IGT data and how effectively grammars generated with these features can model human language.

Slides Kristen's Website Discuss ❯❯

✻ Multilingual Representations (Long Talks) ✻

09:30 – 09:45 Multilingualism Encourages Recursion: a Transfer Study with mBERT

By Andrea Gregor De Varda and Roberto Zamparelli

The present work constitutes an attempt to investigate the relational structures learnt by mBERT, a multilingual transformer-based network, with respect to different cross-linguistic regularities proposed in the fields of theoretical and quantitative linguistics. We pursued this objective by relying on a zero-shot transfer experiment, evaluating the model's ability to generalize its native task to artificial languages that could either respect or violate some proposed language universal, and comparing its performance to the output of BERT, a monolingual model with an identical configuration. We created four artificial corpora through a Probabilistic Context-Free Grammar by manipulating the distribution of tokens and the structure of their dependency relations. We showed that while both models were favoured by a Zipfian distribution of the tokens and by the presence of head-dependency type structures, the multilingual transformer network exhibited a stronger reliance on hierarchical cues compared to its monolingual counterpart.

✻ Keynote Talk ✻

✻ Multilingual Representations (Long Talks) ✻

✻ Typology (Short Talks) ✻

✻ Keynote Talk ✻

✻ Shared Task Session ✻

✻ Linguistic Trivia ✻

✻ Keynote Talk ✻

✻ Databases and Corpora ✻