SIGTYP -- Workshop 2020 Schedule

❦ SIGTYP2020 ✻ Nov, 19th ✻ ONLINE ❦

Time zone: America/New_York

8:30 – 8:40 Opening Session (13:30 -- 13:40 UTC)

By SIGTYP2020 Organizing Committee

Opening remarks: general comments about SIGTYP development, SIGTYP2020 submissions, shared task, etc.

✻ Keynote Talks ✻

8:40 – 9:20 Richard Sproat: Taxonomy of Writing Systems: How to Measure How Logographic a System is. Live Q&A Session: 9:20 – 9:30

Richard is a computational linguist and a Research Scientist at Google, formerly in New York, now in Tokyo. From January, 2009, through October 2012, he was a professor at the Center for Spoken Language Understanding at the Oregon Health and Science University. At Google he has been mostly working on text normalization, where his former group has been developing various machine learning approaches to the problem of normalizing non-standard words in text and he has been particularly interested in the promise (and limitations) of approaches using recurrent neural nets. As of September 2019, he has moved to Google Tokyo, and is working on end-to-end speech understanding. Richard continues to maintain some "side-bar interests" including computational models of the early evolution of writing, the statistical properties of non-linguistic symbol systems, and collaborating on a translation of Wolfgang von Kempelen's Mechanismus der menschlichen Sprache, which was published in 2017. In this talk, Richard will present his most recent joint work with Alexander Gutkin on taxonomy of writing systems and computational approaches to evaluation of how logographic a system is.

Slides Slideslive Talk RocketChat Richard's Website

Discussion

9:30 – 10:10 Miriam Butt: Building Resources: Language Comparison and Analysis. Live Q&A Session: 10:10 – 10:20

Miriam Butt is Professor of Linguistics at the Department of Linguistics at the University of Konstanz. Currently, Miriam is concentrating on the history and distribution of case and complex predicates in South Asian languages. She is also interested in issues of grammar architecture and investigate interface issues (syntax-semantics, morphology-syntax/semantics, prosody-syntax) from both a theoretical and a computational perspective.

Slideslive Talk RocketChat Miriam's Webpage

Discussion

10:20 – 10:30 Coffee Break

Access the venue (Gather.Town) https://www.virtualchair.net/events/emnlp2020

✻ Shared Task Session ✻

10:30 – 10:45 Johannes Bjerva: Shared Task Overview

By Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Celano Giuseppe, Edoardo Maria Ponti, Ekaterina Vylomova, Ryan Cotterell and Isabelle Augenstein

Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world's languages. They have been shown to be useful for downstream applications, including cross-lingual transfer learning and linguistic probing. A major drawback hampering broader adoption of typological KBs is that they are sparsely populated, in the sense that most languages only have annotations for some features, and skewed, in that few features have wide coverage. As typological features often correlate with one another, it is possible to predict them and thus automatically populate typological KBs, which is also the focus of this shared task. Overall, the task attracted 8 submissions from 5 teams, out of which the most successful methods make use of such feature correlations. However, our error analysis reveals that even the strongest submitted systems struggle with predicting feature values for languages where few features are known.

✻ Keynote Talks ✻

✻ Shared Task Session ✻

✻ Keynote Talks ✻

✻ Oral Session 1 ✻

✻ Findings 1 ✻

✻ Oral Session 2 ✻

✻ Findings 2 ✻

✻ Keynote Talks ✻