NLP research by & for local communities

0 Anmeldelser
0
Episode
207 of 325
Længde
36M
Sprog
Engelsk
Format
Kategori
Fakta

While at EMNLP 2022, Daniel got a chance to sit down with an amazing group of researchers creating NLP technology that actually works for their local language communities. Just Zwennicker (Universiteit van Amsterdam) discusses his work on a machine translation system for Sranan Tongo, a creole language that is spoken in Suriname. Andiswa Bukula (SADiLaR), Rooweither Mabuya (SADiLaR), and Bonaventure Dossou (Lanfrica, Mila) discuss their work with Masakhane to strengthen and spur NLP research in African languages, for Africans, by Africans.

The group emphasized the need for more linguistically diverse NLP systems that work in scenarios of data scarcity, non-Latin scripts, rich morphology, etc. You don’t want to miss this one!

Join the discussion

Changelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!

Featuring:

• Just Zwennicker – LinkedIn • Andiswa Bukula – X • Rooweither Mabuya – X • Bonaventure Dossou – Website • , GitHub • , LinkedIn • , X • Daniel Whitenack – Website • , GitHub • , X Show Notes:

EMNLP 2022 papers from the guests:

Towards a general purpose machine translation system for SranantongoMasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity RecognitionAfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages Other links relevant to the discussion:

MasakhaneLanfricaThe South African Centre for Digital Language Resources (SADiLaR) Something missing or broken? PRs welcome!


Lyt når som helst, hvor som helst

Nyd den ubegrænsede adgang til tusindvis af spændende e- og lydbøger - helt gratis

  • Lyt og læs så meget du har lyst til
  • Opdag et kæmpe bibliotek fyldt med fortællinger
  • Eksklusive titler + Mofibo Originals
  • Opsig når som helst
Prøv nu
DK - Details page - Device banner - 894x1036

Other podcasts you might like ...