Dr. YANG JANET LIU

🏢 213, Akademiestraße 7 München, Bavaria 80799, DEU

prof_pic.jpg

📷 in Dubrovnik, Croatia 🇭🇷

May 2023 🌇 sunset lover 🎧

I am currently a Postdoctoral Researcher at the MaiNLP research lab at the Center for Information and Language Processing (CIS) at LMU Munich led by Prof. Dr. Barbara Plank. I am also affiliated with the Munich Center for Machine Learning (MCML).

I obtained my Ph.D. in Computational Linguistics from the Department of Linguistics at Georgetown University, where I was advised by Amir Zeldes, Ph.D. and was a member of Corpling@GU and Computational Linguistics @ Georgetown (GUCL). I was also a student research affiliate of NERT, directed by Nathan Schneider, Ph.D.

research interests involve:

  • tackling text variation in NLP (broadly construed)
  • studying model internals for discourse-level linguistic phenomena and generalization
  • discourse-level linguistic phenomena across genres using computational, statistical, and corpus-based methods
  • NLP applications involving discourse structure and understanding
  • cross-framework discourse understanding and unifying discourse resources (co-organizer of the DISRPT shared task)
  • multilingual annotation projects involving discourse-level phenomena

📧 yliu [@] cis [dot] lmu [dot] de

news

Jun 23, 2025 🛎️ new preprint on examing the role and impact of referemce set choice on summarization metrics!
Jun 03, 2025 🛎️ presented our ACL 2025 paper on discourse generalization at the CIS PhD seminar at LMU Munich
May 27, 2025 🏰 invited talk at the Computational Linguistics Colloquium of Heidelberg University and the Heidelberg Institute for Theoretical Studies
May 16, 2025 🛎️ 2 papers accepted to ACL 2025 (main) 🎶 See y’all in Vienna, Austria 🇦🇹

selected publications

  1. ACL
    Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
    Florian Eichin*, Yang Janet Liu*, Barbara Plank, and Michael A. Hedderich
    In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
    (*equal contribution)
  2. EMNLP
    GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains
    Yang Janet* Liu, Tatsuya * Aoyama, Wesley* Scivetti, Yilun* Zhu, Shabnam Behzad, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
    (*equal contribution)
  3. SIGDIAL
    What’s Hard in RST Parsing? Predictive Models for Error Analysis
    Yang Janet Liu, Tatsuya Aoyama, and Amir Zeldes
    In Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, Sep 2023
  4. Findings
    GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization
    Yang Janet Liu and Amir Zeldes
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  5. EACL
    Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
    Yang Janet Liu and Amir Zeldes
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023