Dr. YANG JANET LIU

🏢 2831 Cathedral of Learning, University of Pittsburgh, PA, USA

prof_pic.jpg

📷 in Dubrovnik, Croatia 🇭🇷

May 2023 🌇 sunset lover 🎧

I am an Assistant Professor at the Department of Linguistics at University of Pittsburgh. I was a Postdoctoral Researcher at the MaiNLP research lab at the Center for Information and Language Processing (CIS) at LMU Munich led by Prof. Dr. Barbara Plank. I was also affiliated with the Munich Center for Machine Learning (MCML).

I obtained my Ph.D. in Computational Linguistics from the Department of Linguistics at Georgetown University, where I was advised by Amir Zeldes, Ph.D. and was a member of Corpling@GU and Computational Linguistics @ Georgetown (GUCL). I was also a student research affiliate of NERT, directed by Nathan Schneider, Ph.D.

Research Interests

My research focuses on computational approaches to discourse-level phenomena across text types, including how language models encode discourse information or generate coherent text, how to evaluate models on discourse-level tasks like summarization, and how to account for linguistic or label variation (broadly construed).

note to prospective students: I’m always looking for motivated students! Please see the “working with me” page for more info.

📧 jal787 [@] pitt [dot] edu

news

Feb 13, 2026 🎤 invited talk at the Intelligent Systems Program Forum at the University of Pittsburgh
Jan 23, 2026 🎤 invited talk at the Language Technologies Institute Colloquium at Carnegie Mellon University
Jan 21, 2026 🛎️ 1 paper titled “Which course? Discourse! Teaching Discourse and Generation in the Era of LLMs” accepted to the TeachingNLP workshop, co-located with EACL 2026
Oct 10, 2025 🎉 successfully organized the First Workshop on Bridging NLP and Public Opinion Research with my amazing co-organizers at COLM 2025 in Montreal, Canada!

selected publications

  1. INLG
    References Matter: Investigating the Impact of Reference Set Variation on Summarization Evaluation
    Silvia Casola*, Yang Janet Liu*, Siyao Peng*, Oliver Kraus, Albert Gatt, and 1 more author
    In Proceedings of the 18th International Natural Language Generation Conference, Oct 2025
    (*equal contribution)
  2. ACL
    Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
    Florian Eichin*, Yang Janet Liu*, Barbara Plank, and Michael A. Hedderich
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025
    (*equal contribution)
  3. EMNLP
    GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains
    Yang Janet* Liu, Tatsuya * Aoyama, Wesley* Scivetti, Yilun* Zhu, Shabnam Behzad, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
    (*equal contribution)
  4. SIGDIAL
    What’s Hard in RST Parsing? Predictive Models for Error Analysis
    Yang Janet Liu, Tatsuya Aoyama, and Amir Zeldes
    In Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, Sep 2023
  5. Findings
    GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization
    Yang Janet Liu and Amir Zeldes
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  6. EACL
    Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
    Yang Janet Liu and Amir Zeldes
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023