publications

2025

2025

  1. preprint
    Evaluation Should Not Ignore Variation: On the Impact of Reference Set Choice on Summarization Metrics
    Silvia Casola*, Yang Janet Liu*, Siyao Peng*, Oliver Kraus, Albert Gatt, and 1 more author
    2025
    (*equal contribution)
  2. preprint
    Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
    Beiduo Chen, Yang Janet Liu, Anna Korhonen, and Barbara Plank
    2025
  3. ACL
    Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
    Florian Eichin*, Yang Janet Liu*, Barbara Plank, and Michael A. Hedderich
    In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
    (*equal contribution)
  4. ACL
    Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges
    Bolei Ma*, Yuting Li*, Wei Zhou*, Ziwei Gong*, Yang Janet Liu, and 5 more authors
    In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
    (*equal contribution)

2024

2024

  1. EMNLP
    GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains
    Yang Janet* Liu, Tatsuya * Aoyama, Wesley* Scivetti, Yilun* Zhu, Shabnam Behzad, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
    (*equal contribution)
  2. CL Journal
    eRST: A Signaled Graph Theory of Discourse Relations and Organization
    Amir Zeldes, Tatsuya Aoyama, Yang Janet Liu, Siyao Peng, Debopam Das, and 1 more author
    Computational Linguistics, Sep 2024
  3. LREC-COLING
    DISRPT: A Multilingual, Multi-domain, Cross-framework Benchmark for Discourse Processing
    Chloé Braud, Amir Zeldes, Laura Rivière, Yang Janet Liu, Philippe Muller, and 2 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024

2023

2023

  1. SIGDIAL
    What’s Hard in RST Parsing? Predictive Models for Error Analysis
    Yang Janet Liu, Tatsuya Aoyama, and Amir Zeldes
    In Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, Sep 2023
  2. INTERSPEECH
    Lightweight and Efficient Spoken Language Identification of Long-form Audio
    Winstead Zhu*, Md Iftekhar Tanveer*, Yang Janet Liu*, Seye Ojumu, and Rosie Jones
    In Proc. INTERSPEECH 2023, Sep 2023
    (*equal contribution)
  3. Findings
    GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization
    Yang Janet Liu and Amir Zeldes
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  4. CODI-DISRPT
    The DISRPT 2023 Shared Task on Elementary Discourse Unit Segmentation, Connective Detection, and Relation Classification
    Chloé Braud, Yang Janet Liu, Eleni Metheniti, Philippe Muller, Laura Rivière, and 2 more authors
    In Proceedings of the 3rd Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2023), Jul 2023
  5. LAW
    GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic Evaluation
    Tatsuya Aoyama, Shabnam Behzad, Luke Gessler, Lauren Levine, Jessica Lin, and 4 more authors
    In Proceedings of the 17th Linguistic Annotation Workshop (LAW-XVII), Jul 2023
  6. EACL
    Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
    Yang Janet Liu and Amir Zeldes
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023

2022

2022

  1. AACL
    GCDT: A Chinese RST Treebank for Multigenre and Multilingual Discourse Parsing
    Siyao Peng, Yang Janet Liu, and Amir Zeldes
    In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Nov 2022
  2. LAW
    Putting Context in SNACS: A 5-Way Classification of Adpositional Pragmatic Markers
    Yang Janet Liu, Jena D. Hwang, Nathan Schneider, and Vivek Srikumar
    In Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022, Jun 2022

2021

2021

  1. CODI-DISRPT
    The DISRPT 2021 Shared Task on Elementary Discourse Unit Segmentation, Connective Detection, and Relation Classification
    Amir Zeldes, Yang Janet Liu, Mikel Iruskieta, Philippe Muller, Chloé Braud, and 1 more author
    In Proceedings of the 2nd Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2021), Nov 2021
  2. CODI-DISRPT
    DisCoDisCo at the DISRPT2021 Shared Task: A System for Discourse Segmentation, Classification, and Connective Detection
    Luke Gessler, Shabnam Behzad, Yang Janet Liu, Siyao Peng, Yilun Zhu, and 1 more author
    In Proceedings of the 2nd Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2021), Nov 2021

2020

2020

  1. LREC
    AMALGUM – A Free, Balanced, Multilayer English Web Corpus
    Luke Gessler, Siyao Peng, Yang Janet Liu, Yilun Zhu, Shabnam Behzad, and 1 more author
    In Proceedings of the Twelfth Language Resources and Evaluation Conference, May 2020
  2. D&D
    A Neural Approach to Discourse Relation Signal Detection
    Amir Zeldes and Yang Janet Liu
    Dialogue and Discourse, May 2020
  3. LREC
    A Corpus of Adpositional Supersenses for Mandarin Chinese
    Siyao Peng, Yang Janet Liu, Yilun Zhu, Austin Blodgett, Yushi Zhao, and 1 more author
    In Proceedings of the 12th Language Resources and Evaluation Conference, May 2020

2019

2019

  1. DISRPT
    Beyond the Wall Street Journal: Anchoring and Comparing Discourse Signals across Genres
    Yang Janet Liu
    In Proceedings of the Workshop on Discourse Relation Parsing and Treebanking 2019, Jun 2019
  2. DISRPT
    A Discourse Signal Annotation System for RST Trees
    Luke Gessler, Yang Janet Liu, and Amir Zeldes
    In Proceedings of the Workshop on Discourse Relation Parsing and Treebanking 2019, Jun 2019
  3. DISRPT
    GumDrop at the DISRPT2019 Shared Task: A Model Stacking Approach to Discourse Unit Segmentation and Connective Detection
    Yue Yu, Yilun Zhu, Yang Janet Liu, Yan Liu, Siyao Peng, and 2 more authors
    In Proceedings of the Workshop on Discourse Relation Parsing and Treebanking 2019, Jun 2019
  4. MS THESIS
    Signaling of Discourse Relations: Anchoring Discourse Signals across Genres
    Yang Janet Liu
    Georgetown University, Jun 2019
  5. SCiL
    Discourse Relations and Signaling Information: Anchoring Discourse Signals in RST-DT
    Yang Janet Liu and Amir Zeldes
    In Proceedings of the Society for Computation in Linguistics, Jun 2019
  6. SCiL
    Adpositional Supersenses for Mandarin Chinese
    Yilun Zhu, Yang Janet Liu, Siyao Peng, Austin Blodgett, Yushi Zhao, and 1 more author
    In Proceedings of the Society for Computation in Linguistics, Jun 2019

2017

2017

  1. LSA
    Scalar Implicature in Chitonga-Speaking Children
    Jodi Reich, Kelly Nedwick, Teodora Niculae-Caxi, Yang Janet Liu, and Elena L Grigorenko
    In Proceedings of the Linguistic Society of America, Jun 2017