YANG JANET LIU

prof_pic.jpg

I study Computational Linguistics at Georgetown University with the Department of Linguistics, where I’m advised by Amir Zeldes, Ph.D. and a member of Corpling@GU and Computational Linguistics @ Georgetown (GUCL). I also work on research with Nathan Schneider, Ph.D. as a student affiliate of NERT. I obtained my M.S. in Computational Linguistics from Georgetown University in May 2019.

My primary research interests are centered around discourse-level linguistic phenomena (i.e. transcending sentence boundaries) across genres using computational, statistical, and corpus-based methods. In addition, my work also involves the creation of discourse resources spanning different genres to inform model development and facilitate targeted evaluation. In addition, I have been working on initiatives for facilitating cross-framework discourse understanding and unifying discourse resources by co-organizing the Discourse Relation Parsing and Treebanking shared task. I have also contributed to multilingual annotation projects such as the development of the largest multi-genre RST treebank for Mandarin Chinese and the creation of the first Chinese corpus annotated with adposition semantics that makes parallel analysis possible.

Before coming to Georgetown, I majored in Linguistics at Temple University in Philadelphia, PA from 2015 to 2017, where I was a research assistant in Temple University’s Multilingual Research Group.

news

Feb 23, 2024 I passed my dissertation defense :laughing:
Dec 01, 2023 Invited talk (online) at Prof. Dr. Dirk Hovy’s MilaNLP Lab at Bocconi University 🇮🇹
Sep 25, 2023 Invited talk at Prof. Dr. Barbara Plank’s MaiNLP research lab at the Center for Information and Language Processing (CIS) at LMU in Munich, Germany about The Pivotal Role of Genres: Insights from English RST Parsing and Abstractive Summarization 🕺🏻
Sep 20, 2023 Invited talk at Prof. Dr. Manfred Stede’s Applied CL Discourse Lab at Universität Potsdam about English RST Parsing in Potsdam, Germany 🇩🇪
Jul 11, 2023 One paper accepted to SIGDIAL 2023 🙌🏼 See you in Prague 🇨🇿 in September
Jun 23, 2023 Area Chair of Discourse and Pragmatics at EMNLP 2023 🇸🇬
May 30, 2023 Started a Research Scientist internship at Spotify USA Inc. 🤠🕺🏻🎸🎧🥁
May 17, 2023 One paper accepted to INTERSPEECH 2023 in Dublin, Ireland 🇮🇪
May 02, 2023 One paper accepted to the Findings of ACL 2023 See y’all in Toronto, Canada 🇨🇦
Apr 22, 2023 Invited talk at MASC-SLL 2023 at Georgetown Mason University (Arlington campus) 🤠
Jan 21, 2023 One paper accepted to the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023) 🕺🏻 See y’all in Dubrovnik, Croatia 🇭🇷
Jan 11, 2023 Co-organizing the DISRPT2023 Shared Task on Discourse Segmentation, Connective and Relation Identification across Formalisms in conjunction with ACL2023 and the CODI2023 Workshop 🤠 More languages and discourse treebanks available 🙌🏼
Dec 14, 2022 Awarded a Fall 2023 GSAS Conference Travel Grant!
Dec 14, 2022 Awarded a Fall 2022 GSAS Conference Travel Grant and a Fall 2022 GSAS-GradGov Research Project Award!
Sep 21, 2022 One co-authored paper accepted at the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP) 👏🏼
May 09, 2022 One paper on Adpositional Pragmatic Markers accepted at the 16th Lingusitic Annotation Workshop (LAW-XVI) Workshop, co-located with LREC 2022 in Marseille, France 🇫🇷
Nov 05, 2021 Passed my dissertation proposal defense ✌🏼
Jun 07, 2021 Started the Research Scientist, PhD - Summer Internship with the Lab in Language Technologies at Spotify Research 🎸🎧🥁
May 11, 2021 Passed the 2nd Qualifying Review ✌🏼
Feb 18, 2021 Co-organizing the DISRPT2021 Shared Task on Discourse Segmentation, Connective and Relation Identification across Formalisms in conjunction with EMNLP2021 and the CODI2021 Workshop 🤠
Dec 16, 2020 Done with PhD Coursework ✌🏼
Jul 01, 2020 A journal paper on detecting signals of discourse relations by Amir and I now published in Dialogue & Discourse 🕵🏻‍♀️
May 18, 2020 Started my first internship at Alexa AI @ Amazon as a Language Data Researcher Intern (VIRTUAL) 🔍
May 17, 2019 Happy Graduation 🎓 M.S. in Computational Linguistics, Georgetown University 🎉
Apr 15, 2019 Awarded a Spring 2019 GSAS Conference Travel Grant and a Spring 2019 GSAS-GradGov Research Project Award!

selected publications

  1. SIGDIAL
    What’s Hard in RST Parsing? Predictive Models for Error Analysis
    Yang Janet Liu, Tatsuya Aoyama , and Amir Zeldes
    In Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue , Sep 2023
  2. ACL
    GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization
    Yang Janet Liu, and Amir Zeldes
    In Findings of the Association for Computational Linguistics: ACL 2023 , Jul 2023
  3. EACL
    Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
    Yang Janet Liu, and Amir Zeldes
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics , May 2023