Accepted Papers

Long papers

  • Beyond calories: evaluating how tailored communication reduces emotional load in diet-coaching
    Simone Balloccu, Ehud Reiter

  • Human evaluation of web-crawled parallel corpora for machine translation
    Gema Ramírez-Sánchez, Marta Bañón, Jaume Zaragoza-Bernabeu, Sergio Ortiz-Rojas

  • Vacillating Human Correlation of SacreBLEU in Unprotected Languages
    Ahrii Kim, Jinhyeon Kim

  • Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer
    Huiyuan Lai, Jiali Mao, Antonio Toral, Malvina Nissim

  • A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification
    Varvara Logacheva, Daryna Dementieva, Irina Krotova, Alena Fenogenova, Irina Nikishina, Tatiana Shavrina, Alexander Panchenko

  • Toward More Effective Human Evaluation for Machine Translation
    Belén C Saldías Fuentes, George Foster, Markus Freitag, Qijun Tan

  • The Human Evaluation Datasheet: A Template for Recording Details of Human Evaluation Experiments in NLP
    Anastasia Shimorina, Anya Belz

Short papers

  • A Methodology for the Comparison of Human Judgments With Metrics for Coreference Resolution
    Mariya Borovikova, Loïc Grobol, Anaïs Lefeuvre-Halftermeyer, Sylvie Billot

  • Perceptual Quality Dimensions of Machine-Generated Text with a Focus on Machine Translation
    Vivien Macketanz, Babak Naderi, Steven Schmidt, Sebastian Möller

  • Towards Human Evaluation of Mutual Understanding in Human-Computer Spontaneous Conversation: An Empirical Study of Word Sense Disambiguation for Naturalistic Social Dialogs in American English
    Alex Lưu