Accepted Papers
Long papers
-
Beyond calories: evaluating how tailored communication reduces emotional load in diet-coaching
Simone Balloccu, Ehud Reiter -
Human evaluation of web-crawled parallel corpora for machine translation
Gema Ramírez-Sánchez, Marta Bañón, Jaume Zaragoza-Bernabeu, Sergio Ortiz-Rojas -
Vacillating Human Correlation of SacreBLEU in Unprotected Languages
Ahrii Kim, Jinhyeon Kim -
Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer
Huiyuan Lai, Jiali Mao, Antonio Toral, Malvina Nissim -
A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification
Varvara Logacheva, Daryna Dementieva, Irina Krotova, Alena Fenogenova, Irina Nikishina, Tatiana Shavrina, Alexander Panchenko -
Toward More Effective Human Evaluation for Machine Translation
Belén C Saldías Fuentes, George Foster, Markus Freitag, Qijun Tan -
The Human Evaluation Datasheet: A Template for Recording Details of Human Evaluation Experiments in NLP
Anastasia Shimorina, Anya Belz
Short papers
-
A Methodology for the Comparison of Human Judgments With Metrics for Coreference Resolution
Mariya Borovikova, Loïc Grobol, Anaïs Lefeuvre-Halftermeyer, Sylvie Billot -
Perceptual Quality Dimensions of Machine-Generated Text with a Focus on Machine Translation
Vivien Macketanz, Babak Naderi, Steven Schmidt, Sebastian Möller