Belz, AnyaThomson, CraigReiter, EhudMille, SimonRogers, AnnaBoyd-Graber, JordanOkazaki, Naoaki2023-09-292023-09-292023-07-01Belz, A, Thomson, C, Reiter, E & Mille, S 2023, Non-Repeatable Experiments and Non-Reproducible Results : The Reproducibility Crisis in Human Evaluation in NLP. in A Rogers, J Boyd-Graber & N Okazaki (eds), Findings of the Association for Computational Linguistics: ACL 2023. Association for Computational Linguistics, Toronto, Canada, pp. 3676-3687. https://doi.org/10.18653/v1/2023.findings-acl.226Bibtex: belz-etal-2023-nonORCID: /0000-0002-7548-9504/work/143414311https://hdl.handle.net/2164/2180012218802engQA75 Electronic computers. Computer scienceQA75Non-Repeatable Experiments and Non-Reproducible Results : The Reproducibility Crisis in Human Evaluation in NLPBook item10.18653/v1/2023.findings-acl.226