State of What Art? A Call for Multi-Prompt LLM Evaluation

Saved in:

Bibliographic Details
Main Authors:	Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky
Format:	Article
Language:	English
Published:	The MIT Press 2024-08-01
Series:	Transactions of the Association for Computational Linguistics
Online Access:	http://dx.doi.org/10.1162/tacl_a_00681
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850264760211210240
author	Moran Mizrahi Guy Kaplan Dan Malkin Rotem Dror Dafna Shahaf Gabriel Stanovsky
author_facet	Moran Mizrahi Guy Kaplan Dan Malkin Rotem Dror Dafna Shahaf Gabriel Stanovsky
author_sort	Moran Mizrahi
collection	DOAJ
format	Article
id	doaj-art-175836b0017c4e128aa7c6a99da58716
institution	OA Journals
issn	2307-387X
language	English
publishDate	2024-08-01
publisher	The MIT Press
record_format	Article
series	Transactions of the Association for Computational Linguistics
spelling	doaj-art-175836b0017c4e128aa7c6a99da587162025-08-20T01:54:38ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2024-08-011210.1162/tacl_a_00681State of What Art? A Call for Multi-Prompt LLM EvaluationMoran MizrahiGuy KaplanDan MalkinRotem DrorDafna ShahafGabriel Stanovskyhttp://dx.doi.org/10.1162/tacl_a_00681
spellingShingle	Moran Mizrahi Guy Kaplan Dan Malkin Rotem Dror Dafna Shahaf Gabriel Stanovsky State of What Art? A Call for Multi-Prompt LLM Evaluation Transactions of the Association for Computational Linguistics
title	State of What Art? A Call for Multi-Prompt LLM Evaluation
title_full	State of What Art? A Call for Multi-Prompt LLM Evaluation
title_fullStr	State of What Art? A Call for Multi-Prompt LLM Evaluation
title_full_unstemmed	State of What Art? A Call for Multi-Prompt LLM Evaluation
title_short	State of What Art? A Call for Multi-Prompt LLM Evaluation
title_sort	state of what art a call for multi prompt llm evaluation
url	http://dx.doi.org/10.1162/tacl_a_00681
work_keys_str_mv	AT moranmizrahi stateofwhatartacallformultipromptllmevaluation AT guykaplan stateofwhatartacallformultipromptllmevaluation AT danmalkin stateofwhatartacallformultipromptllmevaluation AT rotemdror stateofwhatartacallformultipromptllmevaluation AT dafnashahaf stateofwhatartacallformultipromptllmevaluation AT gabrielstanovsky stateofwhatartacallformultipromptllmevaluation

State of What Art? A Call for Multi-Prompt LLM Evaluation

Similar Items