State of What Art? A Call for Multi-Prompt LLM Evaluation

Saved in:
Bibliographic Details
Main Authors: Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky
Format: Article
Language:English
Published: The MIT Press 2024-08-01
Series:Transactions of the Association for Computational Linguistics
Online Access:http://dx.doi.org/10.1162/tacl_a_00681
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850264760211210240
author Moran Mizrahi
Guy Kaplan
Dan Malkin
Rotem Dror
Dafna Shahaf
Gabriel Stanovsky
author_facet Moran Mizrahi
Guy Kaplan
Dan Malkin
Rotem Dror
Dafna Shahaf
Gabriel Stanovsky
author_sort Moran Mizrahi
collection DOAJ
format Article
id doaj-art-175836b0017c4e128aa7c6a99da58716
institution OA Journals
issn 2307-387X
language English
publishDate 2024-08-01
publisher The MIT Press
record_format Article
series Transactions of the Association for Computational Linguistics
spelling doaj-art-175836b0017c4e128aa7c6a99da587162025-08-20T01:54:38ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2024-08-011210.1162/tacl_a_00681State of What Art? A Call for Multi-Prompt LLM EvaluationMoran MizrahiGuy KaplanDan MalkinRotem DrorDafna ShahafGabriel Stanovskyhttp://dx.doi.org/10.1162/tacl_a_00681
spellingShingle Moran Mizrahi
Guy Kaplan
Dan Malkin
Rotem Dror
Dafna Shahaf
Gabriel Stanovsky
State of What Art? A Call for Multi-Prompt LLM Evaluation
Transactions of the Association for Computational Linguistics
title State of What Art? A Call for Multi-Prompt LLM Evaluation
title_full State of What Art? A Call for Multi-Prompt LLM Evaluation
title_fullStr State of What Art? A Call for Multi-Prompt LLM Evaluation
title_full_unstemmed State of What Art? A Call for Multi-Prompt LLM Evaluation
title_short State of What Art? A Call for Multi-Prompt LLM Evaluation
title_sort state of what art a call for multi prompt llm evaluation
url http://dx.doi.org/10.1162/tacl_a_00681
work_keys_str_mv AT moranmizrahi stateofwhatartacallformultipromptllmevaluation
AT guykaplan stateofwhatartacallformultipromptllmevaluation
AT danmalkin stateofwhatartacallformultipromptllmevaluation
AT rotemdror stateofwhatartacallformultipromptllmevaluation
AT dafnashahaf stateofwhatartacallformultipromptllmevaluation
AT gabrielstanovsky stateofwhatartacallformultipromptllmevaluation