State of What Art? A Call for Multi-Prompt LLM Evaluation
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
The MIT Press
2024-08-01
|
| Series: | Transactions of the Association for Computational Linguistics |
| Online Access: | http://dx.doi.org/10.1162/tacl_a_00681 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850264760211210240 |
|---|---|
| author | Moran Mizrahi Guy Kaplan Dan Malkin Rotem Dror Dafna Shahaf Gabriel Stanovsky |
| author_facet | Moran Mizrahi Guy Kaplan Dan Malkin Rotem Dror Dafna Shahaf Gabriel Stanovsky |
| author_sort | Moran Mizrahi |
| collection | DOAJ |
| format | Article |
| id | doaj-art-175836b0017c4e128aa7c6a99da58716 |
| institution | OA Journals |
| issn | 2307-387X |
| language | English |
| publishDate | 2024-08-01 |
| publisher | The MIT Press |
| record_format | Article |
| series | Transactions of the Association for Computational Linguistics |
| spelling | doaj-art-175836b0017c4e128aa7c6a99da587162025-08-20T01:54:38ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2024-08-011210.1162/tacl_a_00681State of What Art? A Call for Multi-Prompt LLM EvaluationMoran MizrahiGuy KaplanDan MalkinRotem DrorDafna ShahafGabriel Stanovskyhttp://dx.doi.org/10.1162/tacl_a_00681 |
| spellingShingle | Moran Mizrahi Guy Kaplan Dan Malkin Rotem Dror Dafna Shahaf Gabriel Stanovsky State of What Art? A Call for Multi-Prompt LLM Evaluation Transactions of the Association for Computational Linguistics |
| title | State of What Art? A Call for Multi-Prompt LLM Evaluation |
| title_full | State of What Art? A Call for Multi-Prompt LLM Evaluation |
| title_fullStr | State of What Art? A Call for Multi-Prompt LLM Evaluation |
| title_full_unstemmed | State of What Art? A Call for Multi-Prompt LLM Evaluation |
| title_short | State of What Art? A Call for Multi-Prompt LLM Evaluation |
| title_sort | state of what art a call for multi prompt llm evaluation |
| url | http://dx.doi.org/10.1162/tacl_a_00681 |
| work_keys_str_mv | AT moranmizrahi stateofwhatartacallformultipromptllmevaluation AT guykaplan stateofwhatartacallformultipromptllmevaluation AT danmalkin stateofwhatartacallformultipromptllmevaluation AT rotemdror stateofwhatartacallformultipromptllmevaluation AT dafnashahaf stateofwhatartacallformultipromptllmevaluation AT gabrielstanovsky stateofwhatartacallformultipromptllmevaluation |