State of What Art? A Call for Multi-Prompt LLM Evaluation
Saved in:
| Main Authors: | Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
The MIT Press
2024-08-01
|
| Series: | Transactions of the Association for Computational Linguistics |
| Online Access: | http://dx.doi.org/10.1162/tacl_a_00681 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Mapping Machine Learning Trends in Chemistry Research using LLM with Multi-Turn Prompting
by: Andreo Yudertha, et al.
Published: (2025-03-01) -
From Prompts to Motors: Man-in-the-Middle Attacks on LLM-Enabled Vacuum Robots
by: Asif Shaikh, et al.
Published: (2025-01-01) -
Use me wisely: AI-driven assessment for LLM prompting skills development
by: Dimitri Ognibene, Gregor Donabauer, Emily Theophilou, Cansu Koyuturk, Mona Yavari, Sathya Bursic, Alessia Telari, Alessia Testa, Raffaele Boiano, Davide Taibi, Davinia Hernandez-Leo, Udo Kruschwitz and Martin Ruskov
Published: (2025-07-01) -
LPITutor: an LLM based personalized intelligent tutoring system using RAG and prompt engineering
by: Zhensheng Liu, et al.
Published: (2025-08-01) -
Efficient Prompt Optimization for Relevance Evaluation via LLM-Based Confusion Matrix Feedback
by: Jaekeol Choi
Published: (2025-05-01)