3

Auto-ARGUE: LLM-Based Report Generation Evaluation
Report generation (RG) is a RAG task that aims to produce a long-form, citation-attributed response to a complex user query. We present the first public, automated, LLM-based implementation of the ARGUE evaluation framework for RG.
Auto-ARGUE: LLM-Based Report Generation Evaluation