TREC 2025 Proceedings
Team02_Run03_100SegmentsNoExpansion
Submission Details
- Organization
- SCIAI
- Track
- Detection, Retr., and Gen for Understanding News
- Task
- Report Generation Task
- Date
- 2025-08-15
Run Description
- Is this run manual or automatic?
- automatic
- Is this run based on the provided starter kit?
- no
- Briefly describe this run
- A set of 60 questions are generated based on the article contents via three LLM calls. These questions are narrowed down to 10 using a pre trained model that ranks questions and by removing questions that are too similar to other questions. These questions are used to retrieve the top 100 segments from MS MARCO V2.1 (Segmented). This is followed by reranking techniques and a LLM being used to select the most relevant segments for each question. An LLM then answers as many questions as possible using the retrieved segments before hitting the 250 word count limit in the final report.
- What other datasets or services (e.g. Google/Bing web search, ChatGPT, Perplexity, etc.)were used in producing the run?
- ChatGPT (gpt-4o model)
- Briefly describe LLMs used for this run (optional)
- Please give this run a priority for inclusion in manual assessments.
- 3
Evaluation Files
Paper