TREC 2025 Proceedings
IDACCS_nugget_4.1
Submission Details
- Organization
- IDACCS
- Track
- RAG TREC Instrument for Multilingual Evaluation
- Task
- Report Generation Task
- Date
- 2025-08-19
Run Description
- Document collection
- ['English subset', 'Arabic subset', 'Chinese subset', 'Russian subset']
- Machine translation of documents
- ['Yes we used the organizer-provided machine translations']
- Write a short description of your retrieval process
- The following steps were done.
1. The organizers serve to retrieve the top 30 documents using the background and problem statement as a query.
2. We reranked the document to get the top 10 using mxbai-rerank-large-v1 on 10 sentence chunks with an overlap of 5 using a query generated by gpt-4o base on the title, background, and problem statement.
- Write a short description of your generation process
- 3. An occams extractive summary of length twice the target length, where the target length is 2500 for the 10000-lon summaries, as the generation was done per language, and the target length was 4000 for the 2000-long summaries.
4. GPT-4.1, with a prompt to form "nuggets" not to exceed the target length, was used to generate the report.
5. Attribution was done using our "blame" semantic similarity method using a t5-base model.
- Which LLM(s) where used by your system?
- gpt-4o, gpt-4.1
- Open repository link
- na
- Assessing priority
- 1 (highest)
Evaluation Files
Paper