TREC 2025 Proceedings

qwen_splade

Submission Details

Organization
UTokyo
Track
Retrieval-Augmented Generation
Task
Retrieval Only Task
Date
2025-08-16

Run Description

Is this a manual (human intervention) or automatic run?
automatic
Does this run leverage neural networks?
yes
Does this run leverage proprietary models in any step of the retrieval pipeline?
yes
Does this run leverage open-weight LLMs (> 5B parameters) in any step of the retrieval pipeline?
no
Does this run leverage smaller open-weight language models in any step of the retrieval pipeline?
yes
Was this run padded with results from a baseline run?
no
What would you categorize this run as?
Generation-in-the-loop Pipeline
Please provide a short description of this run
Hybrid retrieval pipeline leveraging HyDE (Hypothetical Document Embeddings) with Qwen3-Embedding-0.6B for dense retrieval (query:HyDE 0.3:0.7 weighted combination) and SPLADE sparse retrieval. Results are fused using Reciprocal Rank Fusion (RRF, k=60) to combine complementary signals from dense and sparse methods. Final ranking performed by GPT-4.1-mini with sliding window reranking (window=10, stride=5, 3 passes) incorporating document title, URL, and segment content for enhanced contextual understanding.
Please give this run a priority for inclusion in manual assessments.
3

Evaluation Files

Paper