TREC 2025 Proceedings
bge-m3
Submission Details
- Organization
- DS@GT
- Track
- Tip-of-the-Tongue Search
- Task
- Retrieval Task
- Date
- 2025-09-01
Run Description
- Please describe in details how this run was generated
- This is a dense retrieval run. We directly use the Wikipedia embeddings from https://huggingface.co/datasets/Upstash/wikipedia-2024-06-bge-m3 We use the bge-m3 model (https://huggingface.co/BAAI/bge-m3) to embed all the queries and cosine similarity is computed to retrieval top 1000 passages.
- Specify datasets used in this run.
- ["This year's TREC TOT training data"]
- (if you checked "other", describe here)
- Are you 100% confident that no data from https://github.com/microsoft/Tip-of-the-Tongue-Known-Item-Retrieval-Dataset-for-Movie-Identification or iRememberThisMovie.com (besides the training data provided as part of this year's track) was used for producing this run (including any data used for pretraining models that you are building on top of)?
- no
- Did you use any of the official baseline runs in any way to produce this run?
- no
- If you did use any of the official baseline runs in any way to produce this run, please describe how below in sufficient detail (e.g., as reranking candidates or in ensemble with other approaches).
Evaluation Files
Paper