Paper page - Exploring ell_0 Sparsification for Inference-free Sparse Retrievers (original) (raw)

Published on Apr 21, 2025

Abstract

Inference-free retrieval achieves state-of-the-art performance using an ell_0\ell_0ell_0 inspired sparsification method, offering a balance between effectiveness and efficiency.

With increasing demands for efficiency, information retrieval has developed a branch of sparse retrieval, further advancing towards inference-free retrievalwhere the documents are encoded during indexing time and there is no model-inference for queries. Existing sparse retrieval models rely on FLOPS regularization for sparsification, while this mechanism was originally designed for Siamese encoders, it is considered to be suboptimal in inference-free scenarios which is asymmetric. Previous attempts to adapt FLOPS for inference-free scenarios have been limited to rule-based methods, leaving the potential of sparsification approaches for inference-free retrieval models largely unexplored. In this paper, we explore ell_0 inspired sparsification manner for inference-free retrievers. Through comprehensive out-of-domain evaluation on the BEIR benchmark, our method achieves state-of-the-art performance among inference-free sparse retrieval models and is comparable to leading Siamese sparse retrieval models. Furthermore, we provide insights into the trade-off between retrieval effectiveness and computational efficiency, demonstrating practical value for real-world applications.

View arXiv page View PDF Project page GitHub 24 Add to collection

Get this paper in your agent:

hf papers read 2504.14839

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 4

opensearch-project/opensearch-neural-sparse-encoding-doc-v3-gte Feature Extraction • 0.1B • Updated Jul 22, 2025 • 2.93k • 13

opensearch-project/opensearch-neural-sparse-encoding-doc-v3-distill Feature Extraction • 67M • Updated Jun 30, 2025 • 5.85k • 10

raul3820/opensearch-neural-sparse-encoding-doc-v3-distill-onnx Feature Extraction • Updated Jan 4 • 5

seerware/opensearch-neural-sparse-encoding-doc-v3-distill Feature Extraction • 67M • Updated 17 days ago • 13

Datasets citing this paper 2

opensearch-project/msmarco-hard-negatives-llm-scores Viewer • Updated Aug 14, 2025• 503k • 13 • 4

opensearch-project/msmarco-hard-negatives Viewer • Updated Aug 14, 2025• 503k • 10 • 4

Spaces citing this paper 1

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.