rasyosef/splade-tiny · Hugging Face (original) (raw)

This is a SPLADE sparse retrieval model based on BERT-Tiny (4M) that was trained by distilling a Cross-Encoder on the MSMARCO dataset. The cross-encoder used was ms-marco-MiniLM-L6-v2.

This Tiny SPLADE model beats BM25 by 65.6% on the MSMARCO benchmark. While this model is 15x smaller than Naver's official splade-v3-distilbert, is posesses 80% of it's performance on MSMARCO. This model is small enough to be used without a GPU on a dataset of a few thousand documents.

Performance

The splade models were evaluated on 55 thousand queries and 8.84 million documents from the MSMARCO dataset.

Size (# Params) MRR@10 (MS MARCO dev)
BM25 - 18.0
rasyosef/splade-tiny 4.4M 30.9
rasyosef/splade-mini 11.2M 34.1
rasyosef/splade-small 28.8M 35.4
naver/splade-v3-distilbert 67.0M 38.7

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SparseEncoder

# Download from the 🤗 Hub
model = SparseEncoder("rasyosef/splade-tiny")
# Run inference
queries = [
    "what do i need to change my name on my license in ma",
]
documents = [
    'Change your name on MA state-issued ID such as driver’s license or MA ID card. All documents you bring to RMV need to be originals or certified copies by the issuing agency. PAPERWORK NEEDED: Proof of legal name change — A court order showing your legal name change. Your Social Security Card with your new legal name change',
    "See below: 1. Get your marriage license. Before you can change your name, you'll need the original (or certified) marriage license with the raised seal and your new last name on it. Call the clerk's office where your license was filed to get copies if one wasn't automatically sent to you. 2. Change your Social Security card.",
    "You'll keep the same number—just your name will be different. Mail in your application to the local Social Security Administration office. You should get your new card within 10 business days. 3. Change your license at the DMV. Take a trip to the local Department of Motor Vehicles office to get a new license with your new last name. Bring every form of identification you can get your hands on—your old license, your certified marriage certificate and, most importantly, your new Social Security card.",
]
query_embeddings = model.encode_query(queries)
document_embeddings = model.encode_document(documents)
print(query_embeddings.shape, document_embeddings.shape)
# [1, 30522] [3, 30522]

# Get the similarity scores for the embeddings
similarities = model.similarity(query_embeddings, document_embeddings)
print(similarities)
# tensor([[16.6297, 13.4552, 10.1923]])

Model Details

Model Description

Model Sources

Full Model Architecture

SparseEncoder(
  (0): MLMTransformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertForMaskedLM'})
  (1): SpladePooling({'pooling_strategy': 'max', 'activation_function': 'relu', 'word_embedding_dimension': 30522})
)

More

Click to expand

Evaluation

Metrics

Sparse Information Retrieval

Metric Value
dot_accuracy@1 0.4772
dot_accuracy@3 0.793
dot_accuracy@5 0.8964
dot_accuracy@10 0.96
dot_precision@1 0.4772
dot_precision@3 0.2713
dot_precision@5 0.1864
dot_precision@10 0.1009
dot_recall@1 0.4617
dot_recall@3 0.7799
dot_recall@5 0.8874
dot_recall@10 0.9559
dot_ndcg@10 0.7217
dot_mrr@10 0.649
dot_map@100 0.6447
query_active_dims 18.3342
query_sparsity_ratio 0.9994
corpus_active_dims 121.653
corpus_sparsity_ratio 0.996

Training Details

Training Dataset

Unnamed Dataset

{  
    "loss": "SparseMarginMSELoss",  
    "document_regularizer_weight": 0.2,  
    "query_regularizer_weight": 0.3  
}  

Training Hyperparameters

Non-Default Hyperparameters

All Hyperparameters

Click to expand

Training Logs

Epoch Step Training Loss dot_ndcg@10
1.0 37500 11.4095 0.7103
2.0 75000 10.5305 0.7139
3.0 112500 9.5368 0.7197
4.0 150000 8.717 0.7216
5.0 187500 8.3094 0.7217

Framework Versions

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

SpladeLoss

@misc{formal2022distillationhardnegativesampling,
      title={From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective},
      author={Thibault Formal and Carlos Lassance and Benjamin Piwowarski and Stéphane Clinchant},
      year={2022},
      eprint={2205.04733},
      archivePrefix={arXiv},
      primaryClass={cs.IR},
      url={https://arxiv.org/abs/2205.04733},
}

SparseMarginMSELoss

@misc{hofstätter2021improving,
    title={Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation},
    author={Sebastian Hofstätter and Sophia Althammer and Michael Schröder and Mete Sertkan and Allan Hanbury},
    year={2021},
    eprint={2010.02666},
    archivePrefix={arXiv},
    primaryClass={cs.IR}
}

FlopsLoss

@article{paria2020minimizing,
    title={Minimizing flops to learn efficient sparse representations},
    author={Paria, Biswajit and Yeh, Chih-Kuan and Yen, Ian EH and Xu, Ning and Ravikumar, Pradeep and P{'o}czos, Barnab{'a}s},
    journal={arXiv preprint arXiv:2004.05665},
    year={2020}
}

Downloads last month

8,441

Model tree for rasyosef/splade-tiny

Dataset used to train rasyosef/splade-tiny

microsoft/ms_marco Viewer • Updated Jan 4, 2024• 1.11M • 21.9k • 238

Collection including rasyosef/splade-tiny

Papers for rasyosef/splade-tiny

Evaluation results