4 months ago

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Nils Reimers; Iryna Gurevych

Abstract

BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computations (~65 hours) with BERT. The construction of BERT makes it unsuitable for semantic similarity search as well as for unsupervised tasks like clustering. In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity. This reduces the effort for finding the most similar pair from 65 hours with BERT / RoBERTa to about 5 seconds with SBERT, while maintaining the accuracy from BERT. We evaluate SBERT and SRoBERTa on common STS tasks and transfer learning tasks, where it outperforms other state-of-the-art sentence embeddings methods.

Code Repositories

aneesha/SiameseBERT-Notebook

Mentioned in GitHub

p208p2002/Sentence-BERT-mean-operation

Mentioned in GitHub

datcancode/sentence-transformers

pytorch

Mentioned in GitHub

rafaljanwojcik/SentenceBERT_vs_SiameseLSTM

pytorch

Mentioned in GitHub

projeto-de-algoritmos/Grafos1_Joao_Lucas_Leonardo_Miranda

pytorch

Mentioned in GitHub

BM-K/KoSentenceBERT_ETRI

pytorch

Mentioned in GitHub

sjtu-lit/syncse

pytorch

Mentioned in GitHub

reoneo97/wutr-buildon-2021

pytorch

Mentioned in GitHub

BM-K/KoSentenceBERT

pytorch

Mentioned in GitHub

princeton-nlp/SimCSE

pytorch

Mentioned in GitHub

fangrouli/Document-embedding-generation-models

pytorch

Mentioned in GitHub

asgaardlab/test-case-similarity-technique

Mentioned in GitHub

varun-suresh/experiments-with-gpt2/tree/main/language_models

pytorch

OctopusMind/longBert

pytorch

oto-labs/librarian

Mentioned in GitHub

Walid-Rahman2/modified_sentence_transfomers

pytorch

Mentioned in GitHub

kihohan/NLP_Reference

pytorch

Mentioned in GitHub

zhihaillm/wisdominterrogatory

pytorch

Mentioned in GitHub

RaviTejaMaddhini/SBERT-Tensorflow-implementation

Mentioned in GitHub

FreddeFrallan/Contrastive-Tension

Mentioned in GitHub

jcyk/mse-amr

pytorch

Mentioned in GitHub

imperialite/BERT-Embeddings-For-ARA

Mentioned in GitHub

yjiangcm/dcpcse

pytorch

Mentioned in GitHub

croitorualin/reverse-stable-diffusion

pytorch

Mentioned in GitHub

rmslick/SummarySearch

pytorch

Mentioned in GitHub

eric11eca/NeuralLog

Mentioned in GitHub

gmcgoldr/theissues

pytorch

Mentioned in GitHub

brightjade/CS492E-CiteRec

pytorch

Mentioned in GitHub

Siamul/NLP-Project

pytorch

Mentioned in GitHub

skojaku/Practical-Guide-to-Sentence-Transformers

Mentioned in GitHub

lambert-x/prolab

pytorch

Mentioned in GitHub

PaddlePaddle/PaddleNLP/tree/develop/examples/text_matching/sentence_transformers

paddle

idiap/analogy_learning

pytorch

Mentioned in GitHub

dmmiller612/bert-extractive-summarizer

pytorch

Mentioned in GitHub

abhilash1910/ClusterTransformer

pytorch

puerrrr/focal-infonce

pytorch

Mentioned in GitHub

yjiangcm/promcse

pytorch

Mentioned in GitHub

hkust-nlp/syncse

pytorch

Mentioned in GitHub

Alexey-Borisov/3_course_diary

Mentioned in GitHub

nuochenpku/sscl

pytorch

Mentioned in GitHub

thisisclement/STS-Benchmark-SentEval

Mentioned in GitHub

BinWang28/SBERT-WK-Sentence-Embedding

pytorch

Mentioned in GitHub

saulhazelius/transformer-clustering

Mentioned in GitHub

Danqi7/584-final

pytorch

Mentioned in GitHub

BinWang28/BERT_Sentence_Embedding

pytorch

Mentioned in GitHub

law-ai/summarization

pytorch

Mentioned in GitHub

max-planck-innovation-competition/sentence-transformers

pytorch

Mentioned in GitHub

UKPLab/sentence-transformers

Official

pytorch

Mentioned in GitHub

BM-K/KoSentenceBERT_SKT

pytorch

Mentioned in GitHub

AnzorGozalishvili/sentence_transformers_serving

Mentioned in GitHub

asreview/asreview-multilingual-feature-extractor

Mentioned in GitHub

autumn0409/Log-based-Anomaly-Detection-System

Mentioned in GitHub

valdecy/pybibx

Mentioned in GitHub

BM-K/KoSentenceBERT_SKTBERT

pytorch

Mentioned in GitHub

hhzrd/BEFAQ

Mentioned in GitHub

Susheel-1999/Sentence_Similarity

Mentioned in GitHub

yur7nd/ptss

pytorch

Mentioned in GitHub

TheNeuromancer/SentEmb

pytorch

Mentioned in GitHub

eelenadelolmo/WordVectors

pytorch

Mentioned in GitHub

bm-k/kosentencebert-skt

pytorch

Mentioned in GitHub

martinomensio/spacy-sentence-bert

pytorch

Mentioned in GitHub

InsaneLife/dssm

Mentioned in GitHub

xiaoouwang/frenchnlp

pytorch

Mentioned in GitHub

ClaudiuChelcea/2NHACK2021-CoverLetter-Generator-ML

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
semantic-textual-similarity-on-sick	SRoBERTa-NLI-large	Spearman Correlation: 0.7429
semantic-textual-similarity-on-sick	SRoBERTa-NLI-base	Spearman Correlation: 0.7446
semantic-textual-similarity-on-sick	SBERT-NLI-base	Spearman Correlation: 0.7291
semantic-textual-similarity-on-sick	SBERT-NLI-large	Spearman Correlation: 0.7375
semantic-textual-similarity-on-sick	SentenceBERT	Spearman Correlation: 0.7462
semantic-textual-similarity-on-sts-benchmark	SRoBERTa-NLI-STSb-large	Spearman Correlation: 0.8615
semantic-textual-similarity-on-sts-benchmark	SBERT-NLI-base	Spearman Correlation: 0.7703
semantic-textual-similarity-on-sts-benchmark	SRoBERTa-NLI-base	Spearman Correlation: 0.7777
semantic-textual-similarity-on-sts-benchmark	SBERT-NLI-large	Spearman Correlation: 0.79
semantic-textual-similarity-on-sts-benchmark	SBERT-STSb-base	Spearman Correlation: 0.8479
semantic-textual-similarity-on-sts-benchmark	SBERT-STSb-large	Spearman Correlation: 0.8445
semantic-textual-similarity-on-sts12	SRoBERTa-NLI-large	Spearman Correlation: 0.7453
semantic-textual-similarity-on-sts13	SBERT-NLI-large	Spearman Correlation: 0.7846
semantic-textual-similarity-on-sts14	SBERT-NLI-large	Spearman Correlation: 0.7490000000000001
semantic-textual-similarity-on-sts15	SRoBERTa-NLI-large	Spearman Correlation: 0.8185
semantic-textual-similarity-on-sts16	SRoBERTa-NLI-large	Spearman Correlation: 0.7682

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Nils Reimers; Iryna Gurevych

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters