Hacker News new | ask | show | jobs
by ZQ-Dev8 544 days ago
Jeremy, this is awesome! Personally excited for a new wave of sentence transformers built off ModernBERT. A poster below provided the link to a sample ST training script in the ModernBERT repo, so that's great.

Do you expect the ModernBERT STs to carry the same advantages over ModernBERT that BERT STs had over the original BERT? Or would you expect caveats based on ModernBERT's updated architecture and capabilities?

2 comments

Yes absolutely the same advantages -- in fact the maintainer of ST is on the paper team, and it's been a key goal from day one to make this work well.
what’s ST stand for here? I googled and only got results for BERT STS (semantic text similarity)
Sentence Transformers (https://sbert.net/), the most used library for embedding models (similarity, retrieval.)