whylabs/langkit

Support multiple embedding models

Closed this issue · 1 comments

zmacks commented

The Universal Sentence Encoder model is a multipurpose sentence embeddings model for semantic similarity.

The aim is to provide a single encoder that can support as wide a variety of applications as possible, including paraphrase detection, relatedness, clustering and custom text classification.

I'd love to see the model swappable/configurable wherever embeddings are generated.

We have the first part of this request in PR, covering the prompt/response similarity score and themes metric to use a swappable embeddings encode method. Great suggestion, thanks for filing the request @zmacks !