Issue Classification example

In this example we perform text similarity search over a set of github issues to predict the labels on newly entered tickets.

The data is gathered from the quarkusio/quarkus repository, which provides a dataset that includes title and body (of the issues reported) and is labeled using labels (i.e. area/devmode, or kind/bug).

sbert sentence transformers are used to compute the embeddings, which are stored in a vector database (qdrant in our case).

How to use this repo

The code is broken down into several Jupyter notebooks that need to be used in order:

data_aquisition.ipynb
embeddings.ipynb
query.ipynb

heiko-braun/Vector_Embeddings

Issue Classification example

How to use this repo