DmitryKey
I build search engines. Host of the Vector Podcast: https://www.youtube.com/c/VectorPodcast
Insider SolutionsEspoo
DmitryKey's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
mermaid-js/mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
KevinMusgrave/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
evidentlyai/evidently
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
srbhr/Resume-Matcher
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
yoheinakajima/instagraph
Converts text input or URL into knowledge graph and displays
determined-ai/determined
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
metarank/metarank
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
swirlai/swirl-search
SWIRL AI Connect: AI infrastructure software that powers your Search & Retrieval Augmented Generation (RAG) applications. Simplify and enhance your AI pipelines with seamless integration of large language models (LLMs) and data sources.
NotJoeMartinez/yt-fts
YouTube Full Text Search - Search all of a YouTube channel from the command line
veekaybee/what_are_embeddings
A deep dive into embeddings starting from fundamentals
projectblacklight/blacklight
Blacklight provides a discovery interface for any Solr (http://lucene.apache.org/solr) index.
AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P
veekaybee/viberary
Good books, good vibes
terrier-org/pyterrier
A Python framework for performing information retrieval experiments, building on http://terrier.org/
esteininger/vector-search
The definitive guide to using Vector Search to solve your semantic search production workload needs.
fergusq/tampio
Tampio: An object-oriented programming language made to resemble Finnish
m-wrzr/streamlit-searchbox
Streamlit searchbox that dynamically updates and provides a list of suggestions based on a provided function
chrislit/abydos
Abydos NLP/IR library for Python
Lednik7/CLIP-ONNX
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
softwaredoug/searcharray
Full text search in your Pandas dataframe
DmitryKey/bert-solr-search
Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU
steveash/NETransliteration-COLING2018
Code and data used in named entity transliteration experiments
thakur-nandan/sprint
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
pulijon/Sttcast
Transcription from mp3 files to html with or without embedded player
fginter/simstring-cuda
A quick implementation of cosine-based string fuzzy lookup (a little bit like the famous simstring library) using sklearn, torch, and GPU acceleration. Can hold its own with an index of few million strings, batched queries, and GPU. Otherwise loses in speed to simstring, but is easy to install OTOH. I leave this here in case anyone wants it