Pinned Repositories
awesome-selfhosted
A list of Free Software network services and web applications which can be hosted on your own servers
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
beir-qdrant
Qdrant integration with BEIR, simplifying quality checks on standard datasets
django-semantic-search
Bringing semantic search to Django. Integrates seemlessly with Django ORM.
langchain
🦜🔗 Build context-aware reasoning applications
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
openai-cookbook
Examples and guides for using the OpenAI API
qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
kacperlukawski's Repositories
kacperlukawski/django-semantic-search
Bringing semantic search to Django. Integrates seemlessly with Django ORM.
kacperlukawski/beir-qdrant
Qdrant integration with BEIR, simplifying quality checks on standard datasets
kacperlukawski/discord-open-source
List of open source communities living on Discord
kacperlukawski/awesome-chatgpt-plugins
A curated list of awesome ChatGPT plugins, demos and Posts
kacperlukawski/qdrant-examples
A collection of examples and tutorials for Qdrant vector search engine
kacperlukawski/real-wordpiece
A score-based implementation of WordPiece tokenization training, compatible with HuggingFace tokenizers.
kacperlukawski/langchain
âš¡ Building applications with LLMs through composability âš¡
kacperlukawski/autogen
A programming framework for agentic AI 🤖
kacperlukawski/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
kacperlukawski/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily search and find personal or work documents by asking questions in everyday language.
kacperlukawski/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
kacperlukawski/computer-vision-course
This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord
kacperlukawski/docarray
🧬 The data structure for multimodal data · Neural Search · Vector Search · Document Store
kacperlukawski/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
kacperlukawski/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
kacperlukawski/haystack-integrations
🚀 A list of Haystack Integrations, maintained by the community or deepset.
kacperlukawski/huggingface-cookbook
Open-source AI cookbook
kacperlukawski/huggingface-tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
kacperlukawski/landing_page
Landing page for qdrant.tech
kacperlukawski/llama_index
An index created by GPT to organize external information and answer queries!
kacperlukawski/mcp-servers
Model Context Protocol Servers
kacperlukawski/mteb
MTEB: Massive Text Embedding Benchmark
kacperlukawski/myscale-vector-db-benchmark
Framework for benchmarking vector search engines
kacperlukawski/openai-cookbook
Examples and guides for using the OpenAI API
kacperlukawski/ossinsight
Open Source Software Insights - Analysis, Comparison, Trends, Rankings of Open Source Software, you can also get insight from more than 5 billion with natural language (powered by OpenAI). Follow us on Twitter: https://twitter.com/ossinsight
kacperlukawski/prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate).
kacperlukawski/pull-request-another-repo
A Github action that creates a pull request in another repository using branch of the current repository
kacperlukawski/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
kacperlukawski/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
kacperlukawski/ZoomVideoComposer
Pyhton script for generating zoom in/out videos from a set of images