prrao87
AI Engineer @kuzudb 🇨🇦. Building workflows using relational/graph/vector databases and LLMs.
@kuzudbToronto, Canada
Pinned Repositories
db-hub-fastapi
Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
fine-grained-sentiment
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
kuzudb-study
Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset
lancedb-study
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
pydantic-benchmarks
Benchmarks testing the performance of various releases of Pydantic v2 🦀
tweet-stance-prediction
Applying NLP transfer learning techniques to predict Tweet stance toward a topic
GenderGapTracker
Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media
rustinpieces
Journeys between the two worlds of Python 🐍 and Rust 🦀
prrao87's Repositories
prrao87/fine-grained-sentiment
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
prrao87/db-hub-fastapi
Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
prrao87/duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
prrao87/kuzudb-study
Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset
prrao87/lancedb-study
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
prrao87/pydantic-benchmarks
Benchmarks testing the performance of various releases of Pydantic v2 🦀
prrao87/topic-modelling
Comparing the scalability and quality of topic models in Gensim and PySpark
prrao87/prrao87.github.io
Archived. My blog is now moved to https://github.com/thedataquarry
prrao87/rag-data-ops
Code for data ops when building RAG applications using LangChain and LlamaIndex
prrao87/mteb-validation
Compare different embedding models from MTEB leaderboard
prrao87/langchain
🦜🔗 Build context-aware reasoning applications
prrao87/meilisearch-python-sdk
An async and sync Python client for the Meilisearch API
prrao87/qdrant-client
Python client for Qdrant vector search engine
prrao87/RBIR
prrao87/uv-demo
Demo of uv for a streamlined Python package management experience
prrao87/AeonG
AeonG: An Efficient Built-in Temporal Support in Graph Databases
prrao87/awesome-duckdb
🦆 A curated list of awesome DuckDB resources
prrao87/gh-action-test
Test GitHub actions and pre-commit hooks for experimenting with CI/CD and auto-linting workflows.
prrao87/knowledge-table
Knowledge Table is an open-source package designed to simplify extracting and exploring structured data from unstructured documents.
prrao87/kuzu-docs
prrao87/kuzu-rdflib
An integration of KùzuDB and RDFlib.
prrao87/kuzu-ui
Browser-based user interface for Kùzu graph database
prrao87/lancedb
Serverless, low-latency vector database for AI applications
prrao87/llama_index
LlamaIndex is a data framework for your LLM applications
prrao87/rustworkx
A high performance Python graph library implemented in Rust.
prrao87/show-notes
Changelog episode show notes in Markdown format 📝
prrao87/spacy-nlp
Natural Language Processing experiments using the spaCy library
prrao87/this-week-in-rust
Data for this-week-in-rust.org
prrao87/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
prrao87/weaviate-io
Website for the Weaviate vector database