prrao87
AI Engineer @kuzudb ๐จ๐ฆ. Building workflows using relational/graph/vector databases and LLMs. ๐ + ๐ฆ = ๐ช๐ฝ
@kuzudbToronto, Canada
Pinned Repositories
db-hub-fastapi
Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
fine-grained-sentiment
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
kuzudb-study
Benchmark study on KรนzuDB, an embedded OLAP graph database, on an artificial social network dataset
lancedb-study
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
pydantic-benchmarks
Benchmarks testing the performance of various releases of Pydantic v2 ๐ฆ
tweet-stance-prediction
Applying NLP transfer learning techniques to predict Tweet stance toward a topic
GenderGapTracker
Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media
rustinpieces
Journeys between the two worlds of Python ๐ and Rust ๐ฆ
prrao87's Repositories
prrao87/fine-grained-sentiment
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
prrao87/db-hub-fastapi
Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
prrao87/kuzudb-study
Benchmark study on KรนzuDB, an embedded OLAP graph database, on an artificial social network dataset
prrao87/duckdb-study
Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
prrao87/lancedb-study
Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
prrao87/neo4j-python-fastapi
Bulk ingest data into Neo4j using sync or async Python, and expose the data via FastAPI
prrao87/fine-grained-sentiment-app
A Flask LIME explainer app for fine-grained sentiment classification.
prrao87/pydantic-benchmarks
Benchmarks testing the performance of various releases of Pydantic v2 ๐ฆ
prrao87/topic-modelling
Comparing the scalability and quality of topic models in Gensim and PySpark
prrao87/prrao87.github.io
Archived. My blog is now moved to https://github.com/thedataquarry
prrao87/rag-data-ops
Code for data ops when building RAG applications using LangChain and LlamaIndex
prrao87/langchain
๐ฆ๐ Build context-aware reasoning applications
prrao87/meilisearch-python-sdk
An async and sync Python client for the Meilisearch API
prrao87/mteb-validation
Compare different embedding models from MTEB leaderboard
prrao87/qdrant-client
Python client for Qdrant vector search engine
prrao87/sqlmodel
SQL databases in Python, designed for simplicity, compatibility, and robustness.
prrao87/AeonG
AeonG: An Efficient Built-in Temporal Support in Graph Databases
prrao87/awesome-duckdb
๐ฆ A curated list of awesome DuckDB resources
prrao87/gh-action-test
Test GitHub actions and pre-commit hooks for experimenting with CI/CD and auto-linting workflows.
prrao87/kuzu-docs
prrao87/kuzu-rdflib
An integration of KรนzuDB and RDFlib.
prrao87/kuzu-ui
Browser-based user interface for Kรนzu graph database
prrao87/lancedb
Serverless, low-latency vector database for AI applications
prrao87/llama_index
LlamaIndex is a data framework for your LLM applications
prrao87/rustworkx
A high performance Python graph library implemented in Rust.
prrao87/spacy-experimental-coref
Testing the new coreference resolver in spaCy v3.x
prrao87/spacy-nlp
Natural Language Processing experiments using the spaCy library
prrao87/this-week-in-rust
Data for this-week-in-rust.org
prrao87/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
prrao87/weaviate-io
Website for the Weaviate vector database