Pinned Repositories
active-testing
Active and Sample-Efficient Model Evaluation
ALiPy
ALiPy: Active Learning in Python is an active learning python toolbox, which allows users to conveniently evaluate, compare and analyze the performance of active learning methods.
alp
active learning in python
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
ASTRA
Self-training with Weak Supervision (NAACL 2021)
aum
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
awesome-data-labeling
A curated list of awesome data labeling tools
Awesome-explainable-AI
A collection of research materials on explainable AI/ML
pnrajan's Repositories
pnrajan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
pnrajan/argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
pnrajan/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
pnrajan/Awesome-explainable-AI
A collection of research materials on explainable AI/ML
pnrajan/awesome-fairness-in-ai
A curated list of awesome Fairness in AI resources
pnrajan/awesome-knowledge-graph
A curated list of Knowledge Graph related learning materials, databases, tools and other resources
pnrajan/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
pnrajan/awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
pnrajan/awesome-ocr
pnrajan/awesome-open-source-llmops
An awesome & curated list of best open source MLOps/LLMOps tools for data scientists.
pnrajan/awesome-sentiment-attitude-extraction
A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards other Object mentioned in text such as: entities, events, etc.
pnrajan/chirpycardinal
Stanford's Alexa Prize socialbot
pnrajan/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
pnrajan/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
pnrajan/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
pnrajan/facet
Human-explainable AI.
pnrajan/fastdup
FastDup is a tool for gaining insights from a large image collection. It can find anomalies, duplicate and near duplicate images, clusters of similaritity, learn the normal behavior and temporal interactions between images. It can be used for smart subsampling of a higher quality dataset, outlier removal, novelty detection of new information to be
pnrajan/fugue
A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark, Dask and Ray without any rewrites.
pnrajan/GPTZero
An open-source implementation of GPTZero
pnrajan/handprint
Apply different text recognition services to images of handwritten documents.
pnrajan/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
pnrajan/Img2Mol
pnrajan/interpret
Fit interpretable models. Explain blackbox machine learning.
pnrajan/lm-evaluation-harness
A framework for few-shot evaluation of language models.
pnrajan/mislabel-detection
pnrajan/OpenNRE
An Open-Source Package for Neural Relation Extraction (NRE)
pnrajan/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
pnrajan/RapidOCR
A cross platform OCR Library based on PaddleOCR & OnnxRuntime & OpenVINO.
pnrajan/responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
pnrajan/tutorials-for-data-scientists