dharmesh-soni's Stars
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
taverntesting/tavern
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
timescale/pgvectorscale
A complement to pgvector for high performance, cost efficient vector search on large workloads.
w5teams/w5
Security Orchestration, Automation and Response (SOAR) Platform. 安全编排与自动化响应平台,无需编写代码的安全自动化,使用 SOAR 可以让团队工作更加高效
nektos/act
Run your GitHub Actions locally 🚀
transferwise/pipelinewise-tap-mysql
Singer.io Tap for MySQL - PipelineWise compatible
transferwise/pipelinewise-target-snowflake
Singer.io Target for Snowflake - PipelineWise compatible
garden-io/garden
Automation for Kubernetes development and testing. Spin up production-like environments for development, testing, and CI on demand. Use the same configuration and workflows at every step of the process. Speed up your builds and test runs via shared result caching
google-research-datasets/paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification.
Netflix/metaflow-service
:rocket: Metadata tracking and UI service for Metaflow!
akshatdalton/zulip
Zulip server - powerful open source team chat
Pylons/pyramid_openapi3
Pyramid addon for OpenAPI3 validation of requests and responses.
princeton-nlp/ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
hegelai/prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
pengfei-luo/multimodal-knowledge-graph
A collection of resources on multimodal knowledge graph, including datasets, papers and contests.
eihli/image-table-ocr
Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.
AI21Labs/in-context-ralm
ZihengZZH/awesome-multimodal-knowledge-graph
A curated list of AWESOME papers, datasets and tutorials within Multimodal Knowledge Graph.
MeltanoLabs/target-snowflake
Singer Target for the Snowflake cloud Data Warehouse
apple/ml-mkqa
We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Please refer to our paper for details, MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
uber-research/PPLM
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Forward-Operators/prr
prr - command-line LLM prompt runner
brexhq/prompt-engineering
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
openai/openai-cookbook
Examples and guides for using the OpenAI API
aws-samples/amazon-macie-results-analytics
This is a repository of information to help with ideas and examples for performing analytics on the results of Amazon Macie classification jobs.
VertaAI/modeldb
Open Source ML Model Versioning, Metadata, and Experiment Management
opendatadiscovery/odd-collector
Open-source metadata collector based on ODD Specification