chempku's Stars
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
meta-llama/llama
Inference code for Llama models
PacktPublishing/Causal-Inference-and-Discovery-in-Python
Causal Inference and Discovery in Python by Packt Publishing
databricks-demos/dbdemos
Demos to implement your Databricks Lakehouse
freedmand/semantra
Multi-tool for semantic search
pymc-labs/CausalPy
A Python package for causal inference in quasi-experimental settings
napsternxg/awesome-causality
Resources related to causality
skdeshpande91/flexBCF
Faster and more flexible implementation of Bayesian Causal Forests
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
kelvins/awesome-mlops
:sunglasses: A curated list of awesome MLOps tools
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
python-poetry/poetry
Python packaging and dependency management made easy
willwulfken/MidJourney-Styles-and-Keywords-Reference
A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!
CreativeInquiry/terrapattern
Enabling journalists, citizen scientists, humanitarian workers and others to detect “patterns of interest” in satellite imagery.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
wkzs111/phm-ieee-2012-data-challenge-dataset
Dataset that was used during the PHM IEEE 2012 Data Challenge, built by the FEMTO-ST Institute
seasmith/HoustonCrimeViewer
An interactive map of Houston crime data
hardikkamboj/An-Introduction-to-Statistical-Learning
This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
microsoft/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
cs231n/cs231n.github.io
Public facing notes page
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
andkret/Cookbook
The Data Engineering Cookbook
descarteslabs/DL-COVID-19
Mobility changes in response to COVID-19, provided by Descartes Labs
Qiskit/qiskit
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
benweet/stackedit
In-browser Markdown editor
WattTime/pyiso
Python client libraries for ISO and other power grid data sources.
JifuZhao/DS-Take-Home
My solution to the book A Collection of Data Science Take-Home Challenges
awslabs/amazon-sagemaker-mlops-workshop
Machine Learning Ops Workshop with SageMaker: lab guides and materials.