konradsemsch's Stars
aws-samples/amazon-bedrock-workshop
This is a workshop designed for Amazon Bedrock a foundational model service.
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
aws-samples/sagemaker-serverless-inference-benchmarking
dastergon/postmortem-templates
A collection of postmortem templates
docker/genai-stack
Langchain + Docker + Neo4j + Ollama
gkamradt/langchain-tutorials
Overview and tutorial of the LangChain Library
assafelovic/gpt-researcher
LLM based autonomous agent that does online comprehensive research on any given topic
feast-dev/feast
The Open Source Feature Store for Machine Learning
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
opentofu/opentofu
OpenTofu lets you declaratively manage your cloud infrastructure.
iterative/dvc
🦉 ML Experiments and Data Management with Git
pantsbuild/pants
The Pants Build System
interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
confident-ai/deepeval
The LLM Evaluation Framework
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
99designs/aws-vault
A vault for securely storing and accessing AWS credentials in development environments
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
phlippe/uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
aws-samples/aiml-genai-multimodal-agent
msaroufim/ml-design-patterns
Software Architecture for ML engineers
Sentdex/TermGPT
Giving LLMs like GPT-4 the ability to plan and execute terminal commands
aws-powertools/powertools-lambda-python
A developer toolkit to implement Serverless best practices and increase developer velocity.