konradsemsch

Cloud Solution Architect & ML Engineer

@AWSEssen, Germany

konradsemsch's Stars

aws-samples/amazon-bedrock-workshop
This is a workshop designed for Amazon Bedrock a foundational model service.
Language:Jupyter Notebook1.4k607
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go92.1k7.3k
aws-samples/sagemaker-serverless-inference-benchmarking
Language:Python13
dastergon/postmortem-templates
A collection of postmortem templates
1.3k421
docker/genai-stack
Langchain + Docker + Neo4j + Ollama
Language:Python3.9k828
gkamradt/langchain-tutorials
Overview and tutorial of the LangChain Library
Language:Jupyter Notebook6.7k1.9k
assafelovic/gpt-researcher
LLM based autonomous agent that does online comprehensive research on any given topic
Language:Python14.3k1.9k
feast-dev/feast
The Open Source Feature Store for Machine Learning
Language:Python5.5k990
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k936
opentofu/opentofu
OpenTofu lets you declaratively manage your cloud infrastructure.
Language:Go22.8k878
iterative/dvc
🦉 ML Experiments and Data Management with Git
Language:Python13.7k1.2k
pantsbuild/pants
The Pants Build System
Language:Python3.3k628
interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
Language:C++6.2k725
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.2k1k
promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Language:TypeScript4.4k324
confident-ai/deepeval
The LLM Evaluation Framework
Language:Python3.1k246
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
Language:Python11.3k1.2k
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
Language:Python15k2.4k
99designs/aws-vault
A vault for securely storing and accessing AWS credentials in development environments
Language:Go8.4k816
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.7k2.4k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.3k390
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k471
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.1k615
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
Language:Python9.8k626
phlippe/uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
Language:Jupyter Notebook2.5k566
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Language:Python2.5k447
aws-samples/aiml-genai-multimodal-agent
Language:Jupyter Notebook4515
msaroufim/ml-design-patterns
Software Architecture for ML engineers
38231
Sentdex/TermGPT
Giving LLMs like GPT-4 the ability to plan and execute terminal commands
Language:Jupyter Notebook40795
aws-powertools/powertools-lambda-python
A developer toolkit to implement Serverless best practices and increase developer velocity.
Language:Python2.8k389

konradsemsch

konradsemsch's Stars

aws-samples/amazon-bedrock-workshop

ollama/ollama

aws-samples/sagemaker-serverless-inference-benchmarking

dastergon/postmortem-templates

docker/genai-stack

gkamradt/langchain-tutorials

assafelovic/gpt-researcher

feast-dev/feast

NVIDIA/TensorRT-LLM

opentofu/opentofu

iterative/dvc

pantsbuild/pants

interpretml/interpret

Lightning-AI/litgpt

promptfoo/promptfoo

confident-ai/deepeval

h2oai/h2ogpt

UKPLab/sentence-transformers

99designs/aws-vault

NVIDIA/NeMo

InternLM/lmdeploy

CarperAI/trlx

bitsandbytes-foundation/bitsandbytes

bentoml/OpenLLM

phlippe/uvadlc_notebooks

huggingface/optimum

aws-samples/aiml-genai-multimodal-agent

msaroufim/ml-design-patterns

Sentdex/TermGPT

aws-powertools/powertools-lambda-python