justinthelaw's Stars
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
ggerganov/llama.cpp
LLM inference in C/C++
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
microsoft/AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
AIHawk-FOSS/Auto_Jobs_Applier_AI_Agent
Auto_Jobs_Applier_AI_Agent aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way.
timescale/timescaledb
An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.
DS4SD/docling
Get your documents ready for gen AI
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
mrdbourke/pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
canonical/microk8s
MicroK8s is a small, fast, single-package Kubernetes for datacenters and the edge.
langchain-ai/langgraph
Build resilient language agents as graphs.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Portkey-AI/gateway
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
k8sgpt-ai/k8sgpt
Giving Kubernetes Superpowers to everyone
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
tensorchord/pgvecto.rs
Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.
ray-project/kuberay
A toolkit to run Ray applications on Kubernetes
KruxAI/ragbuilder
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
s3ql/s3ql
a full featured file system for online data storage
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
aidar-freeed/ai-codereviewer
AI Code Reviewer: Enhance your GitHub workflow with AI-powered code review! Get intelligent feedback and suggestions on pull requests using OpenAI's GPT-4 API, improving code quality and saving developers time.
distantmagic/paddler
Stateful load balancer custom-tailored for llama.cpp 🏓🦙
tailscale/github-action
A GitHub Action to connect your workflow to your Tailscale network.
arcee-ai/DistillKit
An Open Source Toolkit For LLM Distillation
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
arcee-ai/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
akx/ggify
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
mozilla-ai/lumigator
Source code for Mozilla.ai's Lumigator platform
gpustack/gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
NickCrews/llama-cpp-server-python
Bootstrap a server from llama-cpp in a few lines of python