danitico's Stars
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
dolthub/dolt
Dolt – Git for Data
ml-explore/mlx
MLX: An array framework for Apple silicon
valkey-io/valkey
A flexible distributed key-value datastore that is optimized for caching and other realtime workloads.
huggingface/candle
Minimalist ML framework for Rust
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
microsoft/promptbase
All things prompt engineering
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
pytorch/serve
Serve, optimize and scale PyTorch models in production
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
bclavie/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
pytest-dev/pytest-xdist
pytest plugin for distributed testing and loop-on-failures testing modes.
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
aws/deep-learning-containers
AWS Deep Learning Containers are pre-built Docker images that make it easier to run popular deep learning frameworks and tools on AWS.
pyspark-ai/pyspark-ai
English SDK for Apache Spark
BerriAI/reliableGPT
Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors
youtype/mypy_boto3_builder
Type annotations builder for boto3 compatible with VSCode, PyCharm, Emacs, Sublime Text, pyright and mypy.
oughtinc/ice
Interactive Composition Explorer: a debugger for compositional language model programs
run-llama/finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
Cranial-XIX/llm-pddl
castorini/docTTTTTquery
docTTTTTquery document expansion model
RUC-GSAI/YuLan-Rec
cansik/onnxruntime-silicon
ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)
brendenlake/MLC
Meta-Learning for Compositionality (MLC) for modeling human behavior
newrelic/nr-openai-observability
Easy to install OpenAI GPT monitoring tool.
vispana/vispana
Web client for Vespa.ai
pehrs/vscode-vespa
Vespa AI Extension for Visual Studio Code