sara4dev

nvidiaDallas, TX

sara4dev's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Python88.5k 672 7.1k13.9k
papers-we-love/papers-we-love
Papers from the computer science community to read and discuss.
Language:Shell85.1k 3.1k 2365.7k
ollama/ollama
Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.
Language:Go76.9k 461 3.4k5.8k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++61.2k 517 3.3k8.7k
meta-llama/llama
Inference code for Llama models
Language:Python54.1k 514 9309.3k
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python32k 472 17.9k5.4k
microsoft/autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
Language:Jupyter Notebook28.1k 358 1.4k4.1k
apache/skywalking
APM, Application Performance Monitoring System
Language:Java23.5k 837 5.2k6.5k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda21.4k 212 1192.3k
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Language:Rust18.8k 118 1.1k1.3k
chroma-core/chroma
the AI-native open-source embedding database
Language:Rust13.5k 79 1k1.1k
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Language:Jupyter Notebook12.6k 243 1011.8k
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
Language:Python11k 193 1.1k1.2k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
Language:Jupyter Notebook10.4k 82 2911.5k
jdx/mise
dev tools, env vars, task runner
Language:Rust8.1k 22 968211
rapidsai/cudf
cuDF - GPU DataFrame Library
Language:C++8k 149 6.2k868
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python7.7k 139 3.6k1.4k
Syllo/nvtop
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
Language:C7.7k 75 232281
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++7.4k 84 1.5k798
rancher-sandbox/rancher-desktop
Container Management and Kubernetes on the Desktop
Language:TypeScript5.7k 53 3.5k267
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++4.8k 105 924833
ROCm/ROCm
AMD ROCm™ Software - GitHub Home
Language:Shell4.3k 211 2.2k352
diggerhq/digger
Digger is an open source IaC orchestration tool. Digger allows you to run IaC in your existing CI pipeline ⚡️
Language:Go2.8k 18 411129
Netflix/bpftop
bpftop provides a dynamic real-time view of running eBPF programs. It displays the average runtime, events per second, and estimated total CPU % for each program.
Language:C2k 152 1590
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language:Python1.9k 30 221126
ImplFerris/LearnRust
Rust Learning Resources
865 11 0108
huggingface/optimum-nvidia
Language:Python835 40 5881
nixys/nxs-universal-chart
The Helm chart you can use to install any of your applications into Kubernetes/OpenShift
Language:Smarty359 17 2749
bazel-contrib/rules_oci
Bazel rules for building OCI containers
Language:Starlark251 10 254132
anyscale/ray-summit-2023-training
Language:Jupyter Notebook83 9 130

sara4dev

sara4dev's Stars

langchain-ai/langchain

papers-we-love/papers-we-love

ollama/ollama

ggerganov/llama.cpp

meta-llama/llama

ray-project/ray

microsoft/autogen

apache/skywalking

karpathy/llm.c

qdrant/qdrant

chroma-core/chroma

AI4Finance-Foundation/FinGPT

ludwig-ai/ludwig

meta-llama/llama-recipes

jdx/mise

rapidsai/cudf

triton-inference-server/server

Syllo/nvtop

NVIDIA/TensorRT-LLM

rancher-sandbox/rancher-desktop

NVIDIA/cutlass

ROCm/ROCm

diggerhq/digger

Netflix/bpftop

predibase/lorax

ImplFerris/LearnRust

huggingface/optimum-nvidia

nixys/nxs-universal-chart

bazel-contrib/rules_oci

anyscale/ray-summit-2023-training