yueyericardo's Stars
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
meta-llama/llama3
The official Meta Llama 3 GitHub site
karpathy/llm.c
LLM training in simple, raw C/CUDA
karpathy/llama2.c
Inference Llama 2 in one file of pure C
ml-explore/mlx
MLX: An array framework for Apple silicon
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
khoj-ai/khoj
Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
LargeWorldModel/LWM
ml-explore/mlx-examples
Examples in the MLX framework
google-deepmind/graphcast
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
open-mpi/ompi
Open MPI main development repository
tairov/llama2.mojo
Inference Llama 2 in one file of pure 🔥
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
CaliCastle/cali.so
Cali 的个人官网开源项目
google-deepmind/materials_discovery
pytorch/benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
ur-whitelab/chemcrow-public
Chemcrow
olcf/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
Holmeswww/AgentKit
An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natural language prompts.
westpa/westpa
WESTPA: The Weighted Ensemble Simulation Toolkit with Parallelization and Analysis
usyd-fsalab/fp6_llm
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
AlibabaResearch/flash-llm
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
bhosmer/mm
joey00072/ohara
Collection of autoregressive model implementation