yihong-chen's Stars
meta-llama/llama
Inference code for Llama models
xai-org/grok-1
Grok open release
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
meta-llama/llama3
The official Meta Llama 3 GitHub site
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Sinaptik-AI/pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
triton-lang/triton
Development repository for the Triton language and compiler
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
loro-dev/loro
Make your JSON data collaborative and version-controlled with CRDTs
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
mark-when/markwhen
Make a cascading timeline from markdown-like text. Supports simple American/European date styles, ISO8601, images, links, locations, and more.
dashingsoft/pyarmor
A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
freedmand/semantra
Multi-tool for semantic search
microsoft/DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
srush/llama2.rs
A fast llama2 decoder in pure Rust.
loro-dev/crdt-richtext
Rich text CRDT that implements Peritext and Fugue
lindermanlab/S5
DeepGraphLearning/AStarNet
Official implementation of A* Networks
openhackathons-org/nways_accelerated_programming
N-Ways to GPU Programming Bootcamp
april-tools/gekcs
How to Turn Your Knowledge Graph Embeddings into Generative Models
apartresearch/Neuron2Graph
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
biochunan/AsEP-dataset
NeurIPS 2024 Dataset and Benchmark Submission "AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction"
bill-shen-BS/PISTOL