SwordElucidator
UC Berkeley Computer Science, working for AI startup researching LLMs & AI Agents. AI researches at Stanford University on LM structure & Gen AI.
AI StartupShanghai
SwordElucidator's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
microsoft/autogen
A programming framework for agentic AI 🤖
Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
statelyai/xstate
Actor-based state management & orchestration for complex app logic.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
bmild/nerf
Code release for NeRF (Neural Radiance Fields)
openai/consistency_models
Official repo for consistency models.
HVision-NKU/StoryDiffusion
Create Magic Story!
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM
THUDM/P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
PharMolix/OpenBioMed
MiAO-AI-Lab/LARP
giannisdaras/ambient-diffusion
[NeurIPS 2023] Official Implementation: "Ambient Diffusion: Learning Clean Distributions from Corrupted Data"
john-hewitt/backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
locuslab/deq-ddim
jzhoubu/vsearch
An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.
SwordElucidator/nanoBackpackLM
The simplest repository for training medium-sized BackpackLM for cs224n
sundamu/aws-sagemaker-llm