jshi31's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
CompVis/stable-diffusion
A latent text-to-image diffusion model
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
mosaicml/llm-foundry
LLM training code for Databricks foundation models
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Docta-ai/docta
A Doctor for your data
google/prompt-to-prompt
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
lamini-ai/lamini
The Official Python Client for Lamini's API
Yutong-Zhou-cv/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
bloc97/CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
loonghao/photoshop-python-api
Python API for Photoshop.
NVlabs/genvs
allenai/unified-io-2
qnzhou/Mosaic
Simple greedy image packing