WilliamLLee's Stars
pengsongyou/openscene
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
ollama/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
dwave-examples/3d-bin-packing
Use a hybrid solver to use the minimum number of bins to pack items with different dimensions
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
kyegomez/Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
lizongying/my-tv
我的电视 电视直播软件,安装即可使用
kyegomez/ScreenAI
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
OpenBMB/MiniCPM
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
AGI-Edgerunners/LLM-Agents-Papers
A repo lists papers related to LLM based agent
greyovo/PicQuery
🔍 Search local images with natural language on Android, powered by OpenAI's CLIP model. / 在 Android 上用自然语言搜索本地图片 (基于 OpenAI 的 CLIP 模型)
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
state-spaces/mamba
Mamba SSM architecture
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
windingwind/zotero-actions-tags
Customize your Zotero workflow.
Ikaros-521/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
VL-Group/PENET
[CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"
AIGCDesignGroup/ReplaceAnything
HKUDS/LLMRec
[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"
stefan-jansen/zipline-reloaded
Zipline, a Pythonic Algorithmic Trading Library
Pythagora-io/gpt-pilot
The first real AI developer
AI4Finance-Foundation/FinRL-Meta
FinRL-Meta: Dynamic datasets and market environments for FinRL.
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
AI4Finance-Foundation/FinRL
FinRL: Financial Reinforcement Learning. 🔥
rlqja1107/torch-LLM4SGG
Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at CVPR 2024
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
enoche/MultimodalRecSys
A curated list of awesome resources about multimodal recommender systems.
enoche/MMRec
A Toolbox for MultiModal Recommendation. Integrating 10+ Models...