erow
I am pursuing Vision, Speech and Signal Processing PhD, at the University of Surrey, working with self-supervised learning with vision transformers.
erow's Stars
ranjan-mohanty/vfs-appointment-bot
VFS Appointment Bot - This script automates checking for appointments at VFS Global offices in a specified country.
mengjian-github/copilot-analysis
AntixK/PyTorch-Model-Compare
Compare neural networks by their feature similarity
madebyollin/taesd
Tiny AutoEncoder for Stable Diffusion
state-spaces/mamba
Mamba SSM architecture
radarFudan/Awesome-state-space-models
Collection of papers on state-space models
hustvl/Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
UCSC-VLAA/CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
AhmedBourouis/Scene-Sketch-Segmentation
Open Vocabulary Semantic Scene Sketch Understanding
MuiseDestiny/zotero-gpt
GPT Meet Zotero.
linfengWen98/CAP-VSTNet
[CVPR 2023] CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
vturrisi/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
indussky8/awesome-few-shot-learning
A review for latest few-shot learning works
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
Explosion-Scratch/claude-unofficial-api
Unofficial API for Claude-2 via Claude Web (Also CLI)
lucidrains/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
kennethleungty/Neural-Network-Architecture-Diagrams
Diagrams for visualizing neural network architecture (Created with diagrams.net)
s0md3v/roop
one-click face swap
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
deepset-ai/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.