namhkoh's Stars
ocornut/imgui
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mml-book/mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
mlfoundations/open_clip
An open source implementation of CLIP.
smol-ai/GodMode
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
lucidrains/CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
jayleicn/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
NVIDIAGameWorks/Streamline
Streamline Integration Framework
LAION-AI/CLIP-based-NSFW-Detector
jaywonchung/reason
A shell for research papers
cpeikert/TheoryOfCryptography
Lecture notes for Chris Peikert's graduate-level Theory of Cryptography course
danielgordon10/thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
icbcbicc/IQA-Dataset
A unified interface for downloading and loading popular Image Quality Assessment (IQA) datasets.
apple/pytorch-speech-features
poori-nuna/HOD-Benchmark-Dataset
HOD: A Benchmark Dataset for Harmful Object Detection
Maxlinn/CHAIR-metric-standalone
CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.
Heidelberg-NLP/MM-SHAP
This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks"
bryant1410/slurm-cheatsheet
namhkoh/BAD-BiAs-Detection-in-LLMs
BAD: BiAs Detection for Large Language Models in the context of candidate screening (EECS 692)
MichiganNLP/Scalable-VLM-Probing
Probe Vision-Language Models