namhkoh

Machine Learning Engineer @NVIDIA & Ph.D. Candidate @KAIST-AILab

NVIDIAMountain View

namhkoh's Stars

ocornut/imgui
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
Language:C++62.1k 1k 6.1k10.4k
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.8k 990 1903.5k
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python35.6k 306 8855.2k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.7k 344 2694.1k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 218 4692.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
mml-book/mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"
Language:Jupyter Notebook13.4k 479 7762.5k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 257 128839
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.3k 185 1.9k1.9k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.7k 80 5041k
smol-ai/GodMode
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
Language:TypeScript4.2k 42 161329
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.7k 43 400161
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
Language:Python2.2k 40 282338
lucidrains/CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Language:Python1.1k 14 1889
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
Language:TeX804 8 27194
jayleicn/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Language:Python713 10 5986
NVIDIAGameWorks/Streamline
Streamline Integration Framework
Language:C419 53 4293
LAION-AI/CLIP-based-NSFW-Detector
Language:Python326 4 1328
jaywonchung/reason
A shell for research papers
Language:Rust193 4 55
cpeikert/TheoryOfCryptography
Lecture notes for Chris Peikert's graduate-level Theory of Cryptography course
Language:TeX164 15 236
danielgordon10/thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
Language:Python125 7 729
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Language:Makefile117 6 75
icbcbicc/IQA-Dataset
A unified interface for downloading and loading popular Image Quality Assessment (IQA) datasets.
Language:Python115 1 410
apple/pytorch-speech-features
Language:Python85 8 011
poori-nuna/HOD-Benchmark-Dataset
HOD: A Benchmark Dataset for Harmful Object Detection
Language:Jupyter Notebook30 3 42
Maxlinn/CHAIR-metric-standalone
CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.
Language:Python25 1 00
Heidelberg-NLP/MM-SHAP
This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks"
Language:Python22 3 94
bryant1410/slurm-cheatsheet
12 3 12
namhkoh/BAD-BiAs-Detection-in-LLMs
BAD: BiAs Detection for Large Language Models in the context of candidate screening (EECS 692)
Language:Jupyter Notebook12 1 02
MichiganNLP/Scalable-VLM-Probing
Probe Vision-Language Models
Language:Python5 2 01

namhkoh

namhkoh's Stars

ocornut/imgui

XingangPan/DragGAN

babysor/MockingBird

tatsu-lab/stanford_alpaca

Vision-CAIR/MiniGPT-4

haotian-liu/LLaVA

mml-book/mml-book.github.io

BradyFU/Awesome-Multimodal-Large-Language-Models

PaddlePaddle/PaddleSpeech

mlfoundations/open_clip

smol-ai/GodMode

InternLM/InternLM-XComposer

nateshmbhat/pyttsx3

lucidrains/CoCa-pytorch

acl-org/acl-style-files

jayleicn/ClipBERT

NVIDIAGameWorks/Streamline

LAION-AI/CLIP-based-NSFW-Detector

jaywonchung/reason

cpeikert/TheoryOfCryptography

danielgordon10/thor-iqa-cvpr-2018

PKU-Alignment/beavertails

icbcbicc/IQA-Dataset

apple/pytorch-speech-features

poori-nuna/HOD-Benchmark-Dataset

Maxlinn/CHAIR-metric-standalone

Heidelberg-NLP/MM-SHAP

bryant1410/slurm-cheatsheet

namhkoh/BAD-BiAs-Detection-in-LLMs

MichiganNLP/Scalable-VLM-Probing