GregxmHu

PhD student in Computer Science.

The Chinese University of Hong KongHong Kong SAR

Pinned Repositories

AutoDAN
The official implementation of our paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
Language:Python00
Deeplearning-Lab4
LSTM, Text Classification, Chinese Word Embeddings, Rnn Encoder-Decoder, Temperature Series Prediction
Language:Python1 1 01
Gradient-Cuff
Code for our NeurIPS2024 accepted paper: Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Language:Python4 1 00
llm_attack
Language:Python1 1 00
OpenMatch
An Open-Source Package for Information Retrieval.
Language:Python10
promptbench
A robustness evaluation framework for large language models on adversarial prompts
Language:Python1 0 00
Unified-Prompt-based-Framework-for-Multi-task-training
T5 Prompt Tuning
Language:Python8 2 02
Gradient-Cuff
Repo for NeurIPS 2024 paper "Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes"
Language:Python10
RADAR
Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vicuna and LLaMA. The results show that RADAR can attain good detection performance on LLM-generated AI-text while being robust against paraphrasing.
Language:Jupyter Notebook41 6 41
P3Ranker
Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Language:Python17 1 02

GregxmHu's Repositories

GregxmHu/Unified-Prompt-based-Framework-for-Multi-task-training
T5 Prompt Tuning
Language:Python8 2 02
GregxmHu/Gradient-Cuff
Code for our NeurIPS2024 accepted paper: Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Language:Python4 1 00
GregxmHu/Deeplearning-Lab4
LSTM, Text Classification, Chinese Word Embeddings, Rnn Encoder-Decoder, Temperature Series Prediction
Language:Python1 1 01
GregxmHu/llm_attack
Language:Python1 1 00
GregxmHu/promptbench
A robustness evaluation framework for large language models on adversarial prompts
Language:Python1 0 00
GregxmHu/AutoDAN
The official implementation of our paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
Language:Python00
GregxmHu/ConvDR
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
Language:Python0 0 00
GregxmHu/CSrankings-1
A web app for ranking computer science departments according to their research output in selective venues.
Language:Python0 0 00
GregxmHu/Deeplearning-Lab3
VGG, Resnet, SE-Block.
Language:Python0 1 00
GregxmHu/detect-gpt
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
Language:Python0 0
GregxmHu/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Language:Python0 0
GregxmHu/gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
GregxmHu/GregxmHu.github.io
Language:HTML1 0
GregxmHu/IO-FP
Intelligent Optimization - Final Project
Language:Python1 0
GregxmHu/JavaFamily
【Java面试+Java学习指南】一份涵盖大部分Java程序员所需要掌握的核心知识。
0 0
GregxmHu/KB-QA
KB-QA, Knowledge Graph. Neo-4j, Jena , Graph Database
Language:Python1 0
GregxmHu/latent-jailbreak
0 0
GregxmHu/llm-sp
Papers and resources related to the security and privacy of LLMs 🤖
GregxmHu/loss-landscape
Code for visualizing the loss landscape of neural nets
GregxmHu/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
GregxmHu/my_dr
Dense Retrieval pipeline grounding with sentence-transformers.
Language:Python1 01
GregxmHu/P3Ranker
Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
GregxmHu/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
Language:Python0 0
GregxmHu/Quick-BM25
Language:Python
GregxmHu/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
Language:Jupyter Notebook0 0
GregxmHu/TAP
TAP: An automated jailbreaking method for black-box LLMs
Language:Python0 0
GregxmHu/tevatron
Tevatron - A flexible toolkit for dense retrieval research and development.
Language:Python0 0
GregxmHu/TextGAIL
Language:Python0 0
GregxmHu/TransE
Implementation of TransE (nips 2013), and its simple usage
Language:Python1 0
GregxmHu/VecJBDet