LancoPKU
Language Computing and Machine Learning Group (Xu Sun's group) at Peking University
Peking University, Beijing
Pinned Repositories
Chinese-Literature-NER-RE-Dataset
A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text
DPGAN
Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text (EMNLP2018)
Global-Encoding
Global Encoding for Abstractive Summarization (ACL 2018)
Graph-to-seq-comment-generation
Code for the paper ``Coherent Comments Generation for Chinese Articles with a Graph-to-Sequence Model''
label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
SGM
Sequence Generation Model for Multi-label Classification (COLING 2018)
SU4MLC
Code for the article "Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification" (EMNLP 2018)
superAE
Code for "Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization"
text-autoaugment
[EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
LancoPKU's Repositories
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
lancopku/label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
lancopku/text-autoaugment
[EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
lancopku/AdaMod
Adaptive and Momental Bounds for Adaptive Learning Rate Methods.
lancopku/meProp
meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)
lancopku/Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
lancopku/agent-backdoor-attacks
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
lancopku/Explicit-Sparse-Transformer
code for Explicit Sparse Transformer
lancopku/well-classified-examples-are-underestimated
Code for the AAAI 2022 publication "Well-classified Examples are Underestimated in Classification with Deep Neural Networks"
lancopku/DynamicKD
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
lancopku/codable-watermarking-for-llm
Repository for Towards Codable Watermarking for Large Language Models
lancopku/IAIS
[ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval
lancopku/RAP
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
lancopku/CGM
Code for IJCAI 2021 main conference paper "Long-term, Short-term and Sudden Event: Trading Volume Movement Prediction with Graph-based Multi-view Modeling"
lancopku/clip-openness
[ACL 2023] Delving into the Openness of CLIP
lancopku/SOS
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
lancopku/MUKI
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
lancopku/Avg-Avg
[Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection
lancopku/ChineseNER
Code for "Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media: A Unified Model"
lancopku/DCKD
Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)
lancopku/CascadeBERT
Code for CascadeBERT, Findings of EMNLP 2021
lancopku/FedMNMT
[Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter
lancopku/DAN
[Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks
lancopku/GKD
lancopku/Attention-Augmentation
lancopku/GNOME
Code of the EACL 2023 Paper: Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
lancopku/MR-VPC
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
lancopku/FedGLAD
lancopku/LaDiC
[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?
lancopku/lancopku.github.io
Lanco Lab website