LancoPKU

Language Computing and Machine Learning Group (Xu Sun's group) at Peking University

Peking University, Beijing

Pinned Repositories

Chinese-Literature-NER-RE-Dataset
A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text
418 19 885
DPGAN
Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text (EMNLP2018)
Language:Python145 12 1037
Global-Encoding
Global Encoding for Abstractive Summarization (ACL 2018)
Language:Python273 12 3665
Graph-to-seq-comment-generation
Code for the paper ``Coherent Comments Generation for Chinese Articles with a Graph-to-Sequence Model''
Language:Python174 11 1437
label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Language:Python160 2 2913
pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Language:Python6.6k 206 168989
SGM
Sequence Generation Model for Multi-label Classification (COLING 2018)
Language:Python437 15 34112
SU4MLC
Code for the article "Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification" (EMNLP 2018)
Language:Python153 9 1229
superAE
Code for "Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization"
Language:Python136 8 1846
text-autoaugment
[EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Language:Python127 3 1016

LancoPKU's Repositories

lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Language:Python6.6k 206 168989
lancopku/label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Language:Python160 2 2913
lancopku/text-autoaugment
[EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Language:Python127 3 1016
lancopku/AdaMod
Adaptive and Momental Bounds for Adaptive Learning Rate Methods.
Language:Python126 9 224
lancopku/meProp
meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)
Language:C#110 11 320
lancopku/Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
Language:Python86 11 99
lancopku/agent-backdoor-attacks
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
Language:Python62 3 62
lancopku/Explicit-Sparse-Transformer
code for Explicit Sparse Transformer
Language:Python60 5 212
lancopku/well-classified-examples-are-underestimated
Code for the AAAI 2022 publication "Well-classified Examples are Underestimated in Classification with Deep Neural Networks"
Language:Jupyter Notebook50 4 02
lancopku/DynamicKD
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
Language:Python40 5 16
lancopku/codable-watermarking-for-llm
Repository for Towards Codable Watermarking for Large Language Models
Language:Python35 3 53
lancopku/IAIS
[ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval
Language:Python30 3 04
lancopku/RAP
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
Language:Python24 3 12
lancopku/CGM
Code for IJCAI 2021 main conference paper "Long-term, Short-term and Sudden Event: Trading Volume Movement Prediction with Graph-based Multi-view Modeling"
Language:Python23 1 310
lancopku/clip-openness
[ACL 2023] Delving into the Openness of CLIP
Language:Python23 2 21
lancopku/SOS
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
Language:Jupyter Notebook22 2 14
lancopku/MUKI
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
Language:Python19 2 20
lancopku/Avg-Avg
[Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection
Language:Python18 2 25
lancopku/ChineseNER
Code for "Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media: A Unified Model"
Language:Python17 7 44
lancopku/DCKD
Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)
Language:Python14 3 02
lancopku/CascadeBERT
Code for CascadeBERT, Findings of EMNLP 2021
Language:Python12 2 11
lancopku/FedMNMT
[Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter
Language:Python12 2 01
lancopku/DAN
[Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks
Language:Python10 1 10
lancopku/GKD
Language:Python5 2 00
lancopku/Attention-Augmentation
Language:Python2 2 0
lancopku/GNOME
Code of the EACL 2023 Paper: Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
1 1 00
lancopku/MR-VPC
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
1 1 0
lancopku/FedGLAD
1 0
lancopku/LaDiC
[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?
Language:Python0 0
lancopku/lancopku.github.io
Lanco Lab website
Language:SCSS3 0