ICTNLP

Natural Language Processing Group, Institute of Computing Technology, Chinese Academy of Sciences

Beijing, China

Pinned Repositories

awesome-transformer
A collection of transformer's guides, implementations and variants.
103 7 214
BayLing
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.
Language:Python304 8 1219
DASpeech
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Language:Python61 4 65
DialoFlow
Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".
Language:Python93 4 2310
DSTC8-AVSD
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".
Language:Python56 4 514
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k 29 52185
NAST-S2x
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Language:Python62 4 34
OR-NMT
Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>
Language:Python42 5 1010
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python985 13 1575
TruthX
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
Language:Python134 5 46

ICTNLP's Repositories

ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k 29 52185
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python985 13 1575
ictnlp/BayLing
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.
Language:Python304 8 1219
ictnlp/Auto-RAG
This is the official repository for Auto-RAG.
Language:Python16914
ictnlp/TruthX
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
Language:Python134 5 46
ictnlp/NAST-S2x
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Language:Python62 4 34
ictnlp/DASpeech
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Language:Python61 4 65
ictnlp/DiSeg
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
Language:Python33 3 22
ictnlp/ComSpeech
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
Language:Python24 6 36
ictnlp/HMT
Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"
Language:Python21 1 32
ictnlp/CRESS
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
Language:Python17 3 72
ictnlp/TACS
Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts
Language:Python17 2 22
ictnlp/CMOT
Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
Language:Python15 1 21
ictnlp/SiLLM
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT through collaboration.
Language:Python15 2 02
ictnlp/BT4ST
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
Language:Python13 3 32
ictnlp/Convex-Learning
Code for NeurIPS 2023 paper "Beyond MLE: Convex Learning for Text Generation"
Language:Python12 2 10
ictnlp/PCFG-NAT
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
Language:Cuda11 1 10
ictnlp/MonoAttn-Transducer
Code Implementation for Paper "Learning Monotonic Attention in Transducer for Streaming Generation"
Language:Python101
ictnlp/CTC-S2UT
Code for ACL 2024 findings paper "CTC-based Non-autoregressive Textless Speech-to-Speech Translation"
Language:Python9 4 11
ictnlp/SU4MT
Code for EMNLP 2023 paper "Enhancing Neural Machine Translation with Semantic Units"
Language:Python8 1 00
ictnlp/Multiscale-Contextualization
ACL2024 Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
Language:Python7 1 01
ictnlp/DST
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
Language:Python6 1 41
ictnlp/SAMMT
Code for EMNLP 2023 paper "Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation"
Language:Python3 3 22
ictnlp/SemLing-MNMT
Code for ACL 2024 paper "Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features".
Language:Python3 1 0
ictnlp/ComSpeech-Site
Language:JavaScript2 2 01
ictnlp/MoCE
code for paper: "MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation"
Language:Python22
ictnlp/StreamSpeech-site
Language:JavaScript2 1 01
ictnlp/TruthX-site
Language:HTML1 1 0
ictnlp/LengthBiasDNMT
0 1 00
ictnlp/TA-AT
Official code for AAAI24 paper "TA&AT: Enhancing Task-Oriented Dialog with Turn-Level Auxiliary Tasks and Action-Tree Based Scheduled Sampling"
1 0