OpenNMT/CTranslate2

Fast inference engine for Transformer models

C++MIT

Pinned issues

Can batch translation on CPU result in different output?

#693 opened 3 years ago by robertBrnnn

Open8

Feature request: AMD GPU support with oneDNN AMD support

#1072 opened 2 years ago by santhoshtr

Open51

Continuous batching

#1333 opened a year ago by andreapiso

Open5

Issues

No conversion is registered for the model configuration NllbMoeConfig
#1839 opened 7 days ago by dipta007
1
Linker Failing when Building Python Wheel
#1838 opened 13 days ago by ab-tools
1
adopt modern bert please!
#1837 opened 15 days ago by BBC-Esq
1
falcon3 support please?
#1834 opened 17 days ago by BBC-Esq
1
add exaone pretty please?
#1836 opened 17 days ago by BBC-Esq
0
Phi4 support please?
#1835 opened 17 days ago by BBC-Esq
0
FP16 support for CPU devices
#1833 opened 18 days ago by Narfi03
0
[QUESTION] [MULTI-TO-MULTI] How specify source and target language with mBart (multi-to-multi) from C++ to translate text...
#1831 opened 18 days ago by wcdr
0
Reintroduce support for Compute Capability 5.0
#1765 opened 4 months ago by giuliopaci
1
CUDNN 9 support
#1780 opened 4 months ago by AndrewMead10
11
v4.5.0 is not compatible with `torch>=2.*.*+cu121`
#1806 opened 2 months ago by MahmoudAshraf97
25
Cannot install CTranslate2
#1817 opened a month ago by victorwoo
2
OPENCL / CLBLAST support?
#1829 opened a month ago by 0wwafa
0
Support X-ALMA 🤖 Quality Translation at Scale
#1828 opened a month ago by carolinaxxxxx
0
Support EuroLLM 🤖
#1827 opened a month ago by carolinaxxxxx
2
NO LOGITS RETURNS AFTER GENERATE
#1779 opened 4 months ago by LAnCeBabY
7
libcudnn_cnn problem
#1826 opened a month ago by 781574155
2
Possible issue with AWQ library when using AWQ models with Ctranslate2
#1821 opened a month ago by BBC-Esq
2
Whisper model returns empty logits array
#1822 opened a month ago by MahmoudAshraf97
0
Use multiple GPUs to process queue
#1816 opened 2 months ago by theodufort
1
Performance Regression in Whisper models when timestamp generation is enabled
#1783 opened 4 months ago by MahmoudAshraf97
6
RuntimeError: CUDA failed with error CUDA driver version is insufficient for CUDA runtime version
#1814 opened 2 months ago by SwAt1563
1
Possible premature temporary removal of flash attention?
#1809 opened 2 months ago by BBC-Esq
1
Python process crashes on exit under Windows with CUDA
#1782 opened 4 months ago by TechInterMezzo
2
Hope Moshi model can be supported
#1813 opened 2 months ago by YLQY
0
pkg_resources going to be phased out I think?
#1812 opened 2 months ago by BBC-Esq
0
correct docs regarding flash attention
#1777 opened 2 months ago by BBC-Esq
3
A question regarding Faster Whisper
#1802 opened 2 months ago by MahmoudAshraf97
20
Use C++ for text generation tasks
#1804 opened 2 months ago by JocelynPanPan
0
Program still running when i call the translate_batch
#1796 opened 3 months ago by itperson2024
1
Run Phi3.5's "longrope" RoPE scaling type to make Phi3.5 compatible
#1792 opened 3 months ago by BBC-Esq
0
How to use 4-bit AWQ?
#1776 opened 3 months ago by BBC-Esq
9
Release 4.4.0 and flash attention with python [WIP]
#1775 opened 3 months ago by BBC-Esq
3
How to deploy the NLLB model exported from CTranslate2 framework to Android？
#1799 opened 3 months ago by LiPengtao0504
0
Mistral-Nemo not working
#1793 opened 3 months ago by BBC-Esq
3
NLLB translation mask or skip some tokens
#1798 opened 3 months ago by ekmekovski
0
OpenBLAS ERROR
#1797 opened 3 months ago by JocelynPanPan
0
Error when converting llama-3.2-11b-vision-instruct
#1794 opened 3 months ago by AlexMisiulia
2
Using a partitioned A100 GPU via MIG with device_index and faster_index causing ctranslate2 error
#1788 opened 3 months ago by johnrisby
6
Missing converter : XLMRobertaFlashConfig
#1790 opened 3 months ago by ExtReMLapin
0
Model is twice as large on first load
#1787 opened 3 months ago by winstxnhdw
0
Difference translation result after convert to ctranslate
#1781 opened 4 months ago by hieunguyenquoc
2
How to early stop an encoding call?
#1768 opened 4 months ago by mariano54
3
build failed on jetson agx orin (Error generating file: build/CMakeFiles/ctranslate2.dir/src/ops/flash-attention/./ctranslate2_generated_flash_fwd_split_hdim96_fp16_sm80.cu.o)
#1771 opened 4 months ago by cyu021
1
Can I convert a ctranslate2 model to onnx?
#1773 opened 4 months ago by ashwingopinath
0
Inference failed with "axis 2 has dimension xxxx but expected yyyy" error
#1769 opened 4 months ago by GangLiCN
2
Error while converting to Ctranslate2 from openNMTPy.
#1767 opened 4 months ago by aryan1165
1
Failed to convert microsoft/Phi-3-medium-128k-instruct
#1764 opened 4 months ago by rbgo404
0
Support for DeepSeek models
#1763 opened 4 months ago by ByteForge786
0
Convert Ctranslate2 model to Pytorch or TorchScript or PyTorch Lightning
#1762 opened 4 months ago by aryan1165
0