1398listener's Stars
amazon-science/m3t-multi-modal-translation-bench
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
EurekaForNLP/SM2
Source Code for ACL 2024 main conference paper "Self-Modifying State Modeling for Simultaneous Machine Translation".
Candinya/full-stack-in-7-days
⚡ 7天全栈计划
ylsung/Ladder-Side-Tuning
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
ictnlp/SiLLM
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a traditional SiMT model for policy-decision to achieve SiMT through collaboration.
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
SALT-NLP/LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
nttmdlab-nlp/InstructDoc
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Ultramarine-spec/huggingface_downloader
EriCongMa/E2E_TIT_With_MT
We will prepare our codes and datasets in this repository. Codes and Datasets are utilized to train end-to-end text image translation model with auxiliary text translation task.
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
paperswithcode/ai-deadlines
:alarm_clock: AI conference deadline countdowns
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
timtadh/zhang-shasha
Tree edit distance using the Zhang Shasha algorithm
arxiv-vanity/engrafo
Convert LaTeX documents into beautiful responsive web pages using LaTeXML.
nidhaloff/deep-translator
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
ssut/py-googletrans
(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.
lukasschwab/arxiv.py
Python wrapper for the arXiv API
braun-steven/arxiv-downloader
A command line interface to download PDF files from https://arxiv.org.
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Unbabel/COMET
A Neural Framework for MT Evaluation
cwang621/blsp
BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
nttmdlab-nlp/VisualMRC
VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)