StupidDebugger's Stars
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
tickstep/aliyunpan
阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。
GangZhuo/BaiduPCS
百度网盘命令行工具。The terminal utility for Baidu Network Disk.
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
zhanyong-wan/dongbei
东北方言编程语言
ly0/baidupcsapi
百度网盘api
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
apachecn/rate-my-supervisor
felixonmars/BaiduPCS-Go
Re-upload of iikira/BaiduPCS-Go
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
PeterDing/BaiduPCS-Py
BaiduPCS API & App 百度网盘客户端 和 API
Vchitect/Vlogger
[CVPR2024] Make Your Dream A Vlog
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
dyhBUPT/iKUN
[CVPR 2024] iKUN: Speak to Trackers without Retraining
acl-org/responsibleNLPresearch
templates and other documents regarding responsible NLP research
ReneeYe/ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Rongjiehuang/awesome-speech-to-speech-translation
List of direct speech-to-speech translation papers.
mgaido91/FBK-fairseq-ST
A repository containing the code for speech translation papers.
ictnlp/CRESS
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
ictnlp/Wait-info
Source code for our EMNLP 2022 paper "Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation"
zhangshaolei1998/MyArxiv