StupidDebugger

StupidDebugger's Stars

Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 218 4692.9k
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
Language:Python15.7k 143 2.2k2.5k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 258 128840
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.8k 154 3661k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook11k 144 3701.1k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.6k 56 690514
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
6.2k 179 16858
tickstep/aliyunpan
阿里云盘命令行客户端，支持JavaScript插件，支持同步备份功能。
Language:Go4.3k 35 459356
GangZhuo/BaiduPCS
百度网盘命令行工具。The terminal utility for Baidu Network Disk.
Language:Roff3.5k 211 278720
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.9k 33 158264
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.5k 62 175266
zhanyong-wan/dongbei
东北方言编程语言
Language:Python2.3k 19 93133
ly0/baidupcsapi
百度网盘api
Language:Python1.2k 70 90237
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python977 9 37179
apachecn/rate-my-supervisor
Language:JavaScript965 9 0151
felixonmars/BaiduPCS-Go
Re-upload of iikira/BaiduPCS-Go
Language:Go932 18 47307
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
Language:TeX806 8 27194
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
796 44 348
PeterDing/BaiduPCS-Py
BaiduPCS API & App 百度网盘客户端和 API
Language:Python719 12 112116
Vchitect/Vlogger
[CVPR2024] Make Your Dream A Vlog
Language:Python418 10 1543
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
Language:Python406 16 267
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
255 26 225
dyhBUPT/iKUN
[CVPR 2024] iKUN: Speak to Trackers without Retraining
Language:Python112 1 352
acl-org/responsibleNLPresearch
templates and other documents regarding responsible NLP research
Language:TeX64 4 029
ReneeYe/ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Language:Python62 2 86
Rongjiehuang/awesome-speech-to-speech-translation
List of direct speech-to-speech translation papers.
36 5 02
mgaido91/FBK-fairseq-ST
A repository containing the code for speech translation papers.
Language:Python21 2 17
ictnlp/CRESS
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
Language:Python17 3 72
ictnlp/Wait-info
Source code for our EMNLP 2022 paper "Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation"
Language:Python7 2 1
zhangshaolei1998/MyArxiv
Language:CSS1 1 0