Pinned Repositories
asr-trainer
one script for xls-r/xlsr/whisper/hubert alpaca fine-tuning
awesome-chatgpt-dataset
Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
COMBO-AVS
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
dynamic-superb
The official repository of Dynamic-SUPERB.
ESLab_TermProject
Term project of Embedded System Lab
GenAI_hw6_dataset
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
lectureCode-sp18
lecture code
Baiiiiiiiiii's Repositories
Baiiiiiiiiii/GenAI_hw6_dataset
Baiiiiiiiiii/asr-trainer
one script for xls-r/xlsr/whisper/hubert alpaca fine-tuning
Baiiiiiiiiii/awesome-chatgpt-dataset
Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!
Baiiiiiiiiii/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Baiiiiiiiiii/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
Baiiiiiiiiii/COMBO-AVS
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
Baiiiiiiiiii/dynamic-superb
The official repository of Dynamic-SUPERB.
Baiiiiiiiiii/ESLab_TermProject
Term project of Embedded System Lab
Baiiiiiiiiii/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Baiiiiiiiiii/lectureCode-sp18
lecture code
Baiiiiiiiiii/ML2022-spring
Baiiiiiiiiii/python-project
Baiiiiiiiiii/MMLM
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
Baiiiiiiiiii/organicmaps
🍃 Organic Maps is a free Android & iOS offline maps app for travelers, tourists, hikers, and cyclists. It uses crowd-sourced OpenStreetMap data and is developed with love by MapsWithMe (MapsMe) founders and our community. No ads, no tracking, no data collection, no crapware. Please donate to support the development!
Baiiiiiiiiii/PandaLM
Baiiiiiiiiii/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Baiiiiiiiiii/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Baiiiiiiiiii/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Baiiiiiiiiii/SeViLA-test
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Baiiiiiiiiii/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
Baiiiiiiiiii/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Baiiiiiiiiii/StyleTalk
Official release of StyleTalk dataset.
Baiiiiiiiiii/UASR