Pinned Repositories
2020NTUCSIE-CNHW3
2024-Spring-HW0
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
CLAP
Contrastive Language-Audio Pretraining
guitar_effect_removal
Demo page and evaluation code for "Distortion Recovery: A Two-Stage Method for Guitar Effect Removal"
NTUCSIE-CNHW2
NTUCSIE_OS_2020spring
2020作業系統Project專用 Project1說明網站:https://hackmd.io/@Ue96nvjESj2XsDXw532-qA/ryYqceUrU
OS_Project2
NTU CSIE 108-2 OS Project #2
Scientific-Computing-AICUP
Singing voice transcription competition (rule-based)
y10ab1's Repositories
y10ab1/guitar_effect_removal
Demo page and evaluation code for "Distortion Recovery: A Two-Stage Method for Guitar Effect Removal"
y10ab1/2024-Spring-HW0
y10ab1/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
y10ab1/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
y10ab1/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
y10ab1/compound-word-transformer
Official implementation of compound word transformer (AAAI'21)
y10ab1/DATA5009_2023fall
Computational Methods for Data Science
y10ab1/DeepMIR_2023fall
y10ab1/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
y10ab1/ggml
Tensor library for machine learning
y10ab1/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
y10ab1/Homework1
y10ab1/Homework2
FinTech Homework 2
y10ab1/Homework3
Fintech HW3
y10ab1/HPBDAIS_final_project
y10ab1/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
y10ab1/llama-cpp-python
Python bindings for llama.cpp
y10ab1/llama.cpp
Port of Facebook's LLaMA model in C/C++
y10ab1/midi-model
Midi event transformer for music generation
y10ab1/MU-LLaMA
MU-LLaMA: Music Understanding Large Language Model
y10ab1/NTU_ML_2023spring
y10ab1/pytorch-lightning-template
An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit all the functions as well. Big-project-friendly as well. No need to rewrite your config in hydra.
y10ab1/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
y10ab1/Retina_project
y10ab1/RoomPlan-2D
y10ab1/Sklearn-genetic-opt
ML hyperparameters tuning and features selection, using evolutionary algorithms.
y10ab1/StableDiffusionReconstruction
Takagi and Nishimoto, CVPR 2023
y10ab1/SwiFT
y10ab1/whisper.cpp
Port of OpenAI's Whisper model in C/C++
y10ab1/y10ab1