Phuriches

Tokyo, Japan

Pinned Repositories

audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python0 0 00
dl-tutorial
a quick tutorial of deep learning
Language:Jupyter Notebook0 0 00
DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
Language:Python0 0 00
fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter Notebook0 0 00
GenRepASD
Pytorch implementation of Deep Generic Representations for Domain-Generalized Anomalous Sound Detection: https://arxiv.org/abs/2409.05035
Language:Python5 1 10
OCR_RaspberryPi_edgetpu
Language:Python14 1 14
speech-tutorial
Language:Jupyter Notebook0 2 00
spolacq
Language:Jupyter Notebook0 0 00
TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
Language:Jupyter Notebook0 0 00
word-discovery
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Language:Jupyter Notebook0 0 00

Phuriches/OCR_RaspberryPi_edgetpu
Language:Python14 1 14
Phuriches/GenRepASD
Pytorch implementation of Deep Generic Representations for Domain-Generalized Anomalous Sound Detection: https://arxiv.org/abs/2409.05035
Language:Python5 1 10
Phuriches/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python0 0 00
Phuriches/dl-tutorial
a quick tutorial of deep learning
Language:Jupyter Notebook0 0 00
Phuriches/DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
Language:Python0 0 00
Phuriches/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter Notebook0 0 00
Phuriches/speech-tutorial
Language:Jupyter Notebook0 2 00
Phuriches/spolacq
Language:Jupyter Notebook0 0 00
Phuriches/TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
Language:Jupyter Notebook0 0 00
Phuriches/word-discovery
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Language:Jupyter Notebook0 0 00