Pinned Repositories
ai-deadlines
:alarm_clock: AI conference deadline countdowns
ASL-Alphabet-Translation
Using CNN to translate the ASL Alphabet
CodeMirror
In-browser code editor
CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
discrete-repr
Accompanying code for the paper "Discrete representations in neural models of spoken language" (https://aclanthology.org/2021.blackboxnlp-1.11)
textual-supervision
Code for the paper "Textual supervision for visually grounded spoken language understanding".
zr-2021vg_baseline
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
analyzing-analytical-methods
Code for the paper "Analyzing analytical methods" http://dx.doi.org/10.18653/v1/2020.acl-main.381
natural-speech
This repository contains a codebase to build automatic speech recognition (ASR) systems for iCub and run them within YARP. It also proposes new articulatory-based and unsupervised models for ASR.
platalea
Library for training visually-grounded models of spoken language understanding.
bhigy's Repositories
bhigy/zr-2021vg_baseline
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
bhigy/textual-supervision
Code for the paper "Textual supervision for visually grounded spoken language understanding".
bhigy/discrete-repr
Accompanying code for the paper "Discrete representations in neural models of spoken language" (https://aclanthology.org/2021.blackboxnlp-1.11)
bhigy/ai-deadlines
:alarm_clock: AI conference deadline countdowns
bhigy/ASL-Alphabet-Translation
Using CNN to translate the ASL Alphabet
bhigy/CodeMirror
In-browser code editor
bhigy/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
bhigy/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
bhigy/deepspeech.pytorch
Speech Recognition using DeepSpeech2.
bhigy/dropbox-restore
Restore any dropbox folder to a previous state
bhigy/espnet
End-to-End Speech Processing Toolkit
bhigy/kaldi
This is now the official location of the Kaldi project.
bhigy/liveoverflow
Following liveoverflow binary exploitation playlist
bhigy/matutils
Utilities for matlab
bhigy/minimal-mistakes
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
bhigy/natural-speech
This repository contains a codebase to build automatic speech recognition (ASR) systems for iCub and run them within YARP. It also proposes new articulatory-based and unsupervised models for ASR.
bhigy/nbdev
Create delightful software with Jupyter Notebooks
bhigy/OTFR
On the fly tactile and visual recognition for iCub
bhigy/pafy
Python library to download YouTube content and retrieve metadata
bhigy/polyglot
Color, ASCII-only Git prompt for zsh, bash, ksh93, mksh, pdksh, dash, and busybox ash
bhigy/pydub
Manipulate audio with a simple and easy high level interface
bhigy/revai-python-sdk
Rev AI Python SDK
bhigy/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
bhigy/StreamingSpeakerDiarization
Lightweight python library for speaker diarization in real time implemented in pytorch
bhigy/symbolic-bias
Code for the paper: Symbolic inductive bias for visually grounded learning of spoken language
bhigy/tactile_objrec
Modules for tactile object recognition
bhigy/tactile_objrec_mat
bhigy/tiddlywiki_to_joplin
Script to convert a TiddlyWiki note export CSV file into a Joplin JEX file for import.
bhigy/yarp
YARP - Yet Another Robot Platform
bhigy/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion