ivangtorre

AI Researcher. Working on Automatic Speech Recognition, NLP, DNNs, Linguistic Laws, Complex Systems and Nonlinear Dynamics

Language and Speech Laboratory-EHU, UPMSpain

ivangtorre's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python67.9k 571 08k
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.3k 297 8363.2k
zalandoresearch/fashion-mnist
A MNIST-like fashion product database. Benchmark :point_down:
Language:Python11.8k 331 1023k
kimiyoung/transformer-xl
Language:Python3.6k 83 133762
fastai/course-nlp
A Code-First Introduction to NLP course
Language:Jupyter Notebook3.4k 133 321.5k
BenjiKCF/Neural-Net-with-Financial-Time-Series-Data
This solution presents an accessible, non-trivial example of machine learning (Deep learning) with financial time series using TensorFlow
Language:Jupyter Notebook753 91 16312
archinetai/surgeon-pytorch
A library to inspect and extract intermediate layers of PyTorch models.
Language:Python469 3 316
kensho-technologies/pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
Language:Python421 22 6489
mightydeveloper/Deep-Compression-PyTorch
PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by Song Han, Huizi Mao, William J. Dally
Language:Python409 11 13111
mphilli/English-to-IPA
Converts English text to IPA notation
Language:Python362 13 3076
Kozea/Pyphen
Hy-phen-ation made easy
Language:Python198 32 4324
Vicomtech/hate-speech-dataset
Hate speech dataset from Stormfront forum manually labelled at sentence level.
161 10 457
qute012/Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
Language:Python98 5 1128
nokpil/AgentNet
Pytorch implementation of AgentNet, which is designed for reveal hidden interactions and predict future dynamics of the unknown complex system.
Language:Python24 4 25
ivangtorre/multifrac
This is a plugin for ImageJ2 for multifractal analysis of 2D and 3D images. Cite: MULTIFRAC: An ImageJ plugin for multiscale characterization of 2D and 3D stack images . IG Torre, R. J. Heck and AM Tarquis. SoftwareX, 12, 100574.
Language:Java12 2 00
maxidl/wav2vec2
Language:Python10 1 24
Vicomtech/itzuli-api-lib
Itzuli® Machine Translation Engine API libraries
Language:Go101
ivangtorre/compression-principle-and-Zipf-s-law-of-brevity-in-infochemical-communication
This repository implements all the scripts used for processing data, computing and figure generation for the scientific paper: https://royalsocietypublishing.org/doi/10.1098/rsbl.2022.0162 "Compression principle and Zipf’s Law of brevity in infochemical communication". Please cite: Antoni Hernández-Fernández and Ivan G. Torre. Compression principle and Zipf’s Law of brevity in infochemical communication.
Language:Jupyter Notebook2 2 00
ivangtorre/physical-origin-of-lw
This package implements all the scripts used for computing and representing the resuls of the scientific paper: "On the physical origin of linguistic laws and lognormality in speech". If use, cite: Torre, I. G., Luque, B., Lacasa, L., Kello, C. T., & Hernández-Fernández, A. (2019). On the physical origin of linguistic laws and lognormality in speech. Royal Society Open Science, 6(8), 191023.
Language:Jupyter Notebook2 1 00
ivangtorre/pythreshold
This package implements the threshold algorithm for decimation and collapsing of time series. If use, please cite: "Torre, I. G., Luque, B., Lacasa, L., Luque, J., & Hernández-Fernández, A. (2017). Emergence of linguistic laws in human voice. Scientific reports, 7, 43862."
Language:Python2 2 00
ivangtorre/Speech-pause-distribution-as-an-early-marker-for-Alzheimers-disease
The speech pauses duration corpus and scripts that ensure reproducibility of all results presented in the research paper. P. Pastoriza, I.G. Torre, F. Dieguez, I. Gomez, S. Gelado, J. Bello, A. Avila, J. Matias, V. Pytell, A. Hernandez-Fernandez (2022). Speech pause distribution as an early marker for Alzheimer’s disease. Speech Communication. 136, 107-117
Language:Jupyter Notebook2 2 00
Vicomtech/ASVspoophone
The ASVspoophone corpus is the telephonic version of the ASV Spoof 2019 corpus found at https://www.asvspoof.org It contains the telephonic versions of the audios used for the countermeasure (CM) ASV Spoof 2019 challenge, which have been created by transferring each of them through real land-land, mobile-land and land-mobile telephonic channels. The results are the corresponding 8 kHz 8 bit A-Law versions of the originial audios, which can be used to train anti-spoofing systems that will be used on real telephonic scenarios such as call and contact centres.
10

ivangtorre

ivangtorre's Stars

openai/whisper

NVIDIA/DeepLearningExamples

zalandoresearch/fashion-mnist

kimiyoung/transformer-xl

fastai/course-nlp

BenjiKCF/Neural-Net-with-Financial-Time-Series-Data

archinetai/surgeon-pytorch

kensho-technologies/pyctcdecode

mightydeveloper/Deep-Compression-PyTorch

mphilli/English-to-IPA

Kozea/Pyphen

Vicomtech/hate-speech-dataset

qute012/Wav2Keyword

nokpil/AgentNet

ivangtorre/multifrac

maxidl/wav2vec2

Vicomtech/itzuli-api-lib

ivangtorre/compression-principle-and-Zipf-s-law-of-brevity-in-infochemical-communication

ivangtorre/physical-origin-of-lw

ivangtorre/pythreshold

ivangtorre/Speech-pause-distribution-as-an-early-marker-for-Alzheimers-disease

Vicomtech/ASVspoophone