jwijffels

www.bnosac.be

www.bnosac.beBrussels, Belgium

Pinned Repositories

audio.whisper
Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
Language:C119 4 4913
BTM
Biterm Topic Modelling for Short Text with R
Language:C++95 8 1615
image
Computer Vision and Image Recognition algorithms for R users
Language:C++278 23 2565
taskscheduleR
Schedule R scripts/processes with the Windows task scheduler.
Language:R335 27 9872
udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Language:C++215 16 11033
word2vec
Distributed Representations of Words using word2vec
Language:C++70 10 175
ETLUtils
Utilities for easily loading big data from relational databases directly into ffdf objects in R.
Language:R20 5 57
Myrrix-R-interface
Let R talk to Myrrix. Myrrix is a complete, real-time, scalable clustering and recommender system, evolved from Apache Mahout.
Language:HTML11 4 16
RMOA
Connect R to MOA for massive online data stream mining
Language:R37 18 1519
udpipe-spacy-comparison
Compare accuracies of udpipe models and spacy models which can be used for NLP annotation
Language:Python14 3 21

jwijffels's Repositories

jwijffels/ETLUtils
Utilities for easily loading big data from relational databases directly into ffdf objects in R.
Language:R20 5 57
jwijffels/page_dewarp
Text page dewarping using a "cubic sheet" model
Language:Python1 1 01
jwijffels/activelearning.nlp
ActiveLearning for training NLP models in R
Language:R2 01
jwijffels/ANMS-Codes
Efficient adaptive non-maximal suppression algorithms for homogeneous spatial keypoint distribution
Language:C++1 0
jwijffels/av
Working with Video in R
Language:C1 0
jwijffels/berkeley-stat-157
Homepage for STAT 157 at UC Berkeley
Language:Jupyter Notebook1 0
jwijffels/Bi-Sent2Vec
Robust Cross-lingual Embeddings from Parallel Sentences
Language:C++1 0
jwijffels/cloudfront-authorization-at-edge-keycloak
Language:JavaScript1 0
jwijffels/cpp-fstlib
A single file C++17 header-only Minimal Acyclic Subsequential Transducers, or Finite State Transducers
Language:C++1 0
jwijffels/DETM
Language:Python1 0
jwijffels/dhSegment
Generic framework for historical document processing
Language:Python1 0
jwijffels/EasyOCR
Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai
Language:Python1 0
jwijffels/fuzzy-search
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
Language:Python1 0
jwijffels/koan
A word2vec negative sampling implementation with correct CBOW update.
Language:C++1 0
jwijffels/librnnvad
Voice activity detection (VAD) library, based on WebRTC's VAD engine
jwijffels/LSTM-CRF-pytorch-faster
A more than 1000X faster paralleled LSTM-CRF implementation modified from the slower version in the Pytorch official tutorial (URL:https://pytorch.org/tutorials/beginner/nlp/advanced_tutorial.html).
Language:Python1 0
jwijffels/neural-acoustic-distance
Code associated with the paper: Neural Representations for Modeling Variation in English Speech.
Language:Python1 0
jwijffels/nnutils
CPU & CUDA implementation of several neural network utils
Language:C++1 0
jwijffels/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Language:JavaScript1 0
jwijffels/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
1 0
jwijffels/pageDistanceBasedContourGenerator
Program that calculates the extraction polygon of present text lines given an existing baseline in the page file
Language:C++1 0
jwijffels/phonfieldwork
R package for phonetic research and experimenting
Language:HTML1 0
jwijffels/rticles
LaTeX Journal Article Templates for R Markdown
Language:TeX1 0
jwijffels/sent2vec
General purpose unsupervised sentence representations
Language:C++1 0
jwijffels/speech-representations
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
Language:Python1 0
jwijffels/test_doc2vec
Compare doc2vec R implementation (PVDM, PVDBOW) with mean of word embedding in a classification task.
Language:R1 0
jwijffels/text_analysis_for_social_science
Code for the book on python for social scientists
Language:Jupyter Notebook1 0
jwijffels/v4py.github.io
E-learning materials for the V4Py summer school (Python for linguists).
Language:Shell1 0
jwijffels/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++1 0
jwijffels/weirdai
Weird A.I. Yankovic neural-net based lyrics parody generator
Language:Jupyter Notebook1 0