Pinned Repositories
audio.whisper
Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
BTM
Biterm Topic Modelling for Short Text with R
image
Computer Vision and Image Recognition algorithms for R users
taskscheduleR
Schedule R scripts/processes with the Windows task scheduler.
udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
word2vec
Distributed Representations of Words using word2vec
ETLUtils
Utilities for easily loading big data from relational databases directly into ffdf objects in R.
Myrrix-R-interface
Let R talk to Myrrix. Myrrix is a complete, real-time, scalable clustering and recommender system, evolved from Apache Mahout.
RMOA
Connect R to MOA for massive online data stream mining
udpipe-spacy-comparison
Compare accuracies of udpipe models and spacy models which can be used for NLP annotation
jwijffels's Repositories
jwijffels/android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
jwijffels/audio.whisper
Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
jwijffels/audioplayRmd
Provides one function: include_audio()
jwijffels/awesome-compose
Awesome Docker Compose samples
jwijffels/contributions
A community hub to contribute to the R-releases R universe
jwijffels/example
jwijffels/factgenie
Lightweight self-hosted span annotation tool
jwijffels/hfhub
Download and cache HuggingFace Hub files
jwijffels/htmlwidgetsgallery
jwijffels/inception-external-recommender
Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-service compatible with the external recommender API of INCEpTION.
jwijffels/inception-reporting-dashboard
jwijffels/instructor
structured outputs for llms
jwijffels/itables
Pandas DataFrames as Interactive DataTables
jwijffels/libfvad
Voice activity detection (VAD) library, based on WebRTC's VAD engine
jwijffels/LibtorchSegmentation
A c++ trainable semantic segmentation library based on libtorch (pytorch c++). Backbone: VGG, ResNet, ResNext. Architecture: FPN, U-Net, PAN, LinkNet, PSPNet, DeepLab-V3, DeepLab-V3+ by now.
jwijffels/llm.c
LLM training in simple, raw C/CUDA
jwijffels/marvin
✨ Build AI interfaces that spark joy
jwijffels/modded-nanogpt
NanoGPT (124M) in 5 minutes
jwijffels/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
jwijffels/officer
:cop: officer: office documents from R
jwijffels/prefect
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
jwijffels/py-webrtcrnnvad
Python interface to the RNNoise VAD(Voice Activity Detection) component inside webrtc
jwijffels/pycaprio
Python client to the INCEpTION annotation tool
jwijffels/r_dev_projects
Boilerplate and configs for my R development projects
jwijffels/reactable
Interactive data tables for R
jwijffels/rsyntax_dutch_quotes
Rules for quote extraction in Dutch using rsyntax + udpipe
jwijffels/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
jwijffels/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
jwijffels/whisper.cpp
Port of OpenAI's Whisper model in C/C++
jwijffels/word2vec
Distributed Representations of Words using word2vec