Pinned Repositories
bigvgan-mirror
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
chinese-subtitle-ocr
Optical character recognition for Chinese subtitles using SSD and CNN
fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
kmeans-anchor-boxes
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
llm-cn-en-dict
Using LLMs to generate a synthetic Chinese-English dictionary
object-localization
Object localization in images using simple CNNs and Keras
pitch-benchmark
Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.
story-evaluation-llm
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
swift-f0
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
lars76's Repositories
lars76/kmeans-anchor-boxes
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
lars76/object-localization
Object localization in images using simple CNNs and Keras
lars76/chinese-subtitle-ocr
Optical character recognition for Chinese subtitles using SSD and CNN
lars76/swift-f0
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
lars76/pitch-benchmark
Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.
lars76/fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
lars76/forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
lars76/bigvgan-mirror
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
lars76/segmentation_activations
Code for the paper "Effect of the output activation function on the probabilities and errors in medical image segmentation"
lars76/story-evaluation-llm
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
lars76/pysais-utf8
Python C module for creating suffix, LCP and BWT arrays with UTF-8 text.
lars76/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
lars76/cameraview
lars76/config
Simple config library in C
lars76/helloworld
helloworld program using JSF, Maven, Glassfish, Java EE.
lars76/llm-cn-en-dict
Using LLMs to generate a synthetic Chinese-English dictionary
lars76/spaced_repetition_benchmark_test
lars76/archlinux_configs
lars76/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
lars76/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
lars76/woodyolo
A specialized object detection model originally designed for microscopic wood vessel identification but applicable to any high-recall detection task.