lars76

Pinned Repositories

bigvgan-mirror
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
Language:Python5 1 01
chinese-subtitle-ocr
Optical character recognition for Chinese subtitles using SSD and CNN
Language:Python108 8 430
fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
Language:Python16 2 47
forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
Language:Python12 1 03
kmeans-anchor-boxes
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
Language:Python533 10 21196
llm-cn-en-dict
Using LLMs to generate a synthetic Chinese-English dictionary
Language:Python00
object-localization
Object localization in images using simple CNNs and Keras
Language:Python137 5 2060
pitch-benchmark
Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.
Language:Python38 1 09
story-evaluation-llm
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
Language:Python20
swift-f0
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
Language:Python91 5 212

lars76's Repositories

lars76/kmeans-anchor-boxes
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
Language:Python533 10 21196
lars76/object-localization
Object localization in images using simple CNNs and Keras
Language:Python137 5 2060
lars76/chinese-subtitle-ocr
Optical character recognition for Chinese subtitles using SSD and CNN
Language:Python108 8 430
lars76/swift-f0
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
Language:Python91 5 212
lars76/pitch-benchmark
Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.
Language:Python38 1 09
lars76/fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
Language:Python16 2 47
lars76/forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
Language:Python12 1 03
lars76/bigvgan-mirror
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
Language:Python5 1 01
lars76/segmentation_activations
Code for the paper "Effect of the output activation function on the probabilities and errors in medical image segmentation"
Language:Python2 2 00
lars76/story-evaluation-llm
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
Language:Python20
lars76/pysais-utf8
Python C module for creating suffix, LCP and BWT arrays with UTF-8 text.
Language:C1 0 00
lars76/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python0 0 00
lars76/cameraview
Language:Java0 1 00
lars76/config
Simple config library in C
Language:C0 1 00
lars76/helloworld
helloworld program using JSF, Maven, Glassfish, Java EE.
Language:HTML0 1 00
lars76/llm-cn-en-dict
Using LLMs to generate a synthetic Chinese-English dictionary
Language:Python00
lars76/spaced_repetition_benchmark_test
Language:Python0 1 00
lars76/archlinux_configs
Language:Shell
lars76/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Language:Python0 0
lars76/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python0 0
lars76/woodyolo
A specialized object detection model originally designed for microscopic wood vessel identification but applicable to any high-recall detection task.
Language:Jupyter Notebook