Pinned Repositories
audalign
Package for aligning audio files through audio fingerprinting
DoctorGPT
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
gpt-neox-bak
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
k8s-device-plugin
NVIDIA device plugin for Kubernetes
kaldi-long-audio-alignment
Long audio alignment using Kaldi
lhotse
Tools for handling speech data in machine learning projects.
llama
Inference code for LLaMA models
llm
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
SpeechOceanTech's Repositories
SpeechOceanTech/k8s-device-plugin
NVIDIA device plugin for Kubernetes
SpeechOceanTech/audalign
Package for aligning audio files through audio fingerprinting
SpeechOceanTech/DoctorGPT
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
SpeechOceanTech/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
SpeechOceanTech/gpt-neox-bak
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
SpeechOceanTech/kaldi-long-audio-alignment
Long audio alignment using Kaldi
SpeechOceanTech/lhotse
Tools for handling speech data in machine learning projects.
SpeechOceanTech/llama
Inference code for LLaMA models
SpeechOceanTech/llm
SpeechOceanTech/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
SpeechOceanTech/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
SpeechOceanTech/whisper.cpp
Port of OpenAI's Whisper model in C/C++
SpeechOceanTech/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors