karthik19967829
ML Research @ Nexusflow.ai. Previously : Amazon Alexa, CMU Speech WavLab CMU SCS Alum
ML Researcher @ nexusflow.ai Sanfrancisco, USA
Pinned Repositories
16-884.github.io
apiwebchat
BOT Framework style custom web chat for api.ai agent.
AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
Ballerina
This repo has code base thats a fusion of BOLAA and Webarena to be build hyper-personalized agents that are aligned to your life-goals
BOLAA
chatbot-facebook-nodejs
codingInterview
coding interview brushup
DSTC11-Benchmark
InferDoc
Generate SQUAD style dataset from raw text file and train a transformer based question answering model .This repo has code from https://github.com/facebookresearch/UnsupervisedQA and https://github.com/deepset-ai/haystack
Unsupervised-Learning
This repository consists of the unsupervised clustering techniques to used to develop a social networking platform in an educational institute.
karthik19967829's Repositories
karthik19967829/InferDoc
Generate SQUAD style dataset from raw text file and train a transformer based question answering model .This repo has code from https://github.com/facebookresearch/UnsupervisedQA and https://github.com/deepset-ai/haystack
karthik19967829/DSTC11-Benchmark
karthik19967829/16-884.github.io
karthik19967829/AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
karthik19967829/Ballerina
This repo has code base thats a fusion of BOLAA and Webarena to be build hyper-personalized agents that are aligned to your life-goals
karthik19967829/BOLAA
karthik19967829/codingInterview
coding interview brushup
karthik19967829/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
karthik19967829/espnet
End-to-End Speech Processing Toolkit
karthik19967829/espnet_onnx
Onnx wrapper for espnet infrernce model
karthik19967829/externalcolabcode
karthik19967829/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
karthik19967829/hexa
Discovering and Achieving Goals via World Models, NeurIPS 2021
karthik19967829/hexa-benchmark
karthik19967829/karthik19967829.github.io
karthik19967829/LongLoRA
Code and documents of LongLoRA and LongAlpaca
karthik19967829/mmml-course
karthik19967829/MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
karthik19967829/NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.
karthik19967829/NexusRaven-V2
karthik19967829/pydmps
karthik19967829/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
karthik19967829/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
karthik19967829/sharedtask-dialdoc2021
doc2dial data includes a set of documents from multiple domains; and conversations between an assisting agent and an end user that are grounded in the associated documents.
karthik19967829/shinjiwlab.github.io
karthik19967829/soundstorm-speechtokenizer
Implementation of SoundStorm built upon SpeechTokenizer.
karthik19967829/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
karthik19967829/vocode-python
🤖 Build voice-based LLM agents. Modular + open source.
karthik19967829/WCN-BERT
Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).
karthik19967829/zeno-build
Build, evaluate, analyze, and understand LLM-based apps