jqueguiner's Stars
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
rust-lang/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
jesseduffield/lazydocker
The lazier way to manage everything docker
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
guidance-ai/guidance
A guidance language for controlling large language models.
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
ubicloud/ubicloud
Open source alternative to AWS. Elastic compute, block storage (non replicated), firewall and load balancer, managed Postgres, and IAM services in public beta.
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
vasistalodagala/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
NavodPeiris/speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
X-E-Speech/X-E-Speech-code
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
SpeechResearch/speechresearch.github.io
ECNU-Cross-Innovation-Lab/ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
BennyKok/leaked-zoom
zhihanyang2022/gender-audio-classification
A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
amritkromana/disfluency_detection_from_audio
nuaazs/AskLLM
AI-driven, adaptive customer service agent.
iajaykarthick/NER-medical-text
This project is to develop a named entity recognition (NER) model to identity medical entities such as diseases, symptoms, treatments in the unstructured medical text written in natural language.
JinchaoLove/AffectiveVocalBurstRecognition
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition
SCNU-RISLAB/CNN-Transforemr-and-Multidimensional-Attention-Mechanism
julien-c/hyllama
llama.cpp gguf file parser for javascript
MaxDkn/ShooterAI-test
Teaching genetic algorithms to play 2D top-down team shooting game. Made with Pygame (Python)
nuaazs/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models