neuronalbit's Stars
gildas-lormeau/SingleFile
Web Extension for saving a faithful copy of a complete web page in a single HTML file
darkreader/darkreader
Dark Reader Chrome and Firefox extension
josh-berry/tab-stash
Firefox extension to save and restore tabs as bookmarks. Clear your tabs, clear your mind.
miguelvalente/whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
TEAMuP-dev/audacitorch
PyTorch wrappers for using your model in audacity!
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
flutydeer/audio-slicer
A simple GUI application that slices audio with silence detection
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
spipm/Depix
Recovers passwords from pixelized screenshots
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
massgravel/Microsoft-Activation-Scripts
A Windows and Office activator using HWID / Ohook / KMS38 / Online KMS activation methods, with a focus on open-source code and fewer antivirus detections.
0x90d/videoduplicatefinder
Video Duplicate Finder - Crossplatform
LibrePhotos/librephotos
A self-hosted open source photo management service. This is the repository of the backend.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
NVIDIA/pix2pixHD
Synthesizing and manipulating 2048x1024 images with conditional GANs
sundowndev/phoneinfoga
Information gathering framework for phone numbers
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
facebookresearch/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
shanteacontrols/OpenDeck
Software and hardware platform for simpler building of MIDI controllers.
Ademking/BetterViewer
a replacement for the image viewing mode built into Firefox and Chrome-based web browsers.
marzme/PowerShell_ISE_Themes
A collection of themes for the Windows PowerShell ISE
agmmnn/awesome-blender
🪐 A curated list of awesome Blender addons, tools, tutorials; and 3D resources for everyone.
VictorRobellini/pfSense-Dashboard
A functional and useful dashboard for pfSense that utilizes influxdb, grafana and telegraf
unmade/audiomatch
Find similar audio files easily