awentzonline
Formerly machine learning eng @VodyTV, data eng for Dollar Shave Club, full stack web for The Onion / ClickHole / AVClub.
Olympia, WA
awentzonline's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
s0md3v/roop
one-click face swap
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
facebookresearch/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
ggerganov/ggml
Tensor library for machine learning
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
openai/consistency_models
Official repo for consistency models.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
MilesCranmer/PySR
High-Performance Symbolic Regression in Python and Julia
davabase/whisper_real_time
Real time transcription with OpenAI Whisper.
vchoutas/smplx
SMPL-X
danijar/dreamerv3
Mastering Diverse Domains through World Models
uiri/toml
Python lib for TOML
JonathanFly/bark
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
nghorbani/human_body_prior
VPoser: Variational Human Pose Prior
kuleshov-group/llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
m-bain/webvid
Large-scale text-video dataset. 10 million captioned short videos.
facebookresearch/AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
lmas/opensimplex
This repo has been migrated to https://code.larus.se/lmas/opensimplex
m-bain/CondensedMovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
microsoft/cliffordlayers
facebookresearch/RCDM
Visualizing representations with diffusion based conditional generative model.
wesselb/neuralprocesses
A framework for composing Neural Processes in Python