alex625051
Ананьев Александр Владимирович. Аспирантура РХТУ (Российский химико-технологический университет им. Д.И. Менделеева)
YandexMoscow
Pinned Repositories
-
-test_137
Тестовое задание Python
3WiFi
3WiFi Wireless Database
AI-Video-Translation
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
alex625051.github.io
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
MC_stroke
Computer implementation of a mathematical model of ischemic stroke based on a cellular automaton using the Monte Carlo method
pythonLessons
Занятия по автоматизации python
Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
videoTranslater_v1_0_portable
This project is designed for translating videos from Russian to English with automatic subtitle generation. It operates locally on your computer using the CPU.
alex625051's Repositories
alex625051/MC_stroke
Computer implementation of a mathematical model of ischemic stroke based on a cellular automaton using the Monte Carlo method
alex625051/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
alex625051/videoTranslater_v1_0_portable
This project is designed for translating videos from Russian to English with automatic subtitle generation. It operates locally on your computer using the CPU.
alex625051/AI-Video-Translation
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
alex625051/alex625051.github.io
alex625051/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
alex625051/autotranslate
Videos Transcription and Translation with Faster Whisper and ChatGPT
alex625051/audiobook_maker
alex625051/bi-magic-resources
alex625051/ebook2audiobookXTTS
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
alex625051/grok-1
Grok open release
alex625051/kafka-workshop
Интенсив «Kafka за 90 минут» для DevOpsConf 2023
alex625051/large-v2-002.pt
This repository contains the large-v2 model for the Whisper neural network.
alex625051/large_model_v2
alex625051/model_lab1
alex625051/postgres-kafka-demo
Fully reproducible, Dockerized, step-by-step, demo on how to stream tables from Postgres to Kafka/KSQL back to Postgres. Detailed blog post published on Medium.
alex625051/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
alex625051/pyannote-audio_2.1.1
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
alex625051/pytorch_2.0.1
Tensors and Dynamic neural networks in Python with strong GPU acceleration
alex625051/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
alex625051/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
alex625051/skills-introduction-to-github
My another test repository
alex625051/speechbrain_0.5.15
A PyTorch-based Speech Toolkit
alex625051/torch_audio_2.0.2
Data manipulation and transformation for audio signal processing, powered by PyTorch
alex625051/torch_text_15.0.2
Models, data loaders and abstractions for language processing, powered by PyTorch
alex625051/tourch_vision_0.15.2
Datasets, Transforms and Models specific to Computer Vision
alex625051/transcribation_with_speakers
alex625051/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
alex625051/ViDove
🐦ViDove: End-to-end Video Translation Toolkit
alex625051/whisper
Robust Speech Recognition via Large-Scale Weak Supervision