Pinned Repositories
AI-reads-books-page-by-page
AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, methodically extracting knowledge points and generating progressive summaries at specified intervals
aider
aider is AI pair programming in your terminal
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
coding-task
Coding Task
rsenthilkumar6's Repositories
rsenthilkumar6/AI-reads-books-page-by-page
AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, methodically extracting knowledge points and generating progressive summaries at specified intervals
rsenthilkumar6/aider
aider is AI pair programming in your terminal
rsenthilkumar6/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
rsenthilkumar6/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
rsenthilkumar6/Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
rsenthilkumar6/clapper
Clapper.app, the video editor designed for the age of AI cinema
rsenthilkumar6/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
rsenthilkumar6/Auto_Tor_IP_changer
change your Ip address automatically This tool based on tor project
rsenthilkumar6/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
rsenthilkumar6/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
rsenthilkumar6/DiffSynth-Studio
Enjoy the magic of Diffusion models!
rsenthilkumar6/enchanted
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.
rsenthilkumar6/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
rsenthilkumar6/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
rsenthilkumar6/it-tools
Collection of handy online tools for developers, with great UX.
rsenthilkumar6/kokoro-onnx
TTS with kokoro and onnx runtime
rsenthilkumar6/llama.cpp
LLM inference in C/C++
rsenthilkumar6/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
rsenthilkumar6/metavoice-src
Foundational model for human-like, expressive TTS
rsenthilkumar6/mirai
Mirai is a Server-Driven UI (SDUI) framework for Flutter. Mirai allows you to build beautiful cross-platform applications with JSON in real time.
rsenthilkumar6/open-computer-use
Secure AI computer use powered by E2B Desktop Sandbox
rsenthilkumar6/Open-Interface
Control Any Computer Using LLMs
rsenthilkumar6/openpilot
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
rsenthilkumar6/python-interpreter-node
Execute python code in a custom ComfyUI node. Write code to alter the node's input/outputs. Embed mini-scripts into your saved workflows.
rsenthilkumar6/Real-time-voice-ultra-simple-template-with-function-calling
OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and full event capture
rsenthilkumar6/screen-pipe
Record your screen & mic 24/7 and connect it to LLMs. Inspired by adept.ai, rewind.ai, Apple Shortcut. Written in Rust. Free. You own your data.
rsenthilkumar6/StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion
rsenthilkumar6/stack_board
层叠控件摆放
rsenthilkumar6/tabby
Self-hosted AI coding assistant
rsenthilkumar6/TTS-coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production