XxA7medxX's Stars
ethz-asl/ssc_exploration
Incremental 3D Scene Completion for Safe and Efficient Exploration Mapping and Planning
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
aqntks/Easy-Yolo-OCR
Proceed with text detection only in the selected area of the image
ultralytics/ultralytics
Ultralytics YOLO11 🚀
mesutpiskin/id-card-detector
:credit_card: Detecting the National Identification Cards with Deep Learning (Faster R-CNN)
tailtq/identity-card-info-extraction
buiquangmanhhp1999/extract-information-from-identity-card
From identity card image, this repo detect 4 corners, align by OpenCV, then detect word in image and recognize word by Transformer OCR.
ARBML/klaam
Arabic speech recognition, classification and text-to-speech.
openai/openai-python
The official Python library for the OpenAI API
openai-translator/openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
mpcabd/python-arabic-reshaper
Reconstruct Arabic sentences to be used in applications that don't support Arabic
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
mistralai/FastChat-release
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
mistralai/client-python
Python client library for Mistral AI platform
mistralai/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
vedderb/bldc
The VESC motor control firmware
neoxic/ESCape32
BLDC motor control firmware for 32-bit ESCs
dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Azure-Samples/Cognitive-Speech-TTS
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
espnet/espnet
End-to-End Speech Processing Toolkit
NVIDIA/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
alexdada555/Modelling-Simulation-and-Implementation-of-Linear-Control-for-Asymmetric-Multirotor-UAVs
Master's Thesis Project: Design, Development, Modelling and Simulating of a Y6 Multi-Rotor UAV, Imlementing Control Schemes such as Proportional Integral Derivative Control, Linear Quadratic Gaussian Control and Model Predictive Control on a BeagleBone Blue