recognition

There are 1080 repositories under recognition topic.

  • HumanSignal/labelImg

    LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

    Language:Python24.2k4087666.5k
  • jofpin/trape

    People tracker on the Internet: OSINT analysis and research tool by Jose Pino

    Language:Python8.5k3553341.3k
  • all-contributors/all-contributors

    ✨ Recognize all contributors, not just the ones who push code ✨

    Language:HTML7.9k823061.7k
  • clovaai/deep-text-recognition-benchmark

    Text recognition (optical character recognition) with deep learning methods, ICCV 2019

    Language:Jupyter Notebook3.9k854001.1k
  • meijieru/crnn.pytorch

    Convolutional recurrent network in pytorch

    Language:Python2.5k53239658
  • detectRecog/CCPD

    [ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

    Language:Python2.4k62111576
  • julius-speech/julius

    Open-Source Large Vocabulary Continuous Speech Recognition Engine

    Language:C1.9k103159306
  • Sierkinhane/CRNN_Chinese_Characters_Rec

    (CRNN) Chinese Characters Recognition.

    Language:Python1.9k36311536
  • chrismattmann/tika-python

    Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

    Language:Python1.6k40295244
  • jasmcaus/opencv-course

    Learn OpenCV in 4 Hours - Code used in my Python and OpenCV course on freeCodeCamp.

    Language:Python1.3k31211k
  • sdkcarlos/artyom.js

    A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

    Language:JavaScript1.3k75103367
  • jenly1314/MLKit

    🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。

    Language:Java1.1k1357187
  • sooftware/conformer

    [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

    Language:Python1.1k737187
  • soundfingerprinting

    AddictedCS/soundfingerprinting

    Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

    Language:C#1k71201203
  • xinntao/facexlib

    FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.

    Language:Python9491337161
  • Breta01/handwriting-ocr

    OCR software for recognition of handwritten text

    Language:Jupyter Notebook81428147244
  • nyrahealth/CrisperWhisper

    Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

    Language:Python814164145
  • yuxitong/TensorFlowAndroidDemo

    TensorFlow android demo 车道线 车辆 人脸 动作 骨架 识别 检测 抽烟 打电话 闭眼 睁眼

    Language:Java7383218204
  • bgshih/aster

    Recognizing cropped text in natural images.

    Language:Python73520113197
  • openspeech-team/openspeech

    Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

    Language:Python70918177114
  • leondgarse/keras_cv_attention_models

    Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam

    Language:Python618227997
  • app

    all-contributors/app

    🤖 A GitHub App to automate acknowledging contributors to your open source projects

    Language:JavaScript59817196158
  • Food-Recipe-CNN

    Murgio/Food-Recipe-CNN

    food image to recipe with deep convolutional neural networks.

    Language:Jupyter Notebook582283131
  • gisbi-kim/PyICP-SLAM

    Full-python LiDAR SLAM using ICP and Scan Context

    Language:Python56113986
  • clovaai/synthtiger

    Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

    Language:Python544542106
  • lkuza2/java-speech-api

    The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

    Language:Java5439690299
  • taosir/cnn_handwritten_chinese_recognition

    CNN在线识别手写中文。

    Language:Python5431210134
  • gotev/android-speech

    Android speech recognition and text to speech made easy

    Language:Java5262162166
  • JeffersonQin/YuzuMarker.FontDetection

    ✨ 首个CJK(中日韩)字体识别以及样式提取模型 YuzuMarker的字体识别模型与实现 / First-ever CJK (Chinese Japanese Korean) Font Recognition and Style Extractor, side project of YuzuMarker

    Language:Python50733022
  • Canjie-Luo/Text-Image-Augmentation

    Geometric Augmentation for Text Image

    Language:C++490191390
  • php-opencv/php-opencv-examples

    Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)

    Language:PHP488243093
  • zagum/SpeechRecognitionView

    "Google Now" style animation for Speech Recognizer.

    Language:Java4852111103
  • haoranD/Awesome-Embodied-AI

    A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

  • all-contributors/cli

    Tool to help automate adding contributor acknowledgements according to the all-contributors specification ✨

    Language:JavaScript4198134146
  • LBH1024/CAN

    When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).

    Language:Python380184361
  • ShoufaChen/AdaptFormer

    [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"

    Language:Python36873521