recognition

There are 987 repositories under recognition topic.

  • HumanSignal/labelImg

    LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

    Language:Python22k4007676.2k
  • jofpin/trape

    People tracker on the Internet: OSINT analysis and research tool by Jose Pino

    Language:Python7.9k3513221.3k
  • all-contributors/all-contributors

    ✨ Recognize all contributors, not just the ones who push code ✨

    Language:HTML7.5k802451.7k
  • clovaai/deep-text-recognition-benchmark

    Text recognition (optical character recognition) with deep learning methods, ICCV 2019

    Language:Jupyter Notebook3.7k853831.1k
  • meijieru/crnn.pytorch

    Convolutional recurrent network in pytorch

    Language:Python2.3k56238655
  • detectRecog/CCPD

    [ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

    Language:Python2.2k64105563
  • julius-speech/julius

    Open-Source Large Vocabulary Continuous Speech Recognition Engine

    Language:C1.8k106158295
  • Sierkinhane/CRNN_Chinese_Characters_Rec

    (CRNN) Chinese Characters Recognition.

    Language:Python1.8k36310538
  • chrismattmann/tika-python

    Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

    Language:Python1.4k38278234
  • sdkcarlos/artyom.js

    A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

    Language:JavaScript1.2k74103370
  • jasmcaus/opencv-course

    Learn OpenCV in 4 Hours - Code used in my Python and OpenCV course on freeCodeCamp.

    Language:Python1k3020914
  • soundfingerprinting

    AddictedCS/soundfingerprinting

    Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

    Language:C#91573193187
  • sooftware/conformer

    [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

    Language:Python888936173
  • xinntao/facexlib

    FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.

    Language:Python7681433138
  • Breta01/handwriting-ocr

    OCR software for recognition of handwritten text

    Language:Jupyter Notebook72529153237
  • yuxitong/TensorFlowAndroidDemo

    TensorFlow android demo 车道线 车辆 人脸 动作 骨架 识别 检测 抽烟 打电话 闭眼 睁眼

    Language:Java7223118200
  • bgshih/aster

    Recognizing cropped text in natural images.

    Language:Python72021113192
  • openspeech-team/openspeech

    Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

    Language:Python66217172113
  • leondgarse/keras_cv_attention_models

    Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam

    Language:Python569237387
  • Food-Recipe-CNN

    Murgio/Food-Recipe-CNN

    food image to recipe with deep convolutional neural networks.

    Language:Jupyter Notebook562293129
  • app

    all-contributors/app

    🤖 A GitHub App to automate acknowledging contributors to your open source projects

    Language:JavaScript55217188143
  • lkuza2/java-speech-api

    The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

    Language:Java5329890304
  • taosir/cnn_handwritten_chinese_recognition

    CNN在线识别手写中文。

    Language:Python4931210127
  • Canjie-Luo/Text-Image-Augmentation

    Geometric Augmentation for Text Image

    Language:C++478201389
  • zagum/SpeechRecognitionView

    "Google Now" style animation for Speech Recognizer.

    Language:Java4772211104
  • gotev/android-speech

    Android speech recognition and text to speech made easy

    Language:Java4662261156
  • php-opencv/php-opencv-examples

    Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)

    Language:PHP466252988
  • gisbi-kim/PyICP-SLAM

    Full-python LiDAR SLAM using ICP and Scan Context

    Language:Python46312978
  • clovaai/synthtiger

    Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

    Language:Python43663984
  • JeffersonQin/YuzuMarker.FontDetection

    ✨ 首个CJK(中日韩)字体识别以及样式提取模型 YuzuMarker的字体识别模型与实现 / First-ever CJK (Chinese Japanese Korean) Font Recognition and Style Extractor, side project of YuzuMarker

    Language:Python41342717
  • all-contributors/cli

    Tool to help automate adding contributor acknowledgements according to the all-contributors specification ✨

    Language:JavaScript4128130145
  • LBH1024/CAN

    When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).

    Language:Python347234057
  • php-opencv/php-opencv

    opencv 4.5+ with dnn module for php 7/8

    Language:C++335144543
  • Feghal/ImageDetect

    ✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.

    Language:Swift30210030
  • ShoufaChen/AdaptFormer

    [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"

    Language:Python30173517
  • cansik/architectural-floor-plan

    AFPlan is an architectural floor plan analysis and recognition system to create extended plans for building services.

    Language:Kotlin298196174