Capture2Text

fork of https://sourceforge.net/projects/capture2text/

Capture2Text Readme

Capture2Text enables users to quickly OCR a portion of the screen using a keyboard shortcut.

For more information visit: http://capture2text.sourceforge.net/

Version History:

[Version 4.6.3 (3-18-2022)]

Ticket #147: Possible fix to prevent capture box and preview from being displayed beneath other windows, especially after running for long periods of time or returning from sleep.
Improved the quality of the Text Line Capture feature, especially in the case were the last character is close to the boundary of a speech bubble which is itself close to some kind of high contrast foreground element.
Ticket #173: Fixed "${timestamp}" format option not working.
Ticket #182: Added tooltip to the tray icon.
Ticket #179: Replace Unicode single quote (’) with an ASCII single quote ('). Also replace (“) and (”) with (").
Ticket #163: Added note to the Hotkeys settings page regarding the proper method to disable hotkeys.
Ticket #175: BOM is no longer added to text files that are output by Capture2Text.
Ticket #162: Updated copyright date in the About dialog.

[Version 4.6.2 (8-10-2019)]

Ticket #49, #72: Fix error when using CLI to OCR 8 bpp images.
Ticket #76: Fix "\u200C" character being added when replacing ligatures.
Ticket #68: Fix typo in About dialog.
Fix typo: "Keep lines breaks" -> "Keep line breaks".

[Version 4.6.1 (7-3-2019)]

Ticket #97: Fixed issue where hex characters were appended to the translation.

[Version 4.6.0 (4-21-2018)]

Ticket #48: \t, \r, and \n can now be used in the "Replace with" column.
Ticket #43: Replace nuisance ligatures (ﬁ, ﬂ).
Ticket #30: Non-32bpp images now supported in CLI. Note: 1bpp images will not be pre-processed.
Added "Settings > Output > Call Executable" option.
In Settings dialog, show tab menu as a list box.

[Version 4.5.1 (11-4-2017)]

Ticket #27: Fixed text-to-speech feature not working due to missing qtexttospeech_sapi.dll.
Fixed bug that caused some save data to be stored in the registry.

[Version 4.5.0 (10-22-2017)]

Ticket #26: Added text-to-speech feature.
Ticket #23: Added "scale factor" option to CLI and Settings dialog.
Ticket #21: Fixed occasional column merge issue for Japanese vertical text.
Update to Tesseract 4.00alpha (Note: Capture2Text will continue to be packaged with legacy traineddata until newer LSTM fast/best traineddata is more mature)
Update to QT 5.9.2 and Leptonica 1.74.4.

[Version 4.4.0 (7-28-2017)]

Ticket #16: Fixed issue where only first line of multi-line capture was translated.
Ticket #14: Added CLI option --clipboard.
Ticket #13: You may now call Capture2Text.exe with the --portable option to place the .ini settings file in the same directory as the .exe.
Ticket #12: Added "Trim capture" option to the Setting dialog.
Ticket #12: Added CLI option --trim-capture.
Added CLI option --deskew.

[Version 4.3.0 (6-2-2017)]

Ticket #6: For CLI, output after each file is processed instead of outputting after all files have been processed.
Ticket #6: Added new CLI --output-format token: ${file}.
Ticket #5: Added CLI option --debug-timestamp.

[Version 4.2.0 (5-13-2017)]

Ticket #4: Added option to log captures to file.
Ticket #4: Added option to append timestamp to debug images.
Ticket #4: Added CLI options --output-file-append and --output-format.

[Version 4.1.0 (4-14-2017)]

Ticket #2: Fixed bug that caused CLI option "--screen-rect" to output an error.
Ticket #1: Added hotkey to toggle whitelist on/off. By default this hotkey is unmapped.
Ticket #1: Added hotkey to toggle blacklist on/off. By default this hotkey is unmapped.
Ticket #1: Added option to specify a Tesseract config file to both GUI and CLI.
Added whitelist and blacklist options to CLI.
Increased default lengths for text line captures.
Show help text when no options are provided to Capture2Text_CLI.exe.
Added suffix to some of spin boxes in the settings dialog.
Reduced border width in popup dialog.
Add version number to the .ini file.

[Version 4.0 (4-2-2017)]

Complete re-implementation in QT/C++.
Added Translation feature (powered by Google Translate).
Added Re-Capture Last hotkey.
Added Text Line Capture hotkey.
Added Forward Text Line Capture hotkey.
Added Bubble Capture hotkey.
Added more Preview position options.
Added blacklist setting.
Added "Reset to defaults" links in Settings dialog.
Capture Box and Preview Box may now have outlines.
Better interface for specifying hotkeys in the Settings dialog.
Custom tray icon "balloon" window.
Added "Replace" tab to the Settings dialog. Substitutions/Replacements are now stored in the settings .ini instead of substitutions.txt.
Added sample Capture Box to Settings dialog.
Added sample Preview box to Settings dialog.
Added deskew option.
Added debug options.
Popup dialog now enabled by default.
Size of Popup dialog is now saved automatically.
Added "Topmost" option to Popup dialog.
Added "Font" option to Popup dialog.
Removed the "Enable OCR pre-processing" option (now always enabled).
Removed the "Strip furigana" option (now always enabled).
Removed the "OCR method" option.
Removed "Prepended/Appended Text" setting.
Removed "Send to Cursor" setting.
Removed "Send to Control" setting.
"Preserve newline characters" setting renamed to "Keep linebreaks".
"Preferences" dialog renamed to "Settings".
Added to Capture2Text_CLI.exe for command line usage.
Settings .ini file now stored in %appdata%\Capture2Text.
Changed some of the hotkey defaults.
Added Russian and Korean to default package and removed Italian.
Added icons to some of the items in the tray menu.
Added more information in the About dialog.

[Version 3.9 (6-5-2016)]

Updated active selection corner logic. (Thanks R. Webster-Noble!).

[Version 3.8 (1-15-2016)]

Updated Tesseract (3.05.00dev).
Support for additional languages.
Added the "OCR Method" setting.
NHocr is no longer packaged (but may still be copied from previous versions to the Utils folder)

[Version 3.7 (7-04-2015)]

Text entered into the popup window will now be saved to the clipboard when the OK button is clicked and the Save to Clipboard option is checked.

[Version 3.6 (5-15-2015)]

Removed the experimental speech recognition feature due to new Google Speech API v2 quota restrictions.
Fixed DPI scale issue with the capture box. (Thanks rocker7!).
Now compiled with AutoHotkey 32-bit Unicode v1.1.22.00 (was v1.1.14.03).

[Version 3.5 (7-17-2014)]

Capture box should be less jumpy.
Preview will now only update when the user has stopped moving the capture box for at least 400 milliseconds.
When preview is setting to "Dynamic", the positioning should be less jumpy.

[Version 3.4 (7-10-2014)]

Added option to strip furigana from Japanese text.
Added the "Auto" choice to the "Text direction" preference.
Removed the option to toggle "OCR pre-processing" from the Preferences. It may still be edited in settings.ini.
Changed the default "OCR pre-processing" hotkey to Shift-Ctrl-Windows-B.

[Version 3.3 (3-2-2014)]

More minor tweaks to the Preferences dialog.

[Version 3.2 (3-1-2014)]

Minor tweaks to the Preferences dialog.

[Version 3.1 (2-28-2014)]

Improved OCR accuracy through use of better image pre-preprocessing (leptonica_util).
Now supports text and backgrounds of any color when OCR pre-processing is enabled. (In the previous version, only dark text on a light background was supported).
Added option to place the preview text beside the capture box.
Japanese (Tesseract) accuracy is now vastly improved through use of a Japanese-specific Tesseract config file. Also using this config file with Chinese (Tesseract).
Using Tesseract v3.02.02 for Japanese (was v3.01).
Replaced the binarize option with the OCR pre-processing option.
Removed "Send to Control" from the right-click menu.
Removed the Chinese (NHocr) language pack from default distribution. (You can still download it from https://code.google.com/p/nhocr/downloads/list).
Added the Italian language pack to the default distribution.
Removed setting of PreviewRemoveCaptureBox from the GUI.
Removed ConvertImageFormat (replaced with leptonica_util).
Now compiled with AutoHotkey 32-bit Unicode v1.1.14.03 (was v1.1.11.01).

[Version 3.0 (8-27-2013)]

Added option to binarize captured image before sending it to the OCR engine.

[Version 2.5 (7-5-2013)]

Updated NHocr from v0.20 to v0.21.
Now compiled with Ahk2Exe v1.1.11.01 instead of v1.1.05.06.

[Version 2.4 (12-29-2012)]

Added support for Arabic, Danish (Alternate), Esperanto (Alternate), German (Alternate) and Slovakian (Alternate).

[Version 2.3 (11-9-2012)]

Added option to remove the capture box before a preview OCR. This is more accurate, particularly with NHocr, but causes the capture box to flicker.
Changed the default image scale factor from 300% to 320% to meet Tesseract's minimum recommended DPI.
When using Japanese, revert to Tesseract v3.01. It is MUCH more accurate than v3.02.02.
Now passing a .ppm image to NHocr instead of a .pgm image to better handle non-grayscale captures.
Increased update rate of the capture box to make it appear more fluid.
Fixed text direction being ignored bug for Chinese/Japanese that was introduced in v2.2.
Fixed bug that caused the capture box to stick around after it was supposed to be removed.

[Version 2.2 (11-4-2012)]

Upgraded to Tesseract v3.02.02. For details, see: http://code.google.com/p/tesseract-ocr/wiki/ReleaseNotes
Added whitelist option to the OCR tab.
Simplified substitution tokens and fixed whitespace bug.

[Version 2.1 (10-7-2012)]

Added the substitutions feature.
Added command line options.

[Version 2.0 (3-10-2012)]

Added the Preferences dialog. No more editing settings.ini by hand.
The popup window is now multi-lined.
Added option to preserve newline characters.
Limited preview to 150 characters. A trailing "..." will appear if necessary.
Added Speech Recognition Language option to right-click menu.
Cleaned up the right-click menu.
On the first run, inform user how to access the Preferences dialog.

[Version 1.10a (2-18-2012)]

Removed GdiPlus.dll from distribution.

[Version 1.10 (12-31-2011)]

Added preview box (and corresponding settings)

[Version 1.09 (11-10-2011)]

Fixed speech recording stopping in the middle of a sentence.
Fixed VoiceMaxResults not working correctly. Also increased to 9 as default.

[Version 1.08 (11-06-2011)]

Upgraded Tesseract to version 3.01 (it has better vertical text support and doesn't ignore small captures as much)
When using Tesseract Chinese or Japanese, you can now select the text direction (vertical or horizontal). To support this, added TextDirectionToggleKey and textDirection to settings.ini.
Changed default for ScaleFactor from 4.0 to 3.0 in settings.ini.
Changed menu text for Chinese and Japanese to reflect the OCR engine being used.

[Version 1.07 (11-05-2011)]

Added voice recognition support via unofficial Google voice recognition service
Added the "Send To Cursor" option to menu. The setting.ini file includes: SendToCursor SendToCursorApplyBeforeAndAfterCommands
Renamed OCRAdjustment to OCRSpecific in settings.ini
Moved the CaptureBox section in settings.ini to the OCRSpecific section
Added VoiceSpecific to settings.ini. Section includes: VoiceMaxResults VoiceResultsWindowWidth VoiceResultsWindowFont VoiceResultsWindowFontSize VoiceSilenceBeforeStop VoiceLanguage
Added StartVoiceCapture to Hotkey section in settings.ini
Added VoiceLanguageToggleKey to Hotkey section in settings.ini
Removed scaleFilter from settings.ini
Removed the scaleFactor option from the menu (it's still in settings.ini)

[Version 1.06 (12-12-2010)]

Added language quick access keys.
For Chinese and Japanese delete newlines. For other languages replace newlines with spaces.

[Version 1.05 (12-04-2010)]

Fixed issue where the checkboxes in the language menu wouldn't disappear.

[Version 1.04 (12-04-2010)]

Added ability to move the capture box by right-clicking
Added languages supported by the Tesseract OCR tool
Created a right-click menu that allow the user to select language, output type, capture box settings and scale factor
Removed unnecessary items from settings.ini

[Version 1.03 (11-27-2010)]

Added ability to change dictionary when the Dictionary setting in settings.ini
Added Chinese dictionary

[Version 1.02 (11-27-2010)]

Changed CaptureKey to StartAndEndCaptureKey in settings.ini
Added EndOnlyCaptureKey to settings.ini
Added ToggleActiveCaptureCornerKey to setting.ini

[Version 1.01 (11-27-2010)]

Added ReplaceControlText to settings.ini
Added ability to use linefeeds, carriage returns and tabs in PrependText and AppendText
Added an "About" item to the tray menu.
Removed the capture box showing up in the taskbar
Removed the PassThruKey settings in settings.ini. They are no longer needed.
Changed the tray tooltip text
Cleaned up code and put the ScreenCapture routines in a separate file

[Version 1.00 (11-26-2010)]

Initial version

davidwed/Capture2Text

Capture2Text

Capture2Text Readme

Version History: