A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

Last update: Dec 30, 2022

Overview

About

An OCR translator tool. Made by me by utilizing Tesseract, compiled to .exe using pyinstaller. I made this program to learn more about python.

Inspired by Visual Novel Reader (VNR), Visual Novel OCR, and QTranslate

Requirements

For User

tesseract, You only need to install it and its language tessdata
Internet connection Obviously

For Dev

Python 3.5+, checked using vermin (I am using python 3.9.6)
Libraries from python: os, sys, functools, json, webbrowser, subprocess, datetime, Mbox, tkinter, pathlib, asyncio
External libraries: pyperclip, pytesseract, pyautogui, pillow, deepl_scraper_pp, deep_translator, keyboard

You can install them by running pip_install.bat or by installing them yourself, full details are located at requirements.txt.
*If i miss anything please let me know.

Tutorial on How To Install and Setup

For User

Download the latest release of this program
Install tesseract, make sure to select install all language pack when prompted
Open the ScreenTranslate.exe
Check settings, make sure tesseract location is correct
Set monitor scaling to 100% so that image is captured accurately (If scaling is not set to 100% you will need to set offset in setting) (Recommended)
Set offset if on multiple monitors. (Optional)
Try capturing image and see if it works or not, if it doesn't, go check the image captured in img_cache folder. If it still doesn't work, try to change the offset.
Now that you have set everything, the app should be ready. Feel free to submit new issue on the github repository if you encounter any bugs.

For Dev

Clone the repo or download the source code of the latest release
Install tesseract, make sure select install all language pack when prompted
Install all the dependencies used for the project
Run and test the source code
If everything works, you can run the app normally running the ScreenTranslate.py file or using the TempRun.bat

if everything works and you have a suggestion or improvement, you can submit a pull request on the github repository. I will check if it's a good idea to add it.

How To Compile It To .exe Yourself

You can use p2exe or many other stuff. I use pyinstaller to compile it.
Command used are

# On Source Code Directory
pyinstaller ScreenTranslate.spec

Read this stackoverflow post to learn more on how to do it.

Tutorial on How To Use

Select Language
Translate or Capture Image using the capture window
Set hotkeys and delays as needed
Set offset if needed (Usually when scaling is not 100% or when using multiple monitors)
Done

Disclaimer

This is a free open source software, you can use it for any purpose. However, I am not responsible for any damage caused by this software. Use it at your own risk. (Not that it will do anything to you, it's just a tool to help you translate text lol)

This is also non profit, I gain no money from creating this.

Comments

Thank you for creating the software. But can you help me to solve the following problems.

I tried translating the game tales of arise. doesn't seem to work well with large fonts. And often appear "Please enter some text". Does (Opacity) affect the effectiveness of text detection in the game? How should I adjust the Offset X , Y , W , H, for the software to work best

opened by nonamebatbai 25
Multiple improvements, view comment
Add hotkey for capture and translate

Uses python module 'keyboard'

Example: Set hotkey to 'Enter' such that pressing enter can both advance the VN and capture

User is able to set the hotkey in settings by pressing button and press desired hotkey

Instead of reading Setting.json everytime the screen is captured, cached settings in memory is read to reduce read from hard drive

Auto copy resource and user_manual from source through .spec file; Removed redundant files from copy_after_compiling

Fix typo

Add files to .gitignore
opened by laggykiller 8
Changing directory structure
Change directory structure

Use os.path.join() instead of string joining for handling paths

Use variables of paths instead of joining path everytime it is used
opened by laggykiller 4
i have an idea

pretty good results. so my idea is you can create an extra window. and darken the surroundings. to see the text better and the "Text Capture Area" will work better and the "Text Capture Area" window must be overlaid on the new window to darken the surroundings to work properly. I darkened the "Text Capture Area" and used "Capture Area Settings" to translate and got the exact same result as the picture above.

opened by nonamebatbai 3
Can you add more OCR engines to your software?

This dialogue with black border translates very accuratel y or the dark scene, the translation is also very accurat e but the light scene cannot be translated. Can you solve this problem? Why does the software not work when the scene is bright?

opened by nonamebatbai 3
thank you very much

thank you very much . The software is considered complete. I'm so grateful for your hard work to create great software like this. And this is also the last version you updated?

opened by nonamebatbai 0

Releases(V1.8.5)

V1.8.5(Apr 7, 2022)
As mentioned in #12 by @vtminhh if monitor scaling is not 100%, the setting window will appear cropped. This release fixes that by making the frame bigger and the window resizable.

Changelog [V1.8.5 Resizable setting window]

To counter scaling problem, you can now resize the setting window. It is also bigger now.

Requirements

tesseract, needed for the ocr. Install it with all the language pack.

LibreTranslate for offline translation (Optional).

Internet connection for translation if not using LibreTranslate.

Full Changelog: https://github.com/Dadangdut33/Screen-Translate/compare/V1.8.4...V1.8.5

For detailed installation information please take a look at the readme page
Source code(tar.gz)
Source code(zip)
Changelog.txt(4.98 KB)
Installer.ScreenTranslate.1_8_5_Console.exe(49.85 MB)
Installer.ScreenTranslate.1_8_5_No.Console.exe(49.84 MB)
readme.txt(1.55 KB)
ScreenTranslate.1.8.5.Console.Portable.zip(71.13 MB)
ScreenTranslate.1.8.5.No.Console.Portable.zip(71.14 MB)
V1.8.4(Dec 12, 2021)
I realized that with the captured words being separated by new lines, the translation will be a little messy. So now I added an option to disable/enable it. By default, new lines will now be replaced by space. I also added a setting for libreTranslate API key.

Changelog [V1.8.4 Minor Update]

Added setting for replacing new line with space for captured text.

Added setting for API keys for LibreTranslate.

Requirements

tesseract, needed for the ocr. Install it with all the language pack.

LibreTranslate for offline translation (Optional).

Internet connection for the other engines (Needed if not using LibreTranslate).

Full Changelog: https://github.com/Dadangdut33/Screen-Translate/compare/V1.8.3...V1.8.4
Source code(tar.gz)
Source code(zip)
Installer.ScreenTranslate.1_8_4_Console.exe(44.45 MB)
Installer.ScreenTranslate.1_8_4_No.Console.exe(44.46 MB)
ScreenTranslate.1.8.4.Console.Portable.rar(49.32 MB)
ScreenTranslate.1.8.4.No.Console.Portable.rar(49.33 MB)
V1.8.3(Dec 11, 2021)
Thanks to user fnx4 offline translation is now possible. There is now LibreTranslate engine that you can use by hosting it yourself or by using dedicated server available. If you host it yourself, you can use it without internet connection.

Preview

Changelog [V1.8.3 Added offline translation support by using LibreTranslate]

Now supports offline translation by using LibreTranslate. You can also use the online version if you are not interested in hosting it offline yourself. Link: https://github.com/LibreTranslate/LibreTranslate

Added connection indicator in the main menu.

You can now reconnect if you start the program without internet connection by pressing the signal logo on top right of the main menu.

Requirements

tesseract, needed for the ocr. Install it with all the language pack.

LibreTranslate for offline translation (Optional).

Internet connection for the other engines (Needed if not using LibreTranslate).

Full Changelog: https://github.com/Dadangdut33/Screen-Translate/compare/V1.8.2...V1.8.3
Source code(tar.gz)
Source code(zip)
Installer.ScreenTranslate.1_8_3_Console.exe(44.45 MB)
Installer.ScreenTranslate.1_8_3_No.Console.exe(44.46 MB)
ScreenTranslate.1.8.3.Console.Portable.rar(49.32 MB)
ScreenTranslate.1.8.3.No.Console.Portable.rar(49.33 MB)
V1.8.2(Nov 21, 2021)
To improve the user experience, I have added some status indicators and symbols/emojis to the program.

Preview

Changelog [V1.8.2 Minor Bug Fix and Update]

Fixed the alt chinese language not working on capturing

Fix logo for mask ui (I forgot to add the logo before)

Added status indicator in top right of the main menu to show the current state of the program (Ready/Busy/Error/Warning), each are colored differently.

Added log window to show program running log, you can turn this on in the settings.

Added symbols/emoji to some buttons to make it more clear.

Added button to delete all captured images in setting.

Change cursor look after snipping to indicate loading status.

Requirements

Tesseract, needed for the ocr. Install it with all the language pack.

Internet connection

Full Changelog: https://github.com/Dadangdut33/Screen-Translate/compare/V1.8.1...V1.8.2
Source code(tar.gz)
Source code(zip)
Installer.ScreenTranslate.1_8_2_Console.exe(44.45 MB)
Installer.ScreenTranslate.1_8_2_No.Console.exe(44.45 MB)
ScreenTranslate.1.8.2.Console.Portable.rar(49.32 MB)
ScreenTranslate.1.8.2.No.Console.Portable.rar(49.33 MB)
V1.8.1(Nov 16, 2021)
By request, i have added a mask window that could help when trying to capture text. The way it works is by making the area around the image darker/lighter depending on the colors that you choose (default is dark).

Preview

Changelog [V1.8.1 Minor Bug Fix and Update]

Fixed chinese language bug

Fix debug mode checkbox not syncing between the 2 ui

Add setting to control how many last characters to delete

Requirements

Tesseract, needed for the ocr. Install it with all the language pack.

Internet connection

Full Changelog: https://github.com/Dadangdut33/Screen-Translate/compare/V1.8...V1.8.1
Source code(tar.gz)
Source code(zip)
Installer.ScreenTranslate.1_8_1_Console.exe(44.40 MB)
Installer.ScreenTranslate.1_8_1_No.Console.exe(44.39 MB)
ScreenTranslate.1.8.1.Console.Portable.rar(49.27 MB)
ScreenTranslate.1.8.1.No.Console.Portable.rar(49.27 MB)
V1.8(Nov 3, 2021)
Finally another update. The app looks like it's in the final stage now, I have reached what I envision when I first created it. I don't know whether there will be more features to add or not, but you all can still submit a feature request if you have any.

With this update, there is now a feature to snip and cap, it makes the program more practical to use now, I hope this helps. Also thanks to user nonamebatbai for requesting the feature. I almost didn't want to create it at first but because of the request, I ended up doing it because now I know that someone needs and wants it. Thanks once again! :D

Changelog: [V1.8 Added snip and cap]

Added snip and cap

Added more options to the capture window

Added detached setting window for capturing

Added options for auto detect background type

Added debug mode for capture enhancement (This will show how the image is processed, might help on getting the best result)

Fix keybind not unbinding on restore default

Requirements

Tesseract, needed for the ocr. Install it with all the language pack.

Full Changelog: https://github.com/Dadangdut33/Screen-Translate/compare/V1.7.2...V1.8

P.S. Now I also uploaded this program to Sourceforge, you can download it from there, also leave a rate if you can xD
Source code(tar.gz)
Source code(zip)
Installer.ScreenTranslate.1_8_Console.exe(44.69 MB)
Installer.ScreenTranslate.1_8_No.Console.exe(44.69 MB)
ScreenTranslate.1.8.Console.rar(49.83 MB)
ScreenTranslate.1.8.No.Console.rar(49.84 MB)
V1.7.2(Oct 20, 2021)
V1.7.2, I found out that you need to specify whether the background is dark or light when using the cv2 enhancement, so I decided to upload this small but useful update.

Changelog: [V1.7.2 Bug fix and background option]

Fix reversed word

Added option to choose wether the background that is gonna be captured is light or dark

Source code(tar.gz)
Source code(zip)
Changelog.txt(3.15 KB)
Installer_Screen.Translate.1_7_2_No.Console.exe(45.78 MB)
Installer_ScreenTranslate.1_7_2_Console.exe(45.78 MB)
readme.txt(1.32 KB)
ScreenTranslate.1.7.2.Console.rar(50.66 MB)
ScreenTranslate.1.7.2.No.Console.rar(50.67 MB)
V1.7.1(Oct 20, 2021)
V1.7.1, a minor update. By default, the app now won't show any alert if no text is entered.

Changelog: [V1.7.1 Added option to show or not show alert if no text is get]

By default, the app will now ignore the alert if no text is get. You can set alert to show again by going into setting

Source code(tar.gz)
Source code(zip)
Changelog.txt(3.00 KB)
readme.txt(1.32 KB)
Screen.Translate.1_7_1_No.Console.Installer.exe(45.80 MB)
ScreenTranslate.1.7.1.Console.rar(50.68 MB)
ScreenTranslate.1.7.1.No.Console.rar(50.69 MB)
ScreenTranslate.1_7_1_Console.Installer.exe(45.80 MB)
V1.7(Oct 20, 2021)
V1.7 Preview

V1.7 is here. We now have an installer by using inno setup. The setting window is now categorized to each category, you can now customize the query and result box, there is now also an option to improve the ocr by using python-opencv.

Changelog: [V1.7 OCR Enhancement and setting window improvement]

Fix the language code bug

Added some more language that tesseract supports

The ocr now utilizes python-opencv to improve the text detection. You can turn it off or on in setting

There is now setting for window transparency, textbox font, textbox foreground, and texbox background color for each detached window (the query and result window)

You can now choose wether to check for update on app startup or not

Setting ui is now not resizable but it's now categorized to each category and looks way better with more hints or tips

Every button and spinbox now uses ttk instead of tk (It looks more modern now)

The table in history now expand

Added more tooltiptext to settings

App now have an installer!

*Note: App size might be bigger now that it uses cv2

Note

You can choose between the installer one or the rar one, the only difference is that the installer one is more fancy and have an uninstaller.

Source code(tar.gz)
Source code(zip)
Screen.Translate.1_7_NoConsole.Installer.exe(45.78 MB)
ScreenTranslate.1.7.Console.rar(50.66 MB)
ScreenTranslate.1.7.No.Console.rar(50.66 MB)
ScreenTranslate.1_7_WithConsole.Installer.exe(45.78 MB)
V1.6(Oct 12, 2021)
V1.6 Preview

V1.6 is finally here. With this update, the source code are also more readable now which makes it easier to maintain and improve, you can expect more update in the future. Also, thanks to Mdika for the help with the logo.

Requirements:

Tesseract

Internet connection

Changelog: [V1.6 Added optional separate window for the translation]

There are now 2 additional window, 1 for the query and 1 for the result that you can generate in the generate menu

You can now check for update in the check for update and in the about window

App now have a logo thanks to Mdika

App now have 2 version, 1 with console window and 1 without it. There is no major difference between the 2. The only difference is that the one with console window is more useful for debugging

Hide the modules/library in the build folder so that it is not visible unless show hidden item is on.

File size has been reduced. The app now only includes the necessary module/library on build

The ui source code have been restructured to make it more readable and easier to maintain

Edit Also :

There is now also a shortcut key for opening about, settings, history, and captured

The slider now sync up between the two window

Source code(tar.gz)
Source code(zip)
Changelog.txt(1.96 KB)
Readme.txt(1.41 KB)
ScreenTranslate.1.6.No.Console.rar(20.78 MB)
ScreenTranslate.1.6.With.Console.rar(20.78 MB)
V1.5(Sep 5, 2021)
Once again, thanks to user @laggykiller, for they have added options to use hotkeys for capturing the text, making it easier to use

Requirements:

Tesseract

Internet connection

Changelog: [V1.5 Added hotkeys for capture and translate]

Added keybinding for the "Capture and translate" button

Added time delay for capturing using the keybinding

Minor improvement/fix to the settings loading by reducing the read/load of Setting.json

Source code(tar.gz)
Source code(zip)
Changelog.txt(1.15 KB)
Readme.txt(1.31 KB)
STL_V1.5-Add.Hotkeys.rar(61.47 MB)
V1.4(Sep 5, 2021)
Thanks to user @laggykiller for the help and contribution. There is now an option for just capturing by setting the engine to none.

[V1.4 Add Option For Not Translating]

Added option for not translating by using the 'none' engine

Realign refresh button padding on history ui

Remove check on 0 as this can cause error on the first language for engine that does not have auto detect

Release tag will now follow the version number

Source code(tar.gz)
Source code(zip)
Changelog.txt(930 bytes)
Readme.txt(1.27 KB)
STL_V.1.4-Add.Option.For.Not.Translating.rar(61.47 MB)
v1-update_3(Sep 2, 2021)
[V1.3.1 Bug fix]

Fix bug where program won't run if resource/json directory doesn't exist, program will now create it automatically if such directory doesn't exist.

Also fix bug where program won't save img if img_cache directory doesn't exist, program will now create it automatically if such directory doesn't exist.

Source code(tar.gz)
Source code(zip)
Changelog.txt(618 bytes)
Readme.txt(1.07 KB)
STL_V1.3.1-MinorBugFix.rar(58.18 MB)
v1-update_2(Sep 2, 2021)
[V1.3 Improvement + New Feature]

Added auto copy to clipboard on setting

'To' language now carried on when changing engines just like the 'from' language (Forgot this in the last release)

Added readme and changelog to folder download

Source code(tar.gz)
Source code(zip)
changelog.txt(329 bytes)
Readme.txt(1.07 KB)
STL_V1.3-Improvement+New.Feature.rar(58.20 MB)
v1-update_1(Sep 1, 2021)

I realized that it is kinda annoying that when you switch engines, the selected 'from' language will be the top one. Therefore I added some improvements and make it so that when you change the engine, the 'from' language will still be the same if it does exist. The 'to' language is still defaulted on English, no change is made.

TL;DR - From language now carried on when changing engines
Source code(tar.gz)
Source code(zip)
V1.2-Improvement.rar(56.70 MB)
release(Sep 1, 2021)

After testing it out many times, it finally works as intended. You can report if you found any bugs.

Don't forget to install tesseract, without it the ocr won't work. If you are still confused, you can check the visual tutorial on the user_manual folder.
Source code(tar.gz)
Source code(zip)
V1-Fixed.rar(56.72 MB)

Owner

Fauzan F A

An Informatics Engineering Student at UIN Syarif Hidayatullah Jakarta

GitHub Repository

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

FOTS: Fast Oriented Text Spotting with a Unified Network Introduction This is a pytorch re-implementation of FOTS: Fast Oriented Text Spotting with a

171 Aug 04, 2022

Text-to-Image generation

Generate vivid Images for Any (Chinese) text CogView is a pretrained (4B-param) transformer for text-to-image generation in general domain. Read our p

1.3k Jan 05, 2023

Generic framework for historical document processing

dhSegment dhSegment is a tool for Historical Document Processing. Its generic approach allows to segment regions and extract content from different ty

343 Dec 24, 2022

Use Youdao OCR API to covert your clipboard image to text.

Alfred Clipboard OCR 注：本仓库基于 oott123/alfred-clipboard-ocr 的逻辑用 Python 重写，换用了有道 AI 的 API，准确率更高，有效防止百度导致隐私泄露等问题，并且有道 AI 初始提供的 50 元体验金对于其资费而言个人用户基本可以永久使用

6 Sep 19, 2022

Simple SDF mesh generation in Python

Generate 3D meshes based on SDFs (signed distance functions) with a dirt simple Python API.

1.1k Jan 08, 2023

Table Extraction Tool

Tree Structure - Table Extraction Fonduer has been successfully extended to perform information extraction from richly formatted data such as tables.

88 Jun 02, 2022

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

86 Dec 28, 2022

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

TransFG: A Transformer Architecture for Fine-grained Recognition Official PyTorch code for the paper: TransFG: A Transformer Architecture for Fine-gra

307 Jan 03, 2023

TextBoxes re-implement using tensorflow

TextBoxes-TensorFlow TextBoxes re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified ba

44 Dec 29, 2022

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

M-LSD-warpPerspective-Example M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラムです。 Requirements OpenCV 3.4.2 or Later tensorflow 2.4.1 or Later Usage 実行方法は以下です。 pytho

9 Oct 14, 2022

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Bridging Video-text Retrieval with Multiple Choice Questions, CVPR 2022 (Oral) Paper | Project Page | Pre-trained Model | CLIP-Initialized Pre-trained

99 Jan 06, 2023

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

Application-Oriented Performance Benchmarks for Quantum Computing This repository contains a collection of prototypical application- or algorithm-cent

67 Nov 30, 2022

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

Related tags

Overview

About

Requirements

Tutorial on How To Install and Setup

How To Compile It To .exe Yourself

Tutorial on How To Use

Disclaimer

Comments

Thank you for creating the software. But can you help me to solve the following problems.

Multiple improvements, view comment

Changing directory structure

i have an idea

Can you add more OCR engines to your software?

thank you very much

Releases(V1.8.5)

V1.8.5(Apr 7, 2022)

V1.8.4(Dec 12, 2021)

V1.8.3(Dec 11, 2021)

V1.8.2(Nov 21, 2021)

V1.8.1(Nov 16, 2021)

V1.8(Nov 3, 2021)

V1.7.2(Oct 20, 2021)

V1.7.1(Oct 20, 2021)

V1.7(Oct 20, 2021)

V1.6(Oct 12, 2021)

V1.5(Sep 5, 2021)

V1.4(Sep 5, 2021)

v1-update_3(Sep 2, 2021)

v1-update_2(Sep 2, 2021)

v1-update_1(Sep 1, 2021)

release(Sep 1, 2021)

Owner

Fauzan F A

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

Text-to-Image generation

Generic framework for historical document processing

Use Youdao OCR API to covert your clipboard image to text.

Simple SDF mesh generation in Python

Table Extraction Tool

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

TextBoxes re-implement using tensorflow

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

Fast style transfer

Textboxes implementation with Tensorflow (python)

Read Japanese manga inside browser with selectable text.

A selectional auto-encoder approach for document image binarization

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

A program that takes in the hand gesture displayed by the user and translates ASL.

Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

Simple app for visual editing of Page XML files