Optical Character Recognition (OCR) with Python and Tesseract

Tesseract is a most commonly used character recognition tool which was originally developed by the Google. Basically tesseract helps you to extract any text which is written in your digital images by using your command terminal or by using API implementations. Tesseract is not just an OCR which can extract written text from an image, it will help you to accomplish more advanced jobs which are related with character recognition operations. Some of them are get bounding estimates of recognized characters, convert images in to different output formats, use own customized configurations, get orientation and script detection reports, get tables of analysed verbose information. Tesseract supports Unicode encoding (UTF-8) and using tesseract you will be able to engage with more than 100 languages which is very helpful whenever you want to work with any other language rather than general english.

After watching and going through all the implementations regarding to this tutorial, you will end up with a guy who is well trained to work as an expert of optical character recognition !

Highly recommended for enthusiastic pythonists all over the world :)

Chapters
=========
1) Introduction to Tesseract and installation: 0:01:24
2) Introduction to Pytesseract and installation: 0:06:48
3) Configure tesseract path: 0:12:34
4) Check available languages: 0:14:17
5) Extract text from an image
5.1) Simple text extraction: 0:15:51
5.2) Specified language text extraction: 0:18:37
5.3) Multiple image text extraction: 0:32:05
5.4) Timeout text extraction: 0:35:53
6) Get and draw bounding boxes around characters: 0:40:19
7) Get report of verbose data: 0:48:22
8) Orientation and script detection: 0:51:49
9) Working with output formats
9.1) PDF: 0:57:02
9.2) HOCR: 0:59:21
9.3) XML: 1:00:40
10) Assigning Custom Configurations: 1:02:26

Download the project
====================
Google Drive : - https://drive.google.com/drive/folders/1HZqIh3I3nSAzflMWAUdFSVT2h4tWuxE6?usp=sharing

References
==========
Tesseract: https://github.com/tesseract-ocr/tesseract
Pytesseract: https://github.com/madmaze/pytesseract
Multiple config options: https://www.py4u.net/discuss/10850
Getting bounding box cordinates: https://stackoverflow.com/questions/20831612/getting-the-bounding-box-of-the-recognized-words-using-python-tesseract

Subscribe: https://www.youtube.com/@TheSineth/featured

#python