Computer Vision : OCR using Python - GenAI with LLM & RAG
Become a Computer Vision Expert & Learn OCR with Tesseract, OpenCV, Deep Learning, GenAI, LLMs, & RAG
4.31 (268 reviews)

1,247
students
8.5 hours
content
Mar 2025
last update
$54.99
regular price
What you will learn
A quick starter on OCR Architecture, Commercial Solutions and Use Cases in Industry
Learn to implement OCR - Text Detection with OpenCV and Deep Learning Models
Use Tesseract and EasyOCR to implement OCR - Text Recognition
Work with OCR - Text Labelling using Spacy and Regular Expression
Discover the concepts of RAG, its architecture and extract deeper insights from text
Integrating OCR outputs into RAG pipelines for advanced document understanding and information extraction
Build OCR Solutions for Invoice Processing with Text Labelling and XML output & Vehicle Nameplate Recognition
Executable Code of CTPN and EAST Model implementation for Text Detection and Text Recognition
Learn to train Deep Learning Models of CTPN and EAST on ICDAR dataset
Understand the Image Basics and apply it for Image Processing
Use OpenCV and Tesseract to apply Noise Removal Techniques including Thresholding, Rescaling, Dilation, Erosion and Deskewing
Learn to develop web-based applications - Business Card Recognition and KYC Digitization for OCR using Flask
Screenshots




Related Topics
3885252
udemy ID
3/2/2021
course created date
4/6/2021
course indexed date
Bot
course submited by