Computer Vision : OCR using Python - GenAI with LLM & RAG

Become a Computer Vision Expert & Learn OCR with Tesseract, OpenCV, Deep Learning, GenAI, LLMs, & RAG
4.31 (268 reviews)
Udemy
platform
English
language
Data Science
category
Computer Vision : OCR using Python - GenAI with LLM & RAG
1,247
students
8.5 hours
content
Mar 2025
last update
$54.99
regular price

What you will learn

A quick starter on OCR Architecture, Commercial Solutions and Use Cases in Industry

Learn to implement OCR - Text Detection with OpenCV and Deep Learning Models

Use Tesseract and EasyOCR to implement OCR - Text Recognition

Work with OCR - Text Labelling using Spacy and Regular Expression

Discover the concepts of RAG, its architecture and extract deeper insights from text

Integrating OCR outputs into RAG pipelines for advanced document understanding and information extraction

Build OCR Solutions for Invoice Processing with Text Labelling and XML output & Vehicle Nameplate Recognition

Executable Code of CTPN and EAST Model implementation for Text Detection and Text Recognition

Learn to train Deep Learning Models of CTPN and EAST on ICDAR dataset

Understand the Image Basics and apply it for Image Processing

Use OpenCV and Tesseract to apply Noise Removal Techniques including Thresholding, Rescaling, Dilation, Erosion and Deskewing

Learn to develop web-based applications - Business Card Recognition and KYC Digitization for OCR using Flask

Screenshots

Computer Vision : OCR using Python - GenAI with LLM & RAG - Screenshot_01Computer Vision : OCR using Python - GenAI with LLM & RAG - Screenshot_02Computer Vision : OCR using Python - GenAI with LLM & RAG - Screenshot_03Computer Vision : OCR using Python - GenAI with LLM & RAG - Screenshot_04
Related Topics
3885252
udemy ID
3/2/2021
course created date
4/6/2021
course indexed date
Bot
course submited by