Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024!

Next-Gen Computer Vision: YOLOv8, DINO-GPT4V, OpenCV4, Face Recognition, GenerativeAI, Diffusion Models & Transformers

4.46 (1376 reviews)
Udemy
platform
English
language
Data Science
category
instructor
Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024!
11,827
students
28.5 hours
content
Nov 2023
last update
$84.99
regular price

What you will learn

All major Computer Vision theory and concepts (updated in late 2023!)

Learn to use PyTorch, TensorFlow 2.0 and Keras for Computer Vision Deep Learning tasks

YOLOv8: Cutting-edge Object Recognition

DINO-GPT4V: Next-Gen Vision Models

Learn all major Object Detection Frameworks from YOLOv8, R-CNNs, Detectron2, SSDs, EfficientDetect and more!

Deep Segmentation with Segment Anything, U-Net, SegNet and DeepLabV3

Understand what CNNs 'see' by Visualizing Different Activations and applying GradCAM

Generative Adverserial Networks (GANs) & Autoencoders - Generate Digits, Anime Characters, Transform Styles and implement Super Resolution

Training, fine tuning and analyzing your very own Classifiers

Facial Recognition along with Gender, Age, Emotion and Ethnicity Detection

Neural Style Transfer and Google Deep Dream

Transfer Learning, Fine Tuning and Advanced CNN Techniques

Important Modern CNNs designs like ResNets, InceptionV3, DenseNet, MobileNet, EffiicentNet and much more!

Tracking with DeepSORT

Siamese Networks, Facial Recognition and Analysis (Age, Gender, Emotion and Ethnicity)

Image Captioning, Depth Estimination and Vision Transformers

Point Cloud (3D data) Classification and Segmentation

Making a Computer Vision API and Web App using Flask

OpenCV4 in detail, covering all major concepts with lots of example code

All Course Code works in accompanying Google Colab Python Notebooks

Meta CLIP for Enhanced Image Analysis

Why take this course?

Welcome to Modern Computer Vision Tensorflow, Keras & PyTorch!

AI and Deep Learning are transforming industries and one of the most intriguing parts of this AI revolution is in Computer Vision!


Update for 2024: Modern Computer Vision Course

  • We're excited to bring you the latest updates for our 2024 modern computer vision course. Dive into an enriched curriculum covering the most advanced and relevant topics in the field:

  • YOLOv8: Cutting-edge Object Recognition

  • DINO-GPT4V: Next-Gen Vision Models

  • Meta CLIP for Enhanced Image Analysis

  • Detectron2 for Object Detection

  • Segment Anything

  • Face Recognition Technologies

  • Generative AI Networks for Creative Imaging

  • Transformers in Computer Vision

  • Deploying & Productionizing Vision Models

  • Diffusion Models for Image Processing

  • Image Generation and Its Applications

  • Annotation Strategy for Efficient Learning

  • Retrieval Augmented Generation (RAG)

  • Zero-Shot Classifiers for Versatile Applications

  • Using Roboflow: Streamlining Vision Workflows


What is Computer Vision?

But what exactly is Computer Vision and why is it so exciting? Well, what if Computers could understand what they’re seeing through cameras or in images? The applications for such technology are endless from medical imaging, military, self-driving cars, security monitoring, analysis, safety, farming, industry, and manufacturing! The list is endless.

Job demand for Computer Vision workers are skyrocketing and it’s common that experts in the field are making USD $200,000 and more salaries. However, getting started in this field isn’t easy. There’s an overload of information, many of which is outdated, and a plethora of tutorials that neglect to teach the foundations. Beginners thus have no idea where to start.



This course aims to solve all of that!


  • Taught using Google Colab Notebooks (no messy installs, all code works straight away)

  • 27+ Hours of up-to-date and relevant Computer Vision theory with example code

  • Taught using both PyTorch and Tensorflow Keras!

In this course, you will learn the essential very foundations of Computer Vision, Classical Computer Vision (using OpenCV) I then move on to Deep Learning where we build our foundational knowledge of CNNs and learn all about the following topics:

Computer vision applications involving Deep Learning are booming!


Having Machines that can see will change our world and revolutionize almost every industry out there. Machines or robots that can see will be able to:


  • Perform surgery and accurately analyze and diagnose you from medical scans.

  • Enable self-driving cars

  • Radically change robots allowing us to build robots that can cook, clean, and assist us with almost any task

  • Understand what's being seen in CCTV surveillance videos thus performing security, traffic management, and a host of other services

  • Create Art with amazing Neural Style Transfers and other innovative types of image generation

  • Simulate many tasks such as Aging faces, modifying live video feeds, and realistically replacing actors in films


Detailed OpenCV Guide covering:

  • Image Operations and Manipulations

  • Contours and Segmentation

  • Simple Object Detection and Tracking

  • Facial Landmarks, Recognition and Face Swaps

  • OpenCV implementations of Neural Style Transfer, YOLOv3, SSDs and a black and white image colorizer

  • Working with Video and Video Streams

Our Comprehensive Deep Learning Syllabus includes:

  • Classification with CNNs

  • Detailed overview of CNN Analysis, Visualizing performance, Advanced CNNs techniques

  • Transfer Learning and Fine Tuning

  • Generative Adversarial Networks - CycleGAN, ArcaneGAN, SuperResolution, StyleGAN

  • Autoencoders

  • Neural Style Transfer and Google DeepDream

  • Modern CNN Architectures including Vision Transformers (ResNets, DenseNets, MobileNET, VGG19, InceptionV3, EfficientNET and ViTs)

  • Siamese Networks for image similarity

  • Facial Recognition (Age, Gender, Emotion, Ethnicity)

  • PyTorch Lightning

  • Object Detection with YOLOv5 and v4, EfficientDetect, SSDs, Faster R-CNNs,

  • Deep Segmentation - MaskCNN, U-NET, SegNET, and DeepLabV3

  • Tracking with DeepSORT

  • Deep Fake Generation

  • Video Classification

  • Optical Character Recognition (OCR)

  • Image Captioning

  • 3D Computer Vision using Point Cloud Data

  • Medical Imaging - X-Ray analysis and CT-Scans

  • Depth Estimation

  • Making a Computer Vision API with Flask

  • And so much more

This is a comprehensive course, is broken up into two (2) main sections. This first is a detailed OpenCV (Classical Computer Vision tutorial) and the second is a detailed Deep Learning

This course is filled with fun and cool projects including these Classical Computer Vision Projects:

  1. Sorting contours by size, location, using them for shape matching

  2. Finding Waldo

  3. Perspective Transforms (CamScanner)

  4. Image Similarity

  5. K-Means clustering for image colors

  6. Motion tracking with MeanShift and CAMShift

  7. Optical Flow

  8. Facial Landmark Detection with Dlib

  9. Face Swaps

  10. QR Code and Barcode Reaching

  11. Background removal

  12. Text Detection

  13. OCR with PyTesseract and EasyOCR

  14. Colourize Black and White Photos

  15. Computational Photography with inpainting and Noise Removal

  16. Create a Sketch of yourself using Edge Detection

  17. RTSP and IP Streams

  18. Capturing Screenshots as video

  19. Import Youtube videos directly


Deep Learning Computer Vision Projects:

  1. PyTorch & Keras CNN Tutorial MNIST

  2. PyTorch & Keras Misclassifications and Model Performance Analysis

  3. PyTorch & Keras Fashion-MNIST with and without Regularisation

  4. CNN Visualisation - Filter and Filter Activation Visualisation

  5. CNN Visualisation Filter and Class Maximisation

  6. CNN Visualisation GradCAM GradCAMplusplus and FasterScoreCAM

  7. Replicating LeNet and AlexNet in Tensorflow2.0 using Keras

  8. PyTorch & Keras Pretrained Models - 1 - VGG16, ResNet, Inceptionv3, MobileNetv2, SqueezeNet, WideResNet, DenseNet201, MobileMNASNet, EfficientNet and MNASNet

  9. Rank-1 and Rank-5 Accuracy

  10. PyTorch and Keras Cats vs Dogs PyTorch - Train with your own data

  11. PyTorch Lightning Tutorial - Batch and LR Selection, Tensorboards, Callbacks, mGPU, TPU and more

  12. PyTorch Lightning - Transfer Learning

  13. PyTorch and Keras Transfer Learning and Fine Tuning

  14. PyTorch & Keras Using CNN's as a Feature Extractor

  15. PyTorch & Keras - Google Deep Dream

  16. PyTorch Keras - Neural Style Transfer + TF-HUB Models

  17. PyTorch & Keras Autoencoders using the Fashion-MNIST Dataset

  18. PyTorch & Keras - Generative Adversarial Networks - DCGAN - MNIST

  19. Keras - Super Resolution SRGAN

  20. Project - Generate_Anime_with_StyleGAN

  21. CycleGAN - Turn Horses into Zebras

  22. ArcaneGAN inference

  23. PyTorch & Keras Siamese Networks

  24. Facial Recognition with VGGFace in Keras

  25. PyTorch Facial Similarity with FaceNet

  26. DeepFace - Age, Gender, Expression, Headpose and Recognition

  27. Object Detection - Gun, Pistol Detector - Scaled-YOLOv4

  28. Object Detection - Mask Detection - TensorFlow Object Detection - MobileNetV2 SSD

  29. Object Detection  - Sign Language Detection - TFODAPI - EfficientDetD0-D7

  30. Object Detection - Pot Hole Detection with TinyYOLOv4

  31. Object Detection - Mushroom Type Object Detection - Detectron 2

  32. Object Detection - Website Screenshot Region Detection - YOLOv4-Darknet

  33. Object Detection - Drone Maritime Detector - Tensorflow Object Detection Faster R-CNN

  34. Object Detection - Chess Pieces Detection - YOLOv3 PyTorch

  35. Object Detection - Hardhat Detection for Construction sites - EfficientDet-v2

  36. Object DetectionBlood Cell Object Detection - YOLOv5

  37. Object DetectionPlant Doctor Object Detection - YOLOv5

  38. Image Segmentation - Keras, U-Net and SegNet

  39. DeepLabV3 - PyTorch_Vision_Deeplabv3

  40. Mask R-CNN Demo

  41. Detectron2 - Mask R-CNN

  42. Train a Mask R-CNN - Shapes

  43. Yolov5 DeepSort Pytorch tutorial

  44. DeepFakes - first-order-model-demo

  45. Vision Transformer Tutorial PyTorch

  46. Vision Transformer Classifier in Keras

  47. Image Classification using BigTransfer (BiT)

  48. Depth Estimation with Keras

  49. Image Similarity Search using Metric Learning with Keras

  50. Image Captioning with Keras

  51. Video Classification with a CNN-RNN Architecture with Keras

  52. Video Classification with Transformers with Keras

  53. Point Cloud Classification - PointNet

  54. Point Cloud Segmentation with PointNet

  55. 3D Image Classification CT-Scan

  56. X-ray Pneumonia Classification using TPUs

  57. Low Light Image Enhancement using MIRNet

  58. Captcha OCR Cracker

  59. Flask Rest API - Server and Flask Web App

  60. Detectron2 - BodyPose

Screenshots

Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Screenshot_01Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Screenshot_02Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Screenshot_03Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Screenshot_04

Reviews

Mayank
September 4, 2023
I have done multiple courses on Udemy. In all the courses the instructor makes the learner code with them but this practice is something different and has not helped me with my learning.
Syed
August 28, 2023
Sometimes instructure is not aware about his topic. Totally waste of money. The instructure just read code. He should do coding while explaining a topic
Pierre
July 26, 2023
This course is a copy-paste of FREE tutorials you can find on internet. If you are looking for a collection of codes, then it makes sense. But you cannot call this a course on deep learning as there are no explanations of what is the meaning of the parameters, what are the principles of the algorithms used, why use this method, what are the drawbacks. Overall, it appears CLEARLY that the "teacher" doesn't know what he is talking about (at least for deep learning, I did not go further than that as I am really not happy). A course should not be a collection of free tutorials, period. Honestly, this is a shame...
Umair
July 25, 2023
The code is not properly line by line explained most of the indepth understand is left for the student to search even though in the start of this course it was said that it doesnt require much undertand ing of python language. However this course is still very helpful im glad the instructor uploaded it. Its interesting and very unique skill. Thank you!
Nader
July 14, 2023
by far, this is the most organized and most well explained and most informative course I have taken on uDemy (Have over 50 courses on uDemy)
Fuchi
May 29, 2023
Many techniques related to computer vision are introduced, but not much about algorithms or theories behind these techniques. So usually I have to look up for more info on Google. But in generally, it is a good starting point for learning CV broadly.
Visothkun
May 17, 2023
Well the course itself is OK, But at least give some deep explain. Cause as for me I'm absolutely beginner. and i don't understand all the code and i need an explaination but it seems that you weren't explaining it deeply, you just rush through the code, To be honest i never learn OPENCV or NUMPY before, so i want to understand those module but you weren't explaining it at all. So i have to go to youtube to learn Numpy and OPENCV in order to understand this course ill give you a three star for now, But hope you would do better in your next course.
Megan
May 11, 2023
Coming from a data science job where I have learned and used computer vision and deep-learning by consultancy, this course provides detail that can fill holes in knowledge from those who are self taught data scientists or machine learning engineers
Enos
May 10, 2023
Too much topic, Too less value. But it's okay. This could have been done much better or may be name the course as overview of computer vision
Mdem
May 10, 2023
And finally - I score it 2 star because: 1 star for explanation of convolutional neural networks and convolutional operations - this part was really looks like a training, and author really works over it, 1/2 star for braod examples (however more then half are not working due to obsolete version of framework or some mistakes in code), 1/2 star for collection of references to articles and web sites concerning the topics. It is really not a training in traditional meaning. So If you expect it - it is not your choice. If you searching for examples and use cases - it is OK. And one more 'spoon of tar' - do not really expect author answer your questions - there are lots of them that have not been answered for 6 - 7 and more months.
MOULIK
April 20, 2023
If I could, I'd give it no stars at all. The instructor just blindly threw all the topics of modern computer vision and just glimpsed over them all for like 2-3 videos. This is NOT how you teach someone about something new. Absolutely trash course. Please save your precious bucks and find some other course.
Felipe
April 12, 2023
Good course, here you will learn a lot about image processing with OpenCV and about what CNN can do nowadays. If you know nothing about Computer Vision ou CNN, this is a vey good starting point, the main concepts are well explained and there a lot of examples. What is not so good is the implementation, the instructor will explain you how its done once or twice then he will breeze through a lot of code already developed. If you know enought Python proabably this will not be a problem.
Rohan
April 11, 2023
This is the best course available on Udemy for learning Computer Vision from scratch. It provides a comprehensive coverage of all the Computer Vision concepts (classical as well as modern) along with hands-on experience in Keras and Pytorch. The instructor provides clear and concise explanations of complex concepts, making it easy to understand even the most challenging topics. Highly recommended course for building CV skills and developing understanding on the same.
Cao
April 4, 2023
I learned a lot from this course, from classical computer vision algorithms to machine learning. It's very useful. However, some of the code has become invalid.
Thomas
March 27, 2023
The instructor rambles on with no explanation of hwy he is doing what he is doing. He also seems to be a bit uncertain of his own code.

Charts

Price

Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Price chart

Rating

Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Ratings chart

Enrollment distribution

Modern Computer Vision GPT, PyTorch, Keras, OpenCV4 in 2024! - Distribution chart
3214523
udemy ID
6/8/2020
course created date
2/27/2022
course indexed date
Ignacio Castro
course submited by