Free Course Image Convolutional Neural Networks for Computer Vision: Deep Learning Lecture Series

Free online courseConvolutional Neural Networks for Computer Vision: Deep Learning Lecture Series

Duration of the online course: 19 hours and 30 minutes

New

Free Stanford lecture series on CNNs for computer vision: classification, training, architectures, detection, segmentation, generative models, and robustness.

In this free course, learn about

  • Foundations of Visual Recognition
  • Neural Networks and Convolutional Layers
  • Practical Training and Tooling
  • Modern CNN Architectures and Sequence Models
  • Recognition Tasks and Model Interpretability
  • Generative and Reinforcement Learning Methods
  • Efficiency, Hardware, and Robustness

Course Description

Explore the foundations and modern practice of convolutional neural networks for computer vision with this free online deep learning lecture series from Stanford. Designed for learners who want a rigorous path into visual recognition, it connects core ideas to the methods used in real-world AI systems for understanding images and videos.

You will build intuition for image classification and the principles that make deep models trainable, including loss functions, optimization, and the essentials of neural networks. The course then dives into convolutional neural networks in depth, covering practical training techniques, common pitfalls, and strategies for improving performance and generalization.

Beyond the CNN basics, the series broadens into the software ecosystem for deep learning, influential CNN architectures, and extensions such as recurrent neural networks. It also introduces key computer vision applications like detection and segmentation, alongside techniques for visualizing and interpreting what models learn.

Later lessons expand to generative modeling, deep reinforcement learning, efficiency considerations, and hardware-aware methods. You will also encounter adversarial examples and adversarial training, helping you understand robustness and security issues that affect deployed AI. Ideal for students and practitioners in technology and programming, this course offers a comprehensive roadmap through contemporary artificial intelligence and machine learning for vision.

Course content

  • Video class: Lecture 1 | Introduction to Convolutional Neural Networks for Visual Recognition 57m
  • Exercise: What key event is highlighted as the 2012 breakthrough that accelerated the adoption of convolutional neural networks for image classification?
  • Video class: Lecture 2 | Image Classification 59m
  • Exercise: Why is choosing hyperparameters (like K in k-NN) based on test-set accuracy considered a bad practice?
  • Video class: Lecture 3 | Loss Functions and Optimization 1h14m
  • Exercise: In the softmax (multinomial logistic regression) loss for image classification, what quantity is minimized for each training example?
  • Video class: Lecture 4 | Introduction to Neural Networks 1h13m
  • Exercise: Why are non-linear activation functions (e.g., ReLU) necessary between linear layers in a neural network?
  • Video class: Lecture 5 | Convolutional Neural Networks 1h08m
  • Exercise: In a convolutional layer, what determines the depth (number of channels) of the output volume?
  • Video class: Lecture 6 | Training Neural Networks I 1h20m
  • Exercise: Why can the sigmoid activation function make training deep networks difficult?
  • Video class: Lecture 7 | Training Neural Networks II 1h15m
  • Exercise: What is the main reason Adam uses bias-correction terms for its moment estimates early in training?
  • Video class: Lecture 8 | Deep Learning Software 1h18m
  • Exercise: Why are GPUs typically much faster than CPUs for convolution and matrix multiplication in deep learning?
  • Video class: Lecture 9 | CNN Architectures 1h17m
  • Exercise: In an Inception module, what is the main purpose of inserting 1×1 convolutions (bottleneck layers) before expensive 3×3 and 5×5 convolutions?
  • Video class: Lecture 10 | Recurrent Neural Networks 1h13m
  • Exercise: In CNN architectures like GoogLeNet and ResNet, what change helped significantly reduce the number of parameters compared to AlexNet/VGG?
  • Video class: Lecture 11 | Detection and Segmentation 1h14m
  • Exercise: In semantic segmentation, what is the model expected to output for an input image?
  • Video class: Lecture 12 | Visualizing and Understanding 1h15m
  • Exercise: Why is visualizing the learned weights of the first convolutional layer often interpretable as showing what the filters detect?
  • Video class: Lecture 13 | Generative Models 1h17m
  • Exercise: In PixelCNN, how is the image likelihood p(X) modeled for training?
  • Video class: Lecture 14 | Deep Reinforcement Learning 1h04m
  • Exercise: In deep Q-learning for Atari games, what is the main purpose of experience replay?
  • Video class: Lecture 15 | Efficient Methods and Hardware for Deep Learning 1h16m
  • Exercise: Why does the lecture emphasize reducing memory access when designing efficient CNN inference hardware?
  • Video class: Lecture 16 | Adversarial Examples and Adversarial Training 1h21m
  • Exercise: What is the key idea behind the Fast Gradient Sign Method (FGSM) for generating adversarial examples in CNN-based classifiers?

This free course includes:

19 hours and 30 minutes of online video course

Digital certificate of course completion (Free)

Exercises to train your knowledge

100% free, from content to certificate

Ready to get started?Download the app and get started today.

Install the app now

to access the course
Icon representing technology and business courses

Over 5,000 free courses

Programming, English, Digital Marketing and much more! Learn whatever you want, for free.

Calendar icon with target representing study planning

Study plan with AI

Our app's Artificial Intelligence can create a study schedule for the course you choose.

Professional icon representing career and business

From zero to professional success

Improve your resume with our free Certificate and then use our Artificial Intelligence to find your dream job.

You can also use the QR Code or the links below.

QR Code - Download Cursa - Online Courses

More free courses at Artificial Intelligence and Machine Learning

Free Ebook + Audiobooks! Learn by listening or reading!

Download the App now to have access to + 5000 free courses, exercises, certificates and lots of content without paying anything!

  • 100% free online courses from start to finish

    Thousands of online courses in video, ebooks and audiobooks.

  • More than 60 thousand free exercises

    To test your knowledge during online courses

  • Valid free Digital Certificate with QR Code

    Generated directly from your cell phone's photo gallery and sent to your email

Cursa app on the ebook screen, the video course screen and the course exercises screen, plus the course completion certificate