Free Course Image Deep Learning for Audio in Python: Neural Networks, CNNs and LSTMs

Free online courseDeep Learning for Audio in Python: Neural Networks, CNNs and LSTMs

Duration of the online course: 9 hours and 9 minutes

New

Free course on deep learning for audio in Python. Build neural nets from scratch, use TensorFlow, and create CNN and LSTM models for genre classification.

In this free course, learn about

  • Course Overview and Deep Learning Foundations
  • Neural Networks from Scratch: Neurons, Linear Algebra, and Forward Pass
  • Training Neural Networks: Backpropagation and Gradient Descent
  • TensorFlow / Keras Workflow for Supervised Learning
  • Audio Understanding and Preprocessing for Deep Learning
  • Music Genre Classification with Feedforward Networks
  • Convolutional Neural Networks for Audio Classification
  • Recurrent Neural Networks and LSTMs for Audio/Time-Series

Course Description

Learn how to build deep learning systems that understand audio using Python, with a practical path from core concepts to working models. This free online course guides you through the foundations of AI, machine learning, and neural networks, then moves into hands-on implementation so you can confidently translate theory into code.

You will start by creating key building blocks from scratch, including an artificial neuron and a basic neural network, while practicing essential vector and matrix operations used in modern deep learning. You will also explore how computation flows through neural networks and how training works via backpropagation and gradient descent, reinforcing the mechanics behind learning.

After mastering fundamentals, you will transition to building models with TensorFlow 2 and focus on audio as a data type. You will learn how audio is represented, how to preprocess it effectively for deep learning, and how to prepare a dataset for music genre classification.

From there, you will develop and improve audio classification models, including strategies to reduce overfitting. You will also dive into architectures that are especially effective for sound, such as Convolutional Neural Networks for learning patterns in spectrogram-like inputs and Recurrent Neural Networks with LSTMs for sequence-based modeling. By the end, you will be equipped to design, train, and evaluate neural network approaches for real-world audio tasks in Python.

Course content

  • Video class: 1- Deep Learning (for Audio) with Python: Course Overview 08m
  • Exercise: Which high-level interface on top of TensorFlow is highlighted as enabling complex neural networks with very little code?
  • Video class: 2- AI, machine learning and deep learning 31m
  • Exercise: What makes deep learning different from traditional machine learning in terms of feature handling?
  • Video class: 3- Implementing an artificial neuron from scratch 19m
  • Exercise: In an artificial neuron, what happens immediately after computing the net input (weighted sum) H?
  • Video class: 4- Vector and matrix operations 25m
  • Exercise: In an artificial neuron, how can the net input (weighted sum) be written using linear algebra?
  • Video class: 5- Computation in neural networks 23m
  • Exercise: In a multi-layer perceptron, how is the net input vector for a layer computed?
  • Video class: 6- Implementing a neural network from scratch in Python 21m
  • Exercise: In the forward propagation of the MLP, how are the net inputs for a layer computed in NumPy?
  • Video class: 7- Training a neural network: Backward propagation and gradient descent 21m
  • Exercise: During neural network training, what does gradient descent do with the gradient to reduce the error?
  • Video class: 8- TRAINING A NEURAL NETWORK: Implementing backpropagation and gradient descent from scratch 1h03m
  • Exercise: In an MLP implementation, why is the derivatives data structure created with one fewer element than the number of layers?
  • Video class: 9- How to implement a (simple) neural network with TensorFlow 2 24m
  • Exercise: What is the correct sequence of steps when building and using a TensorFlow/Keras model for a simple supervised learning task?
  • Video class: 10 - Understanding audio data for deep learning 32m
  • Exercise: What is the main advantage of using the Short-Time Fourier Transform (STFT) instead of a single Fourier Transform for audio analysis?
  • Video class: 11- Preprocessing audio data for Deep Learning 25m
  • Exercise: Why are the same n_fft and hop_length parameters commonly passed when extracting MFCCs?
  • Video class: 12- Music genre classification: Preparing the dataset 37m
  • Exercise: Why is each 30-second track split into multiple segments when preparing data for a music genre classifier?
  • Video class: 13- Implementing a neural network for music genre classification 33m
  • Exercise: In a music genre classifier with 10 genres, which output layer setup is most appropriate?
  • Video class: 14- SOLVING OVERFITTING in neural networks 26m
  • Exercise: Which pattern in the accuracy and error curves is a strong sign of overfitting during training?
  • Video class: 15- Convolutional Neural Networks Explained Easily 35m
  • Exercise: When using MFCCs as input to a CNN, why might the data shape include a third dimension of 1 (e.g., 100 × 13 × 1)?
  • Video class: 16- How to Implement a CNN for Music Genre Classification 49m
  • Exercise: Why is a validation set used in addition to train and test sets when tuning a CNN for music genre classification?
  • Video class: 17- Recurrent Neural Networks Explained Easily 28m
  • Video class: 18- Long Short Term Memory (LSTM) Networks Explained Easily 28m
  • Exercise: What is the main advantage of an LSTM over a simple RNN for audio/time-series data?
  • Video class: 19- How to Implement an RNN-LSTM Network for Music Genre Classification 14m
  • Exercise: When switching from a CNN to an LSTM-based RNN for MFCC genre classification, what change is made to the input data shape?

This free course includes:

9 hours and 9 minutes of online video course

Digital certificate of course completion (Free)

Exercises to train your knowledge

100% free, from content to certificate

Ready to get started?Download the app and get started today.

Install the app now

to access the course
Icon representing technology and business courses

Over 5,000 free courses

Programming, English, Digital Marketing and much more! Learn whatever you want, for free.

Calendar icon with target representing study planning

Study plan with AI

Our app's Artificial Intelligence can create a study schedule for the course you choose.

Professional icon representing career and business

From zero to professional success

Improve your resume with our free Certificate and then use our Artificial Intelligence to find your dream job.

You can also use the QR Code or the links below.

QR Code - Download Cursa - Online Courses

More free courses at Artificial Intelligence and Machine Learning

Free Ebook + Audiobooks! Learn by listening or reading!

Download the App now to have access to + 5000 free courses, exercises, certificates and lots of content without paying anything!

  • 100% free online courses from start to finish

    Thousands of online courses in video, ebooks and audiobooks.

  • More than 60 thousand free exercises

    To test your knowledge during online courses

  • Valid free Digital Certificate with QR Code

    Generated directly from your cell phone's photo gallery and sent to your email

Cursa app on the ebook screen, the video course screen and the course exercises screen, plus the course completion certificate