Deep Learning for Speech and Artificial Vision

Focused on deep learning architectures for speech processing and computer vision. Covers CNNs, transformers, and attention mechanisms with hands-on projects using PyTorch and HuggingFace.

DL4SV · UKE · PhD in Computer Engineering · English · February · 2024–ongoing
Deep Learning CNNs Transformers Attention Mechanisms PyTorch HF Transformers

Resources

Jupyter Book
Course Material
Full course notes, code examples, and exercises in an interactive Jupyter Book format.
Open Jupyter Book →