Slides
Andrey Kurenkov's blog post on the history of deep learning.
See Goodfellow et al. sections 6.1 through 6.3