There Will Be a Scientific Theory of Deep Learning
54 points
3 hours ago
| 4 comments
| arxiv.org
| HN
RyanShook
16 minutes ago
[-]
Here's where I'm missing understanding: for decades the idea of neural networks had existed with minimal attention. Then in 2017 Attention Is All You Need gets released and since then there is an exponential explosion in deep learning. I understand that deep learning is accelerated by GPUs but the concept of a transformer could have been used on much slower hardware much earlier.
reply
whateverboat
6 minutes ago
[-]
The same thing happened with matrices. We had matrices for 400 years, but the field of linear algebra and especially numerical linear algebra exploded only with advent of computers.

In olden days, the correct way to solve a linear system of equations was to use theory of minors. With advent of computers, you suddenly had a huge theory of gaussian elimination, or Krylov spaces and what not.

reply
BigTTYGothGF
14 minutes ago
[-]
The modern neural net revival got kicked off long before 2017.
reply
noosphr
2 minutes ago
[-]
Alex net in 2012 is only 5 years earlier.
reply
embedding-shape
12 minutes ago
[-]
> I understand that deep learning is accelerated by GPUs but the concept of a transformer could have been used on much slower hardware much earlier

But they don't give the same results at those smaller scales. People imagined, but no one could have put into practice because the hardware wasn't there yet. Simplified, LLMs is basically Transformers with the additional idea of "and a shitton of data to learn from", and for making training feasible with that amount of data, you do need some capable hardware.

reply
teekert
8 minutes ago
[-]
If you are in the radiology field it started “exploding” much earlier, with CNNs.
reply
wslh
5 minutes ago
[-]
Don't understimate the massive data you need to make those networks tick. Also, impracticable in slow training algorithms, beyond if they were in GPUs or CPUs.
reply
adzm
1 hour ago
[-]
I'm only partially through this paper, but it's written in a very engaging and thoughtful manner.

There is so much to digest here but it's fascinating seeing it all put together!

reply
4b11b4
42 minutes ago
[-]
wow.. this would be cool. Instead of just.. guessing "shapes"
reply
NitpickLawyer
13 minutes ago
[-]
tbf, we've learned (ha!) more from smashing teeny tiny particles and "looking" at what comes out than from say 40 years of string theory. Sometimes doing stuff works, and the theory (hopefully) follows.
reply
amelius
14 minutes ago
[-]
"A New Kind of Science" ...
reply