FilterHN

There Will Be a Scientific Theory of Deep Learning

54 points

by jamie-simon

3 hours ago

| past

| 4 comments

| arxiv.org

| HN

▲

RyanShook

16 minutes ago

[-]

Here's where I'm missing understanding: for decades the idea of neural networks had existed with minimal attention. Then in 2017 Attention Is All You Need gets released and since then there is an exponential explosion in deep learning. I understand that deep learning is accelerated by GPUs but the concept of a transformer could have been used on much slower hardware much earlier.

▲

whateverboat

6 minutes ago

[-]

The same thing happened with matrices. We had matrices for 400 years, but the field of linear algebra and especially numerical linear algebra exploded only with advent of computers.

In olden days, the correct way to solve a linear system of equations was to use theory of minors. With advent of computers, you suddenly had a huge theory of gaussian elimination, or Krylov spaces and what not.

▲

BigTTYGothGF

14 minutes ago

[-]

The modern neural net revival got kicked off long before 2017.

▲

noosphr

2 minutes ago

[-]

Alex net in 2012 is only 5 years earlier.

▲

embedding-shape

12 minutes ago

[-]

> I understand that deep learning is accelerated by GPUs but the concept of a transformer could have been used on much slower hardware much earlier

But they don't give the same results at those smaller scales. People imagined, but no one could have put into practice because the hardware wasn't there yet. Simplified, LLMs is basically Transformers with the additional idea of "and a shitton of data to learn from", and for making training feasible with that amount of data, you do need some capable hardware.

▲

teekert

8 minutes ago

[-]

If you are in the radiology field it started “exploding” much earlier, with CNNs.

▲

wslh

5 minutes ago

[-]

Don't understimate the massive data you need to make those networks tick. Also, impracticable in slow training algorithms, beyond if they were in GPUs or CPUs.

▲

adzm

1 hour ago

[-]

I'm only partially through this paper, but it's written in a very engaging and thoughtful manner.

There is so much to digest here but it's fascinating seeing it all put together!

▲

4b11b4

42 minutes ago

[-]

wow.. this would be cool. Instead of just.. guessing "shapes"

▲

NitpickLawyer

13 minutes ago

[-]

tbf, we've learned (ha!) more from smashing teeny tiny particles and "looking" at what comes out than from say 40 years of string theory. Sometimes doing stuff works, and the theory (hopefully) follows.

▲

amelius

14 minutes ago

[-]

"A New Kind of Science" ...