FilterHN

Are we repeating the telecoms crash with AI datacenters?

137 points

by davedx

10 hours ago

| past

| 22 comments

| martinalderson.com

| HN

▲

nuc1e0n

4 hours ago

[-]

The article claims that AI services are currently over-utilised. Well isn't that because customers are being undercharged for services? A car when in neutral will rev up easily if the accelerator pedal is pushed even very slightly, because there's no load on the engine. But in gear the same engine will rev up much less when the accelerator is pushed the same amount. Will there be the same overutilisation occurring if users have to financially support the infrastructure, either through subscriptions or intrusive advertising?

I doubt it.

And what if the technology to locally run these systems without reliance on the cloud becomes commonplace, as it now is with open source models? The expensive part is in the training of these models more than the inference.

▲

burnte

34 minutes ago

[-]

> Will there be the same overutilisation occurring if users have to financially support the infrastructure, either through subscriptions or intrusive advertising? > I doubt it.

I agree. Right now a lot of AI tools are underpriced to get customers hooked, then they'll jack up the prices later. The flaw is that AI does not have the ubiquitous utility internet access has, and a lot of people are not happy with the performance per dollar TODAY, much less when prices rise 80%. We already see companies like Google raising prices stating it's for "AI" and we customers can't opt out of AI and not pay the fee.

At my company we've already decided to leave Google Workspace in the spring. GW is a terrible product with no advanced features, garbage admin tools, uncompetitive pricing, and now AI shoved in everywhere and no way to granularly opt out of a lot of it. Want spell check? Guess what, you need to leave Gemini enabled! Shove off, Google.

▲

an0malous

3 hours ago

[-]

> The article claims that AI services are currently over-utilised. Well isn't that because customers are being undercharged for services?

Absolutely, not only are most AI services free but even the paid portion is coming from executives mandating that their employees use AI services. It's a heavily distorted market.

▲

keeda

12 minutes ago

[-]

On the flip side, there are many indications that a huge part of corporate AI usage is "Shadow AI", i.e. where workers use GenAI without their employer's blessing or knowledge. E.g. https://cacm.acm.org/news/the-real-significant-threat-of-sha...

And a majority of those workers do not reveal their AI usage, so they either take credit for the faster work or use the extra time for other activities, which further confounds the impact of AI.

This is also distorting the market, but in other ways.

▲

btilly

3 hours ago

[-]

Yes, over-utilization is a natural response to being undercharged. And being undercharged is a natural result when investors are throwing money at you. During bubbles, Silicon Valley often goes to "lose money, make it up with scale". With the vague idea that after you get to scale, THEN you can figure out how to make money. And fairly consistently, their idea for how to make money is "sell ads".

Past successes like Google encourage hope in this strategy. Sure, it mostly doesn't work. Most of of everything that VCs do doesn't work. Returns follow a power law, and a handful of successes in the tail drive the whole portfolio.

The key problem here doesn't lie in the fact that this strategy is being pursued. The key problem is that it is rare for first mover advantages to last with new technologies. That's why Netscape and Yahoo! aren't among the FAANGs today. The long-term wins go to whoever successfully create a sufficient moat for themselves to protect lasting excess returns. And the capabilities of each generation of AI leapfrogs the last so well that nobody has figured out how to create such a moat.

Today, 3 years after launching the first LLM chatbot, OpenAI is nowhere near as dominant as Netscape was in late 1997, 3 years after launching Netscape Navigator. I see no reason to expect that 30 years from now OpenAI will be any more dominant than Netscape is today.

Right now companies are pouring money into their candidates to win the AI race. But if the history of browsers repeats itself, the company that wins in the long-term would launch in about a year from now, focused on applications on top of AI. And its entrant into the AI wars wouldn't get launched until a decade after that! (Yes, that is the right timeline for the launch of Google, and Google's launch of Chrome.)

Investing in silicon valley is like buying a positive EV lottery ticket. An awful lot of people are going to be reminded the hard way that it is wiser to buy a lot of lottery tickets, than it is to sink a fortune into a single big one.

▲

signatoremo

1 hour ago

[-]

> Today, 3 years after launching the first LLM chatbot, OpenAI is nowhere near as dominant as Netscape was in late 1997.

Incorrect. There were about 150 millions Internet users in 1998, or 3.5% of total population. The number grew 10 times by 2008 [0]. Netwcape had about 50% of the browser market at the time [1]. In other words, Netscape dominated a small base and couldn’t keep it up.

ChatGPT has about 800 millions monthly users, or already 10% of total current population. Granted, not exclusively. ChatGPT is already a household name. Outside of early internet adopters, very few people knew who Netscape or what Navigator was.

[0] https://archive.globalpolicy.org/component/content/article/1...

[1] https://www.wired.com/1999/06/microsoft-leading-browser-war/...

▲

treis

3 hours ago

[-]

We're talking miraculous level of improvement for a SOA LLM to run on a phone without crushing battery life this decade.

People are missing the forest for the trees here. Being the go to consumer Gen AI is a trillion+ dollar business. How many 10s of billions you waste on building unnecessary data centers is a rounding error. The important number is your odds of becoming that default provider in the minds of consumers.

▲

serial_dev

1 minute ago

[-]

It’s extremely easy to switch.

I used ChatGPT for every day stuff, but in my experience their responses got worse and I had to wait much longer to get some poor response. I switched to Gemini and their answers were better and were much faster.

I don’t have any loyalty to Gemini though. If it gets slow or another provider gives better answers, I’ll change. They all have the same UI and they all work the same (from a user’s perspective).

There is no moat for consumer genAI. And did I mention I’m not paying for any of it?

It’s like quick commerce, sure it’s easy to get users by offering them something expensive off of VC money. The second they raise prices or offer degraded experience to make the service profitable, the users will leave for another alternative.

▲

bayarearefugee

1 hour ago

[-]

> The important number is your odds of becoming that default provider in the minds of consumers.

I haven't seen any evidence that any Gen AI provider will be able to build a moat that allows for this.

Some are better than others at certain things over certain time periods, but they are all relatively interchangeable for most practical uses and the small differences are becoming less pronounced, not more.

I use LLMs fairly frequently now and I just bounce around between them to stay within their free tiers. Short of some actual large breakthrough I never need to commit to one, and I can take advantage of their own massive spends and wait it out a couple of years until I'm running a local model self-hosted with a cloudflare tunnel if I need to access it on my phone.

And yes, most people won't do that, but there will be a lot of opportunity for cheap providers to offer that as a service with some data center spend, but nowhere near the massive amounts OpenAI, Google, Meta, et al are burning now.

▲

treis

44 minutes ago

[-]

The moat is the chat history and the flywheel of user feedback improving the product.

▲

tux3

25 minutes ago

[-]

Given how often smaller LLM companies train on the output of bigger LLM companies, it's not much of a moat.

LLMs complete text. Every query they answer is giving away the secret ingredient in the shape of tokens.

▲

m4rtink

1 hour ago

[-]

Just build more tulip fields/railways/websites no matter the cost - the golden age is inevitable! ;-)

▲

nuc1e0n

2 hours ago

[-]

How big does an LLM need to be to support natural language queries with RAG?

▲

mattnewton

2 hours ago

[-]

My hot (maybe just warm these days) take is, the problem with voice assistants on phones is they have to be able to have reasonable responses to a long tail or users will learn not to use them, since the use cases aren’t discoverable and the primarily value is talking to it like a person.

So voice assistants backed by very large LLMs over the network are going to win even if we solve the (substantial) battery usage issue.

▲

nuc1e0n

2 hours ago

[-]

Why even bother with the text generation then? You could just make a phone call to an LLM with a TTS frontend. Like with directory enquiries back in the day. Which can be set up as easily as a BBS if you have a home server rack like Jeff Geerling makes youtube videos about.

▲

mNovak

3 hours ago

[-]

Of all the players, I'd argue Google certainly knows how to give away a product for free and still make money.

The local open source argument doesn't hold water for me -- why does anyone buy Windows, Dropbox, etc when there's free alternatives?

▲

bee_rider

1 hour ago

[-]

Not many people buy Windows, they buy laptops that happen to have Windows installed. IMO this is a worthwhile distinction because most people don’t really care about operating systems anyway, and would happily (I suspect, at least) use an Open Source one if it came installed and configured on a device that they got in a store.

Installing an OS is seen as a hard/technical task still. Installing a local program, not so much. I suspect people install LLM programs from app stores without knowing if they are calling out to the internet or running locally.

▲

steveBK123

2 hours ago

[-]

Yes, it seems to me their strategy is to watch the OpenAI, Anthropics, etc of the world bleed themselves to death.

▲

Anon1096

3 hours ago

[-]

Besides the fact that this article is obviously AI generated (and not even well, why is there mismatches in british/american english? I can only assume that the few parts in british english are the human author's writing or edits), yes "overutilization" is not a real thing. There is a level of utilization at every price point. If something is "overutilizated" that actually means it's just being offered at a low price, which is good for consumers. It's a nice scare word though and there's endless appetite at the moment for ai-doomer articles.

▲

martinald

2 hours ago

[-]

Author here, I mix up American and British English all the time. It's pretty common for us Brits to do that imo.

See also how all (?) Brits pronounce Gen Z in the American way (ie zee, not zed).

▲

Anon1096

2 minutes ago

[-]

Sorry but it's highly suspect to be spelling the same word multiple different ways across paragraphs. You switch between using centre/center or utilization/utilisation? It is a very weird mistake to make for a human.

▲

youngtaff

14 minutes ago

[-]

Brit here… I say Gen Zed!

▲

bobmcnamara

2 hours ago

[-]

> why is there mismatches in british/american english

You sometimes see this with real live humans who have lived in multiple counties.

▲

SJC_Hacker

58 minutes ago

[-]

> multiple counties

Pay no attention to those fopheads from Kent. We speak proper British English here in Essex

▲

delusional

1 hour ago

[-]

I do this because I'm a non-native english speaker. My preference varies from word to word. I write color, but i also write aliminium.

▲

Hikikomori

34 minutes ago

[-]

> why is there mismatches in british/american english

Some people are not from usa or England.

▲

knollimar

1 hour ago

[-]

By this logic loss leaders to drive out competition are good gor the consumer, no?

▲

heliumtera

1 hour ago

[-]

To be honest it doesn't feel manually edited.

Bullet points hell, a table that feels it came straight out of grok.

▲

aitchnyu

3 hours ago

[-]

Will the OpenRouter marketplace of M clouds X N models die if the investor money stops? I believe its a free and profitable service, offered completely pay as you go.

▲

dust42

3 hours ago

[-]

OpenAI has 800,000,000 weekly users but only 20,000,000 are paying while 780,000,000 are free riding. Should they by accident under provision then they could simply remove the freebee and raise the prices for the paying clients. But that is not what they want.

IMHO the investors are betting on a winner-takes-it-all market and that some magic AGI will be coming out of OpenAI or Anthropic.

The questions are:

How much money can they make by integrating advertising and/or selling user profiles?

What is the model competition going to be?

What is the future AI hardware going to be - TPUs, ASICs?

Will more people have powerful laptops/desktops to run a mid-sized models locally and be happy with it?

The internet didn't stop after the dotcom crash and the AI wont stop either should there be a market correction.

▲

alex_c

2 hours ago

[-]

>OpenAI has 800,000,000 weekly users but only 20,000,000 are paying while 780,000,000 are free riding.

By itself, this doesn't tell us much.

The more interesting metric would be token use comparison across free users, paid users, API use, and Azure/Bedrock.

I'm not sure if these numbers are available anywhere. It's very possible B2B use could be a much bigger market than direct B2C (and the free users are currently providing value in terms of training data).

▲

greekrich92

53 minutes ago

[-]

800M people are not using chatGPT every week. Be a little less credulous.

▲

iambateman

3 hours ago

[-]

The thing that makes AI investment hard to reason about for individuals is that our expectations are mostly driven by a single person’s usage, just like many of the numbers reported in the article.

But the AI providers are betting, correctly in my opinion, that many companies will find uses for LLM’s which are in the trillions of tokens per day.

Think less of “a bunch of people want to get recipe ideas.”

Think more of “a pharma lab wants to explore all possible interactions for a particular drug” or “an airline wants its front-line customer service fully managed by LLM.”

It’s unusual that individuals and industry get access to basically similar tools at the same time, but we should think of tools like ChatGPT and similar as “foot in the door” products which create appetite and room to explore exponentially larger token use in industry.

▲

gradus_ad

2 hours ago

[-]

When I'm building out a new feature, I can churn through millions of tokens in Claude code. And that's just me... Now think about Claude code but integrated with Excel or datadog, or whatever app could be improved through LLM integration. Think about the millions of office workers, beyond just software engineers, who will be running hundreds of thousands or millions of tokens per day through these tools.

Let's estimate 200 million office workers globally as TAM running an average of 250k tokens. That's 50 trillion tokens DAILY. Not sure what model provider profit per token is, but let's say it's .001 cents.

Thats $500M per day in profit.

▲

not_the_fda

1 hour ago

[-]

Currently there is no profit per token, quite a bit of loss per token, that's the problem. Your not going to make it up in volume.

▲

danielbln

1 hour ago

[-]

Do you have a source for that? I'm especially interested in a source for Anthropic.

▲

epistasis

2 hours ago

[-]

> Think more of “a pharma lab wants to explore all possible interactions for a particular drug”

Pharma does not trust OpenAI with their data, and they don't work on tokens for any of the protein or chemical modeling.

There will undoubtedly be tons of deep nets used by pharma, with many $1-10k buys replacing more expensive physical assays, but it won't be through OpenAI, and it won't be as big as a consumer business.

Of course there may be other new markets opened up but current pharma is not big enough to move the needle in a major way for a company with an OpenAI valuation.

▲

s_ting765

2 hours ago

[-]

> “an airline wants its front-line customer service fully managed by LLM.”

This has been experimented on before by many companies over the recent years, most notably Klarna which was among the earliest guinea pigs for it and had to later on backtrack on this "novel" idea when the results came out.

▲

danielbln

1 hour ago

[-]

Does neither mean it can't work nor that it can't work with LLMs. Just that hacking together some RAG chatbot is probably not it.

▲

malfist

1 hour ago

[-]

But if I'm a pharma lab, I don't want to rely on a statistical engine that makes mistakes to answer those questions, I want to query a database that is deterministic.

▲

hn_acc1

58 minutes ago

[-]

This. LLM is NOT the tool for a pharma lab - properly trained ML is the right tool. Heck, English is probably not even the right LANGUAGE to use for discussing chemical interactions.

▲

jfalcon

1 hour ago

[-]

After reading the article and the comments, here are a few points people are missing from their analysis:

- OverUtilized/UnderCharged: doesn't matter because...

- Lead Time vs. TCO vs. IRS Asset Deprecation: The moment you get it fully built, it's already obsolete. Thus from a CapEx point of view, if you can lease your compute (including GPU) and optimize the rest of the inputs for similar then your CapEx overall is much lower and tied to the real estate - not the technology. The rest is cost of doing business and deductible in and of itself.

- The "X" factor: Someone mentioned TPU/ASIC but then there is the DeepSeek factor - what if we figure out a better way of doing the work that can shortcut the workflow?

- AGI partnerships: Right now, you see a lot of Mega X giving billions to Mega Y because all of them are trying to get their version of Linux or Apache or whatever at parity with the rest. Once AGI is settled and confirmed, then most all of these partnerships will be severed because it then becomes which company is going to get their AI model into that high prestige Montessori school and into the right ivy league schools - like any other rich parent would for their "bot" offspring.

So what will it look like when it crashes? A bunch of bland empty "warehouses" with mobile PDU's once filling all their parking lot space gone. Whatever "paradise" that was there may come back... once you bulldoze all that concrete and steel. The money will do something else like a Don McLean song.

▲

kd913

1 hour ago

[-]

I am still a little skeptical about utilisation rates. If demand is so extreme, wouldn't we see rental prices for H100/A100 prices go up or maintain? Wouldn't the cost for such a gpu still be high (you can get em 3k used).

▲

michaelt

7 minutes ago

[-]

On "runpod community cloud" renting a 5090 costs $0.69/hour [1] and it consumes about $0.10/hour electricity, if running at full power and paying $0.20/kWh.

On Amazon, buying a 5090 costs $3000 [2]

That's a payback time of 212 days. And Runpod is one of the cheaper cloud providers; for the GPUs I compared, EC2 was twice the price for an on-demand instance.

Rental prices for GPUs are pretty darn high.

[1] https://www.runpod.io/pricing [2] https://www.amazon.com/GIGABYTE-Graphics-WINDFORCE-GV-N5090G...

▲

asplake

5 hours ago

[-]

Yes or no conclusions aside (and despite its title, the article deserves better than that), the key point is I think this one: “But unlike telecoms, that overcapacity would likely get absorbed.”

▲

lazide

4 hours ago

[-]

Telecom (dark fiber) capacity got absorbed too. Eventually. After a ton of bankruptcies.

▲

recursive4

4 hours ago

[-]

Stylistically, this smells like it was copy and pasted from straight out Deep Research. Substantively, I could use additional emphasis on the mismatch between expectations and reality with regards to telco debt-repayment schedule.

▲

paulorlando

2 hours ago

[-]

The 2001 telecoms crash drove benefits for companies that came later in the availability of inexpensive dark fiber after the bubble popped. WorldCom, ICG, Williams sold off to Verizon, Level 3, Teleglobe, and others. That in turn helped future Internet companies gain access to plentiful and inexpensive bandwidth. Cable telephony companies such as Cablevision Systems, Comcast, Cox Communications, and Time Warner, used the existing coaxial connections into the home to launch voice services.

▲

maxglute

1 hour ago

[-]

Rail and fiber deprecates on multiple decade timescales. AI data centers close to tulips. Even assuming we manage to make data center stretch to 10 years, these assets won't be around long enough to support ecosystem of new companies if the economics stops making sense. Ultimately the only durable thing is any power infra that gets built, vs rail and fiber where inheritance isn't just rail networks or fiber but like 1000s of kilometers of earthwork projects to build out massive physical networks.

▲

signatoremo

56 minutes ago

[-]

Data centers last decades. Many or the current AI hosting vendors such as Coreweave have crypto origin. Their data centers were built out in 2010s, early 2020s.

Many of legacy systems still running today are IBM or Solaris servers at 20, 30 year old. No reason to believe GPU won’t still be in use in some capacity (e.g. interference) a decade from now.

▲

epistasis

2 hours ago

[-]

This is indeed true, but doesn't fiber have a far longer lifetime than GPU heavy data centers? The major cost center is the hardware, which has a fairly short shelf life.

▲

gretch

2 hours ago

[-]

Well you still get the establishment of 1) large industrial buildings 2) water/electricity distribution 3) trained employees who know how to manage a data center

Even if all of the GPUs inside burn out and you want to put something else entirely inside of the building, that's all still ready to go.

Although there is the possibility they all become dilapidated buildings, like abandoned factories

▲

epistasis

1 hour ago

[-]

The building and electrical infrastructure are far cheaper than the hardware. So much so that the electricity is a small cost of the data center build out, but a major cost for the grid.

Of the most valuable part is quickly depreciating and goes unused within the first few years, it won't have a chance for long term value like fiber. If data centers become, I don't know, battery grid storage, it will be very very expensive grid storage.

Which is to say that while there was an early salivation for fiber that was eventually useful, overallocation of capital to GPUs goes to pure waste.

▲

gmm1990

4 hours ago

[-]

Some of the utilization comparisons are interesting, but the article says 2 trillion was spent on laying fiber but that seems suspicious.

▲

observationist

4 hours ago

[-]

There's an enormous amount of unused, abandoned fiber. All sorts of fiber was run to last mile locations, across most cities in the US, and a shocking amount effectively got abandoned in the frenzy of mergers and acquisitions. 2 trillion seems like a reasonable estimate.

Giant telecoms bought big regional telecoms which came about from local telecoms merging and acquiring other local telecoms. A whole bunch of them were construction companies that rode the wave, put in resources to run dark fiber all over the place. Local energy companies and the like sometimes participated.

There were no standard ways of documenting runs, and it was beneficial to keep things relatively secret, since if you could provide fiber capabilities in a key region, but your competition was rolling out DSL and investing lots of money, you could pounce and make them waste resources, and so on. This led to enormous waste and fraud, and we're now on the outer edge of usability for most of the fiber that was laid - 29-30 years after it was run, most of it will never be used, or ever have been used.

The 90s and early 2000's were nuts.

▲

sosodev

3 hours ago

[-]

I so desperately wish it weren't abandoned. I hate that it's almost 2026 and I still can't get a fiber connection to my apartment in a dense part of San Diego. I've moved several times throughout the years and it has never been an option despite the fact that it always seems to be "in the neighborhood".

▲

Spooky23

2 hours ago

[-]

That has nothing to do with fiber, it’s all about politics and a regulatory environment where nobody is incented to act. Basically, the states can’t fully regulate internet and the Federal government only wants to fund buildouts on a pork barrel basis. Most recently rural.

At the local level, there is generally a cable provider with existing rights of way. To get a fiber provider, there’s 4 possible outcomes: universal service with subsidy (funded by direct subsidy), cherry-picked service (they install where convenient), universal service (capitalized by the telco) and “fuck you”, where they refuse to operate. (ie. Verizon in urban areas)

The private capitalized card was played out by cable operators in the 80s (they were innovators then, and AT&T was just broken up and in chaos). They have franchise agreements whose exclusivity was used as loan collateral.

Forget about San Diego, there are neighborhoods in Manhattan with the highest population density in the country where Verizon claims it’s unprofitable to operate.

I served on a city commission where the mayor and county were very interested in getting our city wired, especially as legacy telco services are on the way out and cable costs are escalating and will accelerate as the merger agreement that formed Spectrum expires. The idea was to capitalize last mile with public funds and create an authority that operated both the urban network and the rural broadband in the county funded by the Federal legislation. With the capital raised with grants and low cost bonding (public authority bonds are cheap and backed by revenue and other assets), it would raise a moderate amount of income in <10 years.

We had the ability to get the financing in place, but we would have needed legislation passed to get access to rights of way. Utilities have lots of ancient rights and laws that make disruption difficult. The politicians behind it turned over before that could be changed.

▲

photochemsyn

3 hours ago

[-]

For infrastructure, central planning and state-run systems make a lot of sense - this after all is how the USA's interstate highway system was built. The important caveat is that system components and necessary tools should be provided by the competitive private sector through transparent bidding processes - eg, you don't have state-run factories for making switches, fiber cable, road graders, steel rebar, etc. There are all kinds of debatable issues, eg should system maintenance be contracted out to specialized providers, or kept in-house, etc.

▲

advisedwang

3 hours ago

[-]

The GDP 1995-2000 (inclusive) was about $52T. So that assertion would mean that about %3.8 of the US' economic activity was laying fiber. That seems like a lot, but in my ignorance doesn't sound totally impossible.

▲

Havoc

5 hours ago

[-]

Don’t think looking at power consumption of b200s is a good measure of anything. Could well be an indication of higher density rather than hitting limits and cranking voltage to compensate

▲

jsight

3 hours ago

[-]

Yes, one of NVidia's selling points for the b200 is that performance per watt is better than before. High power consumption without controlling for performance means nothing.

▲

kqr

4 hours ago

[-]

Is there a way in which this is good for a segment of consumers? When the current gen of GPUs are too old, will the market be flooded with cheap GPUs that benefit researchers and hobbyists who therwis would not afford them?

▲

LogicFailsMe

3 hours ago

[-]

GPUs age surprisingly gracefully. If a GPU isn't cutting edge, you just tie two or more of them together for a bit more power consumption to get more or less the same result as the next generation GPU.

if there's ever a glut in GPUs that formula might change but it sure hasn't happened yet. Also, people deeply underestimate how long it would take a competing technology to displace them. It took GPUs nearly a decade and the fortunate occurrence of the AI boom to displace CPUs in the first place despite bountiful evidence in HPC that they were already a big deal.

▲

ggerni

46 minutes ago

[-]

>GPUs age surprisingly gracefully. source: it was revealed to me in a dream

▲

stego-tech

4 hours ago

[-]

Unlikely, for a few reasons:

* The GPUs in use in data centers typically aren’t built for consumer workloads, power systems, or enclosures.

* Data Centers often shred their hardware for security purposes, to ensure any residual data is definitively destroyed

* Tax incentives and corporate structures make it cheaper/more profitable to write-off the kit entirely via disposal than attempt to sell it after the fact or run it at a discount to recoup some costs

* The Hyperscalers will have use for the kit inside even if AI goes bust, especially the CPUs, memory, and storage for added capacity

That’s my read, anyway. They learned a lot from the telecoms crash and adjusted business models accordingly to protect themselves in the event of a bubble crash.

We will not benefit from this failure, but they will benefit regardless of its success.

▲

ares623

4 hours ago

[-]

Some of them will probably be starving, homeless, or bedridden by the time that happens but yes they can get cheap GPUs

▲

wmf

4 hours ago

[-]

Many researchers and hobbyists cannot even plug in a 10 KW 8 GPU DGX server.

▲

quickthrowman

3 hours ago

[-]

Why not? It’s 40A at 240V, or 25% of the continuous load rating of a 200A 240V single-phase service.

If someone can afford an 8 GPU server, they should be able to afford some #6 wire, a 50A 2P breaker, and a 50A receptacle. It has the same exact power requirements as an L2 EV charger.

▲

knollimar

1 hour ago

[-]

You probably run a bigger (perhaps double) neutral and care about a stronger ground. But yeah, the $12 is rounding error at this scale

▲

CamperBob2

3 hours ago

[-]

That doesn't exactly bode well for the EV revolution, then, does it?

▲

LogicFailsMe

3 hours ago

[-]

The average commute in the United States is about 24 miles a day round trip. That's about 10 kWH. That's enough to charge overnight on a 15A circuit.

▲

CamperBob2

3 hours ago

[-]

You can drive around with 24 miles of charge, but ... I'll pass, thanks.

In reality, if you have a dryer outlet, you have a good fraction of 10 kW available.

▲

LogicFailsMe

3 hours ago

[-]

I just put in a dedicated 50 amp circuit myself and my car charges from ~25% to full in about 6-7 hours. But I wanted to present the lazy worst case scenario. The warrior's FUD here is that there isn't enough easily available lithium for everyone in just California alone to have one.

▲

BuffaloEric33

2 hours ago

[-]

Almost all home EV charging is <=10kW.

▲

tekno45

3 hours ago

[-]

wut?

▲

mwkaufma

1 hour ago

[-]

Simultaneous claims that 'agentic' models are dramatically less efficient, but also forecasts efficiency improvements? We're in full-on tea-leaves-reading mode.

▲

gizajob

5 hours ago

[-]

Yes

▲

venturecruelty

1 hour ago

[-]

No, because at least dark fiber is useful. AI GPUs will be shipped off to developing nations to be dissolved for rare earth metals once the third act of this clown show is over.

▲

turtlesdown11

1 hour ago

[-]

Amazing article, I found it fascinating.

> You can already use Claude Code for non engineering tasks in professional services and get very impressive results without any industry specific modifications

After clicking on the link, and finding that Claude Code failed to accurately answer the single example tax question given, very impressive results! After all, why pay a professional to get something right when you can use Claude Code to get it wrong?

▲

avazhi

2 hours ago

[-]

No, because the datacenters will get used. The demand side exists, whether it’s LLM AIs or something completely different that isn’t AI related. That’s very different from a crash where there is absolutely nothing valuable/useable/demanded underneath the bubble.

▲

turtlesdown11

2 hours ago

[-]

> The demand side exists, whether it’s LLM AIs or something completely different that isn’t AI related.

What do you think LLM tuned GPUs or TPUs are going to be used for that is completely different and not AI related?

▲

kragen

5 hours ago

[-]

This seems to be either LLM AI slop or a person working very hard to imitate LLM writing style:

The key dynamic: X were Y while A was merely B. While C needed to be built, there was enormous overbuilding that D ...

Why Forecasting Is Nearly Impossible

Here's where I think the comparison to telecoms becomes both interesting and concerning.

[lists exactly three difficulties with forecasting, the first two of which consist of exactly three bullet points]

...

What About a Short-Term Correction?

Could there still be a short-term crash? Absolutely.

Scenarios that could trigger a correction:

1. Agent adoption hits a wall ...

[continues to list exactly three "scenarios"]

The Key Difference From S:

Even if there's a correction, the underlying dynamics are different. E did F, then watched G. The result: H.

If we do I and only get J, that's not K - that's just L.

A correction might mean M, N, and O as P. But that's fundamentally different from Q while R. ...

The key insight people miss ...

If it's not AI slop, it's a human who doesn't know what they're talking about: "enormous strides were made on the optical transceivers, allowing the same fibre to carry 100,000x more traffic over the following decade. Just one example is WDM multiplexing..." when in fact wavelength division multiplexing multiplexing is the entirety of those enormous strides.

Although it constantly uses the "rule of three" and the "negative parallelisms" I've quoted above, it completely avoids most of the overused AI words (other than "key", which occurs six times in only 2257 words, all six times as adjectival puffery), and it substitutes single hyphens for em dashes even when em dashes were obviously meant (in 20 separate places—more often than even I use em dashes), so I think it's been run through a simple filter to conceal its origin.

▲

ashtakeaway

3 hours ago

[-]

Remember we have about 20 years of poorly written articles along with a few well written ones for the LLM to be trained on. I'm confident that attempting to tell LLM from human writing is a waste of time now that the year is almost over.

Other than that I'd rather choose a comprehensive article than a summary.

▲

kragen

3 hours ago

[-]

Attempting to tell LLM writing from human writing is usually a waste of time, but apparently not in this case.

▲

alienbaby

53 minutes ago

[-]

Did you see the reply from the author to similar statements?

▲

myrmidon

3 hours ago

[-]

I agree, and it feels like an allergy by now to that style specifically. This is doubly annoying because it ruins the reading experience and just makes me question myself constantly because you often can't be quite certain especially for shorter posts/comments.

On topic: It is always quite easy to be the cynical skeptic, but a better question in my view: Is the current AI boom closer to telecoms in 2000 or to video hosting in 2005? Because parallels are strong to both, and the outcomes vastly different (Cisco barely recovered by now compared to 1999 while youtube is printing money).

▲

alienbaby

54 minutes ago

[-]

You guys make me laugh.

▲

mNovak

3 hours ago

[-]

Nice article; far from bullet-proof, but it brings up some interesting points. HN comments are vicious on the topic of AI non-bubbles.

▲

fnord77

5 hours ago

[-]

> This is the opposite of what happened in telecoms. We're not seeing exponential efficiency gains that make existing infrastructure obsolete. Instead, we're seeing semiconductor physics hitting fundamental limits.

What about the possibility of improvements in training and inference algorithms? Or do we know we won't get any better than grad descent/hessians/etc ?

▲

imvetri

3 hours ago

[-]

No. AI data center, or any data center is designed with incorrect data structure resulting in over utilisation of computing resource.

▲

positron26

3 hours ago

[-]

Hardware growth is slow and predictable, but one breakthrough algorithm completely undercuts any finance hypothesis premised on compute not flowing out of the cloud and back to the edges and into the phones.

This is a kind of risk that finance people are completely blind to. Open AI won't tell them because it keeps capital cheap. Startups that must take a chance on hardware capability remaining centralized won't even bother analyzing the possibility. With so many actors incentivized to not know or not bother asking the question, there's the biggest systematic risk.

The real whiplash will come from extrapolation. If an algorithm advance shows up promising to halve hardware requirements, finance heads will reason that we haven't hit the floor yet. A lot of capital will eventually re-deploy, but in the meantime, a great deal of it will slow down, stop, or reverse gears and get un-deployed.

▲

sdenton4

3 hours ago

[-]

AI had a kind of Jevons paradox approach to efficiency improvements, unfortunately - if you halve the compute requirements with an algorithmic advance, you can run a model twice as big.

▲

layer8

3 hours ago

[-]

The large SOTA models have hit very diminishing returns on further scaling, I think. So you’d rather double the number of models you can run in parallel.

▲

MarkusQ

5 hours ago

[-]

Holly cow, we've found an exception to Betteridge's Law of Headlines! Talk about burying the lede!

▲

semitones

4 hours ago

[-]

If you read the article, then this is not an exception to the law