In the future they will come with their 'ready' solution, already 'working' and be even less receptive to look at design and architecture holistically. Just make it like that. And why do you need to spend X man hours? The thing is basically already done!
> In July 2025, the Securities and Exchange Board of India (SEBI) alleged that Jane Street used multiple entities for market manipulation and barred it from accessing the market.
https://www.bloomberg.com/opinion/newsletters/2025-07-07/jan... (https://archive.fo/0PQ2E)
I think the designer here takes a wrong approach and sort of falls into engineer envy where he wants to make prototypes as deep and realistic as possible. And that is not really the most important part of the design job.
The most important thing is that the right thing gets build (why do we need a JSQL input box? what do we actually want? what are other ways to do this?). And this is often better done with pen and paper sketches, meetings, observation, discussions ... rather than too quickly narrowing on a particular design and spinning into discussions on whether buttons should be on the left or on the right, LLM intricacies etc.
As a non-designer I want to see a thought through 1-2 ideas, not 10 ideas you coded with llm and now the burden of thinking which one is better and why is for some reason is on me.
Not really. Maybe its effective from designer side but from other side that has to review all those ideas it is exhausting and counterproductive. Especially when it is in code.
You really don't need a real prototype for most things. Simple visual is more than enough to present an idea and explain etc. In fact, it is often much better as you can see all flows in canvas at once.
With interactive prototype to see all flows of a feature you have to go through each one, which again is very uneffective.
Give it some time. We’ll figure out what LLMs are good and bad at. I think vibe engineering will eventually go up on the wall next to static vs dynamic typing and vi vs eMacs.
At least, that assumes AI models won’t keep improving by leaps and bounds. We’re in a transition period. It’s gonna be chaos.
But a lot of times they don't. Devs building websites for small businesses used to be a thing, and it was almost universally a crap product. Practically every restaurant in my small town has been able to take control of their own website with AI, and it's a better experience for everyone.
Rapid digitisation meant a lot of low-value work wound up being done by high-value people. The economy is pivoting away from that configuration. The high-value folks getting displaced are pissed, partly rightly, partly out of spite. The folks who had to pay those bills are ecstatic, mostly overexuberantly, but in part correctly.
Your comment is valid, but not for the reason you think. What you should be talking about is the grunt work done in our field to non-trivial applications. Adding a search box to a table, or add an extra field to a form, etc.
So you think about a ticket that often might take a few hours, but in badly architected system might take a week, add a field to a class, edit the DB structure (maybe manually, maybe through via an ORM generated migration), add it to DTOs, add a validator, add it to the FE definitions, edit the page layout, etc..
Low value work that until now had to be performed by high-value employees.
I think there's still a lot more custom work than you'd expect. A recent example: My mum is ex-president of a small international NGO. When I visited earlier today, she was bemoaning how the lady who runs their NGO's website charges them an arm and a leg for small changes. As a result, the website is constantly out of date. The current site is entirely built on top of wordpress. There is, of course, politics. This lady holds the login credentials to everything. And she holds them close to her chest.
I showed my mum what we could do with claude design. Claude whipped up a much better looking, working version of (part) of the website in about 5 minutes. I think LLMs aren't quite at the point that my mum could manage the website herself. You do need a little technical knowledge. My mum just doesn't know what to ask for. But we're really close. I've offered to be a human frontend to LLMs for them going forward, if she can wrest control from the "webmaster".
I suspect conversations like this are happening everywhere at the moment. There are an incredible number of small websites out there. Most of them, claude or chatgpt could reimplement in 5 minutes tops.
The current trend is a proliferation of internal tooling. I've honestly found quite a bit of it useful. The rest is the usual should-have-been-a-Google-form nonsense.
Besides I remember what kind of code we had when we first started coding with AI and now with all these coding agents etc. it has become quite impressive.
However, at the end it is still a tool and needs to be used accordingly.
I have a feeling at some point we will have a "Windows 95" moment (when computing really became personal for the masses) in AI, and things will significantly change shape again.
It's boring and it's viral and if you refuse to participate, there are still people annoying you with their beliefs.
It's a fucking tool, like a fucking shovel.
In the name of "loyalty" or "faith" cults require members to burn their bridges, actively cutting off their own potential to escape to any other social support network. That may mean creating enemies out of former associates, humiliating rituals or blackmail material, or simply ruining their reputations through obvious lies.
Unless you invade a country again and we have to deal with the Fallout for years to come, that is.
For example: I have banned all news about Trump administration from my home.
It's simply not interesting. He is a toddler, and I have a toddler of my own.
I myself, I pump my investments at least twice a day. Once in the morning, right after I work out. And then once right after lunch.
I want to, that's not why I do it. I do it cause I fuckin' need to.
I'm not saying AI isn't a good tool. However the less you understand what you're using and what you're doing the further you stand to geopardize the business you're working for.
Do you not pay for Claude?
I think they mean that specifically in working with 3rd party per-project / freelance designers you usually get a "first draft + one adjustment" price, then every modification costs more. Similar for small design shops. Prices aren't necessarily per hour, as you'd get from developers.
However, designing in code is technology-first. One could argue that the purpose of design - to shape the artifacts for human purpose - is better done NOT starting with the strict rules of code. Pen and paper is still hard to beat, not for anything that looks nice, but for helping your mind forward.
It helps to understand the constraints of a medium, but you really don't need to know every level down to the electrons moving through the silicon.
That’s solves an issue I have with all POC - a really good approach
Keen to hear if anyone has had unconventional creative adventures with it.
I find it funny about meeting requirements when you give them, and making safe choices when you don't give direction. So if you're going to rate the output aesthetics and UX/content, but you don't prompt especially much around the aesthetics, you're only getting the safe assumed defaults. It's good at making bootstrap/tailwind clone designs unless you work that angle. For simple web pages, I've started making this the only focus for initial iteration.
I don’t know what prompted to get it away from the base model and any of like the non-standard web design like styles are a little bit too harsh for me if that makes sense. Like for example I like brutalist design, but it just feels heavy sometimes on the apps that I’m making.
I’ve tried to get the AI to describe a style based on the product name or you know it seem that I wanna have for example travel. But then it creates us like steal, amorphic design where everything looks like a boarding pass airplane ticket.
There’s still value in it for me, I get decent enough output to convey my intent and I sometimes manually tweak the HTML.
Defining fonts is a good way to at least not get the same typography as everyone else.
By the way, you can always tune your prompts to not be generic, if you ask for generic UIs without detailed styling prompts you will get generic designs.
For me building a quick (not production quality) frontend demo in code was already often faster than getting the right interaction working in Figma. And it allowed to make it fully interactive so you can catch much more edge cases on the UX side.
Now with Claude Code it's even faster to build the throw away prototype. But not a huge difference since discussing with the users and thinking about how it should work is 80% of the time. Claude maybe halves the other 20% compared to quickly doing it yourself. Faster to first version, slower to iterate if it didn't fully get it.
It's much harder to RL out design taste because it's not self-grounding, and human labelers have no real skin in the game, so this (having a human with a vested outcome in the process directing a model's work) is the best way to get LLMs better at design/"taste"/aesthetic judgment themselves. We were working on the same thing 7 months ago and then I realized that winning over designers to do this would be a huge uphill battle setting up an inevitable fall from grace later on.
What makes me most suspicious of Claude Design is that when you disconnect and reconnect later, it loses context and nags you that the product doesn't work like that. Bullshit. It's at best an anti-abuse/implementation detail (to keep you from launching 10 at once and coming back to them later) or product shortcoming that just so happens to be optimized for keeping you from continuing your design in better tools than theirs for the inevitable followups.
It's great for one shots and it makes sense when you're trying to build a vertical product development stack like Anthropic but I'm disappointed it feels more like a tool optimized for keeping you in their product than for what you're working on. If a company other than Anthropic had shipped this - it's not that hard to build a visual self-eval loop, just use Chrome Devtools Protocol to run headless chrome and take screenshots -> feed into a judge LLM for feedback -> continue - I don't think it would really have seen much adoption.
That said, AI trained on Actor-Critic with a tight human feedback loop definitely seems like the right approach to solving the problem, just not something I want to spend my time training for someone else unless I can do so with higher "entropy" ie high parallelism/optionality
Also, you're mentioning a lot of unrelated tech. DPO, PPO, actor-critic, visual self-eval loops, Anthropic's "vertical product development stack" may be interesting, but they are mostly orthogonal. The article's point is simply that a designer can now turn design proposals into working prototypes faster than with Figma.
Also, you mention what seems to be a random product bug about disconnect and reconnect that doesn't have anything to do with this workflow. It seems to me that you're post-rationalising some insights that are not really there.
Good to think things through and in public, not discouraging it. I hope this reads as constructive.
Written specifications are being reduced in favor of these working prototypes, and now there’s this extra cognitive burden of reading the code and trying to determine what were the intended changes, and what’s the slop that needs to be tossed aside.
We also have to figure out, should we take over this generated PR and make any needed changes? Or do we start over from scratch? There’s often a sense of friction either way.
There have been times where a bunch of unintended changes were generated and I took time to port them over on my reimplementation, and then later on it’s “oops! Sorry! We didn’t mean to change that.”
I get it’s empowering but it does take away from some of the joy I used to find in my work and replaced it with some headaches.
The reason, I believe, is we’ve lost something along the way. Thinking. A non-trivial amount of which is now outsourced to a language model. It paints over the cracks in the prompt, hallucinates how things should work when unspecified, what would normally make someone stop and say “this isn’t quite working”, “how should I communicate this idea” or “what happens when…” has gone. Now, these details are left for after it’s been built proper.
Sure, we can improve the process and reflect more on how to utilise this new technique … but is it better than how things used to be? Yeah nah
Do you look at generated assembly that comes out of your compiler? You don't. So why are you looking at this code?
We have pushed up the abstraction layer.
Because I am getting the call to fix it when it breaks. I don't have to fix assembly by hand because compilers are deterministic and I have maybe encountered a single real compiler bug in my whole career. Compilers have earned my trust. LLMs are eroding that trust more and more every day I work with them. I encounter LLM-created problems in basically every single diff they surface for me, just over the months the diffs are getting bigger and harder to review and uncover the problems.
LLMs are not an abstraction(not even a bad one) because by design what they are doing is disambiguation. Compilers are not doing that, what you put IN the compiler has to be unambiguous in the first place.
Disambiguation is not a functionality of an abstraction layer. A good abstraction layer is the one I don't have to understand and can trust, if I have to understand its inner workings to use it it ceases to be an abstraction. Except with LLMs you can't even do that, they are a black box you can have no hope of understanding.
And it is not to say LLMs and agentic coding tools are not useful, they are absolutely very useful. They are just not an abstraction layer.
no thanks. BE boys do FE now
https://www.bloomberg.com/news/articles/2026-06-04/jane-stre...
In other words, it is an AI booster. Abusing the goodwill of programmers for their OCaml involvement even though most of it was convoluted bloat and inferior to INRIA code is devious.
It happens all the time now and people need to inoculate themselves against it:
A single famous open source person or an open source involved company invested in AI suddenly posts "organic" testimonies in favor of AI. It means nothing. The person is not the same person, the company is not the same company (or is now overtly evil).
It's a shame because if they'd taken an iterative approach of automating various parts of a normal Figma workflow to speed things up for users, that would have helped them discover where the value was; lots of little ideas, failing fast, testing and updating.
Maybe they just got too big and lost their mojo.
Yeah you don't say. High of $124, currently $22. But hey: that's ~~gambling~~ stock trading for you.
That's pretty mean. I disagree with a lot of people on their investment choices and evangelism. That doesn't mean I want them to suffer for it.
What if they're choosing correctly? Like, I've been pretty consistently critical of crypto. Yet I'm glad my index funds have to buy Coinbase.
Klankers will fix everything. Right?
When they paywall hard, I’ll use a local model on my laptop processor instead,
or phone,
and wait a few hours as needed.
from 6 sessions and 5 projects only one template that I choose anything else is really really bad
This is a very disappointing post from Jane Street.
Using AI for things you aren't good at, or not experienced with, is literally the worst way to use AI. You WANT to struggle when learning a new language, and use reliable documentation to solve your problems, not circumvent them entirely by using AI.
This is extreme incompetence, I'm shocked that Jane Street would advertise it.
In a way it's not much different from copy-pasting components from templates or whatever, just with more customisability. And for stuff that isn't HTML-based like React it does worse. It's also not great at building component libraries, I still write those myself with little LLM involvement, but that makes sense because the architecture is actually relevant with that, unlike generating CSS and xml-derived components, which is mostly just declarative templating anyways.
I've had decent success writing the core logic myself and then delegating the UI to AI. I think if I didn't write the core logic it would not work very well, but since it's designed well by myself the AI has a much smaller scope to work in which constrains it enough where vibe coding works. Pretty cool.