FilterHN

Show HN: I built an AI Agent that uses the iPhone

52 points

by rounak

5 days ago

| past

| 5 comments

| github.com

| HN

It’s powered by OpenAI’s GPT 4.1 model.

Uses Xcode UI tests + accessibility tree to look into apps, and performs swipes, taps, etc to get things done.

▲

totetsu

5 days ago

[-]

> "It would need access to our browser, an ability to drive that. It would need our credit card information to pay for the tickets. It would need access to our calendar, everything we're doing, everyone we're meeting. It would need access to Signal to open and send that message to our friends," she said. "It would need to be able to drive that across our entire system with something that looks like root permission, accessing every single one of those databases, probably in the clear because there's no model to do that encrypted."

Whittaker added that an AI agent powerful enough to do that would "almost certainly" process data off-device by sending it to a cloud server and back.

"So there's a profound issue with security and privacy that is haunting this sort of hype around agents, and that is ultimately threatening to break the blood-brain barrier between the application layer and the OS layer by conjoining all of these separate services, muddying their data, and doing things like undermining the privacy of your Signal messages," she said.

--Meredith Whittaker earlier this year.

▲

katsura

5 days ago

[-]

I've been thinking about building a robot that can use a camera to look around, use motors to go in different directions, and when it sees a human, it could also ask if they've seen John Connor, and if the person is being "difficult" then press a button to terminate them.

The interesting thing is that the three laws of robotics says that robots shouldn't harm humans, but I don't really see a way for an AI agent to understand that by "pressing a button" they actually hurt the human.

▲

voidUpdate

5 days ago

[-]

You have stumbled upon the point of the three laws of robotics, which is that they are part of a series of stories showing why they don't necessarily work

▲

gryfft

5 days ago

[-]

To wit, the three laws are actually a formulation of three laws of tool design: a tool must not harm its user; a tool must be fit for purpose and do what the user wishes, as long as that doesn't harm the user; and a tool should be sturdy and reusable so long as that doesn't interfere with the tool's safety or usability.

These design principles make sense when you are talking about a non-sentient object, but intelligent, adaptable beings cannot be so easily constrained.

▲

rvnx

5 days ago

[-]

At some point (~50 years from now ?) they could even form their own type of life. If they can mine for resources, think, do actions and reproduce. "synthetic life"

▲

diggan

5 days ago

[-]

> If they can mine for resources, think, do actions and reproduce. "synthetic life"

Essentially the story of the Horizon series of video games: https://en.wikipedia.org/wiki/Horizon_(video_game_series), and I'm sure many other sci-fi novels.

▲

rvnx

5 days ago

[-]

Or like in Futurama, the apparition of "Robosexuals"

▲

M4v3R

5 days ago

[-]

I underground that this is nothing more than a proof of concept but imagine what Apple itself could do with this idea if they truly embraced the concept and cut all the internal red tape that currently prevents them from doing so. This is what “Apple Intelligence” should be but never materialized (and at this point I have doubts it ever will, although I am curious what they’ll show off at WWDC this year).

▲

BossingAround

5 days ago

[-]

> I am curious what they’ll show off at WWDC this year

Apparently, not much is planned, per [1]. I'd be very cautious about AI agents like these; from a user level, this has so many security vulnerabilities.

[1] https://www.macrumors.com/2025/05/30/the-macrumors-show-last...

▲

jen729w

5 days ago

[-]

> I am curious what they’ll show off at WWDC this year

Fool me once...

▲

simianwords

5 days ago

[-]

Interesting project, if anything it shows what Android or IOS may support in the near future.

>iOS apps are sandboxed, so this project uses Xcode's UI testing harness to inspect and interact with apps and the system. (no jailbreak required).

What are practical limitations of this? Maybe you can't submit this app to the store?

▲

sunbum

5 days ago

[-]

It's not an app that runs on device at all. It's an program that runs on your mac.

▲

astrodude

5 days ago

[-]

in case if anyone wants to understand how it works: https://github.com/kiranz/phoneagent/blob/add-docs/explanati...