I automated opt-outs for 500 data broker sites (open source)
240 points
2 hours ago
| 26 comments
| github.com
| HN
pards
1 hour ago
[-]
I tried this (as a Canadian):

     1. It asks you to optionally sign up for a bunch of other services like Spokeo
     2. It asks for access to your email via Apple's Mail app which I don't use
     3. I got a lot of 404s anyway
     4. Many sites require manual intervention to work
Nice idea, but it needs a LOT of TLC to make it generally useful. I suspect that having a non-numeric "zip" code and a non-US address might be breaking a lot of the automation.
reply
nixass
1 hour ago
[-]
> 2. It asks for access to your email via Apple's Mail app which I don't use

Assumption that people use Apple services by default is wild

reply
NoNotTheDuo
52 minutes ago
[-]
Mail isn't documented as a requirement, but the first item in the Requirements section is "macOS (uses launchd for scheduling and Messages for iMessage)".
reply
jonhohle
7 minutes ago
[-]
Is Messages to automate responding to broker texts or just for notification?
reply
tedd4u
17 minutes ago
[-]
The mail app is a native Mac app but it can use any email account. It's an SMTP/IMAP/POP client.
reply
oofbey
1 hour ago
[-]
They probably built it just for themselves. More the first person in the post title.
reply
amarcheschi
23 minutes ago
[-]
Given oop info I can found online, they built it to advertise themselves, I think show hn would have been much more appropriate

Mac in requirements is wild tho

reply
Forgeties79
14 minutes ago
[-]
If it was windows or Linux only I feel like no one would bat an eye. Plenty of software is OS dependent. Am I missing something?

I do think they should’ve put that in the title, however. Save a lot of people time

reply
amarcheschi
10 minutes ago
[-]
I agree with you for the no one would bat an eye had it been windows or linux

I do find the project cool, just a bit too sensationalized given the title

reply
dfxm12
56 minutes ago
[-]
They probably wouldn't have shared the GitHub repository with hn if this was the case.
reply
michaelcampbell
37 minutes ago
[-]
There are "show hn" submissions for things that people just want to show off multiple times a week.
reply
xnickb
1 hour ago
[-]
So you're saying the phrase "vibe-coded" should've been used somewhere in the title? :-)
reply
IgorPartola
1 hour ago
[-]
Back in 2011 or so the Yellow Pages still delivered physical phone books to ever address in the state where we were. My city literally sent out an extra off cycle recycling truck the next day to pick them all up. Everyone threw them out.

Well my coworkers and I realized that the opt out form just needed an address. We contemplated pulling all known addresses for the entire country and automating submitting them all over several months to opt everyone out. I don’t think it ever materialized but we had a good chuckle about the emergency meeting the Yellow Pages web devs would have had and at what percentage of opt outs.

reply
trollbridge
1 hour ago
[-]
Around the same time frame, my brother rented some rooms in his house to people who had the occupation of actually delivering those phone books. (This was in a different country, but apparently the Yellow Pages existed everywhere.)

The delivery-people got overwhelmed and eventually just resorted to putting the stacks and stacks of phone books into piles and burning them. It took a long time until they got caught because nobody really misses a phone book.

reply
detourdog
3 minutes ago
[-]
The mailroom of my apartment building in college in 1988 was full of phone books that were unclaimed. I took enough to make a platform for my futon.
reply
notwhereyouare
42 minutes ago
[-]
I think dad wanted some extra money one year and he took my brother and I out and delivered 100s of phonebooks in our area.

i think we got a season pass to 6 flags out of it, but i'm not positive

reply
opengrass
1 hour ago
[-]
I sure do! Calling all local contractors for a quote VS falling for the SEO king.
reply
Gregaros
1 hour ago
[-]
whitepages vs yellowpages
reply
ilamont
18 minutes ago
[-]
> I don’t think it ever materialized but we had a good chuckle about the emergency meeting the Yellow Pages web devs would have had and at what percentage of opt outs.

They would just pretend they didn't receive the opt outs, like half of the direct mailers and spammers out there.

I've gone through the trouble of trying to get Uline to stop sending gigantic paper catalogs to my PO Box two or three times per year. They have a form, they just ignore the requests:

https://www.uline.com/CustomerService/ULINE_FAQ_Ans?FAQ_ID=4...

reply
detourdog
8 minutes ago
[-]
I managed to get off uline’s list I think they have a phone number. If one ever orders from them again the process needs to be repeated. This was a physical address not a PO Box.
reply
ctippett
40 minutes ago
[-]
Similarly I remember being at Australia Post discussing data privacy for a project and I couldn't help but make the wisecrack remark "don't y'all routinely distribute millions of individual's personal data every year and just leave the information lying about on people's doorstops for anyone to access?"
reply
airstrike
1 hour ago
[-]
> 4. Solves CAPTCHAs via CapSolver (AI-powered, ~$0.001/solve)

Right, so my suspicion was correct: I'm the only one being inconvenienced by the same old captchas.

reply
jeroenhd
1 hour ago
[-]
It depends on the CAPTCHA, but there's a reason why Apple, Cloudflare, and Google are shifting towards remote attestation for proof-of-humanity.

The reCAPTCHA v3 Enterprise version and MtCaptcha cost a whopping 3x as much ($3 per 1000 solves). Seems like they're the best CAPTCHAs to go for.

reply
heroh
4 minutes ago
[-]
recaptcha v3 will require the human to have a Google certified android device (i.e. no GrapheneOS or LineageOS etc.) and a dedicated iOS app which leaks device ID and other data.

Google will get to know every user browsing the web and link it to a smartphone. Since they’re rolling out government issue ID verification at the OS level, this change will allow Google to identify a random web visitor to a govt ID.

https://support.google.com/recaptcha/answer/16609652?hl=en

reply
rib3ye
16 minutes ago
[-]
Roblox, by far, has the strangest and most difficult to solve.
reply
queuebert
14 minutes ago
[-]
Some chess sites make you solve a checkmate problem for a captcha. Are those automated now, or is that a good method?
reply
muyuu
44 minutes ago
[-]
Captchas are getting so annoying and puzzling they will soon prove you're unlikely to be human if you pass them.
reply
jorvi
35 minutes ago
[-]
Its only Google's ReCaptcha that sucks, with its eternal gaslighting.

"Select stairs": okay, does that mean the railing too? And probably some percentage of people clicked rails, so now I have meta it and guess if that percentage is enough to throw off my guess.

"Select motorbike": okay, but you're showing me a bicycle. I'll click "skip". FAIL. TRY AGAIN. Sighs.. okay, I guess the average person is so dim-witted they will misidentify a bicycle for a motorbike.

reply
GJim
9 minutes ago
[-]
> "Select stairs"

And the "correct" pictures all shows steps, not stairs.

> "Select motorbike"

And the "correct" pictures all show mopeds, not motorbikes.

Christ, don't get me stated on taxis that aren't black, fire hydrants that aren't a yellow H sign (apparently I'm supposed to look for something like a yellow painted R2D2) and WTF is a "crosswalk" (a pedestrian crossing?).

reply
mynameisash
39 minutes ago
[-]
I think my browsing habits may have changed, as I rarely see captchas. However, just the other day, my son was frustrated by one that he said had taken him fifteen or more tries, and he still hadn't succeeded.
reply
weberer
8 minutes ago
[-]
Yeah, that is a very common complaint about Google's recaptcha. If they don't like you, they actually just send you through an infinite failure loop, even though you keep solving them correctly.
reply
thesimon
1 hour ago
[-]
Makes it tempting to buy paid captcha solving just to enjoy life more
reply
aembleton
1 hour ago
[-]
Looks like they have a browser extension! https://www.capsolver.com/products/browser-extension
reply
amelius
1 hour ago
[-]
I'm wondering if this isn't a nice automated way to send your information to 500 data brokers.
reply
sameg14
1 hour ago
[-]
I had this exact thought
reply
somewhatgoated
20 minutes ago
[-]
100%

I haven’t checked but wonder what info you need to provide in step 3 (Fills and submits the opt-out form automatically)

I assume it’s gonna be more than just the name and address?

A much better way to solve this would be to fight for GDPR-like legislation in the US.

reply
_joel
9 minutes ago
[-]
I feel like this is just a way to mark yourself as "active". Do we honestly think evey one of those shady companies sticks to the rules?
reply
SamuelAdams
11 minutes ago
[-]
I use Optery for about two months a year, seems to do a good enough job for most of the data brokers. There are also discounts or promo codes to lower the price as well.

https://www.optery.com/

HN Launch: https://news.ycombinator.com/item?id=30605010

Promo codes: https://www.optery.com/optery-promo-codes/

reply
bborud
31 minutes ago
[-]
Nothing they do actually improves society so in a healthy society we would be able to outlaw what they do. But we don't. So we can't.
reply
somewhatgoated
15 minutes ago
[-]
You can definitely outlaw this. Under GDPR it’s much harder to lawfully collect and sell personal data large scale. Not saying it doesnt happen still but it gives you a legal basis to fight against it — noyb.eu / Max Schremscand others do some excellent important work on that front
reply
rib3ye
20 minutes ago
[-]
At least in California, the DROP form is scheduled to come online this fall.
reply
ramon156
1 hour ago
[-]
The only thing that is tied to MacOS is launchd, seems like that's useful info to add to the docs. I don't know if you can just do a run from the CLI.

Supporting Systemd should be easy. Not sure what windows uses.

reply
jeroenhd
1 hour ago
[-]
Creating a Windows service is a bit harder (as Windows actually uses a real API for services rather than just relying on process spawning and scripting around that), but with task scheduler you can schedule tasks to run once a month in all kinds of ways.
reply
flexagoon
24 minutes ago
[-]
> a real API for services rather than just relying on process spawning and scripting around that

What's the difference? Aren't services always just spawned processes?

reply
jeroenhd
16 minutes ago
[-]
On Windows, services use this API: https://learn.microsoft.com/en-us/windows/win32/services/ser...

Services are executables, but they have dedicated entrypoints/"signals" for interaction with the service manager. That means you can't point a service at a batch file or powershell script, because those applications don't have the symbols to respond to the signalling from Windows.

reply
b40d-48b2-979e
1 hour ago
[-]
sc.exe or tasksched
reply
3RTB297
34 minutes ago
[-]
Having done data broker opt-outs manually using the Big Ass Data Broker Opt Out List (https://github.com/yaelwrites/Big-Ass-Data-Broker-Opt-Out-Li...), very little of the process can be automated. Intentionally.

A few of these services ask you to go find your record among their lists first, so you can confirm which record you want removed using the URL of the record. So either it has to guess on that, or simply isn't doing it.

reply
hash872
32 minutes ago
[-]
Has anyone had any luck deleting themselves from the data brokers who sell cell data to political texters and/or survey companies? Those are the ones I really want to opt out of
reply
projektfu
1 hour ago
[-]
Interesting. Have you been using it a while and is it working to reduce spam?
reply
Waffle2180
1 hour ago
[-]
The state tracking and manual fallback are the most interesting parts to me. For a tool like this, I’d really want a dry-run/audit mode that shows which fields would be submitted to which broker before anything is sent. The awkward threat model is that the tool reduces exposure, but a broken selector could also leak personal data to the wrong place.
reply
stephenlthorn
16 minutes ago
[-]
Thanks for the lively feedback and comments - this is very much a beta/first attempt.

I hate spam = the only reason I built it. No other intention behind it.

I posted here to get support on making it better so others can use it.

I'll take some of these comments and start iterating on them.

Feel free to submit anything directly to the repo or fork and make it better for your own set up.

reply
guidedlight
2 hours ago
[-]
> Name, city, state, ZIP, email, phone

Does this work for anyone outside the US as well? e.g. Will it work for an Australian?

reply
samieljabali
24 minutes ago
[-]
Why does one need to be removed from these sites on a monthly schedule?
reply
LadyCailin
15 minutes ago
[-]
Because the opt out is a useless fig leaf they can point to in the unlikely event that public outcry forces regulators to do something.
reply
LoganDark
2 hours ago
[-]
I got tired of spammers having my information, so I built a tool that submits an up-to-date copy of my information to over 500 websites. Surely this will help.

Jokes aside, I unironically suspect the purpose of many opt-out forms is merely to record the up-to-date information.

reply
hotsauceror
1 hour ago
[-]
Agreed. Any time I click an “Unsubscribe” link in an email, that takes me to a site where I have to provide my email or indeed, do anything more than click “confirm,” I leave. I assume it either resets some kind of consent trigger or sells my data to a new third-party vendor. The assumption of bad faith is now baked into my interaction with almost every corporate entity.
reply
somewhatgoated
13 minutes ago
[-]
Thankfully most let you opt out with a single click - but if not I will put the whole domain in my killfile, so I won’t get any emails from them ever again
reply
Saris
1 hour ago
[-]
Sometimes the people who set up the email service just forget or don't bother to add the receivers email to the URL parameter when you click unsubscribe, so it'll ask for your email again which is always an annoying step.
reply
hotsauceror
1 hour ago
[-]
I refuse to believe that “someone just forgot” to implement a user-friendly feature whose omission coincidentally benefits their company. It is not a coincidence, and it was not done unintentionally. The same way that it is not a coincidence that the “unsubscribe” link is always in six-point font the same color as the rest of the email footer. Code does not happen in a vacuum. Code does not get pushed to production without vetting and approval. As I say, the assumption of bad faith is baked in.
reply
nkrisc
1 hour ago
[-]
That’s their mistake, and any other email I receive from them will be flagged as spam and sent to the junk folder.

I’m not in the business of fixing their mistakes for free.

I will click the unsubscribe link and that’s it.

reply
baggachipz
1 hour ago
[-]
It's a dark pattern which adds friction to the process, in order to reduce the number of unsubscribes.
reply
jackp96
1 hour ago
[-]
There are plenty of dark patterns in digital marketing, and you're generally right about the thinking.

But there is a (somewhat plausible) defense here: if someone forwards you an email and you hit the unsubscribe link, then it unsubscribes them; not you. Requiring the user to enter their email helps ensure you don't accidentally unsubscribe the wrong person.

That said — the most impactful thing anyone can do to punish dark pattern digital marketing behavior is to report the message as SPAM in your email client. That'll hurt their delivery rates and damage their sending reputation with email providers.

reply
calyhre
51 minutes ago
[-]
Could be a way of saving computation, this way the email content is the exact same for everyone receiving it
reply
dangus
1 hour ago
[-]
I think they’re doing it because of your exact behavior: one-click unsubscribe links are easy to do even if you’re on mobile and aren’t giving the process your full attention. Making you enter your email is a barrier.

They already know your email, I don’t see why getting it again would sell it to a new vendor. Clicking an unsubscribe link already verifies you are a real person.

reply
hotsauceror
1 hour ago
[-]
Very true, the act of unsubscribing itself signifies that the email is still live; more bad faith. As to why not sell it to a new vendor, because that would allow them to check a box that says “we offer a feature that allows users to opt out of data sharing agreement with the partners defined in the TOS and onboarding process.”
reply
londons_explore
1 hour ago
[-]
How many of the forms have captchas etc?

How many require you to make an account or confirm your email address/phone?

reply
Saris
1 hour ago
[-]
Looks like it uses AI to solve the captchas, but yeah some do require making an account in my experience.
reply
tedd4u
7 minutes ago
[-]
FYI it could be that people are the "AI."

https://surejob.in/captcha-entry-work.html

reply
victorbjorklund
1 hour ago
[-]
Sweet, I've been wondering why it doesn't doesn't exist as an open source solution.
reply
WhitneyLand
1 hour ago
[-]
Could this task be a nice benchmark for computer use models?

Would interesting to see the success rate for Claude Cowork or Codex’s equivalent feature.

reply
pulse-dev
41 minutes ago
[-]
Good point, could be a solid benchmark. Sites are adversarially built to resist automation and success is verifiable later when records actually disappear, so harder to game than WebArena.
reply
IgorPartola
1 hour ago
[-]
Any chance of this not needing to run on a Mac? I would try it out but want to run it in a Docker container.
reply
LatencyKills
1 hour ago
[-]
Why not just comment out the macNotify() calls in watcher.js and then run it periodically? There are also a few calls to send iMessages that you should remove.
reply
ur-whale
25 minutes ago
[-]
Wow, you will be giving away a shitload of PIID when using this tool, the exact opposite of what it's supposed to do.
reply
somewhatgoated
6 minutes ago
[-]
I said it a couple of times already in this thread but the only thing that will stop these “businesses” is stronger data privacy regulation.

Sometimes it feels like US-Americans have lost all faith in their government’s ability to improve their lives -i can understand it but at the same time where will this lead?

reply
mixtureoftakes
1 hour ago
[-]
you ever look on a title and just immediately know that its going on the frontpage + staying there
reply
Imustaskforhelp
1 hour ago
[-]
There are times where I immediately guess it, the recent mitchell post of AI psychosis was something that I recognized (which is now at 2k upvotes)

But there are other times where I am wrong too and I even comment on threads with less upvotes because the topic is so interesting yet my comment just ends up being isolated.

It's really more like a 50/50.

Even the one post of mine which had reached the front page of Hackernews was something that I absolutely knew could reach front page but then there weren't much responses for a few days but then after a few days, I saw that it was re-uploaded (I think that Hn selects a few submissions which are interesting, I forgot how that mechanism worked) and then I reached the front page of Hackernews ;)

Either way, I think people should just make what they feel is interesting but I remember reading some article once which said a few things which this article follows:

1. I built XYZ... gets more frontpage than we built XYZ...

2. having (Open source) in the title increases the chances too

This article has both of them so its definitely interesting to see it on front page, either way its an really interesting project :-D

reply
7777777phil
2 hours ago
[-]
cool idea, happy to try it out

> Searches each data broker site for your name + state

Is this US only or would it also work for international profiles (and if so what would be the "state" equivalent)?

reply
lolpython
1 hour ago
[-]
The mention of states is because (besides the author likely being located in the States) many of the opt out forms are US only and filter on US state. You could probably just use an uncommon state or territory like Guam and try it, it would still submit opt outs for matching records on sites that are international. For example https://www.familytreenow.com/optout is listed in the broker list, and that seems to work for international profiles.
reply
CodeCompost
1 hour ago
[-]
Now this is a good use of AI
reply
stephenlthorn
2 hours ago
[-]
I got tired spam calls and text, so I built a script that automates the opt-out process across 500+ data brokers on a monthly schedule.

Where I need help: The heuristic approach misses a lot. Many of the generic sites have unique flows the four generic strategies don't catch. I'm looking for people who want to:

- Verify which generic sites are actually succeeding vs. silently failing - Add explicit broker definitions for high-value sites that are currently on the generic path - Test on non-macOS (launchd scheduling is macOS-only; cron fallback would help Linux/Windows users) - Handle email verification flows (script submits the form but can't click confirmation links in your inbox) Repo: https://github.com/stephenlthorn/auto-identity-remove No personal data in the repo — setup script prompts for your info locally and keeps it gitignored.

reply
lolpython
1 hour ago
[-]
Does this current approach succeed for many sites? I see that this repo was clearly vibe coded or at least heavily used AI to write it. That can be fine, it just makes it more difficult to follow how much was done already and how much is left to get this properly working. As for email verification, a stopgap solution could be to just tell me to click confirm on the emails and which senders to look out for. Properly reading the actual inbox on record across providers could be difficult, it requires an actual email client. Also, forgive me if I'm off base on this one, but your comment appears to be AI generated. If so, that violates site guidelines.

> Don't post generated comments or AI-edited comments. HN is for conversation between humans.

https://news.ycombinator.com/newsguidelines.html#generated

reply
ramshorst
2 hours ago
[-]
Nice ! Is it a command line tool ? What info does it need to operate ?
reply
bilekas
1 hour ago
[-]
Its literally in the README front and center.

Requirements

macOS (uses launchd for scheduling and Messages for iMessage)

Node.js 18+

Playwright browsers installed

reply
ramshorst
1 hour ago
[-]
Did you read my questions ?
reply