https://github.com/blobcache/blobcache
Trusting a server to store an application's state is a different thing from trusting it to author changes or to read the data. Servers should become dumber, and clients should become smarter. When I use an app, I want the app to load E2E encrypted state from storage (possibly on another machine, possibly not owned by me) make whatever changes and produce new encrypted data to send back to the server. The server should just be trusted for durability, and to prevent unauthorized access, but not to tell the truth about doing either of those things. Blobcache provides an API to facilitate transactions on E2EE state between a dumb storage server and any smart client.
Blobcache can be installed on old hardware along with a VPN like Tailscale and then loaded up with data from other devices. Configuration is like SSH, drop a key in a configuration file to grant access. It removes most of the friction associated with consuming and producing storage as a resource.
I'm using it to build E2EE version control like Git, but for your whole home directory.
Looking through the docs on Peergos, it looks like it's built on top of IPFS. I've been meaning to write some documentation for Blobcache comparing it to IPFS. I can give a quick gist here.
Blobcache Volumes are similar to an IPNS name, and the set of IPFS blocks that can be transitively reached from it. A significant difference is that Blobcache Volumes expose a transaction API with serializable isolation semantics. IPFS provides distributed, available-but-inconsistent, cryptographically signed cells. IPFS chooses availability, and Blobcache chooses consistency. A Blobcache Volume corresponds to a specific entity maintained and controlled by a specific Node. An IPFS name exists as a distributed entity on the network.
Most applications need some sort of consistent transactional cell (even if they don't realize it), but in order to be useful, inconsistent-but-available cells have to be used carefully in an application specific way. I blame this required application-specific care for the lack of adoption of CRDTs.
There's a long tail of other differences too. IPFS was pretty badly behaved the last time I used it, trying to configure my router, and creating lots of connections to other nodes. Blobcache is more like a web browser; it creates transient connections in immediate response to user actions.
That whole ecosystem is filled with complicated abstractions. Just as an example, the Multihash format is pervasive. It amounts to a tag for the algorithm used to create a hash, and then the hash output. I'd rather not have that indirection. All the hashes in Blobcache are 256 bits, and you set the algorithm per Volume. In Go that means the hashes can just be `[32]byte` instead of a slice and a tag and a table of algorithms.
I haven't used IPFS in a while, but I became pretty familiar with it awhile ago. Had I been able to build any of the stuff I was interested in on top of it, I probably wouldn't have written Blobcache.
The good news is Peergos also has serializable transactional modifications. This comes from us storing signed roots in a db on your home server (not ipns). We also have our own minimal ipfs implementation that uses 1000x fewer resources than kubo, aka go-ipfs.
(censorship resistant distributed storage with the same API)
The public storage networks are targeting a different use case than Blobcache though, which I think of as a private or web-of-trust storage network. To use a cryptocurrency backed storage solution, one must manage accounts, or a wallet of transaction outputs, connect to unknown parties on the internet, and pay for the increased redundancy. There's also legal risk, depending on the jurisdiction, when allowing untrusted parties to store arbitrary information on one's devices.
I don't want to consult the global economy in order to make use of my extra hard drives, which would otherwise be idle.
What's the story for people who don't know what an SSH hey is?
The good news is that we don’t have to wonder if a better way is possible. The technology is already here! Projects like Solid (Pods) and AT Protocol (PDS) have proven we can separate your information from the applications you use. You can put your data into your own secure digital "locker" or vault.
The difficulty now is not the technology, but getting people to actually use it:
1- It’s Too Hard to Use: Setting up and managing your personal data locker is currently as complicated as managing a super-secret password for a crypto account. For everyone to adopt it, it needs to be way simpler than just clicking "Log in with Google." If it’s too much work for regular people, it will fail.
2- Big Companies Don't Want to Change (The Incentive Problem): The biggest tech companies make billions by collecting and using your data. They have no reason to switch to a system where they have to ask permission to use data they don't own, unless a major law forces them to, or a new competitor steals their users.
3- Privacy Isn't Enough (The Benefit Problem): Most people won't switch just for "privacy." The new system must offer clear, positive benefits, like letting you move all your friends to a new social app instantly, or securely filling out long forms with a single click from your data locker.
The key to success is building user-friendly tools that hide all the complexity and make this new, secure way of managing data simple for everyone.
We are at (near) a significant local maximum, and (again, as far as I've read, which is not all of it for sure) the people pitching this form of information control have given no set of steps from here to there without significant cost/effort.
Of course they don't have to have the whole path in mind. By definition they just need the first step or two. But they must be steps up.
You don't get wings by wanting to fly; first you need feathers to keep warm (I am not an evolutionary biologist, I don't know if that's a valid theory).
Those users have credible exit to take their data off BlueSky's hosting to someplace else (and as of a week or two ago to move back to BlueSky if they want).
Those users can put whatever kind of data they want in their PDS. They can host their git data via https://tangled.org . They can store their music listening scrobbles with https://teal.fm . They can blog on https://leaflet.pub .
And there's been rapidly advancing host it yourself options. Plenty of folk individually or collectively host PDS. There are alternate relays that collect &n syndicate out everyone's PDS data as that changes. Hosting the aggregation layer is significantly harder especially if you are trying to fully connect the network but there are a couple & progress is good.
it feels like a huge improvement over the status quo, and there's extremely visible developer energy building forward & rolling with the concepts. The breakdown on architecture allows for wins and work in various areas. The base seems solid, the core seems coherent & well built, built to scale not as one big thing but coherent layers. I think it's doing what you are asking for, and the signs of advancement & uptake warm my heart to see.
I highly, highly doubt this, even in the narrowest sense of how many BlueSky users still actively post on X.
* I use Bluesky to chat as a Twitter replacement, which gets me into the Fediverse and gets me a PDS
* I use my PDS to store my payment details, giving me a (at first client-side) way to submit stored payment details that feels similar to storing it in the browser, but stores it in my "server"
* From there, it's a natural step to giving the retailer a token that can be used to pull payment details from my PDS; early adopter retailers are incentivized to do this because it frees them from the burden of storing and updating PII/PCI
* After some subset of users and retailers do this, users see the benefit of controlling their data as a viable alternative to some of the worst user-hostile patterns, e.g. the New York Times' "we don't have a cancel subscription page, you have to call an 800 number" nonsense.
* To the extent that storing PCI/PII in a PDS is as easy as storing it in the browser but with perceived additional benefits, user demand drives wider adoption
* Once it's technically feasible for sites to maintain their business model without storing any PII/PCI, it is much more realistic to write laws that proscribe it effectively for those users who choose that
I wonder how many years need to pass after a company removes a user-hostile pattern before it should stop being lambasted for it. I don't know how long they did what you say, but I could see that 5 years might not yet be enough.
Why would there be a statute of limitations on this. A company doesn’t have an inherent right to customers. A lot of us gave up on Windows with XP (a second strike, after ME). Maybe it’s better now, maybe not, but why would I be motivated to give a company who screwed up already (due to making choices I hate, not just incompetence) another chance? NYT absolutely permanently lost customers by placing revenue above civility; which of us benefits from hoping people forget that?
That's okay.
I agree. And looking at the average web user specifically, is "owning your own data" enough of a UX improvement? Maybe paired with less ads and products that optimize for the end-user rather than advertisers? I think... maybe. I hope so. It's going to take a lot of work done for little money, which is concerning, but I'm optimistic.
Meanwhile - Nothing changes, everything generally gets worse and younger generations come into the world with no memories of the 90s internet or the world before mobile devices or surveillence everywhere.
Applying for a job or apartment or anything today means creating endless pointless copies of your pesonal information in databases across the world that will eventually be neglected, hacked, exploited, sold off etc
I dont know the way out if there is one, I guess we can keep fantasizing and thinking about it. It just feels like it would be easier to get the earth to start spinning the other way sometimes.
This problem is practically fixed in the EU (to the extent that legislation can fix it). Data protection laws have enough teeth that real companies can’t afford to keep or sell customer information illegally.
But people only see the tip of the iceberg and think EU data protection is something to do with annoying cookie banners. We need to do a better job of celebrating Europe’s real achievements in making the digital world better for its citizens. Instant zero-fee bank transfers are another example.
How are real people's lives being effected by these problems?
Free services funded by ads have been a boon for the poor.
For any given ad supported service, one of two things must be true:
(1) the ad spend was more than or equal to the cost of the service for those users
(2) the ad spend was less than the cost of the service for those users
From fork (2), it follows that the service isn't sustainable anyway.
From fork (1), it follows that the buyers of the ad slots in turn only make a profit if those ads led to sales higher than the ad spend.
But for any given poor person, buying that which was advertised on the ad supported service necessarily means spending more than they would have on a non-ad-supported version of the same ad supported services.
thinking of stuff like facebook here...
Endless investing is, depending how you look at it, either not (just) ad supported and preceeds the premise, or it still is ad supported (and hence (1)) just with extra steps to badly hide who is doing it.
Hmm… I suppose the purchase of a vote in a democracy is something that a poor person might not otherwise be able to sell, and where "we advertised and convinced you" is (depending on campaign finance etc. rules) one of the legitimate ways to do it… but even then, for reasons too long to type on my phone, I'd say in this case it would still make the poor poorer.
It doesn't matter how much you think my attention is "really worth". If I want the service now, have no cash, but can pay with my attention, I am strictly more enabled than if the service only accepts cash.
The fork between (1), (2) is how much cash their attention is actually turned into.
To put it another way: what's the attention of a poor person really worth, in dollars? Answer is always less than or equal to the amount they can spend.
I mean I supposed every transaction leaves someone poorer of something and richer in something else. I'm not sure of the point though.
I concede that if the ad companies are willing to forgo collecting X dollars in exchange for showing you an ad then it must be worth >=X dollars to the ad company for the person to see the ad.
But it remains true that the poor person has no way to convert their attention directly into X dollars, and all that taking away the free tier does is make it so that someone who would have made a trade (of their attention for a service) cannot do so.
Starting as a subscription service at least doesn't feel like a broken promise.
Not even close to the case for any big player. It just exists as a moat for smaller companies.
But I've noticed there are two kinds of people when it comes to entrepreneurship and regulations. There are people who go all gung-ho and do what they want and ignore the law as much as they can get away with. And there are people who are so scared of things like laws that they never become entrepreneurs. I don't see much of a middle ground in practice.
Frankly, in here EU did a good job, certainly better then USA does. It would be neat if USA made similar laws too.
Megacorps do get bigger fines then small companies, actually. Megacorps existence is also literally result of winner takes all and rich are untouchable legal system cranked to 11 Americans are proud of.
And the regulatory environment 100% advantages large businesses who can afford to hire dozens of compliance attorneys, and who can handle the risks of noncompliance fines.
PS: I'm not saying US regulates anything effectively either. We just allow every merger until 2 remain in a given market, and then say "Good. We still have competition. Everything must be fine!"
People in the EU are still using Instagram/Facebook/WhatsApp. Zuckerberg did a "ok, if you don't want us to track you, you can pay 12€/month" and everyone just smashed the "I consent to get my data mined forever" button.
Not to mention that we *still* have lobbying for chat control.
Every measure from the EU is, as always, meant to look like our beloved bureaucrats are doing something but absolute ineffective at changing the status quo.
What do you suggest instead?
Things elsewhere are bad, but the EU is worse because it lies to people about the efficacy of its regulations and the whole apparatus only exist to make lawmakers and lobbyists a justification for their existence.
Let's stop pretending that the EU has done anything more than political theater.
> What do you suggest instead?
Break apart any company that has more than 150 employees (by employee, also count individuals working more than 50% of the time to the same company): https://news.ycombinator.com/item?id=31317641
We need to do what we preach: sure, things are worse in certain things but for sure setting up a local network with top-level open source self-hosted alternatives is the easiest it has ever been ever.
Also I think people forget to realise that the type of people who were online in the 90s are still online, many still does exactly the same things. The Internet just got so much easier to use for the rest of the people who doesn't really see the magic of it all. And that's ok.
People always complaining how bad things currently are, they are doing a disservice to all the services and communities still around. They are not sexy or cool but they exist.
Routing to your home address could be hard, but it’s also pretty easy and cheap to set up a reverse proxy from a server you can rent. Routing through a public CDN is also easy and cheap and solves a lot of problems like DDoS.
Understand your enthusiasm but to relate the discussion back to Tim Berners-Lee idea for SOLID data storage protocol... Running self-hosted things like email, NextCloud, Plex, sandstorm.io, etc -- are not relevant to the gp's "nothing changes" complaint.
Without dissecting the SOLID protocol, the basic idea is that transactional data is stored on a separate user-specified "storage pod". It's not just simplistic sharing of "name/address" profile data. Imagining some idealized scenarios might help:
- Spotify music : instead of "playlists, listening history" being stored on Spotify's servers, it is stored on the user's storage pod. Spotify makes API calls to constantly save that data to the user-controlled data location. If the user then cancels Spotify and switches to Apple Music service, Apple can just read the "music playlists data storage pod" and all the recommendations work as expected. No import/export.
- Amazon shopping: instead of order history being in a data silo on Amazon servers. It could be stored in user's "ecommerce orders storage pod". The user can then give permission to Walmart.com to read it to provide product recommendations.
The user "doesn't own their own data" continues with the current AI chat tools. The users' ChatGPT "prompts history" is stored at OpenAI instead of a user-controlled "storage pod".
The walled-garden and data silos don't just restrict consumers. Businesses have the same issue. They use SAP accounting software package or a SaaS tool and their data is locked up in those services. Exports are sometimes possible but cumbersome.
Therefore, self-hosting Plex on local server for a personal music library instead of using Spotify cloud doesn't affect the "nothing changes" narrative. TBL still wants people to have the flexibility/convenience of using cloud services but somehow still keep "ownership of their data".
On the other hand, if you were self-hosting a SOLID Storage Pod at home, and a company like Spotify wrote listening data to it, that's when the narrative changes.
It should be obvious that companies are not incentivized to write transactional data to users' storage pods which explains why the SOLID protocol doesn't seem to gain much traction for the last 9 years.
Not simply "not incentived" but actually decentivized. It's not just that companies lose the ability to have a better algorithm to recommend products, but the data itself is worth a fortune. Google, Facebook, etc are worth as much as they are because of the give amount of personal data they've gathered. And, the reason it's worth so much (well, one reason, and probably the least-scary one) is advertising.
Online advertising is the keystone keeping this pile of shit upright and I can't wait until that bubble finally pops. That is when the narrative will change. None of the ideas in this article will come to pass until all of the data that Google hoards is suddenly useless.
it comes down to the rights to own the data you produce, and have it easily accesible. Solid is just a way of giving people option to excercise this right
Most comsumers just want websites to work. Something like SOLID would add friction. People who care about privacy are a vocal minority.
i believe that this is rising tide, maybe those who care are minority, but not for long
This can't happen until there's another viable revenue stream. Which requires smoothing out everything about microtransactions, creating a culture where people now expect to pay, and building trust that it won't get stuffed with ads anyway.
Sometimes HN makes me feel like I'm the literal last remaining person on the planet who just... uses a desktop computer, and stores data on SSDs and HDDs, all physically connected to the machine, and never worries about how to access this data from another device because there are no other devices from which it should be accessed.
I mean, okay, fine, I do things like publishing to GitHub. But I still have a local copy, and I'm in control.
You start.
edit: I have no idea what people think they're talking about when they're like "people should just" and "you should just." The cage is not all in your mind, dude; it's an actual cage, guarded by people with guns.
So yea, some of us are practicing what we preach.
It's always fun to read articles about how urgently we need to go back to local-this and self-hosted that, knowing I never left!
In what country?
In all the ones I know of, regulations are enforced by courts, without the use of guns or violence.
Posting these kinds of hot takes every day are probably why you got shadowbanned.
Sometimes the last stage in a chain of potential escalations is some kind of deprivation instead of violence. For example, if I get money taken from my bank account to pay a fine, and I only planned to use that money to buy a really big TV online, then now I don't get a really big TV, which is a punishment, but not a violent one.
But that's actually quite rare. It doesn't work with a brick-and-mortar store, because there would still be more stages of escalation available, where I could take the TV from the store without paying and then men with guns would come after me. It also doesn't work if I was going to buy food with the money, since starvation is a form of torture. It also doesn't work if I was going to pay rent with the money, since eviction is violent. Only relatively few escalation chains end in non-violent deprivation.
With fictitious legal entities it's more likely to end without harm to any natural entities. The last stages of the chain of enforcement against a corporation can be to transfer ownership to a different natural person, followed by dissolving it entirely. Both of those are just pushing words around on paper, and nobody gets a black eye. On the other hand, one could argue that dissolution is to a legal person what the death penalty is to a natural person, and we only just don't care as much legal people aren't real. I don't think have any ethical qualms with metaphorically murdering a corporation by writing a legal document saying it no longer exists, but it actually supports my point, that even against fictitious entities, escalation chains end with something analogous to shooting the corporation in the head.
The data needs to be viewed by the holder of that data as a dangerous liability, not an asset. If there were headlines about “Megabank Files Bankruptcy Over Data Breach, Executives Jailed” instead of the general sentiment of “LOL another data breach, here’s a free trial of LifeLock,” there would be changing attitudes about storing arbitrary user data.
Don't get me wrong, I'm in the tech industry and generally more online then likely 95% of the population, but ime ... Nobody even knows what bluesky is?
(They also don't know what X is, though they DO know what Twitter is)
And even more niche products like mostodon, the fediverse altogether etc are entirely unknown to most of the tech industry too.
For example Twitter and Facebook didn’t result in a bunch of Democracies springing up after the Arab Spring, it resulted in the complete opposite. Tech simply amplifies the culture that was already there.
I know the topic of mental health and social media is different from the topic of independence vs the monolithic web. But that doesn’t mean that there isn’t significant overlap in terms of those who are willing to boycott Meta for privacy reasons are also the kinds of people who likely dislike social media for other societal reasons too.
This is a victim fantasy, and if being under intense attack from the state meant you were rebelling against the authoritarian system, then you would be capping for Parler, Gab, X and Tiktok. Bluesky, however, is only under attack from its own users, who are authoritarian trolls. At least the management seem to be getting sick of them, because it is actively inhibiting their growth* that they've been used as a base for the angriest, most entitled, least interesting people on the planet. It must be hell trying to manage a site filled with people demanding to speak to the manager.
It is also just a centralized twitter clone backed by VC looking for a return; not a revolution.
[*] Of course, it was their strategy to cater to that group because of all the free advertising they'd get from the media. But it had and has nothing to do with Dorsey's hopeful redemption arc, which was only about decentralization (i.e. not having speech under the control of people like him) and resilience. Bluesky was supposed to be bittorrent.
https://en.wikipedia.org/wiki/Solid_(web_decentralization_pr...
You’d think that if anybody could pull off reshaping how data is stored and shared on the Internet, it would be him. And the technology is, well, solid.
Unfortunately, it doesn’t have as much traction as I would hope. Probably because it requires a new way of thinking about many parts of the tech stack. It’s not as simple as swapping out one library for another one. The existing web has so much momentum, and so many of today’s tools and frameworks have assumptions built into them that aren’t necessarily convenient for building a web where users have true data ownership.
Still, I’m rooting for Solid and the team behind it. They clearly understand these issues. They’ve been building libraries and scaffolding tools to make it easier to adopt Solid, For new projects, it’s pretty easy these days.
Well, TFA, and sibling posts to mine, point out some ways in which federated networks are leading the change in this direction. I would add that alongside SOLID and the AT Protocol, ActivityPub also encourages people taking ownership of their own data.
So probably you need to focus your attention to where the change happens instead of waiting for large, ad filled, for profit networks to act on it. Because indeed they have no incentive.
I think I agree. I know I started re-evaluating my internet presence as a whole. I accept that a lot can't or won't do much, but the same was true, when firefox was new and no one wanted to jump ship, but the people, who liked privacy focus and extensions. Those that can, will move. The herd will follow if they see it can work.
One such candidate is cryptocurrency and personal finances. The cryptocurrency wallet will necessarily need to be cryptographically secure, so this at least provides an opening for privacy. Tying it to finances means that there's an immediate application, payment processing, that people might want to use and put up with clunky behavior, at least initially.
All this lacks specificity and finances, cryptocurrency or no, bring their own drawbacks, but it does seem like it's possible to me.
The Internet's attention can be fickle and it's easy to forget that sometimes. IBM used to be a titan before Microsoft supplanted it. Proprietary server operating system, including web servers and databases used to deeply embedded until they were supplanted by FOSS alternatives. Digg, Friendster, Myspace, Yahoo, etc. used to fixtures of the Internet until they weren't.
Strong regulations is the answer. To think that big corporations are going to do anything for us out of their good heart is naive and dangerous.
If a society wants nice things then they need to fight for it. Get elected officials that care to fix things, that fights against big corporations, and that help to split their monopolies.
The USA thinks that they can get a better Internet by doing nothing, like by magic. The reality is that government and civil society are going to need to put a lot of effort to reign in the big tech monopolies.
Let along actually Living in the apartment or working at the job...
A friend's apartment required you to sign up with a third party to get your packages. They made you create an account and accept that they would make pictures and videos of you to access the package room.
Don't even get me started on connected appliances/wifi and app access for doors.
The big majority goes with the comfort of the mainstream, almost by definition.
Then the people who have not viewed an ad or paid a subscription in 20 years complain that the internet sucks and we need to go back to IRC and chan boards. As if ideologically non-paying customers have a voice worth listening to.
They buy servers to self-host services, extra hardware to store data locally and domain names to let others find them. Those who cannot afford it sometimes join niche communities like the Tildeverse as an outlet for the interest.
In my experience it's largely the 'just not interested' camp who always go for the free webmail and whatever free messaging service comes with their phone.
Do people who ideologically refuse to spend money on meat-foods have nothing worth listening to about animal welfare? Who don't spend money on airline flights have nothing worth listening to about climate change? Who avoid companies which use slave labour in their supply chains have nothing worth listening to about human rights?
'Money talks' but that doesn't automatically mean money has anything worth listening to; markets are manipulated by money as well as using it for signalling, and as a goal-seeking mechanism they are prone to local maxima like other things are.
So to follow your analogy, they eat meat by stealing it, and feel like they are sending a message about animal welfare.
You are arguing on the premise that ads would somehow be a fair exchange. That is simply the opposite of the truth. Ads are parasitic. Services with ads are almost always worse than services without, not just by having ads but also in every other way. Ads do not incentive quality, they incentive treating your users as prey and feeding them SEO slop.
I want to compensate people for actual beneficial work they do. But with most for profit internet services that is simply not possible. If you give them a finger they will take your whole arm. For exampme I want to buy good movies. But I simply cannot. All I can "buy" is a pinky promise from them to let me watch a movie under their conditions which they can change at any time under their sole discretion and they can just revoke that possibility for me completely at any time. Would I pay for Netflix they would only give me 720p no matter how much money I give them, because I have to much control over my own hardware for them.
There are exceptions to this that I happily pay for, but those are all niche services that cater to the small group of people like me.
But they went bankrupt in 2017.
Why? Because people don't want to view ads and they don't want to pay a subscription. Vid.me was unable to monetize and collapsed.
Nebula is a more recent example. Creators falling over themselves to promote it, yet conversation rates are still <1%.
It's not anti-competitive practices killing these companies. It's childish entitled users who get offended when asked to compensate.
But I suppose expecting ad-free video streaming 'just because I paid for it' is also entitled and childish, because to people who use those things as putdowns, everything other people want, is. It's like "everything I don't like is woke" in that sense.
The ad-free version is available if you cover the cost of lost ad revenue.
Just because you pay doesn't mean there will be no ads. And just because there are ads doesn't mean there is an ad free service available.
This is how smart TVs can be bought for $300. It's a $600 TV but you pay for half of it in smart TV ads.
Apparently so. But that isn't what I signed up for. That isn't the product I started out paying for. And that isn't the product I agreed to switch to, except by some weasel words on their part.
Your original argument is that people don't pay for things because people are crybabies. My counter argument is that people do and did pay for things and companies abused that, and now people are "once bitten, twice shy" not "entitled". People paid for NetFlix and then when that became a success, content companies pulled their content and made their own streaming services. People paid for YouTube Premium to avoid ads and then YouTube showed 'sponsorships' which are ads in all but name.
https://old.reddit.com/r/youtube/comments/18ll7y6/i_have_you...
Also there isn't a way for people to pay their share of server cost for services like that. For your average non-video communication service your options are paying 0x or paying 50x.
No, they're not native to the protocol, nor are they required. However it's an open protocol. You are free to pick from a number of solutions that compose that goal.
Commercial IRC services? IRC Cloud comes to mind as one I've seen others use. Couldn't tell you how much it costs, how good it is, or if it leaks data.
And some networks provide bouncers so they basically do have that. And maybe some IRCv3 networks, I haven't looked into that much lately.
And hope you never have your identity stolen, or an account hijacked, since that was the only proof of who you are.
That feels like something that could displace other social media in a way that's difficult for for-profit businesses to replicate since it goes against every product manager's instinct to leave engagement on the table, and would stand in stark contrast to the current social media landscape.
I suppose social subscriptions have to overcome network effects and a plethora of “free” alternatives - ranging from iMessage to facebook.
So the idea of paying for the infrastructure needed to see the content produced by your social network doesn't feel like a good deal.
What happens when the rising tide stops but the boats still have to rise?
My bet is that we will hate Google, Facebook, Amazon, modern Microsoft a lot more than people in the 80s and 90s hated IBM and old Microsoft.
or unless you don’t comply quickly enough when they say “jump” and they unilaterally take away “your” gvoice number.
Giving all your data for better services is easily hijackable.
"over the past 20 years" is not the same as next 20 years
You can export your data from Google or Facebook today, but then you need to write a copy of the source UI that faithfully replicates the way all those data fields are supposed to display. And tomorrow the source makes a change so what used to be one field is now two fields, oh and they also removed another field entirely so that data is just gone. Well, in future dumps anyway. Are you going to use the old schema or the new schema for your display? Is it possible to do both?
When everything is in data silos, you can freely and safely change data format, which is something that needs to happen a lot as applications evolve. Even in a data silo, doing this is pretty tricky and bugs and data loss are significant risks. If you're trying to sync between an unbounded number of data repositories where each repository has potentially conflicting relationships with the data schema, data loss is practically assured.
Another big problem is schema permissions and identity. I might have some piece of data that says "person A is allowed to see this set of fields" and another piece that says "person A is blocked from seeing this other set of fields." This gets synced to 3 different servers, one of those servers has no idea that userA is in fact person A. So you fail closed, but then the data on that server practically does not exist if the goal of this data repository is sharing some data with person A. You really can't do any sort of fine-grained access controls in a system where trust/identity/auditing is decentralized.
We have not solved decentralisation in an accessible and useful way yet, and the incentives won’t change until we do. If ever.
I, and many like me, would pay for centralised service or any other service if it meant that we own our data and can tune the algorithms to our own preferences. I wont pay for doom scrolling, but would gladly pay for algorithm to serve me content that would better my human experience.
Governments have given corporation to much power, people need to rise up agains that, if it remains the same in AI age, we humans, and our collective mind would erode to the point of no return.
Unequivocally, users water plants that deliver in demand fruit while being most convenient and cheapest.
So, why can't I have that?
During my standard install of my favourite distro, I would only need to enter my name, subdomain and email password for everything to be magically installed, so I have a standard web site, some file sharing and email out of the box.
However, it would take me a fortnight to get this setup and I wouldn't have a clue how the email actually worked, if it worked. This wouldn't be my first rodeo either, so I wouldn't be starting entirely from scratch. I am also sure that there are some that have setup umpteen virtual linux machines that they could get everything done by tea-time.
Whether two hours or two weeks, it is still not that much work in the bigger scheme of things, which makes me wonder, why haven't I got some all-singing and all-dancing bash script that automates the whole process? But why has nobody else done it either, to make it fully open source and as easy to obtain as it can be?
Also, why can't I buy a glorified router box that does all of this? It could take the mainboard and power circuitry from any laptop, and, out the box, provide a decent web server, mail server and whatever else.
There is a suspicious absence of products in this space.
Step 0 is to secure that box, as routers are obvious targets, even before they have self-hosted data. There are some products based on RPi, NAS and router form factors.
> suspicious absence of products in this space
Earlier efforts:
Apache Wave (federated)
Chandler
Diaspora
FreedomBox
Microsoft Groove (p2p)
Urbit.org
Sandstorm.io
Active OSS projects include Proxmox (https://community-scripts.github.io/ProxmoxVE/), Paperless-NGX (docs), Immich (photos), NextCloud and others, https://github.com/awesome-selfhosted/awesome-selfhostedAs for security, it is all a bit meh. If you have a box that only runs https: with no other ports open, you are half the way there. If you are just running static pages then you are done. If you run a NextCloud type of beast then you are opening things up, but my hunch is that it works just fine with nobody losing sleep on it.
There was no choice but to use someone else’s computers for moving around large files. Plus CGNAT and whatnot making people have to use dynamic DNS. If a turnkey solution could have existed 20 years ago, maybe a market for it would have developed before the big companies locked it down.
Even if I’m wildly in favor of user control over data, I’d venture to say that there still is no choice but to use someone else’s computers, and not just for performance reasons. If applications have to gather every individual user’s data that gets shown to another user from somewhere outside their servers every time, won’t reliability and consistency and UX likely become nonexistent, in addition to the unusable performance?
Decentralized does not need to be slow like that. And very limited upload does get to be a problem if you want more than a couple people/servers to be able to access your media posts at the same time.
The person you replied to is assuming a reasonable distributed system.
But self-hosting machines are susceptible to the "I can only upload pictures and videos at 5-10mbps" problem. That requires more difficult peer-to-peer systems.
The first problem only requires getting small bits of data onto the same machine. The second problem requires getting large amounts of data onto many machines. Or reasonably symmetrical upload speeds.
That some people don't want to spend time learning the thing that you happen to find interesting doesn't mean they're wasting their lives.
So the next best thing is trying to operate in the constraints that apply, such as most people being unwilling to learn new things and going down the path of least resistance.
Even today, I doubt I could get anyone to just give me a smartphone.
Plex is obviously not true self hosting, but it’s a lot closer to it than a Netflix subscription, and the number of people who I do not consider very tech savvy who have not only been joining other people servers but trying to set up their own is staggering lately. And they’re not simply doing it because they want free movies or something. A lot of them have done it for the same reason I initially started: their kids.
I am concerned about the media that is put in front of my kids. I care about what shows they are watching. Kids are going to get their hands on screens there almost is no getting around it, so I would rather not trust YouTube et al with deciding what my kids do and don’t see. I can’t realistically be there to catch literally everything they watch, but if they’re using my server I know they only have access to a certain Library at all times so I can rest a lot easier. In a lot of ways I imagine this is how our parents felt when we were kids. On cable television growing up there were only so many “weird” or troubling things that could pop up, definitely nothing as extreme as we see today, and you could be reasonably aware of what most of those things were and know what channels to forbid/what times your kids should not have free access to the TV.
I found a lot of other parents feel the same way here. They’re just tired of feeling like the Internet is such an incredibly hostile place and want to find ways to take a little power back into their own hands.
I don’t know hopefully something useful popped up in that rant above. I have a lot of disjointed thoughts about this I really haven’t been able to bring together.
I started with CasaOS and Jellyfin. Quickly outgrew Casa and moved to learning Docker and setting up my own container stack, moving from media self-hosting to adding new containers of stuff like budgeting apps. I’m still working on building out my server but every container I add, the goal is basically to self-host a version of something I’m doing on a centralized service on the web and ultimately take my data and privacy back.
I will say some peoples’ elitist attitudes about stuff can be annoying and discouraging; it’s the same general spillover attitude from the Linux supremacy crowd. When I started with Casa I had someone basically tell me I was wasting my time and if I wasn’t running everything in VMs why bother. Which is entirely the opposite attitude to get “normies” and low technical literacy people on board, they need easy one-click install solutions like CasaOS. And if they decide to move onto something more complex, well I’m sure they can figure out how to reimage and rebuild their server in ProxMox or Docker as part of that.
And I still don't get the "VM for everything" crowd. Why would you do that when you have containers?
It's much simpler and lighter. Any 200€ old corporate PC can run a dozen containers easily. What it can't do is run that same stuff in 12 VMs.
Definitely agree about the elitist attitude problem. The amount of people who dunk on people for using Plex when I think it’s a fantastic jumping off point for true self hosting…it’s just so unnecessary and becomes a missed opportunity.
I don't see how this follows. The moment you create/share data with a site, what's to prevent them from reselling it?
The only thing this seems to attempt to solve is portability/interop (and moving control of and responsibility for blocking/moderation/spam to users rather than sites).
I don't see how it helps at all with privacy or you "controlling" who gets your data. If you give it to site A but not data collector B, what's preventing A from selling it to B? As far as I can tell, the situation will remain identical to how it is today.
Your data will never be in one place unless you never share it. The moment you use it with other sites or services, it is stored there too, out of your control.
All that is much, much better than what we have now.
But it doesn't? Obviously every site's TOS will say that by providing them with your data they can use it for all sorts of purposes. If you sued, you'd lose.
And you're generally going to want to make your data available to the various services requesting it, because otherwise most people won't see your posts and comments on their preferred platform.
Facebook couldn't enforce a TOS because the hosting user had never gone to facebook.com and created an account, so the user never agreed to a contract. But a user couldn't enforce a TOS either because the crawling was automated, so Facebook wouldn't be agreeing to a contract either. But Facebook would be allowed to use the data because that's what a user is inviting by making it publicly available to crawlers and not doing anything to restrict access to Facebook.
When Meta (or any other company) decides to destroy them, they go away forever. You have no "control" over it.
https://indieweb.org/POSSE is the way to go.
You want to write a long post on a 3rd party platform? Write it on your own device, that you control. Then you save it, copy the content and post wherever you like.
If your 3rd party blogging or social media platform goes tits up and everything disppears, you still have your own copy you can just Ctrl-C Ctrl-V anywhere.
You can go as fancy with this as you like, depending on your nerd-level. You can have a self-hosted N8N system that automatically reposts everything to new sites you add to the flow. Or you can just have your stuff in a directory in Obsidian.
If I can clearly assert origin and personal ownership of my data, I can forbid further reselling of it.
EU legislation shows that we can actually have the right to demand that a company forgets about us. Asserting such rights become easier the more accurately we define what data is ours.
Can you? A site's TOS will say that by sharing your data, you grant them the right to display, reuse and redistribute it, the same as you do now. And that would take precedence because your host provided the data. They requested and you provided.
The only thing that would change that is actual legislation. But then the legislation is orthogonal to personal data storage. If you want legislation for that, pursue legislation for that. Personal data storage is completely separate, and the two shouldn't be confused with each other.
I mean, a TOS could be written that way. But they're generally not, because companies don't want to self-impose limits like that.
The TOS usually has something like "grant the platform a perpetual, worldwide, royalty-free, non-exclusive license to host, display, distribute, modify, and otherwise use that content in connection with the service".
See the word "perpetual"? That's standard.
It sounded to me like you were making a general statement about TOS's.
also by having ability to enable/disable access to your data, you have the power of who gets what and for which purpose
also reselling of your data should become illegal to start with, would you be OKAY if your lawyer sells your data? or your colorectal surgeon? off course not, we have laws in place for that, and same laws should be applied to whoever handles your personal data
Not true -- advertising profiles are vastly more valuable when based on a lifetime of data.
> also by having ability to enable/disable access to your data, you have the power of who gets what
But realistically, when are you ever going to disable access? If you want people to be able to read your replies no matter what social network they're using, you're going to make those replies available to every social network.
> and for which purpose also reselling of your data should become illegal to start with
This is my point. The solution here is legal, not technological. Personal data storage doesn't change anything legally, and changing the law would prevent reselling even if you didn't have personal data storage.
It seems important not to confuse the two, in order not to give people false hopes.
Solid idea is more in line with revolution and demand for our representatives to give their people internet that can push the humanity forward, and not just let us waist countless hours on doom scrolling.
[1] https://bsky.app/profile/byarielm.fyi/post/3lz4vzzhybk2b
I have started to self host quite a lot of stuff but eve then every storage solution has a life of 5-6 years in which atleast one of the components would fail. We click enormous amounts of photos but they do not have any impact like printed photo albums. With ever growing storage costs (both cloud based and self hosted) I’m thinking of going back to keep only important stuff that too in print format.
As I find the size of current drives bigger than my yearly additions (personal pictures and movies), I am quite happy with a 10 year lifetime at low usage. I would love some reliable and affordable long term offline storage, but backup tapes and a reader are not affordable and not in common use for end users. Otherwise I would build a tiered storage system with more reliability and even performance (nvme hot tier? maybe).
I'm amazed that with all these technologies we haven't figured out how to store data long term (atleast couple decades) without changing the underlying components. Like you said tape drives aren't for end users.Also with how technology is evolving (fast and disorganised) I'm not even sure if you would be able read a drive in 20 years. Very tech is that backward compatible.
It is not necessarily bad we don't have a very long term storage solution. Imagine you took backups on 360kb FDD 40 years ago, you were drastically limited on how much data you could store and if we assume you had 1 GB of data back then, that is a huge pile of floppy disks to copy at very slow speed. Now imagine you have 10 TB of data today and that will be a tiny fraction of a microSD in 40 years, but reading your 10 TB from HDD will be painfully slow in the year 2065. At the same time if you replace the storage medium every 10 years you keep up better with capacity and performance.
If not, Amazon Glacier is cheap-ish, as is Backblaze B2 and Hetzner storage boxes.
True offline media like tapes and DVD-RW is mostly dead nowadays as far as I can see.
That’s not my experience at all.
It's kind of fun to go through the thousands of photos in our digital photo libraries and pick the best and most impactful ones to print and save "forever".
The W3C Linked Web Storage (LWS) working group is transforming Solid into a web standard: https://www.w3.org/groups/wg/lws/
Why not dozens of apps running over the "web filesystem" like happens on the desktop? Two reasons: 1. Amazon pricing for transit/bandwidth is way higher than storage, and so it makes accessing your own data quite expensive if it is not in the same datacenter. 2. And there is a huge security and usability gap between "pick one photo" vs "give me [scoped] access to your Dropbox" Often the general-purpose mode does not work that well, is quite slow, or just costs a lot in bandwidth, a thing nobody wants to pay extra for when they're already paying for storage.
> the platforms should be asking us what kinds of data they may copy from our servers, and only with strictly temporary allowances.
Until practical homomorphic encryption arrives, I don't see how this temporariness can be enforced. If we rely on promises or regulation instead of the technical ability to enforce this, how is that any better than today's social media companies promising not to do anything bad with the data they have on us?
Aka: I agree it can’t be dine with technology; it has to be done with regulation, and the EU example already models a lot of it.
price of intelligence is dropping day by day like it or not, sooner or later price incentives for someone to host such social media experience could become financially viable
It is true that full data sovereignty isn't something most people are interested in, but this is more about a cooperative model for data ownership and access. Having your data identifier be JackDaniels@yahoo.com isn't particularly different from it being jackdaniels.is.technically.bourbon.com. In both cases another organization owns some of the path to your identifier and could potentially lock you out of it. In both cases, verizon is near the top of that list (.com).
As far as the domain name system being centralized, I'm not sure I agree. DNS is like a feudal system with hundreds of kings (top level domains) who all work together with one pope (ICANN), and various lords and ladies occupying positions under those kings. If ICANN goes completely bonkers the kings can get a new pope, some of them are literally sovereign because they are nation states. Just for fun, some of those states are ruled by literal kings, too. There are experiments to run a TLD by Decentralized Autonomous Organization (DAO), but I think for the most part nobody really cares because the current system happens to work pretty OK. If you have an idea for a more decentralized way to organize a namespace that doesn't involve your grandmother typing in a massive UUID or onion address, and doesn't result in someone being able to domain squat literally everything; I would love to hear about it.
> foo.bar.baz.bim.bim.bap.com
is owned by the owner of bap.com, under the current system.
Top level domains can change pricing, terms, or cease operation. Freenom is a great case study, as they previously operated TLDs. At the edges, a well-operated subdomain service could offer stronger ownership-like behavior than a top level domain.
Well, either that or someone else hosting their identity (see did:plc), which seems to be the part you say should exist?
Probably DNS is the most decentralized centralized system we have available today that most people can actually use, unless I'm missing some obviously better way of doing the same thing?
But from a practical point of view a decentralised system should not rely on domain name ownership. Any computer can generate a private/public key pair, which is all you need for identify.
Right, but once you've generated those, then what? You need a global registry of sorts so people can lookup each others keys for example, which is why DNS kind of is the best we have available today.
I don't think there is any perfect solution here, but it's hard to come up with something that has better trade-offs than DNS. Sure, ICANN might be based in the US, but so far DNS been relatively safe to rely on, and if ends up not reliable in the future, I'm not sure social media profiles is the biggest worry at that point.
Wouldn't that turn into did:plc:facebook all over again?
We can work to make DNS /ICANN et.al. more democratically operated and people-owned while at the same time devising wholly alternate paradigms like Handshake and similar: https://blog.webb.page/2025-08-21-dap-the-handshake-successo...
That is something that could be feathered in gradually -- your country, region, city, neighbourhood, etc could have their own domains, and you could be anon237@milan.italy or whatever, until you find it necessary or inspiring to obtain your own domain.
DNS is not perfect but I think the best we have for now.
This idea is an incremental improvement over "everyone is posting x.com"
Most companies have no incentive to let you hold your data when they can just hold it for you.
If they do this they can mine it for data to improve their product as well as sell or otherwise indirectly profit from it. And, it's easier.
Also, while the market for privacy focused products isnt nothing, the number of people willing to pay a lot extra to compensate for the missed opportunities companies get by collecting your data is, i think, smaller than many people imagine. Which is sad.
I think the only way it will grow to an appreciable size is by seeing up close and personal what a really vicious stasi-like secret police does with dragnet surveillance and come out the other side, with scars. I believe we've only seen a small taste of this.
This is understating it honestly.
The software industry has become completely reliant on renting data access back to users to maintain subscription revenue. One effect of this is it has devalued the actual software in the eyes of users to such a degree that virtually no one will pay for alternatives, certainly not enough to compensate the development cost.
I'm still hoping they release an Apple TV Pro with fully local LLM capability that's shared with everyone in the family - adding a few TB of disk space to it for local data storage and backups wouldn't be a massive thing.
Most people have no incentive of owning their data. Otherwise the companies which don't give you that would die out because people wouldn't use them if they cared.
Same fallacy as believing smartphones are giant and with non-user swappable batteries because somehow smartphone making companies are forcing this on the market, instead of the real reason which is that it's what consumers want.
I agree that people don't care enough about owning their data for it to matter more than what the companies want to push, which is of course monetizing the data and maximizing user lock-in.
Similarly, I think it's in the companies' interests to use non-swappable batteries: simpler and cheaper to manufacture (I think this is the main reason) and the device is made obsolete earlier which is an added bonus. Maybe small improvements in size etc., but that's a very small difference. Modern phones are already larger even with non-swappable batteries so I'm not sure it mattered. But again, having a non-swappable battery has to be weighed against other features, and availability of alternatives. In the end, people just care more about the other features, even though swappable battery would be a good thing.
Just to conclude: I don't believe markets work to fully cater to what customers actually want. It's more like customers (and other parties) get a compromise between what different parties in the market want.
Consumers want what they're told to want by a constant barrage of commercial propaganda.
Devices are large and non-serviceable because this way they can be sold with a higher profit margin. Side effect being that the larger screens make the embedded commercial propaganda more effective and easy to deliver.
People want vendor lock in...otherwise they wouldnt pay for it.
People want bait and switch sales tactics...otherwise they wouldnt work.
People are perfectly fine with high rents...if they didnt, they would not pay them.
People want their smartphones to be deliberately slowed down when they get old...otherwise theyd vote against it with their wallet.
All other services would read/write from your Pod.
Before 2014 I would have thought Apple to potentially take this route for Time Capsule. Instead they doubled down on iCloud. Google will never take this route. Microsoft is not interested. Amazon should have done this and bundled with cold storage back up but their track record are not good enough. I doubt people trust Meta enough even if the solution was perfect.
In pre 2012 you could at least bet on Apple to be somewhat customer centric.
May be UniFi will do it. They just announced their 2 Bay UNAS and I only just discovered, they are a 40B market cap company. ( I thought they were much smaller )
Its confusing if you mean the NAS will stop working if you stop paying for the subscription or not. If you can no longer access your data on the NAS without a subscription, then the NAS just becomes the cloud with an extra up front cost plus the cost of your own electricity.
Personally I have started moving as much of my data out of the cloud as possible. I've got a Synology and a few single board computers running various services with a Synology in my parent's home for their photos. Their photos back up to my NAS and my data to their Synology.
Its a shame Synology decided to enshitify this year for all products going forward, but UGreen looks like a suitable replacement when I outgrow my current NAS.
What are you doing to your hard drives that the bits are rioting?
I've been running a RAIDZ2 NAS (with ECC RAM) for like 5 years with no data loss/corruption issues. Are you saying if it was just regular RAIDZ there would be data integrity issues?
Misses the point entirely.
Right now everyone is only talking about options that are extreme in both ends.
This is a false contrast. Corporations are institutions governed by people - specifically a board of directors, elected by shareholders. They aren't governed by aliens nor are they self-sentient. https://en.wikipedia.org/wiki/Institution#Examples , https://en.wikipedia.org/wiki/Institution#Examples
Perhaps you meant that you are against for-profit corporations where the customer (who stores data) has no vote in the operation of the corporation? If so, then say that and don't imply it.
People often use "corporation" as a pejorative, often in contrast to individual people. But they forget that a corporation is composed of people and ultimately owned by (some) people - but the kind of people that the writer does not like (shareholders, profit-makers, etc.).
> Notice that Alice’s handle is now @alice.com.
It's funny you're using .com as the example, because:
> The domain com is a top-level domain (TLD) in the Domain Name System (DNS) of the Internet. Created in the first group of Internet domains in March of 1985, its name is derived from the word commercial, indicating its original intended purpose for subdomains registered by commercial organizations. Later, the domain opened for general purposes. -- https://en.wikipedia.org/wiki/.com
Even when you're arguing against commercial organizations for storing personal data. Now you're just naming individual people as if they were companies.
I think the context of “encouraging people to switch” to a pds/solid/data coop, how they operate IS important. For two reasons:
- data coop and controlling data opens the door to a new market if we’re going to join data coops, then we may as well try to share the profits from said coop fairly. Otherwise Facebook can step in as a “data-coop” and keep-on-keeping-on
- a secondary effect is that now there is an incentive to move off facebook. If I can join my local Nowheresville.USA.town data coop and benefit directly to my community by storing data together then I am encouraged to switch to this new paradigm
That is the major undiscussed shift to me. I believe the only way out of the Big Tech dystopia is to incentivize the switch. Even if the reward is pennies. Invest in the community oil well.
When someone uses a service like Dropbox or iCloud Drive or Google Drive, they really aren’t experiencing any kind of problem where their data “isn’t theirs” or is “trapped.” It’s not that hard to migrate to something else and the services themselves are reasonably low-friction.
In terms of social data, users don’t really have a major issue with the status quo, and those who do have already developed relatively popular solutions like Mastodon and BlueSky.
Even “proprietary” photos applications like Apple Photos and Google Photos have very easy migration paths to other services.
So what exactly is the problem we’re trying to solve here? Giving me an @Bob handle? Did I want that or need that?
That's exactly it. And with social media (unlike files and photo storage) migration isn't really something people care about, because it's about the present not the past.
If you move from Twitter to Bluesky, does anyone care about moving their tweet history? They just want their list of followers to migrate over as much as possible, which happens relatively organically anyways.
that's why we all need to exercise our rights and freedoms. I'm scared that if we fail to do this in next few years. And let the AI be used in similar ways like it has been used to create social media algorithms. Then we are all fucked!
Whoever owns your AI owns you, so it better be you who owns it!
The trouble isn't a lack of the right technologies - I'd argue it's a problem in the go-to-market strategy of those building these products/technologies.
Ideas flow along lines carved out by power/influence. Facebook's early strategy was to start with restricting its usage to people at Harvard University - arguably a highly influential institution - and then expand outwards to other highly influential institutions. Only once the "who's who" from those institutions were already onboard did they let down the walls to allow us plebs in, and we all rushed in head-first.
X's current strategy leverages Musk's visibility and influence (for better or worse).
Get the most prominent influencers onboard with your decentralized social network, and others will follow (dramatically easier said than done, of course). But without a significant contingent of influencers/powerful people, your network's DoA.
That's sort of a contradiction, no? Or at least it assumes transplanting the same mechanisms into a new milieu -- which I argue is something to leave behind, because it's those very mechanisms that have ruined the current internet.
I think instead of tapping into the same addictive attention economy schemes, the distributed / decentralized socials could onboard people en-masse by providing what's missing there, and filling a real need.
I'm sure Tim Berners-Lee is much smarter than me, but I kind of feel there are some parallels between the idea of "owning" posts you made in a platform and the ludicrous idea of "owning" game items as NFTs in a blockchain. The latter promises interoperability that games would never deliver. I wonder about the former.
At least I feel the major dealbreaker with this technology is just that it's not worth it for both parties involved.
Right now, Facebook hosts all the posts and monetizes them with ads. So long as they are making money with ads, they have no reason to delete the posts they're hosting, as the posts are their money maker.
But what happens if Facebook no longer "owns" the posts?
So now your posts are in your "personal cloud", which means that unless they are encrypted any website or local app can display them, even without any ads. This means Facebook is no longer making money off the posts. Why would they accept this?
On the flip side, who is paying for the hosting? Facebook? It's no longer their servers hosting the content, so I don't think so? Is Facebook supposed to pay the cloud service for metered API access? Can a cloud service offer different rates to different companies? Is the user supposed to pay for their cloud storage? So you're going to make users pay money to use facebook?
What happens if a post violates the ToS? Can facebook delete my post in my cloud storage against my will? What happens if content that is legal where facebook operates is illegal where the cloud servers operate?
Can I manually edit the data in my cloud storage like I'd be able with a file and then facebook has to treat every post as if it were untrusted input?
What happens if my cloud storage closes my account? I just lose everything? Will I be able to back up my cloud to my hard disk and reupload it to another cloud so facebook can access it? How is facebook going to handle a single user with 2 clouds that have different content?
I feel like this is a very complex thing and there are infinite questions that we can have about how this would be implemented in practice, while it's presented as simply "you own your data."
likewise things like email etc instead of all of us being on gmail we could have community email servers etc.
- Who can see my personal data storage posts? Can someone with Twitter see them?
- No, but you'll own your data
- Bye
So maybe start with something which backs-up what you post on Twitter/Instagram/Discord to your personal data storage through APIs/data export.... This has no downside if it's easy to "activate"
There are SO MANY bots on both Twitter and Instagram that a legit developer shouldn't have any issues automating posts.
Discord is a bit harder, you an post as a "bot" easily, but if you want the posts coming from your actual user, you need to poke the actual client.
The irony of ad supported free services is that if you just let the advertisers pay you directly for eyeball time then paid for your services, it'd be better for you financially while keeping the web pure outside of the "paid to consume ads" app.
People getting into Solid and ATproto today are like people using own XMPP servers decades ago, or Mastodon years ago, or Matrix. Some projects like that will succeed, others will fade. But one day, you won't be able to post to Discord due to some policy changes and you'll have to reevaluate options.
Also, you can't backup from Twitter anymore. Or Discord. Or google photos. Or many others - they cut off that option once they're big enough.
I've been waiting a long time. Over that time, the closed services have only gotten more popular and no regular person is ever complaining that they are "hostile".
Regular people don't like ads, but they dislike paying even more, so they're pretty OK with the status quo. They certainly don't want to be paying for a domain name and paying for hosting.
also your government, your service providers and many other entities are creating data on your behalf
Or I see malicious actors would wreck the federation mechanism.
This is already the case with Email SMTPs
But unforunately it will never take off in a huge way because convenience is king. Average Joe and Jane want to install things with as little efforts as possible.
The incentives do not make sense.
Any utopian future that requires a party to put in a lot of effort to change something in a way that would be a net negative for them, is just not going to happen.
People do not spend money to change the world in a way that would be worse for them but better for other people.
Commercial incentives, no. If this preference exists, it would need to be pursued civically.
But lets say you get them on board and pass some law. Unless its a huge market like the EU or USA, probably what immediately happens is everyone pulls out of that market. Not out of malice but because they suddenly have to rewrite their app and that's probably quite expensive.
Opera Unite was such an awesome idea. https://arstechnica.com/information-technology/2009/06/opera...
There was a neat idea a bit back to allow Service Workers to work across origin: foreign fetch. It wasn't on the internet, was only in the scope of your browser, but I thought it was such a neat advancement. Would have done so much to allow the offline web to weave itself. Alas, deprecated. https://developer.chrome.com/blog/foreign-fetch
https://LMNO.lol is my grain of sand.
I wasn't happy the state of blogging (tracking, bloat, ads, paywalls...), so I built https://LMNO.lol. It's offline first and you can browse blogs from anywhere (even terminal). Your blog is a single Markdown file. Drag and drop it to the browser and your entire blog is generated.
Custom domains are welcome. My blog is running off LMNO.lol that https://xenodium.com
Next, please.
Next, please.
Some things are fire, some things are warm, and some things are DOA.
And I’m typing this on my Linux desktop (f’real).
https://www.schneier.com/blog/archives/2024/07/data-wallets-...
I think it’s entirely unfair to dismiss technology because it hasn’t demanded immediate adoption by society. Solid is attempting to help define a better data future. We have working mechanisms in place but everyone is at a disadvantage except the people loyal to these giant corps. Attempting to give people the power to organize their data as they wish and to be used as they wish is worth it. Even if it doesn’t bring a renaissance.
Market share matters, critical mass matters, adoption matters. I'm suggesting that mindshare goes negative over time if these things aren't achieved, and when you have long-tail blog posts trying to pump life into it, it's pivot time.
Righteousness alone doesn't win any of those things. It's been a very long time since Solid was released and it's like a whisper in the wind.
Arguably it hasn’t taken off because no one has incentivized using it.
The business model of cloud service providers makes a lot of sense- we have a system which stores and operates on your data, you pay some rental fee for us to store it and operate on it, easy peasy. The cost is related to both the utility of the operations the operator performs (to both the operator and the user) and the amount of data the user stores.
Fundamentally this is how everything from Dropbox to Facebook is governed- Dropbox does not devise much utility per GB and users store a lot, so you rent per GB, but at Facebook, they don't store lots of your stuff, and on the data side maybe you don't get much value from it as it's a cesspit, but the data is valuable to Facebook to sell ads, etc, so they can provide the service for free.
Importantly, you don't need to improve the product to continue extracting this rent, because the product you are selling is not Dropbox v4, Facebook v2.3, rather you are selling ongoing access to the rental.
As soon as you introduce even simply a federated system where a few corporate operators are involved, it becomes very hard to justify extracting rent there as the network designer, as the operators are taking on the cost of actually storing the data. You have to really be iterating on the core product to use a SaaS business model here. Some things simply don't need a v4, does Dropbox really need that much iteration?
Meanwhile as the system designer, life has become a lot more complex for you. Suddenly you cannot push unilateral sweeping changes to APIs, you need to version things in a way that is compatible between, say, one university updating their system but not the other. Since your users are a few large operators rather than millions of individuals, you lose the network effect advantage of being able to screw over a few users for the "greater good", since if you irritate one corporate client, you lose a lot of your install base. Why would you voluntarily choose this harder path as a company?
Things get even worse as you increase the level of decentralization. The reality is users expect the polished experience that the rental companies can give you; they want their data always accessible so that their friend can see the pic they shared without needing to keep their own computers running, they want the "like counter" to go up without their personal node subscribing to messages from other nodes, etc. The only users that will accept a worse experience are people who have are motivated by their philosophy re: personal data ownership, and this crowd will want a FOSS solution, so you can say goodbye to charging them for Dropbox v4, they are simply not interested if you're not giving them the source code for free. (I suspect this is where the author sits, but fundamentally I don't think it will get mass appeal, most people simply do not care about data ownership above something that "just works".)
So now you are dealing with problems like dynamic generation of redundant data and fault- and Byzantine-tolerant consensus algorithms so that your system can maintain function even when the user turns their computer off, and you have to deal with wrapped-key cryptography so that the redundant data can be split across all these user nodes without you worrying that an unauthorized user can read it, and then you have issues like how do you deal with nodes that are too slow to process updates (perhaps some user data needs to be stored in this conflict-free replicated datatype you devise), and eventually you go through all of this to... create a system that is less monetizable than the rental model, because you can't extract that rent for ongoing data storage, and we know users are not interested in actually paying for software.
The browser controller actually runs its own local server that handles indexing and archiving on your disk, while the front end lives inside your browser as a dashboard or control pane. So it’s both a locally hosted app and a browser extension of sorts.
This is still a work in progress, but one direction I want to push further is allowing users to publish curated collections or search indexes of their browsing history.
More likely, though, you’d create a separate archive centered on a topic you care about, and as you browse you selectively add pages to that topic. Over time, you end up with a niche search engine tied to your expertise.
If that archive is good, others might find it valuable—and you might choose to publish it from your own machine. With tunneling tech (Cloudflare, Tor, etc.), you can expose your local box to the public internet. The vision is: user-sovereign data, but still shareable.
You could even federate groups of topic-based archives into a shared search ecosystem, useful for domains like biotech or other specialized fields.
Another crucial point: DownloadNet archives your browsing in real time. It doesn’t crawl externally; it captures exactly what you see, including sites you access via institutional credentials (e.g. research journals behind paywalls). Then you can optionally share those archives with a trusted group.
I’m also exploring a web-document bundle format: package an interactive set of web pages (not just one) into a self-contained snapshot you can send (e.g. via email). The recipient can browse that snapshot locally, with all internal links intact, as of a particular moment in time. It’s a simple but powerful idea, and I think it has real growth potential in the data-sovereignty space. I started this as a passion project, and I believe many others care deeply about these ideas too. If you’re interested or want to get involved, head to the repository.
One way my vision differs from something like Solid is the philosophy of adoption: rather than launching with a full-blown protocol, you start with a simple tool that users adopt, extend, and share. Over time, emergent use cases and community practices shape the system. It’s bottom-up rather than top-down.
I’m not dissing Solid — I understand its aims and don’t see this as strictly competitive or exclusive. But I feel the incremental, user-led route is likelier to produce something sustainable. You grow it in the wild, learn what users actually need, and adapt. Instead of trying to design for all cases in advance, you let real-world use teach you what matters.
Anyway, that’s the gist of my vision—and how it diverges from other approaches like the one in the article you referenced. While it may seem as a condemnation of other ideas, it's not. So please don't take it that way.
If this is something you could get into, I encourage you come on over to the repo and share your contribution. I also riff more on Solid, this article and the approach of DN if you're interested, here: https://github.com/DO-SAY-GO/dn/wiki/What-is-DiskerNet-and-h...
How about we go back 20yr and train a generation of unix sysadmins and self host at companies and at home.