Respectfully, please, this below is an absolute joke--has it changed in a decade?
Image: Apple provides easy changing for Google, Yahoo, Bing, DuckDuckGo, and Ecosia. (That poor paid search engine that starts with a K has to mess around with extensions; I'm unaffiliated.)
Plus, even though I have it set to DuckDuckGo, when I ask Siri to "search {query}", it searches Google. So even the default I have set is not actually truly the all-around default. Embarrassing to have locked this down as if I didn't drop a grand on the phone.
-
Fun fact: for years now, asking Siri to "search Google Images" results in whitelabeled Bing Images (thankfully, exceptionally easy to remedy with the excellent Shortcuts: "Picture-Search" for Google, quality difference night & day unfortunately... anyway go SearXNG!, it lets you keep your soul).
—-
Oops btw I said “exceptionally easy to remedy” when that isn’t true because I can’t say “{shortcut name} {query}”; must be two separate commands. Not first class unless I want Bing Images hehe (“search images {query}”, single dictation to Siri).
Google is very interested in knowing about whatever you're interested in, and in knowing when, how often, and for how long you're interested in those things. In addition to looking at their search engine, their ads, and their recommendations, you're also feeding them more and more data about you.
I think I remember Kagi was working on a way to allow users to create an anonymous account and if so I'll have to take another look at them.
One secret here is to have a good UX for adding metadata. For example, in obsidian a search window pops up when you write `#[[`. Or when you type `#` to create a tag, a window with all preexisting tags shows up.
However, lately I've been working on a new side project in order to additionally automatically record/collect what I am doing on digital devices. Basically I am building a "personal" spyware/data collection software suite. Kind of in the same realm as ms recall but more focused on security/privacy with sensible cryptographic defaults where needed.
https://timelinize.com/ https://github.com/timelinize/timelinize https://news.ycombinator.com/item?id=45504973 https://perkeep.org/ https://github.com/perkeep/perkeep https://news.ycombinator.com/item?id=45896130
Links are Website, Repo, HN discussion
Although usually a bottom-up approach using automatically updating `Map of Content` notes (Bases) work well for me for finding content.
```
---
aliases:
- <%tp.file.title%>
tags:---
%%
[[<%tp.file.creation_date("YYYY-MM-DD")%>]]
%%
```
<%tp.file.title%> for aliases, it is necessary to always refer to the alias [[note|note alias]] notes in the text (if I refer without a pseudonym, then by accidentally or intentionally changing the name of the note, I can ruin the text in all places where it occurs
For example, when I add a link to the author to a book note and use keyboard shortcuts to create a note page for the author, the following template is used:
```md
---
aliases:
- <%tp.file.title%>
tags:- t3/books
- people
- t3/author
---
%%
[[<%tp.file.creation_date("YYYY-MM-DD")%>]]
%%
### Works
```base
views:
- type: table
name: Table filters:
and:
- file.hasLink("<% tp.file.find_tfile(tp.file.folder(true) + "/" + tp.file.title + ".md").path %>")
- file.hasTag("t3/books")
sort: - property: file.name
direction: ASC
```Tags in metadata do not need the # symbol, although you can use it if you enclose the entire tag in quotation marks.
They have a bookmarklet that sits on my bookmarks toolbar and if I like a page/tweet/video etc I just hit the "Add pin", enter some tags and hit enter.
This works so well that I went through and bookmarked and tagged all of my LinkedIn connections as well (inspired by a post from Derek Sivers [1]).
People are generally amazed at how quickly I can go from talking about a subject to "oh, I have this article you would love" to "here it is!"
https://www.stavros.io/posts/i-made-a-voice-note-taker/
I usually forget what steps I've taken, but using the recorder above, I can dictate short clips of the steps. An LLM assistant I've built takes the clips and adds them to my Joplin, which then gets published:
It's been extremely helpful for keeping logs.
For example, Syncthing on Debian notes [1] or using Spleeter AI to remove background sound from a long audio track [2]. This is why I switched back from static site to a Wordpress-like site [3], so that I can quickly publish notes from my phone.
[1]: https://huijzer.xyz/posts/149/setup-a-syncthing-service-on-d...
[2]: https://huijzer.xyz/posts/146/installing-and-running-spleete...
https://shapeokoenthusiasts.gitbook.io/shapeoko-cnc-a-to-z/
which re-worked the essentials from that wiki, discarded the chaff, and has become a reference which a number of projects have re-purposed. I did resurrect the notes aspect on the /r/shapeoko wiki though.
Similarly, when I wanted to set up the ultimate commuter/long-haul mountain bike, I put down all the gear I learned about at:
http://old.reddit.com/r/bicyclegear/wiki
(probably out-of-date now, but I found the notes useful)
Unfortunately, I've lost access to the two e-mail archives from when I worked as a graphic designer/typographer --- really should have forwarded any notable e-mails (which I would have wanted to refer to later) to myself --- at least one of them wound up being printed out by a startup composition house and distributed to new employees.... maybe one of these days I'll finish the type composition book I was asked to write by an editor at a major publishing house.
For now, I've been working on:
https://willadams.gitbook.io/design-into-3d and https://github.com/WillAdams/gcodepreview
(I know Tiddlywiki is just an html file) but it syncs it and makes using it quite smooth. I think it can be synced with iCloud or whatever
Tried Evernote and tagging and so on and it turns out cataloging stuff is hard, and the lazy recourse is to over-tag, and then I end up doing a brute force search.
Rather than just coalescing to markdown files, the memory-zet plugin looks for actionable durable information and files it inside the existing zettelkasten system with embeddings - a quick no-LLM step (well 300m parameter query embed, it’s fast) is run against incoming chats or as a tool - this returns cards (zettels).
Zettels are somewhat unique in that the original methodology included a post-writing categorization and linking step - I have the system doing this as well. Result - cards can give you a (possibly cyclic) directed graph of connectivity. I built it for ‘centaur’ mode, so I can edit, link, unlink, move, etc through a nice little web interface.
The auto links are not the same quality I would make. But they are genuinely useful; upshot is for anything incoming, the LLM can see information directly about the query (if we have it), stuff that relates whether or not it embeds similarly, and can follow up links if they look promising with a fast tool call.
I made this memory system my daily driver yesterday; so far it is a significant improvement over the core memory extension (write to markdown files, don’t worry about compaction bro, it will be fine)!
It’s already building out people and organizational card bases for things that come in via email and whatsapp - this is a dream, basically. I think it will scale over time - but it’s at least scaling nicely over a few days of work right now.
I’d like to add, that by forcing myself to look up the answer every time I have happy accidents where I learn new ways to do things.
It’s a skill to be willing to unlearn and always presume yourself ignorant, even if you do know how to do it. It’s like confirming “is my way still best practice.”
I think the essay will be something like: adding structure post-hoc lets you build intelligence into the datastore as an architectural matter, not just rely on connections being made during use-time inference, using an embedding with links like this is much different than bulk embedding search, and we need some sort of tests to understand if this helps in practice, although it a) feels pretty good and b) it’s VERY nice to be able to refer to and modify the agents “mid term” memory directly in any event.
Anyway you’ve triggered me enough to say I’ll try and get the repo published today so people can look at it.
Some 80,000+ files in a directory represents an awesome database of knowledge. "$ ls inux" to find anything Linux-related, etc.
One of these days I'll get around to setting up some ML tool that will tell me all the things I didn't already osmose from the archive .. and maybe long after I'm gone, in some hole in a wall of some grimy back alley somewhere, there'll be a ML version of me embedded in a brick, ready to have the conversation well into the future ..
A lot of other info is stored in the PDF metadata, too ..
Whenever I do something and realize I might need it in the future, I just store it on corresponding projects.
Seems to be serving well to me for some time.
Anything else is a bandaid.
Check for yourself: mmap does not occur in the C standard document: https://www.dii.uchile.cl/~daespino/files/Iso_C_1999_definit...
I also try to add any other commands to it as they come up. So much easier to run 'make install' whenever I pull a project than have to remember the commands.
Even if I can't always add the process I will use a bunch of echo's to bring me through the steps.
I haven't found a way to automate this import of my data, but most of the magic is in the history not in the present. It really is incredible. I'll ask the claw to find what I said about the SFPD cruiser I once saw in the TL and boom! It's there! A mild annoyance with using my Mediawiki-based blog (which I chose because it has good support for allowing users to edit it) is that authoring is still a lot of work and I keep forgetting Draft namespace articles.
and you fell for it ;)
Am I the only one who gets physically ill listening to themselves speak? =)
I imagine it sort of like an eye test. Play two samples, A/B test in rapid succession until you get a match.