FilterHN

Show HN: Terminal Phone – E2EE Walkie Talkie from the Command Line

322 points

1 month ago

| 16 comments

TerminalPhone is a single, self-contained Bash script that provides anonymous, end-to-end encrypted voice and text communication between two parties over the Tor network. It operates as a walkie-talkie: you record a voice message, and it is compressed, encrypted, and transmitted to the remote party as a single unit. You can also send encrypted text messages during a call. No server infrastructure, no accounts, no phone numbers. Your Tor hidden service .onion address is your identity.

▲

Pinkert

1 month ago

[-]

Using a v3 onion address as both the cryptographic identity and the NAT traversal layer is such a clean architectural choice. No STUN/TURN servers, no hole punching, you just boot the script and Tor handles routing.

For those who use Tor regularly for things other than web browsing: how bad is the real-world latency for pushing a ~20KB Opus audio chunk over Tor these days? Are we talking a 2-3 second delay, or is it much worse?

▲

smalltorch

1 month ago

[-]

The real world delay is about 2-3 seconds your spot on. I initially started with a full duplex version but it was absolutely terrible. Walkie talkie kinda forces the recieve, listen, response from the users so the latency isn't as much of an issue.

▲

ale42

1 month ago

[-]

Is audio transmitted while it is being recorded or afterwards? Is it played before everything is received or is everything buffered? In the later case, I find it more akin an audio message on Signal or similar, than as a walkie-talkie, which is much more "dynamic".

▲

smalltorch

1 month ago

[-]

It's not streamed. It gets recorded, compressed, (voice effects if you want), encrypted on device, then piped through, reverse process, auto played on reciever end.

Also, once it's decrypted and played back, the message gets destroyed.

▲

iamnothere

1 month ago

[-]

Small suggestion, maybe you should send a “key down” notice when you begin recording, that generates a subtle sound on the receiving end. This would act as something like a typing indicator on a text messaging client.

▲

smalltorch

1 month ago

[-]

Thats a great idea.

▲

bzmrgonz

1 month ago

[-]

Can you tell us which ai minion(s) helped you with this?

▲

smalltorch

1 month ago

[-]

This is included in 1.1.4. call interface now displays when other side is recording and optionally configure a preset chimes or record a custom notification sound.

When remote is detected as recording this sound will play if the setting is enabled.

▲

jetbalsa

1 month ago

[-]

there are ways to help with lag a bit, you can choose the number of hops a HS uses when meeting up. but of course that comes with downsides

▲

smalltorch

1 month ago

[-]

I'm going to look into this. I guess you loose anonymity, but in scenerios where that doesn't matter could be a fine trade off for speed.

▲

observationist

1 month ago

[-]

Could you use tor to establish a better realtime link using a different protocol like Veilid while maintaining the relative anonymity and security?

edit: https://veilid.com/

Added link for clarity. Seems like you could get more or less realtime, udp streaming, full duplex communication . Once you have the first part of that built, then adding things like voip or video calls or what have you becomes a lot easier.

▲

smalltorch

1 month ago

[-]

The problem is the UDP packets from the application will ultimately be in a TCP wrapper over tor. Which will add even more latency.

I know you can set up mumble over Tor but it's going to have the same latency drawbacks.

Another commenter noted the ability to configure only 1 hop instead of the standard 3. I wonder how much latency would be gained back. I want to play around with this.

▲

observationist

1 month ago

[-]

Sorry if I was unclear, this stuff does get tangled, lol. I was talking about setting up a parallel veilid connection - a private route exchange between the two ends that goes straight over the veilid network, outside of tor. It's got comparable privacy and security assurance. There only a few thousand nodes/peers right now, but it's gaining steam.

You could use tor to anchor a common relay, then do key and route exchange to establish veilid, then the realtime app uses that for a very secure private route unique to the two endpoints.

From what I can tell, because of the design, veilid would be excellent, comparable to many commercial voip offerings, with 150-500ms latency . More nodes and users would quickly ramp that up. There are a ton of downstream benefits of something like this; faster file exchange for ipfs, bittorent, etc, that doesn't gum up the tor layer, a kind of implicit defense in depth, but also low latency, efficient peer to peer routing.

It'd almost be zero knowledge by construction; you could build a multi-hop escrow style key exchange.

If Musk integrated tor relays and veilid peers throughout Starlink, you could get near-native latency for wherever you connect in the world, more than enough to add timing attack noise and layer in other security features.

▲

nunobrito

1 month ago

[-]

STUN/TUN are important because of bandwidth. With STUN the bandwidth used is only between the two connected devices, with VPN like Tor there is a bandwidth cost on all the servers where this data is passing. This is a big blocker for anyone hosting the service on a VPS with a few GB of traffic data per month.

▲

medi8r

1 month ago

[-]

Why not stream anyway? adding to the latency by turning it to audio messages sounds more frustrating. At some point a message would be better.

Modulo cool project love show HN etc.

▲

idiotsecant

1 month ago

[-]

Beep boop

▲

iamnothere

1 month ago

[-]

Very cool, happy to see more IRL applications of onion services as a backend. Arti onion client support should soon be available, which will make Tor embeddable in applications as a Rust library. Hopefully this encourages even more usage.

More applications using the network means more cover traffic as well.

▲

xnyan

1 month ago

[-]

> More applications using the network means more cover traffic as well.

Agree. The biggest barrier for me using Tor is the perception held by many IT admins is that Tor is synonymous with nefarious. It makes using it inconvenient or impossible in many highly controlled network environments such as enterprise, public access wifi, etc.

▲

lxgr

1 month ago

[-]

> 21 curated ciphers are available

Why!? That sounds like approximately 20 too many.

▲

smalltorch

1 month ago

[-]

The library is openssl and that comes with all these ciphers available. No other reason than because we can!

I wish AES-GCM was available...but openssl can't do it on its own without further dependencies to parse the authentication correctly.

Really this whole layer is complelty redundant actually. It's already E2EE without openssl via Tor. I like that it's encrypted before I hit the network pipe though.

▲

john_strinlai

1 month ago

[-]

>No other reason than because we can!

great attitude for approximately everything except, perhaps, cryptography.

especially since the initial encryption is mostly redundant, i would encourage that you, at some point, consider reducing the number of ciphers.

▲

inigyou

1 month ago

[-]

If a library doesn't do what you need, you need a different library, but this is impossible from a short bash script, so it's one of the tradeoffs of your design.

▲

lxgr

1 month ago

[-]

> No other reason than because we can!

Then maybe your scientists should spend some time to stop and consider whether they should ;)

But seriously, I'd just limit this to one option on the selection side, even if you continue supporting more than that at the protocol level for cryptographic agility.

▲

fc417fc802

1 month ago

[-]

I don't see the issue. "Anything that openssl actively supports" plus providing a default seems like an extremely reasonable stance to take.

▲

lxgr

1 month ago

[-]

“Supported by OpenSSL” is not a seal of quality in any sense.

It still supports a bunch of outdated crap including (on my system) RC4, RC2(!) and DES (yes, the 56 bit key one, not just 3DES).

▲

fc417fc802

1 month ago

[-]

Fair point. But what I'm getting at is that if you aren't an expert on cryptography (and perhaps even if you are!) rather than imposing your personal preferences on others simply deferring to a trusted third party library can make a lot of sense.

So in addition to a sensible default I guess it would also be a good idea to tag choices that you believe to be outdated with a large warning. That way you haven't rolled your own crypto, you haven't forced your views on others, but you have done your best to enable end users to operate your tool in a sensible manner.

▲

xnyan

1 month ago

[-]

>reasonable stance

Within the last 12 months, I had to write a script for a buddy at work that turned off availability of freaking freaking 56 bit DES in OpenSSH, which was available because was provided by openssl. I'm certain it was still there to provide compatibility for something(s) critical out there that depends on it, and while I can't imagine why anybody would choose to use it, it's there and it's awful.

▲

Bender

1 month ago

[-]

I would rather avoid cipher fixation. Give me thousands of protocol / cipher / mac / mode combinations. Fixation only benefits nations wanting to crack something.

▲

inigyou

1 month ago

[-]

Agility benefits nations wanting to crack something, because they can force you to pick an insecure combination. This has happened in the real world several times before.

▲

Bender

1 month ago

[-]

And what they will get is "haha! It was only 230 hours and we cracked their ... oh, there's another dozen sparse files inside that and it's a different combo..." I can automate encrypted sparse files all the way down each showing a size between 5PB and 40PB which a lot of forensic software will try to copy as a non sparse file.

▲

Bender

1 month ago

[-]

I think that's great. Cipher fixation is a vulnerability as the enemy knows what to attack.

▲

lxgr

1 month ago

[-]

This understanding of cryptography is so outdated that we don't even have a color photograph of the person first refuting it: https://en.wikipedia.org/wiki/Kerckhoffs%27s_principle

▲

Bender

1 month ago

[-]

Adding to that, cryptography is just mathematical obfuscation and often repeated here is that security through obscurity is not security at all. I will stick with my own principals of not fixating on a cipher. The only people that benefit are lazy spooks.

Rather than what is accepted as the strongest ciphers I prefer ciphers not optimized by CPU's and GPU's. Spooks will have to cycle through every combination of protocol + cipher + mac + mode + passphrase + whatever other obfuscation I shim inside that tunnel. Keep 'em on their toes. Even better I will also cycle through these encoding methods [1] if I am in a good mood otherwise I will rot13 their ass and then force them to use a Drogan’s Decoder Wheel.

[1] - https://github.com/qntm/base2048

▲

sadeshmukh

1 month ago

[-]

https://xkcd.com/538/ comes to mind

▲

aitchnyu

1 month ago

[-]

Tangential, did Gitlab become faster than a while back or is it an illusion from their lazy loading?

▲

marcosqanil

1 month ago

[-]

I love this. In your view, how would users go about securely swapping credentials ? PGP over email ?

▲

smalltorch

1 month ago

[-]

Thanks! My realistic use case is that I am already speaking to someone who I know and trust, so ideally exchange credentials in person. A preferred out of band secure messanger of choice is probably fine.

▲

deadbabe

1 month ago

[-]

What do you guys talk about?

▲

smalltorch

1 month ago

[-]

I have my wife's phone set up on autolisten running in the background, so I just pop in and ask how her days going and crack jokes.

▲

clouedoc

1 month ago

[-]

That's funny but it must absolutely drain the battery of her phone, no?

▲

smalltorch

1 month ago

[-]

So far it's lasted all week with maybe 10% -15% loss per day. It's not her main, actually just a old phone I had laying around.

I think it's a pretty light background process.

▲

rsync

1 month ago

[-]

You could put your onion address into an “oh by code”[1] and just write it down … or chalk it on the sidewalk for someone to see … post it on a physical bulletin board.. hold it up on a sign…

This way you could establish communication with an unknown future party, totally offline.

[1] https://0x.co

▲

fc417fc802

1 month ago

[-]

Trying to repurpose hex literal notation as a "recognizable" URL shortener seems like a questionable idea. At least write it as 0x.co/FFFF so it's obvious to readers how to interpret it.

If you're printing something why not go with a QR code?

▲

rsync

1 month ago

[-]

If you can use a QR code you probably should.

However, if you're walking down the street and need to quickly generate and apply a message, how will you pass along a QR code to an unknown future viewer ?

Can you draw a QR code with chalk or freehand with a pen, etc. ?

I will admit that the use-cases for "oh by codes" are weird and infrequent but I am convinced they will emerge ...

▲

fc417fc802

1 month ago

[-]

I don't disagree that URL shortening is incredibly useful at times. Merely that writing out the whole url is almost certainly a better approach and that any sufficiently short domain name is fit for purpose.

▲

rustyhancock

1 month ago

[-]

> Exclude Countries -- Exclude specific countries from your Tor circuits. Presets for Five Eyes, Nine Eyes, and Fourteen Eyes alliances, or enter custom country codes. Uses ExcludeNodes with StrictNodes in the torrc.

Interesting that people do this, I wonder how much it improves security? Afterall, any serious surveillance would involve running relays and exits in foreign lands.

▲

smalltorch

1 month ago

[-]

This was another one of those things I built in because we can. I really don't know... But the Tor developers built this in as an option on the torrc so there must be something to it. We know there are definitely compromised nodes...I think it's just neat that you can have that level of control regardless if it's effective.

▲

kortilla

1 month ago

[-]

It might not help for controlled nodes, but it does help avoid ISPs controlled by said governments from seeing it

▲

smalltorch

1 month ago

[-]

For anyone following 1.1.5 now has support for group calls. Now use terminalphone as a dedicated relay. All users connect to the relay and the messages will be broadcasted to everyone in the group.

A mobile relay should be able to handle 3-5 users nicely. A dedicated machine with a stable connection should be preferred.

You can act as the relay and caller by running two instances, you will need to change your socks port of the second instance so that you can have two addresses.

Relay does not need to have the shared secret, it is simply forwarding payloads, and broadcasting connected client counts.

▲

nullc

1 month ago

[-]

Since you're not realtime you could also have a configurable playback speed on the rx side or processing that removes gaps to make it go faster. This would improve the latency while maintaining the whole store/forward design, and would also let a recipient get more than 100% audio (e.g. from multiple people sending to them).

You're using opus but you might be interested in abusing the DRED error correcting scheme (which is an experimental part of opus) in it as a codec, as it does pretty good sounding speech at 2kbit/sec. You could send the dred first then the opus compressed audio so that if tor craps out before the transmission completes the receiver still get the audio. (A step further would be to run automatic speech recognition, a send text, dred, then opus. :P ).

▲

smalltorch

1 month ago

[-]

I am totally interested in a 2kbs encoding if that's possible. I didn't think opus could encode below 6. Making this as slim as possible is definitely what I'm aiming for. Really though the latency basically all coming from Tor.

Someone also suggested that you can configure Tor to take only one hop. You loose anonimity but gain speed right away. May be something to look into as optional setting.

I also learned today I can pipe direct binary without encoding base64. This will chop overhead right away. I didn't think it was possible to pipe through bash but I was using the wrong command.

I do plan to continue to optimize that's great feedback thanks!

▲

mrexcess

1 month ago

[-]

Looks awesome in many ways. The use of a shared secret instead of PKI limits the real-world applications pretty severely, but adding PKI support doesn't seem too difficult. If the PKI key was only used to establish the session "shared secret", virtually no changes would be needed in the main code.

Thanks for contributing!

▲

smalltorch

1 month ago

[-]

This would be a great improvement and I'm going to look into how to implement!

The most obvious path is just integrating the authorized clients Tor has already built in. A way to export these keys efficiently to your intended recipient.

▲

chasd00

1 month ago

[-]

Forgive my ignorance, but can this be setup for a group like how a group can all be on the same frequency with walkie talkies? Or it is strictly one to one. Either way, it’s a really cool concept.

▲

smalltorch

1 month ago

[-]

It's strickly 1 on 1 for now but I do plan on exploring the group call scenario.

▲

bzmrgonz

1 month ago

[-]

I don't think E2EE works that way.

▲

smalltorch

1 month ago

[-]

It actually can since its just symmetric encryption. Any key holder could decrypt the payload. In fact, the channel could simply be the shared secret.

Let's say we have 10 people in a call, 5 share a key and the other 5 share a different key. Without the shared key audio simply will not decrypt. You could have two private channels with one host.

▲

cl3misch

1 month ago

[-]

I think it does? How would Whatsapp or Signal group chats work then?

▲

oybng

1 month ago

[-]

Looks fun, I've yet to test it but I did skim it.

'|| true' 76 matches 'echo ""' 50 matches ' [ ' 261 matches '=$(' 90 matches

▲

nebezb

1 month ago

[-]

Oh I’m curious. Love bash, and learning new things about it.

I can understand why [ is not ideal. Can you explain the rest to me? I use || true for custom error handling often (with the right set -euo pipefail of course)

▲

m00dy

1 month ago

[-]

I tried using https://letscage.com for this. Almost same design but in rust

▲

sailorganymede

1 month ago

[-]

I worked on text chat ages ago over TOR. Honestly so happy to see that the ecosystem is still going!

▲

smalltorch

1 month ago

[-]

You may like this one then. It's kinda the same thing, but text only and multiple people can connect at once. It's setup so anyone can be a host, or a client.

Basically IRC, but for Tor.

https://gitlab.com/here_forawhile/torch

▲

Tepix

1 month ago

[-]

Interesting to implement this as a shell script.

Still: Using a line based protocol and base64 encoding the audio data? Not my first choice.

The README doesn't mention it, but I assume both parties have to be online at the same time?

Regarding encryption - what's the point? When communicating with a tor hidden service, the data is already encrypted.

Only starting the sending audio data after the speaker has stopped talking means much longer delays than necessary. Imagine someone talking for a minute.

▲

smalltorch

1 month ago

[-]

To expound on the other questions.

To receive a call, you either need to be online and actively listening for calls, or optionally, you can enable auto listening. When another user calls you it will automatically put you in the call. On end call you will be put back in listening mode. I'm not really sure a great way to get around this without overly complicating it.

I believe because of the small overhead that's added there is just no reason not to layer encryption. At the end of the day I just wanted to see the bits I'm sending over the wire with my own eyes for assurance it's protected regardless of the fact that tor is protecting the data.

The streaming would be a nice improvement for latency. I would have to look into how this would work for the optional audio processing. Having one set file for transport also simplifys the some of the flow with encryption like salting and optional hmac authentication as these are derived from the sum of the entire file, not a portion of it.

▲

fc417fc802

1 month ago

[-]

> salting

Do you mean IVs? Can't you (for most algorithms) just use a monotonic counter when streaming blocks?

> optional hmac authentication

Wouldn't that just be done per-chunk instead of per-file?

▲

Bender

1 month ago

[-]

the data is already encrypted

by the spooks that wrote it. no harm in having another turtle in the stack.

▲

Tepix

1 month ago

[-]

If you don’t trust tor, why use it?

▲

Bender

1 month ago

[-]

I don't trust anything or anyone. Rather I just use defense in depth and assume someone at some point will get access to the data.

▲

Tepix

1 month ago

[-]

Well, it's not adding post-quantum crypto which might be more important than yet another layer of AES.

▲

smalltorch

1 month ago

[-]

The base64 encoding adds about 30% overhead. It's not ideal but it was a limitation of bash. Passing raw binary does not work in bash (or I couldn't get it to work).

▲

extraduder_ire

1 month ago

[-]

What exactly was the problem you ran into? I've run binary through pipes just fine before.

▲

smalltorch

1 month ago

[-]

your right it's not a problem. This has been implemented since v1 and I haven't really been focused to much this. Trying to decide if I should remove this step for future versions. It's a clear optimization but Im thinking it should at least be backwards compatible with old versions.

It's not really a latency saver but it definitely reduces load on the network.

▲

snthpy

1 month ago

[-]

I was also curious about the base64 encoding in the stack. I'm not knowledgeable in this area though so it was more for my own education than questioning the choices.

▲

smalltorch

1 month ago

[-]

Its a product of me troubleshooting why my audio pipe wasn't working in early prototyping. I tried quite a few things and the first time I got the successful loopback on a remote device I had implemented the base64 and it solved the piping errors I was getting.

Turns out bash totally can pipe raw binary you just need to appropriately wrap the blob with the correct command.

By the time I had the working pipe I was in feature building mode.

▲

ProofHouse

1 month ago

[-]

This is rad

▲

sourcegrift

1 month ago

[-]

Sorry for hijacking but I came across a firefox send replacement which worked in linux command line. Anyone know what it was? (It was online though, as in no storage for later)

▲

zie

1 month ago

[-]

Firefox Send is still around in hobbyist land:

send.vis.ee along with ffsend[0] maybe?

0: https://github.com/timvisee/ffsend

I love and use ffsend every day.