FilterHN

OpenAI model for masking personally identifiable information (PII) in text

38 points

by tanelpoder

13 hours ago

| past

| 7 comments

| openai.com

| HN

▲

stratos123

3 hours ago

[-]

There's some interesting technical details in this release:

> Privacy Filter is a bidirectional token-classification model with span decoding. It begins from an autoregressive pretrained checkpoint and is then adapted into a token classifier over a fixed taxonomy of privacy labels. Instead of generating text token by token, it labels an input sequence in one pass and then decodes coherent spans with a constrained Viterbi procedure.

> The released model has 1.5B total parameters with 50M active parameters.

> [To build it] we converted a pretrained language model into a bidirectional token classifier by replacing the language modeling head with a token-classification head and post-training it with a supervised classification objective.

▲

hiAndrewQuinn

10 hours ago

[-]

I'm surprised nobody else has commented on this. This is a very straightforward and useful thing for a small locally runnable model to do.

▲

apothegm

9 hours ago

[-]

And also something that it’s dangerous to try to do stochastically.

▲

hiAndrewQuinn

9 hours ago

[-]

It's going to be stochastic in some sense whether you want it to be or not, human error never reaches zero percent. I would bet you a penny you'd get better results doing one two-second automated pass + your usual PII redaction than your PII redaction alone.

▲

cyanydeez

3 hours ago

[-]

I think the problem is most secrets arn't stochastic; they're determinant. When the user types in the wrong password, it should be blocked. Using a probabilistic model suggests an attacker only now needs to be really close, but not correct.

Sure, there's some math that says being really close and exact arn't a big deal; but then you're also saying your secrets don't need to be exact when decoding them and they absolutely do atm.

Sure looks like a weird privacy veil that sorta might work for some things, like frosted glass, but think of a toilet stall with all frosted glass, are you still comfortable going to the bathroom in there?

▲

ashwindharne

10 hours ago

[-]

Same here, this is an incredibly useful thing to have in the toolkit

▲

mplanchard

2 hours ago

[-]

It would be nice if their examples weren’t mostly things that are easy to catch with regex, but it’s cool to see if released as an open, local model.

▲

Havoc

3 hours ago

[-]

50M effective parameters is impressively light. Is there a similarly light model on the prompt injection side? Most of the mainstream ones seem heavier

▲

ndom91

2 hours ago

[-]

Where's the gguf from Unsloth and co?

▲

y0eswddl

9 hours ago

[-]

crying_laughing_emoji.gif

yes, please, feed daddy AI all your PII

▲