FilterHN

A polynomial autoencoder beats PCA on transformer embeddings

18 points

by timvisee

2 days ago

| past

| 3 comments

| ivanpleshkov.dev

| HN

▲

yobbo

14 minutes ago

[-]

My understanding after scanning the code examples is the technique expands the dimensionality of each data point with a set consisting of the quadratic coefficients of its existing dimensions. I thought it sounded like kernel PCA.

▲

magicalhippo

1 hour ago

[-]

I'm just a casual LLM user, but your description of the anisotropy made me think about the recent work on KV cache quantization techniques such as TurboQuant where they apply a random rotation on each vector before quantizing, as I understood it precisely to make it more isotropic.

But for RAG that might be too much work per vector?

▲

pleshkov

2 days ago

[-]

Author here — questions and pushback both welcome.