tried https://github.com/unum-cloud/uform which i do like, especially they also support languages other than English. Any recommendations on other alternatives?
https://github.com/mlfoundations/open_clip/blob/main/docs/mo...
When embeddings are quantized to int8 they still work very well for similarity (no differences in top 10 search on my test set). I haven't tried quantizing the models themselves.
My immediate question is: Why not classify among the entire hierarchy of all Wordnet synsets?
---
https://github.com/glassroom/heinsen_tree#sample-usage-with-...
It worked for me, but I had to modify the code to use all hypernym paths, giving me 147,200 classes, one per path. English only. For synsets with more than one path, I split target probability mass over their paths. For prediction, I added the predicted probs of hypernym paths ending at each synset.