Scaling Embeddings Outperforms Scaling Experts in Language Models
1 points
1 hour ago
| 0 comments
| arxiv.org
| HN
No one has commented on this post.