FilterHN
new
ask
show
jobs
submit
FilterHN
show menu
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention
20 points
by
jxmorris12
4 days ago
|
past
| 2 comments
|
blog.vllm.ai
|
HN
▲
mdaniel
1 day ago
[-]
With all the claims of 10x, I wish they'd point the AIntern to their docs because they're just shameful
https://docs.vllm.ai/en/stable/cli/index.html#serve
reply
▲
downrightmike
2 days ago
[-]
*2023
reply