Bringing Up DeepSeek-V4-Flash on AMD MI300X
75 points
by kkm
6 hours ago
| 5 comments
| fergusfinn.com
| HN
maCDzP
3 hours ago
[-]
I train on AMD MI250X and managed to get Gemma 4 31B to work - but it took a lot of work on the software side.
reply
kkm
3 hours ago
[-]
This is very interesting, planning to write about it?
reply
mezark
5 hours ago
[-]
We at doubleword are bullish for AMD for low-interactivity inference - it does just take a bigger lift on the software side...
reply
brcmthrowaway
3 hours ago
[-]
Are you long AMD?
reply
kkm
5 hours ago
[-]
Also the vllm patch accompanying the blogpost: https://github.com/doublewordai/vllm-amd-blog-doubleword
reply
latchkey
17 minutes ago
[-]
Nice work and thanks for being a customer.

(CEO Hot Aisle)

reply
benlm
5 hours ago
[-]
Nice work! Would DeepSeek V4 Pro on 8xMI300X work with these patches?
reply