Qwen3-VL can scan two-hour videos and pinpoint nearly every detail
6 points
by thm
3 hours ago
| 1 comment
| the-decoder.com
| HN
thot_experiment
1 hour ago
[-]
anyone have a tl;dr for me on what the best way to get the video comprehension stuff going is? i use qwen-30b-vl all the time locally as my goto model because it's just so insanely fast, curious to mess with the video stuff, the vision comprehension works great and i use it for OCR and classification all the time
reply