FilterHN

Qwen3-VL can scan two-hour videos and pinpoint nearly every detail

6 points

by thm

3 hours ago

| past

| 1 comment

| the-decoder.com

| HN

▲

thot_experiment

1 hour ago

[-]

anyone have a tl;dr for me on what the best way to get the video comprehension stuff going is? i use qwen-30b-vl all the time locally as my goto model because it's just so insanely fast, curious to mess with the video stuff, the vision comprehension works great and i use it for OCR and classification all the time