This is the first decent video generation model that runs on consumer hardware. Big deal and I expect ControlNet pose support soon too.
With static images, we always look for eyes.
With video, we always look for dancing.
It can leave LLMs behind...
'Cause LLMs don't dance, and if they don't dance, well, they're no friends of mine.
Edit: I didn't realize that this was actually a reference to Men Without Hats - The Safety Dance. I was referencing a different parody/allusion to that song!