End-to-end model that listens, sees, thinks and responds on video in real time
1 points
2 hours ago
| 1 comment
| twitter.com
| HN
linzhangrun
2 hours ago
[-]
How is the first token latency for real-time scene processing being addressed?
reply