GPT-Realtime-1.5 Released
3 points
1 hour ago
| 1 comment
| twitter.com
| HN
hectormalot
1 hour ago
[-]
We're doing a lot with the realtime models. Happy to see a new release.

Initial feel from a few calls is that it seems to perform better with alphanumeric inputs. Voice seems consistent. Recognition on a few tests seems to be somewhat better, especially did much better on the two 8-bit 8-kHz mulaw calls I tried.

It does still struggle a bit with some specifics in other languages (e.g., that the Dutch/German pronunciation of 53 'fifty-three' is effectively 'three-and-fifty').

reply