Undoubtedly, AI represents the disruptive force of our era. In May, OpenAI unveiled their most recent Chat GPT iteration, dubbed ChatGPT 4o, with the 'o' signifying 'Omni' due to its capability for audio and video interactions.
This model is remarkable for its swift response times and concurrent voice and video interaction capabilities, positioning it as a versatile personal assistant.
The prospect of OpenAI's partner, Be My Eyes, adopting this new model is particularly thrilling for me, as hinted at in their teaser video.The capability to inquire about visual elements in the world through my phone has greatly enhanced my autonomy. With the latest model, the responses are quicker, and it eliminates the need to frame and snap a photo, as the AI now interprets the live video feed directly.
“The new voice (and video) mode is the best computer interface I’ve ever used,” OpenAI CEO Sam Altman said in a blog post following the announcement. “It feels like AI from the movies; and it’s still a bit surprising to me that it’s real. Getting to human-level response times and expressiveness turns out to be a big change.”
https://edition.cnn.com/2024/05/13/tech/openai-altman-new-ai-model-gpt-4o/index.html