ChatGPT now can see what you see and explain it in Santa’s voice
OpenAI has introduced two new features for ChatGPT. First up, the company has finally rolled out the real-time video capabilities it teased almost seven months ago. During one of its livestreams, OpenAI revealed that Advanced Voice Mode – ChatGPT’s conversational feature that mimics human-like responses – is now equipped with vision.
If you’re using the ChatGPT app and subscribed to Plus, Team, or Pro, you can simply point your phone at an object, and ChatGPT will give you an almost instant response. No need to upload photos or type out detailed descriptions anymore.
What’s more, Advanced Voice Mode with vision can also interpret what’s on your device’s screen through screen sharing. Whether it’s explaining your settings menu or walking you through a tricky math problem, it’s got you covered. Once it rolls out, if you want to try it out, follow these steps:
- Tap the voice icon next to the chat bar in the app.
- Hit the video icon at the bottom left to start using video.
Meanwhile, for screen sharing, just tap the three-dot menu and select “Share Screen.”
OpenAI says the rollout for Advanced Voice Mode with vision kicks off Thursday and should be completed within a week. However, not everyone will get to try it right away. ChatGPT Enterprise and Edu subscribers will have to wait until January, and there’s no set timeline yet for users in the EU, Switzerland, Iceland, Norway, or Liechtenstein.
Video credit – OpenAI
And as you can see from the video, OpenAI is getting into the holiday spirit. There is one more feature now: Santa mode. It’s a festive preset voice for ChatGPT that users can access by tapping or clicking the snowflake icon next to the app prompt bar.
Say ho ho ho to Santa in Voice Mode
Santa is rolling out today to everyone across all ChatGPT platforms and is available until the end of the month…then he will retire back to the North Pole. pic.twitter.com/NVS9bRok4r
— OpenAI (@OpenAI) December 12, 2024
I think that by rolling out features like video and screen sharing, OpenAI is pulling ahead of its main rival, Google, which just unveiled its latest AI model, Gemini 2.0. The battle between these tech giants is indeed pushing AI innovation at lightning speed.