Now You Can Chat With ChatGPT Using Your Voice

OpenAI, the company behind ChatGPT, has introduced upgrades that enhance the AI chatbot’s capabilities. Users can now speak their queries to ChatGPT, which response using its synthesised voice.

Additionally, the updated ChatGPT app supports image recognition, allowing users to upload or capture photos and receive descriptions and context, similar to Google’s Lens feature.

These enhancements reflect OpenAI’s commitment to continuously improving its AI models, treating them as evolving products. ChatGPT is becoming more like popular consumer AI assistants such as Apple’s Siri and Amazon’s Alexa.

The new voice interaction feature utilizes two models: Whisper for speech-to-text conversion and a text-to-speech model for ChatGPT’s responses.

OpenAI has gone the extra mile to develop synthetic voices, trained on professional actors’ voices, with the goal of making them enjoyable for extended listening. There’s even potential for users to create their own voices in the future, prioritizing user comfort and engagement.

