ChatGPT's Latest Advancement: Voice Integration and Image Interaction Upgrade

Yesterday, ChatGPT announced on its website that they are going to release a new feature that will enable users to interact with its AI via voice or image method. This feature is introduced to make users express their desire to the AI to get an elaborate answer.


We’re rolling out voice and images in ChatGPT to Plus and Enterprise users over the next two weeks. Voice is coming on iOS and Android (opt-in in your settings) and images will be available on all platforms—ChatGPT said.


With the voice feature provided by OpenAI, you can now engage in a back-and-forth conversation with the AI assistant.


To access this feature, go to "Settings" and you'll see "New Features on the mobile app", click on it and select "Voice conversation". 


After which you should tap on the headphone button located in the top-right corner of the home screen and choose the preferred voice you want out of the five different voices.


See below for how it works.


To use the image feature, tap on the photo button to capture or choose an image. If you're an iOS or Android user, tap on the plus button first. 


With this image feature, you can discuss multiple pictures or make use of the drawing tool to guide your AI assistant.


See below for how it works.

Let’s Talk About the ChatGPT’s Update?

The introduction of voice capabilities in ChatGPT has drawn the OpenAI chatbot assistant closer to Apple’s Siri and Amazon’s Alexa.


It’s interesting to note that ChatGPT’s latest feature can tell bedtime stories, speak audibly to clarify others, and much more.


The technology behind it is being used by Spotify for the platform’s podcasters to translate their content into different languages, OpenAI said.


Also, with the image feature, users can upload more than one image on its interface. 


  • Troubleshoot why your grill won’t start, explore the contents of your fridge to plan a meal, or analyze a complex graph for work-related data.

How People Are Responding With the Latest OpenAI Update

The OpenAI update has invited widespread criticism on the X platform, formerly known as Twitter. While some users are celebrating the release, others have raised concerns.


Aljazeera reported that in a conversation with WIRED, Trevor Darrell, professor at UC Berkeley and co-founder of Prompt AI, said that the fear of AI becoming too human-like is described as the “uncanny valley gap”.


Some users are more concerned about the lawsuit placed before the tech company on copyright laws and infringement of intellectual property rights, advising others not to use ChatGPT.


Some also expressed their dissatisfaction that the update would replace smaller AI startups, software engineers, and even educators in the future.


AI-generated voices have also raised the threat of deepfakes, voice scams, and identity theft.


Additionally, the addition of voice recognition might make the feature less accessible to people who do not speak with mainstream accents, said Joel Fischer, who studies human-computer interaction at the University of Nottingham in the UK—Aljazeera reported.

What Is ChatGPT Saying?

OpenAI accepted that the voice feature holds the potential for malicious practices such as fraud and impersonation. To counter this, the company said it is “using this technology to power a specific use case”.


OpenAI also accepted that there are limitations to using images in AI, including image hallucinations where the AI generates false information about the image.


To counter this, OpenAI said it has taken technical measures to limit ChatGPT’s ability to analyze and make direct statements about people.


Be the first to comment!

You must login to comment

Related Posts

 
 
 

Loading