How To Use ChatGPT's Advanced Voice Mode?

December 18th, 2024

4 minutes

🟢easy Reading Level

Advanced Voice Mode in ChatGPT enhances your AI-driven conversations by providing a more natural, speech-based interaction.

Advanced voice leverages natively multimodal models (such as GPT-4o) to directly “hear” and generate audio. This allows for more fluid, real-time conversations, capturing nuances such as speaking pace and emotional tone.

With Advanced Voice Mode, you can also share video, screens, and images during your conversation—making it a versatile tool for immersive communication and richer content exchange.

In this article, we'll cover how to access ChatGPT Advanced Voice Mode, its benefits, main features and their applications.

How to Access Advanced Voice Mode

  1. Eligibility and Availability: Advanced Voice Mode is available to Plus, Pro, and Team users, with a monthly preview for Free users.

  2. On Mobile (iOS and Android):

    • Update to the latest app version.
    • Tap the Voice icon at the bottom-right of the main screen.
    • If advanced voice is available, you’ll see a blue orb in the center of the conversation screen. The standard voice interface shows a black circle.
    • Grant microphone permissions if prompted.
    • If it’s your first time using advanced voice, you’ll be asked to pick a voice. You can change voices anytime in settings or from the customization menu within Advanced Voice Mode.
  3. On Desktop Web:

    • Go to ChatGPT website and sign in.
    • Click the Voice icon at the bottom-right of the input box.
    • Grant your browser permission to access the microphone if prompted.
    • A blue orb indicates Advanced Voice Mode.
    • Set or change your chosen voice any time in settings or via the customization menu.

Feature 1: Real-Time, Lifelike Voice Conversations

What it is:
Advanced Voice Mode uses multimodal models that directly process audio, creating more natural, responsive, and emotionally nuanced interactions.

How to use it:

  • Start a voice chat and speak naturally at your own pace.
  • ChatGPT will respond with lifelike voices that you select, capable of capturing mood and tone.
  • Adjust the selected voice and conversation style at any time through settings or the in-chat customization menu.

Feature 2: 9 Distinct Voices + Seasonal Voices

What it is:
You can choose from nine lifelike output voices—each with its own character and tone—plus seasonal or event-specific voices (like Santa, available until early 2025).

How to use it:

  • On your first advanced voice conversation, you’ll be prompted to pick a voice.
  • Switch voices any time in settings or via the customization menu in Advanced Voice Mode.
  • Changing voices starts a new conversation, giving you flexibility to find the voice that best fits your style.

Feature 3: Video Sharing (Mobile Apps Only)

What it is:
On iOS and Android devices, you can share live video while chatting in Advanced Voice Mode. This lets you display visual information in real-time—ideal for demonstrations, presentations, or face-to-face style interactions.

How to use it:

  • During a voice chat, tap the Camera button at the bottom of the screen to start sharing your video.
  • Tap the same button again to stop sharing.
  • ChatGPT can respond to what it sees in your camera feed, and may reference it later in the same conversation.

Feature 4: Screen Sharing and Image Uploads (Mobile Apps Only)

What it is:
You can share images from your phone’s gallery or capture new photos. You can also share your screen with ChatGPT to show slides, documents, or in-app workflows.

How to use it:

  • Tap the Three Dots button and select Share Screen from the menu.
  • Choose to take a photo, upload a photo, or share your screen.
  • Stop sharing anytime by tapping the screenshare button again.
  • ChatGPT may analyze and reference shared images or screen content during the conversation.

Feature 5: Background Conversations and Resuming Chats

What it is:
You can keep voice conversations going in the background, even if your phone is locked or you’re using another app. Advanced voice chats can be resumed as text or standard voice sessions later, though standard sessions cannot be resumed as advanced voice.

How to use it:

  • Enable “Background Conversations” in settings.
  • Switch back to your ChatGPT app to continue where you left off.
  • Note: Standard voice conversations can’t be upgraded to Advanced Voice Mode if resumed.

Additional Tips for Advanced Voice Mode

  • Optimizing Audio Quality:
    Use headphones and enable “Voice Isolation” on iPhone to reduce background noise.

  • Controlling Content:
    Ask ChatGPT to speak your preferred language if voice detection is off. In standard voice mode, you can specify a language in the app settings for better accuracy.

  • Data Retention and Memories:
    Memories and custom instructions work in both advanced and standard voice modes. Check your chat history and settings to manage memory preferences.

Conclusion

Advanced Voice Mode brings a new dimension to interacting with ChatGPT—combining speech, video, images, and even screen sharing into a seamless, dynamic experience. Whether you’re presenting a project, collaborating on visuals, or just enjoying a more lifelike conversation with an AI, advanced voice offers richer, more intuitive interactions. With flexible voice options, privacy controls, and multimodal capabilities, Advanced Voice Mode sets a new standard for conversational AI.

Valeriia Kuka

Valeriia Kuka, Head of Content at Learn Prompting, is passionate about making AI and ML accessible. Valeriia previously grew a 60K+ follower AI-focused social media account, earning reposts from Stanford NLP, Amazon Research, Hugging Face, and AI researchers. She has also worked with AI/ML newsletters and global communities with 100K+ members and authored clear and concise explainers and historical articles.


© 2024 Learn Prompting. All rights reserved.