Pro-Voice Recording Guide

This guide outlines the requirements for capturing client audio recordings to create high-quality voice clones for their Digital Mind. By following these guidelines, you can help us gather the necessa

Audio Recording Specifications

To ensure the best possible voice clone, please adhere to the following specifications when gathering audio from your clients:


Recording Length

The duration of the audio recording is crucial for capturing a comprehensive vocal profile.

  • 30 minutes: This is a good minimum length.

  • 1 hour: This is better and provides more data.

  • 2 hours: This is ideal and will allow for the most accurate and nuanced voice clone.


Recording Style

The nature of the recording should reflect natural conversational speech.

  • Conversation/Interview: The recording should capture only the client's side of a back-and-forth conversation, similar to an interview. This allows us to understand their natural speaking patterns, intonation, and rhythm in a dynamic exchange, without the other person's voice.

  • Avoid:

    • Speeches

    • Readings

    • Memorized lines

    • Recordings of them just talking in an attempt to generate a clip

These types of recordings often lack the natural conversational flow, which can result in a voice clone that sounds "off" or unnatural.


Speaking Tone

The client should be speaking in their normal, everyday voice.

  • Normal Speech: The recording should capture the client speaking normally, without any exaggerated or specific emotions in their tone.

  • Avoid: Recordings where the client is speaking in a strange context or with a heightened emotional tone, as this can lead to an unrepresentative voice clone.


Audio Quality

Clean audio is essential for accurate voice cloning.

  • Clean Audio: Please ensure the audio is as clean as possible. This includes removing or minimizing:

  • Room echo

  • Random noises

  • Audio glitches

  • Background chatter from other people

  • Audio Cleaning: You are encouraged to clean the audio if necessary to remove any unwanted disturbances.

  • Professional Equipment: For the best voice clone quality, use professional recording equipment to minimize background noise, echoes, and other audio disturbances, ensuring a cleaner and more accurate vocal profile.


Clip Segments

Multiple clips are acceptable and often preferred.

  • Multiple Clips: You do not need to provide a single continuous audio clip. You can send multiple clips, which we can combine on our end. This offers flexibility in recording sessions.


Authenticity

Preserve natural speech patterns.

  • Do Not Cut Out Authenticity: It is important to retain natural speech elements that contribute to an authentic voice. Avoid editing out disfluencies such as "uh," "um," pauses, and other natural conversational fillers or hesitations to maintain the authentic flow of speech.


These elements are vital for capturing the true cadence and personality of the client's voice.

Thank you for helping us create exceptional Digital Minds!

Last updated