Pro-Voice Recording Guide
This guide outlines the requirements for capturing client audio recordings to create high-quality voice clones for their Digital Mind. By following these guidelines, you can help us gather the necessa
Audio Recording Specifications
To ensure the best possible voice clone, please adhere to the following specifications when gathering audio from your clients:
Recording Length
The duration of the audio recording is crucial for capturing a comprehensive vocal profile.
30 minutes: This is a good minimum length.
1 hour: This is better and provides more data.
2 hours: This is ideal and will allow for the most accurate and nuanced voice clone.
Recording Style
The nature of the recording should reflect natural conversational speech.
Conversation/Interview: The recording should capture only the client's side of a back-and-forth conversation, similar to an interview. This allows us to understand their natural speaking patterns, intonation, and rhythm in a dynamic exchange, without the other person's voice.
Avoid:
Speeches
Readings
Memorized lines
Recordings of them just talking in an attempt to generate a clip
These types of recordings often lack the natural conversational flow, which can result in a voice clone that sounds "off" or unnatural.
Speaking Tone
The client should be speaking in their normal, everyday voice.
Normal Speech: The recording should capture the client speaking normally, without any exaggerated or specific emotions in their tone.
Avoid: Recordings where the client is speaking in a strange context or with a heightened emotional tone, as this can lead to an unrepresentative voice clone.
Audio Quality
Clean audio is essential for accurate voice cloning.
Clean Audio: Please ensure the audio is as clean as possible. This includes removing or minimizing:
Room echo
Random noises
Audio glitches
Background chatter from other people
Audio Cleaning: You are encouraged to clean the audio if necessary to remove any unwanted disturbances.
Professional Equipment: For the best voice clone quality, use professional recording equipment to minimize background noise, echoes, and other audio disturbances, ensuring a cleaner and more accurate vocal profile.
Clip Segments
Multiple clips are acceptable and often preferred.
Multiple Clips: You do not need to provide a single continuous audio clip. You can send multiple clips, which we can combine on our end. This offers flexibility in recording sessions.
Authenticity
Preserve natural speech patterns.
Do Not Cut Out Authenticity: It is important to retain natural speech elements that contribute to an authentic voice. Avoid editing out disfluencies such as "uh," "um," pauses, and other natural conversational fillers or hesitations to maintain the authentic flow of speech.
These elements are vital for capturing the true cadence and personality of the client's voice.
Thank you for helping us create exceptional Digital Minds!
Last updated