New – Long-Form voices for Amazon Polly | Amazon Web Services

ByContributor November 16, 2023December 24, 2023

We are launching three new voices for Polly. Powered by a new long-form engine, the voices are natural and expressive, with appropriate pauses, emphasis, and tone.

New Voices
The new long-form voices are perfect for blog posts, news articles, training videos, and marketing content. The underlying Machine Learning model extracts meaning from the text, learning about speech segments, prosody (the pattern of rhythm and pauses), intonation, and other aspects of expressive speech, allowing the synthesized audio to express emotions, especially in dialogs. The new long-form engine uses a deep learning text-to-speech (TTS) model trained to acquire a contextual understanding of the text that allows it to express prosody in an appropriate way. This allows the intention of the story to drive the vocal performance and create the correct emphasis, pauses, and tones of a realistic human voice.

Here are the new voices:

Name Locale Gender Language Sample

Danielle

en_US

Female

English (US)

AWS Management Console, AWS Command Line Interface (AWS CLI), or the AWS SDKs. Using the CLI, I start by listing the voices that use the new long-form engine:

$ aws --region us-east-1 polly describe-voices --output json 
  | jq -r '.Voices[] | select(.SupportedEngines | index("long-form")) | .Name'
Danielle
Gregory
Ruth

I can pick one, or I can try all of them:

for v in `aws polly describe-voices --output json 
          | jq -r '.Voices[] | select(.SupportedEngines | index("long-form")) | .Name'`; do
    Text="Hello my name is $v and I can read blog posts, articles, 
and other long-form content for you. I am the best!"
    aws polly synthesize-speech --output-format 'mp3' 
    --text "$Text" --voice-id $v $v.mp3 --engine long-form; 
    aws s3 cp $v.mp3 s3://jbarr-voices; 
done

My shell script had a small quoting bug, but the resulting audio was too funny not to include!

Programmatically, you can reproduce my example by writing code that calls the DescribeVoices and SynthesizeSpeech functions.

Things to Know
Here are some interesting things that you should know about the new voices:

Pricing – Long-form voices are priced at $100 per million characters or Speech Marks requests. Check out the Amazon Polly Pricing page to learn more.

Engines & Voices – Some of the voices that I listed above can be used with more than one engine. For example, the Danielle voice can be used with the new long-form engine and the existing neural engine.

Regions – The new engine and voices are available in the US East (N. Virginia) Region.

Check out the new voices, build something awesome, and let me know what you think!

— Jeff;

Maximizing Your Forms with RSForm! Pro – The Ultimate Guide

Joomla plugins are vital tools that enhance the functionality of a website, with RSForm! Pro distinguished as a robust form-building solution. This overview aims to outline the key features, benefits, and straightforward installation process of RSForm! Pro. It will explore various customization options that enable users to tailor forms to their specific requirements, review common… […]

Boost Your Website Speed with Our Performance Optimization Plugin

In the rapidly evolving digital landscape, the performance of a Joomla website significantly influences user engagement. Performance optimization plugins serve as essential tools aimed at enhancing the speed and efficiency of a website by combining, minifying, and compressing assets such as CSS, JavaScript, and images. This article examines the advantages these plugins provide, ranging from… […]

How to Upgrade from Ubuntu 22.04 LTS to Ubuntu 24.04 LTS

The stable version of Ubuntu 24.04 LTS (code-named Noble Numbat) is released on April 25th 2024, if you are curious to know what is in it, you can now upgrade to the version of it… The post How to Upgrade from Ubuntu 22.04 LTS to Ubuntu 24.04 LTS appeared first on FAST DOMAINS.

16 Best Linux Distributions for Older Computers

Do you have an old laptop that has gathered layers of dust over time and you don’t exactly what to do with it? A good place to start would be to install a Linux distribution… The post 16 Best Linux Distributions for Older Computers appeared first on FAST DOMAINS.

New – Long-Form voices for Amazon Polly | Amazon Web Services

WebKitGTK+ 2.22.2 and 2.22.3, Media Source Extensions, and YouTube

Veeam Backup for Microsoft Azure: configuration – pt.3

Tech Mahindra and Google Cloud team up to boost generative AI adoption

Email Signature Design Guide, Best Practices, and Examples

99% of firms face challenges due to multiple cloud platforms

Microsoft outperforms Amazon and Google in cloud AI

© Copyright

VMware ESXi Power Optimization Overview

WiredGorilla

Similar Posts

© Copyright