Personal tools
Skip to content. | Skip to navigation
US English male speaker ("BDL") for Festival. This is a HMM-based Speech Synthesis System (HTS) voice from the Nagoya Institute of Technology, trained using the CMU ARCTIC database. This voice is based on 1132 utterances spoken by a US English male speaker. The speaker is experienced in building synthetic voices. This was recorded at 16bit 32KHz, in a sound proof room, in stereo, one channel was the waveform, the other EGG. The database was automatically labelled using CMU Sphinx using the FestVox labelling scripts. No hand correction has been made.
US English female speaker ("CLB") for Festival. This is a HMM-based Speech Synthesis System (HTS) voice from the Nagoya Institute of Technology, trained using the CMU ARCTIC database. This voice is based on 1132 utterances spoken by a US English female speaker. The speaker is experienced in building synthetic voices. This was recorded at 16bit 32KHz, in a sound proof room, in stereo, one channel was the waveform, the other EGG. The database was automatically labelled using CMU Sphinx using the FestVox labelling scripts. No hand correction has been made.
US English male speaker ("JMK") voice for Festival. JMK is a native Canadian English speaker, but the voice uses the US English front end. This is a HMM-based Speech Synthesis System (HTS) voice from the Nagoya Institute of Technology, trained using the CMU ARCTIC database. This voice is based on 1138 utterances spoken by a US English male speaker. The speaker is experienced in building synthetic voices. This was recorded at 16bit 32KHz, in a sound proof room, in stereo, one channel was the waveform, the other EGG. The database was automatically labelled using CMU Sphinx using the FestVox labelling scripts. No hand correction has been made.
American English male speaker ("Kevin") for Festival. This voice provides an American English male voice using a residual excited LPC diphone synthesis method. It uses the CMU Lexicon pronunciations. Prosodic phrasing is provided by a statistically trained model using part of speech and local distribution of breaks. Intonation is provided by a CART tree predicting ToBI accents and an F0 contour generated from a model trained from natural speech. The duration model is also trained from data using a CART tree.
American English male speaker ("Kurt") for Festival. This voice provides an American English male voice using a residual excited LPC diphone synthesis method. It uses the CMU Lexicon for pronunciations. Prosodic phrasing is provided by a statistically trained model using part of speech and local distribution of breaks. Intonation is provided by a CART tree predicting ToBI accents and an F0 contour generated from a model trained from natural speech. The duration model is also trained from data using a CART tree.
US English male speaker ("RMS") voice for Festival. This is a HMM-based Speech Synthesis System (HTS) voice from the Nagoya Institute of Technology, trained using the CMU ARCTIC database. This voice is based on 1132 utterances spoken by a US English male speaker. The speaker is experienced in building synthetic voices. This was recorded at 16bit 32KHz, in a sound proof room, in stereo, one channel was the waveform, the other EGG. The database was automatically labelled using EHMM an HMM labeler that is included in the FestVox distribution. No hand correction has been made.
US English female speaker ("SLT") voice for Festival. This is a HMM-based Speech Synthesis System (HTS) voice from the Nagoya Institute of Technology, trained using the CMU ARCTIC database. This voice is based on 1132 utterances spoken by a US English female speaker. The speaker is experienced in building synthetic voices. This was recorded at 16bit 32KHz, in a sound proof room, in stereo, one channel was the waveform, the other EGG. The database was automatically labelled using CMU Sphinx using the FestVox labelling scripts. No hand correction has been made.
Fetchmail is a remote mail retrieval and forwarding utility intended for use over on-demand TCP/IP links, like SLIP or PPP connections. Fetchmail supports every remote-mail protocol currently in use on the Internet (POP2, POP3, RPOP, APOP, KPOP, all IMAPs, ESMTP ETRN, IPv6, and IPSEC) for retrieval. Then Fetchmail forwards the mail through SMTP so you can read it through your favorite mail client. Install fetchmail if you need to retrieve mail over SLIP or PPP connections.
FFmpeg is a very fast video and audio converter. It can also grab from a live audio/video source. The command line interface is designed to be intuitive, in the sense that ffmpeg tries to figure out all the parameters, when possible. You have usually to give only the target bitrate you want. FFmpeg can also convert from any sample rate to any other, and resize video on the fly with a high quality polyphase filter.