[VINUX-DEVELOPMENT] Next speech project?
sbahram at nc.rr.com
Sat Nov 6 02:22:19 CET 2010
I'd usually agree with this opinion; however, there are two separate sets of semantics. One set is a variable which has a defined
range, while the other set is a coefficient, or multiplier, also with a range, but this one is defineable as opposed to predefined,
which modifies said variable.
From: vinux-development at googlegroups.com [mailto:vinux-development at googlegroups.com] On Behalf Of Trevor Saunders
Sent: Friday, November 05, 2010 9:01 PM
To: vinux-development at googlegroups.com
Subject: Re: [VINUX-DEVELOPMENT] Next speech project?
> SonicMin, and SonicMax: When set, these parameters would define what
> is meant by "0" speed in Orca, and "100" speed. In this case, we
> would intercept the speed setting commands from Orca and adjust the
> sonic speed multiplier, rather than forwarding any speed change
> command to the TTS synth. So, for example, I like to listen between
> 1X and 4X speed, so I could set these to 1.0 and 4.0, and adjust the
> actual speed in Orca settings to any speed in this range. If not set,
> then the behavior would revert to how it works now, where the TTS
> synth is responsible for the speed.
> SonicDefaultSpeed: If sonic is enabled, it is sometimes desirable to
> use a particular speech synthesis speed in the TTS synth, even though
> we're speeding it up or slowing it down as a post process to
> synthesis. For example, espeak slowly removes pauses between words
> and phrases as you increase speed to -s 450. So, speeding up espeak
> output that was generated with -s 200 sounds different, and perhaps
> better, than speeding up the espeak default speed output. This option
> would allow users to set the default speed for the TTS synth.
> Do these sound right?
I'm not sure I'm comfortable with two different ways to change rate and two sets of rate setting options and commands. I'd sugguest
getting this merged in to the tts systems.
> > Presumable other members of the vinux community could also record a
> > set of words for you - I have a good quality recorder - you would
> > just have to let us know which words you need and we could start
> > contributing to the language database - I suppose the English
> > language accents such as US, UK, Australian etc would be the first
> > step, before trying any foreign languages etc?
> > On 11/5/10, Bill Cox <waywardgeek at gmail.com> wrote:
> >> I think I've basically nailed speeding up speech, and now it's time
> >> to move onto the next phase of open-source speech I/O. There's the
> >> headache of actually getting this technology into common use, but
> >> since it's available for free under the GPL, I think it's use will
> >> spread over time.
> >> So, next project. I've ordered a Yeti mic, and will record a
> >> database of words in English in my own voice, and use this data to
> >> start building a new open-source TTS engine, using an approach that
> >> will hopefully lead to a usable speech recognition engine as well.
> >> What should the new TTS engine be called? Billspeak? How about
> >> OpenTTS, now that OpenTTS has been abandoned? A primary feature
> >> will be that anyone will be able to create a new voice for the TTS
> >> engine by recording themselves. For example, espeak has a nice
> >> British accent, but it's American accent isn't so great. I'll bet
> >> there are people in India who wish espeak would do a better job of
> >> using their accent. This TTS engine will enable support for any
> >> accent users feel strongly enough about to record.
> >> I also need to pick a coding methodology for the new TTS engine. I
> >> specifically packaged sonic using nothing but plain ANSI C, and
> >> ancient coding practices, to accelerate it's adoption. I wont have
> >> to explain the coding style to anyone. However, TTS is going to be
> >> a lot more complex, and I'd like to use the same code base in
> >> developing speech recognition, adding much more complexity.
> >> Performance in terms of minimising CPU load will be a top priority.
> >> So, I'm thinking of using the DataDraw tool I developed and used
> >> for nearly 20 years, as it's excellent for large complex projects
> >> with hundreds of classes, and the resulting code runs faster than
> >> traditional C code based on pointers to structures to model objects
> >> (DataDraw uses integer index into property arrays, which is much faster).
> >> Bill
> >> --
> >> You received this message because you are subscribed to the Google
> >> Groups "The Vinux Development Forum" group.
> >> To post to this group, send email to
> >> vinux-development at googlegroups.com
> >> To unsubscribe from this group, send email to
> >> vinux-development+unsubscribe at googlegroups.com
> >> For more options, visit this group at
> >> http://groups.google.com/group/vinux-development?hl=en?hl=en
> >> Vinux Home Page: http://vinux.org.uk/ Vinux Wiki
> >> Documentation:http://wiki.vinuxvirtual.org.uk/index.php?title=Main_
> >> Page
> > --
> > drbongo
> > Dig that crazy beat on the drums: <http://vinux.org.uk> The best is
> > getting better!
> > --
> > You received this message because you are subscribed to the Google
> > Groups "The Vinux Development Forum" group.
> > To post to this group, send email to
> > vinux-development at googlegroups.com
> > To unsubscribe from this group, send email to
> > vinux-development+unsubscribe at googlegroups.com
> > For more options, visit this group at
> > http://groups.google.com/group/vinux-development?hl=en?hl=en
> > Vinux Home Page: http://vinux.org.uk/ Vinux Wiki
> > Documentation:http://wiki.vinuxvirtual.org.uk/index.php?title=Main_P
> > age
> You received this message because you are subscribed to the Google
> Groups "The Vinux Development Forum" group.
> To post to this group, send email to
> vinux-development at googlegroups.com
> To unsubscribe from this group, send email to
> vinux-development+unsubscribe at googlegroups.com
> For more options, visit this group at
> Vinux Home Page: http://vinux.org.uk/ Vinux Wiki
More information about the Speechd