Further problems with speech implementation.
Henrik Nilsen Omma
henrik at ubuntu.com
Thu Feb 16 13:29:10 GMT 2006
Hi all,
To broaden the discussion I've also added in the -devel list.
Luke Yelavich wrote:
> Hi all
> For those who weren't at the meeting, we discussed various aspects of
> implementation for the live CD, and the GUI install. A couple of issues
> were raised conserning the implementation of speech for gnopernicus.
> From further discussion with Daniel Holbach, it seems that there is more
> to do than first thought. The following is a summary of what has been
> discussed so far, including the first implementations, up to my
> conversation this evening.
>
> * An option on the gfxboot screen was created for people to select
> various accessibility profiles to be loaded when the live CD boots.
> * One of these involves loading a screen reader with synthesized speech
> for access to GNOME and the installer.
> * The necessary settings for these profiles to be activated are not yet
> implemented, but it was decided to put flite on the CD rather than
> festival, due to disk space constraints, as flite is much smaller.
> * It was determined that gnome-speech does not have a driver for flite,
> so an alternate way of interfacing flite with gnopernicus was suggested,
> this being to use a newly developed driver for gnome-speech to interface
> with speech-dispatcher. Speech-dispatcher can interface with flite. This
> would require either waiting for a new gnome-speech release, or patching
> the current package with CVS code containing the speech-dispatcher
> driver.
> * Speech-dispatcher 0.6 has not yet entered Debian sid, and 0.6 is the
> only version that the gnome-speech driver will work with. In order for
> us to use it, we need to upgrade the package in universe to 0.6, and get
> it promoted to main.
> * In order to prevent festival from being pulled in when gnome-speech is
> included, we need to split one or both gnome-speech drivers from the
> core gnome-speech packages to prevent this from happening.
> Speech-dispatcher itself doesn't have many dependancies, the biggest one
> being flite.
> * Daniel mentioned that feature freeze is fast approaching, and all this
> needs to be packaged up, and tested on all three architectures.
>
> So as you can see, this might be more difficult than we first thought.
> Daniel, if I have missed anything, please bring it up or correct me.
>
> Henrik, I am very interested in your opinion about all of this. If
> anybody else has anything to say, please speak up, as we need to get
> something sorted for the release ASAP.
>
I'd like to bring the core developers into the discussion at this point
to since we are approaching a crunch point.
As I see it, we have 3 options:
1. Pursue the path you describe above of packaging and promoting speech
dispatcher along with the gnome drivers. This would land us on
approximately the solution we had planned, though it will be tight to
get it working in time. Perhaps we can pursue this if we also agree on a
reasonable fall-back plan.
2. Upgrade from Festival-lite to the full Festival - This would give us
better speech quality and add 6-7 languages, but would require 40MB
instead of the 8MB used by f-lite. While this is theoretically possible
by removing language packs and/or Windows-FOSS, I doubt it will be given
serious consideration because it will have a significant impact for a
large number of users (and space is just always tight). Since the space
calculations get more detailed as we approach the release date and other
things settle in, I also don't think this would be a possible fall-back
option. I would of course love to be wrong :)
3. Remove support for screen readers on the Dapper Live CD. This would
also mean removing the 'blindness' option from the menu. This is an
unfortunate thing to do since the visually impaired community is by far
the most active AT user group on Linux and are as such our main target
group at this stage (there are other, larger groups, such as the elderly
with mild motoric difficulties, but these may not be early adopters in
the same way.) It would also not allow us to claim to have the best
default AT support among distros (perhaps we still would in some ways,
but it wouldn't be such a clear case to make).
Orthogonally to these option, we also have the opportunity of creating
an AT derivative. This could easily have the full Festival package and
perhaps a few additional AT packages like dasher. This could be released
shortly after dapper and be made available for download (space can be
made by removing the Win-FOSS). There are benefits to this approach, but
getting the AT infrastructure support onto the main CDs is still the big
prize.
Personally, I would like to see us push for option #1 (assuming #2 to be
unrealistic) and have #3 as a fall-back. I'm not the best suited to
judge what is required to make #1 a reality though. I'd like to hear
from Matt and Colin if possible what you thing WRT timeframes and the
suitability of the less-than-completely-mature speech dispatcher et al.
- Henrik
More information about the ubuntu-devel
mailing list