|
|  |
Lo’ Speech Heads! From high atop Mount Apple, the sentence was handed down. Today, Eric Schmidt, Google’s CEO, will have to find a seat outside Apple’s boardroom. Schmidt had served on the board for three years. In a statement from Apple, Steve Jobs noted Schmidt’s contributions and said that the departure was mutual.
“Unfortunately, as Google enters more of Apple’s core businesses, with Android and now Chrome OS, Eric’s effectiveness as an Apple Board member will be significantly diminished, since he will have to recuse himself from even larger portions of our meetings due to potential conflicts of interest,” Jobs said.
There may be more to the story than just that, Speech Heads. Schmidt’s departure follows a murmur of rumors that the FTC was investigating whether his position on the Apple board would constitute a violation of anti-trust laws. It also follows Apple rejection of Google Voice from the App Store—meow!—which, according to Techcrunch, is being looked at by the FCC. Apparently, Apple, Google, and AT&T were all given letters of inquiry on Friday asking about the decision. Citing pending proceedings regarding wireless access and handset exclusivity the FCC wants to know what role AT&T plays in deciding what makes it into the App Store.
No clue what the answer is, but by brother Adam B. says it’s bound to be juicy!
|
|  |

James Larson, Ph.D., is co-program chair for the SpeechTEK 2008 Conference, co-chair of the World Wide Web Consortium’s Voice Browser Working Group, and author of the home-study guide The VoiceXML Guide. He can be reached at jim@larson-tech.com. He was kind enough to submit some thoughts on the recent Voice Search Conference in San Diego.
1. Voice search can be defined as (a) using voice to search text information, and (b) using voice to search voice information. There was little discussion the second type of voice search. There were many talks about the first type of voice search, especially for directory assistance, customer info lookup, and music “jukebox” applications.
2. While much of the conference dealt with voice search, several sessions addressed other speech technology topics. For example, 5. The folks form Spoken talked about Secret Agents. A secret agent is a human who monitors several ongoing IVR dialogs. The agent is notified when the speech recognition engine fails to understand what the user said. The user’s utterance is replayed to the agent, who selects the appropriate word from the grammar, or causes the dialog to transfer to a regular human agent. The overall effect to the user is the dialog works better.
I note that AT&T did this some time ago for directory assistance calls.
The goal of secret agents is to contain the user within the automated IVR system. As we saw from Paul English and the gethuman.com web site, users hate containment, especially if they have a difficult request that they feel can only be handled by a live agent. I wonder how these users will feel if they knew that a secret agent is listening to them but is not allowed to speak directly with them.
3. Mike Phillips, Vlingo, has a nice demonstration for accessing textual data by voice. Vlingo has done a lot of usability testing, and it shows when you use the UI, which I think is very good. Check out the UI by going to http://www.vlingo.com/ and clicking “watch the demo.”
4. Three hot topics of discussion were:
(1) multimodal user interfaces
(2) analytics
(3) video and voice dialog. Most conversations delt with how cool these new technologies were and how to make money using them.
5. I had a chat with David Thomson, who gave a talk about how phones can be used in social web sites. We see opportunities for speech technology in social web sites:
(1) Provide simple authoring tools so teen can create speech dialogs to their portrayed personas.
(2) Viewers could call a phone number and leave messages, which could be converted to text by general purpose dictation recognition.
(3) The virtual equivalent of an answering machine that could accept VoIP calls, filter them, and route them according to instructions by the social web site owner. I think there are many opportunities for speech technologies in social web sites.
|
|  |
This is a transcript of the morning keynote panel between
Bill Meisel, President, TMA Associates
Mike Cohen, Manager, Speech Technology Group, Google
Victor Melfi, Chief Strategy Officer & Senior Vice President, Marketing, VoiceBox Technologies
Neal Bernstein, Senior Director, Local & Mobile Search, Microsoft
Michael Wehrs, Vice President, Evangelism & Industry Affairs [Ed: Yes, that is his real title], Nuance Communications
John Tadlock, Lead Technical Architect, Consumer Application Architecture, AT&T
I typed the discussion out as I listened. So if it’s a little sloppy in parts…tough tamales, readers.
I did this for two reasons. First, it was a really good debate and the panelists gave some great information that I was unable to include in the daily Speech Tech news story. I just didn’t have enough space.
The second reason is because one panelist calls another an “ignorant slut” [Ed: According to staffers here, this phrase originates from SNL's Weekend Update editions with Jane Curtin...and is hilarious in the context of voice search.]. I wanted to share that.
And yes. Yes. You’re welcome.
Meisel: Voice search is vague and we’ve kept it vague for this conference. Voice search is a way of implying, like web search, that you can get things quickly and easily. How do you see this paradigm? What can we do with it?
Answers after the jump!
(more…)
|