Speech Technology Magazine SpeechTEK Conference
 
Eric B.   —   March 25, 2009 @ 10:42 am

Que pasa, baby?In our continuing examination of speech technology in popular culture, I bring to you one of the weirder things we’ve stumbled across.

You’re all doubtlessly familiar with Loquendo TTS work my brother, Adam B., does every week for our news items on the mother site, but apparently its popular use runs far deeper. Our investigations show that there is an entire community of Spanish-speaking YouTube users who are using Loquendo’s TTS to make video críticas or “criticisms. The críticas are rants chockfull of curses and insults leveled on their subject which range from Dragonball Z to emo kids, a subculture of much maligned, droopy-haired teeners who patron a genre of existentially sentimental rock music also known as “emo,” that are delivered by Loquendo TTS products.

The  videos are pretty similar to one another and vary only in target. All, as best we can tell, use the Castilian Spanish male voice font, “Jorge,” and many seem to take advantage of the free demo that Loquendo offers on their website, marked as such because they have this creepy music that Loquendo puts in the background of their demo files–a chorus of synthetic TTS sirens singing Loquendo! over and over again. You’re going to want to check that out for yourself.

For the most part,  críticas level their attacks at popular television institutions House, Pokemon, and the Disney Channel (there seems to be an entire subgenre of just Disney Channel críticasI found heaps of them on YouTube), but there are also some vaguely offensive works like “How to Seduce a Woman,” (in which the narrator explains the importance of body language) and other such lessons.

Here’s a typical specimen:

[youtube]http://www.youtube.com/watch?v=ERueevnv-YU&feature=related[/youtube]

While these things are easy to write off as just a bunch of immature adolescents or, at worst, adults slagging around, críticas still offer some insights about the future of speech.

We mostly think of Loquendo’s TTS offerings as being business oriented, allowing companies to generate a spoken interface on-the-fly in IVR phone systems or whatever. In these instances, the software essentially allows a big company, an abstract collective, to give a single voice, separate from any real living entity in the world, to itself. This, however, cuts right back in the other direction, and allows individuals to deliver spoken messages anonymously, assuming a synthetic and collectively created voice. Individuals can hide their gender, their age, their nationality-any number of things which might be inadvertently revealed in the expression of their biological voice.

The anonymity in críticas, pretty much authorizes users to curse left and right and traffic in the most backward kind of homophobia. That is, it lets them spout all the words they wouldn’t normally say in public–like a vocalized internet flame–so you get a lot of puta madre this and that. In point of fact, just about every other word in these videos is puta, a Spanish insult for a female sex-worker. The TTS writers get pretty creative with it, transmogrifying the word into just about every noun, adjective, and verb form imaginable. Putanizada, putilla, and putón are just some of the kinds of the flourishes that they luxuriate in.

Also interesting, the otherwise coarse language is couched in fairly complex grammatical structures. The work of one 2Alfredo2, in particular, makes heavy use of interjected clauses. When combined with colorful grammatical plays on common Spanish insult words like jilipolla, the overall effect is both formal and pruriently vulgar. It kind of sounds like a high school English essay gone wrong.

They say Montana is the "Cyrus State."Granted, this work is pretty limited in scope to say the least. There’s only so many kicks you can get out of listening to a machine tell off Hannah Montana with every curse word out of the Real Academia Diccionario de Palabortas. But TTS is an artistic medium in its infancy. There is real potential for the anonymity afforded to users to do good.

TTS might allow human rights activists under repressive regimes, and other marginalized voices, to express their deepest feelings without compromising themselves. It, moreover, gives them access to auditory-dependent media infrastructures like podcasting. Likewise, the cadences in TTS are still, for all Loquendo’s immense advancements in recent years, still sometimes jerky, and, in their strangeness, embody a certain set of aesthetic values that can probably be capitalized by artists willing to engage with them.

There are also potentially harmful effects. The technology could be used to anonymize all sorts of ill-intent and to dissemination any hateful message one might care to pass along. TTS is just a tool, like any other, but as speech starts making its way out of the rarified corridors of business, it is likely to begin to be plied for all sorts of artistic and political ends.

If you have your own TTS art you’d like to share, please leave us a comment!

Adam B.   —   January 27, 2009 @ 12:33 pm

loquendo logoYou may have read the recent news brief online at Speech Technology about Loquendo adding yet another voice–this time that of Mikko–to their already vast Text-To-Speech Family.

And while any Speech-Head worth her salt is already well aware of Loquendo’s TTS, many of us–my Speech Brother Eric B included–are unaware of the Loquendo TTS Family Tree.

All told, Loquendo offers sixty-two different voices from an array of different countries in what amounts to an United Nations of Speech Technology.

Among those voice are:

And while we’re talking TTS, don’t forget to check out the TTS version of our daily News Features starring the aforementioned Allison–a feature that we recently expanded to let us deliver even more Speechified News.

Adam B.   —   December 1, 2008 @ 11:39 am

ilane logoIf you read yesterday’s Speech Technology news feature, you already know about iLane–the new hands-free, voice-enabled, vehicular email solution from Loquendo and Canada’s Intelligent Mechatronic Systems.

But, have you seen The Video?

If not, then check out the following links for clips of iLane in action:

Video One

Video Two

Video Three

Adam B.   —   October 22, 2008 @ 10:54 am

I just want to remind everyone that we at Speech Technology magazine, convert our daily news features into audio files via the magic of Loquendo’s TTS Director!

So, if you want to listen to “Allison” read the latest in speech technology news–or even this blog entry–we’ve got the text-to-speech capabilities to make it happen for you!

STM Blog   —   May 1, 2008 @ 8:31 am

Good morning! This post is coming to you super-early, because I have way too much to do later today. So, before my boss creeps in, here’s all the news for the week so far. Our faithful, intrepid managing editor Len hath returned from the Genesys G-Force conference (read his posts about G-Force here), and he’ll have more to report to you in coming days. Such as what he did with the Genesys-emblazoned belt buckle … and how the rest of the office kind of wants it to add to our wall of “free stuff vendors send us.” So, unless you’re totally intrigued by how Xerox (haha, remember photocopies?) plans to compete with Google and Salesforce, or that giant squid is still freaking you out (hello, it has the world’s largest eyes), follow the jump for less-disgusting news stories! [Speech Tech Blog, Information Week, Associated Press]

(more…)

STM Blog   —   March 12, 2008 @ 1:07 pm

The Voice Search Conference winds down around 5pm tonight, but by then I’m on the flight back to NY.

Another journalist I spoke with said he got all excited when he listened to the opening keynotes, but was inevitably disappointed towards the day’s end due to the overall lack of focus on voice search itself.

I feel similarly; many of the panels didn’t have a lot to do with voice search (and they all could have used more live demos. If you’re going to hawk a solution, I’d like to see it in play). Loquendo’s Paolo Baggia told me during lunch that he was going to give a talk called Improving the user experience.

“It’s not really about voice search, though…” he said.

Both Bill Scholz and Bill Meisel stated in the panels they moderated that they’d defined voice search “very broadly.” To what end? To what extent is it beneficial to have such a vague definition? If you’re going to devote an entire three-day conference to a topic, shouldn’t that topic be clearly defined?

The conference did highlight issues regarding voice search (whatever that may entail). Click below for the rest of the article.

(more…)

STM Blog   —   March 7, 2008 @ 3:40 pm

If you go to our website, you’ll notice that some of our webstories are speechified. We’re using Loquendo’s TTS engine. The voice is called Alison. We chose her because we liked her concatenation and she didn’t sound too threatening. We also convinced our editor-in-chief that she was actually Cokie Roberts. Tell us what you think in the comments below!

We also have RSS feeds. Hold on to your hat!

Finally, we’d love for some of you clever people to contribute to this blog once every two weeks. If you’re interested, send us an email. Let us know your name, title, and favorite speech tech-related deployment over the last year. Also your favorite superhero and/or Greek god.

Previous Posts
Keyword Tags
Archives
© 2008 - 2010 Speech Technology Media, a division of Information Today, Inc. About/Contacts | PRIVACY POLICY