In our continuing examination of speech technology in popular culture, I bring to you one of the weirder things we’ve stumbled across.
You’re all doubtlessly familiar with Loquendo TTS work my brother, Adam B., does every week for our news items on the mother site, but apparently its popular use runs far deeper. Our investigations show that there is an entire community of Spanish-speaking YouTube users who are using Loquendo’s TTS to make video críticas or “criticisms. The críticas are rants chockfull of curses and insults leveled on their subject which range from Dragonball Z to emo kids, a subculture of much maligned, droopy-haired teeners who patron a genre of existentially sentimental rock music also known as “emo,” that are delivered by Loquendo TTS products.
The videos are pretty similar to one another and vary only in target. All, as best we can tell, use the Castilian Spanish male voice font, “Jorge,” and many seem to take advantage of the free demo that Loquendo offers on their website, marked as such because they have this creepy music that Loquendo puts in the background of their demo files–a chorus of synthetic TTS sirens singing Loquendo! over and over again. You’re going to want to check that out for yourself.
For the most part, críticas level their attacks at popular television institutions House, Pokemon, and the Disney Channel (there seems to be an entire subgenre of just Disney Channel críticas–I found heaps of them on YouTube), but there are also some vaguely offensive works like “How to Seduce a Woman,” (in which the narrator explains the importance of body language) and other such lessons.
Here’s a typical specimen:
[youtube]http://www.youtube.com/watch?v=ERueevnv-YU&feature=related[/youtube]
While these things are easy to write off as just a bunch of immature adolescents or, at worst, adults slagging around, críticas still offer some insights about the future of speech.
We mostly think of Loquendo’s TTS offerings as being business oriented, allowing companies to generate a spoken interface on-the-fly in IVR phone systems or whatever. In these instances, the software essentially allows a big company, an abstract collective, to give a single voice, separate from any real living entity in the world, to itself. This, however, cuts right back in the other direction, and allows individuals to deliver spoken messages anonymously, assuming a synthetic and collectively created voice. Individuals can hide their gender, their age, their nationality-any number of things which might be inadvertently revealed in the expression of their biological voice.
The anonymity in críticas, pretty much authorizes users to curse left and right and traffic in the most backward kind of homophobia. That is, it lets them spout all the words they wouldn’t normally say in public–like a vocalized internet flame–so you get a lot of puta madre this and that. In point of fact, just about every other word in these videos is puta, a Spanish insult for a female sex-worker. The TTS writers get pretty creative with it, transmogrifying the word into just about every noun, adjective, and verb form imaginable. Putanizada, putilla, and putón are just some of the kinds of the flourishes that they luxuriate in.
Also interesting, the otherwise coarse language is couched in fairly complex grammatical structures. The work of one 2Alfredo2, in particular, makes heavy use of interjected clauses. When combined with colorful grammatical plays on common Spanish insult words like jilipolla, the overall effect is both formal and pruriently vulgar. It kind of sounds like a high school English essay gone wrong.
Granted, this work is pretty limited in scope to say the least. There’s only so many kicks you can get out of listening to a machine tell off Hannah Montana with every curse word out of the Real Academia Diccionario de Palabortas. But TTS is an artistic medium in its infancy. There is real potential for the anonymity afforded to users to do good.
TTS might allow human rights activists under repressive regimes, and other marginalized voices, to express their deepest feelings without compromising themselves. It, moreover, gives them access to auditory-dependent media infrastructures like podcasting. Likewise, the cadences in TTS are still, for all Loquendo’s immense advancements in recent years, still sometimes jerky, and, in their strangeness, embody a certain set of aesthetic values that can probably be capitalized by artists willing to engage with them.
There are also potentially harmful effects. The technology could be used to anonymize all sorts of ill-intent and to dissemination any hateful message one might care to pass along. TTS is just a tool, like any other, but as speech starts making its way out of the rarified corridors of business, it is likely to begin to be plied for all sorts of artistic and political ends.
If you have your own TTS art you’d like to share, please leave us a comment!

Eric B. —
March 25, 2009 @ 10:42 am
You may have read the recent
If you read yesterday’s Speech Technology
Good morning! This post is coming to you super-early, because I have way too much to do later today. So, before my boss creeps in, here’s all the news for the week so far. Our faithful, intrepid managing editor Len hath returned from the Genesys G-Force conference (read his posts about G-Force
The Voice Search Conference winds down around 5pm tonight, but by then I’m on the flight back to NY.