Translation and Control (Of Information)

Here are some links I've found in a morning of web-wide wanderings.

CIA interest in blogs seems to be rising.

Machine Translation is beginning to develop phrase-based systems, supposedly to help eliminate ambiguity.

What’s the connection? Both seem to me to open the doors for some pretty scary stuff. The CIA monitoring blog content is one thing, but with the rising rightwing’s neglect of things like, say, civil rights and freedoms, I think the possibility of Chinese-styled censorship is not necessarily a thing only of SF… not in the the mid-range future, anyway. Of course, it needn’t be full-on suppression, just creative approaches to reducing the visibility and searchability of priavte-produced content that doesn’t agree with the oligarchic agenda.

And what about this translation stuff? I’ve heard a lot of chatter for years about how instantaneous Machine Translation is the holy grail of translation technology. I don’t buy it! The problem with instantaneous translation by machine is that it cannot clarify. Watching the translators at the film festival this week, translating questions by audience members into languages that directors speak, and translating answers into Korean (and sometimes into English), I was struck by the amount of clarification that the translators needed. They checked about all kinds of things, from specific words, intended meanings of words, and scopes of intended meanings. Sometimes they needed something rephrased in order to be sure of how to translate something, and sometimes they seemed unsure of the connection of two parts of an answer and needed to make sure they understood before conveying the intended meaning of a statement.

This shows us something absolutely crucial to the issue of translation: translation involves the transmission of meanings, that is, sets of concepts that are rendered transmissible within one set of arbitrary signs (such as English words) and melting them down to their signified meanings; thereafter, the translation is a re-casting of the same concepts in another set of arbitrary signs (such as English words). The fact that the translator’s understanding of the ideas conveyed by the content is so crucial, suggests to me that a machine that cannot understand cannot truly translate.

Secondly, the fact that a phrase-based translation system is envisaged seems dangerous to me. There are, after all, only a limited number of phrases one can punch into the database. Sometimes idioms have direct translations, such as “Yuyu Sangjong” in Korean to, “Birds of a feather flock together,” in English. But whole phrases? It seems to me this will profoundly limit the kinds of things that will be readable across a language barrier, and it’s not too big a leap of the imagination to think that the selection of which phrases get translative proirity will probably be, at least in part, determined by someone’s political agenda(s). The nightmare of Orwell’s Newspeak from the novel 1984 is really unlikely as a spoken language, but if a “translatability standard” is set up for the Web (as Ray Kurzweil suggests is possible in some of his writings) then the kinds of things we can say will be limited not only to foreign readers in other languages but also in the original document.

And no, this is not better than the document being inaccessible as it would have been before translation tech came along, because limited documents and the appearance of powerful translation tech that also necessitates limited content is a really dangerous thing.

So I think, for the time being, we’d better stick to multilinguality and to using human translators: they may also have political agendas, but those are individual, and not implicit in the whole process of translation itself. It’s a lot safer to let humans do this very human work.

3 thoughts on “Translation and Control (Of Information)

  1. translation or interpretation?
    neither are sciences and one is less scientific than the other. machines make the mistakes humans tell them to make.

  2. What rob said does apply: translation and interpretation are two different things, just as spoken and written language are two different things. Spoken language generally requires more clarification than written language because you usually have one chance and one chance only to get it right in writing–thus people take more care in writing than they do in speaking. Well, that’s the theory, anyway.

    The “one is less scientific than the other” part, though, sounds to me like an expression of the bias toward written language as being more academic/scientific than the spoken language. As a student of oral literature, I would have to disagree. There is just as much “science” to spoken language as there is to written language.

    And as a translator, of course, I’d like to think that computers will never be able to take my place…

  3. interesting that you should find the bias on the side of written contra spoken charles.
    i wonder whether i placed it there or you did ;-)

    i tend to think that oral communication is much more complete than written, with one proviso. oral commnication between two (or more) people in the same physical space tends to be much richer than tele- or written communication. some interesting studies have been done of humans communicating, as we are now, and how they physically behave. our faces are deeply animated as we speak to one another in the same physical space, remove the two people from each other’s ‘sight’ and our faces become much less animated, our eyes less focused and more ‘restless’ and our pposture and motion much less vocal, so to speak.
    remove those to people further, in space and time and you would find that my face right now is entirely expressionless, with the only movement my eyes make being between screen and keyboard, my hands are more than obviously employed.
    the language i use is less inflected, tends to a more reticent tone and is careful to clarify itself as thoughtful, non-atagonistic and socially capable. i’ve even used a smiley.
    one researcher considered the use of smilies as a given of any advanced social usage of the internet, a necessary shorthand that the written word lacks in comparisson to the spoken word.
    advances in high speed broadband and a proliferation of webcameras would be and are the next logical evolution, away from writing and back to spoken (and visual) social interaction.

    the written word tends to lend itself to exploitation by those wishing to track the usage of words and phrases and their context, google’s built an empire upon it, i’m sure the security agencies of many nations will build programs that sift out words critical to their mission, yet any program of that ilk would surely have to take into account the ‘body language’ (smileys, full caps, lowercase, abreviations and so on) being used by the writer for an accurate interpretation to be made.

    it’ll be interesting to watch develop.


