Just for Fun: Bidirectional Multi-passes of Google Translate

All non Baka-Tsuki related topics

Moderators: thelastguardian, Fringe Security Bureau, Senior Editors, Senior Translators, Alt. Language Translator/Editor, Executive Council, Project Translators, Project Editors

User avatar
ainsoph9
Osaka-ben Gaijin-Sama
Posts: 13824
Joined: Fri Jan 12, 2007 11:30 am
Favourite Light Novel: Ahouka!
Location: leave a message at the beep

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by ainsoph9 »

EusthEnoptEron wrote:I wonder why on earth Google replaces "死" with "市" in this test.

死はそれがないと言っていますよ!!
Same sound?
Image
User avatar
EusthEnoptEron
Project Translator
Posts: 836
Joined: Mon Jul 13, 2009 10:39 am
Favourite Light Novel: Ahouka!
Location: Switzerland

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by EusthEnoptEron »

ainsoph9 wrote:
EusthEnoptEron wrote:I wonder why on earth Google replaces "死" with "市" in this test.

死はそれがないと言っていますよ!!
Same sound?
Yeah, but from a technical perspective it makes no sense to use another kanji if you're given it, as they presumably search for the meaning by the kanji and not its sound—unless there is some kind of auto-correction function, which I consider rather useless at the current level of quality. :D
User avatar
ainsoph9
Osaka-ben Gaijin-Sama
Posts: 13824
Joined: Fri Jan 12, 2007 11:30 am
Favourite Light Novel: Ahouka!
Location: leave a message at the beep

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by ainsoph9 »

From what I can tell, the program runs on an iteration basis, which can have one of two effects.

1. It improves the translation.
2. It compounds the mistakes of the first mis-translation.
Image
User avatar
larethian
I.D.S.E Humanoid Interface [LSB]
Posts: 2191
Joined: Tue Aug 31, 2010 10:50 pm
Favourite Light Novel:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by larethian »

there may be a neural network or other AI system behind that not just parses blindly. it might take into account that the user makes a typo or is unsure of the actual kanji, and figured that the sentence will not make sense if the word 'death' is used.
User avatar
Mystes
Heaven's Blade Successor
Posts: 15932
Joined: Thu Aug 05, 2010 6:54 am
Favourite Light Novel:
Contact:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by Mystes »

larethian wrote:there may be a neural network or other AI system behind that not just parses blindly. it might take into account that the user makes a typo or is unsure of the actual kanji, and figured that the sentence will not make sense if the word 'death' is used.
That's more what I think.
Kira0802

#campione at rizon for some #campione discussions~~ And other stuffs.
User avatar
rpapo
I.D.S.E Humanoid Interface [LSB]
Posts: 1530
Joined: Mon Dec 21, 2009 5:15 am
Favourite Light Novel: Ahouka!
Location: Michigan, USA
Contact:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by rpapo »

kira0802 wrote:
larethian wrote:there may be a neural network or other AI system behind that not just parses blindly. it might take into account that the user makes a typo or is unsure of the actual kanji, and figured that the sentence will not make sense if the word 'death' is used.
That's more what I think.
From an e-mail conversation with Mr. Jim Breen (the guy behind WWWJDICT), I was told the following:
Back in the 80s parsing Japanese was considered to be too hard for computers.
Then Nagao showed with Juman that it could be done fairly reliably if you use a bit of science (Juman uses the Hidden-Markov-Model AI technique.) Things have moved on a bit with its successors - MeCab uses a combination of HMMs and CRFs (Complex Random Fields), the latter being one of the latest bits of AI applied to machine-learning.

Google used to use a commercial parser (http://www.basistech.com/base-linguistics/) but moved to an in-house one for Japanese. Taku Kudo, who wrote MeCab, now works for Google 8-)}
My parser contains the Juman code (version 5.1), but I use it to generate an alternate parsing of the text. The interesting thing I find is that where Juman misses, so does Google. That leads me to believe that the current Google code incorporates the Juman code.

My parser uses the WWWJDICT file EDICT as the basic dictionary, so it has a much larger vocabulary than Juman, but it is nowhere near as smart. It uses a "hit it with a bigger hammer" approach. Both approaches have their pluses and their minuses.
User avatar
ainsoph9
Osaka-ben Gaijin-Sama
Posts: 13824
Joined: Fri Jan 12, 2007 11:30 am
Favourite Light Novel: Ahouka!
Location: leave a message at the beep

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by ainsoph9 »

So, it might be somewhat more ideal to see a cross between the two.
Image
User avatar
rpapo
I.D.S.E Humanoid Interface [LSB]
Posts: 1530
Joined: Mon Dec 21, 2009 5:15 am
Favourite Light Novel: Ahouka!
Location: Michigan, USA
Contact:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by rpapo »

ainsoph9 wrote:So, it might be somewhat more ideal to see a cross between the two.
Perhaps. I am not an ivory tower computer person, but rather a practical one. I can imagine a program with a large dictionary and set of rules, and a collection of "hints" to help it out of hard spots. But I have relatively limited free time, and I've chosen, for the time being, to work on translating the hard way to improve my own skills. It rather helps to know how to do something yourself before you try and teach a computer to do it . . .
User avatar
Mystes
Heaven's Blade Successor
Posts: 15932
Joined: Thu Aug 05, 2010 6:54 am
Favourite Light Novel:
Contact:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by Mystes »

I noticed that the auromatic trasnlators place the time clause in the fornt.
Kira0802

#campione at rizon for some #campione discussions~~ And other stuffs.
User avatar
ainsoph9
Osaka-ben Gaijin-Sama
Posts: 13824
Joined: Fri Jan 12, 2007 11:30 am
Favourite Light Novel: Ahouka!
Location: leave a message at the beep

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by ainsoph9 »

rpapo wrote:
ainsoph9 wrote:So, it might be somewhat more ideal to see a cross between the two.
Perhaps. I am not an ivory tower computer person, but rather a practical one. I can imagine a program with a large dictionary and set of rules, and a collection of "hints" to help it out of hard spots. But I have relatively limited free time, and I've chosen, for the time being, to work on translating the hard way to improve my own skills. It rather helps to know how to do something yourself before you try and teach a computer to do it . . .
That is quite true. Sometimes, I do get the feeling that translation programs among other things are not done by the foremost experts in the field.
Image
User avatar
Mystes
Heaven's Blade Successor
Posts: 15932
Joined: Thu Aug 05, 2010 6:54 am
Favourite Light Novel:
Contact:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by Mystes »

Though if automatic translaion was perfect, translators would lose thier jobs.
Kira0802

#campione at rizon for some #campione discussions~~ And other stuffs.
User avatar
ainsoph9
Osaka-ben Gaijin-Sama
Posts: 13824
Joined: Fri Jan 12, 2007 11:30 am
Favourite Light Novel: Ahouka!
Location: leave a message at the beep

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by ainsoph9 »

No, they would become computer programmers.
Image
User avatar
Mystes
Heaven's Blade Successor
Posts: 15932
Joined: Thu Aug 05, 2010 6:54 am
Favourite Light Novel:
Contact:

Re: Just for Fun: Bidirectional Multi-passes of Google Trans

Post by Mystes »

Not my cousins though.
Kira0802

#campione at rizon for some #campione discussions~~ And other stuffs.
Locked

Return to “Commune”