Same sound?EusthEnoptEron wrote:I wonder why on earth Google replaces "死" with "市" in this test.
死はそれがないと言っていますよ!!
Just for Fun: Bidirectional Multi-passes of Google Translate
Moderators: thelastguardian, Fringe Security Bureau, Senior Editors, Senior Translators, Alt. Language Translator/Editor, Executive Council, Project Translators, Project Editors
- ainsoph9
- Osaka-ben Gaijin-Sama
- Posts: 13824
- Joined: Fri Jan 12, 2007 11:30 am
- Favourite Light Novel: Ahouka!
- Location: leave a message at the beep
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
- EusthEnoptEron
- Project Translator
- Posts: 836
- Joined: Mon Jul 13, 2009 10:39 am
- Favourite Light Novel: Ahouka!
- Location: Switzerland
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
Yeah, but from a technical perspective it makes no sense to use another kanji if you're given it, as they presumably search for the meaning by the kanji and not its sound—unless there is some kind of auto-correction function, which I consider rather useless at the current level of quality.ainsoph9 wrote:Same sound?EusthEnoptEron wrote:I wonder why on earth Google replaces "死" with "市" in this test.
死はそれがないと言っていますよ!!
- ainsoph9
- Osaka-ben Gaijin-Sama
- Posts: 13824
- Joined: Fri Jan 12, 2007 11:30 am
- Favourite Light Novel: Ahouka!
- Location: leave a message at the beep
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
From what I can tell, the program runs on an iteration basis, which can have one of two effects.
1. It improves the translation.
2. It compounds the mistakes of the first mis-translation.
1. It improves the translation.
2. It compounds the mistakes of the first mis-translation.
- larethian
- I.D.S.E Humanoid Interface [LSB]
- Posts: 2191
- Joined: Tue Aug 31, 2010 10:50 pm
- Favourite Light Novel:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
there may be a neural network or other AI system behind that not just parses blindly. it might take into account that the user makes a typo or is unsure of the actual kanji, and figured that the sentence will not make sense if the word 'death' is used.
- Mystes
- Heaven's Blade Successor
- Posts: 15932
- Joined: Thu Aug 05, 2010 6:54 am
- Favourite Light Novel:
- Contact:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
That's more what I think.larethian wrote:there may be a neural network or other AI system behind that not just parses blindly. it might take into account that the user makes a typo or is unsure of the actual kanji, and figured that the sentence will not make sense if the word 'death' is used.
Kira0802
#campione at rizon for some #campione discussions~~ And other stuffs.
#campione at rizon for some #campione discussions~~ And other stuffs.
- rpapo
- I.D.S.E Humanoid Interface [LSB]
- Posts: 1530
- Joined: Mon Dec 21, 2009 5:15 am
- Favourite Light Novel: Ahouka!
- Location: Michigan, USA
- Contact:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
From an e-mail conversation with Mr. Jim Breen (the guy behind WWWJDICT), I was told the following:kira0802 wrote:That's more what I think.larethian wrote:there may be a neural network or other AI system behind that not just parses blindly. it might take into account that the user makes a typo or is unsure of the actual kanji, and figured that the sentence will not make sense if the word 'death' is used.
My parser contains the Juman code (version 5.1), but I use it to generate an alternate parsing of the text. The interesting thing I find is that where Juman misses, so does Google. That leads me to believe that the current Google code incorporates the Juman code.Back in the 80s parsing Japanese was considered to be too hard for computers.
Then Nagao showed with Juman that it could be done fairly reliably if you use a bit of science (Juman uses the Hidden-Markov-Model AI technique.) Things have moved on a bit with its successors - MeCab uses a combination of HMMs and CRFs (Complex Random Fields), the latter being one of the latest bits of AI applied to machine-learning.
Google used to use a commercial parser (http://www.basistech.com/base-linguistics/) but moved to an in-house one for Japanese. Taku Kudo, who wrote MeCab, now works for Google }
My parser uses the WWWJDICT file EDICT as the basic dictionary, so it has a much larger vocabulary than Juman, but it is nowhere near as smart. It uses a "hit it with a bigger hammer" approach. Both approaches have their pluses and their minuses.
- ainsoph9
- Osaka-ben Gaijin-Sama
- Posts: 13824
- Joined: Fri Jan 12, 2007 11:30 am
- Favourite Light Novel: Ahouka!
- Location: leave a message at the beep
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
So, it might be somewhat more ideal to see a cross between the two.
- rpapo
- I.D.S.E Humanoid Interface [LSB]
- Posts: 1530
- Joined: Mon Dec 21, 2009 5:15 am
- Favourite Light Novel: Ahouka!
- Location: Michigan, USA
- Contact:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
Perhaps. I am not an ivory tower computer person, but rather a practical one. I can imagine a program with a large dictionary and set of rules, and a collection of "hints" to help it out of hard spots. But I have relatively limited free time, and I've chosen, for the time being, to work on translating the hard way to improve my own skills. It rather helps to know how to do something yourself before you try and teach a computer to do it . . .ainsoph9 wrote:So, it might be somewhat more ideal to see a cross between the two.
- Mystes
- Heaven's Blade Successor
- Posts: 15932
- Joined: Thu Aug 05, 2010 6:54 am
- Favourite Light Novel:
- Contact:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
I noticed that the auromatic trasnlators place the time clause in the fornt.
Kira0802
#campione at rizon for some #campione discussions~~ And other stuffs.
#campione at rizon for some #campione discussions~~ And other stuffs.
- ainsoph9
- Osaka-ben Gaijin-Sama
- Posts: 13824
- Joined: Fri Jan 12, 2007 11:30 am
- Favourite Light Novel: Ahouka!
- Location: leave a message at the beep
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
That is quite true. Sometimes, I do get the feeling that translation programs among other things are not done by the foremost experts in the field.rpapo wrote:Perhaps. I am not an ivory tower computer person, but rather a practical one. I can imagine a program with a large dictionary and set of rules, and a collection of "hints" to help it out of hard spots. But I have relatively limited free time, and I've chosen, for the time being, to work on translating the hard way to improve my own skills. It rather helps to know how to do something yourself before you try and teach a computer to do it . . .ainsoph9 wrote:So, it might be somewhat more ideal to see a cross between the two.
- Mystes
- Heaven's Blade Successor
- Posts: 15932
- Joined: Thu Aug 05, 2010 6:54 am
- Favourite Light Novel:
- Contact:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
Though if automatic translaion was perfect, translators would lose thier jobs.
Kira0802
#campione at rizon for some #campione discussions~~ And other stuffs.
#campione at rizon for some #campione discussions~~ And other stuffs.
- ainsoph9
- Osaka-ben Gaijin-Sama
- Posts: 13824
- Joined: Fri Jan 12, 2007 11:30 am
- Favourite Light Novel: Ahouka!
- Location: leave a message at the beep
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
No, they would become computer programmers.
- Mystes
- Heaven's Blade Successor
- Posts: 15932
- Joined: Thu Aug 05, 2010 6:54 am
- Favourite Light Novel:
- Contact:
Re: Just for Fun: Bidirectional Multi-passes of Google Trans
Not my cousins though.
Kira0802
#campione at rizon for some #campione discussions~~ And other stuffs.
#campione at rizon for some #campione discussions~~ And other stuffs.