+  RHDN Forum Archive
|-+  Romhacking
| |-+  ROM Hacking Discussion
| | |-+  Monster Hunter Dos english translation
Pages: [1]
Author Topic: Monster Hunter Dos english translation  (Read 2 times)
burango
Guest
« on: December 01, 2009, 07:25:50 pm »

Hi guys  Grin
during my work I ran into 'chewed' sections of dialogues... I don't know if they are compressed or what... this is what I mean:
ボタンで「ケ。instead of ボタンで出発です。as it appears in the game.
Nevertheless I tried to translate this section but the game gone mad...
have you some hints about this?  :huh:
Tauwasser
Guest
« Reply #1 on: December 01, 2009, 07:42:19 pm »

So what are you viewing the text with? A regular hex editor set to CP932 or what? Have you checkes Shift JIS and EUC-JP yet?
It might be worthwhile if you posted the full hexadecimal sequence instead of a string, since that undergoes many conversions until it's here.
Right now your string is U+30DC, U+30BD, U+30F3, U+3067, U+FF62, U+0012, U+FF79, U+0010, U+3002. I gather this is probably not what you are seeing or what you are converting from.

cYa,

Tauwasser
burango
Guest
« Reply #2 on: December 02, 2009, 02:37:11 pm »

Thanks Tauwasser for your reply, yes I'm using the CP392 set with Mad Edit
I wasn't able to post here the image I made from PCSX2 and my hex editor...  Shocked
well... it doesn't matter
I have different values in my hex editor:

83 7B = ボ
83 5E = ã‚¿
83 93 = ン
82 C5 = で
A2 12 B9 10  = 「ケ   
81 42 = 。

instead of what I see in the game:

83 7B = ボ
83 5E = ã‚¿
83 93 = ン
82 C5 = で
(8F 6F) = 出
(94 AD) = 発
(82 C5) = で
(82 B7) = す
81 42 = 。

So I took the game has encrypted the section 8F 6F 94 AD 82 C5 82 B7 with A2 12 B9 10 (yes... in the editor they aren't 00 A2 00 12... etc.)
this seems really odd to me... only the dialogue section is so randomly encrypted... for the rest of the game I haven't encountered any problem (I already released 4 patches for this game)

村の‚PヘれアUA
~B00ボタンで「ケ。
気h   îï¾‚けて忽ォて
-)カAり待gワ~す

this is the entire section and as you can see is completely messed up...  :huh:
Tauwasser
Guest
« Reply #3 on: December 02, 2009, 04:41:01 pm »

It looks like it may be a dictionary. 出発 and です are both very common words on their own, so they would merit a dictionary entry, while ボタン might not. If you can access the part via a savestate or something, try editing the A212 to A213 and see if a whole different word appears. If so, search for the word 出発 and the other word in the file and see if you can find the dictionary. Right now it would seem that both values are much much higher than the standard 81-83 range the other stuff uses.

I checked with the other tables I mentioned and they didn't return any positives, also the data length you have there would suggest it is not a clever compession per sé.

cYa,

Tauwasser
burango
Guest
« Reply #4 on: December 03, 2009, 03:38:53 pm »

Editing the A212 to A213 I had がた instead of 出発  :thumbsup: this shows me that the 'chewed' section can be modified somehow... my previous tests about this were too radical
I've done the same thing with B910 with no results... I'm starting to think that this section is only a part 1 of the whole section and I need to discover yet the part 2 to comlplete the 'grid'... in this case I can't translate it manually but I need an interface...
I tried to seek the dictionary as you suggested but I found nothing... sometimes 出発 are encoded sometimes not
...however I'll make some attempts with the rest of the section I posted to try to figure out what to do... thanks  Smiley
Tauwasser
Guest
« Reply #5 on: December 03, 2009, 03:52:23 pm »

Simply change B910 to A212 to confirm the dictionary compression. After that, it's just finding the dictionary or getting a debugger or something for DOS.

cYa,

Tauwasser
burango
Guest
« Reply #6 on: December 05, 2009, 11:07:40 am »

Ok  Wink do you know where I can find some LZSS or LZR decompression tools?
aishsha
Guest
« Reply #7 on: December 05, 2009, 09:16:35 pm »

Quote from: burango on December 05, 2009, 11:07:40 am
Ok  Wink do you know where I can find some LZSS or LZR decompression tools?
Nowhere. There's no universal tool for that. As for dictionary: find the dictionary, see the pattern - change it.
Tauwasser
Guest
« Reply #8 on: December 06, 2009, 12:17:16 pm »

Just because you have a dictionary doesn't mean it uses dictionary compression. That's something totally different. Dictionary compression usually builds its own dictionary while decompression and reuses parts of the stream that it has already written. Aishsha is right that there is no universal tool for that, though there are some specialized for many games (I have like three that I coded myself, it's really not hard).
What you are looking for is the dictionary. That is a table in the game that contains the words written as usual ans is usually referenced by pointers (some do have a static size,so every x bytes will be another entry etc.).

cYa,

Tauwasser
burango
Guest
« Reply #9 on: December 13, 2009, 03:37:51 am »

Thanks for your time Tauwasser, still I don't think it's a dictionary... depending on I put the A212 the results are too different, sometimes the game freezes
This is what I see on the screen during the game:

村のはじっこから、
~B00ボタンで出発です。
気をつけて行ってきて
くださいね。
お帰り待ってま~す!

So the theoric string is:

91BA 82CC 82CD 82B6 82C1 82B1 82A9 82E7 8141
7E 42 30 30 837B 835E 8393 82C5 8F6F 94AD 82C5 82B7 8142
8B43 82F0 82C2 82AF 82C4 8D73 82C1 82C4 82AB 82C4
82AD 82BE 82B3 82A2 82CB 8142
82A8 8B41 82E8 91D2 82C1 82C4 82DC 8160 82B7 8149

while I have in the hex:

村の‚PヘれアUA
~B00ボタンで「ケ。
気h   îï¾‚けて忽ォて
-)カAり待gワ~す

91BA 82CC 82 18 50 CD 82 EA 10 B1 82 74 16 41
7E 42 30 30 837B 835E 8393 82C5 A2 12 B9 10 8142
8B43 82 88 09 F0 82 C2 82 AF 82C4 8D 9A 1E AB 82C4
2D 29 B6 18 41 82 E8 91 D2 82 87 17 DC 81 60 82 B7 81 49

I think it's a strange partial encryption... the game uses at times to insert in the string 2 hexadecimal like in first Monster Hunter ( 82 18 50 CD for 82CD or 82 88 09 F0 for 82F0 ) allowing however to modify the characters, sometimes uses to convert the characters in some 'untouchable' hexadecimals ( changing them all the dialogues are altered not only those changed), modifying occasionally the hex reading like in the 4° row, where the entire string is readen like something different but the hexadecimals are most the same...

I hope I made a quite 'clear' explanation of the problem I ran into but... well it's quite a mess Shocked
 
Pages: [1]  


Powered by SMF 1.1.4 | SMF © 2006-2007, Simple Machines LLC