Page 7 of 144

Re: Homophonic substitution

PostPosted: Sat Aug 08, 2015 2:41 pm
by daikon
Jarlve wrote:I can't even solve your "unwildcarded" cipher with AZdecrypt, it probably suffers from the same problem as daikon3 (word entropy, same words repeating over and over again). This troubles me a bit.


I would guess the reason for this cipher being harder to solve is that it's the song lyrics. It doesn't have proper sentences. It has words next to each other that usually wouldn't pair, like "blowing blowing". It has contractions that are usually not present in a written text ([actin'] and ['scuse]). Basically, it is likely to be underrepresented in the corpus that was used to create the N-gram stats.

Jarlve wrote:With ZKDecrypto or your own solver? I can't seem to solve it with AZdecrypt096.


My own. It takes a few restarts, but it converges on the correct solution about every 5th restart. The "hard cipher" benchmarks, tonyb 1 and 2, both require more restarts for my solver, so it's not the hardest.

Re: Homophonic substitution

PostPosted: Sat Aug 08, 2015 2:46 pm
by smokie treats
PH.key.stats.png


I don't exactly understand some of the things that you are talking about, but I started out usually masking the bigrams near the top of the message in general, until I realized that I needed to distribute them a bit more evenly. I also generally substituted the first symbol in the bigram and not the second symbol, if that helps.

Maybe I can do another one without song lyrics later. Purple Haze has 339 letters, so it was a good fit, and I couldn't think of anything else at the time.

Re: Homophonic substitution

PostPosted: Sat Aug 08, 2015 2:51 pm
by daikon
smokie treats wrote:I started with symbol 37 and kept working with that for a while until trying to mask a bigram was easy with 37. Then I switched to 49, then to 51, then to 59. The first several of each wildcard symbol is really easy. Most of it is pretty easy.


So it was mostly random. I was hoping your process would create a cycle of 37, 49, 51, 59, 37, 49, 51, 59, etc. throughout the cipher, which could be detected. But now that I think about it, it is rather unlikely as it would mean each wildcard would be present roughly the same number of times.

Re: Homophonic substitution

PostPosted: Sat Aug 08, 2015 4:40 pm
by smokie treats
Jarlve, there were three high count 1:1.

13 maps to M, count 18;
17 maps to R, count 15;
24 maps to Y, count 14.

Yes, it was random. It would be fairly difficult to select the wildcards in the masking process so that they would show up in the message as a cycle because the bigrams just appear in different places. But, the +, the B, and the F don't cycle well with other symbols. This process would explain why.

If I am right about Zodiac using the wildcards to mask the bigram repeats (Jarlve thought of it before I did), then it may be possible to verify. A + where a cycle symbol is missing will be sitting between two adjacent symbols. We may be able to trace to the other bigram that was not masked using those adjacent symbols. We may be able to reconstruct the cycles with some sort of a tracing process.

EDIT: Here would be an example. Cycle 6 30 37, which starts in row 1, ends in row 20, and is missing only one 30. Maybe the most reliable cycle that we have. Identify the possible wildcards for 30 where 30 is missing. Then check where 30 ( the * ) is elsewhere in the message and compare adjacent symbols. Was the 51 here a 30 before Zodiac masked the bigram 29 30? What are the odds of finding two adjacent 29's like that (I don't know)?

bigraphtracing30.png

What about comparing all trigrams where + is in the middle to all of the trigrams where the B's and F's are in the middle. See if there is any evidence that he could have shifted wildcards to avoid creating a new bigram.

Re: Homophonic substitution

PostPosted: Sat Aug 08, 2015 6:21 pm
by smokie treats
I made another post that didn't post, but it was short.

Here is another prospective sister bigram that I found right away. It's for cycle 16 14. EDIT 16 40 - a typo; I only found one of two.

bigram.40.png


I wonder if further study of prospective sister bigrams can show a statistical significance?

Smokie

Re: Homophonic substitution

PostPosted: Sat Aug 08, 2015 7:39 pm
by smokie treats
Here, two out of three prospective sister bigrams for cycle 11 36, which starts in Row 1 and ends in Row 20:

bigraphs.11.36.png


You guys, I want to get this solved.

Re: Homophonic substitution

PostPosted: Sun Aug 09, 2015 4:15 am
by Jarlve
I decided to take a break for a while. Everything is okay, just proctecting myself from overextension. May the doge be with you.

Re: Homophonic substitution

PostPosted: Sun Aug 09, 2015 4:19 am
by glurk
Wow. Such exile. So sad. Much sleeping.

Re: Homophonic substitution

PostPosted: Sun Aug 09, 2015 5:03 am
by Jarlve
glurk wrote:Wow. Such exile. So sad. Much sleeping.

Hehehe :D

Re: Homophonic substitution

PostPosted: Sun Aug 09, 2015 6:17 am
by smokie treats
No problem, Jarlve. I was actually thinking about taking a break myself. Regroup and refresh. Sometimes a break has a really good positive effect because you have the time to quietly mull things over for a while and come up with good new ideas. I should probably hold my horses on the "prospective sister bigram cycle reconstruction" idea. There are just so many uncertainties. Thanks for making the suite, there is plenty for me to do when you are gone if I feel like working.