I made a spreadsheet with all of Jarlve's 100 plaintext messages, which can be found here:
viewtopic.php?f=81&t=2435Then I made it so that I adjust two variables, the minimum count of plaintext in a message and the maximum count of plaintext that land on prime positions.
For example, if I make the minimum count of plaintext 24 and the maximum count that land on primes 1, then here are the results:
There are 542 plaintext that have a count of 24 or more. They are all high frequency plaintext, as can be expected. A, E, H, I, N, O, R, S and T. And there are a few messages that have equal to or more than 24 of D and L.
Of the 542 occurrences where there are minimum 24 plaintext in a message, there are only 3 where 1 or fewer of those plaintext land on primes.
Upper left is message #27, where there are 24 of the letter O, and only 1 lands on a prime position.
Upper right is message #84, where there are 32 of the letter A, and only 1 lands on a prime.
Lower left is message #93, where there are 25 of the letter H, and 0 lands on a prime.
plaintext.100.primephobia.1.png
So that is 3% of the messages, or 3 / 542 = 0.6% of the occurrences where there a minimum of 24 plaintext. So it does happen, but not frequently. Why Zodiac would diffuse with 63 symbols and then make one of them a high count 1:1 substitute like the + symbol I don't know. But even if he did, the chances of having only one land on a prime position seems statistically significant. And since he did diffuse with 63 symbols, it seems even more statistically significant.I can change the variables, and you might find it interesting. Will show more thorough statistics soon.
EDIT: So I may be comparing apples to oranges instead of apples to apples. I don't know, but this may give some more perspective.
I made a table and changed the variables.
Blue box upper left: There are 705 occurrences in Jarlve's 100 message plaintext library where there are 21 or more of the same plaintext in the same message. Of those, there are 2 where those plaintext land on 0 prime positions.
Blue box lower left: Same 705 occurrences with 21 or more of the same plaintext in the same message. Of those, there are 162 where the plaintext land on 3 or fewer prime positions. So you can see, if I change the variable for prime positions from 0 to 3, the count of occurrences changes dramatically from 2 to 162.
Red box: Again, in Jarlve's library, there are 542 occurrences where the same plaintext count is 24 or more in the same message. And of those, there are 3 where there are 1 or fewer of those plaintext that land on prime positions.
plaintext.100.primephobia.2.png
3% of random shuffles of the 340 results shows the + landing on only one prime position. And 3% of Jarlve's messages show similar statistics. I am not an expert in interpreting statistics. But because Zodiac used 63 symbols to diffuse and there is only one symbol with a count of 24, this may be statistically important. I am just trying to think of a cipher model that would explain prime phobia, the period 19 bigram repeats, three symbols cycling together on exclusively odd positions, and the cycle statistics in general.
One final thought for the night. I made the 340 into a message that is 18 columns by 19 rows, which is conducive to a period 19 transposition scheme. The prime positions line up with each other in columns.
plaintext.100.primephobia.3.png
The 19 is the + symbol. Can anyone think of a simple relationship between transposition and prime phobia?