My work

Re: My work

Postby Jarlve » Wed Sep 09, 2015 2:00 am

Hey Quicktrader,

1) If set to exclude selected squares will become red and those will not be submitted to the solver. Can be used to exclude rows, columns if you click on the row or column number. You can also select normally (green selection) and press the delete-key and then the r-key. This will delete a selection and also remove the empty place it left. To clear any selection use the c-key.

2a) Mode 1: character. Just selected one character at a time. Mode 2: symbol. Select all characters with the same symbol (useful if you want to select all "+" symbols at once).

2b) FPT stands for free path tool. It allows you to (un)transposition a cipher, and the result is automatically fed into the solver as you go. It's a bit complicated and unwieldy. I made a small guide in: viewtopic.php?f=81&t=2432 (my first post).

3) You can use other n-gram files but they need to be in the same format. So no dictionaries.

I don't fully understand the system/method you are describing but good luck with it. Well it seems you want to exploit the possible 1:1 substitute "+" symbol in the cipher. That's not a bad idea. You can lock the symbol to the letter "l" and work your way from there. About character n-grams, a 5-gram could consist of up to three 3-grams.

Therefore a function in the examine tool would be cool to load a variety of quintgrams in line 13 (starting from row 9). I've chosen all quintgrams with a frequency of at least 2, one may also choose a bigger number (approximately 2,000 with a frequency of 1, even more if choosing a frequency expectation of e.g. 0.5). Started with the top list due to lack of computer ressources.

How does this functionality need to behave exactly? Can't promise I will add it.

The solver assigns letters to symbols and then uses 5-grams to score the resulting text, if it's an improvement it keeps the change and otherwise it discards it.
User avatar
Jarlve
 
Posts: 2544
Joined: Sun Sep 07, 2014 9:51 am
Location: Belgium

Re: My work

Postby Quicktrader » Thu Sep 10, 2015 8:56 am

Jarlve wrote:Hey Quicktrader,

1) If set to exclude selected squares will become red and those will not be submitted to the solver. Can be used to exclude rows, columns if you click on the row or column number. You can also select normally (green selection) and press the delete-key and then the r-key. This will delete a selection and also remove the empty place it left. To clear any selection use the c-key.

HI...DID NOT MEAN TO EXCLUDE SYMBOLS BUT ALPHABETICAL LETTERS, E.G. FOR THE + SYMBOL NOT TO USE LETTERS SUCH AS Q,Z,Y,..THEREFORE INCLUDING/EXCLUDING CERTAIN LETTERS FOR EACH SYMBOL, IF REQUIRED (AS IT REDUCES THE AMOUNT OF VARIETIES, MAY THUS INCREASE THE CHANCE TO RECEIVE A SOLUTION).

2a) Mode 1: character. Just selected one character at a time. Mode 2: symbol. Select all characters with the same symbol (useful if you want to select all "+" symbols at once).

NICE

2b) FPT stands for free path tool. It allows you to (un)transposition a cipher, and the result is automatically fed into the solver as you go. It's a bit complicated and unwieldy. I made a small guide in: viewtopic.php?f=81&t=2432 (my first post).

3) You can use other n-gram files but they need to be in the same format. So no dictionaries.

FORMAT SUCH AS? WITH NUMBERS ON IT OR SIMPLY A .TXT FILE?

I don't fully understand the system/method you are describing but good luck with it. Well it seems you want to exploit the possible 1:1 substitute "+" symbol in the cipher. That's not a bad idea. You can lock the symbol to the letter "l" and work your way from there.

THAT'S EXACTLY WHAT I AM GOING FOR.

About character n-grams, a 5-gram could consist of up to three 3-grams.

IT INDEED DOES, BUT ONLY TWO OF THEM 'TOUCH' EACH OTHER IN ONE 5-GRAM AND REPEAT TWICE IN THE CIPHER (EACH). THIS IS WHY I´D LIKE TO PLACE A LIST OF THE MOST FREQUENT E.G. 200 5-GRAMS INTO THIS SPECIFIC AREA OF THE CIPHER.

ALSO SAW THAT THE 5-GRAM LIST IS A COMPLETE ONE, AAAAA TO ZZZZZ, IT MIGHT BE USEFUL TO MODIFY IT TO E.G. THE 10,000 MOST FREQUENT 5-GRAMS.

Therefore a function in the examine tool would be cool to load a variety of quintgrams in line 13 (starting from row 9). I've chosen all quintgrams with a frequency of at least 2, one may also choose a bigger number (approximately 2,000 with a frequency of 1, even more if choosing a frequency expectation of e.g. 0.5). Started with the top list due to lack of computer ressources.

How does this functionality need to behave exactly? Can't promise I will add it.

The solver assigns letters to symbols and then uses 5-grams to score the resulting text, if it's an improvement it keeps the change and otherwise it discards it.


THANKS

QT
*ZODIACHRONOLOGY*
User avatar
Quicktrader
 
Posts: 2384
Joined: Mon Apr 01, 2013 11:23 am
Location: Vienna, Austria (Europe)

Re: My work

Postby Jarlve » Sat Sep 12, 2015 3:09 am

Quicktrader wrote:HI...DID NOT MEAN TO EXCLUDE SYMBOLS BUT ALPHABETICAL LETTERS, E.G. FOR THE + SYMBOL NOT TO USE LETTERS SUCH AS Q,Z,Y,..THEREFORE INCLUDING/EXCLUDING CERTAIN LETTERS FOR EACH SYMBOL, IF REQUIRED (AS IT REDUCES THE AMOUNT OF VARIETIES, MAY THUS INCREASE THE CHANCE TO RECEIVE A SOLUTION).

I have experimented with such things (like assigning letters randomly biased by their frequency) but it did more bad than good. The hill climber needs freedom to move things around, things need to flow, constrictions interrupt the flow. The exploration aspect.

Quicktrader wrote:FORMAT SUCH AS? WITH NUMBERS ON IT OR SIMPLY A .TXT FILE?

Exactly the same format as the n-gram files the program comes with. ABCDE frequency.

Quicktrader wrote:ALSO SAW THAT THE 5-GRAM LIST IS A COMPLETE ONE, AAAAA TO ZZZZZ, IT MIGHT BE USEFUL TO MODIFY IT TO E.G. THE 10,000 MOST FREQUENT 5-GRAMS.

It's not complete but the quality is rather high, source: http://practicalcryptography.com/
It is not a good idea to use a limited n-gram list because this will make the search space less smooth, introduce more gaps/etc. My program uses an algorithm close to simulated annealing and it's known to work best when the search space is smooth.

You want to place a custom list of 5-grams in a specific area of the cipher. But this would also fill in all the associated symbols right?

I think you're looking for a non-optimal solution in the 340, which could be the reason why our solvers don't get it. So any solver will basicly work against you unless it's engineered from to ground up to your specifications.
User avatar
Jarlve
 
Posts: 2544
Joined: Sun Sep 07, 2014 9:51 am
Location: Belgium

Re: My work

Postby Paul_Averly » Tue Sep 15, 2015 11:21 pm

What were the (or were there) test results for Playfair?

There was a link for "Playfair encipherment prior to homophonic substitution"
But it no longer works.

http://zodiackillerciphers.com/wiki/ind ... ter_cipher
User avatar
Paul_Averly
 
Posts: 839
Joined: Sat Apr 06, 2013 12:10 am

Re: My work

Postby glurk » Wed Sep 16, 2015 2:55 am

Paul_Averly wrote:What were the (or were there) test results for Playfair?

There was a link for "Playfair encipherment prior to homophonic substitution"
But it no longer works.

http://zodiackillerciphers.com/wiki/ind ... ter_cipher

I'm quite certain that you mean my post at http://www.zodiackillerfacts.com/forum/ ... 902#p20902
Which does no longer exist. Unfortunately, I did not save this work, and I quite honestly don't recall exactly what I did. However. I am certain that the 340 is NOT a Playfair cipher, homophonically enciphered.

I wish that every single post of mine at zodiackillerfacts.com had not been deleted, but they have been. Far out of my control.

Not really a big deal, since that entire board is now as dead as buried bones. If it vanished, no one would even notice.

-glurk
--------------------------------
I don't believe in monsters.
User avatar
glurk
 
Posts: 720
Joined: Mon Apr 01, 2013 6:35 am
Location: Location, Location.

Re: My work

Postby Quicktrader » Wed Sep 16, 2015 4:51 am

Jarlve, I do appreciate your answers as they really get it to the point. Please, for a minute, let's go a little bit deeper into the issue of how your examine tool can help to solve the cipher. Please do also read it carefully and do not hesitate to ask, if any questions may occur:

1.) The complexity of the 340 is higher than that of the 408. The cleartext-to-homophone ratio is - unqually distributed - 5.397 compared to 7.418, making the 340 much harder to crack.
2.) A similar complexity ratio of the 340 would require to 'guess' 18 symbols correctly, thus leading to 340/45 = 7.555
3.) The 340 has 63^26 variations, thus 60,653,000,000,000,000,000,000,000,000,000,000,000,000,000,000 or 6.06E+46. The examine tool can handle about 100,000,000 variations in one run.
4.) Donald Gene and Bettye June Harden had cracked the 408 by applying certain cipher structures ('kill', 'I like killing')
5.) Such cipher structures do exist in the 340, too. Double letters, two trigrams appearing twice each (although additional ones might be hidden behind different homophones), etc.
6.) Based on the number of homophones Z had used (according to a separate (!) frequency table), Bernoulli formula shows that the odds for + being an 'L' is 71.1% (with 'S' being the second highest with a value of 2.3%). Other letters either are expected to have more than 3 homophones (e.g. 'S', leading to the low value) or are not expected to appear that often as a double letter (e.g. 'VV')

Therefore I do believe that due to the amount of variations (6 Billions^5 or 6.06E+46), such existing cipher structures should be considered when using the examine tool. This step may enable the examine tool to crack the cipher, which is currently not yet the case. Such considerations of cipher structures may be e.g.:

- considering e.g. + being an 'L' due to Bernoulli (or trying + to be an L, S, R,...)
- considering frequent trigrams, e.g. expectations >1 for the two at least twice reappearing trigrams (> expected frequency of trigrams)
- considering related structures, e.g. a vowel previous to '++' or two frequent trigrams in combination as being a 5-gram (line 13, row 9)

otherwise the Examine tool possibly would have to run for a very long time..

In addition to that: Hillclimb method is the correct one, however regularly locks in to a local optima (value of ~13,000), while we are actually trying to force it to the global optima (the correct or nearly-correct solution). To force hill climb / examine tool doing so, it imo is necessary to configure additional assumptions to reduce the cipher's complexity. This can be done, preferrably according to the cipher structure, by various methods such as but not limited to:

a.) Assuming + being an 'L' (or R, S,..)
b.) Assuming the first symbol of the cipher to be a vowel (or a consonant)
c.) Considering the two frequent trigrams to be statistically 'frequent' ones (therefore not being e.g. 'ZZG' or 'GQM' but rather 'THE' or 'ERE')
d.) Considering the 5-gram in line 13, row 9, being a combination of two of such frequent trigrams ('THERE' rather than 'ZZGQM')
e.) Considering a frequent symbol (e.g. the reversed P symbol) to not be a non-frequent letter (e.g. X, Q, Y would most likely not appear with a frequency of 5% in the cipher)
f.) Considering the bigram appearing three times in the cipher to be a frequent bigram, too.

Therefore the examine tool ideally would be adaptable so that certain symbols (or symbol combinations, e.g. trigrams) can exclude or include certain cleartext letters (e.g. exclude X, Y, Q for the reverse P symbol; using a list of frequent cleartext trigrams for the repeating and therefore frequent, too, cipher trigrams).

I would like to give an example of how I'd love to apply the Examine tool:

1. Setting the + symbol (e.g. 'L')
2. Setting the trigrams to a list of frequent trigrams (e.g. all trigrams that would appear 1.5 in a 340 cipher [44 per 10,000])
3. EASIER: Setting the 5-gram (line 13, row 9) to a list of (possibly frequent) 5-grams consisting of two frequent trigrams ('THERE', 'THANT', 'WHICH',..)
4. Setting the bigram in line one to a list of frequent bigrams ('EN', 'TH',..)
5. Excluding certain letters for certain symbols (e.g. X, Y, Q for the reverse P symbol)
6. RUN the examine tool and let it then check out some 100,000,000 variations..

It is obvious that especially step no. 3 is dramatically reducing the amount of variations. My expectation is that, considering this approach, the examine tool should become able to crack the 340. As I said, however, I do believe that it is a necessary step to solve it (if not accidentially).

Regarding your point that such modification would 'interrupt' the process:
The process is indeed influenced in a negative way: The solving process is now definitely slower than before (is it really?). The program can't just use any 5-gram to be placed in, but has to use one that e.g. has one L on the third position and one T on the fifth position. But this exactly is the solving process we actually need. The other two methods, trying all variations or solving the cipher like a newspaper puzzle, both don't work out. The first one because of the huge amount of variations (too high..), the second one because there is a lack of cipher structures such as more 4-grams or trigrams.

By using millions of variations under the pre-condition of existing cipher structures in combination with certain frequencies, the example tool should become successful.

Criticism:

Q: What if the reappearing trigrams or the 5-gram are not among the e.g. 2,000 most frequent ones?
A: Not very likely but tThere is the possibility to try the second most frequent 2,000 trigrams/5-grams, too. It will end up that the 5-gram is not ZZDZQ.

Q: What if the + is not an 'L'? Or the reverse P symbol in fact is a Y?
A: Exclusions and inclusions could be done by anyone as he/she prefers. E.g. trying a certain group of letters to be excluded for the reverse P, then a second group of letters.

Q: What if the homophones are not distributed according a certain frequency table?
A: Nevertheless the preconditions helps to reduce the amount of variations to be checked through.

Q: What if the cipher then still cannot be solved?
A: Do not think so. But additional steps are possible, e.g. focussing on other cipher structures. Or it may even be not solvable, but I clearly doubt it.

My estimation is that the method described above leads to a reduction of the present 6 billion^5 or 6.06E+46 variations to approximately

63
minus 5 (5-gram list)
minus 2 (bigram list)
minus 1 (+ symbol)
minus 1 (e.g. vowel before plus symbol)
minus 1 (exclusion of revers P letters)

leading to an overall of 53^26 variations (still to be multiplied with the number of 5-grams, bigrams etc. used - which can actually be one by trying one run after another). This still is a number, but the complexity ratio has then increased to 6.415 (which is good, meaning a reuced complexity) instead of 5.397. It is therefore at least closer to a value of 7.418 (408 cipher), latter one has been proven to be solvable by methods of computation in the past.

In addition to that, the most frequent variations are covered instead of rare ones, too. This one I like most. A comparison of the amount of variations before and after the modification shows the advantage:

340 (unsolved):
63^26=
6.06E+46 variations

408 (solved):
55^26=
1.78E+45 variations

340 with modification:
53^26=
6.78E+44 variations

So the modifications in fact are leading to a 340 that has even less variations to be hill-climbed than the - already solvable - 408.

Again, I most appreciate your work and guess you therefore will be the first person to become able to read the cleartext. Would love if you share this idea (and the solution).

QT
Last edited by Quicktrader on Wed Sep 16, 2015 5:21 am, edited 1 time in total.
*ZODIACHRONOLOGY*
User avatar
Quicktrader
 
Posts: 2384
Joined: Mon Apr 01, 2013 11:23 am
Location: Vienna, Austria (Europe)

Re: My work

Postby glurk » Wed Sep 16, 2015 5:17 am

QT-

While I would NEVER say that ZKDecrypto is a panacea, or a great solver, doesn't it do what you ask? It seems, in a way, that we are asking programmers to do the same thing over and over again.

-glurk
--------------------------------
I don't believe in monsters.
User avatar
glurk
 
Posts: 720
Joined: Mon Apr 01, 2013 6:35 am
Location: Location, Location.

Re: My work

Postby Quicktrader » Wed Sep 16, 2015 6:04 am

glurk wrote:QT-

While I would NEVER say that ZKDecrypto is a panacea, or a great solver, doesn't it do what you ask? It seems, in a way, that we are asking programmers to do the same thing over and over again.

-glurk


Think it actually is a great solving tool, but a bit slow.

The main point is that it is hard to try one 5-gram after another in combination with let's say 40 frequent bigrams one after another. Although this would indeed work. With only 200 5-grams, however, it would mean to 4,000 times select seven symbols. Then run it with the slow program, not being sure that it finds the correct solution on the first run. I actually tried some, but it ends up in quite a lot of work.

But in generally it does, except that you can't lay a complete list of n-grams behind these specific cipher fractions.

Below you find a pic on which cipher the examine tool would run if only three steps were taken as a pre-condition: The leading + symbol, the 5-gram (consisting of frequent trigrams) and the bigrams (being frequent, too). It should be obvious that based on this cipher structure running through 100,000,000 variations makes way more sense:

340structure_6.jpg


If that doesn't do it, by the way, there still is the possibility to reduce the 5-grams loaded (NOT [updated] IoFBc but the ones doing the hill climb work, the corpora). Just to let the solver be faster and using a more frequent corpora, even if only parts of the cipher might be solved.

Update:
Bigrams expected to appear at least 1.5 times in 340-2=338 trigrams:
th 1.52
he 1.28
in 0.94
er 0.94
an 0.82
re 0.68
nd 0.63
at 0.59
on 0.57
nt 0.56
ha 0.56
es 0.56
st 0.55
en 0.55
ed 0.53
to 0.52
it 0.50
ou 0.50
ea 0.47
hi 0.46
is 0.46

(Corrected) list of 5-grams consisting of two trigrams (IoFBc) with both trigrams appearing at least >1 in the 340 (updated: out of the 263,342 most frequent 5-grams):

ANDTH
THERE
THING
DTHAT
ETHAT
ENTHE
ETHER
HATHE
DTHER
NTHAT
NTHER
ERETH
THETH
NTHIS
HISHE
DTHIS
ETHIS
HATHA
THAND
SHERE
THENT
ERENT
ENTHA
HATHI
ENTHI
SHETH
ERERE
SHENT

Combining these with the + symbol leads to the structure in the picture above. Using more bigrams and 5-grams lead to a bigger sample, although statistically this would be an outlier.

Currently I do combine all those above with three additional symbols,
sym.jpg
and
sym1.jpg
to try to cross-check it with text fractions in line 17/18 of the cipher..works quite well, but still a variety of potential solutions and have not found the final one yet. LETINSTALLTIDBIT or LETINSTALLTINYIT is such potential text fraction, so far (based on the n-gram 'SHENT').

QT
You do not have the required permissions to view the files attached to this post.
*ZODIACHRONOLOGY*
User avatar
Quicktrader
 
Posts: 2384
Joined: Mon Apr 01, 2013 11:23 am
Location: Vienna, Austria (Europe)

Re: My work

Postby Jarlve » Wed Sep 16, 2015 11:58 am

@Quicktrader, I'm not sure the search space argument holds much weight in relation to the 340. AZdecrypt has solved a 618 character cipher with 221 symbols (try 221^26) and longer ciphers with even more symbols for that matter.

Quicktrader wrote:I would like to give an example of how I'd love to apply the Examine tool:

1. Setting the + symbol (e.g. 'L')
2. Setting the trigrams to a list of frequent trigrams (e.g. all trigrams that would appear 1.5 in a 340 cipher [44 per 10,000])
3. EASIER: Setting the 5-gram (line 13, row 9) to a list of (possibly frequent) 5-grams consisting of two frequent trigrams ('THERE', 'THANT', 'WHICH',..)
4. Setting the bigram in line one to a list of frequent bigrams ('EN', 'TH',..)
5. Excluding certain letters for certain symbols (e.g. X, Y, Q for the reverse P symbol)
6. RUN the examine tool and let it then check out some 100,000,000 variations..

1) This functionality is already available (locking symbols to letters).

2,3,4) The way I'd like to add the n-gram functionality is that you select a symbol and then press a key which prompts you which n-gram size you like to use. If you for instance choose 4-gram the solver will then lock the symbol and the other 3 symbols that follow it to use a secondary 4-gram list. So that it will not change any of the involved symbols individually, but rather as a whole, picking a random 4-gram from the secondary list.

5) The way I'd like to add the letter excluding functionality is that you select a symbol and then press a key which prompts you to enter a string of letters the symbol can use. For instance you enter "aeiou" or "etaoinshrdlu" and if you leave it empty it'll revert to "abcde...".

Does that seem okay for you?
User avatar
Jarlve
 
Posts: 2544
Joined: Sun Sep 07, 2014 9:51 am
Location: Belgium

Re: My work

Postby Quicktrader » Thu Sep 17, 2015 5:47 am

Jarlve wrote:@Quicktrader, I'm not sure the search space argument holds much weight in relation to the 340. AZdecrypt has solved a 618 character cipher with 221 symbols (try 221^26) and longer ciphers with even more symbols for that matter.

Quicktrader wrote:I would like to give an example of how I'd love to apply the Examine tool:

1. Setting the + symbol (e.g. 'L')
2. Setting the trigrams to a list of frequent trigrams (e.g. all trigrams that would appear 1.5 in a 340 cipher [44 per 10,000])
3. EASIER: Setting the 5-gram (line 13, row 9) to a list of (possibly frequent) 5-grams consisting of two frequent trigrams ('THERE', 'THANT', 'WHICH',..)
4. Setting the bigram in line one to a list of frequent bigrams ('EN', 'TH',..)
5. Excluding certain letters for certain symbols (e.g. X, Y, Q for the reverse P symbol)
6. RUN the examine tool and let it then check out some 100,000,000 variations..

1) This functionality is already available (locking symbols to letters).

2,3,4) The way I'd like to add the n-gram functionality is that you select a symbol and then press a key which prompts you which n-gram size you like to use. If you for instance choose 4-gram the solver will then lock the symbol and the other 3 symbols that follow it to use a secondary 4-gram list. So that it will not change any of the involved symbols individually, but rather as a whole, picking a random 4-gram from the secondary list.

5) The way I'd like to add the letter excluding functionality is that you select a symbol and then press a key which prompts you to enter a string of letters the symbol can use. For instance you enter "aeiou" or "etaoinshrdlu" and if you leave it empty it'll revert to "abcde...".

Does that seem okay for you?


Ok? This is perfect :D.

Please rethink the 'random' selection of n-grams. This is crucial because it might be more useful to back it with a specific (e.g.) 5-gram list. The one I use is a combination of

a.) trigrams sorted according to a specific frequency value and
b.) the 5-grams sorted according to their own frequency, too:

n-grams.jpg


In this file I first set the precondition that both trigrams appear at least e.g. 1.5 times in the cipher. This value may be modified, e.g. to a value of 1.0 only, too (as shown in the picture).

Setting up such precondition leads to a marker ('1') according to which I sort first, thus eliminating all other (statistically outlier) 5-grams consisting of various non-frequent trigrams. After that, however, I do sort the 5-grams, too, according to their own frequency. This list may finally be checked 'one-after-another'. If your tool selects any n-gram randomly, chances are very high (>99.95%) that the tool chooses any out of e.g. 263,320 5-grams. The ones consisting of frequent trigrams (~150 or so} would not really be checked first although exactly those do fulfill the criteria of 'consisting out of two frequent trigrams'.

Thinks for your interest into this issue

QT
You do not have the required permissions to view the files attached to this post.
*ZODIACHRONOLOGY*
User avatar
Quicktrader
 
Posts: 2384
Joined: Mon Apr 01, 2013 11:23 am
Location: Vienna, Austria (Europe)

PreviousNext

Return to Zodiac Cipher Mailings & Discussion

Who is online

Users browsing this forum: BDHOLLAND, Chaucer, Goodkidmaadtoschi, Jarlve, Shawn, tGkTcy2W9B4p60o and 51 guests

cron