/Crypto

Practicing Encryption/Decryption Algorithms

Primary LanguagePython

Crypto

Practicing Encryption/Decryption Algorithms

crypto.py

python crypto.py [ISENCRYPT] [ENCRYPTION] [KEY] '[INPUT]'

  • Where [ISENCRYPT] sets the program to either encrypt or decrypt the given input, this argument can either be 'encrypt' or 'decrypt'
  • Where [ENCRYPTION] is an integer ID corresponding to the encryption algorithm to be used (shown in table below)
  • Where [KEY] is a key in the format shown in the table below
  • The output will be the hexadecimal ciphertext or ASCII plaintext, except for ceasar cipher which always outputs ASCII

key.py

python key.py [ENCRYPTION]

  • Where [ENCRYPTION] is the ID of the algorithm to use or it's string name
  • [ENCRYPTION] can be followed by -[KEYLEN] for algorithms with multiple key lengths
  • [KEYLEN] is an integer, referring to number of bits in the key
  • For 3DES, [KEYLEN] can be number of keys or number of bits in the combined keys

benchmark.py

python benchmark.py [ENCRYPTION] [NTRIES] [KEYLEN]

  • Where [ENCRYPTION] is an integer ID corresponding to the encryption algorithm to be used (shown in table below)
  • Where [NTRIES] is an optional integer referring to the number of decryptions to complete, default 1000
  • Where [KEYLEN is number of bits to use, for algorithms with multiple possible key lengths, in bits, defaults to shortest option

This repository will consist of a self project in which i will attempt to:

    • Create a program that can encrypt sentences with basic algorithms
    • Create a program that can decrypt sentences with basic algorithms
    • Improve the encryption program to more advanced algorithms
    • Improve the decryption program to more advanced algorithms
    • Develop a key generation algorithm implementation
    • Develop a key-sharing algorithm implementation?

Encryption Algorithm List

Algorithm Symmetry Completed ID Key Format Key Length (bits : hex chars)
Caesar Symmetric
1 Decimal INT_MAX
DES Symmetric
2 Hexadecimal 64 : 16
3DES Symmetric
3 Hexadecimal 128 : 32 or 192 : 48
AES Symmetric
4 Hexadecimal 128 : 32 or 192 : 48 or 256 : 64
RSA Asymmetric
5 Filepath to Hexadecimal Text File 1024 : 256 or 2048 : 512
ECC Asymmetric
6 N/A N/A

Estimation of Time to Brute-Force

In the case of multiple key sizes, [CIPHER]-[KEYVARIATION] has been used. The final three columns show the estimated time taken to test the given percentage of all possible keys; e.g. For DES there are 288230376151711744 keys to test, so in the column labelled '50%', there will be the estimated time taken to test 144115188075855872 keys. In the column labelled 'Av Time To Decrypt' the average time to decrypt is calculated using the average of 1000 decryptions of a sample block of text, which is kept constant across the ciphers, to give an average of the time required to brute force.

Algorithm No. Of Keys Av Time To Decrypt (seconds) 50% keys tested (years) 90% keys tested (years) 100% keys tested (years)
Caesar 25 0.15515875816345215 0.0000005 0.0000009 0.000001
DES 256 24.41735816001892 27,876,899,412 50,178,418,942 55,753,798,824
3DES-2KEY 2112 74.09152507781982 6.095 * 1027 1.097 * 1028 1.219 * 1028
3DES-3KEY 2168 72.97424387931824 4.326 * 1044 7.787 * 1044 8.652 * 1044
AES-128 2128 52.747599840164185 2.844 * 1032 5.119 * 1032 5.688 * 1032
AES-192 2192 62.59677004814148 6.226 * 1051 1.121 * 1052 1.245 * 1052
AES-256 2256 74.68215489387512 1.370 * 1071 2.466 * 1071 2.740 * 1071
RSA-1024 3.778 * 10151 N/A N/A N/A N/A
RSA-2048 N/A N/A N/A N/A N/A
ECC N/A N/A N/A N/A N/A

Obviously using a laptop CPU to solve an embarassingly parallelisable problem such as decryption is by no means the most efficient method to crack any of these ciphers, and the table shows how long a brute force attack might take on this device. However this does pose a starting point for how long a sophisticated attack might take on such a cipher. Following a normal distribution, the probabillity of the key being found by the time that 50% of keys have been tried is quite high, and multiple cores testing keys simultaneously would divide the problem massively and can make breaking these codes much quicker, but brute force attacks are still by no means feasible in most cases. For example, the 'record' for cracking DES currently sits at 23 hours, using far more sophisticated attacks than brute forcing every key possible, exploiting weaknesses in the encryption algorithm itself. However thease measurements of the time taken to decrypt give an indication to compare algorithms with each other.

It should slo be considered that this does not take into account testing output strings. If a program decrypted ciphertext using every single key the user would have to read all of the output strings until they find one that is valid. The program could then make this process shorter by testing keys for grammar, punctuation patterns, ascii readability etc.. While this would extend the length of time per encryption, but once a string was found that had >95% chance of being a valid input string, the program could exit, meaning not all of the keys would need to be tested. In addition to this, ordering keys by likelihood, in case the keys were human generated would save time, for example, testing keys like '1234567890ABCDEF' first would improve efficiency.

AES encryption is yet to be repeatably broken. While there are exploits to reduce the time taken to test possible keys, or to minimie the key space, the length of time required to test a reasonable proportion of the key space is still far too large for this to be possible. This is reflected in the above table where AES-256 requires over 2.4*10^68 years to solve which would still be a ridiculously large number, even with unlimited funds to complete.

RSA encryption uses, in theory any length of key. For this project i will benchmark using only 1024 and 2048 bit key lengths. These keys are generated by multiplying primes of half that length. In order to establish the number of possible keys (which will be an approximation), I have used the approximation of funciton π(n) to calculate the number of primes possible for each key length. This funciton can be approximated by n/ln(n-1) with n being half of the length of the key. For a key length of 1024 this results in (2^512)/ln((2^512)-1)=3.778004e+151, and for a key length 2048 the result is (2^1024)/ln((2^1024)-1)=abignumber.

Milestone Log

2021/08/11

  • Skeleton code written to take user input, clean it and call algorithm depending on user request
  • Currently, user input accepted for plaintext is limited to alphabet and basic punctuation incl. \n ( ) , . ? ! ;
  • User input is cleaned as follows:
    • All characters capitalised
    • All punctuation removed
    • All spaces removed
  • Need to update to allow punctuation and numbers to be included in plaintext before the encryption algorithms are written

2021/08/12

  • Identical/mirror version of skeleton code created for the decryption algorithms (necessary to test the encryption)
  • User input is now allowed for all characters with char values 32-126 incl.
  • User input now no longer needs to be thoroughly cleaned as the character codes are used
  • Both programs can be run with a single line using command line args as follows:
    • python encrypt.py [ENCRYPTION] [KEY] '[PLAINTEXT]'
    • Where:
      • [ENCRYPTION] and [KEY] are both integer values with encryption corresponding to the dictionary key-value pairs of the algorithms
      • [PLAINTEXT] is written surrounded in single quotes so that the terminal ignores special characters
  • Todo:
    • Find out how to make sure the plaintext input is surrounded by single quotes, and request the user re-enters their command if so
    • Write an encryption and decryption algorithm to implement DES encryption

2021/08/13

2021/08/14

  • 3DES implemented with 2 and 3 key variations
  • DES encryption correction made as key variable name caused mismatched usage
    • Was not a problem for DES encryption as the key variable was constant throughout program
    • 3DES implementation calls the DES Encryption algorithm multiple times with different keys so made the error apparent
  • Todo:

2021/08/22

  • Simple pseudo-random key generation implemented
  • Should be useful for more in depth testing of the encryption algorithms
  • Usage is as follows:
    • python key.py [KEYLEN] [KEYBASE]
    • Where:
      • Both [KEYLEN] and [KEYBASE] are integer values
      • [KEYLEN] is in bits
      • [KEYLEN] is a multiple of 8
      • [KEYBASE] is one of 2, 8, 10 or 16
      • The [KEYBASE] argument is optional, will default to 16
  • The secrets.py module is used as opposed to the random.py module
  • The secrets library function used is secrets.token_hex([nbytes=None]) as this generates a random text string in hexadecimal format

2021/08/30

  • Skeleton code for functions of AES implementation added
  • Main AES loop calling encryption functions implemented
  • Included a timer library to time completion of encryption/decryption functions

2021/08/31

  • AES encryption and decryption implemented
  • Key length can be of 128, 192 or 256 bits
  • Uses mix_columns and inv_mix_columns algorithms from https://github.com/boppreh/aes/blob/master/aes.py for more concise code
  • Using Time Taken addition, it is possible to calculate how long it might take to brute force the key using this machine, added to README.md
  • Todo:
    • Develop asymmetric key cipher algorithms, starting with either RSA or ECC
    • Implement asymmetric key generation into the key.py program
    • Develop a key-sharing algorithm??

2021/09/02

  • Redesigned the program to separate code corresponding to different algorithms
  • The AES and DES algorithms now exist in separate libraries aes.py and des.py to the main executable, crypto.py
  • Caesar Cipher, as a single short function still exists within crypto.py
  • crypto.py is now able to encrypt and decrypt without having to call different files
  • Changes completed in a different branch, needs to be merged to main
  • Todo:
    • Further algorithms that are implemented will be done as separate libraries too
    • RSA will be next to implement
    • Need to devise how key.py can output asymmetric key-pairs

2021/09/06

  • key.py program edited to provide a framework for the asymmetric key generation program
    • Changed input format to allow multiple key lengths to be generated for different algorithms
    • Single input argument is in the form [ENCRYPTION]-[KEYLEN] where the hyphen and [KEYLEN] argument are optional, default is shortest possible key length for that algorithm
    • RSA keys will likely be stored in individual files, then when calling the algorithm the key files will be read instead of copy pasting the long key strings
  • RSA library rsa.py created with empty method as skeleton
  • Todo:
    • RSA key generation implementation
    • RSA encryption to be implemented

2021/09/15

  • rsa.py library implemented
    • Original implementation had key generation in key.py, but due to the relatively shorter encryption/decryption imlpementation, asymmetric key generation is in rsa.py
    • Original implementation with C = (M^k) mod N was unsuccessful for sufficiently large keys
      • Resultant implementation consists of the python pow(a, b, n) function which uses much faster modular exponentiation to complete the calculation quicker
  • benchmark.py now creates asymmetric key variables before the main execution loop, where symmetric keys have equal values for these variables
  • Todo: