Useful English Language Statistics

Useful English Language Statistics


Order and Frequency of Single Letters
  E 12.31%     L 4.03%    B 1.62%
  T  9.59      D 3.65     G 1.61
  A  8.05      C 3.20     V 0.93
  O  7.94      U 3.10     K 0.52
  N  7.19      P 2.29     Q 0.20
  I  7.18      F 2.28     X 0.20
  S  6.59      M 2.25     J 0.10
  R  6.03      W 2.03     Z 0.09
  H  5.14      Y 1.88      

Letter Groups Percentages

  A E I O U           38.58%
  L N R S T           33.43%
  J K Q X Z            1.11%
  E T A O N           45.08%
  E T A O N I S R H   70.02%  

Order and Frequency of Leading DIGRAMS

  TH  3.15%  TO  1.11%  SA  0.75%  MA  0.56%
  HE  2.51   NT  1.10   HI  0.72   TA  0.56
  AN  1.72   ED  1.07   LE  0.72   CE  0.55
  IN  1.69   IS  1.06   SO  0.71   IC  0.55
  ER  1.54   AR  1.01   AS  0.67   LL  0.55
  RE  1.48   OU  0.96   NO  0.65   NA  0.54
  ES  1.45   TE  0.94   NE  0.64   RO  0.54
  ON  1.45   OF  0.94   EC  0.64   OT  0.53
  EA  1.31   IT  0.88   IO  0.63   TT  0.53
  TI  1.28   HA  0.84   RT  0.63   VE  0.53
  AT  1.24   SE  0.84   CO  0.59   NS  0.51
  ST  1.21   ET  0.80   BE  0.58   UR  0.49
  EN  1.20   AL  0.77   DI  0.57   ME  0.48
  ND  1.18   RI  0.77   LI  0.57   WH  0.48
  OR  1.13   NG  0.75   RA  0.57   LY  0.47 

Order of Leading TRIGRAMS

THE AND THA ENT ION TIO FOR NDE HAS NCE EDT TIS OFT STH MEN

Chart of CONTACT Percentages

[Chart showing Letter Contact %'s]

Chart of DIGRAM Frequencies

[Table showing Digram Frequencies]


Source: H.F. Gaines, Cryptanalysis; a study of ciphers and their solution, Dover, New York, 1956.
Return to index (non-frame version)

wcherowi@carbon.cudenver.edu