Information theory's fundamental contribution to natural language processing and computational linguistics was further established in 1951, in his article "Prediction and Entropy of Printed English", showing upper and lower bounds of entropy on the statistics of English – giving a statistical foundation to language analysis. In addition, he proved that treating whitespace as the 27th letter of the alphabet actually lowers uncertainty in written language, providing a clear quantifiable link between cultural practice and probabilistic cognition.
What were the main results of applying statistical analysis to the English language?
The main results of applying statistical analysis to the English language were establishing upper and lower bounds of entropy for it and that treating whitespace as a 27th letter of the alphabet lowers uncertainty.