The guesses were markedly non-normally distributed. Karl Pearson , the founder of mathematical statistics. Galton's publication of Natural Inheritance in sparked the interest of a brilliant mathematician, Karl Pearson , [29] then working at University College London , and he went on to found the discipline of mathematical statistics. His work grew to encompass the fields of biology , epidemiology , anthropometry, medicine and social history. In , with Walter Weldon , founder of biometry , and Galton, he founded the journal Biometrika as the first journal of mathematical statistics and biometry. His work, and that of Galton's, underpins many of the 'classical' statistical methods which are in common use today, including the Correlation coefficient , defined as a product-moment; [31] the method of moments for the fitting of distributions to samples; Pearson's system of continuous curves that forms the basis of the now conventional continuous probability distributions; Chi distance a precursor and special case of the Mahalanobis distance [32] and P-value , defined as the probability measure of the complement of the ball with the hypothesized value as center point and chi distance as radius. He also founded the statistical hypothesis testing theory , [32] Pearson's chi-squared test and principal component analysis. The second wave of mathematical statistics was pioneered by Ronald Fisher who wrote two textbooks, Statistical Methods for Research Workers , published in and The Design of Experiments in , that were to define the academic discipline in universities around the world. He also systematized previous results, putting them on a firm mathematical footing. In his seminal paper The Correlation between Relatives on the Supposition of Mendelian Inheritance , the first use to use the statistical term, variance. In , at Rothamsted Experimental Station he started a major study of the extensive collections of data recorded over many years. This resulted in a series of reports under the general title Studies in Crop Variation. In he published The Genetical Theory of Natural Selection where he applied statistics to evolution. Over the next seven years, he pioneered the principles of the design of experiments see below and elaborated his studies of analysis of variance. He furthered his studies of the statistics of small samples. Perhaps even more important, he began his systematic approach of the analysis of real data as the springboard for the development of new statistical methods. He developed computational algorithms for analyzing data from his balanced experimental designs. In , this work resulted in the publication of his first book, Statistical Methods for Research Workers. In , this book was followed by The Design of Experiments , which was also widely used. In addition to analysis of variance, Fisher named and promoted the method of maximum likelihood estimation. Fisher also originated the concepts of sufficiency , ancillary statistics , Fisher's linear discriminator and Fisher information. His article On a distribution yielding the error functions of several well known statistics presented Pearson's chi-squared test and William Sealy Gosset 's t in the same framework as the Gaussian distribution , and his own parameter in the analysis of variance Fisher's z-distribution more commonly used decades later in the form of the F distribution. Before this deviations exceeding three times the probable error were considered significant. For a symmetrical distribution the probable error is half the interquartile range. Other important contributions at this time included Charles Spearman 's rank correlation coefficient that was a useful extension of the Pearson correlation coefficient. William Sealy Gosset , the English statistician better known under his pseudonym of Student, introduced Student's t-distribution , a continuous probability distribution useful in situations where the sample size is small and population standard deviation is unknown. Jerzy Neyman in showed that stratified random sampling was in general a better method of estimation than purposive quota sampling. Please improve this section by adding secondary or tertiary sources. February Learn how and when to remove this template message James Lind carried out the first ever clinical trial in , in an effort to find a treatment for scurvy. Laplace in noted that the frequency of an error could be expressed as an exponential function of its magnitude once its sign was disregarded. Lagrange proposed a parabolic distribution of errors in Laplace in published his second law of errors wherein he noted that the frequency of an error was proportional to the exponential of the square of its magnitude. This was subsequently rediscovered by Gauss possibly in and is now best known as the normal distribution which is of central importance in statistics. Peirce in who was studying measurement errors when an object was dropped onto a wooden base. Lagrange also suggested in two other distributions for errors - a raised cosine distribution and a logarithmic distribution. Laplace gave a formula for the law of facility of error a term due to Joseph Louis Lagrange , , but one which led to unmanageable equations. Daniel Bernoulli introduced the principle of the maximum product of the probabilities of a system of concurrent errors. In William Playfair introduced the idea of graphical representation into statistics. He invented the line chart , bar chart and histogram and incorporated them into his works on economics , the Commercial and Political Atlas. This was followed in by his invention of the pie chart and circle chart which he used to display the evolution of England's imports and exports. These latter charts came to general attention when he published examples in his Statistical Breviary in Laplace, in an investigation of the motions of Saturn and Jupiter in , generalized Mayer's method by using different linear combinations of a single group of equations. In Laplace estimated the population of France to be 28,, The census data of these communities showed that they had 2,, persons and that the number of births were 71, Assuming that these samples were representative of France, Laplace produced his estimate for the entire population. Carl Friedrich Gauss , mathematician who developed the method of least squares in The method of least squares , which was used to minimize errors in data measurement , was published independently by Adrien-Marie Legendre , Robert Adrain , and Carl Friedrich Gauss Gauss had used the method in his famous prediction of the location of the dwarf planet Ceres. The observations that Gauss based his calculations on were made by the Italian monk Piazzi. The method of least squares was preceded by the use a median regression slope. This method minimizing the sum of the absolute deviances. A method of estimating this slope was invented by Roger Joseph Boscovich in which he applied to astronomy. The term probable error der wahrscheinliche Fehler - the median deviation from the mean - was introduced in by the German astronomer Frederik Wilhelm Bessel. Other contributors to the theory of errors were Ellis , De Morgan , Glaisher , and Giovanni Schiaparelli In the 19th century authors on statistical theory included Laplace, S. Gustav Theodor Fechner used the median Centralwerth in sociological and psychological phenomena. Francis Galton used the English term median for the first time in having earlier used the terms middle-most value in and the medium in The only data sets available to him that he was able to show were normally distributed were birth rates. Development of modern statistics[ edit ] Although the origins of statistical theory lie in the 18th-century advances in probability, the modern field of statistics only emerged in the lateth and earlyth century in three stages. The first wave, at the turn of the century, was led by the work of Francis Galton and Karl Pearson , who transformed statistics into a rigorous mathematical discipline used for analysis, not just in science, but in industry and politics as well. 