Preview

Dependability

Advanced search

Estimation of quality of a small sampling biometric data using a more efficient form of the chi-square test

https://doi.org/10.21683/1729-2646-2016-16-2-43-48

Abstract

Aim. The purpose is to increase the power of the Pearson’s chi-square test so that this test will become efficient on small test samplings. . It is necessary to reduce the scope of a test sample from 200 examples to 20 examples while maintaining the probability of errors of the first and the second kind. Selection of 20 examples of biometric images is considered by users to be a comfortable level of effort. The need to select more examples is perceived by users negatively.

Methods. The article offers one more (the second) form of the Pearson test that is much less sensitive to the scope of data in a test sampling. It is shown that a traditional form of the chi-square test is more sensitive to the scope of a test sampling than the Cramer-von Mises test. The offered (second) form of the chi-square test is less sensitive to the scope of a test sampling than a classical form of the chi-square test and less sensitive than the Cramer-von Mises test as well. This effect is achieved by the transition from the space of frequency of occurrence of events and probabilities of a group of similar events occurring in the space of more accurately evaluated junior statistical moments (mean and standard deviation). The fractal dimension of the new synthetic form of chi-square test coincides with the fractal dimension of the classical form of the chi-square test.

Results. The offered second variant of the chisquare test is presumably one of the most powerful of all existing statistical tests. The analytical description of correlation of standard deviations of a classical form of the chi-square test and a new form of the chi-square test is given. The standard deviation of the second form of the chi-square test decreases by half on retention of a statistical expectation on samplings of the same scope. The latter is equivalent to a four-time reduction of the requirements to the scope of a test sampling within the interval from 16 to 20 examples. Power gain as the result of the application of a new test is growing with the growth of a test sampling scope.

Conclusions. When creating a classical chi-square test in 1900, Pearson was guided by limited computing opportunities of the existing computer facilities, and he had to rely on the analytical relations that he found. Today the situation has changed and there are no more restrictions in relation to the engaged computing resources. However we continue to rely on those created with computing resources of 1900 by inertia. Probably, we should try to consider modern opportunities of computer facilities and to build more powerful options of statistical tests. Even if new tests will require a search of large number of possible states (they will have big tables calculated in advance instead of analytical relations), it is not a constraining factor today. When data is insufficient (in biometrics, in medicine, in economy) a computing complexity of statistical tests does not play a special role if the result of estimations is more accurate.

About the Authors

B. B. Akhmetov
International Informatization Academy (IIA), Turkestan, Kazakhstan
Russian Federation

PhD., Member of the International Informatization Academy (IIA), Vice-president of the International Hoca Ahmet Yesevi Turkish-Kazakh University. 29 B.Sattarkhanov Avenue, Bld. of Rectorate, Turkestan, 161200, Kazakhstan, tel.: +7 (72533) 3-35-77



A. I. Ivanov
Laboratory of biometric and neural network technologies, JSC Penza Research Electric and Technical Institute, Penza, Russia
Russian Federation

Dr. Sci., Associate Professor – Head of Laboratory of biometric and neural network technologies, JSC Penza Research Electric and Technical Institute. 9 Sovetskaya Str., Penza, 440000, Russia, tel.: +7 (841-2) 59-33-10



References

1. Ramírez-Ruiz J., Pfeiffer C., Nolazco-Flores J. Cryptographic Keys Generation Using FingerCodes. //Advances in Artificial Intelligence – IBERAMIA-SBIA 2006 (LNCS 4140), p. 178-187, 2006

2. Monrose F., Reiter M., Li Q., Wetzel S. Cryptographic key generation from voice. In Proc. IEEE Symp. on Security and Privacy, 2001

3. Feng Hao, Ross Anderson and John Daugman. Crypto with Biometrics Effectively, IEEE TRANSACTIONS ON COMPUTERS, VOL. 55, NO. 9, SEPTEMBER 2006.

4. Yazov Y.K. and others. Neural network protection of personal biometric data. // Y.K. Yazov (editor and author), co-authors V.I. Volchikhin, A.I. Ivanov, V.A. Funtikov, I.G. Nazarov // М.: Radiotechnics, 2012. 157 p. IBSN 978-5- 88070-044-8.

5. Akhmetov B.S., Ivanov A.I., Funtikov V.A., Bezyaev A.V., Malygina E.A. Technology of use of large neural networks to transform indistinct biometric data into the access key code. Monograph, Kazakhstan, Almaty, LLP Publishing House LEM, 2014. -144 p.

6. Akhmetov B.S., Volchikhin V.I, Ivanov A.I., Malygin A.Y. Algorithms of testing of biometric and neural network mechanisms of information security, Kazakhstan, Almaty,

7. KazNTU after K.I.Satpayev, 2013.- 152 p. ISBN 978-101- 228-586-4.

8. Kobzar A.I. Practical mathematical statistics for engineers and researchers. М. FIZMATLIT, 2006, 816 p.

9. R 50.1.037-2002 Standartization recommendations. Practical statistics. Rules of check of fitting of practical distribution to the theoretical one. Part I. χ2 criteria. State Standard of Russia. Moscow-2001, 140 p.

10. Akhmetov B.S., Ivanov A.I., Serikova N.I., Funtikova Y.V. Algorithm of imitative increase of number of degrees of freedom at the analysis of biometric data by the goodness-of-fit of the chi-square test. Reporter of the National academy of sciences of the Republic of Kazakhstan. No.5, 2014. p. 28-:-34.

11. Serikova N.I., Ivanov A.I., Kachalin S.V. Biometrical statistics: smoothing of histograms constructed on a small learning sampling. /Reporter of SibSAU 2014 No. 3(55) p.146-150.

12. Akhmetov B.B., Ivanov A.I., Bezyaev A.V., Funtikova Y.V. Multivariate statistical analysis of biometric data by the network of partial Pearson test. // Reporter of the National academy of sciences of the Republic of Kazakhstan. – Almaty, 2015. No.1. P. 5-11.


Review

For citations:


Akhmetov B.B., Ivanov A.I. Estimation of quality of a small sampling biometric data using a more efficient form of the chi-square test. Dependability. 2016;16(2):43-48. https://doi.org/10.21683/1729-2646-2016-16-2-43-48

Views: 974


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1729-2646 (Print)
ISSN 2500-3909 (Online)