Discriminant analysis to predict the clinical diagnosis of primary immunodeficiencies: a preliminary report
PDF (Spanish)
PubMed

Supplementary Files

HTML (Spanish)

Keywords

discriminant analysis
clinical diagnosis
primary immunodeficiencies
automatic learning
computed-assised
expert vs machine

How to Cite

Discriminant analysis to predict the clinical diagnosis of primary immunodeficiencies: a preliminary report. (2015). Revista Alergia México , 62(2), 125-133. https://doi.org/10.29262/ram.v62i2.66

Plaudit

Abstract

Background: The features in a clinical history from a patient with suspected primary immunodeficiency (PID) direct the differential diagnosis through pattern recognition. PIDs are a heterogeneous group of more than 250 congenital diseases with increased susceptibility to infection, inflammation, autoimmunity, allergy and malignancy. Linear discriminant analysis (LDA) is a multivariate supervised classification method to sort objects of study into groups by finding linear combinations of a number of variables.

Objective: To identify the features that best explain membership of pediatric PID patients to a group of defect or disease.

Material and method: An analytic cross-sectional study was done with a pre-existing database with clinical and laboratory records from 168 patients with PID, followed at the National Institute of Pediatrics during 1991-2012, it was used to build linear discriminant models that would explain membership of each patient to the different group defects and to the most prevalent PIDs in our registry. After a preliminary run only 30 features were included (4 demographic, 10 clinical, 10 laboratory, 6 germs), with which the training models were developed through a stepwise regression algorithm. We compared the automatic feature selection with a selection made by a human expert, and then assessed the diagnostic usefulness of the resulting models (sensitivity, specificity, prediction accuracy and kappa coefficient), with 95% confidence intervals. 

Results: The models incorporated 6 to 14 features to explain membership of PID patients to the five most abundant defect groups (combined, antibody, well-defined, dysregulation and phagocytosis), and to the four most prevalent PID diseases (X-linked agammaglobulinemia, chronic granulomatous disease, common variable immunodeficiency and ataxia-telangiectasia). In practically all cases of feature selection the machine outperformed the human expert. Diagnosis prediction using the equations created had a global accuracy of 83 to 94%, with sensitivity of 60 to 100%, specificity of 83 to 95% and kappa coefficient of 0.37 to 0.76. 

Conclusions: In general, the selection of features has clinical plausibility, and the practical advantage of utilizing only clinical attributes, infecting germs and routine lab results (blood cell counts and serum immunoglobulins). The performance of the model as a diagnostic tool was acceptable. The study’s main limitations are a limited sample size and a lack of cross validation. This is only the first step in the construction of a machine learning system, with a wider approach that includes a larger database and different methodologies, to assist the clinical diagnosis of primary immunodeficiencies.

PDF (Spanish)
PubMed

References

Al-Herz W, Bousfiha A, Casanova J-L, Chapel H, et al. Primary immunodeficiency diseases: an update on the classification from the international union of immunological societies expert committee for primary immunodeficiency. Front Immunol [Internet]. 2011 Jan;2:54. Available from: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=3342372&tool=pmcentrez&rendertype=abstract

Sherbino J, Dore KL, Siu E, Norman GR. The effectiveness ofcognitive forcing strategies to decrease diagnostic error: an exploratory study. Teach Learn Med [Internet]. 2011/01/18 ed. 2011;23:78-84. Available from: http://www.ncbi.nlm.nih.gov/pubmed/21240788

Groves M, O’Rourke P, Alexander H. The clinical reasoning characteristics of diagnostic experts. Med Teach [Internet].2003/07/26 ed. 2003;25:308-313. Available from: http://www.ncbi.nlm.nih.gov/pubmed/12881056

Sajda P. Machine learning for detection and diagnosis of disease. Annu Rev Biomed Eng [Internet]. 2006 [cited 2014 May 3]; Available from: http://www.annualreviews.org/doi/abs/10.1146/annurev.bioeng.8.061505.095802

Cleophas TJ, Zwinderman AH. Machine Learning in Medicine [Internet]. Vasa. Springer; 2013 [cited 2014 Feb 14]. Available from: http://medcontent.metapress.com/index/A65RM03P4874243N.pdf

Pardo PJ, Georgopoulos AP, Kenny JT, Stuve TA, et al. Classification of adolescent psychotic disorders using linear discriminant analysis. Schizophr Res [Internet]. 2006 Oct [cited 2014 Nov 7];87:297-306. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16797923

Kamel MI, Alhaddad MJ, Malibary HM, Thabit K, et al. EEG based autism diagnosis using regularized Fisher Linear Discriminant Analysis. IJ Image, Graph Signal Process.2012;3:35-41.

Ye J, Li T, Xiong T, Janardan R. Using uncorrelated discriminant analysis for tissue classification with gene expression data. IEEE/ACM Trans Comput Biol Bioinform 2004;1:181-190.

Krawczyk B, Simić D, Simić S, Woźniak M. Automatic diagnosis of primary headaches by machine learning methods.

Cent Eur J Med [Internet]. 2012 Nov 16 [cited 2014 Jan 29];8:157-165. Available from: http://www.springerlink.com/index/10.2478/s11536-012-0098-5

Chen H, Yang B, Wang G. Support vector machine based diagnostic system for breast cancer using swarm intelligence. 2012;(February 2011):2505–19.

Cherkassky M. Application of machine learning methods to medical diagnosis. Chance [Internet]. 2009 Feb 15;22:42-50. Available from: http://link.springer.com/10.1007/ s144-009-0007-0

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright (c) 2015 Revista Alergia México

Downloads

Download data is not yet available.