Preview

Meditsinskiy sovet = Medical Council

Advanced search

From sequencing data to disease understanding: How can a doctor process patient’s NGS data on their own computer

https://doi.org/10.21518/ms2025-351

Abstract

Introduction. In modern medicine, physicians are increasingly required to be versatile specialists, combining in-depth medical knowledge with technical expertise. While the accessibility of genomic research has dramatically increased over the past few decades, its full integration into medical practice still faces significant challenges. Given the rapid proliferation of new knowl­edge regarding the associations between genomic data and human diseases, there is a growing clinical need for physicians to be able to analyze this data themselves. This is especially true for subsequent medico-genetic studies, particularly when patients already have existing Next-Generation Sequencing (NGS) data (e.g., from exome sequencing).

Aim. The objective of this study is to develop and provide a detailed guide for medical specialists to independently perform bioinformatics analysis of a patient’s NGS data.

Materials and methods. The source data for this study are examples of NGS data files provided to patients following a medicogenetic examination. We used both established and custom-developed software algorithms for read alignment against a refer­ence genome, variant discovery, variant filtering based on quality criteria and specific genes (and their transcripts), and assess­ing their potential health impact.

Results. We developed a comprehensive algorithm and a bioinformatics processing pipeline for sequencing data analysis. This pipeline utilizes a Linux command-line interface, along with Docker containers for established bioinformatics tools such as bwa, gatk, samtools, and bcftools, as well as R scripts based on the Bioconductor project and our own proprietary developments. This algorithm allows medical professionals to independently obtain and interpret genetic variants from a patient’s NGS data.

Conclusion. The information obtained through this pipeline can serve as a foundation for further work in diagnosing hereditary diseases, personalized medicine, and pharmacogenetics. The proposed algorithm effectively achieves the study’s objective, enabling the retrieval of patient genomic sequence variants (exomes) suitable for subsequent analysis and interpretation on a personal computer. We anticipate that a physician’s computer can handle this task in a reasonable amount of time, ensuring reliable and reproducible data processing.

About the Authors

A. A. Korneenkov
Saint Petersburg Research Institute of Ear, Throat, Nose and Speech
Russian Federation

Aleksei A. Korneenkov, Dr. Sci. (Med.), Professor, Head of the Research Laboratory of Clinical Informatics and Biostatistics

9, Bronnitskaya St., St Petersburg, 190013



Yu. K. Yanov
Military Medical Academy named after S.M. Kirov; North-Western State Medical University named after I.I. Mechnikov
Russian Federation

Yuri K. Yanov, Dr. Sci. (Med.), Professor, Member of the Russian Academy of Sciences, Military Medical Academy named after S.M. Kirov; Professor, Department of Otolaryngology, North-Western State Medical University named after I.I. Mechnikov

6, Akademik Lebedev St., St Petersburg, 194044,

41, Kirochnaya St., St Petersburg, 191015



E. E. Vyazemskaya
Saint Petersburg Research Institute of Ear, Throat, Nose and Speech
Russian Federation

Elena E. Vyazemskaya, Engineer of the Research Laboratory of Clinical Informatics and Biostatistics

9, Bronnitskaya St., St Petersburg, 190013



A. Yu. Medvedeva
Saint Petersburg Research Institute of Ear, Throat, Nose and Speech
Russian Federation

Anna Y. Medvedeva, Engineer of the Research Laboratory of Clinical Informatics and Biostatistics

9, Bronnitskaya St., St Petersburg, 190013



References

1. Vicente AM, Ballensiefen W, Jönsson JI. How personalised medicine will transform healthcare by 2030: the ICPerMed vision. J Transl Med. 2020;18(1):180. https://doi.org/10.1186/s12967-020-02316-w.

2. Khan A, Barapatre AR, Babar N, Doshi J, Ghaly M, Patel KG et al. Genomic medicine and personalized treatment: a narrative review. Ann Med Surg. 2025;87(3):1406–1414. https://doi.org/10.1097/MS9.0000000000002965.

3. Desmond-Hellmann S. Toward precision medicine: a new social contract? Sci Transl Med. 2012;4(129):129ed3. https://doi.org/10.1126/scitranslmed.3003473.

4. Collins FS, Varmus H. A new initiative on precision medicine. N Engl J Med. 2015;372(9):793–795. https://doi.org/10.1056/NEJMp1500523.

5. Manolio TA, Narula J, Bult CJ, Chisholm RL, Deverka PA, Ginsburg GS et al. Genomic medicine year in review: 2023. Am J Hum Genet. 2023;110(12):1992–1995. https://doi.org/10.1016/j.ajhg.2023.11.001.

6. Beg A, Parveen R. Role of bioinformatics in cancer research and drug development. In: Translational Bioinformatics in Healthcare and Medicine. Elsevier. 2021;141–148. https://doi.org/10.1016/b978-0-323-89824-9.00011-2.

7. Attwood TK, Blackford S, Brazas MD, Davies A, Schneider MV. A global perspective on evolving bioinformatics and data science training needs. Brief Bioinform. 2019;20(2):398–404. https://doi.org/10.1093/bib/bbx100.

8. Chatterjee A, Ahn A, Rodger EJ, Stockwell PA, Eccles MR. A guide for designing and analyzing RNA-Seq data. In: Raghavachari N, Garcia-Reyero N, eds. Gene Expression Analysis. Methods MolBio. 2018;1783:35–80. https://doi.org/10.1007/978-1-4939-7834-2_3.

9. Kolanu ND. CRISPR-Cas9 Gene Editing: Curing Genetic Diseases by Inherited Epigenetic Modifications. Glob Med Genet. 2024;11(1):113–122. https://doi.org/10.1055/s-0044-1785234.

10. Hu T, Chitnis N, Monos D, Dinh A. Next-generation sequencing technologies: An overview. Hum Immunol. 2021;82(11):801–811. https://doi.org/10.1016/j.humimm.2021.02.012.

11. Toriello HV, Smith SD. Hereditary Hearing Loss and Its Syndromes. 3rd ed. Oxford University Press; 2013. Available at: https://global.oup.com/academic/product/hereditary-hearing-loss-and-its-syndromes9780199731961?cc=ru&lang=en&.

12. Owusu Obeng A, Fei K, Levy KD, Elsey AR, Pollin TI, Ramirez AH et al. Physician-Reported Benefits and Barriers to Clinical Implementation of Genomic Medicine: A Multi-Site IGNITE-Network Survey. J Pers Med. 2018;8(3):24. https://doi.org/10.3390/jpm8030024.

13. Carroll JC, Makuwaza T, Manca DP, Sopcak N, Permaul JA, O’Brien MA et al. Primary care providers’ experiences with and perceptions of personalized genomic medicine. Can Fam Physician. 2016;62:e626–e635. Available at: https://www.cfp.ca/content/62/10/e626.

14. Campion M, Goldgar C, Hopkin RJ, Prows CA, Dasgupta S. Genomic education for the next generation of health-care providers. Genet Med. 2019;21(11):2422–2430. https://doi.org/10.1038/s41436-019-0548-4.

15. Kohzaki H. A proposal for clinical genetics (genetics in medicine) education for medical technologists and other health professionals in Japan. Front Public Health. 2014;2:128. https://doi.org/10.3389/fpubh.2014.00128.

16. Kudron EL, Deininger KM, Aquilante CL. Are Graduate Medical Trainees Prepared for the Personalized Genomic Medicine Revolution? Trainee Perspectives at One Institution. J Pers Med. 2023;13(7):1025. https://doi.org/10.3390/jpm13071025.

17. Lee-Barber J, Kulo V, Lehmann H, Hamosh A, Bodurtha J. Bioinformatics for medical students: a 5-year experience using OMIM® in medical student education. Genet Med. 2019;21(2):493–497. https://doi.org/10.1038/s41436-018-0076-7.

18. Schwartz JE, Ko P, Freed S, Safdar N, Christman M, Page R et al. Integrating an interprofessional educational exercise into required medical student clerkships – a quantitative analysis. BMC Med Educ. 2025;25(1):168. https://doi.org/10.1186/s12909-024-06499-4.

19. Hoffman JD, Thompson R, Swenson KB, Dasgupta S. Complexities of Clinical Genetics Consultation: An Interprofessional Clinical Skills Workshop. MedEdPORTAL. 2020;16:10869. https://doi.org/10.15766/mep_2374-8265.10869.

20. Kopel J, Brower GL. Perspectives on Consumer and Clinical Genetic Testing Education among Medical Students in West Texas. J Community Hosp Intern Med Perspect. 2022;12(3):28–32. https://doi.org/10.55729/2000-9666.1050.

21. Kopel J. Clinical genetic testing in medical education. Proc. 2019;32(1):165–166. https://doi.org/10.1080/08998280.2018.1528937.

22. Nightingale KP, Bishop M, Avitabile N, Simpson S, Freidoony L, Buckley S, Tatton-Brown K. Evaluation of the Master’s in Genomic Medicine framework: A national, multiprofessional program to educate health care professionals in NHS England. Genet Med. 2025;27(1):101277. https://doi.org/10.1016/j.gim.2024.101277.

23. French EL, Kader L, Young EE, Fontes JD. Physician Perception of the Importance of Medical Genetics and Genomics in Medical Education and Clinical Practice. Med Educ Online. 2023;28(1):2143920. https://doi.org/10.1080/10872981.2022.2143920.

24. McCorkell G, Nisselle A, Halton D, Bouffler SE, Patel C, Christodoulou J et al. A national education program for rapid genomics in pediatric acute care: Building workforce confidence, competence, and capability. Genet Med. 2024;26(10):101224. https://doi.org/10.1016/j.gim.2024.101224.

25. McGrath SP, Walton N, Williams MS, Kim KK, Bastola K. Are providers prepared for genomic medicine: interpretation of Direct-to-Consumer genetic testing (DTC-GT) results and genetic self-efficacy by medical professionals. BMC Health Serv Res. 2019;19(1):844. https://doi.org/10.1186/s12913-019-4679-8.

26. Sepulveda JL. Using R and Bioconductor in Clinical Genomics and Transcriptomics. J Mol Diagn. 2020;22(1):3–20. https://doi.org/10.1016/j.jmoldx.2019.08.006.

27. Kadri S, Sboner A, Sigaras A, Roy S. Containers in Bioinformatics: Applications, Practical Considerations, and Best Practices in Molecular Pathology. J Mol Diagn. 2022;24(5):442–454. https://doi.org/10.1016/j.jmoldx.2022.01.006.

28. Kang K, Yang Y, Wu Y, Luo R. Integrating Large Language Models in Bioinformatics Education for Medical Students: Opportunities and Challenges. Ann Biomed Eng. 2024;52(9):2311–2315. https://doi.org/10.1007/s10439-024-03554-5.

29. Korneenkov AA, Yanov YuK, Ryazantsev SV, Vyazemskaya EE, Astashchenko SV, Ryazantseva ES. A meta-analysis of clinical studies in otorhinolaryngology. Vestnik Oto-Rino-Laringologii. 2020;85(2):26–30. (In Russ.) https://doi.org/10.17116/otorino20208502126.

30. Korneenkov AA, Ryazantsev SV, Vyazemskaya EE, Budkovaya MA. The measures of informativeness of diagnostic medical technologies in otorhinolaryngology: calculation and interpretation. Rossiiskaya Otorinolaringologiya. 2020;19(1):46–55. (In Russ.) https://doi.org/10.18692/1810-4800-2020-1-46-55.

31. Korneenkov AA, Levina EA, Vyazemskaya EE, Levin SV, Skirpichnikov IN. Spatial cluster modeling of access of elderly patients with hearing loss to medical services. Rossiiskaya Otorinolaringologiya . 2021;20(6):8–19. (In Russ.) https://doi.org/10.18692/1810-4800-2021-6-8-19.

32. Korneenkov AA, Ryazantsev SV, Levin SV, Khramov AV, Vyazemskaya EE, Skirpichnikov IN et al. Spatial and statistical analysis of hearing impairment data of Chelyabinsk region residents. Rossiiskaya Otorinolaringologiya. 2021;20(3):39–50. (In Russ.) https://doi.org/10.18692/1810-4800-2021-3-39-50.

33. Korneenkov AA, Yanov YuK, Vyazemskaya EE, Medvedeva AYu. Issues of integrating medical bioinformatics technologies into otorhinolaryngology: challenges and software solutions. Rossiiskaya Otorinolaringologiya. 2024;23(6):8–19. (In Russ.) Available at: https://www.elibrary.ru/kevooo.

34. Korneenkov AA, Yanov YuK, Dvoryanchikov VV, Vyazemskaya EE, Medvedeva AYu. Medical bioinformatics: operations with nucleotide and amino acid sequences in the R. Rossiiskaya Otorinolaringologiya. 2025;24(4):13–27. (In Russ.) https://doi.org/10.18692/1810-4800-2025-4-13-27.

35. Taylor C. The Future of Bioinformatics. Science. 2000;8. https://doi.org/10.1126/article.63850.


Review

For citations:


Korneenkov AA, Yanov YK, Vyazemskaya EE, Medvedeva AY. From sequencing data to disease understanding: How can a doctor process patient’s NGS data on their own computer. Meditsinskiy sovet = Medical Council. 2025;(18):108-121. (In Russ.) https://doi.org/10.21518/ms2025-351

Views: 9


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2079-701X (Print)
ISSN 2658-5790 (Online)