Artificial intelligence in clinical physiology: How to improve learning agility

Cover Page


Cite item

Abstract

Clinical physiology involves a complete, comprehensive, multilateral study of the functions of both affected and healthy organs, which allows us to assess the compensatory capabilities of the body.

Artificial intelligence is increasingly being used in medicine, including in clinical physiology. This is facilitated by the increase in computing processing power, development of cloud services and datasets, and numerous scientific articles demonstrating the effectiveness and viability of such intelligent solutions.

Although the approach to medical dataset development is generally similar, there are a number of key features and significant differences in clinical physiology. Artificial intelligence systems in clinical physiology may be effectively trained and applied in practice by following the recommendations in this study.

The national standard of the Russian Federation GOST R 59921.9-2022, which has entered into force, is included in the set of standards “Artificial Intelligence systems in clinical medicine” and establishes additional requirements for data analysis algorithms and test methods of artificial intelligence systems used in the field of clinical physiology. A crucial feature of the created standard is its qualimetric type (i.e., it has a mandatory set of demonstration data).

Russia is one of the first countries to start developing quasi-metric standards worldwide, and 15 industry standards in the field of artificial intelligence (2 of them in medicine) will come into force this year.

Full Text

INTRODUCTION

Clinical physiology is a branch of medicine that studies the role and nature of physiological changes in the body during pre-pathological and pathological conditions. Clinical physiology is a complete, comprehensive, multifaceted study of the body’s functions (not only of affected organs but also of healthy ones), allowing us to assess the body’s compensatory capabilities.[1]

Artificial intelligence (AI) systems are increasingly being used in almost all fields of medicine: [2] there is a significant number of works on electrocardiography (ECG) evaluation, including through smart watches, [3–7] and an increasing amount of research in the field of computer vision around the world,[8, 9] as well as the development of various smart solutions1. PhysioNet2, for example, includes a large number of open data sets pertaining to various pathologies. The largest open ECG data sets contain 21,837 [10] and 10,646 ECGs, [11] respectively; however, despite the importance of the issue, the formation of such data sets remains a major challenge that necessitates a thorough approach.

We identified the following major issues while analyzing public open ECG data sets:

  1. Differences in technical conditions for ECG recording: sampling rate, least significant digit value, analog-to-digital converter bit depth, recording duration, and number of channels
  2. Incompatible descriptive languages (thesauri): different “schools,” different patiet populations, and different end-user goals
  3. Imbalance in ECG disorder classes within the data set and in data sets with the general population
  4. Concerns about the quality of the annotation/classification
  5. Lack or absence of clinical data (metadata)

These issues can be worsened when other diagnostic and control methods in clinical physiology are used. This is due to the fact that the following data can be used to create a data set and then train AI systems in clinical physiology3:

  1. Physiological parameter values (blood pressure, heart rate, and saturation level)
  2. Digitized biological signals (electrocardiogram and vessel pressure indicator)
  3. Induced and returned signals (neuromyogram, rheogram, Doppler curve, and ultrasound M-scan)
  4. Dynamic images (cine loops)
  5. Complex data

DATA SET FORMATION METHODOLOGY: ARE THERE ANY DIFFERENCES?

Data set formation methodology in clinical physiology is broadly similar to that of radiodiagnosis [12]: planning; creation of a thesaurus or glossary and inclusion and exclusion criteria; selection of experts and moderators; data analysis for compliance with inclusion criteria; annotation approval; and multilevel moderation. However, there are several key differences as follows:

  1. The order in which the data array is processed differs significantly. The work sequence for preparing a data set (number series, graphs, and individual measurements) is as follows:
    • Data segmentation (separation)
    • Data measurement
    • Data annotation: a method of providing verbal (semantics) meaning to an object or set of data
    • Data classification
  2. A dictionary (glossary) is sufficient for classifying simple (binary) properties of objects, and a thesaurus is required for multiclass objects.
  3. Moreover, some less evident and difficult-to-categorize factors can lead to significant errors when creating a data set4, 5:
    • Highly qualified operators are required to conduct clinical physiology research; one of the most important factors in source data generation is operator dependence.
    • When forming the final data set, the presented array of studies should be analyzed for the following factors: sufficient recording duration, number of channels, disabling signal filtering, as well as compliance with accepted technical parameters, dynamic range, signal-to-noise ratio, and results storage format.
    • Experts and moderators involved in data separation must be qualified: while data anonymization is permissible, details of their qualifications and contributions must be included in the AI system test report.
    • A set of equipment and software for AI system tests in clinical physiology should be developed; at the same time, the characteristics of hardware and software must exceed the minimum requirements set by the AI system manufacturer and consider the typical characteristics of a specific or potential user’s computing facilities.

INCLUSION AND EXCLUSION CRITERIA FOR RECORD SELECTION IN CLINICAL PHYSIOLOGY DATA SET FORMATION

Exclusion criteria (absolute; only one is required):

  • The records are provided in a proprietary format, and the manufacturer refuses to create a matching layer
  • Noncompliance with technical specifications for saved data (for example, the recording duration for a digital ECG is less than 10 s, the sampling rate is less than 500 Hz, the least significant digit value is greater than 5 µV, and the analog-to-digital converter bit depth is less than 10 bits).
  • Access to metadata is either impossible or significantly restricted
  • Less than 70% of the ECGs in the final data set are annotated and classified appropriately

Inclusion criteria (all must be met):

  • The records are provided in one of the following formats: WDBF, EDF, aECG (HL-7), SCP-ECG, DICOM-ECG, and XML
  • Compliance with the technical specifications for saved data (for example, for a digital ECG, the recording duration must be at least 10 s, the sampling rate must be at least 500 Hz, the least significant digit value must be 5 µV, and the analog-to-digital converter bit depth must be at least 10 bits)
  • Access to metadata is not restricted
  • At least 90% of the ECGs in the final data set are annotated and classified appropriately

It appears that data sets for AI system training should include the full range of possible phenomena (syndromes, diagnoses, and outcomes) from the most rare (casual) to the most common. The type of data set determines the need to respect the variability of gender and racial differences in patients (for example, these metadata are required for assessing respiratory function parameters). The incidence of phenomena (syndromes) in a population is given less weight in the data set formation. It is recommended to use additional metrics when using unbalanced data sets for rare (casual) phenomena.

NORMATIVE DOCUMENTS REGULATING THE DEVELOPMENT AND APPLICATION OF DATA ANALYSIS ALGORITHMS AND TEST METHODS OF ARTIFICIAL INTELLIGENCE SYSTEMS IN CLINICAL PHYSIOLOGY

The national standard of the Russian Federation GOST R 59921.9-20226, which became effective on January 1, 2023, is included in the set of standards known as “Artificial Intelligence Systems in Clinical Medicine” and establishes additional requirements for data analysis algorithms and test methods of AI systems in clinical physiology.

Developers of AI systems for clinical physiology and other interested parties will be able to study the requirements:

  • Data set generation, preparation, segmentation, measurement, detection, annotation, and classification for AI system testing
  • Data set structure, application procedures, and access conditions
  • Organizing terminological resources and presenting data analysis results
  • Information exchange between medical devices, intelligent systems, and other healthcare automation systems
  • Technical, bench, laborator, and clinical test processes and results, as well as postregistration and operational control of software and hardware–software systems based on artificial intelligence technologies
  • The form and content of software and hardware–software systems based on artificial intelligence technologies, in accordance with the tasks being solved in the field of medicine and healthcare

The prescribed requirements for data sets distinguish the new national standard from other GOST R standards and English-language counterparts. Three scenarios are proposed in particular: clinical trials conducted only on a test site (bench) using data sets; clinical trials conducted within a health facility; and combined clinical trials. All scenarios are illustrated with flowcharts (Figure 1).

 

Figure 1. Flowchart for conducting clinical trials with data sets (one implementation option)

 

The standard also includes test options for assessing AI system resistance to errors in input data and testing using synthetic and combined data. The new GOST К standard allows the testing of AI systems that are compatible with various data types and presentation formats. The following can be used for AI system testing in clinical physiology:

  • Measured physiological parameter values (blood pressure, heart rate, and saturation level)
  • Digitized biological signals (electrocardiogram and vessel pressure indicator)
  • Induced and returned signals (neuromyogram, rheogram, Doppler curve, and ultrasound M-scan)
  • Dynamic images (cine loops, for example, in an ultrasound examination mode and motion video recordings)
  • Complex data containing data of several types listed above (synchronized and in-phase)

The data can represent the results of single measurements (patient studies), or they can be chosen to systematically represent the development of pathological processes (a time series of homogeneous measurements), or they can reflect changes upon presentation of graduated stimuli, or they can reflect changes in parameters depending on external conditions (during sleep, at rest, during physical or mental stress, distress, etc.).

The fact that the new standard is a quasimetric GOST R (i.e., it comes with a mandatory set of demo data) is also an important feature (Figure 2).

 

Figure 2. An example file from the demo data set of GOST R 59921.9-2022, Artificial Intelligence Systems in Clinical Medicine. Algorithms for data analysis in clinical physiology. Testing methods

 

Russia was one of the world’s first countries to develop quasimetric standards. In the field of artificial intelligence, 15 industrial standards will come into force in 2023, with two of them in medicine.7, 8

CONCLUSION

Compliance with the aforementioned rules will allow for the acquisition of a data set for AI system training in such a way that all three phases of clinical trials are potentially passed, namely, (a) testing to ensure the accuracy of the input data (recognition of signals received with a violation of the study technology, as well as those containing artifacts and noise); (b) testing to ensure the accuracy of the recognition of syndromes, phenomena, clinical equivalents, and/or the formation of a conclusion (annotation) according to an agreed thesaurus or glossary; and (c) testing on synthetic and combined data (recognition of a synthetic stimulus signal that initiates or amplifies natural signals and evaluation of stimulation efficiency or inefficiency).

ADDITIONAL INFORMATION

Funding source. This article was not supported by any external sources of funding.

Competing interests. The authors declare that they have no competing interests.

Authors’ contribution. All authors made a substantial contribution to the conception of the work, acquisition, analysis, interpretation of data for the work, drafting and revising the work, final approval of the version to be published and agree to be accountable for all aspects of the work. D.V. Shutov, D.V. Drozdov ― work concept and design, editing and approval of the final version of the manuscript, advisory support; D.E. Sharova ― work concept and design, data analysis, writing the text of the article, editing and approval the final version of the manuscript; L.R. Abuladze ― writing the text of the article, editing.

×

About the authors

Dmitry V. Shutov

Moscow Center for Diagnostics and Telemedicine

Author for correspondence.
Email: ShutovDV@zdrav.mos.ru
ORCID iD: 0000-0003-1836-3689
SPIN-code: 9381-2456

MD, Dr. Sci. (Med.)

Russian Federation, Moscow

Dariya E. Sharova

Moscow Center for Diagnostics and Telemedicine

Email: ShutovDV@zdrav.mos.ru
ORCID iD: 0000-0001-5792-3912
SPIN-code: 1811-7595
Russian Federation, Moscow

Liya R. Abuladze

Moscow Center for Diagnostics and Telemedicine

Email: AbuladzeLR@zdrav.mos.ru
ORCID iD: 0000-0001-6745-1672
SPIN-code: 8640-9989

Junior Research Associate

Russian Federation, Moscow

Dmitrii V. Drozdov

National Medical Research Center of Cardiology

Email: cardioexp@gmail.com
ORCID iD: 0000-0001-7374-3604
SPIN-code: 2279-9657

MD, Cand. Sci. (Med.)

Russian Federation, Moscow

References

  1. Kurzanov AN. Clinical physiology: formation, goals, tasks, limits of competence, place in the system of higher professional medical education. International journal of experimental education. 2012;(4):128–130. (In Russ).
  2. Gusev AV, Vladzimirsky AV, Sharova DE, et al. Development of research and development in the field of artificial intelligence technologies for healthcare in the Russian Federation: results of 2021. Digital Diagnostics. 2022;3(3):178–194. (In Russ). doi: 10.17816/DD107367
  3. Al-Mousily MF, Baker GH, Jackson L, et al. The use of a traditional nonlooping event monitor versus a loan-based program with a smartphone ECG device in the pediatric cardiology clinic. Cardiovasc Digit Heal J. 2021;2(1):71–75. doi: 10.1016/j.cvdhj.2020.11.008
  4. Ding EY, Pathiravasan CH, Schramm E, et al. Design, deployment, and usability of a mobile system for cardiovascular health monitoring within the electronic Framingham Heart Study. Cardiovasc Digit Heal J. 2021;2(3):171–178. doi: 10.1016/j.cvdhj.2021.04.001
  5. Bashar SK, Hossain MB, Lázaro J, et al. Feasibility of atrial fibrillation detection from a novel wearable armband device. Cardiovasc Digit Heal J. 2021;2(3):179–191. doi: 10.1016/j.cvdhj.2021.05.004
  6. Goodwin AJ, Eytan D, Greer RW, et al. A practical approach to storage and retrieval of high-frequency physiological signals. Physiol Meas. 2020;41(3):035008. doi: 10.1088/1361-6579/ab7cb5
  7. Bartlett VL, Ross JS, Shah ND, et al. Physical activity, patient-reported symptoms, and clinical events: Insights into postprocedural recovery from personal digital devices. Cardiovasc Digit Heal J. 2021;2(4):212–221. doi: 10.1016/j.cvdhj.2021.06.002
  8. Mishra S, Khatwani G, Patil R, et al. ECG paper record digitization and diagnosis using deep learning. J Med Biol Eng. 2021;41(4):422–432. doi: 10.1007/s40846-021-00632-0
  9. Kashou AH, Mulpuru SK, Deshmukh AJ, et al. An artificial intelligence-enabled ECG algorithm for comprehensive ECG interpretation: Can it pass the ‘Turing test’? Cardiovasc Digit Heal J. 2021;2(3):164–170. doi: 10.1016/j.cvdhj.2021.04.002
  10. Wagner P, Strodthoff N, Bousseljot RD, et al. PTB-XL, a large publicly available electrocardiography dataset. Sci Data. 2020;7(1):154. doi: 10.1038/s41597-020-0495-6
  11. Zheng J, Zhang J, Danioko S, et al. A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients. Sci Data. 2020;7(1):48. doi: 10.1038/s41597-020-0386-x
  12. M 80 Regulations for the preparation of data sets with a description of approaches to the formation of a representative sample of data. Part 1. Methodological recommendations. Ed by S.P. Morozov, A.V. Vladzimirsky, A.E. Andreichenko, et al. Moscow; 2022. 40 р. (The series “Best practices of radiation and instrumental diagnostics”). (In Russ).

Supplementary files

Supplementary Files
Action
1. JATS XML
2. Figure 1. Flowchart for conducting clinical trials with data sets (one implementation option)

Download (117KB)
3. Figure 2. An example file from the demo data set of GOST R 59921.9-2022, Artificial Intelligence Systems in Clinical Medicine. Algorithms for data analysis in clinical physiology. Testing methods

Download (409KB)

Copyright (c) 2023 Eco-Vector

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ПИ № ФС 77 - 79539 от 09 ноября 2020 г.


This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies