Comparison of the duration of generating radiological protocols with keyboard and voice input

Cover Page


Cite item

Full Text

Abstract

BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.

AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.

METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the Mann–Whitney U-test with a statistical significance level of 0.05.

RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p <0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p <0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p <0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p <0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.

CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.

Full Text

BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.

AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.

METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the Mann–Whitney U-test with a statistical significance level of 0.05.

RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p <0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p <0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p <0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p <0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.

CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.

×

About the authors

Nikita D. Kudryavtsev

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Author for correspondence.
Email: KudryavtsevND@zdrav.mos.ru
ORCID iD: 0000-0003-4203-0630
SPIN-code: 1125-8637
Russian Federation, Moscow

Daria E. Sharova

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Email: SharovaDE@zdrav.mos.ru
ORCID iD: 0000-0001-5792-3912
SPIN-code: 1811-7595
Russian Federation, Moscow

Anton V. Vladzymyrskyy

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Email: VladzimirskijAV@zdrav.mos.ru
ORCID iD: 0000-0002-2990-7736
SPIN-code: 3602-7120
Russian Federation, Moscow

References

  1. Khramtsov AI, Nasyrov RA, Khramtsova GF. Application of digital technology in the work of a pathologist: guidelines for learning how to use speech recognition systems. Pediatrician (St. Petersburg). 2021;12(3):63–68. (In Russ). doi: 10.17816/PED12363-68
  2. Kudryavtsev N D, Sergunova K A, Ivanova G.V, et al. Evaluation of the effectiveness of the implementation of speech recognition technology for the preparation of radiological protocol. Vrach i informatsionnye tekhnologii. 2020;(S1):58–64. (In Russ). doi: 10.37690/1811-0193-2020-S1-58-64
  3. Tekhnologiya raspoznavaniya rechi pomogla vracham zapolnit’ bolee 210 tysyach protokolov luchevykh issledovanii [Internet]. Moscow Mayor official website [cited 2023 Apr 18]. Available from: https://www.mos.ru/news/item/118060073/. (In Russ).
  4. Kudryavtsev ND, Semenov DS, Kozhikhina DD, Vladzymyrskyy AV. Speech recognition technology: results of a survey of radiologists at the Moscow reference center for diagnostic radiology. Healthcare Management: News. Views. Education. Bulletin of VSHOUZ. 2022;8(3):95–104. (In Russ). doi: 10.33029/2411-8621-2022-8-3-95-104

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2023 Eco-Vector

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ПИ № ФС 77 - 79539 от 09 ноября 2020 г.


This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies