Comparison of the duration of generating radiological protocols with keyboard and voice input

封面


如何引用文章

全文:

详细

BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.

AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.

METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the Mann–Whitney U-test with a statistical significance level of 0.05.

RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p <0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p <0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p <0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p <0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.

CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.

全文:

BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.

AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.

METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the Mann–Whitney U-test with a statistical significance level of 0.05.

RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p <0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p <0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p <0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p <0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.

CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.

×

作者简介

Nikita Kudryavtsev

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

编辑信件的主要联系方式.
Email: KudryavtsevND@zdrav.mos.ru
ORCID iD: 0000-0003-4203-0630
SPIN 代码: 1125-8637
俄罗斯联邦, Moscow

Daria Sharova

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Email: SharovaDE@zdrav.mos.ru
ORCID iD: 0000-0001-5792-3912
SPIN 代码: 1811-7595
俄罗斯联邦, Moscow

Anton Vladzymyrskyy

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Email: VladzimirskijAV@zdrav.mos.ru
ORCID iD: 0000-0002-2990-7736
SPIN 代码: 3602-7120
俄罗斯联邦, Moscow

参考

  1. Khramtsov AI, Nasyrov RA, Khramtsova GF. Application of digital technology in the work of a pathologist: guidelines for learning how to use speech recognition systems. Pediatrician (St. Petersburg). 2021;12(3):63–68. (In Russ). doi: 10.17816/PED12363-68
  2. Kudryavtsev N D, Sergunova K A, Ivanova G.V, et al. Evaluation of the effectiveness of the implementation of speech recognition technology for the preparation of radiological protocol. Vrach i informatsionnye tekhnologii. 2020;(S1):58–64. (In Russ). doi: 10.37690/1811-0193-2020-S1-58-64
  3. Tekhnologiya raspoznavaniya rechi pomogla vracham zapolnit’ bolee 210 tysyach protokolov luchevykh issledovanii [Internet]. Moscow Mayor official website [cited 2023 Apr 18]. Available from: https://www.mos.ru/news/item/118060073/. (In Russ).
  4. Kudryavtsev ND, Semenov DS, Kozhikhina DD, Vladzymyrskyy AV. Speech recognition technology: results of a survey of radiologists at the Moscow reference center for diagnostic radiology. Healthcare Management: News. Views. Education. Bulletin of VSHOUZ. 2022;8(3):95–104. (In Russ). doi: 10.33029/2411-8621-2022-8-3-95-104

补充文件

附件文件
动作
1. JATS XML

版权所有 © Eco-Vector, 2023

Creative Commons License
此作品已接受知识共享署名-非商业性使用-禁止演绎 4.0国际许可协议的许可。

СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ПИ № ФС 77 - 79539 от 09 ноября 2020 г.


##common.cookie##