Comparison of the duration of generating radiological protocols with keyboard and voice input

Capa


Citar

Texto integral

Resumo

BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.

AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.

METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the Mann–Whitney U-test with a statistical significance level of 0.05.

RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p <0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p <0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p <0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p <0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.

CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.

Texto integral

BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.

AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.

METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the Mann–Whitney U-test with a statistical significance level of 0.05.

RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p <0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p <0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p <0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p <0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.

CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.

×

Sobre autores

Nikita Kudryavtsev

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Autor responsável pela correspondência
Email: KudryavtsevND@zdrav.mos.ru
ORCID ID: 0000-0003-4203-0630
Código SPIN: 1125-8637
Rússia, Moscow

Daria Sharova

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Email: SharovaDE@zdrav.mos.ru
ORCID ID: 0000-0001-5792-3912
Código SPIN: 1811-7595
Rússia, Moscow

Anton Vladzymyrskyy

Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies

Email: VladzimirskijAV@zdrav.mos.ru
ORCID ID: 0000-0002-2990-7736
Código SPIN: 3602-7120
Rússia, Moscow

Bibliografia

  1. Khramtsov AI, Nasyrov RA, Khramtsova GF. Application of digital technology in the work of a pathologist: guidelines for learning how to use speech recognition systems. Pediatrician (St. Petersburg). 2021;12(3):63–68. (In Russ). doi: 10.17816/PED12363-68
  2. Kudryavtsev N D, Sergunova K A, Ivanova G.V, et al. Evaluation of the effectiveness of the implementation of speech recognition technology for the preparation of radiological protocol. Vrach i informatsionnye tekhnologii. 2020;(S1):58–64. (In Russ). doi: 10.37690/1811-0193-2020-S1-58-64
  3. Tekhnologiya raspoznavaniya rechi pomogla vracham zapolnit’ bolee 210 tysyach protokolov luchevykh issledovanii [Internet]. Moscow Mayor official website [cited 2023 Apr 18]. Available from: https://www.mos.ru/news/item/118060073/. (In Russ).
  4. Kudryavtsev ND, Semenov DS, Kozhikhina DD, Vladzymyrskyy AV. Speech recognition technology: results of a survey of radiologists at the Moscow reference center for diagnostic radiology. Healthcare Management: News. Views. Education. Bulletin of VSHOUZ. 2022;8(3):95–104. (In Russ). doi: 10.33029/2411-8621-2022-8-3-95-104

Arquivos suplementares

Arquivos suplementares
Ação
1. JATS XML

Declaração de direitos autorais © Eco-Vector, 2023

Creative Commons License
Este artigo é disponível sob a Licença Creative Commons Atribuição–NãoComercial–SemDerivações 4.0 Internacional.

СМИ зарегистрировано Федеральной службой по надзору в сфере связи, информационных технологий и массовых коммуникаций (Роскомнадзор).
Регистрационный номер и дата принятия решения о регистрации СМИ: серия ПИ № ФС 77 - 79539 от 09 ноября 2020 г.


Este site utiliza cookies

Ao continuar usando nosso site, você concorda com o procedimento de cookies que mantêm o site funcionando normalmente.

Informação sobre cookies