ГОЛОСОВЫЕ ТЕХНОЛОГИИ ПРИ СЕРДЕЧНО-СОСУДИСТЫХ ЗАБОЛЕВАНИЯХ

Дарья  Хеммерлинг; Бенедетта Синьорелли; Войцех  Вояковски; Михал  Тендера; Томаш  Ядчик

doi:10.26739.2181-0974-2021-4-1

ЖУРНАЛ КАРДИОРЕСПИРАТОРНЫХ ИССЛЕДОВАНИЙ | JOURNAL OF CARDIORESPIRATORY RESEARCH

№4 | 2021

9

ОБЗОРНЫЕ СТАТЬИ/ ADABIYOTLAR SHARHI/ REVIEW ARTICLES

Дарья Хеммерлинг,

Университет науки и технологий, факультет

электротехники инженерии, автоматики, информатики

и биомедицинской инженерии, Краков, Польша

Бенедетта Синьорелли,

Кафедра наук о здоровье человека, факультет медицины

и хирургии, Университет Флоренции, Италия

Войцех Вояковски,

Отделение кардиологии и структурных болезней сердца,

Медицинский университет Силезии, Катовице, Польша

Михал Тендера,

Отделение кардиологии и структурных болезней сердца,

Медицинский университет Силезии, Катовице, Польша

Томаш Ядчик

Отделение кардиологии и структурных болезней сердца,

Медицинский университет Силезии, Катовице, Польша,

Международный центр клинических исследований,

Университетская клиника Святой Анны Брно, Брно, Чехия

ГОЛОСОВЫЕ ТЕХНОЛОГИИ ПРИ СЕРДЕЧНО-СОСУДИСТЫХ ЗАБОЛЕВАНИЯХ

For citation:

Daria Hemmerling, Benedetta Signorelli, Wojciech Wojakowski, Michał Tendera, Tomasz Jadczyk. Voice technology in

cardiovascular diseases.

Journal of cardiorespiratory research. 2021, vol. 2, issue 4, pp. 9-12

http://dx.doi.org/10.26739/2181-0974-2021-4-1

Ключевые слова:

голосовые технологии, вокальные биомаркеры, искусственный интеллект, голосовые боты, персонализированная

медицина

Daria Hemmerling

,

AGH University of Science and Technology,

Faculty of Electrical Engineering, Automatics,

Computer Science and Biomedical Engineering, Kraków, Poland

Benedetta Signorelli

,

Department of Human Health Science,

Faculty of Medicine and Surgery,

University of Florence, Italy

Wojciech Wojakowski

,

Division of Cardiology and Structural Heart Diseases,

Medical University of Silesia, Katowice, Poland

Michał Tendera,

Division of Cardiology and Structural Heart Diseases,

Medical University of Silesia, Katowice, Poland

Tomasz Jadczyk

Division of Cardiology and Structural Heart Diseases,

Medical University of Silesia, Katowice, Poland,

International Clinical Research Center,

St. Anne's University Hospital Brno, Brno, Czech Republic

ЖУРНАЛ КАРДИОРЕСПИРАТОРНЫХ ИССЛЕДОВАНИЙ | JOURNAL OF CARDIORESPIRATORY RESEARCH

№4 | 2021

10

VOICE TECHNOLOGY IN CARDIOVASCULAR DISEASES

Keywords

: voice technology, vocal biomarkers, artificial intelligence, voice chatbots, personalised medicine

Daria Hemmerling,

Fan va Texnologiya Universiteti,

Elektrotexnika, аvtomatika kompyuter fanlari va

biotibbiyot muhandisligi fakulteti, Krakov, Polsha

Benedetta Signorelli,

Inson salomatligi fanlari bo'limi,

Tibbiyot va jarrohlik fakulteti,

Florensiya universiteti, Italiya

Voytsex Vojakovski,

Kardiologiya va yurakning strukturaviy kasalliklari bo'limi,

Sileziya tibbiyot universiteti, Katovitse, Polsha

Mixal Tendera,

Kardiologiya va yurakning strukturaviy kasalliklari bo'limi,

Sileziya tibbiyot universiteti,Katovitse, Polsha

Tomaş Jadchik

Kardiologiya va yurakning strukturaviy kasalliklari bo'limi,

Sileziya tibbiyot universiteti,Katovitse, Polsha,

Xalqaro klinik tadqiqotlar markazi,

Brno Sent-Anna universiteti kasalxonasi, Brno, Chexiya

YURAK QON TOMIR KASALLIKLARIDA OVOZ TEXNOLOGIYASI

Kalit so'zlar:

ovoz texnologiyasi, vokal biomarkerlar, sun'iy intellekt, ovozli chatbotlar, moslashtirilgan tibbiyot

Introduction

Despite great efforts, cardiovascular diseases (CVD) remain the

leading  cause  of  death  worldwide  [1].  Thus,  novel  diagnostic  and
treatment solutions are highly demanded to address current challenges
in the field of clinical medicine. Interestingly, recent studies indicate a
potential  use  of  voice  technology  which  covers  a  wide  spectrum  of
artificial  intelligence  (AI)  techniques  allowing  for  human  language
understanding  as  well  as  for  predictive  analysis  of  vocal  biomarkers.
Physiologically, voice is the sound produced with the usage of  the lungs
and  the

vocal folds

in the

larynx

. The vibration of vocal folds is

generated  when  the  air  is  pushed  through  vocal  folds  with  sufficient
pressure. On the one hand, the spoken language is the easiest and fastest
way of communication. On the other hand, generation of voice requires
using  a  series  of  coordinated,  complex  movements  in  the  head,  neck,
chest and  abdomen muscles,  which impact the signals’  frequency and
amplitude  resulting  in  specific,  decodable  sounds.  By  its  complex
nature,  voice  is  an  unique  bio-print  characteristic  for  each  person
conveying information about individual's personality, mood and health
status. From a diagnostic point of view, voice is a bio-signal that can be
acquired non-invasively and in an easy, economically-sound manner [

1

,

2

]. Subsequently, a correlation between CVD and alterations in speech

characteristics  open  new  diagnostic  opportunities  based  on deviations
of voice features associated with CVD-mediated systemic inflammatory
process  which  impacts  anatomical  structures  responsible  for  voice
generation [2]. Despite the very well-developed digital technology there
is still the challenge to extract specific, important information about the
patient's  health condition. Especially due  to  its complex and dynamic
characteristic, voice can be pronounced in different intonations and with
different emotions. AI-driven digital solutions are still being sought on
how  to  non-invasively  evaluate  patient's  voice  organs  and  effectively
distinguish  between  patients  with  existing  disorders  and  healthy
individuals.

Moreover, the advancements in the field of computer science

leveraged application of human-computer voice interfaces (also called
voice  assistants,  voice  chatbots  or  conversational  agents)  allowing
machines  to  understand  spoken  language  and  generate  human-like
voice.  The  aforementioned  implementation  of  voice  technology  in
clinical  field  provides  interesting  tools  which  useability  is  currently
being evaluated and tested [

3]

.

This article covers the application of AI-based voice chatbots and

the potential application of vocal biomarkers in the field of
cardiovascular medicine.

Artificial intelligence-driven voice technology in medicine

Definition of voice assistant

Voice assistants (VA) powered with the advanced algorithm of AI

and  natural  language  processing  allow  for  verbal  communication
between  humans  and  computers.  These  conversational  agents  (i.e.,
Amazon Alexa, Apple’s  Siri  or  Google Assistant)  can  be  installed  on
standalone devices called smart speakers or deployed on smartphones.
The  emulated  human-machine  conversations  are  based  on  the
application of neural networks which perform voice-to-text analysis and
text-to-voice computation generating natural human voice transforming
day-to-day clinical practice [3, 4].

Application of voice technology in clinical practice

Voice-enabled technologies have the potential to influence everyday

cardiovascular medicine practice by:

(1) Foreign language interpretation and real-time language

translation,

(2) Patient education,

(3) Medication reminders and prescription refills,

(4) Continuity of care,

(5) Automated and paperless collection of medical data,

(6) Remote long-term monitoring,

(7) Diagnostic value of vocal biomarkers.

Foreign language interpretation

There is an incremental need to address language barriers for

patients whose health care workers do not speak their primary language.
Voice technology provides tools that facilitate communication in a safe
and  effective  manner.  Panayiotou  et  al.  reviewed  digital  language
translation solutions in health care settings. Among 15 iPad-compatible
applications  including  8  voice-to-voice  and  voice-to-text  translation
apps,  2  services  (Assist  and  Talk  to  Me)  were  found  to  be  clinically
adequate  for  everyday  conversations  on  subject  matters  that  do  not
require a professional interpreter [5].

Patient education

There are numerous potential applications for the use of VA in the

field of patient education and guidelines. Specifically, this Alexa-based
applications  can  be  used  to  provide  information  on  the  cardio-
pulmonary  resuscitation  (i.e.,  The  Mayo  Clinic  First  Aid)  [6]  or
information  from  Mayo  Clinic  experts  on  topics  related  to
cardiovascular  diseases  providing  an  access  to  the  verified  medical
knowledge  [7].  Furthermore,  the  Answers  by  Cigna  application
available on Amazon Alexa provides health coach programs supporting
treatment  plans.  Furthermore,  users  can  ask  a  wide  range  of  health-

ЖУРНАЛ КАРДИОРЕСПИРАТОРНЫХ ИССЛЕДОВАНИЙ | JOURNAL OF CARDIORESPIRATORY RESEARCH

№4 | 2021

11

related questions receiving easy-to-understand responses [8]. A similar
approach is exemplified by the Orbita ENGAGE designed for patients
who can communicate verbally with a VA for medically associated
frequently asked questions, especially based on symptom screening [9].

Medication reminders

Among CVD patients, medication non-adherence is a perceptible

challenge  both  in  the  period  immediately  following  an  acute
cardiovascular  event  as  well  as  during  long-term  follow-up  [10].  VA
have  been  successfully  implemented  to  support  pharmacotherapy
management.  By  saying to Alexa  "Manage my medication"  or “Refill
my  prescription”,  registered  patients  can  set  reminders  to  take
medication and request prescriptions with home drug delivery through
the Giant Eagle Pharmacy voice application [11].

Continuity of care

In the broader spectrum, some VA solutions like Orbit Connect are

engineered  for  long-term  follow-up  as  well  as  for  pre-  and  post-visit
through  digital  coaching,  assessments, and  care team  communication.
Furthermore, a holistic approach to patients with CVD should include
mental status  evaluation. Importantly,  the  prevalence  of  depression  in
this group is 3-fold higher in comparison to the general population [12].
Correspondingly,  the  Talk  space  voice  application  for Amazon Alexa
allows  users to access  depression  assessments tools  as  well  as guided
mindfulness techniques [13].

Automated and paperless collection of medical data

Integration of the medical voice AI chatbots with hospital electronic

health systems (EHR) leverages advances in voice technology allowing
for  seamless  and  automatic  population  of  electronic  forms  [14].
Noteworthy, it is crucial to ensure adequate level of security and privacy
during  transmission  and  computation  of  patient's  protected  health
information  (PHI). Accordingly,  the  GDPR  (EU)  and  HIPAA  (USA)
regulations  must  be  implemented  for  each  software  solution  dealing
with PHI.

Practical application of voice chatbot in clinical settings was

exemplified by the CardioCube

®

service deployed on Amazon Echo

smart speaker for automatic collection of patient-reported medical
history at the Cardiology Outpatient Clinic of the Cedars-Sinai Medical
Center (Los Angeles, CA, USA) [14]. Initialization of CardioCube

®

voice  assistant  was  evoked  by  a  verbal  command  “Computer,  open
CardioCube”.  Furthermore,  patients  answered  pre-defined  clinical
questions which corresponded to the hospital intake form i.e. “Do you
have high blood pressure?”, “Have you ever had a heart attack?”, “Have
you  been  diagnosed  with  diabetes?”.  The  answers  provided  verbally
were  translated  into  text  using  cloud-based  AI  systems  and
automatically populated a patient's record in the hospital EHR system.
Healthcare  providers  could  access  the  complete  report  through  a
standard web-based interface. This interactive approach was shown to
streamline  repetitive  and  time-consuming  tasks  during  patient
registration  providing  a secure and  high  accuracy  (97.5%)  digital  tool
automatically generating medical reports.

Remote long-term monitoring

The FCNcare by CardioCube

®

solution was implemented at the

Family  Care  Network  (Bellingham,  WA,  USA)  for  remote  long-term
follow-up  of  patients  with  diabetes  and  heart  failure  [15].  Individuals
enrolled  in  the  pilot  study  received  Amazon  Echo-deployed
CardioCube

®

software for home use based on reporting actual clinical

status during scheduled conversation sessions between patient and
CardioCube

®

. The voice-based questionnaire consisted of eight

questions:  (1)  “In  the  past  week,  have  you  missed  any  dose  of  your
medication?”, (2) “Are  you  needing  a  medication  refill?”,  (3)  Do you
have any medication-related questions that you need your care team to
answer?”,  (4) A caring reminder, eating more carbohydrates  increases
your blood sugar. All sugary foods contain carbohydrates, as do bread,
rice,  pasta,  and  potatoes.  Have  you  been  carefully  managing  your
carbohydrate intake in the past week?”, (5) “And how about exercises,
how many times in the past week have you exercised?”, (6) “As for this
past  week,  were  you  able  to  check  your  sugar  levels  with  a
glucometer?”, (7) “And how many times in the past week did you check

your  blood  sugar  level?”,  (8) Were  the majority of  your readings  in  a
good  range?”.  Obtained  results  were  analysed  and  automatically
transferred to the Family Care Network EHR system for review by the
nurse.  Importantly,  in  case  of  health  status  deterioration  (i.e.  patient
reports  dyspnea)  red-flagging  notifications  were  implemented  to
improve  useability  of  the  service  giving  healthcare  providers  a  quick
access to the most crucial reports.

Diagnostic value of vocal biomarkers

In the literature, there are only a few studies that analysed the voice

and  speech  signals  in  an  acoustic  parametrized  manner  for  heart
diseases.  The  researchers  from  Mayo  Clinic  reported  a  possible
relationship  between  specific  vocal  biomarkers  and  coronary  artery
disease (CAD) underscoring the potential use of this simple biomarker
to identify patients at risk [2]. The authors have analysed if patient voice
signal  characteristics  are  associated  with  the  presence  of  CAD.  They
performed  detailed  acoustic  analysis  to  describe  the  overall  shape  of
signal's spectral envelope. With further analyses, authors identified five-
voice features that were associated with CAD. Combining data with the
Atherosclerotic  Cardiovascular  Disease  risk  scores,  it  was  possible  to
identified  two  voice  features  that  were  independently  associated  with
CAD (odds ratio OR = 0.37; 95% CI, interquartile range IQR = 0.18-
0.79;  and  OR  =  4.01;  95%  CI,  IQR  =  1.25-12.84;  p=.009  and  p=.02,
respectively).  Both  features  were more  strongly associated  with  CAD
when  patients  were  asked  to  describe  an  emotionally  significant
experience. The work was further developed and described by Maor et
al.  [16],  where  the  authors  have  analysed  if  the  vocal  biomarker  is
associated  with  hospitalization  and  mortality  among  patients  with
congestive  heart  failure  (CHF).  By  extracting  a  total  of  223  acoustic
features for each patient, the main novel finding of this study was that
non-invasive  voice  signal  characteristics  are  associated  with  adverse
clinical  outcomes  among  patients  with  symptomatic  CHF  [16].
Moreover,  Pareek  et  al.  [17]  have  also  evaluated  CAD  patients.  The
results revealed significant variations in spectrograms and specific voice
analyses between active and control group including jitter, shimmer, and
complex parameters such as  Relative Average  Perturbation being as a
quantitative measure of the voice.

Extraction of acoustic parameters enables an objective assessment

of the voice and speech quality. The registration of the signal might be
done in various manners. New technologies in digital signals processing
enable the recordings without the requirement for access to an anechoic
chamber.  The  sessions  might  be  done  at  the  doctor’s  office,  at  home,
with  a  relatively  low  level  of  noise.  Most  smart  speakers  and  VA-
deployed on smartphones have a circular microphone array to provide
voice-only interaction from a distance in standard room conditions. To
perform the analysis with desired goals such as automatic diagnostic or
highlighting  health  impairments  using  voice  it  must  be  stated  what
should  be  recorded.  Voice  signals  might  be  recorded  in  different
manners,  depending  on  what  features  are  desired.  The  phonation  of
sustained  vowels  with  continuous  phonation  over  a  certain  time  are
helpful to find discontinuities in signal's amplitudes and frequencies as
well as  changes in  loudness levels. The  speech  recordings  bring more
information about the speech speed, pauses length, pitch and loudness
changes. Accordingly, the speech might be acquired from a text read, a
story-tell,  a  question-answer  scenario,  repetition  of  specific  syllables
and  conglomerate  of  words.  This  enables  the  semantic  voice  analysis
and extraction of meaningful words, enabling syntax analysis for natural
language processing.

Future directions

The aforementioned use cases confirm the feasibility of using

voice chatbots and vocal biomarker application in the field of
cardiovascular medicine. Noteworthy, VA can be integrated with the
existing

healthcare

ecosystems

leveraging

clinical

adoption

opportunities of voice technology. The further development will enable
constant patient monitoring with an immediate warning in case system
detects health status deterioration including analysis of “invisible” vocal
biomarkers. Such approach might be useful in predicting risk of the
occurrence of health- and life-threatening conditions.

ЖУРНАЛ КАРДИОРЕСПИРАТОРНЫХ ИССЛЕДОВАНИЙ | JOURNAL OF CARDIORESPIRATORY RESEARCH

№4 | 2021

12

Список литературы/ References/ Iqtiboslar

1.

Kones R, Rumana U. Cardiometabolic diseases of civilization: history and maturation of an evolving global threat. An update and call to
action. Annals of Medicine. 2017;49(3):260-74. Epub 2016/12/13.

2.

Maor E, Sara JD, Orbelo DM, Lerman LO, Levanon Y, Lerman A. Voice Signal Characteristics Are Independently Associated With Coronary
Artery Disease. Mayo Clinic Proceedings. 2018;93(7):840-7. Epub 2018/04/17.

3.

Hoy MB. Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants. Medical Reference Services Quarterly. 2018;37(1):81-8.
Epub 2018/01/13.

4.

Suta P, Lan X, Wu B, Mongkolnam P, Chan J. An overview of machine learning in chatbots. International Journal of Mechanical Engineering
and Robotics Research. 2020;9(4):502-10.

5.

Panayiotou A, Gardner A, Williams S, Zucchi E, Mascitti-Meuter M, Goh AM, et al. Language Translation Apps in Health Care Settings:
Expert Opinion. JMIR mHealth and uHealth. 2019;7(4):e11316. Epub 2019/04/10.

6.

Mayo Clinic First Aid. Amazon. [21 NOV 2021]; Available from: https://www.amazon.com/mayo-clinic-first-aid/dp/b0744ljcv2.

7.

Mayo Clinic News Network. Amazon. [21 NOV 2021]; Available from: https://www.amazon.com/mayo-clinic-news-
network/dp/b0711hkbb6

8.

Answers by Cigna. Amazon. [cited 21 NOV 2021]; Available from: https://www.amazon.com/Cigna-Answers-by/dp/B079SMKCZB

9.

Employee Health Manager. Orbita. . [21 NOV 2021]; Available from: https://orbita.ai/orbita-employee-health-manager/

10.

Simon ST, Kini V, Levy AE, Ho PM. Medication adherence in cardiovascular medicine. British Medical Journal. 2021;374:n1493. Epub
2021/08/13.

11.

New ways to manage your medications at home using Alexa. [21 NOV 2021]; Available from:

https://www.aboutamazon.com/news/devices/new-ways-to-manage-your-medications-at-home-using-alexa.

12.

Chaddha A, Robinson EA, Kline-Rogers E, Alexandris-Souphis T, Rubenfire M. Mental Health and Cardiovascular Disease. The American
Journal of Medicine. 2016;129(11):1145-8. Epub 2016/10/25.

13.

Talkspace: Online therapy for all. Amazon. [21 NOV 2021]; Available from: https://www.amazon.com/Talkspace-Online-therapy-for-
all/dp/B085VHRW33.

14.

Jadczyk T, Kiwic O, Khandwalla RM, Grabowski K, Rudawski S, Magaczewski P, et al. Feasibility of a voice-enabled automated platform
for medical data collection: CardioCube. International Journal of Medical Informatics. 2019;129:388-93. Epub 2019/08/25.

15.

FCNcare by CardioCube. [21 NOV 2021]; Available from:

https://jmir.org/api/download?alt_name=jmir_v23i5e22959_app1.mov&filename=62989d3b55a0fbfb33d8e0aac8c1a6bd.mov.

16.

Maor E, Perry D, Mevorach D, Taiblum N, Luz Y, Mazin I, et al. Vocal Biomarker Is Associated With Hospitalization and Mortality Among
Heart Failure Patients. Journal of the American Heart Association. 2020;9(7):e013359. Epub 2020/04/03.

17.

Pareek V, Sharma R, editors. Coronary heart disease detection from voice analysis. 2016 IEEE Students’ Conference on Electrical:
Electronics and Computer Science (SCEECS); 2016.

ГОЛОСОВЫЕ ТЕХНОЛОГИИ ПРИ СЕРДЕЧНО-СОСУДИСТЫХ ЗАБОЛЕВАНИЯХ

Ключевые слова:

Аннотация

Похожие статьи

Библиографические ссылки