BME TMIT 1 speechlab.tmit.bme.hu A magyar beszédtechnológia helyzete és távlatai (Status Report of Hungarian Speech Technology) Németh Géza BME Távközlési és Médiainformatikai Tanszék Beszédtechnológiai Laboratórium Budapest University of Technology & Economics Department of Telecommunications & Media Informatics Speech Technology Laboratory
BME TMIT 2 speechlab.tmit.bme.hu Overview What is it? Why is it important in general? Why is it important in Hungary? History Recent results Available resources Research challenges Application challenges
BME TMIT 3 speechlab.tmit.bme.hu What is it? Artificial replacement of any element of the human speech chain Rely on … mathematics, information technology, physics, neurology, linguistics, psychology and electrical engineering [
BME TMIT 4 speechlab.tmit.bme.hu Why is it important in general? Language <> text Speech is the main modality of the expression of language It is the most efficient Disadvantage of loss of speech vs. loss of sight In some contexts (in-car, manufaturing, …) preferred communication channel Big data source (natural, real, …)
BME TMIT 5 speechlab.tmit.bme.hu Why is it important in general? [Gartner hype-cycle on Emerging technologies July 2012 ] Related to speech technology
BME TMIT 6 speechlab.tmit.bme.hu Why is it important in Hungary? We have a unique language (agglitunative, free word order) Extra effort - Middle-sized market (73rd in the world [Ethnologue] ) Multinationals getting interested (Google, Nuance, …) but Tailor-made, high quality solutions cost too much <> just sufficient effort Prominens résztvevők Maróth Miklós (alelnök, MTA, nyelvész); Gróh Gáspár (Áder János köztársasági elnök megbízásából, közíró); Kelemen Csaba (fővh, ICT fejlesztés, Németh Lászlóné miniszter köszöntője, NFM); Csizmadia Norbert (tervezéskoordinációért felelős államtitkár, NGM); L. Simon László (kultúráért felelős államtitkár, EMMI); Hoffmann Rózsa (oktatásért felelős államtitkár, EMMI) írásos köszöntője; Bába Iván (közigazgatási ügyekért felelős államtitkár, KülügyM); Korányi László (kül- és belkapcsolati elnökhelyettes, villamosmérnök, NIH)
BME TMIT 7 speechlab.tmit.bme.hu History of vehicle and speech technology
BME TMIT 8 speechlab.tmit.bme.hu Recent real-life results of of Hungarian speech technology MailMondó Westel BME TMIT 1999 T-Mobile Westel BME TMIT 2003 T-Mobile MIT Systems Digital Natives BME TMIT 2008 AITIA MonSpeech Vodafone Montana, AITIA, 2012 BME TMIT, MTA Nytud Freedom BME TMIT 2002 Scientific Informatika a Látássérültekért
BME TMIT 9 speechlab.tmit.bme.hu Available resources World-class language and speech technology co-operative R&D know-how SMEs (AITIA, Morphologic, Nextent, … ) International networks Lack of large industrial R&D centers Lack of focused attention, quality requirements META-NET
BME TMIT 10 speechlab.tmit.bme.hu Research challenges 1 Accurate reference speech processing infrastructure Processing of spontaneous interactions Collecting and labelling enough (?) data Unfunded international efforts (e.g. U-STAR) Rule-data driven combination Cognitive Infocommunications Cognitive Robotics Eto – communications Just ripe applications
BME TMIT 11 speechlab.tmit.bme.hu Research challenges 2 How to avoid the „uncanny valley”
BME TMIT 12 speechlab.tmit.bme.hu Application challenges 1 62% of year Hungarian population is internet user What about the rest (38%)? Equal access to information??? Speech technology may help (magyarorszag.hu, 112, MÁV, BKV, Volán) Example: Disability applications Screen readers for the visually impaired Electronic acess to teaching and other written material Example: VoxAidwww.robobraille.org
BME TMIT 13 speechlab.tmit.bme.hu Application challenges 2 Speech technology in education Games for kindergarten and schoolchildren Example: GOH hearing screeing at 3 years Interactive multimodal teaching material Motivation of Hungarian kids in minority situation Rehabilitation of aphasia, autism, problems…
BME TMIT 14 speechlab.tmit.bme.hu Application challenges 3 Speech technology in the health industry Automation of operations (instructions, notetaking) Automation of findings dictation Early diagnosis and rehabilitation of larynx problems, depression, etc. by voice Remote health applications (e.g. warning about medication, window closure, etc.) Supervision of dementia, Alzheimer, …
BME TMIT 15 speechlab.tmit.bme.hu Application challenges 4 Speech technology in the content industry Interdisciplinary integration Speech technology – medical education – social workers (IBM – Hungarian government?) Digital public education and intelligent home program (Microsoft – Hungarian government?) Multi-model content analytics (polls??) Banks, retail industry information services Car infotainment (Audi, Daimler – Hungarian gov?) Speech controlled home Smartphone, smartTV Smart washing machine, ……
BME TMIT 16 speechlab.tmit.bme.hu Application challenges 5 Speech technology in manufacturing Warehouse automation Production warning Speech instructions Talking user manuals 3DICC 3D Internet Based Control and Communication
BME TMIT 17 speechlab.tmit.bme.hu Mélyebb érdeklődőknek: Köszönjük az támogatását. (Teleauto, BelAmi, EtoCom -TÁMOP /1/KMR , BME Kutatóegyetemi -TÁMOP-4.2.1/B-09/1/KMR , CIP CESAR, AAL PAELIFE projektek) Hozzászólások (Comments, questions)