As an addendum to my previous post called “Dictation Etiquette“, I would like to direct you guys to the Philips web site, which now has a whole section dedicated to dictation guidelines aimed at optimizing speech recognition results. A very well done site, where all the DO’s and DON’Ts are further detailed, with a few video clips adding a fun note to the educational purpose of the exercise. For those in a rush, all tips were compiled in a downloadable PDF.
DO’s:
> Before you start dictating
> How to dictate
> Correcting your dictation
DON’Ts
> Wrong distance to the mike
Q: “Dude, that mike doesn’t work. I’m only 3 corridors away.”
A: We’re talking speech recognition here, not speech miracle. We can’t figure how to teleport humans yet, I guess that applies to sound waves too.
> Background noise
Q: “But hospitals are not exactly silent environnements, are they?”
A: Well said. The noise cancellation features present both at the mike and SpeechMagic software levels are here to take care of the background noise inherent to any healthcare setting. We are just saying here: try to avoid non-healthcare-related sounds…
> Talking slow, then fast, then slow, etc…
Q: “Err, who would do that?”
A: Well, well, among all the things our mother forgot to teach us is: the importance of being self-conscious during the course of a dictation. As a result, we don’t always realize the pain we cause to those whose job it is to listen…
I came across the following article from Health Imaging & IT magazine this morning. According to a recent study conducted at the University of Maryland Medical Center, the introduction of white noise as part of the acoustic background has a positive impact on speech recognition accuracy…


How many times have I heard physicians voice concern over the initial time required to “train” a speech recognition system in those words: “too long” and “not worth the effort”. Well, that might have been true 10 years ago. And that might still be true for consumer products, which are not tailored for a specific profile of users, like professional speech recognition is for healthcare. Sit back and relax, as here come the good news:
Let’s first take a look at the terminology. As always, Wikipedia clears up any potential confusion with one of those efficient, 3-line definitions: “Digital dictation is different from Speech Recognition where audio is analyzed by a computer using speech algorithms in an attempt to automatically transcribe the document. With digital dictation the process of converting digital audio to text is done via a typist using a digital transcription software application (…)”
Let’s cover the definitions first. Front-end speech recognition is a particularly attractive feature for physicians who prefer to look after the full report generation process. Text is generated on-screen from their dictations in real-time, allowing physicians to edit and finalize documents themselves.
Recent Comments