UC Berkeley researchers detect 'silent speech' with electrodes and AI

UC Berkeley researchers say they are the first to train AI using using silently mouthed words and sensors that collect muscle activity. Silent speech is detected using electromyography (EMG), with electrodes placed on the face and throat. The model focuses on what researchers call digital voicing to predict words and generate synthetic speech.

Researchers believe their method can enable a number of applications for people who are unable to produce audible speech and could support speech detection for AI assistants or other devices that respond to voice commands.

“Digitally voicing silent speech has a wide array of potential applications,” the team’s paper reads. “For example, it could be used to create a device analogous to a Bluetooth headset that allows people to carry on phone conversations without disrupting those around them. Such a device could also be useful in settings where the environment is too loud to capture audible speech or where maintaining silence is important.”

Another example of AI that can capture words from silent speech — lip-reading AI — can power surveillance tools or support use cases for people who are deaf.

For their silent speech prediction, the UC Berkeley researchers used an approach “where audio output targets are transferred from vocalized recordings to silent recordings of the same utterances.” A WaveNet decoder is then used to generate audio speech predictions.

Compared to a baseline trained with vocalized EMG data, the approach delivers a 64% to 4% decline in word error rates in transcriptions of sentences from books and an error reduction of 95% from the baseline. To fuel additional work in this area, the researchers open-sourced a dataset of nearly 20 hours of facial EMG data.

A paper about the model titled “Digital Voicing of Silent Speech” by David Gaddy and Dan Klein received the Best Paper award at the Empirical Methods in Natural Language Processing (EMNLP) event held online last week. The company Hugging Face received the Best Demo Paper award from organizers for its work on the open source Transformers library. In other EMNLP works, members of the Masakhane open source project for translating African languages published a case study on low-resourced machine translation, and researchers from China introduced a sarcasm detection model that achieved state-of-the-art performance on a multimodal Twitter dataset.

This content was originally published here.

UC Berkeley researchers detect ‘silent speech’ with electrodes and AI | VentureBeat

VOTA PARA LOGRAR UNA MEJOR CALIDA DE VIDA

LOS CIUDADANOS UNIDOS SOMOS MAYORÍA

EL 2 DE JUNIO DEL 2024 VOTA PARA MANTENER

Sobre el autor

Ciudadano por México

Comentarios

Cancelar respuesta

Destacados

El contratista de defensa Scale AI descartó silenciosamente el acuerdo con TikTok de propiedad china por preocupaciones de seguridad

Goldman Sachs dejaría a Apple en su alianza de cuentas de ahorros, informe

Amazon consolida un año de ganancias tras el 2022 de pérdidas

Oasis App, la empresa líder en el mercado en renta de palcos

Ahora también puedes ser coleccionista de arte y a precios sorprendentes BADAMX Los mejores artistas ahora al alcance de todos

Monte Hermoso: multas de hasta $10 mil por no usar barbijo – Canal Siete Bahía Blanca

Invadirán CDMX personajes de Star Wars

Tierra Viva Hoteles es reconocida por las OTAs como la cadena hotelera más valorada en el Perú en 2019 – Hotel Perú News | por Javier Baz

Un Curso de Verano Muy Divertido en CESSA Para Niñas y Niños.

El IMSS aprueba al pozole como comida saludable

Salud o belleza, ¿qué influye más a la hora de elegir pareja?

Próximo gobierno federal invertiría 38 mdp: Rocío Nahle

También te puede interesar

Sobre el autor

Ciudadano por México

Comentarios

Destacados