Senior Research Associate in Machine Learning for Speech Processing

Lancaster University
Bailrigg, United Kingdom
3 days ago

Role details

Contract type
Temporary contract
Employment type
Part-time / full-time
Working hours
Shift work
Languages
English
Experience level
Senior
Compensation
£ 40K

Job location

Bailrigg, United Kingdom

Tech stack

Artificial Intelligence
Artificial Neural Networks
Audio Signal Processing
Computational Linguistics
Python
Machine Learning
PyTorch
Information Technology

Job description

Modern machine learning can predict vocal tract shapes from audio recordings of the voice with remarkable accuracy, but most of these models are black boxes. This Royal Society-funded project aims to crack open the black box and solve one of the most compelling challenges in speech science: understanding the mapping between vocal tract movements and the acoustic speech signal. Using state-of-the-art MRI recordings of the vocal tract during speech, we aim to develop machine learning approaches that don't just predict acoustic output from articulatory configurations, but reveal why and how these mappings work. We need approaches that combine predictive power with scientific insight: models whose internal representations align with phonetic and physical knowledge. This requires hybrid machine learning (ML) approaches that integrate domain knowledge with data-driven learning, as well as explainable AI (xAI) techniques that make model behaviour transparent and scientifically meaningful. You will apply these approaches to a large database of real-time MRI and acoustic recordings of the vocal tract. Solving this problem will help to drive fundamental progress on critical applications, such as articulatory biofeedback for language learning and speech therapy. Your Role Working with Dr Sam Kirkham (Lancaster, Speech Science), Dr Anton Ragni (Sheffield, Computer Science) and Professor Aneta Stefanovska (Lancaster, Physics) you'll develop and validate interpretable ML approaches for modelling acoustic-articulatory relations using MRI vocal tract data. The position is available for 18 months from 1 July 2026 (start date negotiable). Key objectives o Develop hybrid ML architectures that incorporate phonetic and physical constraints. o Apply and extend explainable AI techniques for speech production modelling. o Validate model interpretations against established knowledge. o Lead and/or contribute to publications at the intersection of speech science and machine learning. This is a methodologically creative role with genuine intellectual ownership. You'll have access to rich MRI datasets and Lancaster's high-performance computing facilities., # Your most relevant ML project involving interpretability, hybrid approaches, or sequential data.

Your perspective on making ML models scientifically interpretable (not just explainable to end users).

Why this project interests you.

Requirements

o PhD in Speech Processing, Computational Linguistics, Machine Learning, Computer Science, or related field (PhD must have been submitted by start date). o Strong experience with machine learning for time-series data. o Excellent Python skills with PyTorch. o Ability to work independently on complex, open-ended problems. o Effective communication skills for interdisciplinary collaboration. o Demonstrated interest in interpretable ML, explainable AI, or hybrid approaches. Desirable o Experience with speech/audio processing or articulatory data. o Knowledge of physics-informed neural networks, neural ODEs, or other hybrid architectures. o Knowledge of speech production, speech science, or biomechanics. o Publications in ML, speech technology, or computational linguistics.

Benefits & conditions

Lancaster University is highly ranked and research-led and situated near the historic city of Lancaster. The North West of England offers high standards of living, beautiful countryside, including the Lake District, and excellent national and international transport connectivity. See Jobs - Lancaster University.

Apply for this position