Gakuto Kurata

Title

Distinguished Engineer and Chief Scientist for Spoken Conversational Systems

Bio

Dr. Gakuto KURATA is a Distinguished Engineer and Chief Scientist for Spoken Conversational Systems at IBM Research. He is the senior manager of AI technologies at IBM Research - Tokyo and currently leads research and development of Large Language Models and Large Speech Models in Japan in collaboration with global IBM Research and Development teams. By closely collaborating with Sales and Consulting teams, he accelerates AI adoption in business by translating customer requirements into products.

He joined IBM in April 2004, after obtaining M.S. in Information Science and Technology from the University of Tokyo. He received a Ph.D. in Information Science and Technology from the University of Tokyo in 2013. He was the Technical Assistant to the Director of IBM Research - Tokyo in 2014. He is an IBM Master Inventor and a member of IBM Academy of Technology. He served as an elected member of the IEEE SLTC (Speech and Language Processing Technical Committee) from 2018 to 2024. He has more than 20 years of research and development experiences in speech technology, natural language processing, and their combinations. He earned Technology Development Award from Acoustical Society of Japan in 2018 and Industrial Achievement Award from Information Processing Society of Japan in 2021.

Publications

Voice Activity-based Text Segmentation for ASR Text Denormalization
- - Sashi Novitasari
  - Takashi Fukuda
  - et al.
- 2025
- INTERSPEECH 2025
Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity Cues
- - Sashi Novitasari
  - Takashi Fukuda
  - et al.
- 2025
- INTERSPEECH 2025
LLM based Text Generation for Improved Low-resource Speech Recognition Models
- - Tohru Nagano
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025
Knowledge Distillation Based Training of Unified Conformer CTC Models for Multi-form ASR
- - Takashi Fukuda
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025
SocialStigmaQA Spanish and Japanese - Towards Multicultural Adaptation of Social Bias Benchmarks
- - Clara Higuera Cabañes
  - Ryo Iwaki
  - et al.
- 2024
- NeurIPS 2024
Robust ASR Error Correction with Conservative Data Filtering
- - Takuma Udagawa
  - Masayuki Suzuki
  - et al.
- 2024
- EMNLP 2024
MULTIPLE REPRESENTATION TRANSFER FROM LARGE LANGUAGE MODELS TO END-TO-END ASR SYSTEMS
- - Takuma Udagawa
  - Masayuki Suzuki
  - et al.
- 2024
- ICASSP 2024
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries
- - Ashish Mittal
  - Sunita Sarawagi
  - et al.
- 2023
- EMNLP 2023
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
- - Xiaodong Cui
  - George Saon
  - et al.
- 2022
- INTERSPEECH 2022
Improving ASR Robustness in Noisy Condition Through VAD Integration
- - Sashi Novitasari
  - Takashi Fukuda
  - et al.
- 2022
- INTERSPEECH 2022

Projects

Top collaborators

Gakuto Kurata

Title

Bio

Publications

Voice Activity-based Text Segmentation for ASR Text Denormalization

Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity Cues

LLM based Text Generation for Improved Low-resource Speech Recognition Models

Knowledge Distillation Based Training of Unified Conformer CTC Models for Multi-form ASR

SocialStigmaQA Spanish and Japanese - Towards Multicultural Adaptation of Social Bias Benchmarks

Robust ASR Error Correction with Conservative Data Filtering

MULTIPLE REPRESENTATION TRANSFER FROM LARGE LANGUAGE MODELS TO END-TO-END ASR SYSTEMS

Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

Improving ASR Robustness in Noisy Condition Through VAD Integration

Patents

Method For Improving Acoustic Model, Computer For Improving Acoustic Model And Computer Program Thereof

Detecting Customers With Low Speech Recognition Accuracy By Investigating Consistency Of Conversation In Call-center

Training Deep Neural Network For Acoustic Modeling In Speech Recognition

System, Method And Program For Improving Pronunciation Accuracy In Speech Recognition

Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, And Recording Medium

Immersive Interactive Telepresence

Unsupervised Training Method, Training Apparatus, And Training Program For N-gram Language Model

Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, And Recording Medium

Method Of Selecting Training Text For Language Model, And Method Of Training Language Model Using The Training Text, And Computer And Computer Program For Executing The Methods

Method For Improving Acoustic Model, Computer For Improving Acoustic Model And Computer Program Thereof

Projects

AI in Tokyo

Speech Technologies

Top collaborators

Takashi Fukuda

George Saon

Masayasu Muraoka

Hiroshi Kanayama