About

Mao Saeki / 佐伯真於

I am a founding member and Research Scientist at Equmenopolis Inc., where I lead the development of the InteLLA virtual agent. I am also pursuing a Ph.D. in Computer Science at Waseda University under the supervision of Prof. Tetsunori Kobayashi. My interest lies in multimodal conversational AI, especially the understanding and generation of non-verbal cues. I have built AI agents from the ground up and scaled them to serve thousands of users, while conducting cutting-edge research on turn-taking and gesture generation, integrating them into real-world applications.

Projects

InteteLLA - Intelligent Language Learning Assistant

InteLLA automatically measures the conversational profiency of English learners through naturalistic conversation, enabled by gesture generation, turn-taking and proficiency assessment models. By adapting the conversational difficulty to each user though real time assessment, it is able to extract user's full potential, and give accurate assessment. InteLLA has previosly won the Bronze award at the QS-Wharton Reimagine Education Award 2021 in the Learning Assessment Category.

We are actively looking for collaboraters! Contact me if you are interested.

Experiences

  • Equmenopolis Inc., Research Scientist, 2022 - Present
  • Waseda University, Research Associate, Perceptual Computing Group, 2020 - 2023
    Developed conversational AI agent for language learning assistance
  • National Institute of Advanced Industrial Science and Technology, Research Assistant, 2018 - 2019
    Worked on anomaly detection for industrial machinery
  • LPixel Inc., Development Intern, 2018 - 2019
    Worked on lung cancer detection from CT scan images

Education

  • Ph.D. in Computer Science and Engineering, Waseda University, 2020 - Present (Advisor: Tetsunori Kobayashi)
  • M.E. in Computer Science and Engineering, Waseda University, 2020 (Advisor: Tetsunori Kobayashi)
  • B.E. in Mechanical Engineering, Waseda University, 2018

Awards

  • Best Paper, SIGdial, September 2024
  • Best Student Paper, Interspeech, August 2023
  • Best presentation, The Japanese Society for Artificial Intelligence, June 2022
  • Reimagine Education Award, Learning Assessment Category Bronze, Quarrelli Simmons (QS) and The Wharton School of the University of Pennsylvania (MBA), December 2021
  • Young Researcher Award for Excellent Research, The Japanese Society for Artificial Intelligence SIG-SLUD (Special Interest Group of Speech, Language Understanding and Discourse Processing), October 2021

Publications

  • Mao Saeki, Hiroaki Takatsu, Fuma Kurata, Shungo Suzuki, Masaki Eguchi, Ryuki Matsuura, Kotaro Takizawa, Sadahiro Yoshikawa, Yoichi Matsuyama, "InteLLA: Intelligent Language Learning Assistant for Assessing Language Proficiency through Interviews and Roleplays", Proc. The 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGdial), September 2024.
  • Fuma Kurata, Mao Saeki, Shinya Fujie, and Yoichi Matsuyama, "Multimodal Turn-Taking Model Using Visual Cues for End-of-Utterance Prediction in Spoken Dialogue Systems", Proc. The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH), August 2023.
  • Mao Saeki, Kotoka Miyagi, Shinya Fujie, Shungo Suzuki, Tetsuji Ogawa, Tetsunori Kobayashi, and Yoichi Matsuyama, "Confusion detection for adaptive conversational strategies of an oral proficiency assessment interview agent", Proc. The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH), September 2022.
  • Shungo Suzuki, Ryuki Matsuura, Mao Saeki, and Yoichi Matsuyama, "How is dialogic fluency different from monologic fluency? The case of oral proficiency interview", 9th International Conference on Task-based language teaching (TBLT), August, 2022.
  • Shungo Suzuki, Ryuki Matsuura, Mao Saeki, and Yoichi Matsuyama, "Temporal features distinguish between second language oral proficiency levels? The case of Japanese learners of English", 31st annual conference of the European Second Language Association (EUROSLA), August 2022.
  • Shungo Suzuki, Ryuki Matsuura, Mao Saeki, and Yoichi Matsuyama, "Revisiting the assessment potential of read-aloud speech performance: Cognitive validity and predictive validity", 43rd Language Testing Research Colloquium (LTRC), March 2022
  • Ryuki Matsuura, Shungo Suzuki, Mao Saeki, Tetsuji Ogawa, and Yoichi Matsuyama, "Automated scoring of L2 fluency based on detection of disfluency words and pause locations", Acoustic Society of Japan (ASJ), March 2022,
  • Mao Saeki, Ryuki Matsuura, Shungo Suzuki, Kotoka Miyagi, Tetsunori Kobayashi, and Yoichi Matsuyama, "InteLLA: A Speaking Proficiency Assessment Conversational Agent with Adaptive Interview Strategy", The Japanese Society for Artificial Intelligence (JSAI), SIG-SLUD, October 2021.
  • Mao Saeki, Weronika Demkow, Tetsunori Kobayashi, and Yoichi Matsuyama, “A WoZ Study for an Incremental Proficiency Scoring Interview Agent Eliciting Ratable Samples“, Proc. The 12th International Workshop on Spoken Dialog System Technology (IWSDS 2021), November 2021
  • Mao Saeki, Yoichi Matsuyama, Satoshi Kobashikawa, Tetsuji Ogawa, and Tetsunori Kobayashi, “Analysis of Multimodal Features for Speaking Proficiency Scoring in An Interview Dialogue“, Proc. The 8th IEEE Spoken Language Technology Workshop (SLT 2021), pp.629-635, January 2021.

Contact

email: saeki[at]equ.ai