Ye Bai
Ph.D. Student in Artificial Intelligence for Healthcare
The University of Melbourne
About Me

I am currently a first-year PhD student at The University of Melbourne, supervised by Prof. David O'Neal .

Previously, I worked for 5+ years in the trust industry, managing 300+ financial projects exceeding ¥50 billion, which strengthened my analytical and cross-disciplinary problem-solving skills.

Research Interests: Multi-Agent Systems • Conversational RAG • AI for Healthcare • Long-term Memory Systems

Education
  • The University of Melbourne
    The University of Melbourne
    Ph.D. in Artificial Intelligence
    Oct. 2025 - present
  • Monash University
    Monash University
    Master of Artificial Intelligence
    Mar. 2023 - Jun. 2025
  • Beijing Institute of Technology
    Beijing Institute of Technology
    Master of Professional Accounting
    Sep. 2016 - Jun. 2018
  • Beijing Institute of Technology
    Beijing Institute of Technology
    Bachelor of Accounting
    Sep. 2012 - Jun. 2016
Experience
  • Huaneng Guicheng Trust Co., Ltd.
    Huaneng Guicheng Trust Co., Ltd.
    Trust Project Accountant
    Jun. 2017 - Jul. 2022
Honors & Awards
  • Full Ph.D. Scholarship, The University of Melbourne
    2025
  • Master of AI, awarded with Distinction, Monash University
    2025
  • National Scholarship(top 2.5%), BIT-Beijing
    2016
  • Outstanding Graduate Award, BIT-Beijing
    2016
Selected Publications (view all )
MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning
MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning

Ye Bai, Minghan Wang, Thuy-Trang Vu

The 23rd Annual Workshop of the Australasian Language Technology Association (ALTA 2025) 2025

Table-based question answering requires complex reasoning capabilities that current LLMs struggle to achieve with single-pass inference. Existing approaches, such as Chain-of-Thought reasoning and question decomposition, lack error detection mechanisms and discard problem-solving experiences, contrasting sharply with how humans tackle such problems. In this paper, we propose MAPLE (Multi-agent Adaptive Planning with Long-term mEmory), a novel framework that mimics human problem-solving through specialized cognitive agents working in a feedback-driven loop. MAPLE integrates 4 key components: (1) a Solver using the ReAct paradigm for reasoning, (2) a Checker for answer verification, (3) a Reflector for error diagnosis and strategy correction, and (4) an Archiver managing long-term memory for experience reuse and evolution. Experiments on WiKiTQ and TabFact demonstrate significant improvements over existing methods, achieving state-of-the-art performance across multiple LLM backbones.

MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning

Ye Bai, Minghan Wang, Thuy-Trang Vu

The 23rd Annual Workshop of the Australasian Language Technology Association (ALTA 2025) 2025

Table-based question answering requires complex reasoning capabilities that current LLMs struggle to achieve with single-pass inference. Existing approaches, such as Chain-of-Thought reasoning and question decomposition, lack error detection mechanisms and discard problem-solving experiences, contrasting sharply with how humans tackle such problems. In this paper, we propose MAPLE (Multi-agent Adaptive Planning with Long-term mEmory), a novel framework that mimics human problem-solving through specialized cognitive agents working in a feedback-driven loop. MAPLE integrates 4 key components: (1) a Solver using the ReAct paradigm for reasoning, (2) a Checker for answer verification, (3) a Reflector for error diagnosis and strategy correction, and (4) an Archiver managing long-term memory for experience reuse and evolution. Experiments on WiKiTQ and TabFact demonstrate significant improvements over existing methods, achieving state-of-the-art performance across multiple LLM backbones.

Discrete Minds in a Continuous World: Do Language Models Know Time Passes?
Discrete Minds in a Continuous World: Do Language Models Know Time Passes?

Minghan Wang, Ye Bai, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari

Conference on Empirical Methods in Natural Language Processing (EMNLP) 2025

While Large Language Models (LLMs) excel at temporal reasoning tasks like event ordering and duration estimation, their ability to perceive the actual passage of time remains unexplored. We investigate whether LLMs perceive the passage of time and adapt their decision-making accordingly through three complementary experiments. First, we introduce the Token-Time Hypothesis, positing that LLMs can map discrete token counts to continuous wall-clock time, and validate this through a dialogue duration judgment task. Second, we demonstrate that LLMs could use this awareness to adapt their response length while maintaining accuracy when users express urgency in question answering tasks. Finally, we develop BombRush, an interactive navigation challenge that examines how LLMs modify behavior under progressive time pressure in dynamic environments. Our findings indicate that LLMs possess certain awareness of time passage, enabling them to bridge discrete linguistic tokens and continuous physical time, though this capability varies with model size and reasoning abilities. This work establishes a theoretical foundation for enhancing temporal awareness in LLMs for time-sensitive applications.

Discrete Minds in a Continuous World: Do Language Models Know Time Passes?

Minghan Wang, Ye Bai, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari

Conference on Empirical Methods in Natural Language Processing (EMNLP) 2025

While Large Language Models (LLMs) excel at temporal reasoning tasks like event ordering and duration estimation, their ability to perceive the actual passage of time remains unexplored. We investigate whether LLMs perceive the passage of time and adapt their decision-making accordingly through three complementary experiments. First, we introduce the Token-Time Hypothesis, positing that LLMs can map discrete token counts to continuous wall-clock time, and validate this through a dialogue duration judgment task. Second, we demonstrate that LLMs could use this awareness to adapt their response length while maintaining accuracy when users express urgency in question answering tasks. Finally, we develop BombRush, an interactive navigation challenge that examines how LLMs modify behavior under progressive time pressure in dynamic environments. Our findings indicate that LLMs possess certain awareness of time passage, enabling them to bridge discrete linguistic tokens and continuous physical time, though this capability varies with model size and reasoning abilities. This work establishes a theoretical foundation for enhancing temporal awareness in LLMs for time-sensitive applications.

All publications