Table 4 Comparison of Q-values for each state-action.

From: Optimization of dynamic incentive strategies for public transportation based on reinforcement learning and network synergy effect

 

0.5 CNY

1.0 CNY

1.5 CNY

2.0 CNY

Peak

2.1794

2.3452

2.5913

2.0539

Peak-Off

1.3733

1.5996

1.3298

1.6519