T. G. Dietterich, The MAXQ method for hierarchical reinforcement learning, ICML, 1998.

C. Diuk, A. L. Strehl, and M. L. Littman, A hierarchical approach to efficient reinforcement learning in deterministic domains, AAMAS, 2006.

M. Ghavamzadeh and S. Mahadevan, A multiagent reinforcement learning algorithm by dynamically merging Markov decision processes, AAMAS, 2002.

M. Ghavamzadeh and S. Mahadevan, Learning to communicate and act using hierarchical reinforcement learning, AAMAS, 2004.

T. Hester and P. Stone, Generalized model learning for reinforcement learning in factored domains, AAMAS, 2009.

M. Qu, H. Zhu, J. Liu, G. Liu, and H. Xiong, A cost-effective recommender system for taxi drivers, KDD, 2014.

R. S. Sutton and A. G. Barto, Introduction to Reinforcement Learning, 1998.

C. Tan, Driverless vehicles hit the road in trials around singapore. Straits Times, 2015.

T. K. Tan-cheon and . Kheong, Autonomous vehicles, next stop: Singapore. Journeys, 2014.

C. J. Watkins and P. Dayan, Technical note: Q-learning, Mach. Learn, vol.8, pp.3-4, 1992.

J. Yuan, Y. Zheng, X. Xie, and G. Sun, T-drive: Enhancing driving directions with taxi drivers' intelligence, TKDE, vol.25, issue.1, 2013.

N. J. Yuan, Y. Zheng, L. Zhang, and X. Xie, T-finder: A recommender system for finding passengers and vacant taxis, TKDE, vol.25, issue.10, 2013.

Y. Zheng, Trajectory data mining: An overview, ACM Trans. Intell. Syst. Technol, vol.6, issue.3, 2015.