TY - JOUR
T1 - An intelligent financial portfolio trading strategy using deep Q-learning
AU - Park, Hyungjun
AU - Sim, Min Kyu
AU - Choi, Dong Gu
N1 - Publisher Copyright:
© 2020 Elsevier Ltd
PY - 2020/11/15
Y1 - 2020/11/15
N2 - Portfolio traders strive to identify dynamic portfolio allocation schemes that can allocate their total budgets efficiently through the investment horizon. This study proposes a novel portfolio trading strategy in which an intelligent agent is trained to identify an optimal trading action using deep Q-learning. We formulate a Markov decision process model for the portfolio trading process that adopts a discrete combinatorial action space and determines the trading direction at a prespecified trading size for each asset, thus ensuring practical applicability. Our novel portfolio trading strategy takes advantage of three features to outperform other strategies in real-world trading. First, a mapping function is devised to handle and transform any action that is initially proposed but found to be infeasible into a similar and valuable feasible action. Second, by overcoming the dimensionality problem, this study establishes agent and Q-network models to derive a multi-asset trading strategy in the predefined action space. Last, this study introduces a technique that can derive a well-fitted multi-asset trading strategy by designing an agent to simulate all feasible actions in each state. To validate our approach, we conduct backtesting for two representative portfolios and demonstrate superior results over the benchmark strategies.
AB - Portfolio traders strive to identify dynamic portfolio allocation schemes that can allocate their total budgets efficiently through the investment horizon. This study proposes a novel portfolio trading strategy in which an intelligent agent is trained to identify an optimal trading action using deep Q-learning. We formulate a Markov decision process model for the portfolio trading process that adopts a discrete combinatorial action space and determines the trading direction at a prespecified trading size for each asset, thus ensuring practical applicability. Our novel portfolio trading strategy takes advantage of three features to outperform other strategies in real-world trading. First, a mapping function is devised to handle and transform any action that is initially proposed but found to be infeasible into a similar and valuable feasible action. Second, by overcoming the dimensionality problem, this study establishes agent and Q-network models to derive a multi-asset trading strategy in the predefined action space. Last, this study introduces a technique that can derive a well-fitted multi-asset trading strategy by designing an agent to simulate all feasible actions in each state. To validate our approach, we conduct backtesting for two representative portfolios and demonstrate superior results over the benchmark strategies.
KW - Deep neural network
KW - Deep Q-learning
KW - Markov decision process
KW - Portfolio trading
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85085467248&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2020.113573
DO - 10.1016/j.eswa.2020.113573
M3 - Article
AN - SCOPUS:85085467248
SN - 0957-4174
VL - 158
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 113573
ER -