Views

CrossRef citations to date

Altmetric

Special Issue Papers

Learning multi-market microstructure from order book data

Geonhwan JuDepartment of Industrial and Systems Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon305-701, South KoreaCorrespondence[email protected]

https://orcid.org/0000-0001-9661-5162 View further author information

Kyoung-Kuk KimDepartment of Industrial and Systems Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon305-701, South Korea

https://orcid.org/0000-0002-9661-8707 View further author information

Dong-Young LimDepartment of Industrial and Systems Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon305-701, South Korea

https://orcid.org/0000-0002-4677-965X View further author information

Abstract

In this paper, we investigate market behaviors at high-frequency using neural networks trained with order book data. Experiments are done intensively with 110 asset pairs covering 97% of spot-futures pairs in the Korea Exchange. An efficient training scheme that improves the performance and training stability is suggested, and using the proposed scheme, the lead–lag relationship between spot and futures markets are measured by comparing the performance gains of each market data set for predicting the other. In addition, the gradients of the trained model are analyzed to understand some important market features that neural networks learn through training, revealing characteristics of the market microstructure. Our results show that highly complex neural network models can successfully learn market features such as order imbalance, spread-volatility correlation, and mean reversion.

Keywords:

Acknowledgments

The authors thank their project counterpart for providing us with valuable datasets. Constructive comments from Prof. Jinwoo Shin are greatly appreciated. Author names are in alphabetical order.

Disclosure statement

No potential conflict of interest was reported by the authors.

ORCID

Geonhwan Ju https://orcid.org/0000-0001-9661-5162

Kyoung-Kuk Kim https://orcid.org/0000-0002-9661-8707

Dong-Young Lim http://orcid.org/0000-0002-4677-965X

Notes

1 Since the number of data varies with asset, longer training epochs are required for assets with less market activity. We find that 160 epochs are enough to guarantee the convergence for all assets.

2 From the cross-validation results, we found that using training data whose dates are after the test set gives no performance gain on predicting the micro-movements. This is mainly due to the highly localized characteristics of the short-term price dynamics.

3 We tried longer time delays up to 60 s, and seven labels were enough to improve the training stability. Since labels with longer time delay are less correlated with short-term price movements, using more labels with longer time delays results in underfitting.

Additional information

Funding

This work was supported by the National Research Foundation of Korea (NRF-2019R1A2C1003144).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Learning multi-market microstructure from order book data

Information for

Open access

Opportunities

Help and information

Learning multi-market microstructure from order book data

Abstract

Acknowledgments

Disclosure statement

ORCID

Notes

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature