25th International Conference on Database Systems for Advanced Applications

Sep. 24-27, 2020, Jeju, South Korea

Click following URL

http://dasfaa2020.sigongji.com

to visit DASFAA 2020 Online Event Site

Paper details

Title: A Fast Automated Model Selection Approach Based on Collaborative Knowledge

Authors: Zhenyuan Sun, Zixuan Chen, Zhenying He, Yinan Jing and X. Sean Wang

Abstract: Great attention has been paid to data science in recent years. Besides data science experts, plenty of researchers from other domains are conducting data analysis as well because big data is becoming more easily accessible. However, for those non-expert researchers, it can be quite difficult to find suitable models to conduct their analysis tasks because of their lack of expertise and the existence of excessive models. In the meantime, existing model selection approaches rely too much on the content of data sets and take quite long time to make the selection, which makes these approaches inadequate to recommend models to non-experts online. In this paper, we present an efficient approach to conducting automated model selection efficiently based on analysis history and knowledge graph embeddings. Moreover, we introduce exterior features of data sets to enhance our approach as well as address the cold start issue. We conduct several experiments on competition data from Kaggle, a well-known online community of data researchers. Experimental results show that our approach can improve model selection efficiency dramatically and retain high accuracy as well.

Video file:

Slide file:

Sponsors