IMV-LSTM:多变量LSTM的可解释预测与知识挖掘

EXPLORING THE INTERPRETABILITY OF LSTM NEURAL NETWORKS OVER MULTI-VARIABLE DATA Anonymous authors Paper under double-blind review ABSTRACT In learning a predictive model over multivariate time series consisting of target and exogenous variables, the forecasting performance and interpretability of the model are both essential for deployment and uncovering knowledge behind the data. To this end, we propose the interpretable multi-variable LSTM recurrent neural network (IMV-LSTM) capable of providing accurate forecasting as well as both temporal and variable level importance interpretation. In particular, IMVLSTM is equipped with tensorized hidden states and update process, so as to learn variables-wise hidden states. On top of it, we develop a mixture attention mechanism and associated summarization methods to quantify the temporal and variable importance in data. Extensive experiments using real datasets demonstrate the prediction performance and interpretability of IMV-LSTM in comparison to a variety of baselines. It also exhibits the prospect as an end-to-end framework for both forecasting and knowledge extraction over multi-variate data. 1 INTRODUCTION Our daily life is now surrounded by various types of sensors, ranging from smart phones, video cameras, Internet of things, to robots. The observations yield by such devices over time are naturally organized in time series data (Qin et al., 2017; Yang et al., 2015). In this paper, we focus on multivariable time series consisting of target and exogenous variables. Each variable corresponds to a monitoring over physical world. A predictive model over such multi-variable data aims to predict the future values of the target series using historical values of target and exogenous series. In addition to forecasting, the interpretability of prediction models is essential for deployment and knowledge extraction as well (Hu et al., 2018; Foerster et al., 2017; Lipton, 2016). For multi-variable time series in this paper, we focus on two types of importance interpretation. (1) Variable-wise temporal importance: exogenous variables present different temporal influence on the target one (Kirchgässner et al., 2012). For instance, for the exogenous variable having instant effect on the target one, its historical data at short time lags is expected to high importance values. (2) Overall variable importance: exogenous variables and the auto-regressive part of the target variable differ in predictive power, which reflects different variable importance w.r.t. the prediction of the target (Feng et al., 2018; Riemer et al., 2016). The ability to unveil such knowledge through predictive models enables to fundamentally understand the effect of exogenous variables on the target one. Recently, recurrent neural networks (RNNs), especially long short-term memory (LSTM) (Hochreiter & Schmidhuber, 1997) and the gated recurrent unit (GRU) (Cho et al., 2014), have been proven to be powerful sequence modeling tools in a variety of tasks such as language modelling, machine translation, health informatics, time series, and speech (Ke et al., 2018; Lin et al., 2017; Lipton et al., 2015; Sutskever et al., 2014; Bahdanau et al., 2014). However, current RNNs fall short of the aforementioned interpretability for multi-variable data due to their opaque internal states. Specifically, when fed with the multi-variable observations of the target and exogenous variables, RNNs blindly blend the information of all variables into memory cells and hidden states which are used for prediction. It is intractable to distinguish the contribution of individual variables into the prediction through hidden states (Zhang et al., 2017). Recently, attention-based neural networks have been proposed to enhance the ability of RNN in selectively using long-term memory and the interpretability (Vaswani et al., 2017; Qin et al., 2017; Choi et al., 2016; Vinyals et al., 2015; Chorowski et al., 2015; Bahdanau et al., 2014). Nevertheless, current

资源下载
下载价格10 元(40 台币TWD)
点点赞赏,手留余香 给TA打赏

评论0

请先
  • 游客 下载了资源 爱普生Epson L565 驱动
  • 游客 下载了资源 爱普生Epson LQ-300+II 驱动
  • 游客 下载了资源 爱普生Epson Stylus TX200 驱动
  • 游客 下载了资源 2020年下半年教师资格证考试《高中地理》题(答案)
  • 游客 下载了资源 爱普生Epson WorkForce Pro WF-M5799DWF 驱动
  • 游客 下载了资源 爱普生Epson Stylus BX620FWD 驱动
  • 游客 下载了资源 2008年广东公务员考试《申论》真题卷及答案
  • 游客 下载了资源 2014年下半年教师资格证考试《高中历史》题解析
  • 游客 下载了资源 2010年918公务员联考《行测》答案及解析(山西、河南、辽宁、福建、重庆、海南、青海、宁夏、内蒙古、陕西、西藏、甘肃)
  • 游客 下载了资源 2017年下半年教师资格证考试《初中体育与健康》题解析
  • 游客 下载了资源 2018年上半年教师资格证考试《初中物理》题解析
  • 游客 下载了资源 佳能Canon PIXMA MX720 驱动
  • 游客 下载了资源 2016年0423西藏公务员考试《行测》真题参考答案及解析
  • 游客 下载了资源 爱普生Epson L3115 驱动
  • 游客 下载了资源 2012年421公务员联考《行测》答案及解析(山西、辽宁、黑龙江、福建、湖北、 湖南、广西、海南、四川、重庆、 云南、西藏、陕西、青海、宁夏、新疆、甘肃 )
  • 游客 下载了资源 佳能Canon PIXMA MX360 Series 驱动
点击浏览器地址栏的⭐图标收藏本页
需要托管,代写作业,论文扫码加微信
显示验证码

社交账号快速登录

微信扫一扫关注
扫码关注后会自动登录