基于深度强化学习的矿井通风阻力系数反演研究

Inversion of mine ventilation resistance coefficient based on deep reinforcement learning

  • 摘要: 矿井通风阻力系数对矿井通风系统安全管理、诊断和智能化发展至关重要,气流通常作为通风阻力系数反演的基础,但常规的非线性优化方法存在非唯一性的问题,影响了反演的精度。为此,基于深度强化学习(DRL)的新型优化方法来反演阻力系数,反演被视为1个马尔可夫决策过程,通风网络求解模型(VNSM)被嵌入到DRL环境中,采用近端策略优化对代理人的策略进行优化。现场试验表明,计算气流与实测气流的MAE为0.354,MSE为0.287,RMSE为0.536,MRE为0.013。与标准遗传算法、差分进化算法和进化策略算法相比,DRL方法的MREMAERMSEMSE值分别降低了23.5%、15.3%、14.1%和26.4%,与其他算法相比,DRL方法对不同巷道的敏感性差异较小。

     

    Abstract: Mine ventilation resistance coefficient is crucial to the safety management, diagnosis and intelligent development of mine ventilation system, and the airflow is usually used as the basis for the inversion of ventilation resistance coefficient, but the conventional nonlinear optimization method has the problem of non-uniqueness, which affects the accuracy of the inversion.A novel optimization method based on deep reinforcement learning(DRL)is used to invert the resistance coefficients, where the inversion is regarded as 1 Markov decision process, and the ventilation network solution model(VNSM)is embedded into the DRL environment, and the proximal strategy optimization is used to optimize the agent's strategy.Field tests show that MAE of the computed airflow versus the measured airflow is 0.354,MSE is 0.287,RMSE is 0.536,and MRE is 0.013.Compared with the standard genetic algorithm, differential evolutionary algorithm, and evolutionary strategy algorithms, DRL method reduces the values of MRE,MAE,RMSE and MSE by 23.5%,15.3%,14.1%,and 26.4%,respectively, and the sensitivity difference of DRL method to different roadways is small compared with other algorithms.

     

/

返回文章
返回