論文検索サイト

システム/制御/情報 Vol. 27 (2014), No. 8

ISIJ International
belloff
オンライン版ISSN: 2185-811X
冊子版ISSN: 1342-5668
発行機関: THE INSTITUTE OF SYSTEMS, CONTROL AND INFORMATION ENGINEERS (ISCIE)

Backnumber

  1. Vol. 37 (2024)

  2. Vol. 36 (2023)

  3. Vol. 35 (2022)

  4. Vol. 34 (2021)

  5. Vol. 33 (2020)

  6. Vol. 32 (2019)

  7. Vol. 31 (2018)

  8. Vol. 30 (2017)

  9. Vol. 29 (2016)

  10. Vol. 28 (2015)

  11. Vol. 27 (2014)

  12. Vol. 26 (2013)

  13. Vol. 25 (2012)

  14. Vol. 24 (2011)

  15. Vol. 23 (2010)

  16. Vol. 22 (2009)

  17. Vol. 21 (2008)

  18. Vol. 20 (2007)

  19. Vol. 19 (2006)

  20. Vol. 18 (2005)

  21. Vol. 17 (2004)

  22. Vol. 16 (2003)

  23. Vol. 15 (2002)

  24. Vol. 14 (2001)

  25. Vol. 13 (2000)

  26. Vol. 12 (1999)

  27. Vol. 11 (1998)

  28. Vol. 10 (1997)

  29. Vol. 9 (1996)

  30. Vol. 8 (1995)

  31. Vol. 7 (1994)

  32. Vol. 6 (1993)

  33. Vol. 5 (1992)

  34. Vol. 4 (1991)

  35. Vol. 3 (1990)

  36. Vol. 2 (1989)

  37. Vol. 1 (1988)

システム/制御/情報 Vol. 27 (2014), No. 8

状態非依存の方策を用いた新しい強化学習手法の提案

中野 太智, 前田 新一, 石井 信

pp. 327-332

抄録

Usually, reinforcement learning (RL) algorithms have a difficulty to learn the optimal control policy as the dimensionality of the state (and action) becomes large, because of the explosive increase in the search space to optimize. To avoid such an unfavorable explosive increase, in this study, we propose BASLEM algorithm (Blind Action Sequence Learning with EM algorithm) which acquires a state-independent and time-dependent control policy starting from a certain fixed initial state. Numerical simulation to control a non-holonomic system shows that RL of state-independent and time-dependent policies attain great improvement in efficiency over the existing RL algorithm.

ブックマーク

SNSによる共有

論文タイトル

状態非依存の方策を用いた新しい強化学習手法の提案

Tuning of Nonlinear Model Predictive Controller for Parameter-Dependent Systems and its Application to the Speed Control of Spark Ignition Engines

Fatima Tahir, Toshiyuki Ohtsuka

pp. 333-342

抄録

In this paper, we propose a systematic method for the efficient tuning of the performance index in Nonlinear Model Predictive Control (NMPC) of parameter-dependent systems. The quadratic cost function in NMPC is tuned by applying the inverse optimality conditions on the linear quadratic regulator designed for the linearized model using the Inverse Linear Quadratic (ILQ) regulator design method. This approach provides some tuning parameters that give a trade-offbetween the speed of the system’s response and the magnitude of the control input. We propose two systematic methods for the selection of parameter-dependent tuning parameter. This approach is applied to the speed control of mean-value model of Spark Ignition (SI) engines. Effectiveness of the proposed methods is elaborated in simulation results.

ブックマーク

SNSによる共有

論文タイトル

可観測正準型デルタオペレータモデルによる同定の簡便化

岩元 麗之, 熊澤 典良, 市原 裕之, 阿部 直人, 福井 泰好

pp. 343-351

抄録

A delta operator model is usually applied to identification problems with short sampling periods. To identify the system, we have to compute the difference of the input and output data in discrete time and design a noise filter. In this paper, we propose the delta operator model with the observable canonical form. By using this model, we can easily estimate the parameters of the delta operator model without the difference of the input and output data. We derive the Cramer Rao inequality, which evaluates the performance for estimating parameters in the proposed model. Finally, we show the results of the identification experiments for an inertia rotor pendulum and a nonminimum phase system.

ブックマーク

SNSによる共有

論文タイトル

可観測正準型デルタオペレータモデルによる同定の簡便化

この機能はログイン後に利用できます。
下のボタンをクリックしてください。

詳細検索

論文タイトル

著者

抄録

ジャーナル名

出版日を西暦で入力してください(4桁の数字)。

検索したいキーワードを入力して下さい