論文検索サイト

システム/制御/情報 Vol. 35 (2022), No. 12

ISIJ International
belloff
オンライン版ISSN: 2185-811X
冊子版ISSN: 1342-5668
発行機関: THE INSTITUTE OF SYSTEMS, CONTROL AND INFORMATION ENGINEERS (ISCIE)

Backnumber

  1. Vol. 37 (2024)

  2. Vol. 36 (2023)

  3. Vol. 35 (2022)

  4. Vol. 34 (2021)

  5. Vol. 33 (2020)

  6. Vol. 32 (2019)

  7. Vol. 31 (2018)

  8. Vol. 30 (2017)

  9. Vol. 29 (2016)

  10. Vol. 28 (2015)

  11. Vol. 27 (2014)

  12. Vol. 26 (2013)

  13. Vol. 25 (2012)

  14. Vol. 24 (2011)

  15. Vol. 23 (2010)

  16. Vol. 22 (2009)

  17. Vol. 21 (2008)

  18. Vol. 20 (2007)

  19. Vol. 19 (2006)

  20. Vol. 18 (2005)

  21. Vol. 17 (2004)

  22. Vol. 16 (2003)

  23. Vol. 15 (2002)

  24. Vol. 14 (2001)

  25. Vol. 13 (2000)

  26. Vol. 12 (1999)

  27. Vol. 11 (1998)

  28. Vol. 10 (1997)

  29. Vol. 9 (1996)

  30. Vol. 8 (1995)

  31. Vol. 7 (1994)

  32. Vol. 6 (1993)

  33. Vol. 5 (1992)

  34. Vol. 4 (1991)

  35. Vol. 3 (1990)

  36. Vol. 2 (1989)

  37. Vol. 1 (1988)

システム/制御/情報 Vol. 35 (2022), No. 12

個別の学習機構を持つ階層システムのためのアーキテクチャ

菊谷 彪文, 定本 知徳

pp. 289-299

抄録

In this paper, we propose an architecture for realizing distributed reinforcement learning of distributed controllers for a class of unknown hierarchical systems, where homogeneous subsystems are interconnected through a complete graph. All these controllers consist of two sub-controllers for average and difference dynamics of the system, respectively. First, we show that optimal sub-controllers can be trained individually by a reinforcement learning (RL) method for average/difference data. Due to the smaller-scale of the data, the learning time of the proposed method can be drastically reduced compared to existing RL methods. However, the computation for obtaining the average data requires all-to-all communication among subsystems, which is undesirable in terms of communication costs and security. Hence, by exploiting a distributed consensus observer, we propose an architecture that enables us to learn distributed optimal controllers in a distributed manner. The control performance of the trained controller is shown to be ideally optimal. Moreover, the proposed architecture is completely scalable, i.e., its computational cost is independent from the number of subsystems. The effectiveness is shown through numerical simulations.

ブックマーク

SNSによる共有

論文タイトル

個別の学習機構を持つ階層システムのためのアーキテクチャ

ルール導出法を改善したSTRIMの提案と部分一致仮説データへの適用

加藤 裕一, 佐伯 徹郎

pp. 300-310

抄録

Various data mining models and/or methods have been proposed to date. A statistical test rule induction method (STRIM) has been proposed as one of them, that induces if-then rules hidden in a dataset known as the decision table generated based on a simple hypothesis. This study improves the previous data generation model using a hypothesis similar to human rating and the rule induction method to adapt to real-world datasets. Specifically, 1) the hypothesis is expanded from a complete correspondence hypothesis to a partial correspondence hypothesis. 2) The previous rule induction method is developed into a Bayesian STRIM, that infers and/or explores the causes based on the results. The applied rule induction method’s validity and usefulness are confirmed using a verification system. The relationship and difference between Bayesian STRIM against a maximum a posteriori probability estimate and a Bayesian network method are also studied in the rule induction problem.

ブックマーク

SNSによる共有

論文タイトル

ルール導出法を改善したSTRIMの提案と部分一致仮説データへの適用

この機能はログイン後に利用できます。
下のボタンをクリックしてください。

詳細検索

論文タイトル

著者

抄録

ジャーナル名

出版日を西暦で入力してください(4桁の数字)。

検索したいキーワードを入力して下さい