• Rasmus Eckholdt Andersen
  • Emil Blixt Hansen
  • Steffen Madsen
4. semester, Virksomhedsteknologi, Kandidat (Kandidatuddannelse)
The need for adaptable models, e.g. reinforcement learning (RL), have in recent years been more present within the industry. However, the number of commercial solutions using RL is limited, one reason being the complexity related to the design of RL. Therefore, a method to identify complexities of RL for industrial applications is presented in this thesis. It was used on 15 applications inspired from four industrial companies. Complexity was especially identified in relation to the reward functions. Thus two Linear Inverse RL (IRL) algorithms in which the reward function is represented as a linear combination of features, was tested using expert data. Some of the tests indicated a visual better result than tests carried out using RL. The process of designing features shared similarities with the process of designing a reward function. The added complexity of implementing Linear IRL and constructing expert data is thus not always a simpler approach. The IRL method GAIL, which requires no feature construction, was furthermore tested showing potential.
Udgivelsesdato3 jun. 2019
Antal sider109
ID: 304987415