Fig 2 : Proposed Approach Framework to Control Telepresence Robot during Delay in Communication
The hybrid network divides into an online and targets part in the proposed approach structure comparable to DDPG alone. Although the two networks have the same topology, their update patterns are different. Figure 3 illustrates the hybrid network topology, where an LSTM component along with the decision component make up the main network. The main network’s job is identifying the best policy for converting the state-to-action mapping.