• No results found

Lenient multi-agent deep reinforcement learning

N/A
N/A
Protected

Academic year: 2021

Share "Lenient multi-agent deep reinforcement learning"

Copied!
9
0
0

Loading.... (view fulltext now)

Full text

Loading

Figure

Figure 1: Lenient-DQN Architecture. We build on the stan- stan-dard Double-DQN architecture [33] by adding a lenient loss function (top right, see Section 4.1)
Table 1: Hyper-parameters
Figure 4: TMC and TDS schedules used during analysis.
Figure 6: Noisy Stochastic CMOTP Average Reward

References

Related documents

By extracting hydrocarbons from drilling fluid returns at surface, the FlairFlex service provides fluid characterization and early insight into C 1 –C 6 reservoir fluid

Bhatnagar University Institute of Chemical Engineering & Technology, Panjab University, Sector-14, Chandigarh for admission under Foreign Nationals/NRI

The significant and negative effect obtained for the industry com- mon support group (group 6 in Table 10) originates from the year the experiment started in Kainuu (2005), where

This research was aimed at developing the use of journals can develop the ability of the eleventh grade students at SMA Negeri 1 Sojol in writing Recount text. It was

Chirurgia brzucha, narzędzia do chirurgii żołądka, jelit i odbytu Abdominale Chirurgie,Magen-, Darm- und Rektum-Instrumente Abdominal Surgery, Intestinal- and Rectal

The Staff Associations have, over the years, taken a very proactive stance in all discipline and complaint matters affecting the standards of police officers and

Overall, the Group’s total sales increased by 30.2 % in the first three months of 2014 to EUR 126.3 m compared to EUR 97.0 m in the same period of the previous year.. This

Vertebrate hearts made of specially arranged striated muscle fibers Composed of interconnected cells, each with its own nucleus. Interconnections appear as lines called