Dedale

Dedale is an environment for studying multi-agents coordination, learning and decision-making problems under realistic hypotheses. Highly parametrizable, Dedale allows to tackle either cooperative or competitive exploration, patrolling, treasure(s) or agent(s) hunt problems with teams from one to dozens of heterogeneous agents in discrete or continuous environments, local or distributed over the network.

In Dedale the environment is (initially) unknown to the agents. It is dynamic, partially observable and everything is asynchronous (agents and communication). No turn-per-turn here, we consider real-life hypotheses.

Citing Dedale

Cédric Herpson. Dedale: A Realistic Simulation Framework for Decentralized Multi-Agents Problems. (to appear - draft version :hal-02392383)

1. Environnement

In the discrete case, the environment is a graph. Nodes (each one associated to a unique identifier) are places were your agents find objects and the edges are the corridors between them. You can automatically generate topologies, import OpenStreetMap areas or design yours :

Dogoronev generated env — Dogoronev based

The size of generated environnments is parametrizable. Two agents cannot currently occupy one location at the same time, this is done in order to make you deal with interlocking through coordination.

Elements within the environments

All elements are optional and configurables

1.1 Wells

Wells are dangerous node where any agent coming-in falls and dies. Wells generate wind in their surrounding nodes (the distance is parametrizable), giving a warning to any passing agent. Depending of the situation, your agents may, or should (not ?) take the risk to explore further.

1.2 Treasures

Treasures of various types, value and difficulty can be automatically distributed or manually set in the environment. A treasure is stored within a safe, which can be locked or not, and added to a node.

If this feature is activated, your agents have to find them and collect as much as possible. When an agent (see next section) is in a room where a treasure is, and if its backPack and expertise allows it, then the agent can grab it.

If the safe is locked and the agent does not possess sufficient expertise to open it, he needs to call for help.
If the agent backPack is full or the treasure is of a different type than his, then he grab nothing
If the agent cannot grab all the treasure, the remaining treasure stays on the map. Another agent can come and take it.

The agent drops a portion of the treasure by picking it up. The more picking actions are made on a treasure, the more the loss is important. We will therefore opt for picking up a single treasure in a minimum of actions.

1.3 Dynamical environment

You can activate adversaries on the map. Thus, several teams can evoluate on the same graph.. But, to begin, we invite you to deal with the Golem. It is currently armless but will move (or steal) the treasures he finds. The coordination of your team will become critical to optimize the picking of the treasures.

2. Agents

At the beginning, agents are deployed (randomly or not) on the map. The agents do not know the topology beforehand. Thus, they have to discover and learn it.

Agents can be of 3(+1) different types :

Explorer : It cannot collect, only explore
Tanker : It possess a nearly unlimited storage capacity, but cannot pick anything
Collector : It possess a backpack specific to a given type of treasure, and a limited carrying capacity. It can pick a treasure, and drop it in the Tanker if the latter is in reach
HumanControlled : Only one agent can be of this type, it is used to locally test the behaviour of your team facing a keyboard-controlled agent

You can decide to activate one or several type of agents. The more types, the more ~~difficulties~~ fun ;)

2.1 Limited percepts

When an agent is in a room, it can perceives :

The id, unique - of the room
The occurrence, if any, of a treasure, its type, value and opening conditions (if any)
The links to the neighbouring rooms (and associated observations, if any)

2.2 Limited communication range

As your agents are cooperative, they should share their knowledge in order to coordinate and optimize their actions. In order to do that, your agent can communicate through messages. Nevertheless, the reach is limited. An Agent can only communicate with the ones in the neighbourhood (distance that can vary with each agent). The coordination solutions should thus take into account this fact. To control the number (and the size) of the messages exchanged can be a good idea.

2.3 Actions and Observations

The complete description of an agent’s interaction capabilities is available here

3. These configuration possibilities allow you to adress differents research problems :

3.1 Distributed exploration

Consider an unknown finite map, a set of agents and a limited communication range.

How to efficiently explore it ?
If we add wells
- How to explore it ?
- What is the minimal number of agents required to locate with certainty the well(s) and fully explore the map ?

Tips :
Start with one agent, then 2, 5 and beyond.
Prefer DFS to BFS.

3.2 Cooperative patrolling and pursuit-evasion games : Hunt the Golem(s)

Consider an unknown finite map, a set of agents and a limited communication range. We release the Golem.

What strategy can you propose to guarantee to block it on a given node (predetermined or not) in a finite time ?
- What is the minimal number of required agents ?
- Easy on a (finite) binary tree with unlimited communication range, it becomes a little bit more difficult when you set the communication range to 2, use a graph and a annoyingly clever Golem..
  - What is the minimal number of required agents on a graph ?
  - What are the best strategies ?

Tips :
Start with only one golem on a little binary tree
Draw a picture and remember that the golem will try to get behind your agents

3.3 Treasure(s) hunt and pickup and delivery problems

Consider an unknown map, a set of agents, a limited communication range and treasures to pick (and deliver).

How to maximise the treasures grabbed ?
- You will need to deal with interlocking
If the safes have opening conditions
- How to maximise the treasures grabbed ?
If we release the Golem, he will move the treasures (creating a dynamic environment)
- What strategy can you propose to maximise the treasures grabbed ?

Tips :
The Tanker(s) can be in area(s) around a strategical(s) node(s).

Of course, you can activate all of these options at the same time and let other teams compete against you as well..