An Example of Implementing a Complex Function As an Agent . 22

4. The Language-Independent Agent Architecture

4.1 A Model for the Language-Independent Agent Architecture

4.1.3 An Example of Implementing a Complex Function As an Agent . 22

Complex cross-application functionality also maps into agent. Section 3.1.2 contains a list of various functionality in Trinity. Figure 4.3, adapted from [19], shows the agent and the infrastructure it requires for the last function in that list, the model transformation into EMF format. The “Modeling UI” in Figure 4.3 means same as MMA in this thesis.

As before, the dashed line represents the route an agent travels during the model transformation. At ﬁrst, the agent is created as a result from user input in MMA.

The agent then reads the desired model into its data container at stage two. Then, a language-independent agent transportation occurs at third stage. In stage four, the actual model transformation into EMF format happens. The transformation utilizes Eclipse API for creating EMF objects, and uses agent’s data container as input. Finally, at stage ﬁve, the testing and validation tool is started and EMF model is passed to it. After the tool completes, it will display results to the user.

More detailed description is provided in [19].

4. The Language-Independent Agent Architecture 23

Figure 4.3: The applications, agent architecture infrastructure, agent, and tasks of the EMF model transformation.

4.2 Language-Independent Deﬁnition of the Agent State 4.2.1 The Formal Deﬁnition of the Agent State

The diﬀerent implementations, one for each language, of the agent architecture presented in this thesis must be compatible between language boundaries. Therefore, the full state of any agent must be deﬁned in a language-independent fashion. First the basic components used throughout deﬁnitions concerning agent state are deﬁned in Deﬁnition 4.2.1.

Deﬁnition 4.2.1. Let the set of all Unicode symbols [29] be Σ. Then 1. the set of all possibly usable task outputs isTO = Σ^∗,

2. the set of all possibly usable instance IDs isI = Σ⁺, and 3. the set of all possibly usable tags is T = Σ⁺×Σ⁺.

A function to retrieve agent state, when agent’s instance ID is given, is deﬁned next. This function might return diﬀerent results, depending at which point in lifecycle of agent the function is called, since the state of the agent most likely is changing as the agent is being executed. One way to map this function to reality is that this function is interpreted as the agent architecture. The function consumes the instance ID of the agent, and returns the state for such agent.

Deﬁnition 4.2.2 (The function to retrieve an agent state). To begin, a set of all possible execution states for any agent is deﬁned to beES, and a set of all possible data states for any agent is deﬁned to be DS. Let I be as in Deﬁnition 4.2.1 (2).

The function to retrieve an agent state with given agent instance ID is AS : I → {nil} ∪(ES ×DS). AS produces nil if the agent with given instance ID does not

4. The Language-Independent Agent Architecture 24

exist in agent architecture environment. However, if the agent exists, it will return the state of the agent as∈ES ×DS.

Next, constraints forES andDS are deﬁned, so that results produced byAS are meaningful. First, the constraints for a set of all possible execution states ES are deﬁned.

Deﬁnition 4.2.3 (Constraints for execution states). All elements of ES are of the form(V,E,v,ˆ VT,ET,EX), where

1. V is a set, and E ⊆V ×V. Together V and E are interpreted as a directed graph. This graph is called theagent task graph, since every vertex represents a single task to execute, and edges are transitions between tasks.

2. ˆv ∈ V is a root vertex of the task graph. It must hold that every vertex is reachable from the root vertex, that is,∀v ∈V : ˆv →^∗ v.

3. VT is a function such thatVT :V → P(T), whereT is as in Deﬁnition 4.2.1 (3), and ∀v ∈V : |VT(v)| <∞. The meaning of VT is that it returns a set of tags, which the agent task must have so that the task represented by this vertex gets executed.

4. ET is a function so thatET :E → P(TO), whereTO is as in Deﬁnition 4.2.1 (1), and ∀e ∈ E : |ET(e)| < ∞. The meaning of ET is that in order for the task represented by target vertex v_t to be executed, the task represented by source vertex v_s must return task output to_vs such thatto_vs ∈ET(v_s, v_t).

5. EX ∈ P(EP ×ST) is a set of the execution lifelines of the agent. The EP is a set of all possible paths starting at vˆ so that EP = {ep₀ep₁. . . ep_n | n ≥ 0∧ ep₀ = ˆv ∧ ∀i; 1 ≤ i ≤ n : (ep_i₋₁, ep_i) ∈ E}, and ST is a set of execution statuses such that ST = {executing,in_transition,stopped}. The meaning of EX is that it is a set of all current execution lifelines of the agent.

Additionally, the elements V,E,ˆv,VT, and ET are ﬁxed for each agent. That is, given speciﬁc agent instance ID, they must always stay the same regardless of the call time of AS.

Each time the task completes, the vertex representing this task is added to the path of the current execution lifeline. Furthermore, for allep_i+1 in all paths, the task executed by ep_i returned task output to so that to ∈ ET(ep_i, ep_i+1). This means that execution path must traverse only those edges, for which there exist task output inET.

4. The Language-Independent Agent Architecture 25

If EX = ∅, then agent execution is considered to be stopped. If there are many execution lifelines with status other than stopped, then it means that the lifelines are executingconcurrently. Next, the constraints for all possible data statesDS are deﬁned.

Deﬁnition 4.2.4 (Constraints for data states). All elements ofDS are of the form (aid, hlid,N,VN), where

1. aid ∈I, where I is as in Deﬁnition 4.2.1 (2), is the instance ID of the agent.

It must hold that AS(id)produces the state which has the instance IDaid of same value as id given to AS.

2. The hlid∈I is the instance ID of the home location of the agent,

3. N is a domain-speciﬁc set of names for domain-speciﬁc data contained by this agent. Formally,N ⊆Σ⁺ and |N|<∞.

4. VN is a function VN : N → LI. It returns the language-independent data structure for each name inN. All elements of LI are software objects.

Additionally, the elementsaid and hlid are constants for each agent.

4.2.2 Agent State During Transportation

In order to specify more concretely what sort of data is transported in agent trans-portation, a set of properties for agent state during transportation is deﬁned. First, we decompose this state into two parts: the execution state, and the data state.

Additionally, we assume that all areas, locations, and agents have an ID unique in the scope of their type. This means that an area and an agent may have the same IDs.

Data state — agent architecture related Agent instance ID The ID of the agent.

Home location ID The ID of location, where agent was originally created.

History information An ordered list, in ascending order, of history infor-mation items. The order is determined by completion time of each task.

After execution of a task has been completed, the history information item is appended to this set. This item is composed of the tags of the task, the ID of location, and an UTC time of completion of the task.

Destination

Criteria for area tags A set of tags that area must have in order to execute the next task. Optional.

4. The Language-Independent Agent Architecture 26

Criteria for area ID The ID that area must have in order to execute the next task. Optional.

Criteria for location tags A set of tags that location must have in order to execute the next task. Optional.

Criteria for location ID The ID that location must have in order to execute the next task. Optional.

Criteria for agent history True if the location ID may be in history information of this agent. Falseotherwise.

Data state — domain-speciﬁc

Data container The contents of the data container of the agent.

Execution state Task graph

Vertices A set of task graph vertices. Each vertex represents a task to be executed, and has an ID, unique in scope of this graph, and a set of tags. The actual task must have all the tags of this vertex to be executed.

Edges All the transitions between vertices. Each transition has the pre-vious and next vertices, and the task output (a single string), which the previous task must return in order the transition to be used.

Next task The ID of the vertex in task graph, which will be used to search for the next task to execute.

In the execution state, there is only one vertex for the task to be executed next.

This is because transportation occurs one execution lifeline at a time. Thus, only one task at a time may be executing at any point in the lifeline. There is a possibility for execution lifelines to perform forks (where a single lifeline spawns at least one new lifeline) andjoins (where at least two lifelines merge into one). Hence, there is a need to deﬁne what exactly happens during such join. That is, what to do when two lifelines with diﬀerent data container are merged? The answer for this question is not presented in this thesis, since the agent architecture is still in prototype stage.

4.3 Language-Independent Data Structures

4.3.1 Language-Independent Data Structure Types

In order to have a uniform way of handling the language-independent data in dif-ferent programming languages, we deﬁne here a set of language-independent data

4. The Language-Independent Agent Architecture 27

structure types. The types are further decomposed into atomic types and composed types. Each atomic type contains a single datum easily transformable and under-standable in a type systems of most programming languages. Correspondingly, each composed type is a container for other language-independent data structures, and also easily transformable and understandable in a type systems of most programming languages.

Next we list all language-independent data structure types required for agent architecture operation. Formally, we deﬁne types for the software objects in set LI deﬁned in Deﬁnition 4.2.4.

Atomic types

Int32 and Int64: data structures representing signed 32-bit and 64-bit in-tegers, respectively.

Real32 and Real64: data structures representing 32-bit and 64-bit ﬂoating point values [4], respectively.

String: data structure representing a string of n characters, where n ≥0.

Boolean: data structure having two possible values, either true or false.

Binary: data structure representing sequence of n binary numbers, where n ≥0.

Composed types

List: data structure representing an ordered list of n other data structures, where n≥0.

Map: data structure representing an unordered list of named other data structures. Each name is unique in scope of a single Map.

These data structure types are enough to present data common to all program-ming languages. However, the user of the agent architecture is able to extend these types and provide her own custom types.

4.3.2 Serialization Rules for Language Independent Data Structures

Each language-independent data structure type has a serialization rule. This rule istwo-way in the sense that unambiguous deserialization must be possible from the serialized format without any loss of data. The serialization format is binary in order to be as compact as possible. The rules have two variations: one variation should be used when schema is available during serialization of data structure, and the other

4. The Language-Independent Agent Architecture 28

variation should be used when such schema is not available during serialization of data structure.

Now follows a list of serialization rules for all deﬁned language-independent data structure types.

Int32 and Int64:

Serialized with schema: variable-length [2] ZigZag [1] encoding, in little-endian byte order.