Thread extraction for polyadic instruction sequences

Bergstra, J.A.; Middelburg, C.A.

Publication date
2009

Document Version
Submitted manuscript

Citation for published version (APA):

General rights
It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons).

Disclaimer/Complaints regulations
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.
Abstract. Instruction sequences are often fragmented. An important reason for instruction sequence fragmentation is that the execution architecture at hand to execute instruction sequences sets bounds to the size of instruction sequences. In this paper, we study instruction sequences that have been split into fragments. The purpose is to develop a theoretical understanding of this matter. The possible joint behaviours exhibited by a collection of fragments on execution are explained in terms of threads as considered in basic thread algebra. In this way, a setting is provided in which the slow-down results of instruction sequence fragmentation can be analysed.

Keywords: instruction sequence fragmentation, polyadic instruction sequence, thread extraction, basic thread algebra.


1 Introduction

In this paper, we consider fragmentation of sequential programs that take the form of instruction sequences. With that we carry on the line of research with which a start was made in [3]. This line of research concerns the development of a theoretical understanding of possible forms of sequential programs and associated ways of programming. It is a fact of life that instruction sequences are often fragmented. An important reason for instruction sequence fragmentation is that the execution architecture at hand to execute instruction sequences sets bounds to the size of instruction sequences. However, there may also be other reasons for instruction sequence fragmentation, for instance business economical reasons. In this paper, we look for a direct approach to the formalization of instruction sequences that have been split into fragments and we investigate their relationship with sequential programs.

The question is how a joint behaviour of the fragments in a collection of fragments is achieved. The view of this matter is that there can only be a single

* This research has been partly carried out in the framework of the Jacquard-project Symbiosis, which is funded by the Netherlands Organisation for Scientific Research (NWO).
fragment being executed at any stage, but the fragment in question may make any fragment in the collection the one being executed by means of a special instruction for switching over execution to another fragment. This does not fit in very well with the conception that the collection of fragments constitutes a sequential program. To our knowledge, a theoretical understanding of instruction sequences that have been split into fragments has not been developed. This has motivated us to take up this topic.

As the view is taken in the line of research carried on in this paper that the behaviour exhibited by a sequential program on execution is a thread as considered in basic thread algebra [3], so the view is taken here that the possible joint behaviours exhibited by the fragments on execution are threads as considered in basic thread algebra. In execution architectures, a fragment must be loaded in order to become the one being executed. Hence, making a fragment the one being executed can be looked upon as loading that fragment for execution.

The instruction sequences taken for fragments are called polyadic instruction sequences in this paper. We introduce polyadic instruction sequences in the setting of program algebra [3], an algebra of programs in which programs are looked upon as instruction sequences. In [3], a hierarchy of program notations rooted in program algebra is presented as well. Included in this hierarchy are very simple program notations which are close to existing assembly languages up to and including simple program notations that support structured programming by offering a rendering of conditional and loop constructs. All of these program notations are referred to in this paper, but only one of them is actually used. That program notation is introduced under the name PGLD in [3].

This paper is organized as follows. First, we review basic thread algebra, program algebra, and the program notation PGLD (Sections 2, 3, and 4). Next, we introduce polyadic instruction sequences in the setting of program algebra, explain the possible joint behaviours of a collection of polyadic instruction sequences using basic thread algebra, and give an example of the use of polyadic instruction sequences (Sections 5 and 6). Following this, we extend basic thread algebra to allow for threads to make use of services and give a description of instruction register file services (Sections 7 and 8). After that, we show that, for each possible joint behaviour of a collection of polyadic instruction sequences, a sequential program can be synthesized from the collection of polyadic instruction sequences that exhibits on execution essentially the behaviour in question by making use of an instruction register file service (Section 9). Finally, we make some concluding remarks (Section 10).

In this paper, we do not give models of BTA and its extensions. For the preferred models, called projective limit models, the reader is referred to [10].

1 In [3], basic thread algebra is introduced under the name basic polarized process algebra. Prompted by the development of thread algebra [6], which is a design on top of it, basic polarized process algebra has been renamed to basic thread algebra.
2 Basic Thread Algebra

In this section, we review BT A (Basic Thread Algebra). BT A is a form of process algebra which is concerned with the behaviour that sequential programs exhibit on execution. Those behaviours are called threads.

In BT A, it is assumed that fixed but arbitrary finite sets $A$ and $I$ with $A \cap I = \emptyset$ and $\tau \in I$ have been given. The members of $A$ are called basic actions and the members of $I$ are called internal actions. The members of $A \cup I$ are referred to as actions. In previous work, we take in essence the singleton set $\{\tau\}$ for $I$. The generalization made here permits internal actions with differences relevant for analysis to be distinguished.

A thread performs actions in a sequential fashion. Upon each action performed, a reply from the execution environment of the thread determines how it proceeds. The possible replies are $T$ and $F$. Performing an internal action, always leads to the reply $T$.

Although BT A is one-sorted, we make this sort explicit. The reason for this is that we will extend BT A with an additional sort in Section 7.

BT A has one sort: the sort $T$ of threads. To build terms of sort $T$, BT A has the following constants and operators:

- the deadlock constant $D : T$
- the termination constant $S : T$
- for each $a \in A \cup I$, the binary postconditional composition operator $\preceq a \succeq$:
  $T \times T \to T$.

We assume that there are infinitely many variables of sort $T$, including $x, y, z$. Terms of sort $T$ are built as usual (see e.g. [11, 12]). We use infix notation for the postconditional composition operator. We introduce basic action prefixing as an abbreviation: $a \circ p$ abbreviates $p \preceq a \succeq p$.

The thread denoted by a closed term of the form $p \preceq a \succeq q$ will first perform $a$, and then proceed as the thread denoted by $p$ if the reply from the execution environment is $T$ and proceed as the thread denoted by $q$ if the reply from the execution environment is $F$. The threads denoted by $D$ and $S$ will become inactive and terminate, respectively.

BT A has only one axiom. This axiom is given in Table 1. In this table, $\iota$ stands for an arbitrary member of $I$.

Notice that each closed BT A term denotes a thread that will become inactive or terminate after it has performed finitely many actions. Infinite threads can be described by guarded recursion.

A guarded recursive specification over BT A is a set of recursion equations $E = \{ X = t_X \mid X \in V \}$, where $V$ is a set of variables of sort $T$ and each $t_X$ is a BT A term of the form $D, S$ or $t \preceq a \succeq t'$ with $t$ and $t'$ that contain only variables

\[ x \preceq \iota \succeq y = x \preceq \iota \succeq x \top \]
Table 2. Axioms for guarded recursion
\[ \langle X | E \rangle = \langle t_X | E \rangle \] if \( X = t_X \in E \) \hspace{1cm} \text{RDP}
\[ E \Rightarrow X = \langle X | E \rangle \] if \( X \in V(E) \) \hspace{1cm} \text{RSP}

Table 3. Approximation induction principle
\[ \prod_{n \geq 0} \pi_n(x) = \pi_n(y) \Rightarrow x = y \] \hspace{1cm} \text{AIP}
\[ \pi_0(x) = D \] \hspace{1cm} \text{P0}
\[ \pi_{n+1}(S) = S \] \hspace{1cm} \text{P1}
\[ \pi_{n+1}(D) = D \] \hspace{1cm} \text{P2}
\[ \pi_{n+1}(x \preceq a \succeq y) = \pi_n(x) \preceq a \succeq \pi_n(y) \] \hspace{1cm} \text{P3}

from \( V \). We write \( V(E) \) for the set of all variables that occur in \( E \). We are only interested in models of BTA in which guarded recursive specifications have unique solutions, such as the projective limit model of BTA presented in [1]. A thread that is the solution of a finite guarded recursive specification over BTA is called a \textit{finite-state} thread.

For each guarded recursive specification \( E \) and each \( X \in V(E) \), we introduce a constant \( \langle X | E \rangle \) of sort \( T \) standing for the unique solution of \( E \) for \( X \). The axioms for these constants are given in Table 2. In this table, we write \( \langle t_X | E \rangle \) for \( t_X \) with, for all \( Y \in V(E) \), all occurrences of \( Y \) in \( t_X \) replaced by \( \langle Y | E \rangle \). \( X \), \( t_X \) and \( E \) stand for an arbitrary variable of sort \( T \), an arbitrary BTA term of sort \( T \) and an arbitrary guarded recursive specification over BTA, respectively. Side conditions are added to restrict what \( X \), \( t_X \) and \( E \) stand for.

We will use the following abbreviation: \( a^\omega \), where \( a \in \mathcal{A} \cup \mathcal{I} \), abbreviates \( \langle X | \{ X = a \circ X \} \rangle \).

We will write BTA+REC for BTA extended with the constants for solutions of guarded recursive specifications and axioms RDP and RSP.

Closed terms of sort \( T \) from the language of BTA+REC that denote the same infinite thread cannot always be proved equal by means of the axioms of BTA+REC. We introduce AIP (Approximation Induction Principle) to remedy this. AIP is based on the view that two threads are identical if their approximations up to any finite depth are identical. The approximation up to depth \( n \) of a thread is obtained by cutting it off after performing a sequence of actions of length \( n \). In AIP, the approximation up to depth \( n \) is phrased in terms of the unary \textit{projection} operator \( \pi_n : \mathcal{T} \to \mathcal{T} \). AIP and the axioms for the projection operators are given in Table 3. In this table, \( a \) stands for an arbitrary member of \( \mathcal{A} \cup \mathcal{I} \).

We will write BTA+REC+AIP for BTA+REC extended with the projection operators and the axioms from Table 3.
3 Program Algebra

In this section, we review PGA (ProGram Algebra). The perception of a program as a single-pass instruction sequence is the starting-point of PGA.

In PGA, it is assumed that a fixed but arbitrary set $\mathcal{A}$ of basic instructions has been given. PGA has the following primitive instructions:

- for each $a \in \mathcal{A}$, a plain basic instruction $a$;
- for each $a \in \mathcal{A}$, a positive test instruction $+a$;
- for each $a \in \mathcal{A}$, a negative test instruction $-a$;
- for each $l \in \mathbb{N}$, a forward jump instruction $\#l$;
- a termination instruction $!$.

We write $\mathcal{I}$ for the set of all primitive instructions.

The intuition is that the execution of a basic instruction $a$ produces either $T$ or $F$ at its completion. In the case of a positive test instruction $+a$, $a$ is executed and execution proceeds with the next primitive instruction if $T$ is produced. Otherwise, the next primitive instruction is skipped and execution proceeds with the primitive instruction following the skipped one. If there is no next instruction to be executed, deadlock occurs. In the case of a negative test instruction $-a$, the role of the value produced is reversed. In the case of a plain basic instruction $a$, execution always proceeds as if $T$ is produced. The effect of a forward jump instruction $\#l$ is that execution proceeds with the $l$-th next instruction. If $l$ equals $0$ or the $l$-th next instruction does not exist, deadlock occurs. The effect of the termination instruction $!$ is that execution terminates.

PGA has the following constants and operators:

- for each $u \in \mathcal{I}$, an instruction constant $u$;
- the binary concatenation operator $\cdot$;
- the unary repetition operator $\omega$.

We assume that there are infinitely many variables, including $x, y, z$. Terms are built as usual. We use infix notation for the concatenation operator and postfix notation for the repetition operator.

A closed PGA term is considered to denote a non-empty, finite or periodic infinite sequence of primitive instructions.\(^2\) Closed PGA terms are considered equal if they denote the same instruction sequence. The axioms for instruction sequence equivalence are given in Table 4. In this table, $n$ stands for an arbitrary natural number greater than $0$. For each PGA term $P$, the term $P^n$ is defined by induction on $n$ as follows: $P^1 = P$ and $P^{n+1} = P;P^n$. The equation $X^\omega = X;X^\omega$ is derivable. Each closed PGA term is derivably equal to one of the form $P$ or $P;Q^\omega$, where $P$ and $Q$ are closed PGA terms in which the repetition operator does not occur.

Each closed PGA term $P$ is considered to denote an instruction sequence of which the behaviour is a finite-state thread, called the thread produced by

\(^2\) A periodic infinite sequence is an infinite sequence with only finitely many sub-sequences.
Table 4. Axioms of PGA

\begin{align*}
(x; y); z & = x; (y; z) \quad \text{PGA1} \\
(x^n \omega) & = x^\omega \quad \text{PGA2} \\
x^n ; y & = x^n \quad \text{PGA3} \\
(x; y)^\omega & = x; (y; x)^\omega \quad \text{PGA4}
\end{align*}

Table 5. Defining equations for thread extraction operation of PGA

\begin{align*}
|a| & = a \circ D \\
|a; x| & = a \circ |x| \\
|+a| & = a \circ |x| \\
|+a; x| & = |x| \preceq a \succeq |\#2; x| \\
|--a| & = a \circ D \\
|--a; x| & = |\#2; x| \preceq a \succeq |x| \\
|D| & = D \\
|D| & = D \\
|\#l| & = D \\
|\#l; x| & = |\#l + 2; u; x| = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x| \\
|\#l + 2; u; x| & = |\#l + 1; x|
\end{align*}

The set \( A \) of basic instructions is taken for the set \( \mathcal{A} \) of basic actions. The thread extraction operation \( |\cdot| \) determines, for each closed PGA term \( P \), a closed BTA+REC term that denotes the thread produced by \( P \). The thread extraction operation is defined by the equations given in Table 5 (for \( a \in \mathcal{A} \), \( l \in \mathbb{N} \) and \( u \in \mathcal{J} \)) and the rule that \( |\#l; X| = D \) if \( \#l \) is the beginning of an infinite jump chain. This rule is formalized in e.g. [7].

The behaviour of each closed PGA term, is a thread that is definable by a finite guarded recursive specification over BTA. The other way round, each finite guarded recursive specification over BTA in which no internal actions occur can be translated into a closed PGA term of which the behaviour is the solution of the finite guarded recursive specification concerned.

Closed PGA terms are considered to denote programs and therefore they constitute an elementary program notation. Closed PGA terms are also called PGA programs.

In [3], a hierarchy of program notations rooted in PGA is presented. In this hierarchy, the program notations PGLA, PGLB, PGLC, PGLD, PGLDg, PGLE, and PGLS appear. PGLA programs are translated into PGA programs by means of a function \( \text{pga}2\text{pgla} \). PGLB programs are translated into PGLA programs by means of a function \( \text{pglb}2\text{pgla} \). PGLC programs are translated into PGLB programs by means of a function \( \text{pglc}2\text{pglb} \), etcetera. These functions are called projections in [3]. Each of them translates each of the programs from a program notation higher in the hierarchy into a program from a program notation lower in the hierarchy that exhibits the same behaviour on execution. Moreover, PGA programs are translated into PGLA programs by means of a function \( \text{pga}2\text{pgla} \). PGLA programs are translated into PGLB programs by means of a function \( \text{pgla}2\text{pglb} \). PGLB programs are translated into PGLC programs by means of a
function \textit{pglb2pglc}, etcetera. These functions are called embeddings in [3]. Each of them translates each of the programs from a program notation lower in the hierarchy into a program from a program notation higher in the hierarchy that exhibits the same behaviour on execution. We remark that there does not exist a function by which each PGLE program is translated into a PGLS program that exhibits the same behaviour on execution: PGLS is strictly weaker than PGLE.

In Section 5, we will take special versions of the above-mentioned program notations for the ones that may be used for fragments. Moreover, we will use the function \textit{pgla2pga} and in addition the functions \textit{pglb2pga}, \textit{pglc2pga}, . . . defined by \textit{pglb2pga}(P) = \textit{pgla2pga}(\textit{pglb2pgla}(P)) for all PGLB programs \( P \), \textit{pglc2pga}(P) = \textit{pglb2pga}(\textit{pglc2pglb}(P)) for all PGLC programs \( P \), etcetera. In Section 9, we will use the function \textit{pglc2pgld} and in addition the function \textit{pga2pglc} defined by \textit{pga2pglc}(P) = \textit{pglb2pglc}(\textit{pgla2pglb}(\textit{pga2pgla}(P))) for all PGA programs \( P \).

PGLC, PGLD and PGLS are the most interesting program notations of the ones mentioned above. PGLC and PGLD are close to existing assembly languages. The main difference between them is that PGLC has relative jump instructions and PGLD has absolute jump instructions. PGLS supports structured programming by offering a rendering of conditional and loop constructs instead of (unstructured) jump instructions.

4 The Program Notation PGLD

In this section, we review the program notation PGLD. This program notation is reviewed because it will be used later on in Sections 6 and 9.

In PGLD, like in PGA, it is assumed that there is a fixed but arbitrary finite set \( \mathfrak{A} \) of basic instructions. Again, the intuition is that the execution of a basic instruction \( a \) may modify a state and produces \( T \) or \( F \) at its completion.

PGLD has the following primitive instructions:

- for each \( a \in \mathfrak{A} \), a plain basic instruction \( a \);
- for each \( a \in \mathfrak{A} \), a positive test instruction \( +a \);
- for each \( a \in \mathfrak{A} \), a negative test instruction \( -a \);
- for each \( l \in \mathbb{N} \), an absolute jump instruction \( \#\#l \).

PGLD programs have the form \( u_1; \ldots; u_k \), where \( u_1, \ldots, u_k \) are primitive instructions of PGLD.

The plain basic instructions, the positive test instructions, and the negative test instructions are as in PGA. The effect of an absolute jump instruction \( \#\#l \) is that execution proceeds with the \( l \)-th instruction of the program concerned. If \( \#\#l \) is itself the \( l \)-th instruction, then deadlock occurs. If \( l \) equals 0 or \( l \) is greater than the length of the program, then termination occurs.

The function \textit{pgld2pga} from the set of all PGLD programs to the set of all PGA programs can be defined directly as follows:

\[
\text{pgld2pga}(u_1; \ldots; u_k) = (\psi_1(u_1); \ldots; \psi_k(u_k); !; !; \#0; \#0)^\omega,
\]


where the auxiliary functions $\psi_j$ from the set of all primitive instructions of PGLD to the set of all primitive instructions of PGA are defined as follows $(1 \leq j \leq k)$:

\[
\begin{align*}
\psi_j(##l) &= \#l - j & \text{if } j \leq l \leq k, \\
\psi_j(##l) &= \#k + 2 - (j - l) & \text{if } 0 < l < j, \\
\psi_j(##l) &= ! & \text{if } l = 0 \lor l > k, \\
\psi_j(u) &= u & \text{if } u \text{ is not a jump instruction}.
\end{align*}
\]

5 Polyadic Instruction Sequences

In this section, we look for a direct approach to the formalization of instruction sequences that have been split into fragments. The fragments resulting from a splitting are considered instruction sequences with special instructions for switching over execution from one fragment to another. The instruction sequences concerned are called polyadic instruction sequences.

It is assumed that a special version of PGLA, PGLB, PGLC, PGLD, PGLDg, PGLE or PGLS is used for each polyadic instruction sequence. Moreover, it is assumed that a collection of polyadic instruction sequences between which execution can be switched takes the form of a sequence, called a polyadic instruction sequence vector, in which each polyadic instruction sequence is coupled with the program notation used for it.

Our general view on the way of achieving a joint behaviour of the polyadic instruction sequences in a polyadic instruction sequence vector is as follows:

- there can only be a single polyadic instruction sequence being executed at any stage;
- the polyadic instruction sequence in question may make any polyadic instruction sequence in the vector the one being executed;
- making another polyadic instruction sequence the one being executed is effected by executing a special instruction for switching over execution;
- any polyadic instruction sequence can be taken for the one being executed initially.

In addition to special instructions for switching over execution, polyadic instruction sequences may contain special instructions for putting instructions into instruction registers and special instructions which are actually instruction place-holders: each of them is replaced by the instruction contained in one of the instruction registers on making a polyadic instruction sequence the one being executed. The presence of special instructions of the latter kind turns a polyadic instruction sequence into a parameterized instruction sequence of which the parameters are filled in each time it is made the one being executed. This feature accounts for the use of the prefix polyadic. Its merit is primarily that it allows for execution to proceed in effect from different positions each time a polyadic instruction sequence is loaded for execution. An example of this is given in Section 6.
We take the line that different program notations can be used for different polyadic instruction sequences in a polyadic instruction sequence vector. On making a polyadic instruction sequence in the vector the one being executed, it is considered to be translated into a PGA\textsubscript{p} program.

PGA\textsubscript{p} is a variant of PGA in which the above-mentioned special instructions are incorporated. In PGA\textsubscript{p}, it is assumed that there is a fixed but arbitrary finite set \( \mathbb{A}_c \) of \textit{core basic instructions}. In PGA\textsubscript{p}, a basic instruction is either a core basic instruction or a supplementary basic instruction. PGA\textsubscript{p} has the following \textit{core primitive instructions}:

- for each \( a \in \mathbb{A}_c \), a \textit{plain basic instruction} \( a \);
- for each \( a \in \mathbb{A}_c \), a \textit{positive test instruction} \(+a\)
- for each \( a \in \mathbb{A}_c \), a \textit{negative test instruction} \(-a\)
- for each \( l \in \mathbb{N} \), a \textit{forward jump instruction} \(#l\)
- a \textit{termination instruction} \(!\).

We write \( \mathcal{I}_c \) for the set of all core primitive instructions. The core primitive instructions of PGA\textsubscript{p} are the counterparts of the primitive instructions of PGA.

PGA\textsubscript{p} has the following \textit{supplementary basic instructions}:

- for each \( i \in \mathbb{N} \), a \textit{switch-over instruction} ###\( i \);
- for each \( i \in \mathbb{N} \) and \( u \in \mathcal{I}_c \), a \textit{put instruction} \$put\( i; u\);
- for each \( i \in \mathbb{N} \), a \textit{get instruction} \$get\( i\).

We write \( \mathcal{A}_s \) for the set of all supplementary basic instructions. In the presence of a polyadic instruction sequence vector, a switch-over instruction ###\( i \) is the instruction for switching over execution to the \( i \)-th polyadic instruction sequence in the vector. A put instruction \$put\( i; u\) is the instruction for putting instruction \( u \) in the instruction register with number \( i \). A get instruction \$get\( i\) is the instruction place-holder of which each occurrence in a polyadic instruction sequence is replaced by the contents of the instruction register with number \( i \) on switching over execution to that polyadic instruction sequence.

The supplementary basic instructions of PGA\textsubscript{p} can be viewed as built-in basic instructions. However, as laid down below, supplementary basic instructions do not occur in positive or negative test instructions. Thus, the core primitive instructions and supplementary basic instructions make up the primitive instructions of PGA\textsubscript{p}.

PGA\textsubscript{p} has the following constants and operators:

- for each \( u \in \mathcal{I}_c \cup \mathcal{A}_s \), an \textit{instruction constant} \( u \);
- the binary \textit{concatenation} operator \( \cdot \);
- the unary \textit{repetition} operator \( ^\omega \).

The axioms of PGA\textsubscript{p} are the same as the axioms of PGA. Suppose that in PGA the restriction is dropped that \( \mathbb{A} \) must be a finite set. Then PGA\textsubscript{p} can be viewed as the specialization of PGA obtained by taking the set \( \mathbb{A}_c \cup \mathcal{A}_s \) for \( \mathbb{A} \) and excluding terms in which basic instructions from \( \mathcal{A}_s \) occur in positive or negative test instructions. Henceforth, we actually drop the restriction
that $\mathfrak{A}$ must be a finite set. This simplifies the definitions of the different program notations that can be used for polyadic instruction sequences and also enables the use of the functions $\text{pga2pga}$, $\text{pglb2pga}$, etcetera for translating programs in those program notations into PGA programs.

The different program notations that can be used for polyadic instruction sequences are $\text{PGLA}_p$, $\text{PGLB}_p$, $\text{PGLC}_p$, $\text{PGLD}_p$, $\text{PGLDg}_p$, $\text{PGLE}_p$, and $\text{PGLS}_p$. The set of all $\text{PGLA}_p$ programs is the subset of the set of all $\text{PGLA}$ programs, taking the set $\mathfrak{A}_c \cup \mathfrak{A}_s$ for $\mathfrak{A}$, in which the basic instructions from $\mathfrak{A}_s$ do not occur in positive or negative test instructions. The other program notations are defined similarly. If the set $\mathfrak{A}_c \cup \mathfrak{A}_s$ is taken for $\mathfrak{A}$, the function $\text{pga2pga}$ translates each $\text{PGLA}_p$ program into a PGA $p$ program that exhibits the same behaviour on execution. Similar remarks apply to the other program notations.

A polyadic instruction sequence is either a $\text{PGLA}_p$ program, a $\text{PGLB}_p$ program, a $\text{PGLC}_p$ program, a $\text{PGLD}_p$ program, a $\text{PGLDg}_p$ program, a $\text{PGLE}_p$ program or a $\text{PGLS}_p$ program.

A polyadic instruction sequence vector is a sequence of pairs consisting of a polyadic instruction sequence and a member of the set \{A, B, C, D, Dg, E, S\} of program notation indices.

Let $\alpha$ be a polyadic instruction sequence vector, let $P_1, \ldots, P_n$ and $c_1, \ldots, c_n$ be polyadic instruction sequences and program notation indices, respectively, such that $\alpha = \langle (P_1, c_1) \rangle \cdots \langle (P_n, c_n) \rangle$ and let $i \in [1, n]$. Then we write $pg(\alpha, i)$ and $pgn(\alpha, i)$ for $P_i$ and $c_i$, respectively.

Let $\alpha$ be a polyadic instruction sequence vector of length $n$, and let $i \in [1, n]$. Then program notation index $pgn(\alpha, i)$ indicates which program notation is used for polyadic instruction sequence $pg(\alpha, i)$. $A$ stands for $\text{PGLA}_p$, $B$ stands for $\text{PGLB}_p$, etcetera. The program notation used is made explicit because it cannot always be determined uniquely from the polyadic instruction sequence concerned, whereas the behaviour that this polyadic instruction sequence exhibits on execution may be different for each of the program notations in question.

The set of instruction registers that contain an instruction and the contents of each of those registers matter when a polyadic instruction sequence is made the one being executed. That makes us introduce the notion of an instruction register file state and special notation relating to this notion.

An instruction register file state is a function $\sigma : I \rightarrow \mathfrak{I}_c$, where $I$ is a finite subset of $\mathbb{N}$.

Let $p$ be a $\text{PGA}_p$ program and $\sigma$ be an instruction register file state. Then we write $p[\sigma]$ for $p$ with, for all $i \in \text{dom}(\sigma)$, all occurrences of $\text{get}:i$ in $p$ replaced by $\sigma(i)$.

Let $i, n \in \mathbb{N}$ be such that $1 \leq i \leq n$, let $\alpha$ be a polyadic instruction sequence vector of length $n$, and let $\sigma$ be an instruction register file state. Then we write $\sigma \cdots \sigma$ for the concatenation of finite sequences $\sigma$ and $\sigma'$, and $\text{len}(\sigma)$ for the length of finite sequence $\sigma$.

3 We write $D^*$ for the set of all finite sequences with elements from set $D$. We use the following notation for finite sequences: $\langle \rangle$ for the empty sequence, $\langle d \rangle$ for the sequence having $d$ as sole element, $\sigma \cdots \sigma'$ for the concatenation of finite sequences $\sigma$ and $\sigma'$, and $\text{len}(\sigma)$ for the length of finite sequence $\sigma$. 4
valid(\alpha, i, \sigma) to indicate that instructions of the form \$get:i do not occur in 
\text{prj}_{pqn(\alpha, i)}(pq(\alpha, i))[\sigma].

We also use special notation relating to the program notation indices. Let \( c \) be a program notation index. Then we write \text{prj}_c for the projection \text{pgla2pga} if \( c = A \), the projection \text{pglb2pga} if \( c = B \), etcetera.

An obvious choice of the thread extraction operation of \( \text{PGA}_p \) is the thread extraction operation of \( \text{PGA} \), taking the set \( A_c \) \( \cup \) \( A_s \) for \( A \), restricted to the set of closed terms of \( \text{PGA}_p \). This thread extraction operation is considered not to be the proper one, because it treats the supplementary basic instructions as arbitrary basic instructions and thus disregards the fixed effects that they produce on execution. Moreover, this thread extraction operation requires that in BTA the restriction is dropped that \( A \) must be a finite set.

As regards the proper thread extraction for \( \text{PGA}_p \), the idea is that it yields, for each \( \text{PGA}_p \) program \( P \), a function that gives, for each polyadic instruction sequence vector \( \alpha \), the thread that is the joint behaviour of \( P \) and the polyadic instruction sequences in \( \alpha \) if \( P \) is the polyadic instruction sequence being executed initially. Because this behaviour depends upon the set of instruction registers that contain an instruction and the contents of each of those registers, we need a thread extraction operation for each instruction register file state.

For each instruction register file state \( \sigma \), we introduce the thread extraction operation \( \mid_{\sigma} \). These thread extraction operations are defined by the equations given in Table 6 (for \( a \in \mathbb{A}, l, i \in \mathbb{N}, u \in I_c \cup A_u \) and \( v \in I_c \)) and the rule that \( \mid_{\sigma}(\alpha) = D \) if \( \#l \) is the beginning of an infinite jump chain. Here, it is assumed that \( gl \in I \). The internal action \( gl \) (generate and load) represents the internal activity involved in switching over execution. The internal actions \( tau \) and \( gl \) are distinguished because \( tau \) is considered to represent a negligible internal activity, whereas \( gl \) is considered to represent a substantial internal activity.

We can couple nominal indices as labels with some of the polyadic instruction sequences in a polyadic instruction sequence vector. This would permit the use of alternative switch-over instructions with nominal indices instead of ordinal indices, like with the goto instructions from PGLDg. In the notational style of [2], the form of those alternative switch-over instructions would be ###\[i\].

### Example

In this section, we consider the splitting of a PGLD program \( P \) of 10000 instructions into two fragments.

We write \( \nu_1(l) \) for the number of absolute jump instructions ###\(l'\) with \( l' > 5000 \) from position 1 up to position \( l \) and \( \nu_2(l) \) for the number of absolute jump instructions ###\(l'\) with \( l' \leq 5000 \) from position 5001 up to position \( l \).

The polyadic instruction sequence \( P' \) corresponding to the first half of \( P \) is obtained from the first half of \( P \) as follows:

- the instruction \$get:1 is prefixed to it;
Table 6. Defining equations for thread extraction operations of PGA

| $\alpha \,|\, \sigma (\alpha) = a \circ D$ |
| $\alpha \,|\, x_\sigma (\alpha) = a \circ |x|_\sigma (\alpha)$ |
| $+$ $\alpha \,|\, x_\sigma (\alpha) = a \circ D$ |
| $-$ $\alpha \,|\, x_\sigma (\alpha) = |x|_\sigma (\alpha) \leq a \geq |\# l \,|\, x_\sigma (\alpha)$ |
| $+$ $\alpha \,|\, x_\sigma (\alpha) = a \circ D$ |
| $-$ $\alpha \,|\, x_\sigma (\alpha) = |\# l \,|\, x_\sigma (\alpha) \leq a \geq |x|_\sigma (\alpha)$ |
| $\# 1 \,|\, x_\sigma (\alpha) = |x|_\sigma (\alpha)$ |
| $\# 0 \,|\, x_\sigma (\alpha) = D$ |
| $\# l + 2 ; \# l_\sigma (\alpha) = D$ |
| $\# l + 2 ; \# l_\sigma (\alpha) = |\# l + 1 ; x_\sigma (\alpha)$ |
| $!! \,|\, \sigma (\alpha) = S$ |
| $! \,|\, x_\sigma (\alpha) = S$ |

- each absolute jump instructions $\# l$ with $l \leq 5000$ is replaced by the absolute jump instructions $\# l'$, where $l' = l + \nu_1 (l) + 1$;
- each absolute jump instructions $\# l$ with $l > 5000$ is replaced by the instruction sequence $\texttt{put}_2: \# l' ; \# l'$, where $l' = (l - 5000) + \nu_2 (l - 5000)$;

and the polyadic instruction sequence $P''$ corresponding to the second half of $P$ is obtained from the second half of $P$ as follows:

- the instruction $\texttt{get}_2$ is prefixed to it;
- each absolute jump instructions $\# l$ with $l > 5000$ is replaced by the absolute jump instructions $\# l'$, where $l' = (l - 5000) + \nu_2 (l - 5000) + 1$;
- each absolute jump instructions $\# l$ with $l \leq 5000$ is replaced by the instruction sequence $\texttt{put}_1: \# l' ; \# l'$, where $l' = l + \nu_1 (l)$. 

12
Notice that the positions occurring in jump instructions are adapted to the prefixing of a get instruction to each half of \( P \) and the replacement of each jump instructions that gives rise to a jump into the other half of \( P \) by two instructions.

For any instruction register file state \( \sigma \), we have that \(|P|\) coincides with \([\text{put}:1;\#1;\#\#1;1]|\langle \langle P', D \rangle \rangle \sim \langle \langle P'', D \rangle \rangle\) after the presence of the internal actions \( \tau \) and \( \eta \) in the latter behaviour has been concealed. In Section 7, we will introduce operators to conceal the presence of internal actions.

In this section, we have illustrated by means of an example that splitting a program into fragments is relatively simple. In Section 9, we will show that synthesizing a program from a collection of fragments is fairly complicated.

### 7 Threads-Services Interaction and Abstraction

A thread may make use of services. That is, it may perform certain actions for the purpose of having itself affected by a service that takes them as commands to be processed. The processing of an action may involve a change of state of the service and at completion of the processing of the action the service returns a reply value to the thread. The reply value determines how the thread proceeds. In this section, we review the use operators, which are concerned with threads making such use of services.\(^4\) We also introduce abstraction operators. They serve for concealment of the presence of internal actions, which arise among other things from the use operators. Both use operators and abstraction operators will be used later on in Section 9.

It is assumed that a fixed but arbitrary finite set \( \mathcal{F} \) of foci and a fixed but arbitrary finite set \( \mathcal{M} \) of methods have been given. Each focus plays the role of a name of a service provided by the execution environment that can be requested to process a command. Each method plays the role of a command proper. For the set \( \mathcal{A} \) of basic actions, we take the set \( \{f.m \mid f \in \mathcal{F}, m \in \mathcal{M}\} \). Performing a basic action \( f.m \) is taken as making a request to the service named \( f \) to process command \( m \).

A service \( H \) consists of

- a set \( S \) of states;
- an effect function \( \text{eff} : \mathcal{M} \times S \rightarrow S \);
- a yield function \( \text{yld} : \mathcal{M} \times S \rightarrow \{T,F,B\} \);
- an initial state \( s_0 \in S \);

satisfying the following condition:

\[
\forall m \in \mathcal{M}, s \in S \cdot (\text{yld}(m, s) = B \Rightarrow \forall m' \in \mathcal{M} \cdot \text{yld}(m', \text{eff}(m, s)) = B).
\]

The set \( S \) contains the states in which the service may be, and the functions \( \text{eff} \) and \( \text{yld} \) give, for each method \( m \) and state \( s \), the state and reply, respectively,

\(^4\) This version of the use mechanism was first introduced in [6]. In later papers, it is also called thread-service composition.
Table 7. Axioms for use operators

<table>
<thead>
<tr>
<th>Axiom</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>$S/f_H = S$</td>
<td>TSU1</td>
</tr>
<tr>
<td>$D/f_H = D$</td>
<td>TSU2</td>
</tr>
<tr>
<td>$(i \circ x)/f_H = i \circ (x/f_H)$</td>
<td>TSU3</td>
</tr>
<tr>
<td>$(x \leq g.m \geq y)/f_H = (x/f_H) \leq g.m \geq (y/f_H)$ if $f \neq g$</td>
<td>TSU4</td>
</tr>
<tr>
<td>$(x \leq f.m \geq y)/f_H = \tau \circ (x/f_H \frac{\partial}{\partial m} H)$ if $H((m)) = T$</td>
<td>TSU5</td>
</tr>
<tr>
<td>$(x \leq f.m \geq y)/f_H = \tau \circ (y/f_H \frac{\partial}{\partial m} H)$ if $H((m)) = F$</td>
<td>TSU6</td>
</tr>
<tr>
<td>$(x \leq f.m \geq y)/f_H = D$ if $H((m)) = B$</td>
<td>TSU7</td>
</tr>
</tbody>
</table>

that result from processing $m$ in state $s$. By the condition imposed on services, once the service has returned $B$ as reply, it keeps returning $B$ as reply.

Let $H = (S, eff, yld, s_0)$ be a service and let $m \in M$. Then the derived service of $H$ after processing $m$, written $\frac{\partial}{\partial m} H$, is the service $(S, eff, yld, eff(m, s_0))$; and the reply of $H$ after processing $m$, written $H(m)$, is $yld(m, s_0)$.

When a thread makes a request to the service to process $m$:

- if $H(m) \neq B$, then the request is accepted, the reply is $H(m)$, and the service proceeds as $\frac{\partial}{\partial m} H$;
- if $H(m) = B$, then the request is rejected and the service proceeds as a service that rejects any request.

We introduce the sort $S$ of services and, for each $f \in F$, the binary use operator $p/f_H : T \times S \rightarrow T$. The axioms for these operators are given in Table 7. In this table, $\iota$ stands for an arbitrary member of $I$, $f$ and $g$ stand for arbitrary foci from $F$, and $m$ stands for an arbitrary method from $M$.

Intuitively, $p/f_H$ is the thread that results from processing all basic actions performed by thread $p$ that are of the form $f.m$ by service $H$. When a basic action of the form $f.m$ performed by thread $p$ is processed by service $H$, it is turned into the internal action $\tau$ and postconditional composition is removed in favour of action prefixing on the basis of the reply value produced. This intuition is covered by axioms TSU5 and TSU6. Axioms TSU3 and TSU4 express that internal actions and basic actions of the form $g.m$ with $f \neq g$ are not processed. Axiom TSU7 expresses that deadlock takes place when a basic action to be processed is not accepted.

Let $T$ stand for either BTA, BTA+REC or BTA+REC+AIP. Then we will write $T+TSU$ for $T$, taking the set $\{f.m | f \in F, m \in M\}$ for $A$, extended with the use operators and the axioms from Table 7.

Both basic actions and internal actions are actions whose presence matters. For each $\iota \in I$, we introduce the unary abstraction operator $\tau_{\iota} : T \rightarrow T$ to conceal the presence of internal action $\iota$ in the case where its presence does not matter after all.

The axioms for the abstraction operators are given in Table 8. In this table, $a$ stands for an arbitrary action from $A \cup I$. 
Table 8. Axioms for abstraction operators

<table>
<thead>
<tr>
<th>Axiom</th>
<th>Symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>$\tau_i(S) = S$</td>
<td>TT1</td>
</tr>
<tr>
<td>$\tau_i(D) = D$</td>
<td>TT2</td>
</tr>
<tr>
<td>$\tau_i(\iota \circ x) = \tau_i(x)$</td>
<td>TT3</td>
</tr>
<tr>
<td>$\tau_i(x \sqsubseteq a \sqsupseteq y) = \tau_i(x) \sqsubseteq a \sqsupseteq \tau_i(y)$ if $a \neq \iota$</td>
<td>TT4</td>
</tr>
<tr>
<td>$\bigwedge_{n \geq 0} \tau_i(\pi_n(x)) = \tau_i(\pi_n(y)) \Rightarrow \tau_i(x) = \tau_i(y)$</td>
<td>TT5</td>
</tr>
</tbody>
</table>

A main difference between the version of the use mechanism introduced here and the version of the use mechanism introduced in [9] is that the former version does not incorporate abstraction and the latter version incorporates abstraction.

Let $T$ stand for either $\text{BTA}$, $\text{BTA+REC}$, $\text{BTA+REC+AIP}$, $\text{BTA+TSU}$, $\text{BTA+REC+TSU}$ or $\text{BTA+REC+AIP+TSU}$. Then we will write $T+\text{ABSTR}$ for $T$ extended with the abstraction operators and the axioms from Table 8.

For each $\iota \in \mathcal{I}$, the equation $\tau_i(\iota x) = D$ is derivable from the axioms of $\text{BTA+REC+AIP+ABSTR}$.

8 Instruction Register File Services

In this section, we describe services that make up register files with a finite set of registers that can contain instructions from a finite set of core primitive instructions. These services will be used in Section 9.

It is assumed that a fixed but arbitrary set $I \subseteq \mathbb{N}$ such that $I = [1,n]$ for some $n \in \mathbb{N}$ and a fixed but arbitrary finite set $U \subseteq \mathcal{I}_c$ have been given. The set $I$ is considered to contain the positions of the registers in the instruction register file and the set $U$ is considered to contain the instructions that can be put in those registers.

We write $\text{IRFS}$ for the set $\bigcup_{I' \subseteq I} I' \rightarrow U$. The members of $\text{IRFS}$ are considered to be the possible instruction register file states. It is assumed that a fixed but arbitrary bijection $\theta : \text{IRFS} \rightarrow [1, \text{card(IRFS)}]$ has been given.

The instruction register file services accept the following methods:

- for each $i \in I$ and $u \in U$, a register put method $\text{put:i:u}$;
- for each $j \in \text{rng}(\theta)$, a register file test method $\text{eq:j}$.

We write $\mathcal{M}_{\text{IRF}}$ for the set $\{\text{put:i:u} \mid i \in I \land u \in U\} \cup \{\text{eq:j} \mid j \in \text{rng}(\theta)\}$.

It is assumed that $\mathcal{M}_{\text{IRF}} \subseteq \mathcal{M}$.

The methods accepted by instruction register file services can be explained as follows:

- $\text{put:i:u}$: the contents of register $i$ becomes instruction $u$ and the reply is $T$;
- $\text{eq:j}$: if the state of the instruction register file equals $\theta^{-1}(j)$, then nothing changes and the reply is $T$; otherwise nothing changes and the reply is $F$.
Let $s \in \text{IRFS} \cup \{\uparrow\}$, where $\uparrow \notin \text{IRFS}$. Then the instruction register file service with initial state $s$, written $\text{IRFS}_s$, is the service $(\text{IRFS}, \text{eff}, \text{yld}, s)$, where the functions $\text{eff}$ and $\text{yld}$ are defined as follows ($\sigma \in \text{IRFS} \cup \{\uparrow\}$):

\[
\begin{align*}
\text{eff} \left(\text{put}:i:u, \sigma\right) &= \sigma \oplus [i \mapsto u], \\
\text{eff} \left(\text{eq}:j, \sigma\right) &= \sigma, \\
\text{eff} \left(m, \sigma\right) &= \uparrow \text{ if } m \notin \text{M}_{irf}, \\
\text{eff} \left(m, \uparrow\right) &= \uparrow,
\end{align*}
\]

\[
\begin{align*}
\text{yld} \left(\text{put}:i:n, \sigma\right) &= T, \\
\text{yld} \left(\text{eq}:j, \sigma\right) &= T \text{ if } \theta(\sigma) = j, \\
\text{yld} \left(m, \sigma\right) &= B \text{ if } m \notin \text{M}_{irf}, \\
\text{yld} \left(m, \uparrow\right) &= B.
\end{align*}
\]

9 Program Synthesis

In order to establish a connection between collections of fragments and sequential programs, we show in this section that, for each possible joint behaviour of a collection of fragments, a sequential program can be synthesized from the collection of fragments that exhibits on execution essentially the behaviour in question by making use of an instruction register file service. More precisely, we show that, for each PGA$_p$ program $P$ and polyadic instruction sequence vector $\alpha$, a PGA$_p$ program $P'$ can be synthesized from $P$ and $\alpha$ such that, for all relevant instruction register file states $\sigma$, $|P'|_{\text{irf}}$ coincides with $|P|_{\sigma}(\alpha)$ after the presence of the internal actions $\tau$ and $\text{gl}$ has been concealed.

Let $P$ be a PGA$_p$ program and $\alpha$ be a polyadic instruction sequence vector. The general idea is that:

- each polyadic instruction sequence in $\alpha$ is translated into a PGA$_p$ program and an appropriate finite collection of instances of this PGA$_p$ program in which occurrences of get instructions are replaced by core primitive instructions is generated;
- $P$ and all the generated programs are translated into PGLC$_p$ programs and these PGLC$_p$ programs are concatenated;
- the resulting PGLC$_p$ program is translated into a PGLD$_p$ program and this program is translated into a PGLD program by replacing all occurrences of the supplementary instructions by core primitive instructions as follows:
  - a switch-over instruction $###i$ is replaced by an absolute jump instruction whose effect is a jump to the beginning of an appended instruction sequence whose execution leads, after the state of the instruction register file has been found by a linear search, to a jump to the beginning of the right instance of the PGA$_p$ program that corresponds to the $i$th polyadic instruction sequence in $\alpha$;
  - a put instruction $\text{put}:i:u$ is simply replaced by the plain basic instruction $\text{irf. put}:i:u$;

\footnote{We use the following notation for functions: $[]$ for the empty function; $[d \mapsto r]$ for the function $f$ with $\text{dom}(f) = \{d\}$ such that $f(d) = r$; and $f \oplus g$ for the function $h$ with $\text{dom}(h) = \text{dom}(f) \cup \text{dom}(g)$ such that, for all $d \in \text{dom}(h)$, $h(d) = f(d)$ if $d \notin \text{dom}(g)$ and $h(d) = g(d)$ otherwise.}
- a get instruction $\text{get} : i$ is simply replaced by the absolute jump instruction whose effect is a jump to the position of the instruction itself.

A collection of instances of the PGA program corresponding to a polyadic instruction sequence in $\alpha$ is considered appropriate if it includes all instances that may become the one being executed. $P$ and all the generated programs are translated into PGLC programs because PGLC programs are relocatable: they can be concatenated without disturbing the meaning of jump instructions. The PGLC program resulting from the concatenation is translated into a PGLD program before the supplementary instructions are replaced because the replacement of a switch-over instruction by an absolute jump instruction is simpler than its replacement by a relative jump instruction.

Following the general idea outlined above, we define a function $\text{pgap2pgld}$ that yields, for each PGA program $P$, a function that gives, for each polyadic instruction sequence vector $\alpha$, a PGLD program $P'$ such that, for each relevant instruction register file service state $\sigma$, $|\text{pgld2pga}(P')|_{IRF(\sigma)}$ coincides with $|P|_{\sigma}(\alpha)$ after the presence of the internal actions tau and gl has been concealed.

The function $\text{pgap2pgld}$ from the set of all PGA programs to the set all functions from the set of all polyadic instruction sequence vectors to the set of all PGLD programs is defined as follows:

$$\text{pgap2pgld}(x)(\alpha) =
\text{translate}(\text{pglc2pgld}(\text{expand}(x)(\alpha))) ;
+\text{irf.eq:1} ; \#l_1,1 ; \ldots ; +\text{irf.eq:n'} ; \#l_{1,n'} ;
\vdots
+\text{irf.eq:1} ; \#l_{n,1} ; \ldots ; +\text{irf.eq:n'} ; \#l_{n,n'},$$

where $n = \text{len}(\alpha)$, $n' = \text{max(rng}(\theta))$, the function $\text{expand}$ from the set of all PGA programs to the set all functions from the set of all polyadic instruction sequence vectors to the set of all PGLC programs is defined as follows:

$$\text{expand}(x)(\alpha) =
\text{pga2pglc}(x) ;
\text{pga2pglc}(\text{gen}(\alpha, 1, \theta^{-1}(1))) ; \ldots ; \text{pga2pglc}(\text{gen}(\alpha, 1, \theta^{-1}(n'))) ;
\vdots
\text{pga2pglc}(\text{gen}(\alpha, n, \theta^{-1}(1))) ; \ldots ; \text{pga2pglc}(\text{gen}(\alpha, n, \theta^{-1}(n'))),$$

where $n = \text{len}(\alpha)$, $n' = \text{max(rng}(\theta))$, and the function $\text{gen}$ from the set of all polyadic instruction sequence vectors, the set of all natural numbers and the set of all instruction register file states to the set of all PGA programs is defined as follows:

$$\text{gen}(\alpha, i, \sigma) = \text{prj}_{\text{pgn}(\alpha, i)}(\text{pg}(\alpha, i))|\sigma| \text{ if } 1 \leq i \leq \text{len}(\alpha) \land \text{valid}(\alpha, i, \sigma),$$
$$\text{gen}(\alpha, i, \sigma) = \#0 \text{ if } 1 \leq i \leq \text{len}(\alpha) \land \neg \text{valid}(\alpha, i, \sigma),$$
$$\text{gen}(\alpha, i, \sigma) = ! \text{ if } i = 0 \lor i > \text{len}(\alpha),$$
the function \textit{translate} from the set of all PGLD_{p} programs to the set of all PGLD programs is defined as follows:

\[
\text{translate}(u_1; \ldots; u_k) = \psi_1(u_1) ; \ldots ; \psi_1(u_k),
\]

where the functions \(\psi_j\) from the set of all primitive instructions of PGLD_{p} to the set of all primitive instructions of PGLD are defined as follows \((1 \leq j \leq k)\):

\[
\begin{align*}
\psi_j(#\#i) &= #\#i, & &\text{if } 1 \leq i \leq \text{len}(\alpha), \\
\psi_j(#\#i) &= !, & &\text{if } i = 0 \lor i > \text{len}(\alpha), \\
\psi_j(\$\text{put}::i:\!u) &= \text{irf}.\text{put}::i:\!u, \\
\psi_j(\$\text{get}::i) &= #\#j, \\
\psi_j(u) &= u & &\text{if } u \text{ is not a supplementary basic instruction},
\end{align*}
\]

where for each \(i \in [1, \text{len}(\alpha)]\):

\[
l_i = \text{len}(\text{pga2pglc}(x)) + \sum_{h \in [1, \text{len}(\alpha)], h' \in \text{rng}(\theta)} \text{len}(\text{pga2pglc}(\text{prj_pgn}(\alpha, h)(\text{pg}(\alpha, h))[\theta^{-1}(h')]))) \\
+ 2 \cdot \text{max(rng}(\theta)) \cdot (i - 1),
\]

and for each \(i \in [1, \text{len}(\alpha)]\) and \(j \in \text{rng}(\theta)\):

\[
l_{i,j} = \text{len}(\text{pga2pglc}(x)) + \sum_{h \in [1, i] \cup [1, \text{len}(\alpha) \setminus h], h' \in \text{rng}(\theta)} \text{len}(\text{pga2pglc}(\text{prj_pgn}(\alpha, h)(\text{pg}(\alpha, h))[\theta^{-1}(h')]))) \\
+ \sum_{h' \in [1, j - 1]} \text{len}(\text{pga2pglc}(\text{prj_pgn}(\alpha, i)(\text{pg}(\alpha, i))[\theta^{-1}(h')]))) .
\]

The following theorem states rigorously that, for any PGA_{p} program \(P\) and polyadic instruction sequence vector \(\alpha\), for all relevant instruction register file states \(\sigma\), \(|\text{pgld2pga}(\text{pgap2pgld}(P)(\alpha))|_{|_{\text{IRF}}\sigma}\) coincides with \(|P|_{\sigma}(\alpha)\) after the presence of the internal actions \(\tau\) and \(\text{gl} has been concealed.

**Theorem 1.** Let \(P\) be a PGA_{p} program and \(\alpha\) be a polyadic instruction sequence vector, and let \(n\) be the highest number occurring in instructions of the form \(\$\text{put}::i:\!u\) or \(\$\text{get}::i\) in \(P\) or \(\alpha\). Take the interval \([1, n]\) for \(I\) and the set of all core primitive instructions occurring in instructions of the form \(\$\text{put}::i:\!u\) for \(U\), and let \(\sigma \in \text{IRF}\). Then \(\tau_{\alpha}(\text{pgld2pga}(\text{pgap2pgld}(P)(\alpha)))|_{|_{\text{IRF}}\sigma}\) coincides with \(|P|_{\sigma}(\alpha)\).

**Proof.** The proof of Theorem 1 follows the same line as the proof of Theorem 1 from [4] given in that paper. Here, we give only a brief outline of the proof of the current theorem.

The proof of this theorem proceeds as follows: (i) we give a set \(T\) of closed terms of sort \(T\) with \(\tau_{\alpha}(\text{pgld2pga}(\text{pgap2pgld}(P)(\alpha)))|_{|_{\text{IRF}}\sigma}\) \(\in T\), a set \(T'\) of closed terms of sort \(T\) with
τ_{\text{tau}}(\text{pgld2pga}(\text{pgap2pgld}(P)))(\alpha)) /_{\mathcal{IRF}_\sigma} \in T', and a bijection \( \beta : T \to T' \);

(ii) we show that there exists a set \( E \) consisting of one derivable equation \( p = p' \) for each \( p \in T \) such that, for all equations \( p = p' \) in \( E \):

- \( \beta(p) = p'' \) is a derivable equation if \( p'' \) is \( p' \) with, for all \( q \in T \), all occurrences of \( q \) in \( p' \) replaced by \( \beta(q) \);
- \( p' \in T \) only if \( p' \) can be rewritten to a \( q' \notin T \) using the equations in \( E \) from left to right.

This means that \( \tau_{\text{tau}}(\tau_{\text{gl}}(|P|\sigma(\alpha))) \) and \( \tau_{\text{tau}}(|\text{pgld2pga}(\text{pgap2pgld}(P))(\alpha))| /_{\mathcal{IRF}_\sigma} \) denote solutions of the same guarded recursive specification. Because guarded recursive specifications have unique solutions, it follows immediately that \( \tau_{\text{tau}}(|\text{pgld2pga}(\text{pgap2pgld}(P))(\alpha))| /_{\mathcal{IRF}_\sigma} = \tau_{\text{tau}}(\tau_{\text{gl}}(|P|\sigma(\alpha))) \). □

In the proof outlined above, an apposite indexing of the closed terms in the sets \( T \) and \( T' \) facilitates the definition of the bijection \( \beta \). Yet, this definition is much more complicated than the definition of the bijection needed in the proof from [4] referred to.

The definition of \( \text{pgap2pgld} \) shows that the synthesis of sequential programs from collections of fragments is fairly complicated.

## 10 Conclusions

We have given a formalization of instruction sequences that have been split into fragments. Thread extraction provides the possible joint behaviours of a collection of fragments. We have shown that, for each possible joint behaviour of a collection of fragments, a sequential program can be synthesized from the collection of fragments that exhibits on execution essentially the behaviour in question by making use of a service. This program synthesis is reminiscent of the service-based variant of projection semantics for program notations used in [5]. However, we consider the program synthesis too complicated to serve a semantical purpose.

In this paper, a fragment in a collection of fragments has two attributes: an ordinal index (position) and a program notation index. We have also mentioned that a nominal index (label) could be an optional attribute. Many other attributes that are relevant in practice can be imagined, e.g. modification date, author, tester, owner, user, and security level. In this paper, we have restricted ourselves to attributes that are indispensable for a theoretical understanding of instruction sequence fragmentation.

The question arises whether all aspects of instruction sequence fragmentation covered in this paper can be dealt with at the level of threads. This issue has been investigated in [8]. It happens that not all aspects can be dealt with at the level of threads. In particular, the ability to replace special instructions in an instruction sequence fragment by different ordinary instructions every time execution is switched over to that fragment cannot be dealt with at the level of threads. Threads turn out to be too abstract to deal with instruction sequence fragmentation in full.
References


