On the expressiveness of single-pass instruction sequences

Bergstra, J.A.; Middelburg, C.A.

DOI
10.1007/s00224-010-9301-8

Publication date
2012

Document Version
Final published version

Published in
Theory of Computing Systems

Citation for published version (APA):

General rights
It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons).

Disclaimer/Complaints regulations
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.

UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl)
On the Expressiveness of Single-Pass Instruction Sequences

J.A. Bergstra · C.A. Middelburg

Published online: 11 November 2010
© The Author(s) 2010. This article is published with open access at Springerlink.com

Abstract We perceive programs as single-pass instruction sequences. A single-pass instruction sequence under execution is considered to produce a behaviour to be controlled by some execution environment. Threads as considered in basic thread algebra model such behaviours. We show that all regular threads, i.e. threads that can only be in a finite number of states, can be produced by single-pass instruction sequences without jump instructions if use can be made of Boolean registers. We also show that, in the case where goto instructions are used instead of jump instructions, a bound to the number of labels restricts the expressiveness.

Keywords Single-pass instruction sequence · Regular thread · Expressiveness · Jump-free instruction sequence

1 Introduction

The work presented in this paper is part of a research program which is concerned with different subjects from the theory of computation and the area of computer architectures where we come across the relevancy of the notion of instruction sequence. The working hypothesis of this research program is that this notion is a central notion of computer science. It is clear that instruction sequence is a key concept in practice, but strangely enough it has as yet not come prominently into the picture in theoretical circles.
Program algebra [3], which is intended as a setting suited for developing theory from the above-mentioned working hypothesis, is taken for the basis of the development of theory under the research program. The starting-point of program algebra is the perception of a program as a single-pass instruction sequence, i.e. a finite or infinite sequence of instructions of which each instruction is executed at most once and can be dropped after it has been executed or jumped over. This perception is simple, appealing, and links up with practice. A single-pass instruction sequence under execution is considered to produce a behaviour to be controlled by some execution environment. Threads as considered in basic thread algebra [3] model such behaviours: upon each action performed by a thread, a reply from the execution environment determines how the thread proceeds. A thread may make use of services, i.e. components of the execution environment.

Each Turing machine can be simulated by means of a thread that makes use of a service. The thread and service correspond to the finite control and tape of the Turing machine. The threads that correspond to the finite controls of Turing machines are examples of regular threads, i.e. threads that can only be in a finite number of states. The behaviours of all single-pass instruction sequences considered in program algebra are regular threads and each regular thread is produced by some single-pass instruction sequence. In this paper, we show that each regular thread can be produced by some single-pass instruction sequence without jump instructions if use can be made of services that make up Boolean registers.

The primitive instructions of program algebra include jump instructions. An interesting variant of program algebra is obtained by leaving out jump instructions and adding labels and goto instructions. It is easy to see that each regular thread can also be produced by some single-pass instruction sequence with labels and goto instructions. In this paper, we show that a bound to the number of labels restricts the expressiveness of this variant.

As part of the research program of which the work presented in this paper is part, issues concerning the following subjects from the theory of computation have been investigated from the viewpoint that a program is an instruction sequence: semantics of programming languages [4, 12], expressiveness of programming languages [9, 23], computability [10, 13], and computational complexity [7]. Performance related matters of instruction sequences have also been investigated in the spirit of the theory of computation [11, 12]. In the area of computer architectures, basic techniques aimed at increasing processor performance have been studied as part of this research program (see e.g. [5, 8]).

The work referred to above provides evidence for our hypothesis that the notion of instruction sequence is a central notion of computer science. To say the least, it shows that instruction sequences are relevant to diverse subjects. In addition, it is to be expected that the emerging developments with respect to techniques for high-performance program execution on classical or non-classical computers require that programs are considered at the level of instruction sequences. All this has motivated us to continue the above-mentioned research program with the work on expressiveness presented in this paper.

1In [3], basic thread algebra is introduced under the name basic polarized process algebra.
This paper is organized as follows. First, we review basic thread algebra and program algebra (Sects. 2 and 3). Next, we present a mechanism for interaction of threads with services and give a description of Boolean register services (Sects. 4 and 5). After that, we show that each regular thread can be produced by some single-pass instruction sequence without jump instructions if use can be made of Boolean register services (Sect. 6). Then, we introduce the variant of program algebra obtained by leaving out jump instructions and adding labels and goto instructions (Sect. 7). Following this, we show that a bound to the number of labels restricts the expressiveness of this variant (Sect. 8). Finally, we make some concluding remarks (Sect. 9).

2 Basic Thread Algebra

In this section, we review BTA (Basic Thread Algebra), which is concerned with the behaviours that sequential programs exhibit on execution. These behaviours are called threads.

In BTA, it is assumed that a fixed but arbitrary set $\mathcal{A}$ of basic actions has been given. A thread performs actions in a sequential fashion. Upon each action performed, a reply from the execution environment of the thread determines how it proceeds. To simplify matters, there are only two possible replies: $T$ and $F$.

BTA has one sort: the sort $T$ of threads. To build terms of sort $T$, it has the following constants and operators:

- the deadlock constant $D : T$;
- the termination constant $S : T$;
- for each $a \in \mathcal{A}$, the binary postconditional composition operator $\_ \triangleleft a \triangleright \_ : T \times T \rightarrow T$.

We assume that there are infinitely many variables of sort $T$, including $x, y, z$. We introduce action prefixing as an abbreviation: $a \circ p$ abbreviates $p \triangleleft a \triangleright p$.

The thread denoted by a closed term of the form $p \triangleleft a \triangleright q$ will first perform $a$, and then proceed as the thread denoted by $p$ if the reply from the execution environment is $T$ and proceed as the thread denoted by $q$ if the reply from the execution environment is $F$. The threads denoted by $D$ and $S$ will become inactive and terminate, respectively. This implies that each closed BTA term denotes a thread that will become inactive or terminate after it has performed finitely many actions. Infinite threads can be described by guarded recursion.

A guarded recursive specification over BTA is a set of recursion equations $E = \{X = t_X \mid X \in V\}$, where $V$ is a set of variables of sort $T$ and each $t_X$ is a BTA term of the form $D$, $S$ or $t \triangleleft a \triangleright t'$ with $t$ and $t'$ that contain only variables from $V$. We write $V(E)$ for the set of all variables that occur in $E$. We are only interested in models of BTA in which guarded recursive specifications have unique solutions, such as the projective limit model of BTA presented in [2].

For each guarded recursive specification $E$ and each $X \in V(E)$, we introduce a constant $\langle X \mid E \rangle$ of sort $T$ standing for the unique solution of $E$ for $X$. The axioms for these constants are given in Table 1. In this table, we write $\langle t_X \mid E \rangle$ for $t_X$ with, for all $Y \in V(E)$, all occurrences of $Y$ in $t_X$ replaced by $\langle Y \mid E \rangle$. $X$, $t_X$ and $E$ stand for an
arbitrary variable of sort T, an arbitrary BTA term of sort T and an arbitrary guarded recursive specification over BTA, respectively. Side conditions are added to restrict what X, tX and E stand for.

Closed terms that denote the same infinite thread cannot always be proved equal by means of the axioms given in Table 1. We introduce AIP (Approximation Induction Principle) to remedy this. AIP is based on the view that two threads are identical if their approximations up to any finite depth are identical. The approximation up to depth n of a thread is obtained by cutting it off after it has performed n actions. In AIP, the approximation up to depth n is phrased in terms of the unary projection operator πₙ : T → T. AIP and the axioms for the projection operators are given in Table 2.

### 3 Program Algebra

In this section, we review PGA (ProGram Algebra). The perception of a program as a single-pass instruction sequence is the starting-point of PGA.

In PGA, it is assumed that a fixed but arbitrary set A of basic instructions has been given. PGA has the following primitive instructions:

- for each a ∈ A, a plain basic instruction a;
- for each a ∈ A, a positive test instruction +a;
- for each a ∈ A, a negative test instruction −a;
- for each l ∈ N, a forward jump instruction #l;
- a termination instruction !.

We write J for the set of all primitive instructions.

The intuition is that the execution of a basic instruction a produces either T or F at its completion. In the case of a positive test instruction +a, a is executed and execution proceeds with the next primitive instruction if T is produced. Otherwise, the next primitive instruction is skipped and execution proceeds with the primitive instruction following the skipped one. If there is no next instruction to be executed, deadlock occurs. In the case of a negative test instruction −a, the role of the value produced is reversed. In the case of a plain basic instruction a, execution always proceeds as if T is produced. The effect of a forward jump instruction #l is that execution proceeds with the l-th next instruction. If l equals 0 or the l-th next instruction does
not exist, deadlock occurs. The effect of the termination instruction ! is that execution terminates.

PGA has the following constants and operators:

- for each \( u \in \mathcal{I} \), an instruction constant \( u \);
- the binary concatenation operator \( _{;} \); 
- the unary repetition operator \( _{ω} \).

We assume that there are infinitely many variables, including \( X, Y, Z \).

A closed PGA term is considered to denote a non-empty, finite or eventually periodic infinite sequence of primitive instructions.\(^2\) Closed PGA terms are considered equal if they denote the same instruction sequence. The axioms for instruction sequence equivalence are given in Table 3. In this table, \( n \) stands for an arbitrary natural number greater than 0. For each PGA term \( P \), the term \( P^n \) is defined by induction on \( n \) as follows: \( P^1 = P \) and \( P^{n+1} = P \cdot P^n \). The equation \( X^ω = X \cdot X^ω \) is derivable. Each closed PGA term is derivably equal to one of the form \( P \) or \( P^ω \cdot Q \), where \( P \) and \( Q \) are closed PGA terms in which the repetition operator does not occur.

The repetition operator renders backward jump instructions superfluous. In [3], it is shown how programs in a program notation that is close to existing assembly languages with forward and backward jump instructions can be translated into closed PGA terms.

The behaviours of the instruction sequences denoted by closed PGA terms are considered threads, with basic instructions taken for basic actions. The thread extraction operation \( |_{;} | \) determines, for each closed PGA term \( P \), a closed term of BTA with guarded recursion that denotes the behaviour of the instruction sequence denoted by \( P \). The thread extraction operation is defined by the equations given in Table 4 (for \( a \in \mathcal{A} \), \( l \in \mathbb{N} \) and \( u \in \mathcal{I} \)) and the rule that \( |_{#l}; X | = D \) if \( #l \) is the beginning of an infinite jump chain. This rule is formalized in e.g. [9].

\[^2\]An eventually periodic infinite sequence is an infinite sequence with only finitely many distinct suffixes.
4 Interaction of Threads with Services

A thread may make use of services. That is, a thread may perform an action for the purpose of interacting with a service that takes the action as a command to be processed. The processing of an action may involve a change of state of the service and at completion of the processing of the action the service returns a reply value to the thread. In this section, we introduce the use operators, which are concerned with this kind of interaction between threads and services.

It is assumed that a fixed but arbitrary set $F$ of foci and a fixed but arbitrary set $M$ of methods have been given. Each focus plays the role of a name of some service provided by an execution environment that can be requested to process a command. Each method plays the role of a command proper. For the set $A$ of actions, we take the set $\{f.m | f \in F, m \in M\}$. Performing an action $f.m$ is taken as making a request to the service named $f$ to process command $m$.

A service $H$ consists of

- a set $S$ of states;
- an effect function $\text{eff} : M \times S \rightarrow S$;
- a yield function $\text{yld} : M \times S \rightarrow \{T, F, B\}$;
- an initial state $s_0 \in S$;

satisfying the following condition:

$$\forall m \in M, s \in S \cdot (\text{yld}(m, s) = B \Rightarrow \forall m' \in M \cdot \text{yld}(m', \text{eff}(m, s)) = B).$$

The set $S$ contains the states in which the service may be, and the functions $\text{eff}$ and $\text{yld}$ give, for each method $m$ and state $s$, the state and reply, respectively, that result from processing $m$ in state $s$.

Let $H = (S, \text{eff}, \text{yld}, s_0)$ be a service and let $m \in M$. Then the derived service of $H$ after processing $m$, written $\partial \partial m H$, is the service $(S, \text{eff}, \text{yld}, \text{eff}(m, s_0))$; and the reply of $H$ after processing $m$, written $H(m)$, is $\text{yld}(m, s_0)$.

When a thread makes a request to service $H$ to process $m$:

- if $H(m) \neq B$, then the request is accepted, the reply is $H(m)$, and the service proceeds as $\partial \partial m H$;
- if $H(m) = B$, then the request is rejected.

We introduce the sort $S$ of services and, for each $f \in F$, the binary use operator $\_ / f \_ : T \times S \rightarrow T$. The axioms for these operators are given in Table 5.

Intuitively, $p / f H$ is the thread that results from processing all actions performed by thread $p$ that are of the form $f.m$ by service $H$. When an action of the form $f.m$ performed by thread $p$ is processed by service $H$, the postconditional composition concerned is eliminated on the basis of the reply value produced. No internal action is left as a trace of the processed action, like with the use operators found in papers on thread interleaving (see e.g. [6]).

Combining TSU2 and TSU7, we obtain $\bigwedge_{n \geq 0} \pi_n(x) / f H = D \Rightarrow x / f H = D$. 

 Springer
Table 5  Axioms for use operators

\[
\begin{align*}
S /_f H &= S & \text{TSU1} \\
D /_f H &= D & \text{TSU2} \\
(x \leq g, m \geq y) /_f H &= (x /_f H) \leq g, m \geq (y /_f H) & \text{if } f \neq g \text{ TSU3} \\
(x \leq f, m \geq y) /_f H &= x /_f D_{\delta m} H & \text{if } H(m) = T \text{ TSU4} \\
(x \leq f, m \geq y) /_f H &= y /_f D_{\delta m} H & \text{if } H(m) = F \text{ TSU5} \\
(x \leq f, m \geq y) /_f H &= D & \text{if } H(m) = B \text{ TSU6} \\
\bigwedge_{n \geq 0} \pi_n(x) /_f H &= \pi_n(y) /_f H \Rightarrow x /_f H = y /_f H & \text{TSU7}
\end{align*}
\]

5 Instruction Sequences Acting on Boolean Registers

Our study of jump-free instruction sequences in Sect. 6 is concerned with instruction sequences that act on Boolean registers. In this section, we describe services that make up Boolean registers.

A Boolean register service accepts the following methods:

- a set to true method set:T;
- a set to false method set:F;
- a get method get.

We write \( M_{\text{BR}} \) for the set \{set:T, set:F, get\}. It is assumed that \( M_{\text{BR}} \subseteq M \).

The methods accepted by Boolean register services can be explained as follows:

- set:T: the contents of the Boolean register becomes T and the reply is T;
- set:F: the contents of the Boolean register becomes F and the reply is F;
- get: nothing changes and the reply is the contents of the Boolean register.

Let \( s \in \{T, F, B\} \). Then the Boolean register service with initial state \( s \), written \( BR_s \), is the service \((\{T, F, B\}, eff, eff, s)\), where the function \( eff \) is defined as follows \((b \in \{T, F\})\):

\[
\begin{align*}
\text{eff(set:T, b)} &= T, & \text{eff}(m, b) &= B & \text{if } m \notin M_{\text{BR}}, \\
\text{eff(set:F, b)} &= F, & \text{eff}(m, B) &= B. \\
\text{eff(get, b)} &= b,
\end{align*}
\]

Notice that the effect and yield functions of a Boolean register service are the same.

6 Jump-Free Instruction Sequences

In this section, we show that each thread that can only be in a finite number of states can be produced by some single-pass instruction sequence without jump instructions if use can be made of Boolean register services.

First, we make precise what it means that a thread can only be in a finite number of states. We assume that a fixed but arbitrary model \( \mathfrak{M} \) of BTA extended with guarded recursion and the use mechanism has been given, we use the term thread only for the
elements from the domain of \( M \), and we denote the interpretations of constants and operators in \( M \) by the constants and operators themselves.

Let \( p \) be a thread. Then the set of states or residual threads of \( p \), written \( \text{Res}(p) \), is inductively defined as follows:

- \( p \in \text{Res}(p) \);
- if \( q \sqsubseteq a \sqsupseteq r \in \text{Res}(p) \), then \( q \in \text{Res}(p) \) and \( r \in \text{Res}(p) \).

We say that \( p \) is a regular thread if \( \text{Res}(p) \) is finite.

We will make use of the fact that being a regular thread coincides with being the solution of a finite guarded recursive specification of a restricted form.

A linear recursive specification over BTA is a guarded recursive specification
\[
E = \{ X = tX \mid X \in V \}
\]
where each \( tX \) is a term of the form \( D, S \) or \( Y \sqsubseteq a \sqsupseteq Z \) with \( Y, Z \in V \).

**Proposition 1** Let \( p \) be a thread. Then \( p \) is a regular thread iff there exists a finite linear recursive specification \( E \) and a variable \( X \in V(E) \) such that \( p \) is the solution of \( E \) for \( X \).

**Proof** This proposition generalizes Theorem 1 from [23] from the projective limit model to an arbitrary model. However, the proof of that theorem is applicable to any model. \(\square\)

In the proof of the next theorem, we associate a closed PGA term \( P \) in which jump instructions do not occur with a finite linear recursive specification
\[
E = \{ X_i = X_{l(i)} \sqsubseteq a_i \sqsupseteq X_{r(i)} \mid i \in [1,n] \} \cup \{ X_{n+1} = S, X_{n+2} = D \}.
\]

In \( P \), a number of Boolean register services is used for specific purposes. The purpose of each individual Boolean register is reflected in the focus that serves as its name:

- for each \( i \in [1,n+2] \), \( s:i \) serves as the name of a Boolean register that is used to indicate whether the current state of \( \langle X_1 \mid E \rangle \) is \( \langle X_i \mid E \rangle \);
- \( rt \) serves as the name of a Boolean register that is used to indicate whether the reply upon the action performed by \( \langle X_1 \mid E \rangle \) in its current state is \( T \);
- \( rf \) serves as the name of a Boolean register that is used to indicate whether the reply upon the action performed by \( \langle X_1 \mid E \rangle \) in its current state is \( F \);
- \( e \) serves as the name of a Boolean register that is used to achieve that instructions not related to the current state of \( \langle X_1 \mid E \rangle \) are passed correctly;
- \( f \) serves as the name of a Boolean register that is used to achieve with the instruction \( +f.set:F \) that the following instruction is skipped.

Now we turn to the theorem announced above. It states rigorously that the solution of every finite linear recursive specification can be produced by an instruction sequence without jump instructions if use can be made of Boolean register services.

**Theorem 1** Let a finite linear recursive specification
\[
E = \{ X_i = X_{l(i)} \sqsubseteq a_i \sqsupseteq X_{r(i)} \mid i \in [1,n] \} \cup \{ X_{n+1} = S, X_{n+2} = D \}
\]
be given. Then there exists a closed PGA term \( P \) in which jump instructions do not occur such that

\[
<X_1|E> = ((((\ldots (|P| /_{s:1} BR_F) \ldots /_{s:n+2} BR_F) /_{t} BR_F) /_{\emptyset} BR_F) /_{\hat{t}} BR_F).
\]

Proof We associate a closed PGA term \( P \) in which jump instructions do not occur with \( E \) as follows:

\[
P = s:1.set:T; (Q_1; \ldots; Q_{n+1})^o,
\]

where, for each \( i \in [1, n] \):

\[
Q_i = +s:i.get; e.set:T;
+e.get; s:i.set:F;
+e.get; s:i.set:F; rt.set:T;
+e.get; s:i.set:F; rt.set:T;
+rt.get; s:i.set:T;
+rt.get; s:i.set:T;
\]

and

\[
Q_{n+1} = +s:n+1.get; !.
\]

We use the following abbreviations (for \( i \in [1, n+1] \) and \( j \in [1, n+2] \)):

\[
P_i'^{br} \quad \text{for} \quad Q_i; \ldots; Q_{n+1}; (Q_1; \ldots; Q_{n+1})^o;
\]

\[
|x_{i,j}^{br} \quad \text{for} \quad ((((\ldots (|P_i'| /_{s:1} BR_F) \ldots /_{s:n+2} BR_F) /_{t} BR_F) /_{\emptyset} BR_F) /_{\hat{t}} BR_F);
\]

where \( b_j = T \) and, for each \( j' \in [1, n+2] \) such that \( j' \neq j, b_{j'} = F \).

From the definition of thread extraction, the definition of Boolean register services, and axiom TSU4, it follows that

\[
((((\ldots (|P| /_{s:1} BR_F) \ldots /_{s:n+2} BR_F) /_{t} BR_F) /_{\emptyset} BR_F) /_{\hat{t}} BR_F) /_{\hat{t}} BR_F = |P_1'^{br}|_1.
\]

This leaves us to show that \( <X_1|E> = |P_1'^{br}|_1 \).

Using the definition of thread extraction, the definition of Boolean register services, and axioms P0, P2, TSU1, TSU2, TSU4, TSU5 and TSU7, we easily prove the following:

\[
|x_{i,j}^{br} = |P_i'^{br}|_j \quad \text{if} \quad 1 \leq i \leq n \land 1 \leq j \leq n+1 \land i \neq j \quad (1)
\]

\[
|x_{i,j}^{br} = |P_i'^{br}|_j \quad \text{if} \quad i = n+1 \land 1 \leq j \leq n+1 \land i \neq j \quad (2)
\]

\[
|x_{i,j}^{br} = |P_i'^{br}|_i(l(i)) \leq a_i \geq |P_{i+1}'^{br}|_{r(i)} \quad \text{if} \quad 1 \leq i \leq n \quad (3)
\]

\[
|x_{i,j}^{br} = S \quad \text{if} \quad i = n+1 \quad (4)
\]

\[
|x_{i,j}^{br} = D \quad \text{if} \quad 1 \leq i \leq n+1 \land j = n+2 \quad (5)
\]

\[\square\] Springer
From Properties 1 and 2, it follows that

$$|P'_i|_j^{br} = |P'_j|_j^{br}$$ if $1 \leq i \leq n + 1 \land 1 \leq j \leq n + 1 \land i \neq j$.

From this and Property 3, it follows that

$$|P'_i|_i^{br} = |P'_{l(i)}|_{l(i)}^{br} \leq a_i \geq |P'_{r(i)}|_{r(i)}^{br}$$ if $1 \leq i \leq n$.

From this and Properties 4 and 5, it follows that $|P'_1|_1^{br}$ is a solution of $E$ for $X_1$. Because linear recursive specifications have unique solutions, it follows that $\langle X_1 | E \rangle = |P'_1|_1^{br}$.

Theorem 1 goes through in the case where $E = \{X_1 = D\}$: a witnessing $P$ is $(f.g)^\omega$. It follows from the proof of Proposition 1 given in [23] that, for each regular thread $p$, either $p$ is the solution of $\{X_1 = D\}$ for $X_1$ or there exists a finite linear recursive specification $E$ of the form considered in Theorem 1 such that $p$ is the solution of $E$ for $X_1$. Hence, we have the following corollary of Proposition 1 and Theorem 1:

**Corollary 1** For each regular thread $p$, there exists a closed PGA term $P$ in which jump instructions do not occur such that $p$ is the thread denoted by

$$(((\ldots (|P| /_{s:1} BR_F) \ldots /_{s:n+2} BR_F) /_{l} BR_F) /_{r} BR_F) /_{e} BR_F) /_{l} BR_F.$$

In other words, each regular thread can be produced by an instruction sequence without jump instructions if use can be made of Boolean register services.

The construction of such instructions sequences given in the proof of Theorem 1 is weakly reminiscent of the construction of structured programs from flow charts found in [14]. However, our construction is more extreme: it yields programs that contain neither unstructured jumps nor a rendering of the conditional and loop constructs used in structured programming.

### 7 Program Algebra with Labels and Goto’s

In this section, we introduce PGA$_g$, a variant of PGA obtained by leaving out jump instructions and adding labels and goto instructions.

In PGA$_g$, like in PGA, it is assumed that a fixed but arbitrary set $\mathcal{A}$ of basic instructions has been given. PGA$_g$ has the following **primitive instructions**:

- for each $a \in \mathcal{A}$, a *plain basic instruction* $a$;
- for each $a \in \mathcal{A}$, a *positive test instruction* $+a$;
- for each $a \in \mathcal{A}$, a *negative test instruction* $-a$;
- for each $l \in \mathbb{N}$, a *label instruction* $[l]$.
– for each \( l \in \mathbb{N} \), a goto instruction \(#[l]\);
– a termination instruction !.

We write \( J_g \) for the set of all primitive instructions of \( \text{PGA}_g \).

The plain basic instructions, the positive test instructions, the negative test instructions, and the termination instruction are as in PGA. Upon execution, a label instruction \([l]\) is simply skipped. If there is no next instruction to be executed, deadlock occurs. The effect of a goto instruction \(#[l]\) is that execution proceeds with the occurrence of the label instruction \([l]\) next following if it exists. If there is no occurrence of the label instruction \([l]\), deadlock occurs.

\( \text{PGA}_g \) has a constant \( u \) for each \( u \in J_g \). The operators of \( \text{PGA}_g \) are the same as the operators as PGA. Likewise, the axioms of \( \text{PGA}_g \) are the same as the axioms as PGA.

Just like in the case of PGA, the behaviours of the instruction sequences denoted by closed \( \text{PGA}_g \) terms are considered threads. The behaviours of the instruction sequences denoted by closed \( \text{PGA}_g \) terms are indirectly given by the behaviour preserving function \( \text{pgag2pga} \) from the set of all closed \( \text{PGA}_g \) terms to the set of all closed PGA terms defined by

\[
\text{pgag2pga}(u_1; \ldots; u_n) = \text{pgag2pga}(u_1; \ldots; u_n; (#[1]^o),
\]

\[
\text{pgag2pga}(u_1; \ldots; u_n; (u_{n+1}; \ldots; u_m)^o)
= \phi_1(u_1); \ldots; \phi_n(u_n); (\phi_{n+1}(u_{n+1}); \ldots; \phi_m(u_m))^o,
\]

where the auxiliary functions \( \phi_j : J_g \rightarrow I \) are defined as follows (\( 1 \leq j \leq m \)):

\[
\phi_j([l]) = #1,
\]

\[
\phi_j([#l]) = #tg_{j}(l),
\]

\[
\phi_j(u) = u \quad \text{if} \ u \text{ is not a label or goto instruction},
\]

where

\[
tg_{j}(l) = i \quad \text{if the leftmost occurrence of} \ [l] \ \text{in} \ u_j; \ldots; u_m; u_{n+1}; \ldots; u_m \text{ is the} \ i-th \text{instruction};
\]

\[
tg_{j}(l) = 0 \quad \text{if there are no occurrences of} \ [l] \ \text{in} \ u_j; \ldots; u_m; u_{n+1}; \ldots; u_m.
\]

Let \( P \) be a closed \( \text{PGA}_g \) term. Then the behaviour of \( P \) is \( |\text{pgag2pga}(P)| \). The approach to semantics followed here is introduced under the name projection semantics in [3]. The function \( \text{pgag2pga} \) is called a projection.

8 A Bounded Number of Labels

In this section, we show that a bound to the number of labels restricts the expressiveness of \( \text{PGA}_g \). We will refer to \( \text{PGA}_g \) terms that do not contain label instructions \([l]\) with \( l > k \) as \( \text{PGA}_g^k \) terms. Moreover, we will write \( J_g^k \) for the set \( J_g \setminus \{[l] | l > k\} \).
We define an alternative projection for closed PGA\(^k\) terms, which takes into account that these terms contain only label instructions \([l]\) with \(1 \leq l \leq k\). The alternative projection \(\text{pgag2pga}^k\) from the set of all closed PGA\(^k\) terms to the set of all closed PGA terms is defined by

\[
\text{pgag2pga}^k(u_1; \ldots; u_n) = \text{pgag2pga}^k(u_1; \ldots; u_n; (#[1])^o),
\]

\[
\text{pgag2pga}^k(u_1; \ldots; u_n; (u_{n+1}; \ldots; u_m)^o)
\]

\[
= \psi(u_1, u_2); \ldots; \psi(u_n, u_{n+1}); (\psi(u_{n+1}, u_{n+2}); \ldots;
\]

\[
; \psi(u_{m-1}, u_m); \psi(u_m, u_{m+1}))^o,
\]

where the auxiliary function \(\psi: \mathcal{J}_g^k \times \mathcal{J}_g^k \rightarrow \mathcal{J}\) is defined as follows:

\[
\psi(u', u'') = \psi'(u') \# k + 2 \# k + 2 \psi''(u''),
\]

where the auxiliary functions \(\psi', \psi'': \mathcal{J}_g^k \rightarrow \mathcal{J}\) are defined as follows:

\[
\psi'([l]) = #1,
\]

\[
\psi'([#l]) = #l + 2 \quad \text{if } l \leq k,
\]

\[
\psi'([#l]) = #0 \quad \text{if } l > k,
\]

\[
\psi'(u) = u \quad \text{if } u \text{ is not a label or goto instruction},
\]

\[
\psi''([l]) = (#k + 3)^{l-1} \# k - l + 1 \# (k + 3)^{k-l},
\]

\[
\psi''(u) = (#k + 3)^k \quad \text{if } u \text{ is not a label instruction}.
\]

In order to clarify the alternative projection, we explain how the intended effect of a goto instruction is obtained. If \(u_j\) is \([l]\), then \(\psi'(u_j)\) is \(#l + 2\). The effect of \(#l + 2\) is a jump to the \(l\)-th instruction in the defining thread extraction, and the easy to prove fact that \([P; #0] = |P|\).

**Proposition 2** For each PGA context \(C[\cdot]:\)

\[
|C[#n + 1; u_1; \ldots; u_n; #m]| = |C[#m + n + 1; u_1; \ldots; u_n; #m]|.
\]

**Proof** Contexts of the forms \(C[^o; Q\) and \(P; C[^o; Q\) do not need to be considered because of axiom PGA3. For eight of the remaining twelve forms, the equation to be proved follows immediately from the equations to be proved for the other forms, to wit \(_Q\); \(P\); \(_Q\); \(P\); \(_o^o\) and \(P\); \(_Q; P\); \(_o^o\); the axioms of PGA, the defining equations for thread extraction, and the easy to prove fact that \([P; #0] = |P|\).
In the case of the form \( \_ : Q \), the equation concerned is easily proved by induction on \( n \). In the case of the form \( P : \_ : Q \), only \( P \) in which the repetition operator does not occur need to be considered because of axiom PGA3. For such \( P \), the equation concerned is easily proved by induction on the length of \( P \), using the equation proved for the form \( \_ : Q \). In the case of the form \( P : \_ : \omega \), only \( P \) in which the repetition operator does not occur need to be considered because of axiom PGA3. For such \( P \), the equations for the approximating forms \( P : \_ : \omega^k \) are easily proved by induction on \( k \), using the equation proved for the form \( P : \_ : Q \). From these equations, the equation for the form \( P : \_ : \omega \) follows using AIP. In the case of the form \( P : (Q : \_ : \omega) \), the equation concerned is proved like in the case of the form \( P : \_ : \omega \).

\[ \square \]

The following theorem states rigorously that the projections \( \text{pgag2pga} \) and \( \text{pgag2pga}^k \) give rise to instruction sequences with the same behaviour.

**Theorem 2** For each closed PGA\(_g^k\) term \( P \), \(|\text{pgag2pga}(P)| = |\text{pgag2pga}^k(P)|\).

**Proof** Because \( \text{pgag2pga}(u_1; \ldots ; u_n) = \text{pgag2pga}(u_1; \ldots ; u_n; (\#[1])^{\omega}) \) and \( \text{pgag2pga}^k(u_1; \ldots ; u_n) = \text{pgag2pga}^k(u_1; \ldots ; u_n; (\#[1])^{\omega}) \), we only consider the case where the repetition operator occurs in \( P \).

We make use of an auxiliary function \(|\_ , \_|\). This function determines, for each natural number and closed PGA term in which the repetition operator occurs, a closed term of BTA with guarded recursion. The function \(|\_ , \_|\) is defined as follows:

\[ |i , u_1; \ldots ; u_n ; \omega| = |u_{i+1}; \ldots ; u_m ; \omega| \quad \text{if} \quad 1 \leq i \leq m, \]

\[ |i , u_1; \ldots ; u_n ; \omega| = D \quad \text{if} \quad m < i \leq m. \]

Let \( P = u_1; \ldots ; u_n; (u_{n+1}; \ldots ; u_m)^{\omega} \) be a closed PGA\(_g^k\) term, let \( P' = \text{pgag2pga}(P) \), and let \( P'' = \text{pgag2pga}^k(P) \). Moreover, let \( \rho : \mathbb{N} \rightarrow \mathbb{N} \) be such that \( f(i) = (k + 3) \cdot (i - 1) + 1 \). Then it follows easily from the definitions of \(|\_ , \_|\), \(|\_|\), \( \text{pgag2pga} \) and \( \text{pgag2pga}^k \), the axioms of PGA and Proposition 2 that for \( 1 \leq i \leq m \):

\[ |i , P'| = a \circ |i + 1, P'| \quad \text{if} \quad u_i = a, \]

\[ |i , P'| = |i + 1, P'| \leq a \geq |i + 2, P'| \quad \text{if} \quad u_i = +a, \]

\[ |i , P'| = |i + 2, P'| \leq a \geq |i + 1, P'| \quad \text{if} \quad u_i = -a, \]

\[ |i , P'| = |i + 1, P'| \quad \text{if} \quad u_i = [l], \]

\[ |i , P'| = |i + n, P'| \quad \text{if} \quad u_i = \#[l] \land tgt_i(l) = n, \]

\[ |i , P'| = S \quad \text{if} \quad u_i = !. \]
and

\[ |\rho(i), P''| = a \circ |\rho(i + 1), P''| \quad \text{if } u_i = a, \]

\[ |\rho(i), P''| = |\rho(i + 1), P''| \preceq a \succeq |\rho(i + 2), P''| \quad \text{if } u_i = +a, \]

\[ |\rho(i), P''| = |\rho(i + 2), P''| \preceq a \succeq |\rho(i + 1), P''| \quad \text{if } u_i = -a, \]

\[ |\rho(i), P''| = |\rho(i + 1), P''| \quad \text{if } u_i = [l], \]

\[ |\rho(i), P''| = |\rho(i + n), P''| \quad \text{if } u_i = \#[l] \land tgt_i(l) = n, \]

\[ |\rho(i), P''| = S \quad \text{if } u_i = ! \]

(where \( tgt_i \) is as in the definition of \( pgag2pga \)). Because \( |pgag2pga(P)| = |1, P'| \) and \( |pgag2pga_k(P)| = |\rho(1), P''| \), this means that \( |pgag2pga(P)| \) and \( |pgag2pga_k(P)| \) are solutions of the same guarded recursive specification. Because guarded recursive specifications have unique solutions, it follows that \( |pgag2pga(P)| = |pgag2pga_k(P)| \).

The projection \( pgag2pga_k(P) \) yields only closed PGA terms that do not contain jump instructions \( \#l \) with \( l > k + 3 \). Hence, we have the following corollary of Theorem 2:

**Corollary 2** For each closed \( \text{PGA}_k \) term \( P \), there exists a closed PGA term \( P' \) not containing jump instructions \( \#l \) with \( l > k + 3 \) such that \( |pgag2pga(P)| = |P'| \).

It follows from Corollary 2 that, if a regular thread cannot be denoted by a closed PGA term that does not contain jump instructions \( \#l \) with \( l > k + 3 \), it cannot be denoted by a closed \( \text{PGA}_k \) term. Moreover, it is known that, for each \( k \in \mathbb{N} \), there exists a closed PGA term for which there does not exist a closed PGA term not containing jump instructions \( \#l \) with \( l > k + 3 \) that denotes the same thread (see e.g. [23], Proposition 3). Hence, we also have the following corollary:

**Corollary 3** For each \( k \in \mathbb{N} \), there exists a closed PGA term \( P \) for which there does not exist a closed \( \text{PGA}_k \) term \( P' \) such that \( |P| = |pgag2pga(P')| \).

### 9 Conclusions

Program algebra is a setting suited for investigating single-pass instruction sequences. In this setting, we have shown that each behaviour that can be produced by a single-pass instruction sequence under execution can be produced by a single-pass instruction sequence without jump instructions if use can be made of Boolean register services. We consider this an interesting expressiveness result. An important variant of program algebra is obtained by leaving out jump instructions and adding labels and goto instructions. We have also shown that a bound to the number of labels restricts the expressiveness of this variant. Earlier expressiveness results on single-pass instruction sequences as considered in program algebra are collected in [23].
Program algebra does not provide a notation for programs that is intended for actual programming. However, to demonstrate that single-pass instruction sequences as considered in program algebra are suited for explaining programs in the form of assembly programs as well as programs in the form of structured programs, a hierarchy of program notations rooted in program algebra is introduced in [3]. One program notation belonging to this hierarchy, called PGLD\textsubscript{g}, is a simple program notation, close to existing assembly languages, with labels and goto instructions. We remark that a projection from the set of all PGLD\textsubscript{g} programs to the set of all closed PGA\textsubscript{g} terms can easily be devised.

The idea that programs are in essence single-pass instruction sequences underlies the choice for the name program algebra. The name seems to imply that program algebra is suited for investigating programs in general. We do not intend to claim this generality, which in any case does not matter when investigating single-pass instruction sequences. The name program algebra might as well be used as a collective name for algebras that are based on any viewpoint concerning programs. To our knowledge, it is not common to use the name as such.

Most closely related to our work on instruction sequences is work on Kleene algebras (see e.g. [15–18, 24]), but programs are considered at a higher level in that work. For instance, programming features like jump instructions have never been studied. In most work on computer architecture (see e.g. [1, 19–22]), instruction sequences are under discussion. However, the notion of instruction sequence is not subjected to systematic and precise analysis in the work concerned.

Acknowledgements This research was partly carried out in the framework of the Jacquard-project Symbiosis, which is funded by the Netherlands Organisation for Scientific Research (NWO). We thank Alban Ponse, colleague at the University of Amsterdam, and Stephan Schroevers, graduate student at the University of Amsterdam, for carefully reading a preliminary version of this paper and pointing out some flaws in it. Moreover, we thank an anonymous referee for suggesting improvements of the presentation of the paper.

Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

References