## Tree Parsing for Code Selection

Reinhard Wilhelm Universität des Saarlandes wilhelm@cs.uni-sb.de

3. Januar 2010

▲□▶ ▲□▶ ▲□▶ ▲□▶ □ □ の Q @

## Code Generation

Real machines instead of abstract machines:

- Register machines,
- Limited resources (registers, memory),

◆□▶ ◆□▶ ◆□▶ ◆□▶ □ のQ@

- Fixed word size,
- Storage hierarchy,
- Intraprocessor parallelism.

### Phases in code generation

code selection: selecting semantically equivalent sequences of machine instructions for programs,

register allocation: exploiting the registers for storing values of variables and temporaries,

instruction scheduling: reordering instruction sequences to exploit intraprocessor parallelism.

◆□▶ ◆□▶ ◆□▶ ◆□▶ □ のQ@

### Complexity

Many subproblems in the compiler backend are complex: Early results: Bruno&Sethi[1976]: generation of optimal code for straight-line programs and 1-register machine is NP-complete Garey&Johnson[1979]: Instruction scheduling, even for very simple target machines, is NP-hard. What makes the difference in code generation?

input: straight-line programs w/o common subexpressions machine model: register constraints, e.g., interchangeable registers or not, operations on register pairs or not

Common subexpressions need directed acyclic graphs (DAGs). Code generation for expression trees has efficient solutions.

## Phase Ordering Problem

Issues:

- Software Complexity
- Result Quality
- Order in Serialization

▲□▶ ▲□▶ ▲□▶ ▲□▶ □ □ の Q @

## Code Selection

Task: Select (best) instruction sequences for a program.

- Control statements translated as for abstract machines,
- Procedure organisation same as on abstract machines,
- Expressions, variable and data structure access many different translations.

Expressions (without common subexpressions) to be translated into (locally) optimal code according to some cost measure.

◆□▶ ◆□▶ ◆□▶ ◆□▶ □ のQ@

## An Example CISC Architecture, the Motorola 68000

- 8 Data registers,
- 8 Address registers,
- many addressing modes,
- 2-address machine, i.e., two operand locations in each instruction, one is also the result location,

#### ADD D1, D2

adds the contents of registers D1 and D2 and stores the result in D2.

most instructions are scalable to byte (.B), word (.W), double word (.L) operands.

## Addressing Modes

- $D_n$  Data register direct:  $cont(D_n)$ .
- $A_n$  Address register direct:  $cont(A_n)$ .
- $(A_n)$  Address register indirect:  $St(cont(A_n))$ .
- ►  $d(A_n)$  Address register indirect with address distance:  $St(cont(A_n) + d)$  with 16-Bit-constant d.
- ► d(A<sub>n</sub>, I<sub>x</sub>) Address register indirect with Index and Address distance: St(cont(A<sub>n</sub>) + cont(I<sub>x</sub>) + d) with A<sub>n</sub> used as base register, I<sub>x</sub> index register (either address or data register), 8-Bit-distance d.
- ► x Absolute short: *St*(*x*) with 16-Bit-constant *x*.
- x Absolute long: St(x) with 32-Bit-constant x.
- ► #x Immediate: x.

## **Execution Times**

|               | Addressing mode                                              | Byte, Word | Double Word |
|---------------|--------------------------------------------------------------|------------|-------------|
| Dn            | Data register direct                                         | 0          | 0           |
| An            | Address register direct                                      | 0          | 0           |
| $(A_n)$       | Address register indirect                                    | 4          | 8           |
| $d(A_n)$      | Address register indirect with<br>Address distance           | 8          | 12          |
| $d(A_n, I_x)$ | Address register indirect with Index<br>and Address distance | 10         | 14          |
| x             | Absolute short                                               | 8          | 12          |
| x             | Absolute long                                                | 12         | 16          |
| #x            | immediate                                                    | 4          | 8           |

## Alternative Code Sequences

Load a Byte into the lower quarter of data register D5, the address results from adding base register A1's content to the contents of the lower half of data register D1 and incrementing the result by 8. The execution time, 14 cycles, consists of the execution time for the operation proper, 4 cycles, and the execution time for the addressing, 10 cycles.

| MOVE.B 8(A1, D1.W), D5 | ADDA   | #8, A1        | costs: 16 | ADDA   | D1.W, A1      | costs: 8  |
|------------------------|--------|---------------|-----------|--------|---------------|-----------|
| total costs 8          | ADDA   | D1.W, A1      | costs: 8  | MOVE.B | 8(A1), D5     | costs: 12 |
|                        | MOVE.B | (A1), D5      | costs: 8  |        | total costs 2 | 0         |
|                        |        | total costs 3 | 32        |        |               |           |

・ロト ・ 日 ・ ・ ヨ ・ ・ 日 ・ うらつ

### Code Sequences for b := 2 + a[i]

b, i integer variables, a: array[1 ..10] of integer. a, b, i in the same frame addressed by address register A5, Relative addresses:  $b \mapsto 4$ ,  $i \mapsto 6$ ,  $a \mapsto 8$ . The code for addressing a[2] computes: A5 + 8 + value(i) \* 2

| A1 costs 4    |
|---------------|
| A1 costs 12   |
| ), D1 costs 8 |
| D1 costs 44   |
| A2 costs 4    |
| A2 costs 12   |
| A2 costs 8    |
| ), D2 costs 8 |
| D2 costs 4    |
| A3 costs 4    |
| A3 costs 12   |
| (A3) costs 8  |
| l costs 128   |
|               |

## An Example RISC Architecture, the MIPS

 RISC microprocessor architecture developed by John L. Hennessy at Stanford University in 1981

・ロト ・ 日 ・ ・ ヨ ・ ・ 日 ・ うらつ

- no interlocked pipeline stages
- Load/Store-Architecture (R3000)
- 32 registers
- $2^{30}$  memory words =  $2^{32}$  bytes
- ▶ Still used: Playstation Portable, PS2, etc.

### Instruction Set (MIPS R3000) Arithmetic<sup>.</sup> add \$1. \$2. \$3 sub \$1, \$2, \$3 addi \$1, \$2, CONST Data Transfer: Iw \$1, CONST(\$2) sw \$1, CONST(\$2) Cond. Branch: beg \$1, \$2, CONST **Unconditional Jumps:** i CONST ▶ jr \$1 ial CONST Logical operations: Bitwise Shift, etc. Pseudoinstructions: Translated into real instructions before assembly: bgtz Label (branch greater than), etc.

◆□▶ ◆□▶ ◆□▶ ◆□▶ □ のQ@

## Example Code

| if $(x <= 0)$ |      | bgtz \$1 el        |
|---------------|------|--------------------|
| y = x + 1;    |      | addi \$2, \$1, 1   |
| else          |      | j end              |
| x = y + x;    | el:  | addi \$1, \$2, \$1 |
|               | end: |                    |
| Assuming      |      |                    |

◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 臣 のへぐ

x in \$1 and y in \$2

## Looking for a Description Mechanism

Several compilation subtasks

- can be formally described and
- ▶ their implementation can be automatically generated.

Examples:

| compilation<br>subtask                    | description<br>forma-<br>lism                                    | acceptor                                     | desired<br>output                                    | algorithmic<br>aspects                                                              | properties                                                                                                                     |  |
|-------------------------------------------|------------------------------------------------------------------|----------------------------------------------|------------------------------------------------------|-------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------|--|
| lexical<br>analysis<br>syntax<br>analysis | regular<br>expressi-<br>ons<br>context-<br>free<br>gram-<br>mars | finite au-<br>tomata<br>pushdown<br>automata | final<br>states<br>syntax<br>trees, de-<br>rivations | r.e. → nfa,<br>nfa → dfa,<br>minimizati-<br>on<br>(determ.)<br>parser<br>generation | equivalences,<br>closure<br>properties,<br>decidabilities<br>non-equiv. of<br>det. and non-<br>det. pda, un-<br>decidabilities |  |
|                                           |                                                                  |                                              |                                                      |                                                                                     |                                                                                                                                |  |

| compilation         | description                  | acceptor                | desired                           | algorithmic                                                                          | properties                                                                  |
|---------------------|------------------------------|-------------------------|-----------------------------------|--------------------------------------------------------------------------------------|-----------------------------------------------------------------------------|
| subtask             | formalism                    |                         | output                            | aspects                                                                              |                                                                             |
| lexical<br>analysis | regular ex-<br>pressions     | finite au-<br>tomata    | final<br>states                   | r.e. → nfa,<br>nfa → dfa,<br>minimizati-<br>on                                       | equivalences,<br>closure pro-<br>perties,<br>decidabili-                    |
| syntax<br>analysis  | context-<br>free<br>grammars | pushdown<br>automata    | syntax<br>trees, de-<br>rivations | (determ.)<br>parser<br>generation                                                    | ties<br>non-equiv.<br>of det. and<br>non-det.<br>pda, undeci-<br>dabilities |
| code<br>selection   | regular<br>tree<br>grammars  | finite tree<br>automata | derivations                       | $\begin{array}{rcc} rtg & \mapsto & fta, \\ fta & \mapsto & bu- \\ dfta \end{array}$ | closure pro-<br>perties, de-<br>cidabilities                                |

## Machine Description

- Input to Code Selector Generator,
- Regular Tree Grammar, terminals from the program representation, non-terminals represent machine resources,
- Often ambiguous,
- Each rule has associated costs,
- Factorization of addressing modes reduces size.



## Generated Code Selector

- Parses intermediate representations (IR) of programs,
- Computes derivations according to "machine grammar", each corresponding to one instruction sequence,
- Has to select cheapest derivation, corresponding to (locally) cheapest code sequence
- May compute costs in states or use dynamic programming.

◆□▶ ◆□▶ ◆□▶ ◆□▶ □ のQ@

### Tree Languages

Alphabet with arity is a finite set Σ of operators together with a function ρ : Σ → N<sub>0</sub>, arity.

$$\triangleright \Sigma_k = \{a \in \Sigma \mid \rho(a) = k\}.$$

The homogeneous tree language over Σ is the following inductively defined set T(Σ):

.

• 
$$a \in T(\Sigma)$$
 for all  $a \in \Sigma_0$ ;

• Are 
$$b_1, \ldots, b_k$$
 in  $T(\Sigma)$  and is  $f \in \Sigma_k$ , so is  $f(b_1, \ldots, b_k) \in T(\Sigma)$ .

Example:

$$\begin{split} \Sigma &= \{a, cons, nil\},\\ \rho(a) &= \rho(nil) = 0, \ \rho(cons) = 2.\\ \text{Some trees over } \Sigma:\\ a, \ cons(nil, \ nil), \ cons(cons(a, nil), nil) \end{split}$$

### Patterns, Substitutions

V infinite set of variables (arity 0).

- $p \in T(\Sigma \cup V)$  is called a **pattern** over  $\Sigma$ ,
- p is linear if no variable occurs twice in p.
- A Substitution  $\Theta$  maps variables to patterns,  $\Theta: V \to T(\Sigma \cup V).$

• 
$$\Theta$$
 extended to  $\Theta$ :  $T(\Sigma \cup V) \rightarrow T(\Sigma \cup V)$  by  
 $t\Theta = x\Theta$ , if  $t = x \in V$  and  
 $t\Theta = a(t_1\Theta, \dots, t_k\Theta)$ , if  $t = a(t_1, \dots, t_k)$ .  
Let  $V = \{X\}$ .  
 $X$ ,  $cons(nil, X)$ ,  $cons(X, nil)$  are patterns over  $\Sigma$ .

## **Regular Tree Grammars**

Regular Tree Grammar (RTG)  $G = (N, \Sigma, P, S)$  consists of

- N, finite set of non-terminals,
- Σ, finite alphabet (with arity) of terminals (operators labeling nodes)
- ► *P*, finite set of rules  $X \to s$  where  $X \in N$  and  $s \in T(\Sigma \cup N)$ ,
- $S \in N$ , the start symbol.

Notions:

- $p: X \rightarrow Y$  chain rule,
- ▶ p: X → s has type (X<sub>1</sub>,...,X<sub>k</sub>) → X, if j-th occurrence of a non-terminal in s (counted from the left) is X<sub>j</sub>.
- ▶  $\tilde{s}$  results from *s* by replacing non-terminal  $X_j$  by variable  $x_j$ .

Why "Regular"?

- Path words form a regular word language,
- Regular tree languages are closed under union, intersection, and complement,

Emptiness and therefore containment are decidable.

Example: Lists

• 
$$G_1 = (N_1, \Sigma, P_1, L)$$
  
•  $\Sigma = \{a, cons, nil\}$   
where  $\rho(a) = \rho(nil) = 0, \rho(cons) = 2$   
•  $N_1 = \{E, L\}$  and  
•  $P_1 = \{L \rightarrow nil, L \rightarrow cons(E, L), E \rightarrow a\}$ 

 $L(TG_1)$  is the language of linear lists of *a*'s including the empty list, i.e.  $L(G_1) = \{nil, cons(a, nil), cons(a, cons(a, nil)), \ldots\}$ .

## Example: Machine Grammar

• 
$$G_m = (N_m, \Sigma, P_m, REG);$$
  
•  $\Sigma = \{const, m, plus, REG\}$   
where  $\rho(const) = 0; \rho(m) = 1, \rho(plus) = 2,$   
•  $N_m = \{REG\}$   
•  $P_m = \{addmc : REG \rightarrow plus(m(const), REG), addm : REG \rightarrow plus(m(REG), REG), add : REG \rightarrow plus(REG, REG), ldmc : REG \rightarrow m(const), ldc : REG \rightarrow const, ld : REG \rightarrow REG\}$ 

▲□▶ ▲課▶ ▲理▶ ★理▶ = 目 - の��

 $G_m$  describes a subset of an instruction set of a simple processor, rules are marked with names of instructions.

The first three instructions add

- ▶ the contents of a memory cell, whose address is given by a constant,
- ▶ the contents of a memory cell, whose address is in a register, resp.,
- the contents of a register

to the contents of a register and put the result into a register.

The last three instructions load into a register:

▶ the contents of a memory cell whose address is given by a constant,

◆□▶ ◆□▶ ◆□▶ ◆□▶ □ のQ@

- a constant, and
- the contents of a register, resp.

## Example Derivations



◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 三臣 - のへで

### **Derivation Tree**

An X-derivation tree for tree  $t \in T(\Sigma \cup N)$  according to tree grammar G is a tree  $\psi \in T(P \cup N)$ , such that

▶ Is 
$$\psi \in N$$
, then  $\psi = X = t$ .

▶ Is  $\psi \notin N$ , then  $\psi = p(\psi_1, ..., \psi_k)$  for a rule  $p: X \to s \in P$  of type  $(X_1, ..., X_k) \to X$ , such that  $t = \tilde{s}\{x_1/t_1, ..., x_k/t_k\}$  and  $\psi_i$  are  $X_i$ -derivation trees for the  $t_i$ .





The generated language

# $L(TG) = \{t \in T(\Sigma) \mid \exists \psi \in T(P \cup N) : \psi \text{ is } S \text{-derivation tree for } t\}.$

▲□▶ ▲□▶ ▲□▶ ▲□▶ ▲□ ● ● ●

## The Tree Analysis Problem

- ► An instance of the tree analysis problem consists of an RTG G and a tree t.
- A solution consists of the set of all derivation trees of t according to G,
- ► A **Tree Analyzer** for *G* solves the tree analysis problem for *G* and all its trees,

▲日 ▶ ▲ □ ▶ ▲ □ ▶ ▲ □ ▶ ● ● ● ● ●

 A Tree Analyzer Generator generates a tree analyzer for each RTG.

### Finite Tree Automata, Intuition

Generalization of finite word automata to trees,

- ► Transitions  $(q, a, q_1, ..., q_k)$ , where  $a \in \Sigma_k$ , q state at node *n* labeled *a*,  $q_1, ..., q_k$  state at children of *n*,
- Non-deterministic automaton "guesses" computations in any order (like a puzzle).

・ロト ・ 日 ・ ・ ヨ ・ ・ 日 ・ うらつ

#### Traversal strategies,







top down:



▲ロト ▲御ト ▲ヨト ▲ヨト 三ヨ - のへの

## Finite Tree Automaton (FTA)

- $A = (Q, \Sigma, \delta, Q_F)$ , where
  - Q, finite set of states,
  - $Q_F \subseteq Q$ , final states,
  - Σ, input alphabet (with arity),
  - $\delta \subseteq \bigcup_{j\geq 0} Q \times \Sigma_j \times Q^j$ , transition relation.
  - A is top down deterministic, if
    - exactly one final state, and
    - At most one transition (q, a, q<sub>1</sub>..., q<sub>k</sub>) ∈ δ for all a and q.
  - A is bottom up deterministic, if at most one transition (q, a, q<sub>1</sub>...q<sub>k</sub>) ∈ δ for all a and all q<sub>1</sub>,..., q<sub>k</sub>. In this case, we write δ as partial function:
     δ : ⋃<sub>j≥0</sub> Σ<sub>j</sub> × Q<sup>j</sup> → Q

### Computation

- A annotates the nodes with states; hence new alphabet Σ × Q = {⟨a, q⟩ | a ∈ Σ, q ∈ Q}, where ρ(⟨a, q⟩) = ρ(a).
- *q*-computation  $\phi$  of *A* on tree  $t = a(t_1, \ldots, t_m)$ : a tree  $\langle a, q \rangle (\phi_1, \ldots, \phi_m) \in T(\Sigma \times Q)$ , where  $\phi_j$  are  $q_j$ -computations for the  $t_j, j = 1, \ldots, m$ ,  $(q, a, q_1 \ldots q_m)$  is a transition.
- ▶ Is  $q \in Q_F$ , then  $\phi$  is accepting.
- The language L(TA) consists of the trees with accepting computations.
- A state resp. transition is superfluous if it does not occur in any accepting computation.

## Example Computation

DFTA 
$$A_b = (Q_b, \Sigma_b, \delta_b, Q_{F,b})$$
 with  
states  $Q_b = \{q_e, q_o\}$ ,  
alphabet  $\Sigma_{b,0} = \{c\}$  and  $\Sigma_{b,2} = \{a\}$ ,  
final states  $Q_{F,b} = \{q_e\}$   
transitions:  $\delta_b = \{(q_o, c) \\ (q_e, a, q_o, q_o) \\ (q_o, a, q_e, q_o) \\ (q_e, a, q_e, q_e) \\ (q_e, a, q_e, q_e)\}$   
Accepts trees with even number of  $c$ 's.

▲□▶ ▲圖▶ ▲圖▶ ▲圖▶ ▲□▶

Tree and  $q_e$ -computation



◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 臣 のへぐ

## Determinism - Non-determinism

- Bottom up NFTAs and Top down NFTAs are equivalent,
- Bottom up DFTAs and Top down DFTAs are not equivalent; example language cannot be recognized by top down DFTA.
- NFTAs are equivalent to bottom up DFTAs (powerset construction).

(Bottom up) DFTA:

- At most one computation for each tree,
- At most one state at each node,
- $\delta$  extended to a partial function  $\delta : T(\Sigma) \to Q$  by:  $\delta(t) = \delta(a, \delta(t_1) \dots \delta(t_k))$ , if  $t = a(t_1, \dots, t_k)$ .
- $\delta(t) = q$  iff there is a q-computation for t.

## Generating Tree Parsers

The generation (and the explanation) process: Input: G

- 1. Generate NFTA  $A_G$ ,
- 2. Apply powerset construction to obtain DFTA P(G).

◆□▶ ◆□▶ ★□▶ ★□▶ □ のQ@

Later: Consider variant with costs.

## How $A_G$ Works

### $A_G$

 tries to cover the given tree with right sides of productions (like a puzzle),

・ロト ・ 日 ・ ・ 日 ・ ・ 日 ・

3

does reductions to check whether neighbouring rules fit.



## States and transitions of $A_G$

- States: "Tree Items", subtrees of right sides, interpretation: what has been analyzed so far.
- Transitions: analyze next generation of a right side,



What about "complete items", i.e. full right sides s? do reduction in the same step, i.e., new state is X, not s.

$$\begin{array}{l} A_G, \mbox{ Definition} \\ A_G = (Q_G, \Sigma, \delta_G, \{S\}), \mbox{ where} \\ \blacktriangleright \ Q_G = N \cup \{s' \mid \exists (X \rightarrow s) \in \\ P, \mbox{ where } s' \mbox{ is proper subtree of } s\}. \\ \blacktriangleright \ Transition \ relation \ \delta_G: \ transitions \ of \ the \ forms \\ \{(s, a, s_1 \dots s_k) \mid s = a(s_1, \dots, s_k) \in Q_G\} \ \mbox{and} \\ \{(X, a, s_1 \dots s_k) \mid \exists (X \rightarrow s) \in P \ \mbox{and} \\ s = a(s_1, \dots, s_k)\}. \end{array}$$

◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 臣 の�?

Problem with chain rules:

A<sub>G</sub> would have to "step on the spot" doing chain reductions.
 However, A<sub>G</sub> has to consume at least one terminal per step,

▲日 ▶ ▲ □ ▶ ▲ □ ▶ ▲ □ ▶ ● ● ● ● ●

#### • Chain reductions are precomputed and integrated into $\delta$ .

$$\begin{array}{lll} \delta_{G} := & \{(s, a, s_{1} \dots s_{k}) \mid s = a(s_{1}, \dots, s_{k}) \in Q_{G}\} & \cup & (\text{proper transition}) \\ & \{(X, a, s_{1} \dots s_{k}) \mid \exists (X' \to s) \in P : & (\text{reduction}) \\ & \exists X \text{-derivation tree for } X' \text{ and } s = a(s_{1}, \dots, s_{k})\} & (\text{chain rules}) \end{array}$$

## Example $A_{G_m}$

 $A_{G_m} = (Q_{G_m}, \Sigma_{G_m}, \delta_{G_m}, Q_{F,G_m})$  for  $G_m$  has the state set  $Q_{G_m} = \{const, REG, m(const), m(REG)\}$ and the transitions  $\delta_{G_m} = \{ (const, const, \epsilon) \}$  $(REG, const, \epsilon)$  $(REG, REG, \epsilon)$ (m(const), m, const)(REG, m, const) (m(REG), m, REG)(*REG*, *plus*, *m*(*const*) *REG*) (REG, plus, m(REG) REG)(*REG*, *plus*, *REG REG*)}

・ロト ・ 日 ・ ・ ヨ ・ ・ 日 ・ うらつ

## Example Computation of $A_{G_m}$



## Properties

G RTG and t input tree.

There exists an X-derivation tree for t according to G iff there exists an X-computation for t in A<sub>G</sub>. In particular: L(G) = L(A<sub>G</sub>).

◆□▶ ◆□▶ ★□▶ ★□▶ □ のQ@

## Principle of the Powerset Construction

#### Finite Word Automata:

 $\exists \text{ old states } q_1, q_2 \text{ and word } w \text{ such that} \\ (q_0, w) \vdash_M^* (q_1, \varepsilon) \text{ and } (q_0, w) \vdash_M^* (q_2, \varepsilon) \\ \Longrightarrow \exists \text{ new state } Q \text{ such that } q_1, q_2 \in Q \text{ and} \\ \delta_d^*(q_d, w) = Q$ 

Finite Tree Automata:

$$\exists \text{ old states } q_1, q_2 \text{ and tree } t \text{ such that} \\ \exists q_1 - \text{ and } q_2 - \text{ computations for } t \\ \Longrightarrow \exists \text{ new state } B \text{ such that } q_1, q_2 \in B \text{ and} \\ \delta_p(t) = B$$

◆□▶ ◆圖▶ ★ 国▶ ★ 国▶ - 国 - のへで

## Word automata



new state Q such that  $\delta(P, a) = Q$ 

◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 三臣 - のへで

Tree automata



◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 臣 のへぐ

## Powerset Construction

Powerset automaton P(A) is built iteratively,  $Q_p^{(n)}$  and  $\delta_p^{(n)}$  occur in computations on trees of height  $\leq n - 1$ . Let  $A = (Q, \Sigma, \delta, Q_F)$  be a NFTA. Its powerset automaton is the DFTA  $P(A) = (Q_p, \Sigma, \delta_p, Q_{p,F})$ , where

► 
$$Q_p = 2^Q$$
,

$$\blacktriangleright \ Q_{p,F} := \{ B \in Q_p \mid B \cap Q_F \neq \emptyset \},\$$

states and transitions are computed in the iteration:

$$Q_p := \bigcup_{n \ge 0} Q_p^{(n)}$$
 and  $\delta_p := \bigcup_{n \ge 0} \delta_p^{(n)}$ , where

$$\blacktriangleright \quad Q_p^{(0)} = \emptyset;$$

► Be n > 0. For  $a \in \Sigma_k$  and  $B_1, \ldots, B_k \in Q_p^{(n-1)}$  let  $B := \{q \in Q \mid \exists q_1 \in B_1, \ldots, q_k \in B_k : (q, a, q_1 \ldots q_k) \in \delta\}.$ Is  $B \neq \emptyset$ , then  $B \in Q_p^{(n)}$  and  $(B, a, B_1 \ldots B_k) \in \delta_p^{(n)}.$ 

ション ふゆ くち くち くち くち

## The Powerset Construction on Tree Parsers



◆□▶ ◆□▶ ◆臣▶ ◆臣▶ 三臣 - のへで

### Example

The powerset automaton for  $A_{G_m}$  has state set  $Q_{G_m} = \{q_1, q_2, q_3, q_4\}$  where  $q_1 = \{REG\}$  $q_2 = \{const, REG\}$  $q_3 = \{m(REG)\}$  $q_4 = \{m(const), REG, m(REG)\}$ and transition function  $\delta_{G_m}$ : operator children state(s) state REG  $q_1$ ε const ε  $q_2$ q<sub>3</sub> m  $q_1$  $q_4$ m  $q_2$ plus  $q_1$  $q_1$  $q_1$ plus  $q_1$  $q_4$  $q_1$ plus  $q_1$  $q_3$  $q_1$ ・ロト ・ 日 ・ モー・ モー・ うへぐ

### Properties

1. For each  $t \in T(\Sigma)$ :

▶ Is  $\delta_p(t)$  defined, then  $\delta_p(t) = \{q \mid \exists q$ -computation on  $t\}$ .

- Is δ<sub>p</sub>(t) undefined, then there is no q ∈ Q with a q-computation of A for t.
- ►  $\delta_p(t) \cap N = \{X' \in N \mid \exists X' \text{-derivation tree for } t\}.$
- 2. L(A) = L(P(A)).
- 3. For each state  $B \in Q_r$  there exists a tree t, such that  $\delta_p(t) = B$ .

## Adding Costs

- ▶ Rules have cost functions, i.e. costs of the instruction,
- Translated into cost functions for the transitions of the NFTA,

◆□▶ ◆□▶ ★□▶ ★□▶ □ のQ@

 Deterministic bottom up automaton constructs cheapest derivations.

- ▶ Rule *p* of type  $(X_1, ..., X_k) \to X$  gets *k*-place function  $C(p) : \mathbb{N}_0^k \to \mathbb{N}_0$
- C extended to derivation trees  $\psi$ .  $\psi = X \in N$ , then  $C(\psi) := 0$ .  $\psi = p(\psi_1, \dots, \psi_k)$ , then  $C(\psi) := C(p)(C(\psi_1), \dots, C(\psi_k))$ .

・ロト ・ 日 ・ ・ ヨ ・ ・ 日 ・ うらつ

- C is monotone, if for all  $p \in P$ , C(p) is monotone,
- C is additive, if for all  $p \in P$ , C(p) has the form  $C(p) = c_p + x_1 + \cdots + x_k, c_p \in \mathbb{N}_0.$

From cost annotation C of grammar G to cost annotation  $C^*$  of automaton  $A_G$ .

- Assume an additive cost measure. Costs can be described by a constant, i.e. C is a function from  $P \rightarrow \mathbb{N}_0$ .
- Define C\* as
  - For  $\tau = (s, a, s_1 \dots s_k)$  where  $s = a(s_1, \dots, s_k)$ ,  $C^*(\tau) := 0$ .
  - For τ = (X, a, s<sub>1</sub>...s<sub>k</sub>), let C<sup>\*</sup>(τ) be the minimal costs of an X-derivation tree for a(s<sub>1</sub>...s<sub>k</sub>).

 Extend cost function of automaton to cost function for computations.

## Extracting Cheapest Derivations

Extract cheapest computations of A from computations of  $A_r$  as follows:

- 1. Tabulate for each node  $\langle a, B \rangle$  of a computation  $\phi$  of the powerset automaton  $P(A_r)$  the costs  $c_q$  and the transitions  $d_q$  for all  $q \in B$ .
- 2.  $c_q$  are the costs of a cheapest q-computation of a given tree t, and

(日) (中) (日) (日) (日) (日) (日)

 $d_a$  are the chosen transitions of A.

## Integrated Cost Computation

- Assume that the set of cheapest X-derivations has differences bounded by a constant (realistic).
- Integrate the (finitely many) cost differences into the states of the subset automaton.

・ロト ・ 日 ・ ・ ヨ ・ ・ ヨ ・ ・ つ へ つ

Computed cost for state q is the difference between the cheapest q-computation and the cheapest computation.