In this series of posts, we will create a very simple language that can be used to specify a state machine. Then we will write a compiler that will translate the description to a code that can simulate this state machine.
Though the titles say State Machine
, we are actually talking about a Turing
machine
, which is more powerful that finite state machines
Here is the agenda:
Part 1: What is a state machine? A (Rust) program to simulate state machine.
Part 2: Writing (Rust) macros to reduce repetition
Part 3: A small language and convert a description into a high level language(Rust) implementation
Proposed Part 4: Translating the description into a lower level
assembly like language(LLVM IR)
This is part 1.
Background
Today computers have replaced all kind of devices: Calculator, Music Players, Radio, Video streaming, Chip Design, Physics Simulation etc.
The general approach is to write a software(program) for each of these purposes and run them on a general purpose hardware.
Can a program be written for all kind of application ? A program for self driving cars, a music generation program, a program to discover new medicine given the cancer description? Is there even a problem/application for which no program is possible?
~100 years ago, the situation was even more bleak. There was no computer as we know today, rather a “computer” was a person, usually a woman, who will perform calculations using pen and paper and maybe take help from mechanical calculators.
Mathematicians were posing a more abstract but related problem: Is there an algorithm which can prove or disprove a mathematical statement? 1
Note: What is the relation between a music player program and a mathematical statement proving algorithm?
When working in mathematical logic, one needs to define the terms i.e. what is an algorithm?
Two people independently formalised the notion of algorithm: Alonzo church and Alan Turing.
We will focus on Turing’s approach.
TODO - Summarize turing’s approach in an approachable manner, without going into the technical details. See More
Simplified Version
The paper is concerned with an abstract and general problem requiring a lot of formalism. However we can tradeoff some formalism with intuitive reasoning.
Let’s see how a human computer solves a problem using an algorithm. The human has access to a paper and pencil, and her own mind.
The person will state the problem on paper using pencil by writing some symbols, then having remembered the algorithm description in her memory, manipulates the symbols as mandated to reach at the next set of symbols and so on, until in the end one has the expected result in terms of symbol.
This problem can be multiplying two numbers, calculating prime factors, or a complicated problem like building a music player, if one breaks it in terms of subroutines of bit manipulation.
What is a music player in terms of bits in and bits out? An encoded music file(mp3 etc) is a stream of bits. The output of a music player is the mixture of frequencies, where each frequency is a real number represented using bits.
What happens on hardware level to convert these frequencies into sound?
Turing gives an abstract version:
- A tape divided into squares: this can be used as paper. We are used to the notion of a two dimensional paper but that is just for our convenience.
- A head: the location on tape which can be read and written. The head can move towards any of the two ends of tape.
- Internal state logic: The logic of algorithm can be encoded in this device. This will control the state transition and the tape head movement. This is like having remembered the multiplication algorithm and then manipulating symbols by moving pencil here and there.
A number is said to be computable if one can construct a state machine that will output the number on the tape in finite number of steps. Since one can write any number in a bit format (i.e. in base 2), computing sequences of 0 and 1s is similar in spirit to computing numbers.
Are all rational numbers computable? What about irrational numbers?
Let’s take an example. Suppose we want to calculate a number of the form:
0101010101...
Here is the description of state(tape) machine that will print this sequence:
- Start with state a.
- If the state is a, irrespective of content of tape, write 0 on current head and change internal state to b.
- If the internal state is b and current head reads 0 then move the head to right and write 1 on tape. The internal state doesn’t change.
- If the internal state is b and current head reads 1 then move the head to right and change internal state to a.
- We are back at step 2.
Here is how this plays out:
Step 1:1
2
3
4
Internal state: a
Tape: | | | | | |...
Head: ^
Logic: Write 0 and transition to b
1
2
3
4
Internal state: b
Tape: |0| | | | |...
Head: ^
Logic: Move right and write 1
1
2
3
4
Internal state: b
Tape: |0|1| | | |...
Head: ^
Logic: Move right and transition to a
1
2
3
4
Internal state: a
Tape: |0|1| | | |...
Head: ^
Logic: Write 0 and transition to b
1
2
3
4
Internal state: b
Tape: |0|1|0| | |...
Head: ^
Logic: Move right and write 1
You get the idea. Here is more concise way of writing the same logic:
Current State | Required Current Tape Head Content | Action | Next State |
a | None | P(0) | b |
b | 0 | R-P(1) | b |
b | 1 | R | a |
Problem Motivation
The paper presents another intereseting sequence that one can calculate by this setup:
0 01 011 0111 0111 ...
Here is the table for that:
Current State | Tape Head Content | Action | Next State |
b | _(any) | P(e)-R-P(e)-R-P(0)-R-R-P(0)-L-L | o |
o | 1 | R-P(x)-L-L-L | o |
o | 0 | X | q |
q | 0 or 1 | R-R | q |
q | X(empty) | P(1)-L | p |
p | x | P(X)-R | q |
p | e | R | f |
p | X(empty) | L-L | p |
f | _(any) | R-R | f |
f | X(empty) | P(0)-L-L | o |
Now verifying the output of such a big table by hand is going to be cumbersome. How about we implementing a program to simulate this? The logic is simple: Depending on current state and current head content perform actions and change state. Keep repeating this for required number of steps and print the tape content in the end.
I implemented this logic in rust, since i’ve been trying to learn it recently:
Full Code
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
use std::fmt;
use std::io;
// Internal states
#[derive(Debug, PartialEq, Eq)]
enum TapeMachineState {
b,
o,
q,
p,
f,
}
// Allowed symbols
#[derive(Debug, PartialEq, Eq, Clone)]
enum TapeMachineSymbol {
Symbol0,
Symbol1,
Symbole,
Symbolx,
SymbolX,
}
// For printing purposes
impl TapeMachineSymbol {
fn as_str(&self) -> &'static str {
match self {
TapeMachineSymbol::Symbol0 => "0",
TapeMachineSymbol::Symbol1 => "1",
TapeMachineSymbol::Symbole => "e",
TapeMachineSymbol::Symbolx => "x",
TapeMachineSymbol::SymbolX => "X",
}
}
}
// Internal State + Tape + Head
struct TapeMachine<'a> {
state: &'a TapeMachineState,
result: &'a mut Vec<TapeMachineSymbol>,
index: usize,
}
impl<'a> TapeMachine<'a> {
pub fn new(state: &'a TapeMachineState, result: &'a mut Vec<TapeMachineSymbol>) -> Self {
Self {
state,
result,
index: 0,
}
}
// Allowed Actions
// Print
fn p(&mut self, symbol: TapeMachineSymbol) {
self.result[self.index] = symbol;
}
// Move right
fn r(&mut self) {
self.index += 1;
}
// Move left
fn l(&mut self) {
self.index -= 1;
}
}
fn main() {
println!("Enter the number of steps:");
let mut steps_input = String::new();
io::stdin().read_line(&mut steps_input).unwrap();
let steps: usize = steps_input.trim().parse().unwrap();
println!("Enter the total tape length:");
let mut tape_length_input = String::new();
io::stdin().read_line(&mut tape_length_input).unwrap();
let max_len: usize = tape_length_input.trim().parse().unwrap();
// Initialise tape machine
let mut result = vec![TapeMachineSymbol::SymbolX; max_len];
let mut tape_machine = TapeMachine::new(&TapeMachineState::b, &mut result);
// Simulate for required number of steps
for i in 0..steps {
println!(
"Step: {} State: {:?} Symbol: {:?}",
i,
tape_machine.state,
tape_machine.result[tape_machine.index]
);
match (tape_machine.state, &tape_machine.result[tape_machine.index]) {
(TapeMachineState::o, TapeMachineSymbol::Symbol1) => {
tape_machine.r();
tape_machine.p(TapeMachineSymbol::Symbolx);
tape_machine.l();
tape_machine.l();
tape_machine.l();
tape_machine.state = &TapeMachineState::o;
println!("Final State: {:?}", TapeMachineState::o);
}
(TapeMachineState::o, TapeMachineSymbol::Symbol0) => {
// X means do nothing
tape_machine.state = &TapeMachineState::q;
println!("Final State: {:?}", TapeMachineState::q);
}
(
TapeMachineState::q,
TapeMachineSymbol::Symbol0 | TapeMachineSymbol::Symbol1,
) => {
tape_machine.r();
tape_machine.r();
tape_machine.state = &TapeMachineState::q;
println!("Final State: {:?}", TapeMachineState::q);
}
(TapeMachineState::q, TapeMachineSymbol::SymbolX) => {
tape_machine.p(TapeMachineSymbol::Symbol1);
tape_machine.l();
tape_machine.state = &TapeMachineState::p;
println!("Final State: {:?}", TapeMachineState::p);
}
(TapeMachineState::p, TapeMachineSymbol::Symbolx) => {
tape_machine.p(TapeMachineSymbol::SymbolX);
tape_machine.r();
tape_machine.state = &TapeMachineState::q;
println!("Final State: {:?}", TapeMachineState::q);
}
(TapeMachineState::p, TapeMachineSymbol::Symbole) => {
tape_machine.r();
tape_machine.state = &TapeMachineState::f;
println!("Final State: {:?}", TapeMachineState::f);
}
(TapeMachineState::p, TapeMachineSymbol::SymbolX) => {
tape_machine.l();
tape_machine.l();
tape_machine.state = &TapeMachineState::p;
println!("Final State: {:?}", TapeMachineState::p);
}
(TapeMachineState::f, TapeMachineSymbol::SymbolX) => {
tape_machine.p(TapeMachineSymbol::Symbol0);
tape_machine.l();
tape_machine.l();
tape_machine.state = &TapeMachineState::o;
println!("Final State: {:?}", TapeMachineState::o);
}
(TapeMachineState::b, _) => {
tape_machine.p(TapeMachineSymbol::Symbole);
tape_machine.r();
tape_machine.p(TapeMachineSymbol::Symbole);
tape_machine.r();
tape_machine.p(TapeMachineSymbol::Symbol0);
tape_machine.r();
tape_machine.r();
tape_machine.p(TapeMachineSymbol::Symbol0);
tape_machine.l();
tape_machine.l();
tape_machine.state = &TapeMachineState::o;
println!("Final State: {:?}", TapeMachineState::o);
}
(TapeMachineState::f, _) => {
tape_machine.r();
tape_machine.r();
tape_machine.state = &TapeMachineState::f;
println!("Final State: {:?}", TapeMachineState::f);
}
(_, _) => {
println!(
"State: {:?} Index: {:?} Symbol: {:?}",
tape_machine.state,
tape_machine.index,
tape_machine.result[tape_machine.index]
);
let binary_result: String = tape_machine
.result
.iter()
.map(|x| x.as_str())
.collect();
println!("{}", binary_result);
panic!("Invalid state reached");
}
}
}
let binary_result: String = tape_machine.result.iter().map(|x| x.as_str()).collect();
println!("{}", binary_result);
let clean_result: String = tape_machine
.result
.iter()
.filter(|&x| x != &TapeMachineSymbol::SymbolX)
.map(|x| x.as_str())
.collect();
println!("=========\n");
println!("{}", clean_result);
}
The crux of implementation is this loop:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
for i in 0..steps {
println!("Step: {} State: {:?} Symbol: {:?}",
i, tape_machine.state, tape_machine.result[tape_machine.index]);
// Depending on the current state and current tape head content
match (tape_machine.state, &tape_machine.result[tape_machine.index]) {
(TapeMachineState::o, TapeMachineSymbol::Symbol1) => {
// Perform actions
tape_machine.r();
tape_machine.p(TapeMachineSymbol::Symbolx);
tape_machine.l();
tape_machine.l();
tape_machine.l();
tape_machine.state = &TapeMachineState::o;
println!("Final State: {:?}", TapeMachineState::o);
}
(TapeMachineState::o, TapeMachineSymbol::Symbol0) => {
// X means do nothing
tape_machine.state = &TapeMachineState::q;
println!("Final State: {:?}", TapeMachineState::q);
}
...
If you run the above code, for 1 000 steps and tape length set to 10 000, then you will get following cleaned up content( after removing empty spaces)
e0010110111011110111110111111011111110111111110111111111
If we ignore the first e
, we are actually on track to get this sequence.
Instructions to run the code
This is interesting. However, what if we want to test another state machine, i.e. one with different states and rules. We will have to rewrite the code again, which is a time taking process. The table description is pretty succinct. Can we generate simulation code with minimal description of state machine?
Rust Macros can generate code by matching patterns. Can we use them to generate the code for us? We do it in the next post.