mlatu-6

mlatu-6 is a concatenative esoteric programming language designed by User:Zhil based on the theory of concatenative calculus. It is a subset of mlatu.

It was later found that this language is closely related to Underload.

Language Overview

mlatu-6 is a term rewriting language consisting of 6 primitive combinators and quotations. A quotation is a sequence of terms (a term is either a combinator or a quotation) wrapped in parentheses. When a primitive appears directly after an appropriate amount of quotations, a reduction can be made. The primitives are:

Command	Description
`+`	Copies the preceding quotation
`-`	Removes the preceding quotation
`<`	Removes a level of nesting from the preceding quotation
`>`	Adds a level of nesting to the preceding quotation
`,`	Concatenates the contents of the two preceding quotations into a new quotation
`~`	Swap the contents of the two preceding quotations

Uppercase letters of the latin alphabet are often used to denote an opaque sequence of terms. These can't be further reduced, even under strong reduction, and come in handy when you are trying to work out the stack effect of a combinator.

Usually programs are presented without whitespace, but interpreters may add a reduction to remove whitespace, so that it can be used to style programs.

Concatenative Theory

In the concatenative calculus, the stack possesses only 6 core capabilities: we can copy the top element, we can remove it, we can wrap it in a quote, we can unwrap it from its quote, we can concatenate the top two quotes, and we can swap the top two quotes. In fact, it only has 5 core capabilities, swapping can be constructed from the other 5 capabilities and the ability for quotations to nest, through great effort, but this was not known at the time of creating this language, and it results in much more of a tarpit. You will occasionally hear people refer to this tarpit language as mlatu-5. Some of the marvelous creations below are only possible in mlatu-6, however, since mlatu-5 is a more restricted environment.

If we make a base of 6 combinators that each do exactly one of these capabilities, in the simplest way possible, and give them simple 1-character names, we end up with exactly mlatu-6. The choice of these 6 combinators is unique. This makes it an excellent language for studying concatenative calculus.

Actually, nested quotations also possess the ability to swap quotes around, which is why we can construct ~ from the other 5 combinators with this equivalence:

~ = >(->)(-)(>,>,+,<->,<)+,<

Evaluation Order

mlatu-6, unlike mlatu, allows reduction inside of quotations (so called transparent quotations), a strategy called strong reduction. By removing the pattern matching from mlatu, mlatu-6 seems to have the Church-Rosser property, meaning that while there might be multiple reduction paths, if they reach a normal form, it is unique.

So, for any given problem, there might be multiple reduction paths. Some of these paths might be infinite, even though finite paths exist. All finite paths should reduce to the same normal form, however.

mlatu on the other hand uses weak reduction, meaning we don't reduce inside of quotations. Some mlatu-6 interpreters might only offer weak reduction as well (and every mlatu interpreter is, in fact, a weak mlatu-6 interpreter).

Normal Order Reduction

To avoid dealing with this issue, we can define normal order reduction, which will avoid the infinite paths and guarantees a normal form if one exists (again, proof needed). To achieve this normal order, reductions cannot happen inside a quotation unless the outside environment of that quotation is already inert (in normal form). On top of that, we will perform reductions from left to right by convention, so that normal order resembles the equivalent stack machine evaluation, but this does not alter the end result, so implementers of normal order interpreters should be able to evaluate in any order, provided we follow the first rule (proof needed).

This type of reduction is called left-most outermost reduction.

A Simple Program

(+<)+<

Consider this simple program. The only operator that appears directly after a quotation is the second +, so that will be our first reduction:

(+<)(+<)<

We copied the quotation, next up we can reduce the third <:

(+<)+<

We arrived back at the initial state! Since the language is deterministic, this means there's no point in further reduction, we have reached an infinite loop. When our program can no longer be reduced, it is finished. Interpreters will often just stop after a set number of reductions too though.

Here is the same program, with a more compact display of the evaluation sequence:

(+<)+<
(+<)(+<)<
(+<)+<

The underlined term is the one which is to be evaluated at that step, with the bolded character being the combinator which will be applied to the preceding quotes. This styling also italicizes other valid evaluation paths, as can be seen in:

(+)(+-),<(~)+,-
(++-)<(~)+,-
++-(~)+,-
++-(~)(~),-
++-(~~)-
++-

We could have also taken the italicized path first to arrive at the same final expression, but taking the leftmost valid path is typical.

Counting Program Length

The infinite program from the previous section is the smallest non-halting program at 6 symbols. Any other non-halting programs are at least 7 characters long. You will also notice that loops always seem to resemble this basic loop. Let's talk a bit more about program length:

(A)+

The length of this program is 1. When counting program length, we don't count the inert quotations that tend to precede the program. We can also write down the stack effect of this program: (A) -- (A)(A). This syntax means that the input is on the left, and the output is on the right. The "A" here is more like a variable, it could be any sequence of combinators, and it will be copied from the input to the output (twice in fact, in this case). We don't always have to include the inert quotations:

By itself, this program can't be reduced further, but we can trivially see that it would reduce further if we preceded it with two more combinators, so we say that the program has the stack effect (B)(A) -- (A)(B) (~ is the swap operator, remember?).

In general, if we have a program which performs some stack effect when it's put after some amount of quotations, we can refer to it as its own combinator. They are often given a name, and help when constructing a larger program.

Lean Proofs

brightly-salty Converted mlatu-6 to Lean, which allows us to write compact, but verified proofs about the language. To run these proofs, all you need is a Lean environment (the online playground works just fine) and to copy in the main file from the Github repository. Then at the bottom, you just paste in the proof you want to verify.

The whole definition is only ~300 lines of code, but that's still a bit much to showcase on this wiki.

Base Lengths

Now that we have a method of counting program length, we can take a concatenative combinator, figure out the smallest program with the same stack effect as the combinator, and consider the length of that program to be the length of the combinator. k which has the stack effect (B)(A) -- A has a length of 3, since the smallest program with this stack effect is ~-<.

The combinator cake with stack effect (B)(A) -- ((B)A)(A(B)) seems to have a minimal length of 12 thanks to the program >~>>,+<~,~<. In concatenative calculus, {k, cake} forms a complete base, just like our 6 primitive combinators form one (complete means any combinator can be composed from either of these bases). We can add up the lengths of k and cake and arrive at a "base length" of 15. The mlatu-6 combinator base trivially has a base length of 6. A base length of 5 is possible because of the ~ equivalence defined above.

Now let's do something clever. Let's invent three new combinators z, hop, baba as follows:

 def z/2 = -< .
 assert (b) (a) z = b.
 
 def hop/2 = ~> .
 assert (b) (a) hop = (a) ((b)).
 
 def baba/2 = ,+ .
 assert (b) (a) baba = (b a) (b a).
 
 -- Here's why these three new combinators are really cool:
 def i/1 = () z.
 assert/1 i = <.
 
 def swap/2 = hop i.
 assert/2 swap = ~.
 
 def zap/1 = () swap z.
 assert/1 zap = -.
 
 def unit/1 = () hop hop zap.
 assert/1 unit = >.
 
 def dup/1 = () baba.
 assert/1 dup = +.
 
 def cat/2 = baba zap.
 assert/2 cat = ,.

It takes a second to get used to the Lean syntax, but it should be obvious what this code does:

We can still get all 6 combinators from the 3! This means that {a, b, c} is a complete base, still with length 6! Isn't that cool? It seems like this is actually the ONLY such 3-combinator complete proper base where each combinator is length 2. A proof has not yet been found however.

Can we do a 2-combinator base of length 6? We sure can! User:PkmnQ came up with this marvel:

 def z/2 = -< .
 assert (b) (a) z = b.
 
 def twee/2 = ~,>+ .
 assert (b)(a) twee = ((a b)) ((a b)).
 
 -- Reconstruct the 6 primitives:
 def i/1  = () z.
 assert/1 i = <.
 
 def unit/1 = () twee (()) twee z i z.
 assert/1 unit = >.
 
 def zap/1 = unit (()) twee z i z.
 assert/1 zap = -.
 
 def swap/2 = unit (unit) twee (()) twee z i z i i unit twee (()) twee z i z i i.
 assert/2 swap = ~.
 
 def dup/1 = () twee i swap i.
 assert/1 dup = +.
 
 def cat/2 = swap twee (()) twee z i z i.
 assert/2 cat = ,.

Meanwhile Moja came up with this one:

 def z/2 = -< .
 assert (b) (a) z = b.
 
 def cwud/3 = ,~>+ .
 assert (c)(b)(a) cwud = (b a)((c))((c)).
 
 -- Reconstruct the 6 primitives:
 def i/1  = () z.
 assert/1 i = <.
 
 def swap/2 = () cwud z.
 assert/2 swap = ~.
 
 def dup/1 = () () cwud cwud z z.
 assert/1 dup = +.
 
 def zap/1 = () swap z.
 assert/1 zap = -.
 
 def unit/1 = () () cwud zap swap zap.
 assert/1 unit = >.
 
 def cat/2 = (() swap) swap unit cwud z swap i cwud z zap.
 assert/2 cat = ,.

Here's the famous {k, cake} base mentioned earlier:

 def k/2 = ~-< .
 assert (b)(a) k = a.
 
 def cake/2 = >~>>,+<~,~<,.
 assert (b)(a) cake = ((b)a)(a(b)).
 
 -- Reconstruct the 6 primitives:
 def zap/1 = () k.
 assert/1 zap = -.
 
 def unit/1 = () cake zap.
 assert/1 unit = >.
 
 def swap/2 = unit cake k.
 assert/2 swap = ~.
 
 def i/1  = () swap k.
 assert/1 i = <.
 
 def dup/1 = () cake i swap i.
 assert/1 dup = +.
 
 def cat/2 = ((i) cake k i) cake () k cake () k.
 assert/2 cat = ,.

Can we go even further? Sure! We will construct a combinator that is able to construct z and one other combinator we'll call unknown.

double = ~>~>,~,< = (C)(B)(A) -- (B)(A)C
base = (a)(---b) double
z = -<

(base) base = ((a)(---b)double)(a)(---b)double
           = (a)(---b)(a)(---b)double
           = (a)(a)(---b)---b
           = b

if we set b = z, then
(base) base = z
(z) base    = (a)(---z)z
            = a

we pick unknown for a,
base = (unknown)(---z) double
     = (unknown)(----<)~>~>,~,<
     = ((unknown)(----<))~,<

(base) base = z
(z) base = unknown

So for twee and cwud, we get ((~,>+)(----<))~,< and ((,~>+)(----<))~,< respectively.

From the base combinator we can construct the 2-combinator base, and then from there the 6-combinator base.

When Moja discovered this base, she originally had base = (a)((((b)k)k)k) double. But Zhil discovered that this was equivalent to base = (a)(---b) double, which is much simpler.

Dense Gödel Encodings

A Gödel numbering assigns a unique number to each program in a language. A dense Gödel numbering makes sure that each number has a program assigned to it, so there are no numbers that don't correspond to a program.

Here is a base length 12 Dense Binary Gödel Encoding by User:Dadsdy

 ((<-)(,+>));<
 
 X = ((<-)(,+>))
 
 X = ((<-)(,+>))
 X<X<< = (<-)(,)((,))
 (<-)(,)((,))<< = (<-,)
 (<-,)< = <-,
 X<-, = (<-),
 X(<-), = ((<-),)
 ((<-),)<-, = -,
 X-, = ,
 (<-,)(<-), = (<-,<-)
 (<-,<-)< = <-,<-
 XX<-,<- = -
 X<- = (<-)
 X X<< (<-),, = (-)
 X(-),(-), = ()
 ()()X<,< = ,+>
 (),+> = +>
 +> < = +
 ()+> , = (())
 (())(())X<,,, = (,+>)
 (())X(-)(-)(-), ,+> ,,, = ((---))
 +> +> ((---)) ,+> ,<,<- = >
 (,+>)(<-), = (,)
 (())(,+>), = (+>)
 (+>)(,),(<-), = (<)
 (+>)(<), = (+)
 (+>)(+>),((---))>,(,+>),(,),(<),(,),(<-), = (>)

Strictly Proper Universal Base

Strictly Proper Universal Bases or SPUBs are bases from which you can construct all mlatu-6 primitives without using parentheses. Each element of the base has to be a proper combinator with a stack effect. SPUBs are not usually Turing Complete without parentheses though, they only construct the base combinators, not their quoted forms.

Here is a base length 11 SPUB by User:Dadsdy

 x = >~>>, +<<,   = (B)(A) -- ((A)((B)))(AB)
 z = -<           = (B)(A) -- B

 hop = x z
 k = hop z
 nest2 = hop hop
 deadcat = x k
 ~ = nest2 deadcat
 > = ~ hop
 < = hop deadcat
 - = nest2 z
 , = ~ x hop -
 + = ~ nest2 x deadcat k <

Continuation-composing Universal Base

These are bases where juxtaposition is not just composition, but continuation composition, because they don't require parentheses to be complete. What this means is that a program made from these base elements can be cut in half at any point in the program, and you get two valid programs. SPUBs also have this property, but can only construct the primitives that way. These bases can recreate any mlatu-6 program. These bases are therefore also densely gödel numbered, in a number base corresponding to the amount of combinators in the base.

To create such a base, we start from a number of (improper) combinators and then construct all mlatu-6 primitives, as well as their simply quoted forms.

Here is such a base by User:Dadsdy:

 {(-);(<+);(,>);<}
 
 (-)< = -
 (<+)< = <+
 (,>)< = ,>
 ,> < = ,
 (-)(-),>(-), = ()
 () ,> = >
 > <+ = +
 (<+)(-), = (<)
 ()> (,>), = (>)
 (>)(<+), = (+)
 (,>)(<), = (,)

Here's another one by User:dadsdy:

 (,>+>);(<<-);<
 (,>+>)< = ,>+>
 (<<-)< = <<-
 ,>+> <<- = ,>
 ,> < = ,
 (,>+>)(<-), = (,>)
 (<<-)(<<-) ,> (<<-)(<<-) ,> , (,>) ,> (<<-) , = (())
 (())<<- = -
 (()) < = ()
 () ,>+> ,<< = +
 () ,> = >
 
 (())> (<<-), = (-)
 ()>(,>), = (>)
 (,>+>)(,>),(<<-), = (,)
 ()>>(,>),(<<-), = (<)
 ()>(,>+>),(,),(<),(<), = (+)

Simplifying Reductions

Some constructs can always be replaced with others. For example, you can always replace +~ with + because the top two items are the same after a copy. Here's a list of reductions you can do (empty right hand side means you can just remove them altogether):

+- ==
>< ==
~~ ==
(), ==
+~ == +
>- == -
-- == ,-
~-- == --
~,- == ,-
+>~ == >+<
+>~> == >+
>+<> == >+
>,<< == ,<

Busy Beavers

A busy beaver is the halting program that runs for the most reductions or produces the largest state before halting from any program of the same length. For mlatu-6, we study two kinds of busy beavers: reduction busy beavers count the number of reduction steps under normal order reduction, while size busy beavers count the output size after reduction.

Reduction Busy Beavers - Champions

Size	Reductions	Program
3	1	`()+`
4	2	`()+<`
5	3	`()+>~`
6	5	`(+)+<<`
7	7	`(+)+<<<`
8	12	`(+,+)+<<`
9	37	`(+,+)+<<<`
10	>=6182	`(+,+)+<<<<`
11	>3*2^2059	`(+,+)+<<<<<`

Size Busy Beavers - Champions

Size	Final size	Program
2	2	`()`
3	4	`()+`
4	8	`()>+`
5	12	`()>>+`
6	18	`()>>++`
7	24	`()>>+++`
8	66	`(+,+)+<<`
9	>=18416	`(+,+)+<<<`
10	>2^2060	`(+,+)+<<<<`

Kerby Combinators

Brent Kerby's page on concatenative calculus contains a list of combinators. Here are these combinators and the smallest programs we've found for them. There might be mistakes in this list and the programs listed might not yet have been confirmed to be minimal. We also included versions with and without parentheses for the longer ones.

 zap == (A) --
   1: -
 
 i == (A) -- A
   1: <
 
 unit == (A) -- ((A))
   1: >
 
 dup == (A) -- (A)(A)
   1: +
 
 cat == (B)(A) -- (BA)
   1: ,
 
 swap == (B)(A) -- (A)(B)
   1: ~
 
 m == (A) -- (A)A
   2: +<
 
 z == (B)(A) -- B
   2: -<
 
 t == (B)(A) -- (A)B
   2: ~<
 
 nip == (B)(A) -- (A)
   2: ~-
 
 swat == (B)(A) -- (AB)
   2: ~,
 
 tack == (B)(A) -- (B(A))
   2: >,
 
 sap == (B)(A) -- AB
   3: ~,<
 
 k == (B)(A) -- A
   3: ~-<
 
 rep == (A) -- AA
   3: +,<
 
 take == (B)(A) -- (A(B))
   3: ~>,
 
 run == (A) -- A(A)
   4: +>,<
 
 dip == (B)(A) -- A(B)
   4: ~>,<
 
 cons == (B)(A) -- ((B)A)
   4: ~>~,
 
 w == (B)(A) -- (B)(B)A
   6: (+)~,<
   7: ~>+,~,<
 
 c == (C)(B)(A) -- (B)(C)A
   6: (~)~,<
   11: >~>~,~>,<~<
 
 poke == (C)(B)(A) == (A)(B)
   7: >~>,~-<
 
 peek == (B)(A) -- (B)(A)(B)
   8: >(+)~,<~
   9: >~>+,~,<~
 
 dig == (C)(B)(A) -- (B)(A)(C)
   8: (~)~>,<~
   9: >~>~,~>,<
 
 bury == (C)(B)(A) -- (A)(C)(B)
   8: ~(~)~>,<
   9: >~>,~>,<~
 
 flip == (C)(B)(A) -- (A)(B)(C)
   8: >~>,~>,<
   9: ~(~)~>,<~
 
 b == (C)(B)(A) -- ((C)B)A
   9: (~>~,)~,<
   13: >~>,~>,<>~,~<
 
 sip == (B)(A) -- (B)A(B)
   11: >(+)~,<~>,<
   12: >~>+,~,<~>,<
 
 cake == (B)(A) -- ((B)A)(A(B))
   12: >~>>,+<~,~<,   (by Garklein and Moja)
   19: >~>,+(>~,),~(>,),,< (by Forth Truffle)
 
 s == (C)(B)(A) -- ((C)B)(C)A
   15: ((+>)~>,<,~)~,< (by Moja and Forth Truffle)
   23: >~>,~>,<>>+,~>,<,>~,~,< (by Moja)
 
 s' == (D)(C)(B)(A) -- ((D)C)A(D)B
   23: (~,)~,>(~>+)~,(~,<),~,< (by PkmnQ)
   30: ~>,>~>,~>>+,~>,<~>,<~,>~,~,<~< (by sakito system)
 
 j == (D)(C)(B)(A) -- ((C)(D)A)(B)A
   27: +>(>~>,)~,(~>~,)~(,),>,<~,< (by Garklein and Zhil)
   29: >+>~,~>,<>~,>~,~>>,~>>,<,~,~< (by PkmnQ)
 
 j' == (E)(D)(C)(B)(A) -- ((D)A(E)B)(C)B
   35: ~(~(~>~,~>,)~>,<)~>,<+(~(,)~>,<)~,< (by Garklein)

Open Challenges

Finding some of the shortest programs with an element of chaos (aka they run for a while and don't just produce very simple repetitive output)

External Resources

mlatu-6

Contents

Language Overview

Concatenative Theory

Evaluation Order

Normal Order Reduction

A Simple Program

Counting Program Length

Lean Proofs

Base Lengths

Dense Gödel Encodings

Strictly Proper Universal Base

Continuation-composing Universal Base

Simplifying Reductions

Busy Beavers

Reduction Busy Beavers - Champions

Size Busy Beavers - Champions

Kerby Combinators

Open Challenges

External Resources

Navigation menu

mlatu-6

Language Overview

Concatenative Theory

Evaluation Order

Normal Order Reduction

A Simple Program

Counting Program Length

Lean Proofs

Base Lengths

Dense Gödel Encodings

Strictly Proper Universal Base

Continuation-composing Universal Base

Simplifying Reductions

Busy Beavers

Reduction Busy Beavers - Champions

Size Busy Beavers - Champions

Kerby Combinators

Open Challenges

External Resources

Navigation menu

Search