Talk:Binary lambda calculus

From Esolang
Jump to navigation Jump to search

HTML MATH markup not working?

This question probably covers more than just this article, but it seems that the tags are not working. I tried some of the math expressions from this page in both the sandbox for this wiki and the wikipedia sandbox, and only wikipedia displayed correctly.

BTW: until this problem is fixed, the way to read this article is to edit the page, copy all the text, paste it into the wikipedia sandbox, and then preview.

--Nthern 16:38, 8 September 2011 (UTC)

You can also type TeX mathematics into PNG rendering using program I wrote: http://zzo38computer.cjb.net/texify/texify.htm I suggest to make cache so that if you type in a TeX code, it will render and store the rendered result in the wiki. The source code of my program is available you can copy it if you want to do so. --Zzo38 22:03, 8 September 2011 (UTC)

Tentative Explanation

Binary lambda calculus (BLC) is an extremely small Turing-complete language which can be represented as a series of bits or bytes. Unlike Binary combinatory logic, another binary language with a similar acronym, it is capable of input and output.

== De Bruijn Index ==

I suggest knowing what a De Bruijn index is before trying to understand this language. In case you don't want to do that, here's an explanation. The explanation also features lambda calculus code which has more parentheses than normal in order to better map onto BLC.

Lambda calculus can use names to denote the term corresponding to each lambda, like the following example.

λxλy.y

This function takes in two inputs x and y and outputs the final one it took in. In this, you know that the term y is referring to the second lambda, the one with y after it. But the term names are arbitrary, which is even more annoying when all you have is binary.

A De Bruijn index can be used to replace the term in lambda calculus without naming the term. The index is a number which counts from the final lambda in the scope until it finds the lambda which the term is referencing.

In BLC, this number starts from 1. So the De Bruijn index in BLC basically indicates how many lambdas the term has had to encounter before finding the lambda it's actually referencing, with 1 referring to the last lambda, 2 to the second-last, 3 to the third-last, etc.

The following code is equivalent to the code above, but using a De Bruijn index.

λλ.1

Note that this is in the scope. It is possible for there to be lambdas that are in a different scope, meaning that the lambdas.

Here are some examples of lambda calculus with named terms and lambda calculus with De Bruijn indices. I have deliberately put spaces between the numbers so it's easier to understand what's going on.

Command Description Notes
λxλy.(yx) λλ.(2 1) outputs in opposite order
λxλyλz.((xz)(yz)) λλλ.((3 1)(2 1)) S combinator
λx.(xλy.(yy)) λ.(1λ.(1 1)) takes in one input, outputs that input and this other lambda. Note the lambda doesn't need parentheses before it.
λxλy.((x λz.(zz))y) λλ.((2 λ.(1 1)) 1) Note that 1 replaces y and 2 replaces x. They can't be arguments of the inner lambda, so they treat it like it doesn't exist.
== BLC Syntax ==

The BLC program is a sequence of bits read left to right. The following commands are defined. Feel free to change how the commands are explained if you think it's too confusing.

  • 00x = Lambda function with body x
  • x0, where x is one or more "1" bits = the number of "1" bits serves as the De Bruijn index
    • For example 10 corresponds to a De Bruijn index of 1, 110 to 2, 1110 to 3, etc.
    • To be clear, this means 10 refers to the final lambda, 110 to the second-last, 1110 to the third-last, etc.
  • 01xy, where x and y are more code = x is applied to y.
    • By default, the output of a lambda is one term in this language. Using 01 allows the function to output both x and y, which is the same as applying x to y in lambda calculus because both of the outputs are also lambdas.
    • If you want to take in one input and output it twice, you would write 00011010 = 00 01 10 10.
    • If you want to take in two inputs and output the first one three times, you would write 00000101110110110 = 00 00 01 01 110 110 110.
    • If you want to output code directly starting with 00, it doesn't need to have 01 directly before it. If there is a term before the function, then 01 is needed before the term.

It is possible for the code to have padding at the right end, i.e. code which doesn't affect the result of the command. This fact is especially useful when trying to use bytes to represent this language.

A program is a lambda calculus term that transforms an input to an output. Standard input is represented as a list of boolean values, and standard output has the same format.

A set bit in BLC is 0000110 (True), and an unset bit is 000010 (False), which are the normal lambda calculus representations of these values.

0000110 = 00 00 110
*taking two inputs (with the two lambda functions represented by 00), return the second argument from the inside i.e. the first argument

000010 = 00 00 10 
*taking two inputs (with the two lambda functions represented by 00), return the innermost / last argument i.e. the second argument

The empty list, called nil, is 000010 (False).

A list with multiple elements is represented by the pairing or cons function 00010110xy, where x is the head of the list and y is the tail.

00010110xy = 00 01 01 10 x y
*taking one input, output that input, the head of the list, and the tail of the list

You might expect programs to consist of multiple bytes, considering all of these have been six bits or over. However, printing out input in this language is done through the code 0010, which takes one input (which is the innermost by default) and prints it out. Because padding is ignored and lambdas only output one term by default in this language, a program consisting of just a cat can be represented by any bytes between 32 (00100000) and 47 (00101111) because everything after 0010 is ignored (remember that you have to type 00011010 to output the input twice).

== SKI combinator calculus ==

The encoding of lambda term S is λxλyλz.((xz)(yz)), which is written as λλλ.((3 1)(2 1)) using De Bruijn indexes instead of names, and as 00 00 00 01 01 1110 10 01 110 10 in BLC.

The K combinator is written as λxλy.x or λλ.2 in a corresponding format, so it would be 00 00 110 in BLC.

The identity function I is the same as the cat: 00 10.

Therefore, you can implement SKI combinator calculus in BLC.

== BLC8 ==

BLC operates on a stream of bits (values of 0 and 1), while BLC8 is the same, but operates on a stream of bytes (values from 0 - 255) with the most significant bit in the smallest value (big-endian). In the following programs, BLC and BLC8 are distinguished by:

  1. the relevant name in parentheses
  2. BLC being represented in bits while BLC8 is represented in bytes
== Programs ==

self-interpreter (BLC)

  01010001
   10100000
    00010101
     10000000
      00011110
       00010111
        11100111
         10000101
          11001111
          000000111
         10000101101
        1011100111110
       000111110000101
      11101001 11010010
     11001110   00011011
    00001011     11100001
   11110000       11100110
  11110111         11001111
 01110110           00011001
00011010             00011010

prime number sieve (BLC)

000100011001100101000110100
 000000101100000100100010101
 11110111          101001000
 11010000          111001101
 000000000010110111001110011
 11111011110000000011111001
 10111000
 00010110
0000110110

Brainfuck interpreter (BLC8)

0000000          01a15144 02d55584               223070b7        00f032ff
0000020          7f85f9bf        956fe15e        c0ee7d7f 006854e5
0000040          fbfd5558        fd5745e0        b6f0fbeb 07d62ff0
0000060          d7736fe1 c0bc14f1               1f2eff0b        17666fa1
0000100          2fef5be8        ff13ffcf        2034cae1 0bd0c80a
0000120          e51fee99        6a5a7fff        ff0fff1f d0049d87
0000140          db0500ab 3bb74023               b0c0cc28 10740e6c
0000160

Universal Turing Machine (BLC)

0101000110100000000101011000000000011110000101111110011110 
0001011100111100000011110000101101101110011111000011111000 
0101111010011101001011001110000110110000101111100001111100 
0011100110111101111100111101110110000110010001101000011010

There is also a universal Turing machine written in BLC8 which is slightly longer.

== Computational Class ==

Lambda calculus is Turing-complete, and because binary lambda calculus is a version of lambda calculus, it is also Turing-complete. It can also implement SKI combinator calculus as seen above, so it is Turing-complete in that way. As a final indication of Turing-completeness, Brainfuck and a Universal Turing Machine have both been represented in this language.

== External Resources ==

Merely an encoding

I'd like to massively rephrase this article before adding an infobox. Specifically, I want to shift from saying that BLC is its own language to saying that BLC is an encoding of wikipedia:untyped lambda calculus. This is more in-line with how it's used by folks like myself ([1]) or John Tromp ([2]), as a measurement device for gauging the approximate size of lambda terms. It also would deflate some of the currently-wordy discussion around complexity classes. Thoughts? Corbin (talk) 20:44, 16 September 2024 (UTC)

Not just program text encoding but also IO convention (I don't mean that the fact is so important for distinguishing "true languages" and "just encodings", just remind of the fact). Blashyrkh (talk) 19:39, 16 January 2025 (UTC)
In a sense, almost all programming languages are encodings of some computational model – that's the primary difference between a computational model and a language. However, BLC is kind-of special in that the model is more commonly used than the language, rather than the other way round, so it makes sense to emphasise the model over the language in the article. --ais523 21:04, 16 January 2025 (UTC)