Sunday, July 23, 2017

For Loops are Functional

It has been known for quite some time within the functional programming community, that many programming constructs widely considered to be "imperative" in nature (e.g. goto statements, for loops, while loops, etc.) can be expressed in terms of function calls without side-effects.

I was explaining to someone the other day how programmers can write code that looks imperative, for example using a Python syntax, and that it can be transformed by a compiler into pure functional code.

In this article I will start from a simple Python example and demonstrate a series of simple transformations that could be done by a compiler to convert a traditional for loop into a pure function call.

These transformations do not remove all side-effects, only assignment to variables within the current stack frame.

Summing Numbers from the Fibonnaci Sequence

The working example sums five numbers in the Fibonnaci sequence using a for loop.

    acc = 0
    for x in [1, 2, 3, 5, 8]:
        acc += x
    print acc

Rewriting as a For Loop with an Index

In the first transformation we rewrite the for loop to use an explicit integer index, and we assign the array to a named variable. This makes it look more like a C-style for loop. It should also make it more obvious how the transformations can be generalized to arbitrary variable assignment on the stack.

    acc = 0
    xs = [1, 2, 3, 5, 8]
    n = len(xs)
    for i in range(n):
        acc += xs[i]
    print acc

Convert the For Loop into a While Loop

The for loop can be understood as syntactic sugar (i.e. a convenient syntax) for a while loop. So the first transformation is into a while loop.

    acc = 0
    xs = [1, 2, 3, 5, 8]
    n = len(xs)
    i = 0
    while i < n:
        acc += xs[i]
        i++
    print acc

Stack Frame to Tuple

All of the variables in the stack frame referenced from within the loop are next merged into a single tuple. This is the key observation in making local variable updates into something that can be expressed purely in terms of functional calls.

    f = { 5, [1,2,3,5,8], 0, 0 }
    while (f.i < f.n) 
        f = { f[0], f[1], f[2] + f[2][f[3]], f[3]+1 } 
    print acc;

Replace While loop with a While Function

Now presuming the presence of a while_fxn we can convert the while statement into a single function call.

    f = while_fxn({ 5, [1,2,3,5,8], 0, 0 }, 
        lambda f: { f[0], f[1], f[2] + f[2][f[3]], f[3]+1 },
        lambda f: f.i < f.n)
    print f[2];

Here is one possible implementation of the while function using the original while statement, which we can imagine being implemented by the compiler.

   def while_fxn(x, bodyFxn, terminationFxn):
       while (!termination(x)):
            x = body(x)
       return x

A Tail-Recursive While Function

It is also interesting to note that the compiler can omit support for a while function primitive construct if we implement it as follows:

   def while_fxn(x, bodyFxn, conditionFxn):
       return !condition(x) ? x : body(x)

A problem with this approach if literally implementing it, is that the compiler/interpreter needs to support tail-call optimization (TCO) to prevent stack-overflow when executed more than a few times.

Summary

The point of this article was to show how code with what looked like imperative code can be converted into pure functionally called through a series of transformations which could be performed by a compiler or interpreter. The key observation is that the reference variables of the stack frame can be modeled as a "tuple".

This technique can be useful when converting from imperative code to data-flow code, which is something I'll discuss in a later blog post.

Postscript: Using Itertools.Accumulate

A Python expert could argue that the example could have been rewritten using the itertools.accumulate function.

    import itertools as it;
    print it.accumulate([1,2,3,5,8])

This is true for a programmer, since they can easily identify that the loop in this case is effectively only varying a single value using the binary operation of addition. However, IMO this is not easily expressed as a single transform for a compiler, and does not generalize to arbitrary sets of variable assignments. I wanted to show how the compiler can progressively rewrite imperative code until it gets to something that can be expressed in terms of function calls.

Consider a more complex example like Ridged multi-fractal noise, for which the same techniques shown in the article can be applied.

References

A paper widely referenced on the correspondance between imperative structures and functional programming is "Lambda the Ultimate Imperative" by Guy Steele and Gerald Sussman 1976. You can find that article here along with other papers related on the subject.The Wikipedia article on tail recursion also currently provides a good overview of the subject.