blog

Nice Code Bro, Too Bad It’s Just a Wrapper Over Math: Solving Leetcode in Lambda Calculus

Background and motivation

2 semesters ago I took Logic in Computer Science under Prof. Jagat at BITS, and it was probably the best CS course I have taken so far. I really enjoyed the formal theory side of computer science and the approach the course took to build everything up from first principles. So when I came across Lambda Calculus which provided a nice mix of formalism and intuition, while also making the crazy ambitious claim: It is Turing Complete, so anything that can be done in ANY programming langugae can be done through the bare bones on functional mathematics, I knew the perfect way to spend the next 3 weeks: preparing for SI szn in the most inefficient way possible, by solving LeetCode in lambda calc.

Only a week in into the project did i realise that I was basically just writing glorified Haskell, but oh well :)

Like any other field of formal logic, the formal study of lambda calculus is littered, and for good reason, with involved formal definitions. These are very important, but also difficult to digest. This blog post is not an academic resource, hence, I opt for an intuitive natural language explanation. To maintain disambiguity and completeness I will link all the involved discussions, and definitions when I can. Forgive me for this math sacrilige.

The equivalent Python code is available here.

Introduction and definitions

\[\lambda x.x\]

here $\lambda$ is an operator that denotes a function in the language, the first $x$ denotes the input parameter and the second $x$ denotes the output as a ‘function’ of the input parameter. For example, a simple function like $f(x)=x+1 $ is written as $\lambda x.x+1$ or the polynomial $f(x)=x^2+x+1$ is $\lambda x.x^2+x+1$ which is to say that that the $\lambda$ operator allows you to abstract over $x$. To evalute the function:

\[\displaylines{ (\lambda x. x^2 + x + 1) \, 2 \\ \implies 2^2 + 2 +1 \\ \implies 7}\]

Notice that we substitute $2$ for all occurances of $x$ in the output. More formally:

\[(\lambda x.M)N \implies M[x := N]\]

There is no restriction on the type of $N$ because types dont exist in (untypes) lambda calculus. It couldve been a string, an int, a float, bool, etc. however, lambda calculus does not recognise anything other than a function, so in that sense $N$ is also a lambda. We will see soon how to encode things like integers into lambda functions. For now, take lite.

$M[x := N]$ denotes the substituion of $N$ for the bound occurances of $x$ in $M$. If you are familiar with the definition of free and bound variables in context of first order logic, then the same intuitive idea applies here: an occurrence of $x$ in $φ$ is free in $φ$ if and only if it is a leaf node in the parse tree of $φ$ such that there is no path upwards from that node $x$ to a node $∀x$ or $∃x$. But of course quanitifers do not exist in $\lambda$ calculus but the $\lambda$ operator is analogous to the logical quantifiers in that it binds the variable appearing right after it (the input parameter) in term $M$ (the output), which is what the quantifiers also do (hence the ‘no path upwards…’ clause in the definition). Dont worry if it doesn’t make sense, in my experience relying on a intuitive understanding of substitution works fine in most cases.

This is a good resource on the topic: Uni Leipzig

To maintain disambiguity in our work we will maintain a “hygeine condition”, i.e., we are only allowed to substitute into a lambda if the replacement term ($N$) does not contain the bound variable ($x$), otherwise we can simply rename our bound variable, becasue, notice that, the point of the lambda operator is to abstract away from the bound variable, i.e.:

\[(\lambda x.x^2)3\]

and

\[(\lambda t.t^2)3\]

are equivalent expressions (evaluate to the same quanitity for all substitutions).

This is called $\beta$-reduction and is central to lambda calculus

The cool thing about lambda calculus is that you could directly test it in code. For example, in python $\lambda x.x+1$ is written as:

and evalued as:

alt text

Unary but Higher Order

As you may have noticed by now: lambda functions are unary! Which seems like a problem because if the claim is that in can do everything that a programming langugage can then it must be able to encode functions like:

def sum(a, b):
    return a+b

which takes in two arguments. We can in fact do this, the key is that even though the functions are unary, they are higher order, i.e., they can (and in a langugae like lambda calculus which only has the notion of functions, MUST) return other lambdas. The lambda function for the above python code if:

\[\lambda x. \lambda y.x+y\]

Let’s test this by $\beta$-reducing it:

\[\displaylines{ (\lambda x. \lambda y.x+y)2 \space 3 \\ \implies (\lambda y.2+y) 3 \\ \implies 2+3 \\ \implies 5}\]

This is indeed correct and testable:

alt text

Again, for testing, the only constraints on the types of the inputs are that they must be summable in Python (they cant be int and a string). Again, in lambda calculus the notion of types is not inherent and the only thing we can work with are other lambdas.

alt text

Also notice that the type of the value returned by $ (\lambda x. \lambda y.x+y)2 \space 3 $ is an int, but the type returned by $ (\lambda x. \lambda y.x+y)2 $ is a function, i.e., the first lambda, which binds $x$, returns another lambda which binds $y$. In code this is analogous to:

function lambda1(x){
    return function lambda2(y){
        return x+y
    }
}

alt text

A note on grammar

I initially opted to skip this section and instead write it in the appendix, but as you will see soon ahead, it serves us very well to develop and define an unambiguous parsing grammar for $\lambda$ calculus.

The complete BNF grammar is:

\[\displaylines{ \begin{array}{rcl} \langle \lambda\text{exp} \rangle & ::= & \langle \text{var} \rangle \\ & \mid & (\langle \lambda\text{exp} \rangle) \\ & \mid & \lambda \langle \text{var} \rangle . \langle \lambda\text{exp} \rangle \\ & \mid & ( \langle \lambda\text{exp} \rangle )\langle \lambda\text{exp} \rangle \end{array} }\]

This recursively defines all the valid expressions in the language

1) Variable: The first condition denotes that a variable like $x$ is a valid expression. 2) if M is valid then (M) is also valid. 3) Lambda Abstraction: A $\lambda$ expression of the type $\lambda x.M$ is valid where $M$ is either a $\langle \text{var} \rangle$ as in $\lambda x.x^2$ or another lambda abstraction. Note ‘.’ sperates the input and the function. 4) Application or Substitution: The third rule corresponds to the substituion phase where the abstraction in the first brackets denotes to the function that needs to be evaluated and the second abstraction is the argument passed to it (note that the language only has functions, so the argument is also a function as opposed to a number like 1).

Note: also (by convention) application is left associative: ABC means (AB)C not A(BC), and application/substituion has higher precedence than abstraction: λx.AB means λx.(AB), not (λx.A)B.

But how do you program with this?

The first thing we will encode for is Booleans. Again, booleans as a data type don’t exist in our langugage yet so the best thing we can do is emulate their behavior.

The fundamental function of booleans in progrgamming is control flow as an if-else clause:

if(condition is true){do something}
else{do something else}

So, the encoding for True as a lambda function is:

\[\lambda x.\lambda y. x\]

and the eoncding for False is:

\[\lambda x.\lambda y. y\]

Notice, that True is basically a function that takes in two arguments and returns the first (corresponding to the first branch in an if-else statement) and the vice-versa for False.

To see how this emulates if-else consider a lambda function $F$ that returns either the True lambda function or the False lambda function, and depending on its truth value we want to return either $M$ or $N$, i.e.,

if(F){
    return M
}
else{
    return N
}

is the same as saying:

\[\lambda F. \lambda m.\lambda n. F \space m\space n\]

Consider the $\beta$-reduction when True is passed (F is True):

\[\displaylines{ (\lambda F. \lambda m.\lambda n. F \space m\space n)\space True \space M \space N \\ \implies (\lambda m.\lambda n.True \space m \space n)\space M \space N \\ \implies (\lambda n. True \space M \space n)\space N \\ \implies True \space M \space N \\ \implies (\lambda x. \lambda y. x )M \space N \\ \implies (\lambda y.M)N \\ \implies M}\]

and the $\beta$-reduction when the condition $F$ is False:

\[\displaylines{ (\lambda F. \lambda m.\lambda n. F \space m\space n)\space False \space M \space N \\ \implies (\lambda m.\lambda n.False \space m \space n)\space M \space N \\ \implies (\lambda n. False \space M \space n)\space N \\ \implies False \space M \space N \\ \implies (\lambda x. \lambda y. y )M \space N \\ \implies (\lambda y.y)N \\ \implies N }\]

alt text

Also note that we can name our lambda abstractions and use these names as aliases for the expression. We do this by replace $\lambda$ by the name of the function, i.e., a $\lambda$ function is an anonymous function, which is consistent with the use of lambda functions in languages like Java where it means an anonymous function.

Hence:

\[\displaylines{ True \space x. \lambda y. x \\ \text{and} \\ False \space x. \lambda y. y}\]

but in interest of readability I will use the following aliasing convention:

\[\displaylines{ True = \lambda x. \lambda y. x \\ \text{and} \\ False = \lambda x. \lambda y. y}\]

Now that we have defined the True and False, we can do a lot of interesting things with just these.

For example:

\[AND = \lambda x. \lambda y. \space x \space y \space False\]

The currying steps when $x$ is True and $y$ is False:

\[\displaylines{ (\lambda x. \lambda y. \space x \space y \space False) True \space False \\ \implies (\lambda y. True \space y \space False) False \\ \implies True \space False \space False \\ \implies (\lambda x. \lambda y. x) False \space False \\ ... \\ \implies False}\]

You could try to substitute $x$ and $y$ for all permutations of $True$ and $False$ and verify that the truth table is correct.

Intuitively, if $x$ is False, it will choose the second input to it which is False, and if it is True it will choose $y$.

If we are able to encode for $NOT$ we will have the full range of logical operators since {$ \land, \lnot $} make the universal set of logical operators.

\[NOT = \lambda x. x \space False \space True\]

Verification of the truth table is fairly simple in this case and left as an exercise :p but intuitvely, if $x$ is True it will select the first input which is False, and if not, then the second which is True.

Now all other logical operators can be defined in terms of $AND$ and $NOT$.

\[\displaylines{ OR \equiv \lor \equiv \lnot(\lnot A \land \lnot B) \\ OR = NOT(AND(NOT(A) \space \space NOT(B))) \\ OR = (\lambda x. x \space False \space True) AND(NOT(A) \space \space NOT(B)) \\ OR = (AND(NOT(A) \space \space NOT(B))) False \space True \\ OR = AND(\lambda x. x \space False \space True(A) \space \space \lambda x. x \space False \space True(B)) False \space True \\ OR = AND((A \space False \space True) \space \space (B\space False \space True)) False \space True \\ OR = (\lambda x. \lambda y. \space x \space y \space False ((A \space False \space True) \space \space (B\space False \space True))) False \space True \\ OR = (\lambda y. \space (A \space False \space True) \space y \space False (B\space False \space True)) False \space True \\ OR = ((A \space False \space True) \space (B\space False \space True) \space False) False \space True \\[0.3in] \text{Hence,} \\ OR = \lambda A. \lambda B. ((A \space False \space True) \space (B\space False \space True) \space False) False \space True}\]

You could try making the truth table for this expression to verify that this is indeed correct. However, we could think of a simpler expression for $OR$. When the first argument is True we want to return True, otherwise we want to return whatever the second input is:

\[OR = \lambda x. \lambda y. x \space True \space y\]

But at least we know for sure that it is possible to write any logical operator by some combination of AND and OR. The proof of why AND and OR make the universal set is something that I discovered the existence of in the appendix of my logic text book 10 minutes before the final paper of the course, and never bothered to understand.

Numerals

Numbers don’t exist. But we don’t need them. What does a number even do? It just tells you a quantity, and then you can do something quantity number of times. And you can do arithmetic on those quantity numbers to make new quantities. Big deal.

This is Zero: $\lambda f. \lambda x. x$ what is even going on here? Semantically: Zero takes in an operation $f$ and a starting value $x$ and applies $f$ to $x$ zero number of times, by just returning $x$.

This is One: $\lambda f. \lambda x. f x$

This is Two: $\lambda f. \lambda x. f(fx)$

Note that the encodings for Zero is the same as False, and that for One is the same as True, which is so surprising. It occurs naturally without us having intended it and corresponds to how booleans are dealt with in formal logic and most programming languages (0 for False and 1 for True).

You could go one defining all numbers by hand, and I think there is something to pretty and artisnal about it: defining the very basics of a language by hand. But we could also define a successor function, that takes in a number, a function, and a starting value and apply the the function one more time than the input number to the starting value.

\[succ = \lambda n. \lambda f. \lambda x. f(n\space f\space x)\]

Our first program:

We have done enough groundwork to build actual programs already. Most of these programs naturally Boolean functions, built by encoding for predicates and then using an if-else style control flow. First we will encode for the predicate isZero() which looks like this in cpp:

bool isZero(int n){
  if(n==0){
    return true;
  }
  else{
    return false;
  }
}

int main(){
  if(isZero(0)){
    printf("Statement is true");
  }
  else{
    printf("Statement is false");
  }
}

This translates to:

\[isZero = \lambda x. x \space (\lambda n. False)\space True\]

Why is this correct? It is so, because Zero is a lambda (like False) that returns it’s second argument so the second argument should be True for isZero, and the first should return False however many times it is applied to itself.

alt text

We could also write a function that tells if a number is odd or even:

\[isEven = \lambda x. x \space NOT \space True\]

If we $x$ is zero it returns the second argument which is True, otherwise if $x$ is n it applies the $NOT$ lambda $n$ times to True.

alt text

Using isEven we can build other functions like one which takes in two arguments and checks if their parities are equal.

bool parityEqual(int n, int m){
    if(!(isEven(n) ^ isEven(m))){ // ^ is XOR
        return true;
    }
    else{
        return false;
    }
}

So first let’s define the XOR lambda:

\[XOR = \lambda x. \lambda y. x(NOT \space y) \space y\]

i.e., if $x$ is false return whatever $y$ is, otherwise return $\lnot y$.

So, parityEqual:

\[\displaylines{ parityEqual = \lambda x. \lambda y. XOR(isEven(x) \space isEven(y)) \space False \space True \\[0.3in] \text{Substituting isEven(x) and isEven(y) in XOR}: \\[0.3in] \implies(\lambda x. \lambda y. x(NOT \space y) \space y)\space isEven(x) \space isEven(y) \\ \implies (\lambda y. isEven(x) (NOT \space y) \space y)\space isEven(y) \\ \implies isEven(x) (NOT \space isEven(y))\space isEven(y) \\[0.3in] Hence,\\ parityEqual = \lambda x. \lambda y. isEven(x) (NOT \space isEven(y))\space isEven(y) \space False \space True}\]

alt text

Another important function we should define is areEqual which takes in two numerals (their lambda expressions) are returns if they are equal or not. One way to do it is two subtract the two and pass the subtraction to the isZero function. Which means we need to encode for $+$, $-$ and other arithemtic operations.

Arithmetic

Addition

An addition lambda must take $m$ and $n$, two numerals as input and since, $m$ and $n$ correspond to applying a function $f$ (first input) to a starting value $x$ (second input) $m$, and $n$ number of times respectively, the output should apply $f$ to $x$ $\space$ $m+n$ number of times:

\[add = \lambda m. \lambda n. \lambda f. \lambda x. m(f)(n(f)(x))\]

Fundamentally add applies $f$ to itself with $x$ as the starting value $n$ times then applies $f$ to itself with $n(f)(x)$ as the sarting value $m$ times, so that $f$ is applied to itself with $x$ as the starting value $m+n$ times.

Multiplication

$m*n$ is the same as adding $n$ to itself $m$ times. So the multiplication lambda must apply $n$ to $f$ and $x$ $m $ times and that’s it.

\[mult = \lambda m. \lambda n. \lambda f. \lambda x. m(n(f))(x)\]

To define the subtraction function we need to define the predecessor function, which is a function, which applies $f$ to itself one less time, which is way trickier than succ.

Predecessor

The funny thing about the predecessor function is that it was given by god, no one knows how it works, men far beyond my stature have driven themselves insane trying to foolishly decipher this divine arrangemenr of symbols. Don’t try to understand it, just know that this exists, and quickly abstract upwards (If youre so smart go through the links at the end to try to understand it).

\[pred=λn.λf.λx.(n(λg.λh.h(gf))(λu.x))(λu.u)\]

A super helpful discussion on the semantics of pred is here: StackOverflow discussion, with the lower answers providing a better explanation, especially the ones by Dmitri Gekhtman and Cyker.

For the sake of completeness, here is dmitri’s answer:

For the sake of completeness, here is dmitri’s answer:

I’ll add my explanation to the above good ones, mostly for the sake of my own understanding. Here’s the definition of PRED again:

PRED := λnfx. (n (λg (λh.h (g f))) ) λu.x λu.u The stuff on the right side of the first dot is supposed to be the (n-1) fold composition of f applied to x: f^(n-1)(x). Let’s see why this is the case by incrementally grokking the expression. λu.x is the constant function valued at x. Let’s just denote it const_x.

λu.u is the identity function. Let’s call it id.

λg (λh.h (g f)) is a weird function that we need to understand. Let’s call it F.

Ok, so PRED tells us to evaluate the n-fold composition of F on the constant function and then to evaluate the result on the identity function.

PRED := λnfx. F^n const_x id Let’s take a closer look at F:

F:= λg (λh.h (g f)) F sends g to evaluation at g(f). Let’s denote evaluation at value y by ev_y. That is, ev_y := λh.h y

So

F = λg. ev_{g(f)} Now we figure out what F^n const_x is.

F const_x = ev_{const_x(f)} = ev_x and

F^2 const_x = F ev_x = ev_{ev_x(f)} = ev_{f(x)} Similarly,

F^3 const_x = F ev_{f(x)} = ev_{f^2(x)} and so on:

F^n const_x = ev_{f^(n-1)(x)}

Now,

PRED = λnfx. F^n const_x id
 = λnfx. ev_{f^(n-1)(x)} id

 = λnfx. id(f^(n-1)(x))

 = λnfx. f^(n-1)(x) which is what we wanted.
Super goofy. The idea is to turn doing something n times into doing f n-1 times. The solution is to apply F n times to const_x to obtain ev_{f^(n-1)(x)} and then to extract f^(n-1)(x) by evaluating at the identity function.

Ok! yay! That was something. Good thing is youll probably never need to come up with this on your own :D

Subtraction

\[sub = \lambda n. \lambda m. \lambda f. \lambda x. (m(pred)(n))f \space x\]

Hence, this corresponds to $n- m$.

alt text

Notice that the sub function is built using pred, which semantically means applying a function one less number of times to some value. But what happens when you pass zero to pred? You could try $\beta$-reducing it to find that it returns zero. Hence, the subtraction of a bigger number from a smaller number will equal 0.

alt text

Equality

We have now done enough groundwork to define a very important predicate: isEqual.

Notice that two numbers are only equal when their difference is 0, and we already have an isZero. But having a simple lambda like this:

\[\lambda m. \lambda n. (isZero(sub(m \space n))) \space True \space False\]

will not work because sub(m n) will be zero when n is greater than m. The key to overcoming this is to notice that if two numbers $m$, $n$ are equal, both $m-n$ and $n-m$ are Zero, and when they are not equal only one of the differences is Zero (corresponding to smaller - greater).

Hence the predicate we want to check for is:

\[isZero(m-n) \land isZero(n-m)\]

Translating to lambda expression:

\[isEqual = \lambda m. \lambda n. AND(isZero(sub(m \space n)) \space isZero(sub(n \space m)))\space True \space False\]

Loops Y Combinator

A combinator is just a lambda expression with no free variables.

Lambda calculus is a purely functional programming language which means it has no concept of iterations (which deal with state changes of variables), so we will need to simulate a while loop with recursion.

Consider this combinator:

\[Ω=(λx.x(x))(λx.x(x))\]

This is called the omega combinator and leads to an infinite (non-terminating) computation. $\omega = \lambda x.x(x) $ is the lambda expression which $Ω$ applies to itself.

\[Ω=(λx.xx)(λx.xx)→(λx.xx)(λx.xx)→…\]

i.e., it reproduces itself. This is a simple type of recursion (it is passed itself as an argument).

Another recursive combinator is the Y Combinator which looks similar to the $Ω$ combinator:

\[Y=λf.(λx.f(xx))(λx.f(xx))\]

Semantically: it takes in a function $f$, and finds its “fixed point” which is defined as a point which conincides with it’s image in $f$, i.e., $y$ is a fixed point of $f$ iff $f(y)=y$.

Let

\[g=λx.f(xx)\]

then

\[\displaylines{ \textbf{Y}f=gg \\ \implies \textbf{Y}f=(λx.f(xx))(λx.f(xx)) \\ \implies \textbf{Y}f=f((λx.f(xx))(λx.f(xx)))=f(gg) \\ \implies gg = f(gg)}\]

Hence, $gg$, i.e., $\textbf{Y}f$, the application of the Y Combinator the a function $f$ gives it’s fixed point. Why is this interesting? Because it allows us to define a function that is not recursive, and apply the Y Combinator to it which makes it recursively call itself with different values.

\[\displaylines{ gg = f(gg) = f(f(gg)) = f(f(f(gg)))... \\ \textbf{Y}f = f(\textbf{Y}f) = f(f(\textbf{Y}f)) ...}\]

FizzBuzz

To finally solve FizzBuzz in $\lambda$ calculus we will need to define 2 more predicates: isMod3Zero and isMod5Zero, which is easier than defining a more general mod3 and mod5 function or the even more general mod function. But I will leave these as black boxes for now, and return to them once we have seen how to deal with recursion.

Let’s define the FizzBuzz() function in cpp:

void fizzbuzz(int n){
    for(int i = 1; i <= n; i++){
        if(i % 3 == 0 && i % 5 == 0){
            cout << "FizzBuzz" << endl;
        } else if(i % 3 == 0){
            cout << "Fizz" << endl;
        } else if(i % 5 == 0){
            cout << "Buzz" << endl;
        } else {
            cout << i << endl;
        }
    }
}

But we can also define this function recursively which maps to what we are trying to do through lambda calc much better:

void fizzbuzz_rec(int n){
    if(n == 0){
        return;
    }
    fizzbuzz_rec(n-1);
    if(n % 3 == 0 && n % 5 == 0){
        cout << "FizzBuzz" << endl;
    } else if(n % 3 == 0){
        cout << "Fizz" << endl;
    } else if(n % 5 == 0){
        cout << "Buzz" << endl;
    } else {
        cout << n << endl;
    }
}

The recursive outline of the following code is like this:

Recursively call the function.
Enter into a (nested) if-else block.

The if-else part of the code, tho cumbersome to write, is straightforward using the boolean predicates we have seen so far.

To deal with the recursion part, we will follow a simple strategy. Notice that in simple untyped lambda calc functions don’t have names (all the named functions we have seen so far are aliases for convinience), which means we can’t recurse by calling the function by name inside of its definition because the name does not exist. So we will abstract away the recursive function call and wrap the whole definition inside another function. I will now use a CPP type pseudolanguage.

void almost_fizzbuzz(function f){
  return f(int n){
    if(n == 0){
        return;
    }
    f(n-1);
    if(n % 3 == 0 && n % 5 == 0){
        cout << "FizzBuzz" << endl;
    } else if(n % 3 == 0){
        cout << "Fizz" << endl;
    } else if(n % 5 == 0){
        cout << "Buzz" << endl;
    } else {
        cout << n << endl;
    }
  }
}

The strategy is to pass this almost function to the Y Combinator! To recap:

Write your function iteratively.
Change it to a recursive function.
Abstract the recursive call by changing the recursive function call name to $f$
Wrap the whole function inside another function that takes $f$ as an argument.
Pass the wrapper function to the Y Combinator.

Why this works is very well explained here: Mike Vanier’s explanation

The gist of it is that:

\[\displaylines{ fizzbuzz = \textbf{Y}(almost\_fizzbuzz) = almost\_fizzbuzz(\textbf{Y}(almost\_fizzbuzz)) \\ \implies fizzbuzz = almost\_fizzbuzz(fizzbuzz)}\]

Thus, we are able to pass the function to itself anonymously, and almost_fizzbuzz gets passed the correct function to evalute the next recursive argument.

So, finally let’s define the almost lambda function:

\[\displaylines{ almost\_fizzbuzz = \lambda f.\lambda n. (IsEqual(n)(Zero))(\lambda x. \text{"Function Over"})\\ (\lambda x. f (Pred(n)) (And(IsMod3Zero(n))(IsMod5Zero(n)))(λx. "FizzBuzz") \newline (IsMod3Zero(n))(λx. "Fizz") \\ (IsMod5Zero(n))(λx. "Buzz") \\ (\lambda x. n))}\]

Which is quite a mouthful, but just denotes a basic control flow on inspection.

Hence FizzBuzz is:

\[FizzBuzz = \textbf{Y}almost\_fizzbuzz\]

But, we still have to define the isMod functions. I will provide the recursive template, which needs to be made into its almost equivalent, and passed to the Y comb. I think this is now straightforward and left as an exercise :)

bool isMod3Zero(int n){
  if(n==0){
    return true;
  }
  else if(n<0){
    return false;
  }
  else{
    return isMod3Zero(n-3);
  }
}

A helpful tip is to draw the if-else flowchart and to notice that in lambda calc

\[Predicate \space Lambda(\text{Lambda if Predicate is True})(\text{Lambda if Predicate is False})\]

is equivalent to:

if(Predicate){
  Do smth
}
else{
  Do smth else
}

Closing

Notice that I didn’t implement the FizzBuzz lambda in Python. That’s because there is a discussion on lazy and strict evaluation in programming langugaes that needs to happen before. However, I am too tried right now. Maybe a part 2 :p