ZKP – Building Babylon

2 June 202516 June 2025

Unlucky folds & the correlated agreement theorem

An important step in the soundness analysis of the FRI-IOPP is bounding the probability of an “unlucky fold”: assuming that the prover is cheating (so that the initial oracle is $δ$ -far from the code), how likely is it that the verifier “unluckily” chooses a folding challenge $α \in F$ such that the $α$ -folding of the oracle is not $δ$ -far from its code? We describe here how to bound this probability using the Correlated Agreement Theorem from [PG-RSC].

Notation

Let $F$ be a finite field and suppose $Ω \subset F^{\times}$ is a multiplicative subgroup with even order, so that $ω \mapsto ω^{2}$ is a 2-to-1 map $Ω \to Ω^{2}$ . Write $F^{Ω}$ (resp. $F^{Ω^{2}}$ ) for the space of all $F$ -valued functions on $Ω$ (resp. $Ω^{2}$ ). Denote by $E_{Ω}$ the evaluation map $f \mapsto (f (ω))_{ω \in Ω} \in F^{Ω}$ , and for any $d > 0$ , write ${RS}_{d} [Ω] = E_{Ω} (F [X]^{< d})$ for the Reed-Solomon code consisting of evaluations of polynomials with degree less than $d$ on $Ω$ . Write $Δ$ for the relative Hamming distance.

Define $Θ = (Θ_{0}, Θ_{1}) : F^{Ω} \to F^{Ω^{2}} \times F^{Ω^{2}}$ by $(Θ_{0} u)_{ω^{2}} = \frac{u_{ω} + u_{- ω}}{2}, (Θ_{1} u)_{ω^{2}} = \frac{u_{ω} - u_{- ω}}{2 ω}, \forall ω \in Ω, \forall u \in F^{Ω} .$ The map $Θ$ , which operates in the evaluation domain, corresponds to the familiar even/odd decomposition of a polynomial in the coefficient domain. In particular: $u_{ω} = (Θ_{0} u)_{ω^{2}} + ω (Θ_{1} u)_{ω^{2}}, \forall ω \in Ω, \forall u \in F^{Ω},$ and so the folding $Ψ_{α} u$ of $u \in F^{Ω}$ using $α \in F$ can be defined using $Θ$ via: $Ψ_{α} : F^{Ω} \to F^{Ω^{2}}, u \mapsto (Θ_{0} u) + α (Θ_{1} u), \forall u \in F^{Ω} .$

The “Correlated Agreement Theorem” ${CAT}_{d, Ω}$

Let $ρ = d / | Ω |$ , and write ${CAT}_{d, Ω}$ for the following statement, proven in [PG-RSC] (as Theorem 4.1):

Suppose that $δ \in (0, \frac{1 - ρ}{2}]$ , and that $u^{(0)}, u^{(1)} \in F^{Ω}$ are such that $| {α \in F | Δ (u^{(0)} + α u^{(1)}, {RS}_{d} [Ω]) < δ} | \geq | Ω | .$ Then there exists $Λ \subset Ω$ with $\frac{| Λ |}{| Ω |} \geq 1 - δ$ and $f^{(0)}, f^{(1)} \in F [X]^{< d}$ such that $f^{(0)} (λ) = u_{λ}^{(0)}, f^{(1)} (λ) = u_{λ}^{(1)}, \forall λ \in Λ .$

The Correlated Agreement Theorem & Unlucky Folds

The contrapositive of ${CAT}_{d / 2, Ω^{2}}$ can be used to bound the probability of an unlucky fold as follows.

Assume that $u \in F^{Ω}$ is $δ$ -far from ${RS}_{d} [Ω]$ . Then $Δ (u, E_{Ω} f) \geq δ, \forall f \in F [X]^{< d} .$ Suppose (for contradiction) that there exist $f^{(0)}, f^{(1)} \in F [X]^{< d / 2}$ and $Λ \subset Ω^{2}$ with $| Λ | / | Ω^{2} | \geq 1 - δ$ and $f^{(0)} (λ) = (Θ_{0} u)_{λ}, f^{(1)} (λ) = (Θ_{1} u)_{λ}, \forall λ \in Λ .$ Set $f (X) = f^{(0)} (X^{2}) + f^{(1)} (X^{2})$ and write $\sqrt{Λ} = {ω \in Ω | ω^{2} \in Λ}$ . Then for any $λ \in Λ$ and either of its two square roots $ω \in \sqrt{Λ}$ : $f (ω) = f^{(0)} (λ) + ω f^{(1)} (λ) = (Θ_{0} u)_{λ} + ω (Θ_{1} u)_{λ} = u_{ω},$ and so $f \in F [X]^{< d}$ and $u \in F^{Ω}$ agree on a subset $\sqrt{Λ} \subset Ω$ of density $\frac{| \sqrt{Λ} |}{| Ω |} = \frac{| Λ |}{| Ω^{2} |} \geq 1 - δ .$ Thus we would have that $Δ (u, E_{Ω} f) < δ$ , which would contradict the $δ$ -farness of $u$ from the code. Thus no such $f^{(0)}$ , $f^{(1)}$ and $Λ$ exist. Therefore, by the contrapositive of ${CAT}_{d / 2, Ω^{2}}$ , we have that $| {α \in F | Δ (Θ_{0} u + α Θ_{1} u, {RS}_{d / 2} [Ω^{2}]) < δ} | < | Ω | .$ i.e. that the probability of the folding $Ψ_{α} u$ failing to be $δ$ -far from ${RS}_{d / 2} [Ω^{2}]$ (an “unlucky fold”) is strictly less than $ϵ = \frac{| Ω |}{| F |}$ .

Proving the Correlated Agreement Theorem

The Correlated Agreement Theorem, in the case where $δ$ is at most the unique decoding radius, can be proven by running the Berlekamp-Welch decoder over the function field $F (α)$ (where $α$ , transcendental over $F$ , stands in for the folding challenge). This works since the Berlekamp-Welch decoder is really just a result from linear algebra, and so works over any field. The other ingredient is the Polishchuk-Spielman Lemma. A proof of the theorem and this lemma are given in [PG-RSC].

References

[PG-RSC]: E. Ben-Sasson, D. Carmon, Y. Ishai, S. Kopparty and S. Saraf, “Proximity Gaps for Reed–Solomon Codes,”, 2020 (pdf).

7 March 202514 March 2025

Consequences real & imaginary of “How to Prove False Statements: Practical Attacks on Fiat-Shamir”

“How to Prove False Statements: Practical Attacks on Fiat-Shamir” (Khovratovich, Rothblum, Soukhanov; 2025) constructs families of circuits for which the GKR protocol, when made non-interactive by Fiat-Shamir heuristic, will prove false statements. It’s a great paper and is so well written that I won’t attempt to do better by paraphrasing their constructions. What I’d like to look at here is instead the consequences of the result for real-world applications of non-interactive proofs.

The Fiat-Shamir heuristic attempts to simulate, in the non-interactive setting, random challenges that the verifier would have issued in an interactive setting. It does so by replacing the information-theoretic assumption of the random oracle model with a cryptographic assumption about deterministic hash functions. Fiat-Shamir is crucial to real-world cryptographic applications (and blockchains, in particular) because it enables non-interactive proofs: a non-interactive proof can be checked by any number of verifiers, present and future. Moreover these verifiers don’t need to trust one another. They can check the validity of the proof for themselves.

How does Fiat-Shamir work? The prover (and any future verifiers), replace each random message from the verifier with the result of applying a cryptographic hash function to all of the messages from the prover up to that point. Intuitively, the cryptographic assumptions on the hash function should then prevent a computationally-bounded malicious prover from choosing their messages to obtain a desired (Fiat-Shamir generated) message from the verifier and thereby prove a false statement. This has never been proven, however, and indeed earlier work had demonstrated somewhat contrived protocols where it fails.

The current paper shows that the Fiat-Shamir heuristic fails for the very natural and useful GKR protocol. In my assessment, this paper is very impactful in a social sense, in that it reminds us all of the fallibility of cryptographic conjectures. It is a blow to the confidence of those building blockchain applications, and also to those who use blockchains as a store of wealth. However, this is not because this specific attack can be carried out in meaningful applications, but simply because it reminds us how we tend to build castles on sand. Moreover, the result “breaks the promise” of GKR in that it illustrates how GKR can, for certain maliciously constructed circuits, prove false statements. Importantly, however, the paper demonstrates failure of non-interactive GKR only for a contrived family of circuits, and so there is a strong sense in which this result has no practical consequences whatsoever. A verifying party is not simply checking “is this a valid proof of the satisfaction of some circuit?” but rather “is this a valid proof of satisfaction of the circuit XYZ known to me”. Any verifying party that doesn’t care which circuit is satisfied before it releases the funds (or performs some critical function) was already vulnerable to deception, irrespective of this result. Why would you trust a circuit that you hadn’t audited?

It has also been suggested that this paper signals the end of recursive verification of GKR circuits. The reason for this being that both this attack and recursive verification rely on being able to circuitize the hash function used in the transcript, and so facilitating one facilitates the other. This concern evaporates however when we again remember that the only circuits used in production are ones upon which the prover and verifying parties have agreed, and this includes circuits for recursive verification. Why would a verifying party accept the use of a recursive verifier circuit with a backdoor in it? They simply wouldn’t.

3 February 202522 May 2025

Building a polynomial commitment scheme from the FRI-IOPP

Here we cover how to build a polynomial commitment scheme (PCS) from the FRI interactive oracle proof of proximity (FRI-IOPP). Specifically, we explain how to reduce an evaluation proof to a proof of proximity. It is assumed that the reader already understands the FRI-IOPP. Everything here was derived from reading Transparent PCS with polylogarithmic communication complexity (Vlasov & Panarin, 2019), along with (to be honest) much reflection and discussion.

We assume the usual setup for FRI. Let $F$ denote a finite field, write $Ω \subset F$ for a subgroup of the group of units $F^{\times}$ with $| Ω |$ a power of two. Let $F^{Ω}$ denote the vector space of functions $Ω \to F$ , considered as vectors with entries indexed by some fixed enumeration of $Ω$ (we’ll call elements of $F^{Ω}$ “words”). Let $E : F [X] \to F^{Ω}$ denote the evaluation map, i.e. $(E f)_{ω} = f (ω) \forall f \in F [X] \forall ω \in Ω .$ Write $Δ$ for the relative Hamming distance on $F^{Ω}$ , and let ${RS}_{k} = E (F [X]^{< k}) \subset F^{Ω}$ denote the Reed-Solomon code with degree bound $k$ . Fix throughout some degree bound $1 < d < | Ω |$ . We will be concerned with two separate Reed-Solomon codes, namely ${RS}_{d}$ and ${RS}_{d - 1}$ (note that both use the same evaluation points $Ω$ ). Write $δ_{0}$ for the unique decoding radius of ${RS}_{d}$ , i.e. $δ_{0}$ is maximal such that, for any $f \in F [X]^{< d}$ and $u \in F^{Ω}$ , $Δ (u, E f) ⩽ δ_{0} ⟹ (\forall g \in F [X]^{< d} Δ (u, E g) ⩽ δ_{0} ⟹ f = g) .$

The data of a commitment in the FRI-PCS with degree bound $d$ is a word $u \in F^{Ω}$ (more precisely: an oracle to $u$ ). Assume for now that the prover is honest. In the simplest case, $u = E f$ for some $f \in F [X]^{< d}$ . Crucially, however, we require something weaker than this of a commitment $u$ : it is sufficient that $u$ be within the unique decoding radius of some codeword $E f$ , i.e. $\begin{matrix} (UDR) & Δ (u, E f) ⩽ δ_{0} \exists f \in F [X]^{< d} . \end{matrix}$ Any such $u$ is considered to be a valid commitment to its corresponding $f \in F [X]^{< d}$ . Why is this subtlety necessary? Because if the stricter condition $u = E f$ were required, then the (soon to be explained) reduction of an evaluation proof to an invocation of the FRI-IOPP wouldn’t be possible. The FRI-IOPP isn’t sufficiently sensitive to reliably distinguish between a codeword and nearby non-codewords (for an explicit demonstration, see here).

Assuming that the prover has sent a commitment $u \in F^{Ω}$ to the verifier, an evaluation proof proceeds as follows. The verifier chooses an evaluation point $r \in F ∖ Ω$ and sends this to the prover, and the prover responds with the purported evaluation $c$ of the polynomial committed to (in the case of an honest prover, $c = f (r)$ where $f$ is determined by $(UDR)$ ). The prover wants to establish $\begin{matrix} (TwoPart) & \exists f \in F [X]^{< d} s . t . Δ (u, E f) ⩽ δ_{0} and f (r) = c . \end{matrix}$
Since $f (r) = c$ if and only if $f - c$ is divisible by $X - r$ , and degree is additive under polynomial multiplication, this two part claim is equivalent to the following unified claim:
$\begin{matrix} (Unified) & \exists q \in F [X]^{< d - 1} s . t . Δ (u, E ((X - r) q + c)) ⩽ δ_{0} . \end{matrix}$ Thus we are interested in the proximity of the commitment $u$ to a specific subset of ${RS}_{d}$ , namely the subset consisting of codewords of the form $E ((X - r) q + c)$ . Note, however, that we can not immediately apply FRI-IOPP to establish this claim, since this subset is not itself a Reed-Solomon code (it isn’t even a linear subspace of $F^{Ω}$ ). The good news is that, with a small amount of work, it can be seen that the claim $(Unified)$ is equivalent to a claim that can be established using the FRI-IOPP (with degree bound $d - 1$ ). This claim involves a vector $D_{r, c} u \in F^{Ω}$ derived from $u$ , which we’ll define below. And thus the FRI-PCS evaluation proof will be established using by a proximity proof!

Firstly, a note on oracles. Importantly, the verifier does not need to receive or read all of the $u \in F^{Ω}$ that commits to $f \in F [X]^{< d}$ . Since an evaluation proof reduces to an invocation of the FRI-IOPP, it is sufficient for the verifier to be supplied with a mechanism to query entries $u_{ω}$ of $u$ at arbitrary indices $ω \in Ω$ . In an information theoretic presentation, this mechanism is abstracted as an oracle. In implementation, the oracle is typically instantiated with a Merkle commitment to the tree whose leaves are the $u_{ω}$ (enumerated in some agreed upon manner). The prover binds itself to $u$ by sending the Merkle root to the verifier, and the verifier queries an entry $u_{ω}$ by asking the prover for the Merkle path from the root to the corresponding leaf.

Let’s now derive the reduction of the evaluation proof to an instance of the FRI-IOPP. Given bijections $θ_{w} : F \to F$ for each $ω \in Ω$ , a bijection $θ : F^{Ω} \to F^{Ω}$ of the space of all words can be defined by $(θ v)_{ω} = θ_{ω} (v_{ω}) \forall v \in F^{Ω}, \forall ω \in Ω .$ Call such a map $θ$ a “component-wise bijection”; it is immediate for any such map that $\begin{matrix} (HDInv) & Δ (θ u, v) = Δ (u, θ^{- 1} v) \forall u, v \in F^{Ω} . \end{matrix}$ For any $r \in F ∖ Ω$ and $c \in F$ , define a component-wise bijection $D_{r, c} : F^{Ω} \to F^{Ω}$ by $(D_{r, c} u)_{ω} = \frac{u_{ω} - c}{ω - r} \forall u \in F^{Ω}, \forall ω \in Ω .$ Its inverse is given by $(D_{r, c}^{- 1} (v))_{ω} = (ω - r) v_{ω} + c \forall v \in F^{Ω}, \forall ω \in Ω,$ from which it follows immediately that $\begin{matrix} (DrcInverse) & (D_{r, c}^{- 1} \circ E) (q) = E ((X - r) q + c) \forall q \in F [X] . \end{matrix}$ Combining $(HDInv)$ and $(DrcInverse)$ , we obtain $\begin{matrix} (StepDown) & Δ (D_{r, c} u, E (q)) = Δ (u, E ((X - r) q + c)) \forall v \in F^{Ω} \forall q \in F [X], \end{matrix}$ and substituting this into the claim $(Unified)$ , we obtain the equivalent claim $\begin{matrix} (FRIable) & \exists q \in F [X]^{< d - 1} s . t . Δ (D_{r, c} u, E q) ⩽ δ_{0}, \end{matrix}$ which the prover and verifier can establish using the FRI-IOPP! $(FRIable)$ is nothing more than the statement that $D_{r, c} u$ is $δ_{0}$ -close to ${RS}_{d - 1}$ .

Cheating fails (w.h.p.)

It is instructive to now consider the two possible cheating strategies for the prover in this reduction to the FRI-IOPP. The first is where the first claim of $(TwoPart)$ doesn’t hold, i.e. for all $f \in F [X]^{< d}$ , $u$ is not within the unique decoding radius of some $E f$ . Put differently, this is just saying that $Δ (u, {RS}_{d}) > δ_{0}$ , i.e. “ $u$ is $δ_{0}$ -far from the code ${RS}_{d}$ ”. Thus, by $(StepDown)$ , for any $q \in F [X]^{< d - 1}$ , $Δ (D_{r, c} u, E q) = Δ (u, E ((X - r) q + c)) ⩾ Δ (u, {RS}_{d}) > δ_{0},$ and so claim $(FRIable)$ is false, and will be caught with the soundness bound of the FRI-IOPP. The only other cheating strategy for the prover is that where the $u$ is in the unique decoding radius of $E f$ for some $f \in F [X]^{< d}$ , but the claimed evaluation is false, i.e. $f (r) \neq c$ . Then for any $q \in F [X]^{< d - 1}$ , we have $f \neq (X - r) q + c$ (since otherwise the evaluations at $c$ would match), and so $E f$ and $E ((X - r) q + c)$ are distinct codewords. Thus, by $(StepDown)$ , $Δ (D_{r, c} u, E q) = Δ (u, E ((X - r) q + c)) > δ_{0},$ since $u$ can be within $δ_{0}$ (the unique decoding radius) of at most one codeword (which is $E f$ ).

How to perform the FRI-IOPP with degree $d - 1$ ?

The FRI-IOPP is defined for degree bounds $d$ that are powers of two. But if $d$ is a power of two, how to perform the second FRI-IOPP, which uses the degree bound $d - 1$ ? The short answer is simply to replace it with another invocation of the FRI-IOPP with degree bound $d$ . In detail:

The verifier will use the oracle to $u$ to simulate an oracle $D_{r, c} u$ to $(f - c) / (X - r)$ .
The prover wishes to establish that $D_{r, c} u$ is within the unique decoding radius (UDR) of ${RS}_{d - 1}$ . The UDR for ${RS}_{d - 1}$ is larger than the UDR $δ_{0}$ for ${RS}_{d}$ , so it is sufficient to show that $Δ (D_{r, c} u, {RS}_{d - 1}) < δ_{0}$ .
Prover and verifier use the FRI-IOPP to instead show that $Δ (D_{r, c} u, {RS}_{d}) < δ_{0}$ . This shows that there exists some $q \in F [X]^{< d}$ such that $Δ (D_{r, c} u, E q) < δ_{0}$ .
By equation $(StepDown)$ , it follows that $Δ (u, (X - r) q + c) < δ_{0}$ .
But we are within the UDR, so $(X - r) q + c = f$ , where $f$ is the polynomial from the first IOPP invocation. Thus $deg (q) = deg (f) - 1$ . The first FRI-IOPP showed that $deg (f) < d$ , and so $E q \in {RS}_{d - 1}$ . Thus $(Unified)$ is established, and we are done.

10 January 2025

FRI is a proof of proximity, not a low-degree test

(The following thought experiment was suggested to me by my colleague Ryan Cao; mistakes and invective are my own).

FRI is often described as a “low degree test”, which suggests that the verifier should reject with high probability if the degree is high. This is not the case, as the simple example below demonstrates. Indeed it is the polynomials of highest degree that have the highest chance of being erroneously accepted by the verifier. (What is true is that the FRI verifier rejects with high probability if the evaluations are far from the code: FRI is a proof of proximity).

Let $F$ be a field, $Ω \subset F$ with $n = | Ω |$ , and write
$ϵ_{Ω} : F [x]^{< n} \to F^{Ω}, f \mapsto (f (ω))_{ω \in Ω}$ for the evaluation map. For any $k \leq n$ , let $RS [Ω, k] := ϵ_{Ω} (F [x]^{< k})$ be the Reed-Solomon code of evaluations of polynomials of degree strictly less than $k$ on $Ω$ .

Henceforth we assume that $Ω$ is a subgroup of the multiplicative group $F^{\times}$ and that $n = | Ω |$ is a power of two. The degree bound $k = 2^{d}$ will also be a power of two. For $α \in F$ , let $Φ_{α} : F [x] \to F [x]$ be the FRI folding operation (in the coefficient domain) and write $Ψ_{α} = ϵ_{Ω} \circ Φ_{α} \circ ϵ_{Ω}^{- 1}$ for the corresponding operation in the evaluation domain. Then it’s not hard to check that for any $v \in F^{Ω}$ and $α \in F$ , we have $\begin{matrix} (Fold) & (Ψ_{α} (v))_{ω^{2}} = \frac{1}{2} (1 + \frac{α}{ω}) v_{ω} + \frac{1}{2} (1 - \frac{α}{ω}) v_{- ω} \forall ω \in Ω . \end{matrix}$

FRI operates by iterated folding, using challenges provided by the verifier. The claim of the proximity of $v^{(0)} := v \in F^{Ω}$ to the code $RS [Ω, 2^{d}]$ is reduced to the claim of the proximity of $v^{(1)} := Ψ_{α} v^{(0)}$ to the code $RS [Ω^{2}, 2^{d - 1}]$ for some $α \in F$ . This process is repeated until the claim is that $v^{(d)}$ is close to the code $RS [Ω^{2^{d}}, 1]$ , at which point the verifier queries $t$ entries from $v^{(d)}$ and checks if they all agree: if they do, it concludes that $v^{(d)}$ is (w.h.p.) close to its code, and hence that the original vector $v^{(0)}$ is (w.h.p.) close to the original code. Proving that FRI is indeed a good test of proximity is complicated. It is easy to show, however, that FRI is not (indeed never intended being) a reliable means of detecting polynomials of high degree, and that is what we’ll do here.

For any $ω \in Ω$ , let $L_{ω, Ω}$ denote the Lagrange interpolation polynomial, i.e. $L_{ω, Ω} = \prod_{\begin{matrix} ω^{'} \in Ω ω^{'} \neq ω \end{matrix}} \frac{x - ω^{'}}{ω - ω^{'}} \in F [x] .$ Then $deg L = n - 1$ , i.e. it has has maximal degree (given the size of the evaluation domain $Ω$ ). Its evaluations, however, are “one-hot” at $ω$ $v_{ω, Ω} := ϵ_{Ω} (L_{ω, Ω}) = (0, \dots, 0, 1, 0, \dots, 0)$ and in this sense are maximally close to the code, while not being in the code (having Hamming distance $1$ from the zero codeword). It’s easy to see from $(Fold)$ that $Ψ_{α} (v_{ω, Ω}) = \frac{1}{2} (1 + \frac{α}{ω}) v_{ω^{2}, Ω^{2}}$ i.e. that except with negligible probability (when $α = - ω$ ), the folding will also have Hamming distance 1 from its code $RS [Ω^{2}, 2^{d - 1}]$ . Note, however, that the relative Hamming distance has doubled, since the size of the evaluation domain has halved.

So let’s set $v^{(0)} := v_{ω, Ω}$ . Then after $d$ rounds of folding, $v^{(d)}$ is (except with negligible probability) non-zero in exactly one place, and its relative distance from the code $RS [Ω^{2^{d}}, 1]$ (which consists of constant vectors) is $2^{d} / 2^{n}$ i.e. the rate $ρ$ of the original code. The verifier thus accepts with probability $(1 - ρ)^{t}$ . Given that $ρ$ is typically close to zero (e.g. $2^{22 - 64}$ ) and $t$ is always small, this probability is close to $1$ . For this reason, it is misleading (if commonplace) to describe FRI as a “low-degree test”: the polynomials $L_{ω, Ω}$ , which have maximal degree are routinely accepted by the FRI verifier. Why? Because the evaluations of these polynomials are close (“proximal”) to the code. FRI is a proof of proximity, after all.

17 July 202430 July 2024

When is a polynomial determined by evaluations? Polynomial interpolation over commutative rings with unity.

A polynomial with coefficients in a field and of degree $< n$ is determined by its evaluations at any $n$ distinct points. A common way to see this is via Lagrange interpolation. But what happens in the more general case where the coefficients come from a commutative ring $R$ with $1$ ? It’s easy to see that the statement fails. Consider e.g. $R = Z / 8 Z$ , and let $f (X) = 4 X + 4 X^{2}$ . Then $f$ vanishes everywhere on R (easy to check), despite having degree two. In particular, there are multiple polynomials of degree $< 3$ (viz. $f$ and the zero polynomial) that vanish at three distinct points e.g. $1, 2, 3$ .

The correct generalization of the statement can be derived by considering the Vandermonde matrix. Recall that, given points $c_{1}, \dots, c_{n} \in R$ , the Vandermonde matrix $V$ is the $n$ x $n$ matrix consisting of powers of the $c_{i}$ . The matrix-vector product of $V$ and the vector of the coefficients of a polynomial $f$ then gives the vector of evaluations $f (c_{i})$ of $f$ at the points $c_{i}$ . Interpolation goes the other way, i.e. from evaluations to coefficients. So we’d like to be able to invert the Vandermonde matrix.

As it happens, a square matrix over a commutative ring $R$ with $1$ is invertible if and only if its determinant is invertible in $R$ (the construction of the inverse matrix in terms of the adjugate demonstrates this). The determinant of the Vandermonde can be shown (using only column operations and properties of the determinant) to be the product of the $c_{i} - c_{j}$ for $i \neq j$ . Thus we see that a polynomial $f \in R [X]$ of degree $< n$ is determined by its evaluations at $n$ distinct points if the differences of these evaluation points are invertible in $R$ .

In fact, we can do much better: the differences don’t need to have inverses in $R$ , they just need to be invertible in a larger ring: it in fact suffices that the differences are not zero divisors in $R$ . For suppose that this is the case. Let $S$ be the multiplicative closure of the set of pairwise differences. Then $S$ contains no zero divisors, and so $R$ can be considered as a subring of its localization $S^{- 1} R$ at the subset $S$ . Importantly, the pairwise differences have inverses in $S^{- 1} R$ . Hence, by the above argument, any polynomial of degree $< n$ with coefficients in $S^{- 1} R$ is determined by its evaluations at our points, and this of course continues to hold when the coefficients (and evaluations) lie in the subring $R$ .

To return to the problematic example above: for any three distinct points in $R = Z / 8 Z$ , either at least two of them are odd, or at least two of them are even, and in either case there will be a pair of distinct points whose difference is even and hence either zero or a zero divisor.

10 July 202411 July 2024

Efficient polynomial interpolation on $0, 1, 2, . .$ (the inverse Vandermonde on integer nodes)

The Vandermonde matrix computes evaluations of polynomials from their coefficients via a matrix-vector product. The Vandermonde matrices are nested, i.e. each Vandermonde matrix is the principal submatrix of any larger Vandermonde matrix that uses (an extension of) the same sequence of evaluation points. For example, here is the (square) Vandermonde matrix for the evaluation points $0, 1, 2, 3$ ; the Vandermonde matrix for $0, 1, 2$ is the $3$ x $3$ principal (i.e. top-left) submatrix:

If the inverse Vandermonde matrix exists, then the inverse computation (from evaluations to coefficients, i.e. polynomial interpolation), can also be performed as a matrix-vector product (for the inverse of the Vandermonde matrix to exist, it must be square and the evaluation points must be distinct). Unfortunately, the inverse Vandermonde matrices are no longer “nested” in the above sense. For example, here are the inverse Vandermonde matrices for evaluation points $0, 1, 2$ and $0, 1, 2, 3$ .

Who cares? Well, it would be nice if they were nested since then one could pre-compute a sufficiently large inverse Vandermonde matrix, and be able to interpolate any polynomial that came along. But it just isn’t so! However, for the particular case where the evaluation points are the non-negative integers (and indeed in many more general cases), the neatness can be restored using a certain triangular decomposition of the Vandermonde matrix and its inverse.

Let’s write $V_{n}$ for the Vandermonde matrix for the evaluation points $0, 1, . ., n$ . Then $V_{n}$ can be expressed as a product of an lower-triangular, diagonal and upper-triangular matrices $L_{n}$ , $D_{n}$ , and $U_{n}$ i.e. $V_{n} = L_{n} D_{n} U_{n}$ . Thus $V_{n}^{- 1} = U_{n}^{- 1} D_{n}^{- 1} L_{n}^{- 1}$ . It turns out that all of these factors form nested families in the sense above, e.g. $U_{n}$ is the principal $n$ x $n$ submatrix of any $U_{n + k}$ , while $L_{n}^{- 1}$ is the principal $n$ x $n$ submatrix of $L_{n + k}^{- 1}$ , and so on. Thus if we pre-compute $L_{n}^{- 1}, D_{n}^{- 1}, U_{n}^{- 1}$ then we’ll be able to interpolate any polynomial of degree at most $n$ (by selecting the appropriate principal submatrix of each factor, and then performing three matrix-vector multiplications).

Furthermore, the entries of $L^{- 1}, D^{- 1}, U^{- 1}$ (also of $L$ , $D$ , $U$ ) are given by pleasing and useful recursive formulae, and these can be used to increase the size of your precomputed matrices as required. For instance, $(L^{- 1})_{i, j} = (- 1)^{i - j} (\binom{i}{j})$ and the identity $(\binom{i}{j}) = (\binom{i - 1}{j - 1}) + (\binom{i - 1}{j})$ is easily adapted to a recurrence on the $(L^{- 1})_{i, j}$ that allows us to extend $L^{- 1}$ as needed. The diagonal entries of $D^{- 1}$ are given by $(D^{- 1})_{i, i} = \frac{1}{i!}$ (recurrence obvious) while the entries of $U^{- 1}$ are given by the (signed) Sterling numbers of the first kind via $(U^{- 1})_{i, j} = s (j, i)$ . These quantities satisfy the recurrence $s (j, i) = s (j - 1, i - 1) - (j - 1) s (j - 1, i)$ with boundary conditions $s (0, 0) = 1$ and $s (k, 0) = s (0, k) = 0$ for all $k > 0$ .

These formulae were first derived in Vandermonde matrices on integer nodes (Eisinberg, Franzé, Pugliese; 1998). Another useful (and freely available) reference is Symmetric functions and the Vandermonde matrix (Oruç & Akmaz; 2004) which deals with the case of $q$ -integer nodes (take $q = 0$ in their Theorem 4.1 to obtain the formulae above).

19 May 202426 May 2024

A construction of the finite fields (with exercises)

The following is intended as an introduction to finite fields for those with already some familiarity with algebraic constructions. It is based on a talk given at our local seminar.

A finite field is simply a field with a finite number of elements. An example of a finite field that should already be familiar is $Z / p Z$ , the integers modulo a prime $p$ , which in the context of field theory is more commonly denoted $F_{p}$ . But what other finite fields exist? In this post, we’ll construct a finite field $F_{p^{n}} = G F (p^{n})$ of size $p^{n}$ for any prime $p$ and positive integer $n$ , and additionally prove that, up to isomorphism, these are all the finite fields.

(Note that another common notation for $F_{p^{n}}$ is $G F (p^{n})$ – the “GF” stands for “Galois field”).

$F_{p}$ is a field

Firstly, let’s take a moment to show why $F_{p}$ is a field. It is clearly is a commutative ring with 1, so it remains to see why every non-zero element $a$ has an inverse. We need to find an element $x$ such that $a x \equiv 1 \mod p$ . The Extended Euclidean Algorithm provides a way to find such a $x$ . The algorithm takes two positive integers $a, b$ and returns integer coefficients that linearly combine $a$ and $b$ to yield their GCD, i.e. such that $a x + b y = gcd (a, b)$ . Take $b = p$ . Since $p$ is prime and $a$ is not zero, $gcd (a, p) = 1$ . The Euclidean algorithm therefore yields $x$ and $y$ such that $a x + p y = 1$ , which means $a x \equiv 1 \mod p$ . Hence, $x$ is the multiplicative inverse of $a$ .

This same argument will be recycled below in our construction of extension fields.

The characteristic of a field

The characteristic of a field $K$ is the smallest positive integer $p$ such that $p \cdot 1 := 1 + \dots + 1$ ( $p$ times) equals $0$ in $K$ . In other words, it is the order of the additive group generated by the element $1$ .

If $K$ is finite, then it is clear that such a $p$ must exist. Moreover, $p$ must be prime. For supposing that $p$ factorized, as say $p = r s$ with $1 < r, s < p$ , it would follow that $\begin{matrix} (ZD) & (r \cdot 1) (s \cdot 1) = 0, \end{matrix}$ while at the same time, by minimality of the characteristic, we’d have that neither of the multiplicands $r \cdot 1$ , $s \cdot 1$ were themselves zero. To arrive at a contradiction, either note that you’ve constructed zero divisors in a field, or instead use that fact that $r \cdot 1$ (being non-zero) has an inverse, multiply both sides of $(ZD)$ by that inverse and note that this would force $s \cdot 1 = 0$ , a contradiction.

(A similar argument shows that $Z / m Z$ is not a field if $m$ is not prime).

If no positive integer $p$ exists such that $p \cdot 1 = 0$ , the characteristic is defined to be zero (this is the case for $Q, R, C$ , for example).

The prime subfield

A subfield of a field is simply a subset which is itself a field (with the same $1$ and $0$ ). The prime subfield of a field $K$ is the subfield generated by $1$ and is the smallest subfield contained in $K$ . If the characteristic of $K$ is a prime number $p$ , then the prime subfield is (a copy of) the field $F_{p}$ . If the characteristic of $K$ is zero, then the prime subfield is isomorphic to the field of rational numbers $Q$ .

Of course, the prime subfield could be the entire field!

Any finite field has size a prime power, and that prime is its characteristic

Let $K$ be a finite field of characteristic $p$ , and identify $F_{p}$ with the prime subfield of $K$ . Now let’s forget some of the structure of $K$ and just consider $K$ as a vector space over the field $F_{p}$ . The vector space axioms are indeed satisfied, since elements of $K$ can be added together, and multiplied by scalars (i.e. elements of $F_{p}$ ) in a way that is distributive and associative – all of this just follows from the field axioms.

Now let $n \geq 1$ be the dimension of $K$ as a vector space over $F_{p}$ . If you chose a basis for $K$ , it would have length $n$ , and every element of $K$ would have a unique expression as a linear combination of the basis with coefficients in $F_{p}$ . Moreover, every such expression would be an element of $K$ . There are $p^{n}$ such expressions, so $| K | = p^{n}$ .

Example: there is precisely one field with four elements

While we will indeed construct $F_{p^{n}}$ for every prime $p$ and $n > 0$ , let’s first do the simplest possible example beyond the more familiar fields $F_{p}$ : let’s “manually” construct a field $F_{4}$ with four elements. Indeed, we’ll see that there is only one such field, up to isomorphism.

Firstly, note that $F_{4}$ has characteristic 2 (by the preceding section), and hence has $F_{2}$ as its prime subfield. So there are only two “new” field elements. Call them $A, B$ , so that $F_{4} = {0, 1, A, B}$ . Note that the four elements must all be pairwise non-equal, or the field is too small. Now, try to fill in the multiplication table for this new field, using the fact that the non-zero elements of a field (in our case: $1, A, B$ ) must form a group under multiplication. This implies that each element can appear at most once in each row and column. You’ll see that there is only one way to do this!

Similarly, try filling in the addition table, this time using the fact that the field is a group under addition, as well as $A + A = A \cdot (1 + 1) = A \cdot 0 = 0$ (similarly for $B$ ). There is only one possible addition table!

Below, we’ll construct this same finite field (and many others) but in a more sophisticated manner.

Polynomial prerequisites

Polynomial division

Given two polynomials $f, g \in K [x]$ , $f \neq 0$ , we can perform polynomial division to write $g (x) = q (x) f (x) + r (x)$ for some unique $q, r \in K [x]$ such that $deg (r) < deg (f)$ . Call $q$ the quotient and $r$ the remainder. This is analogous to the division algorithm for integers.

Roots correspond to linear factors

A polynomial $f (x)$ has a root $λ$ if and only if it is divisible by the linear polynomial $(x - λ)$ . This can be seen using polynomial division: for if $f (x)$ is divided by $(x - λ)$ , then the remainder is $f (λ)$ . Hence, $f (λ) = 0$ if and only if the remainder is zero, which means $f (x)$ is divisible by $(x - λ)$ .

Aside: a finite field is never algebraically closed

While this subsection has no relevance to the construction below, it is too nice to omit!
Recall that a field $K$ is said to be algebraically closed if every non-constant polynomial $f (x) \in K [x]$ has a root in $K$ . For example, $C$ is algebraically closed, while $R$ is not. Now if $K$ is a finite field, form the polynomial $f (x) = (\prod_{λ \in K} (x - λ)) + 1$ and notice that $f (λ) \neq 0$ for any $λ \in K$ . Thus $K$ can not be algebraically closed.

Irreducible polynomials

An irreducible polynomial over a field $K$ is a non-constant polynomial that cannot be factored into the product of two non-constant polynomials over $F$ . Irreducibles of degree three or lower are easy to find: any factorization must involve a linear factor, and these can be detected by evaluating the polynomial (as discussed above).

Exercise 1: Verify that, over $F_{2}$ , the polynomial $x^{2} + x + 1$ is the unique quadratic irreducible.

Exercise 2: (Again over $F_{2}$ ) show that $x^{3} + x + 1$ and $x^{3} + x^{2} + 1$ are the unique cubic irreducibles.

A stepping stone: constructing new fields from old

Let $K$ be any field (not necessarily finite) and let $f \in K [x]$ an irreducible polynomial of degree $n$ . Write $K_{(f)} = K [x] / f K [x]$ for the quotient of the ring $K [x]$ by the ideal generated by $f$ . Then $K_{(f)}$ is itself a ring with $1$ . Let $π : K [x] \to K_{(f)}$ be the surjection of rings that comes from the quotient construction, i.e. that maps any polynomial $g$ to its coset $g + f K [x]$ .

Just as the elements of $Z / p Z$ are enumerated by remainders after integer division by $p$ , the elements $g + f K [x]$ of $K_{(f)}$ can be enumerated by remainders $r (x)$ of polynomial division of $g (x)$ by $f (x)$ : if $g = q f + r$ , then $π (g) = π (r)$ . If $K$ is indeed finite, this immediately tell us that $| K_{(f)} | = | K |^{n}$ , since there are $| K |$ possibilities for each of the $n = deg (f)$ coefficients of $r (x)$ .

There is moreover an extended Euclidean algorithm for polynomials, and (analogous to our argument for $Z / p Z$ ) this can be used to demonstrate that every non-zero element of $K_{(f)}$ has an inverse. For if $a$ is such an element, than there exists a $g \in K [x]$ with $π (g) = a$ , and we have that $g$ is not divisible by $f$ , since $a \neq 0$ . Thus, the greatest common divisor of $f$ and $g$ (which is defined to be the monic polynomial of maximal degree dividing both $f$ and $g$ ), in view of the irreducibility of $f$ , must be $1$ . The extended Euclidean algorithm therefore yields polynomials $s, t \in K [x]$ such that $s f + t g = 1$ , and applying $π$ to both sides of this equation shows that $π (t) = π (g)^{- 1}$ , i.e. $π (t)$ is the inverse of $a = π (g)$ .

We’ve thus shown that $K_{(f)}$ is a field. Indeed, it has $K$ as a subfield, and so $K \subset K_{(f)}$ is a field extension. It is, in fact, quite a special field extension – the polynomial $f$ , which was irreducible over $K$ , has a root $K_{(f)}$ , namely $π (x)$ . To see this, first note that $π$ is a $K$ -linear map. Then:
$f (π (x)) = \sum_{i} f_{i} (π (x))^{i} = \sum_{i} f_{i} π (x^{i}) = π (\sum_{i} f_{i} x^{i}) = π (f (x)) = 0.$

In summary, given a field $K$ and an irreducible $f \in K [x]$ of degree $n$ , we’ve constructed an extension field of $K$ in which $f$ has a root!

Note that we’d have achieved our goal of constructing a field with $p^{n}$ elements if we knew that there was an irreducible polynomial of degree $n$ over $F_{p}$ . But we don’t know this at this stage. Nonetheless, the above construction is the crucial ingredient, as we’ll see below.

Exercise 3: Verify that the complex numbers $C$ can be constructed from the real numbers $R$ in this way, using the irreducible quadratic $f (x) = x^{2} + 1 \in R [x]$ . In particular, you should recover the familiar formulae for the real and complex parts of the multiplication of two complex numbers from multiplication in $K_{(f)}$ . (For a worked solution, see here).

Exercise 4: Carry out the above construction for $K = F_{2}$ and the irreducible $f (x) = x^{2} + x + 1 \in K [x]$ , and check that you obtain the field with four elements (which we constructed earlier in manual fashion).

Exercise 5: (continuing the example of the previous exercise) Show that both roots of $f$ are obtained ( $π (x)$ is one of them, which is the other?). Though we won’t use (or show) this here, it turns out that this is always true if $K$ is finite, then $f$ will factor completely into linear factors over the extension field $K_{(f)}$ . You can cycle through the roots by applying the Frobenius automorphism.

Existence of a splitting field

Suppose $K$ is a field (not necessarily finite) and $h \in K [x]$ a non-constant polynomial. A splitting field for $h$ is a field $L$ extending $K$ (so $K \subset L$ ) over which $h$ splits as a product of linear factors, and that is minimal with the property, i.e. if $L^{'}$ with $K \subset L^{'} \subset L$ is another such field, then $L^{'} = L$ .

We show here that splitting fields exist (a special case of which will be the last ingredient in our construction of the finite fields).

We proceed iteratively. $h$ has a unique expression as a product of irreducibles over $K$ . If this expression consists only of linear factors, then stop. If not, choose a non-linear (i.e. degree > 1) irreducible factor $f$ , and construct the field $K_{(f)}$ as above. Considering $h \in K_{(f)} [x]$ , we see that $h$ has at least one more linear factor than before. Repeat this process, each time replacing $K$ by $K_{(f)}$ where $f$ is one of the remaining non-linear irreducible factors of $h$ . Since polynomials have finite degree, this process which terminate with a field $\hat{L}$ over which $h$ factors linearly. Now take the smallest subfield $L \subset \hat{L}$ over which $h$ factors linearly (such a field is uniquely determined, since the intersection of any two subfields with this property will again be a subfield with this property). Then we have constructed a splitting field for $h$ .

Construction of a field with $p^{n}$ elements

Finally! Using the construction of the previous section, let $L$ be a splitting field of $h (x) = x^{p^{n}} - x \in F_{p} [x]$ . So $F_{p^{n}} \subset L$ . Now let $L^{'} = {λ \in L | h (λ) = 0}$ . It remains to show that $L^{'}$ is a field and $| L^{'} | = p^{n}$ .

To see that $L^{'}$ is a field, first note that $0, 1 \in L$ are both roots of $h$ , so $0, 1 \in L^{'}$ . Now simply show that $L^{'}$ is closed under addition, multiplication, and inversion. Only addition is not immediate: for this, you need to use that the binomial coefficients $(\binom{p^{n}}{k})$ vanish in characteristic $p$ whenever $0 < k < p^{n}$ (which follows from the definition of the binomial coefficient in terms of factorials, c.f. here). Thus $L^{'}$ is a field.

Finally, note that $| L^{'} |$ is equal to the number of distinct roots of $h$ . The polynomial $h$ has degree $p^{n}$ , but perhaps there are repeated roots? There are not. If a root $λ$ was repeated, then $(x - λ)^{2}$ would divide $h$ . But if this were the case, then $(x - λ)$ would divide its derivative $\frac{d h}{d x}$ (this follows immediately from the product rule for differentiation). But direct calculation shows that $\frac{d h}{d x} = - 1$ (in characteristic $p$ ), and so $h$ can have no repeated roots. Hence $| L^{'} | = p^{n}$ , and we have constructed a field with $p^{n}$ elements!

Extension: these are all the finite fields

In the previous section, we constructed a splitting field $L^{'}$ for the polynomial $h (x)$ and showed that it had $p^{n}$ elements. But could there be multiple, non-isomorphic fields of size $p^{n}$ ? There can not, as we see below. We need this uniqueness up to isomorphism in order to be able to sensibly speak of “the field $F_{p^{n}}$ with $p^{n}$ elements”!

Suppose that $K$ is some other field with $| K | = p^{n}$ , so $F_{p} \subset K$ . Then the set of all non-zero elements of $K$ is a multiplicative group of size $p^{n} - 1$ . Thus for any non-zero $λ \in K$ , we have that $λ^{p^{n} - 1} = 1$ , or, put differently, that $λ^{p^{n}} - λ = 0$ , i.e. $h (λ) = 0$ ! Note that this holds also for $λ = 0$ , so we’ve shown that every element of $K$ is a root of $h$ . Since $| K | = p^{n} = deg (h)$ , it follows that $h$ factors linearly over $K$ , and that $K$ is a minimal extension of $F_{p}$ with this property since $h$ has no repeated factors (as seen in the previous section). Thus $K$ is a splitting field for $h$ as well, i.e. all fields of size $p^{n}$ are splitting fields for $h$ .

Splitting fields are unique up to isomorphism in the sense detailed below. This statement is trivial if, as some authors do, you chose to consider only fields inside of a fixed algebraic closure of $F_{p}$ . If, like me, you would prefer not to do this, you might proceed as follows.

14 February 202417 February 2024

Understanding LogUp: A Royal Road

While there is famously no “royal road to geometry”, I believe that there is a royal road to understanding the wonderful logUp, a lookup argument from Starkware’s Shahar Papini and Polygon’s Ulrich Haböck. We’ll take this royal road here. This is significantly more direct than the approach taken in the two papers. The advantage of the exposition of the papers is that the thought processes that led to the final formulation are apparent (which is appreciated). The advantage of the exposition here is that it is formulated with the benefit of hindsight and ignores the historical development. Consequently (I hope!), you’ll get to the heart of the matter faster.

The setup for any lookup argument is a “table” $t$ of values that are permitted, and a “witness column” $w$ consisting of values to be checked. Both the table and witness column are multisets, typically represented as one-dimensional arrays of field elements, where repetition of an element in the array is used to represent multiplicity of that element in the multiset. The goal a lookup is to demonstrate (with high probability) that all of the witness values appear in the table, or equivalently, that considered as sets (i.e. ignoring multiplicities), the witness is a subset of the table, i.e. $\begin{matrix} (Subset) & \forall i \exists j : w_{i} = t_{j} . \end{matrix}$ The table and witness are typically different lengths, but we’ll assume for simplicity that they are both powers of two, say $w = (w_{i})_{i = 0, \dots, 2^{M} - 1} t = (t_{j})_{j = 0, \dots 2^{N} - 1},$ for some $M, N \geq 0$ .

What’s wrong with the naive approach to lookups?

To see what’s truly wonderful about logUp, it’s crucial to see what’s wrong with a “naive” lookup argument. A typical lookup argument (not logUp) would show $(Subset)$ by exhibiting, for each table entry $t_{j}$ , a non-negative integer $m_{j} \geq 0$ and then showing (via random evaluation) that the following polynomial equality holds $\begin{matrix} (Naive) & \prod_{i = 0}^{2^{M} - 1} (X - w_{i}) = \prod_{j = 0}^{2^{N} - 1} (X - t_{j})^{m_{j}} . \end{matrix}$ To see why this is problematic, consider how the exponents on the right hand side will be computed in circuit using addition and multiplication gates. Before anything else, $X$ is replaced with a random field element $α$ (in pursuit of Schwartz-Zippel). Then, for each $(α - t_{j})$ , all of the powers $(α - t_{j})^{2^{0}}, (α - t_{j})^{2^{1}}, (α - t_{j})^{2^{2}}, \dots, (α - t_{j})^{2^{M} - 1}$ need to be computed by repeated squaring. These powers are then combined to obtain $(α - t_{j})^{m_{j}}$ : $(α - t_{j})^{m_{j}} = \prod_{k = 0}^{M - 1} (b_{k}^{(j)} (α - t_{j})^{2^{k}} + (1 - b_{k}^{(j)})),$ where $m_{j} = \sum_{k = 0}^{M - 1} b_{k}^{(j)} 2^{k}$ is the binary decomposition of $m_{j}$ into bits $b_{k}^{(j)}$ , for $j = 0, \dots M - 1$ . And that’s the problem: not only do the multiplicities $m_{j}$ need to be provided to the circuit as inputs, but so do their binary decompositions! This is an order $M$ (=log of witness length) blow up in the number of circuit inputs. All inputs have to be committed to, and that’s expensive.

What’s so great about logUp?

LogUp demonstrates that $w \subset t$ by exhibiting, for each table entry $t_{j}$ , a field element $m_{j} \in F_{q}$ such that that the following “logUp identity” holds: $\begin{matrix} (LogUp) & \sum_{i = 0}^{2^{M} - 1} \frac{1}{X - w_{i}} = \sum_{j = 0}^{2^{N} - 1} \frac{m_{j}}{X - t_{j}} . \end{matrix}$ Setting aside for a moment the meaning of the inverse polynomial summands, we can see already why logUp is great. The multiplicities are not non-negative integers, but rather field elements, and using them in circuit involves just scalar multiplication! In particular, no binary decomposition of the multiplicities is required, resulting in significantly fewer inputs to be committed to (in contrast to the naive approach outlined above).

The logarithmic derivative

The logarithmic derivative of a function is just the derivative of its logarithm. If you apply this transformation to both sides of naive lookup equation $(Naive)$ , you’ll see you get the logUp equation $(LogUp)$ . To do this, you’ll need to work symbolically, treating polynomials as formal objects (not functions, see below). While this connection between the two equations is conceptually pleasing (and important for understanding where logUp came from), it is worth noting that proof of the soundness of the logUp approach doesn’t use the logarithmic derivative. See Lemma 5 (which relies on Lemma 4) of the 2022 logUp paper, or see below for an alternative proof.

The logUp identity is an equation in the field of fractions

Before attempting to show that the logUp identity is equivalent to the subset relation, let’s pause to think about where the logUp identity $(LogUp)$ “lives”.

Recall that polynomials are formal arithmetic combinations of field elements and an indeterminate (so e.g. $X^{2} \neq X$ in $F_{2} [X]$ , even though they coincide as functions $F_{2} \to F_{2}$ , because they are distinct formal sums; c.f. here). There is no danger in making this distinction. Any equality between two polynomials is also an equality between their corresponding polynomial functions (since evaluation at any point is a ring homomorphism).

The field of fractions $F (X)$ is similarly a formal object, consisting of pairs of polynomials $(p, q)$ , where $q \neq 0$ , that are considered up to an equivalence that mimics that of fractions, i.e. $(p, q) \sim (p^{'}, q^{'})$ if and only if $p q^{'} = p^{'} q$ . They can be formally added and multiplied in the way that seems natural if one writes $p / q$ for $(p, q)$ (if unfamiliar, have a play and convince yourself that all is okay).

The logUp identity $(LogUp)$ is an equality in the field of fractions $F (X)$ .

How to show $(LogUp)$ : Schwartz-Zippel for the field of fractions

Let $p (X), q (X)$ be the polynomials given by
$\frac{p (X)}{q (X)} = (\sum_{i = 0}^{2^{M} - 1} \frac{1}{X - w_{i}}) - (\sum_{j = 0}^{2^{N} - 1} \frac{m_{j}}{X - t_{j}}),$ where $q (X)$ is the obvious product of all the denominators (i.e. with repetition). To show that the logUp identity $(LogUp)$ holds, we need to show that $p (X) / q (X) = 0 / 1$ in $F (X)$ , i.e. that $p (X) = 0$ and $q (X) \neq 0$ . Random evaluation (a.k.a. Schwartz-Zippel) can be used to show both of these simultaneously (w.h.p.). There are two small caveats: (i) if you’re unlucky and you hit a root of $q (X)$ , you’ll need to resample, and (ii) you need to take the inverse of the evaluation of $q (X)$ as an input to the circuit to show that that evaluation is indeed non-zero in circuit.

For randomly sampled $α \in F$ , if $α \neq w_{i} \forall i, \land α \neq t_{j} \forall j, \land \sum_{i} \frac{1}{α - w_{i}} = \sum_{j} \frac{m_{j}}{α - t_{j}}$ then (since evaluation at any $α$ is a ring homomorphism) $p (α) = 0 \land q (α) \neq 0$ from which it follows (w.h.p.) that $\frac{p (X)}{q (X)} = 0$ , which implies $(LogUp)$ .

The logUp relation $(LogUp)$ is equivalent to the subset relation $(Subset)$

As mentioned, this is shown in Lemma 5 and 4 of the 2022 paper. We show it here in a different way.

One direction of implication is trivial: if $(Subset)$ , then $(LogUp)$ clearly holds. Note that this is irrespective of the characteristic of the field (not so for the converse, as we’ll see).

We prove the converse statement (i.e. $(LogUp)$ implies $(Subset)$ ) via the contrapositive, but for this we need the assumption $2^{M} < char (F)$ , i.e. that the witness length is bounded by the characteristic of the field. Suppose that $(Subset)$ does not hold. Then there exists some $i_{0}$ such that $w_{i_{0}} \neq t_{j}$ for all $j$ . Let $I$ denote the set of all indices $i$ such that $w_{i} = w_{i_{0}}$ , and write $K = | I |$ . Note that $K < 2^{M} < char (F)$ . Let $p (X), q (X)$ be as in the previous section. Since $q (X) \neq 0$ , to show that $(LogUp)$ is not satisfied, it suffices to show that $p (X) \neq 0$ . Straightforward calculation shows that $p (X)$ can be written in the form $p (X) = (X - w_{i_{0}})^{K - 1} φ (X) + (X - w_{i_{0}})^{K} ψ (X)$ where $φ (X) = K (\prod_{i \notin I} (X - w_{i})) (\prod_{j} (X - t_{j}))$ and $ψ (X)$ is a polynomial (which polynomial doesn’t matter). By $K - 1$ applications of the product rule for differentiation (as per usual, we differentiate polynomials symbolically), we see that $p^{(K - 1)} (w_{i_{0}}) = (K - 1)! φ (w_{i_{0}}) .$ Recalling that $K$ is bounded by the characteristic, we see by inspection that $φ (w_{i_{0}}) \neq 0$ and consequently (by the same fact) that $p^{(K - 1)} (w_{i_{0}}) \neq 0$ . Thus $p^{(K - 1)} (X) \neq 0$ , and so $p (X) \neq 0$ , and we’re done.

Fractional sumcheck via the GKR protocol

We saw above that, in order to show $(LogUp)$ w.h.p., we need to show that $\begin{matrix} (Eval) & \sum_{i} \frac{1}{α - w_{i}} = \sum_{j} \frac{m_{j}}{α - t_{j}} \end{matrix}$ for some random $α \in F$ . This is just a relationship in the field. To show it, the authors describe “fractional sumcheck”, which amounts to separately reducing each side to a single fraction, and then showing that these two fractions are equal.

The reduction of each side of the equation is expressed as a layered arithmetic circuit to which the GKR protocol can be applied. Imagine we want to reduce a sum $\sum_{b \in B^{N}} \frac{p (b)}{q (b)},$ where $B = {0, 1}$ and $p$ and $q$ are functions $B^{N} \to F$ . Note that e.g. the right hand side of $(Eval)$ can be written in this form by replacing the indices $j = 0, \dots, 2^{N} - 1$ of the multiplicities $m$ and the table $t$ with bitstrings $b \in B^{N}$ and defining the functions $p$ and $q$ to give the numerators and denominators of the summands. Now define $N + 1$ functions
$p_{k}, q_{k} : B^{k} \to F, 0 \leq k \leq N,$ by $p_{N} := p$ , $q_{N} := q$ and $p_{k} (b) := p_{k + 1} (0 b) q_{k + 1} (1 b) + p_{k + 1} (1 b) q_{k + 1} (0 b),$ $q_{k} (b) := q_{k + 1} (0 b) q_{k + 1} (1 b)$ for all $b \in B^{k}$ , where e.g. $1 b$ denotes the bitstring of length $k + 1$ obtained by prefixing $b$ with a $1$ . Then the desired single fraction is the ratio of field elements (i.e. functions on $B^{0}$ ) $p_{0} / q_{0}$ . Do the same for the other side of the equation, obtaining $p_{0}^{'} / q_{0}^{'}$ , and then check both sides are equal via $p q^{'} - q p^{'} = 0$ and $q \neq 0$ , $q^{'} \neq 0$ (these last two are shown using the inverses of $q$ and $q^{'}$ , taken as inputs). The above defines a layered arithmetic circuit with wiring that is regular in the sense of the GKR protocol. This allows the satisfaction of the circuit to be efficiently verified without needing to materialize (or commit to) any of the intermediate values. For more on the GKR protocol, check out Thaler’s book, or instead this blogpost by Remco Bloemen.

The case of batch witness columns

LogUp works just as well for a batch of witness columns. We haven’t made that explicit in the above (contrary to the presentation of the papers) because it suffices to simply concatenate the witness columns and sum up their multiplicities.

Other works using the logarithmic derivative for lookups

As described in the introduction to the 2022 logUp paper, there was both existing and concurrent work using the logarithmic derivative for lookup arguments (they are on the reading list!):

Thanks

Thank you to the exceptional team at Modulus Labs, Georg Wiese, Victor Sint Nicolaas, Hamish Ivey-Law and Ulrich Haböck for helpful discussions and suggestions (any errors are my own).

References

2023 paper

Improving logarithmic derivative lookups using GKR (Papini & Haböck)
Talk at PSE

Notation

The “Correlated Agreement Theorem” CATd,Ω

The Correlated Agreement Theorem & Unlucky Folds

Proving the Correlated Agreement Theorem

References

Cheating fails (w.h.p.)

How to perform the FRI-IOPP with degree d−1?

Fp is a field

The characteristic of a field

The prime subfield

Any finite field has size a prime power, and that prime is its characteristic

Example: there is precisely one field with four elements

Polynomial prerequisites

Polynomial division

Roots correspond to linear factors

Aside: a finite field is never algebraically closed

Irreducible polynomials

A stepping stone: constructing new fields from old

Existence of a splitting field

Construction of a field with pn elements

Extension: these are all the finite fields

What’s wrong with the naive approach to lookups?

What’s so great about logUp?

The logarithmic derivative

The logUp identity is an equation in the field of fractions

How to show (LogUp): Schwartz-Zippel for the field of fractions

The logUp relation (LogUp) is equivalent to the subset relation (Subset)

Fractional sumcheck via the GKR protocol

The case of batch witness columns

Other works using the logarithmic derivative for lookups

Thanks

References

2023 paper

2022 paper

The “Correlated Agreement Theorem” ${CAT}_{d, Ω}$

How to perform the FRI-IOPP with degree $d - 1$ ?

$F_{p}$ is a field

Construction of a field with $p^{n}$ elements

How to show $(LogUp)$ : Schwartz-Zippel for the field of fractions

The logUp relation $(LogUp)$ is equivalent to the subset relation $(Subset)$