Introducing the Type 1 Prover

0:01:00 - 0:01:13

what's going on everybody this is Smoky behind the polygon account while we get this stage

0:01:13 - 0:01:16

set we're going to play a little bit of music

0:03:30 - 0:03:40

all right perfect so why don't we get started so again this is smoky behind the polygon

0:03:40 - 0:03:46

account and the community manager at polygon labs and we are going to introduce the type

0:03:46 - 0:03:52

one prover so I think to start off why don't we go around the room and just give some brief

0:03:52 - 0:03:58

introduction so why don't we start out with Brendan hey everyone I'm Brendan I work on

0:03:58 - 0:04:09

the polygon zero team at polygon perfect all right let's go on over to PG PG Paul hi I'm

0:04:09 - 0:04:16

Paul I am part of the product team here I work on product research and got lucky enough

0:04:16 - 0:04:20

to work with Brendan and Daniel and team on getting this type one prover out the door

0:04:23 - 0:04:26

perfect thank you Paul all right let's go on over to Daniel

0:04:28 - 0:04:34

hey I'm Daniel also from polygon zero I work on mainly prover libraries and the zkvm as well

0:04:37 - 0:04:40

thank you Daniel and then let's go on over to Robin

0:04:40 - 0:04:48

hi everyone I'm Robin working as a photographer at Topazware and well happy to be here and happy

0:04:48 - 0:04:51

to have collaborated with zero on this zkvm

0:04:55 - 0:04:58

thank you Robin and we do have one more person down in the audience we have John

0:04:59 - 0:05:03

I did invite you up to speak so if you just want to accept that invite

0:05:03 - 0:05:07

all right perfect hey John you want to give us an introduction please

0:05:07 - 0:05:12

hey hey everybody my name is John I work at polygon and I lead the dev tools team

0:05:12 - 0:05:17

so I'm mainly focused on running the software and getting it optimized and performant

0:05:19 - 0:05:26

perfect thank you all right well let's get into it so why don't we start out with you know high

0:05:26 - 0:05:30

level overview and and significance of the type one prover so I'm going to direct this

0:05:30 - 0:05:40

question to Brendan so can you tell us what a type 1 prover is sure so the type 1 zkvm prover

0:05:41 - 0:05:47

we use the naming from Vitalik's framework for classifying type 1 zkvm provers but it's a really

0:05:47 - 0:05:55

exciting development that allows any existing evm chain to immediately upgrade to zk so it's

0:05:55 - 0:06:03

with the type 1 prover we can generate proofs for any evm chain whether it's a side chain like

0:06:03 - 0:06:11

polygon pos any optimistic roll up or even the ethereum mainnet itself and so what we've shown

0:06:11 - 0:06:18

today are benchmarks and obviously open source code that we've been using to prove mainnet

0:06:18 - 0:06:25

ethereum blocks and kind of the exciting thing about this is I think that if you asked most people

0:06:26 - 0:06:33

how efficient and how practical type 1 zkvm provers were they would say not very and we've

0:06:33 - 0:06:38

shown that we're able to generate proofs at an average cost per transaction of two to three

0:06:38 - 0:06:44

tenths of a cent and we're just getting started in terms of optimization and reducing this we

0:06:44 - 0:06:50

anticipate this might be a little bit optimistic but I think there's reason to believe that we will

0:06:50 - 0:07:00

reduce that cost by 30 to 50 x by the end of the year. Wow super impressive thanks Brendan

0:07:00 - 0:07:06

and you know for is it that a type 1 is better than a type 2 prover like what are some of the

0:07:06 - 0:07:14

trade-offs between the two? Yeah so I guess to back up for context the evm is something that's

0:07:14 - 0:07:20

notoriously zk unfriendly so it uses a lot of cryptographic primitives and data structures that

0:07:20 - 0:07:27

are just not very conducive to being proven with Azure knowledge proof and so the way that we've

0:07:27 - 0:07:35

gotten around that for the last year was by using a type 2 zkvm so a type 2 has full compatibility

0:07:35 - 0:07:43

with all existing ethereum smart contracts all wallets all dev tools it's a really great solution

0:07:43 - 0:07:47

for a new chain and it's just more efficient to generate zero knowledge proofs for

0:07:49 - 0:07:52

but there are a lot of chains that already exist and they've already been running

0:07:52 - 0:07:59

and they have state and they have users and it's better to generate proofs for those chains as they

0:07:59 - 0:08:05

are and so the type 1 allows us to do that it doesn't require us changing the data structures

0:08:05 - 0:08:09

or the primitives that are used by these chains we can just immediately start generating zero

0:08:09 - 0:08:15

knowledge proofs for chains as they're running without any changes on the client or the wallet

0:08:15 - 0:08:24

or anything and so it's the type 1 is really for upgrading existing evm chains to zk and part of

0:08:24 - 0:08:30

the reason that we built this was because we want to onboard those chains into the polygon ecosystem

0:08:31 - 0:08:36

and it's just a really powerful thing to be able to take any existing evm chain and allow it to

0:08:36 - 0:08:41

instantly join polygon's ag layer and share liquidity and state and user base with other

0:08:41 - 0:08:53

polygon chains so they're able to just immediately move over and you know launch uh with the type 1

0:08:54 - 0:09:03

yes perfect thank you and do do you want to get into kind of like the difference between a type 2

0:09:04 - 0:09:09

yeah so so a type 2 like i said just uses different data structures on the back end

0:09:09 - 0:09:17

so it uses a different representation for the state try and so like to the user and to the

0:09:17 - 0:09:22

developer it's an identical environment to ethereum but you can't use a type 2 prover

0:09:22 - 0:09:26

to generate proofs for existing evm chains so that's the main difference

0:09:26 - 0:09:31

that's perfect thank you brendan and

0:09:33 - 0:09:37

can you dive a little bit deeper into the applications like a little further into

0:09:37 - 0:09:45

the applications of the type 1 prover uh yep so it's really any uh any existing evm chain look

0:09:45 - 0:09:52

like like we we see this as uh being useful for for basically one thing like there are a lot of

0:09:52 - 0:10:01

users and liquidity and state that's locked in evm chains they might be um uh side chains like

0:10:01 - 0:10:07

the polygon pos chain or they might be optimistic rollups and these chains are making compromises

0:10:07 - 0:10:13

either in terms of not having full ethereum security like the polygon pos chain or in terms

0:10:13 - 0:10:19

of imposing real economic costs on their users like optimistic rollups do with seven day withdrawal

0:10:19 - 0:10:28

delays and so the type 1 fundamentally unlocks these chains and allows them to upgrade to zk

0:10:28 - 0:10:33

which means that uh like for optimistic rollups they can get rid of the seven day withdrawal delay

0:10:33 - 0:10:39

and so if you if you look at the real cost that optimistic rollups have to bear um or the users

0:10:39 - 0:10:45

of optimistic rollups have to bear rather it's that they have to pay third-party bridges uh to

0:10:45 - 0:10:51

avoid having their funds locked for seven days and so in aggregate optimistic rollup users have

0:10:51 - 0:10:56

paid tens of millions of dollars to third-party bridges to avoid this and to get around the the

0:10:56 - 0:11:03

capital inefficiency that's inherent in optimistic rollups and so zk allows uh us to to change them

0:11:07 - 0:11:13

perfect thank you brendan all right so so for this next question i'm going to direct it

0:11:14 - 0:11:22

to robin so it wasn't long ago that the notion of zk evm an evm compatible zk rollup was considered

0:11:22 - 0:11:28

years away much less zk evm that was ethereum equivalent so what are some of the breakthroughs

0:11:28 - 0:11:35

in zk rnd that account for this acceleration and who are some of the people teams not on this spaces

0:11:35 - 0:11:42

whose work contributed um well so there are different reasons and why this became

0:11:42 - 0:11:49

practical over the years uh and so people part of the people that made it possible as

0:11:49 - 0:11:55

the people speaking here right now as a zero team um so basically there are two things uh

0:11:55 - 0:12:02

improvements in zk uh research uh in general proving system that targeting better more

0:12:03 - 0:12:08

optimized for this kind of unfriendly primitives uh for example we have seen in the past few years

0:12:08 - 0:12:14

uh considerable improvements on the lookup arguments um which are a big thing for proving

0:12:14 - 0:12:23

stuff like uh the evm or usually like any kind of vms around and then was something that's a bit of a

0:12:23 - 0:12:31

uh friendly fights uh in the space but uh it's field size usually this is zero not snarks as

0:12:31 - 0:12:36

usual ones i mean with pairings and everything require large fields which are extremely

0:12:36 - 0:12:43

wasteful for some specific operations typically operations uh and everything while uh zero is uh

0:12:43 - 0:12:50

with the gold deluxe field that's introduced um is basically reducing this waste um and the push

0:12:50 - 0:12:56

forwards to like smaller fields over and over again uh is making this like more and more possible

0:12:56 - 0:13:01

so we feed with bronchi three and the focus on 32-bit fields and some also concurrent works like

0:13:01 - 0:13:06

kubetana with binary towers of binary fields uh all of these improvements over the years

0:13:06 - 0:13:16

are what makes zk evm type one possible today thank you robin and and for this next question

0:13:16 - 0:13:19

so what were some of the main technical challenges from an engineering

0:13:19 - 0:13:24

perspectives or put another way what part of the evm do you hate the most

0:13:24 - 0:13:35

uh well it's not meant to be easy proving zk so that's definitely something um it really depends

0:13:35 - 0:13:41

as a cryptographer what's really hard when you want to like make statements about something zk

0:13:41 - 0:13:46

is you want to be sure that you're not forgetting anything in terms of constraints in your circuit

0:13:46 - 0:13:53

otherwise people may be able to forge fake proofs uh so this makes it uh quite hard because you want

0:13:53 - 0:13:59

in one hand to respect all the specs specified by the ethereal paper and on the other hand you need

0:13:59 - 0:14:05

to make something as efficient as possible so typically all the requirements around like

0:14:05 - 0:14:11

opcode specifications that you need to operate over large words uh stuff like this that are not so

0:14:11 - 0:14:17

memory copies uh stuff like this that is not super uh trivial to do in zk to do it efficiently

0:14:17 - 0:14:23

these were like uh probably like not the funniest part to work with uh i wasn't involved in the

0:14:23 - 0:14:29

initial design of the k-shock table for example to prove uh statistic permutations but this must

0:14:29 - 0:14:35

have been like probably like not super funny for the zero team to work with and to make it as

0:14:35 - 0:14:43

efficient as this now yeah maybe maybe we could hear from daniel uh about what the uh the experience

0:14:43 - 0:14:52

was because i i know that daniel uh drafted the original mpt and rlp uh logic in the uh in the type

0:14:52 - 0:14:59

one and so i i'm curious daniel if your ptsd isn't too bad from from that period um what that uh

0:14:59 - 0:15:07

what that was like yeah that was a pretty rough few weeks when i was uh trying to get the initial

0:15:07 - 0:15:16

mpt code working and uh yeah it's i think the whole design of ethereum's mpts has a lot of

0:15:16 - 0:15:23

kind of legacy aspects to it the way it it uses an arity 16 try for example which isn't really

0:15:23 - 0:15:31

ideal um it and also the use of ketchup as the hash function so but you know we we knew we had to

0:15:31 - 0:15:38

deal with those things so we we spent a lot of time on ketchup and we came up with like a pretty

0:15:38 - 0:15:45

good way to arithmetize it and to prove it fairly efficiently um one one of our teammates angus came

0:15:45 - 0:15:51

up with some more insights and some more kind of clever tricks to to arithmetize ketchak in a more

0:15:51 - 0:15:58

zk friendly way um so i think right now we can prove like a few hundred ketchak hashes per second

0:15:58 - 0:16:03

and when we do switch to plunky 3 that that should increase to a thousand or more

0:16:05 - 0:16:14

awesome awesome um so a big thing i i think sort of the headline for this announcement was um the

0:16:14 - 0:16:20

benchmarks and and i personally uh i i was very optimistic about the type one i didn't think that

0:16:20 - 0:16:27

we could get to this level of performance this soon um and i think what people might not realize

0:16:27 - 0:16:33

is that it was like a you know somewhat long road to to get here and so uh maybe paul and then and

0:16:33 - 0:16:39

then also john uh could talk about kind of where we're at in terms of performance and efficiency

0:16:39 - 0:16:47

um what it was like to get here where we expect to be in the short term um and yeah um yeah sure

0:16:47 - 0:16:53

of course and yeah john i'll probably call you up to talk about some specifics but uh on the high

0:16:53 - 0:16:58

level i'll just describe like what the benchmarks we we published today were um and and what sort

0:16:58 - 0:17:05

of testing we've been doing um so so the main the main benchmarks that we showed today were for

0:17:06 - 0:17:11

three main blocks um and these the this is worth noting like these are real benchmarks these aren't

0:17:11 - 0:17:17

benchmarks that were constructed randomly these weren't benchmarks of just like transfers in a

0:17:17 - 0:17:23

particularly good case these are benchmarks from real ethereum blocks um the three that we have

0:17:23 - 0:17:30

here today as our sample are interesting mostly because they uh were previously reported upon by

0:17:30 - 0:17:35

risk zero so we have some other kind of information to compare against um and the main one there is

0:17:35 - 0:17:45

this block uh 17 million 354,424 um and that block or sorry 1.7 million that that blocks an interesting

0:17:45 - 0:17:50

one uh currently i think if we look at it that had about 16 16 and a half million gas so that's

0:17:50 - 0:17:57

over the normal target gas limit hundred had 182 transactions and it's like a variety of of items

0:17:57 - 0:18:03

in it so one of the things that we're looking at when designing these benchmarks are well what

0:18:03 - 0:18:07

type of information are we benchmarking with and what we really wanted to do is to come up with

0:18:07 - 0:18:13

some cases that gave like real kind of average theory use case and if you were to inspect into

0:18:13 - 0:18:19

this you'll see there's uniswap trades there's withdrawals from main net there's like random

0:18:19 - 0:18:24

stuff there's some small contract deploys blah blah blah that are in this block um so this is

0:18:24 - 0:18:31

like a real ethereum block and the uh and really what we did is we went through this process of

0:18:31 - 0:18:37

a just you know getting those proofs working and then uh sticking john on the problem and saying

0:18:37 - 0:18:43

okay let's come up with a way to decide what sort of machines run these well uh john if you could

0:18:43 - 0:18:48

pop on um maybe you could walk through the types of machines we've tried and where we ended up here

0:18:48 - 0:18:53

yeah sure so i think the the beginning of the process was just like okay can we run can we

0:18:53 - 0:18:59

run it at all and starting to understand kind of the boundaries and knobs that we can tweak in terms

0:18:59 - 0:19:05

of like getting getting the prover to to prove a real block um and so i think we started off with

0:19:05 - 0:19:10

like a hypothesis that we would need a lot of memory like a ton of memory um and and that was

0:19:10 - 0:19:14

like going to be complicated so that's really the first thing that i think we started working on is

0:19:14 - 0:19:21

is optimizing and getting it to complete uh properly on on a machine with adequate memory

0:19:21 - 0:19:27

and then over time we started working within the the gcp environment to uh i think from a system

0:19:27 - 0:19:33

perspective uh figure out how to reduce the memory so that we can run more and more processes

0:19:33 - 0:19:37

in parallel and what we learned over the over the course of this is that you know by kind of

0:19:37 - 0:19:43

over driving on parallelization we can really really uh like kind of overdrive the prover and

0:19:43 - 0:19:49

reduce costs per unit gas basically and so we're able to drive up our efficiency by still using

0:19:49 - 0:19:54

machines with a decent amount of memory but kind of over committing uh and driving more and more

0:19:54 - 0:20:01

provers in parallel in order to to like reduce uh the kind of overall costs that you're paying

0:20:01 - 0:20:07

per transaction or per proof basically uh and that's been kind of the essence of the exercise

0:20:07 - 0:20:11

and there still feels like there's wiggle room there through you know either even just from system

0:20:11 - 0:20:15

perspective uh let alone all the kind of crazy improvements that daniel was talking about

0:20:18 - 0:20:24

yeah so i'll just say this brendan so like by by doing this you know i think when we started

0:20:24 - 0:20:29

uh really starting in earnestly trying to benchmark we were seeing you know proving

0:20:29 - 0:20:36

costs of around 300 per billion gas um and that was on pretty large machines with large amounts

0:20:36 - 0:20:44

of memory i think 700 gigs of memory or something is what we started on um and uh using more or less

0:20:44 - 0:20:50

the same code base we were able to drive that down to about 21 to 30 dollars per billion gas

0:20:50 - 0:20:55

um so what that like if you really think about that that's you know that sub-dollar transactions

0:20:55 - 0:21:02

for normal gas for like an ethereum block is where that comes to um and like i think like

0:21:02 - 0:21:06

that much improvement based on the operational side is really impressive but it's also a testament

0:21:06 - 0:21:13

to how uh you know plunky 2 works and how the evm the zk evm uh works here where it's really

0:21:13 - 0:21:20

designed to be run on commodity machines um if we compare this to like you know risk zero uses gpu

0:21:20 - 0:21:24

for as a part of its proof they're just much more constrained on the type of instances that they can

0:21:24 - 0:21:31

get um there's a lot less of a competitive market for that uh and also like it's just you know

0:21:31 - 0:21:37

you you can't sort of consider that like consumer hardware from the perspective of running at scale

0:21:37 - 0:21:42

whereas with these i think on our latest ones we're running on basically like bog standard t2d

0:21:42 - 0:21:48

instances with a couple hundred gigs around um and i don't know if anybody's checked recently but

0:21:48 - 0:21:52

like the world is covered with those now google and amazon have just plastered the earth with

0:21:52 - 0:21:58

these machines um so they're they're abundant they're available they're really easy to scale

0:21:58 - 0:22:04

across uh and since the the prover here is horizontally scalable we can both hit these

0:22:04 - 0:22:10

costs and increase throughput just by by scaling yeah the other just like quick comment is that

0:22:10 - 0:22:16

like what we've seen is like yeah we can run on commodity hardware and get more uh getting like

0:22:16 - 0:22:21

just basically scale that way and then on top of that as we've even experimented with like bare

0:22:21 - 0:22:25

metal or other other servers that are higher end we get a you know if that server costs twice as

0:22:25 - 0:22:31

much to run we often will see just as much of a throughput improve performance so it's like uh

0:22:31 - 0:22:36

i think really positive uh in terms of like where we can go even with you know faster hardware

0:22:36 - 0:22:40

but yeah we can get a huge amount of scale just with commodity servers basically

0:22:42 - 0:22:47

yeah i think that's a great point one one thing that might be helpful for the listeners

0:22:47 - 0:22:54

is we we really express all these benchmarks in in terms of cost because cost is fundamentally

0:22:54 - 0:22:58

i i would say our most important metric because it determines how much users have to pay

0:22:58 - 0:23:05

um but i i just want to be clear to to the listeners that um like latency is something

0:23:05 - 0:23:11

that we actually have a lot of control over and uh we can affect latency by you know parallelizing

0:23:11 - 0:23:18

and we generate proofs on the per transaction level um for now and then we're going to move to

0:23:18 - 0:23:25

to implementing a risk zero style continuations mechanism in the future um i guess paul and john

0:23:25 - 0:23:33

first maybe it's uh worth kind of getting into sort of the the knobs that we can toggle for uh

0:23:33 - 0:23:37

for latency and and what that might look look like because i i think people might have this

0:23:37 - 0:23:42

perception that it's going to take multiple hours to generate these proofs and that's just not the

0:23:42 - 0:23:49

case uh yeah it's a it's a really good point so uh like you said we're currently parallelizing uh

0:23:50 - 0:23:55

based upon the transaction what that means is that if we take a block of 186 transactions

0:23:55 - 0:23:59

you have a couple of different ways to approach it one is that we could prove each transaction

0:23:59 - 0:24:05

in serial um in order to get through that whole block another is that you could spin up 186

0:24:05 - 0:24:11

machines and those 186 machines can each individually generate a transaction proof

0:24:11 - 0:24:17

and then we can aggregate those proofs on small on uh together so you kind of pairwise

0:24:17 - 0:24:22

aggregate these proofs you create a little tree and by doing this recursive aggregation you get

0:24:22 - 0:24:31

a final proof um and what that kind of ends up looking like is that the worst case time for for

0:24:31 - 0:24:36

doing the the proof of all the blocks is the slowest transaction to prove plus all the time

0:24:36 - 0:24:42

it takes to aggregate all these proofs together uh the aggregation is extremely fast so you end

0:24:42 - 0:24:48

up being bottlenecked primarily by the slowest single thing that you can prove at a time uh

0:24:48 - 0:24:54

so right now that's one transaction so if you had one transaction that uses 30 million gas

0:24:54 - 0:24:59

these are extremely rare because they're extremely expensive uh that would be the bottleneck but what

0:24:59 - 0:25:04

you actually see in ethereum workloads is you tend to have a lot of transactions that are not

0:25:04 - 0:25:08

really doing very much a lot of transfers a lot of like smallish trades and those sorts of things

0:25:08 - 0:25:16

so we're able to parallelize a lot on those uh while while keeping the latency down to you know

0:25:16 - 0:25:21

basically the speed of proving the the slowest transaction in a block uh as we move forward

0:25:21 - 0:25:26

with a continuation style uh what this fundamentally means is that we're able to

0:25:26 - 0:25:30

take a block of transactions and split them up at arbitrary boundaries so it doesn't necessarily

0:25:30 - 0:25:35

need to be at a transaction level uh which will allow us to have a lot more control about where

0:25:35 - 0:25:41

we split and just how much you want to scale that across the cluster of machines in order to reach

0:25:41 - 0:25:46

the latency levels that you want the interesting thing there is you're still using approximately

0:25:46 - 0:25:53

the same total runtime just over more machines so as you're able to scale uh up and down quickly

0:25:53 - 0:25:55

you can reach the the same low

Host

Speaker

Player

Snippets