Launching Choreo on Quint Q&A

0:00:00 - 0:01:52

Thank you. Thank you. Hello, can you hear me?

0:01:52 - 0:01:59

I think it works because I see the animation moving.

0:01:59 - 0:02:05

Okay, I kind of heard you, Ocean.

0:02:05 - 0:03:08

I couldn't really hear what you said, but I think it works. Thank you. Good morning or afternoon everyone.

0:03:08 - 0:05:30

We're just waiting for our speakers here so get comfy and we'll let you know when we're Thank you. . Thank you. All right, I think we have everyone as speakers now, so I'll pass it off to Gabrielle and

0:05:30 - 0:05:31

Ethan.

0:05:31 - 0:05:33

Hi, everyone.

0:05:33 - 0:05:38

Thank you for joining our spaces.

0:05:38 - 0:05:41

We're excited to talk about Choreo.

0:05:41 - 0:05:44

Ethan, do you want to test your mic?

0:05:44 - 0:05:45

Hello, hello.

0:05:46 - 0:05:46

Hello.

0:05:47 - 0:05:48

How's it going?

0:05:48 - 0:05:49

I'm good.

0:05:49 - 0:05:49

How are you doing?

0:05:50 - 0:05:51

I'm good.

0:05:51 - 0:05:51

I'm very good.

0:05:51 - 0:05:52

Very excited.

0:05:53 - 0:05:54

First day of spring.

0:05:54 - 0:05:57

I guess it's fall for you guys, but I'm very happy.

0:05:57 - 0:05:58

Yeah, nice.

0:05:58 - 0:05:59

So, now, finally.

0:06:00 - 0:06:02

So, yeah, jumping into Choreo.

0:06:02 - 0:06:06

So, just a very brief overview.

0:06:06 - 0:06:10

Corel is this new framework that we launched for Quint.

0:06:10 - 0:06:13

One thing that people would say about Quint sometimes

0:06:13 - 0:06:15

is that it doesn't feel batteries included

0:06:15 - 0:06:17

because Quint is very general.

0:06:17 - 0:06:20

So you can specify basically anything you want in Quint.

0:06:20 - 0:06:23

I specify like some games, some...

0:06:23 - 0:06:27

I've even thought about like managing my cat's litter box in Quint.

0:06:27 - 0:06:31

So it's very general. You can think about any state machines in Quint.

0:06:32 - 0:06:35

But mostly what we are doing is just specifying distributed systems.

0:06:35 - 0:06:38

Sorry, what is the state machine for your cat's litter box?

0:06:40 - 0:06:47

So basically, if you have a cat, you kind of have to clean the litter box, at least

0:06:47 - 0:06:52

with some frequency in order to see if there is like some problems, you know, like if there's

0:06:52 - 0:06:56

blood in there, then you must get to the vet in like 24 hours.

0:06:56 - 0:06:59

Otherwise, it can be something very serious.

0:06:59 - 0:07:04

So in order to do that, then you have to have at least some frequency for checking the litter

0:07:04 - 0:07:05

box and cleaning it.

0:07:06 - 0:07:10

So I was thinking if we can have this property, you know, like in Quint and having the state machine of like,

0:07:10 - 0:07:17

okay, if you clean it at least like once a day, then for sure you'll get to the vet in enough time if something happens,

0:07:17 - 0:07:20

depending on the time of detection of diseases.

0:07:20 - 0:07:24

You're not worried about like race conditions with the cat or some kind of deadlock.

0:07:24 - 0:07:28

There's no distributed protocol. No, no, that's all fine. Okay. Yeah, that's all fine because I

0:07:28 - 0:07:34

only have one cat, you know, so no problems. I have two litter boxes. And Quint is still useful

0:07:34 - 0:07:42

even for sequential protocols. Yes, I think so. I've used it for some sequential thinking, I guess.

0:07:43 - 0:07:47

It's harder or it's rarer for it to be used

0:07:47 - 0:07:50

for sequential protocols because it's usually less complex.

0:07:50 - 0:07:52

So you only get used if it is actually complex

0:07:52 - 0:07:54

and you don't, you cannot think of it in your head.

0:07:54 - 0:07:56

I guess the litter box is something

0:07:56 - 0:07:57

that I can think of in my head.

0:07:57 - 0:08:00

So it's not a super good example, but it's a fun one.

0:08:00 - 0:08:03

But it still gives you a delightful form of language

0:08:03 - 0:08:11

to express your program and invariants and actually check them so that, you know, even if it's sequential, if it's hard to keep all the pieces in your head, Quinn's still useful.

0:08:13 - 0:08:15

Yes, yes. I have a good example for that as well.

0:08:16 - 0:08:26

So I was teaching formal methods at the university and we wanted to kind of like have seminars and people would choose the subject

0:08:26 - 0:08:28

they wanted to talk about.

0:08:28 - 0:08:32

And I wanted to kind of like raft the teams

0:08:32 - 0:08:34

between the students, you know,

0:08:34 - 0:08:35

refl I guess is the word, sorry.

0:08:35 - 0:08:39

So I wanted to do some sort of like random assignment,

0:08:39 - 0:08:42

but I wanted to follow their preferences.

0:08:42 - 0:08:46

So I modeled something saying like, there couldn't be, so if I chose,

0:08:46 - 0:08:49

so the seminars were on formal methods as well.

0:08:49 - 0:08:52

So if I chose to speak about cock and you,

0:08:52 - 0:08:55

and my second preferred one is Quint,

0:08:55 - 0:08:57

and your preferred one is Quint,

0:08:57 - 0:09:00

but your second favorite is Lean, right?

0:09:00 - 0:09:03

If I do an assignment where I have Quint

0:09:03 - 0:09:05

and you have Coq,

0:09:05 - 0:09:09

that's not good because I would prefer having Coq and you'd prefer having Quint.

0:09:09 - 0:09:13

If there was such a scenario, then that solution was not good.

0:09:13 - 0:09:17

I wrote this in Quint and had Appalachian,

0:09:17 - 0:09:20

the mother checker find me a solution where I could have

0:09:20 - 0:09:23

an assignment to each student that had the preferred team.

0:09:23 - 0:09:24

It's not a complex thing,

0:09:24 - 0:09:27

but it would take me a while to figure this out by hand.

0:09:27 - 0:09:32

So you were actually looking for a result that you were like, like an implementation

0:09:32 - 0:09:36

outcome and Quint just like, you know, found it quickly for you.

0:09:36 - 0:09:41

Do you think it would have been like, that's something that you could imagine someone else

0:09:40 - 0:09:46

imagine someone else doing a similar thing and writing it in Python. Do you think, you know,

0:09:41 - 0:09:44

doing a similar thing and writing it in Python.

0:09:47 - 0:09:52

Quint has a future as a scripting language like this for quickly figuring out real world things?

0:09:53 - 0:09:57

Yeah. Well, for me, it was because I didn't have to. So one thing, interesting thing about the

0:09:57 - 0:10:02

formal specifications is that you don't have to tell it exactly how to do something. For example,

0:10:02 - 0:10:05

you can define a sorted list as a list where

0:10:05 - 0:10:10

every consecutive element is greater than the one before. That's a sorted list. You don't have to

0:10:10 - 0:10:16

use quicksort, you know? So that was kind of what I did. I specified what was a correct solution,

0:10:16 - 0:10:22

and Quint gave me the results without me having to write a matching algorithm or whatever I would

0:10:22 - 0:10:25

ask for that. Very cool.

0:10:25 - 0:10:27

Okay, well, this has all been a tangent

0:10:27 - 0:10:28

from what we're trying to talk about today,

0:10:28 - 0:10:31

which is distributed protocols.

0:10:31 - 0:10:33

And Corey, why don't you continue

0:10:33 - 0:10:35

a little introduction that you started

0:10:35 - 0:10:37

before I started questioning you about cat litter.

0:10:39 - 0:10:42

Okay, yeah, so when people were using Quint

0:10:42 - 0:10:43

to write distributed protocols,

0:10:43 - 0:10:48

which I guess was pretty much most of our usage,

0:10:48 - 0:10:51

they would miss things.

0:10:51 - 0:10:52

Like, how do I do message passing?

0:10:53 - 0:10:55

Oh, you have to do it like this and that.

0:10:56 - 0:10:58

So we have to explain to these people how to do it every time,

0:10:58 - 0:11:00

or they would have to figure it out every time.

0:11:01 - 0:11:04

So if every single Quint spec that we are writing,

0:11:04 - 0:11:05

or most Quint specs that we are writing or most Quint specs that we

0:11:05 - 0:11:10

are writing need message passing, why it's not there? So people ask why is it not berries included?

0:11:12 - 0:11:18

And we don't want to change Quint. We want Quint to still... Not for this, right? We still want

0:11:18 - 0:11:32

Quint to be a very general tool that people can use for very different things because it's a general language. We don't want to make it like a DSL, right? But we could make a library or a framework. So

0:11:32 - 0:11:37

that's what we started doing. And there were some challenges because Quint is not a programming

0:11:37 - 0:11:46

language. It's not meant to be super extensible in this sense because also we don't want to have huge specs because that's not the goal.

0:11:46 - 0:11:49

It's like having very abstract scoped specs.

0:11:50 - 0:11:53

But yeah, so we started in this task

0:11:53 - 0:11:57

and we have Yassin who is here with us today as well.

0:11:57 - 0:11:59

He joined Informal as an intern

0:11:59 - 0:12:02

and he took on this task very seriously

0:12:02 - 0:12:04

and he ended up writing his master thesis about this.

0:12:04 - 0:12:05

So it was very interesting project. So that is Corio and how it was born. on this task very seriously and he ended up writing his master thesis about this so

0:12:05 - 0:12:09

uh it was very interesting project so that that is choreo and how it was born

0:12:12 - 0:12:12

amazing

0:12:15 - 0:12:23

so it needs so it ended up that choreo uh i guess it has two main advantages first first it's very

0:12:23 - 0:12:26

like easy to onboard and start using,

0:12:26 - 0:12:29

or start writing specs for distributed systems now.

0:12:29 - 0:12:31

I guess Quint already makes it much easier

0:12:31 - 0:12:34

than with some other tooling that is available

0:12:34 - 0:12:36

because we have a much more accessible language,

0:12:36 - 0:12:40

but Choreo makes it even easier if that's possible

0:12:40 - 0:12:46

because it adds this construct and this structure for you to start writing.

0:12:46 - 0:12:48

So you actually can download our template

0:12:48 - 0:12:50

and start from there.

0:12:50 - 0:12:53

You're not obligated to use the template,

0:12:53 - 0:12:55

but the template can help you already have some structure.

0:12:55 - 0:12:57

So instead of starting with a blank file,

0:12:57 - 0:13:00

you go there and there's like to do,

0:13:00 - 0:13:02

put your types here and then put the types there.

0:13:04 - 0:13:07

And then it takes care of, and then it helps to get started.

0:13:07 - 0:13:10

And the second advantage is that it will,

0:13:10 - 0:13:14

you will only, you not only write a distributed system spec,

0:13:14 - 0:13:16

but you're actually going to write a good one,

0:13:16 - 0:13:19

probably better than people that get started

0:13:19 - 0:13:20

with other languages,

0:13:20 - 0:13:23

because this already embeds some techniques,

0:13:23 - 0:13:25

a formal specification that people probably wouldn't

0:13:25 - 0:13:31

get learned so quickly uh without studying a little bit at least okay maybe maybe let's back

0:13:31 - 0:13:36

up a bit let's let's start with the name why is it called uh why is it called choreo oh that's

0:13:37 - 0:13:48

names are super hard right so we had some iterations um and i i liked course. So first of all, I was a dancer. I like to dance. So I

0:13:48 - 0:13:54

have a good excuse. But we were thinking about this, like orchestrating distributed systems

0:13:54 - 0:14:00

and how hard it is to think about many processes and put them all in one place and the non-determinism.

0:14:00 - 0:14:05

And then I was mostly talking to ChatGPT and Yassine was also talking to ChatGPT

0:14:05 - 0:14:07

and Yosef, we are using like,

0:14:07 - 0:14:09

can you please give us ideas for the names?

0:14:09 - 0:14:12

And one of the many names that we went to was Choreo

0:14:12 - 0:14:14

and I immediately liked it.

0:14:14 - 0:14:16

And I liked the hook,

0:14:16 - 0:14:17

Choreographing Distributed Systems

0:14:17 - 0:14:20

because that's what we are doing, right?

0:14:20 - 0:14:22

So when you're choreographing,

0:14:22 - 0:14:23

you have like this group of people,

0:14:24 - 0:14:25

everyone goes in a different direction. They have to go there and form a circle and the other one comes here and forms a line. doing, right? So when you're choreographing, you have like this group of people, everyone

0:14:25 - 0:14:28

goes in a different direction, they have to go there and form a circle and the other

0:14:28 - 0:14:34

one comes here and forms a line and you have to take care of all of that. And in the end,

0:14:34 - 0:14:38

it's something beautiful, but what every single dancer is doing in a choreograph, like you

0:14:38 - 0:14:42

have to worry about so many things. So I think it makes a lot of sense.

0:14:42 - 0:14:52

Right. So Quint is already, you know, even before Coreo, Quint was heavily used for specifying distributed systems, arguably one of the best tools on the planet for doing so.

0:14:54 - 0:15:05

So, you know, one might come to this and say, well, isn't Quint already, wasn't Quint already designed for specifying distributed systems? Why have you gone and had to, you know, create a

0:15:05 - 0:15:10

framework that you now say is for specifying distributed systems when you already had,

0:15:10 - 0:15:14

you know, a specification language for doing it? What, maybe you can talk a little bit about what

0:15:14 - 0:15:19

makes it, even when you're using Quint, what makes it difficult or challenging

0:15:19 - 0:15:25

and to specify distributed systems and, you know, if Quint is supposed to help you reason about them,

0:15:26 - 0:15:27

why wasn't Quint enough?

0:15:27 - 0:15:30

What sort of motivated the introduction of Choreo?

0:15:31 - 0:15:32

Yeah, good question.

0:15:33 - 0:15:37

So Quint is based on CLA+, right?

0:15:38 - 0:15:40

And the creator of CLA+, is Leslie Lampard,

0:15:41 - 0:15:43

which is a very influential person

0:15:43 - 0:15:44

in this distributed systems world.

0:15:44 - 0:15:45

And he created CLA person in this distributed systems world and he

0:15:45 - 0:15:51

created tla plus because of distributed systems he was writing papers on distributed systems and he

0:15:51 - 0:15:57

lacked a precise language to define and reason about these systems for those who don't know this

0:15:57 - 0:16:01

is the guy that coined the term byzantine uh byzantine general problem byzantine fault

0:16:01 - 0:16:10

tolerance all of us in blockchains are very much directly downstream of this guy. He invented the Paxos algorithm and he invented latex, right? And

0:16:10 - 0:16:15

TLA because latex wasn't torture enough. He needed to torture people in other ways when

0:16:15 - 0:16:20

they think about protocols, but it was supposed to be helpful. This TLA plus thing, right?

0:16:20 - 0:16:21

Sorry, go on.

0:16:21 - 0:16:25

Yes. Yes. So if you Google Byzantine fault tolerance, go to the paper,

0:16:25 - 0:16:31

you'll see that Leslie Lamport is the author. Yeah, so he created TLA plus for this,

0:16:32 - 0:16:39

to be precise when he talks about distributed systems to other people. And we created Quint

0:16:39 - 0:16:44

based on TLA plus, but TLA plus is very mathematical and it's meant to write papers.

0:16:44 - 0:16:46

And we don't want to write papers.

0:16:46 - 0:16:47

We want to write executable specs.

0:16:48 - 0:16:50

So Queens is a much better language for executable specifications.

0:16:51 - 0:16:51

Right.

0:16:51 - 0:16:54

Because TLA plus is not executable.

0:16:56 - 0:16:59

So it's model checkable and it's simulatable,

0:17:00 - 0:17:05

but it was not created to be anywhere near a computer.

0:17:07 - 0:17:07

You know, like,

0:17:08 - 0:17:09

Lambert didn't think

0:17:09 - 0:17:11

you could even write a model checker for TLC.

0:17:12 - 0:17:12

For TLA, sorry.

0:17:13 - 0:17:15

Later, the TLC was created

0:17:15 - 0:17:17

but, like, the amount of work

0:17:17 - 0:17:19

you have to do to even

0:17:19 - 0:17:20

parse the language

0:17:20 - 0:17:22

because it was not meant for that.

0:17:22 - 0:17:23

It is meant for that.

0:17:23 - 0:17:25

So TLA plus was very

0:17:25 - 0:17:30

much designed as, you know, a tool in the logicians toolbox to be something for thinking mathematically,

0:17:30 - 0:17:36

but it was very mathematician oriented. It wasn't really designed to be part of, let's say, the

0:17:36 - 0:17:41

software development lifestyle to be really accessible to software developers to be part of,

0:17:41 - 0:17:45

you know, to have tooling that, you know, that allows it to be quickly executable

0:17:45 - 0:17:47

and integrated into your flows and things like that.

0:17:47 - 0:17:51

It was really for like, as if you were going to sit down with pen and paper, but, you know,

0:17:51 - 0:17:53

do it in latex kind of thing.

0:17:54 - 0:17:55

Yes, yes.

0:17:55 - 0:17:57

And have a precise language.

0:17:57 - 0:18:01

Because in the papers, if you, Lampard said like, he was writing the papers in English,

0:18:00 - 0:18:04

writing the papers in English, right? So he writes like, oh, if this happens, then that should

0:18:01 - 0:18:01

right?

0:18:01 - 0:18:04

So he wrote like, oh, if this happens, then that should occur.

0:18:04 - 0:18:10

occur. And he noticed that even he had ambiguities in his papers. And he said, oh, if I,

0:18:11 - 0:18:16

who am very smart and a very good writer can be imprecise, imagine the others.

0:18:16 - 0:18:24

Yeah. And that's, so the main motivation is precision. And then the other stuff came later.

0:18:24 - 0:18:27

So today they do have initiatives to make TLA plus more accessible.

0:18:27 - 0:18:31

For example, with plus call, they do have a lot.

0:18:31 - 0:18:35

They try to give TLA plus for the engineers as well.

0:18:35 - 0:18:37

So today, definitely that's one of their goals.

0:18:37 - 0:18:38

They want more engineers to use TLA plus.

0:18:38 - 0:18:40

But it's an afterthought.

0:18:40 - 0:18:41

It wasn't designed for that.

0:18:42 - 0:18:45

And so it's lacking a kind of, you know, let's say,

0:18:51 - 0:18:54

aesthetic of modern development tools. Yeah, and it's super hard to do these tools as an afterthought. Right. Certainly. They have many difficulties. Right.

0:18:55 - 0:19:02

Okay. Yeah, please. Yeah. Zampart designed it for precise and precise way of talking about

0:19:02 - 0:19:05

distributed systems, and then Quint is based on that.

0:19:05 - 0:19:09

And so when we first wrote the readme for Quint,

0:19:09 - 0:19:12

we added like for distributed systems,

0:19:12 - 0:19:14

focus on distributed systems.

0:19:14 - 0:19:19

Cause I think the very like necessity of right,

0:19:19 - 0:19:22

sitting down writing as formal spec with the state machine

0:19:22 - 0:19:26

matches the distributed system setting a lot because the pain

0:19:26 - 0:19:31

point that we have in distributed systems where it's so hard to think about all the stuff that

0:19:31 - 0:19:38

can happen, the interleavings and everything, it's so hard. And that's why formal specs are a great

0:19:38 - 0:19:43

fit and Quint is a great fit. So you know how the relationship between Quint and distributed systems

0:19:43 - 0:19:49

is, it exists, but it's not like completely direct. It's because distributed systems are difficult

0:19:49 - 0:19:53

and Quint helps with difficult stuff. Does that make sense?

0:19:53 - 0:19:58

So it's a general tool for understanding that is really just a more delightful

0:20:00 - 0:20:10

wrapper around mathematics, essentially. That's really what Quint is under the hood, is just a way to express math in a more, you know, familiar and accessible language.

0:20:11 - 0:20:16

And of course, that can be used for modeling distributed systems protocols, because those

0:20:16 - 0:20:22

are just math. And those are the kinds of protocols that will benefit most from, you know,

0:20:22 - 0:20:28

sitting down and writing some math about them. But Quint is a generalized language.

0:20:30 - 0:20:37

It's math, and so it can be used for anything. It doesn't have built-in helpers to actually

0:20:37 - 0:20:43

model specific things about how... You still have to make a lot of decisions when you sit down to

0:20:43 - 0:20:49

use Quint on how you're going to model the distributed system. How do you model the environment? How do you model the

0:20:49 - 0:20:54

message passing? How do you model, you know, a Byzantine actor who's, you know, trying to mess

0:20:54 - 0:20:59

with your protocol? You have to make all these decisions on how you're going to model your

0:20:59 - 0:21:04

distributed system. And you have to do it each time you sit down to model a distributed system

0:21:04 - 0:21:05

in Quint, right?

0:21:05 - 0:21:09

And so that adds just, you know, it makes it harder, I guess, than it needs to be.

0:21:09 - 0:21:14

Whereas Choreo is making a lot of those decisions for you.

0:21:14 - 0:21:15

Is that right?

0:21:16 - 0:21:17

Yes, precisely.

0:21:17 - 0:21:18

Exactly that.

0:21:18 - 0:21:18

Yeah.

0:21:18 - 0:21:23

So for people that are not used to modeling distributed systems or doing it for the first time,

0:21:23 - 0:21:29

then they have to learn and think a lot if they're using just raw Quint. And for us who are used to it and they're doing

0:21:29 - 0:21:34

all the time, it's like boilerplate we have to write and then more things that we need to set

0:21:34 - 0:21:40

up. And Choreo solves those. Right. And so I imagine, you know, we've been, you and our team

0:21:40 - 0:21:45

and others now have been writing Quintpecs for distributed systems for a number

0:21:45 - 0:21:51

of years before choreo existed so i imagine there's a number of different ways out in the wild

0:21:51 - 0:21:57

that folks have gone about you know actually modeling these different components of distributed

0:21:57 - 0:22:05

systems and there's been some comparison between those some contrast some lessons learned can you speak to any of that

0:22:07 - 0:22:13

yes um i guess the most uh concrete example or instance of this that i can talk about

0:22:13 - 0:22:20

is about message passing so if you even go into the lampard's website and watch his lectures which

0:22:20 - 0:22:25

is how i got into this world right i that's how I learned this stuff. It's by Washington Lambert.

0:22:25 - 0:22:29

And he teaches you that when you are modeling message passing,

0:22:29 - 0:22:32

you should just have a set of elements.

0:22:32 - 0:22:34

So you have a state variable,

0:22:34 - 0:22:36

which works sort of like as a global variable

0:22:36 - 0:22:40

that changes over time in your state machine.

0:22:40 - 0:22:43

So you start with a perhaps empty set of messages.

0:22:43 - 0:22:45

And then as the processes start

0:22:45 - 0:22:51

to communicate, you just add new messages to the set. And then receiving a message means just

0:22:51 - 0:22:56

looking at the set and checking that the message is there. So if there is a message in the set of

0:22:56 - 0:23:01

all messages ever sent, then that means you can receive that message. You don't take it out of

0:23:01 - 0:23:08

the set. It's still there. And that's how we modeled message passing in most form of specs.

0:23:08 - 0:23:10

That's how Lampard taught us to do it.

0:23:10 - 0:23:15

And that's not the most intuitive way at all.

0:23:15 - 0:23:18

Maybe the most intuitive way is having like an array with a queue,

0:23:18 - 0:23:21

or maybe an internal buffer for each of the process

0:23:21 - 0:23:23

with the messages it has to receive.

0:23:23 - 0:23:30

And then it starts consuming them and removing them from this list in certain orders but the thing is that the having

0:23:30 - 0:23:36

a set is very general but having a set I mean does that prevent you from receiving the same message

0:23:36 - 0:23:41

twice or I guess the message persists in there so you could read it multiple times and that's

0:23:41 - 0:23:45

how you would model receiving the same message twice?

0:23:50 - 0:23:54

Yeah. So you, the set doesn't prevent that. So you can receive the same message multiple times, which is basically it's modeling this at least once delivery model, or maybe

0:23:54 - 0:24:01

since we are choosing from the set, which message you are going to receive in a non-deterministic

0:24:01 - 0:24:06

way, it can also mean that you never receive some message or receive some message that was sent later

0:24:06 - 0:24:08

before a message that was sent before.

0:24:08 - 0:24:09

So there's no ordering guarantees.

0:24:10 - 0:24:12

So this is like the most general modeling.

0:24:12 - 0:24:15

It takes care of all the problems

0:24:15 - 0:24:17

that can happen in real life in the network.

0:24:19 - 0:24:21

So it's very general and it's very efficient.

0:24:21 - 0:24:22

Right.

0:24:22 - 0:24:25

So is that technique built into Choreo then?

0:24:26 - 0:24:32

Yes, it is. Okay. So if you hadn't come across that, you know, these Lamport videos, or you

0:24:32 - 0:24:37

hadn't learned or thought about this somewhat unintuitive way of modeling messaging, you might

0:24:37 - 0:24:43

be trying to do something else that might complicate the specs in ways you didn't expect or,

0:24:40 - 0:24:42

complicate the specs in ways you

0:24:42 - 0:24:43

didn't expect or

0:24:43 - 0:24:45

wouldn't want.

0:24:47 - 0:24:48

Yeah, exactly.

0:24:48 - 0:24:50

I guess either you wouldn't know how to do it

0:24:50 - 0:24:52

at all and you would look it up

0:24:52 - 0:24:54

on the internet and maybe find out the proper

0:24:54 - 0:24:56

way or maybe you would try it yourself

0:24:56 - 0:24:58

and then you would probably end up with

0:24:58 - 0:25:00

something like the internal buffer arrays

0:25:00 - 0:25:01

that I mentioned.

0:25:02 - 0:25:04

And is this set what we call

0:25:04 - 0:25:05

the message soup

0:25:05 - 0:25:08

in our blog posts and other places?

0:25:08 - 0:25:09

Yes, yes.

0:25:09 - 0:25:09

Okay, great.

0:25:10 - 0:25:12

So even before we posted, we published Corio,

0:25:12 - 0:25:14

we're already exploring this stuff.

0:25:14 - 0:25:16

And we actually went and tried to validate this

0:25:16 - 0:25:20

because how we ended up writing this blog post, okay?

0:25:20 - 0:25:23

So we wanted Corio to be super simple to use.

0:25:24 - 0:25:26

And the simplest thing to use,

0:25:26 - 0:25:31

or if you're coming from like a pseudocode for a protocol,

0:25:31 - 0:25:34

is usually like you receive one message

0:25:34 - 0:25:35

and then you do something about it.

0:25:35 - 0:25:38

So like upon a proposal, you do something.

0:25:38 - 0:25:41

Upon a message of certain type, you react like this.

0:25:41 - 0:25:43

People usually write their pseudocode.

0:25:43 - 0:25:45

I mean, that's how you'd write the code, right? You'd write a handler for the message, you receive the message. You know, you run like this. People usually write their pseudocode. I mean, that's how you would write, that's how you'd write the code, right?

0:25:45 - 0:25:46

You'd write a handler for the message,

0:25:46 - 0:25:48

you receive the message, you know,

0:25:48 - 0:25:50

you run through your handler, you stick the message,

0:25:50 - 0:25:53

you update your state based on that, you're accumulating.

0:25:53 - 0:25:56

Usually you're trying to accumulate votes or signatures

0:25:56 - 0:25:58

or something until you hit some threshold, right?

0:25:58 - 0:26:00

And so you do it one message at a time, right?

0:26:00 - 0:26:02

That's what you're talking about?

0:26:02 - 0:26:03

Yes, yes, exactly.

0:26:03 - 0:26:04

Yeah, the code would be a better example, yeah.

0:26:04 - 0:26:06

So that's how you write code.

0:26:06 - 0:26:09

If you're doing something simpler than consensus,

0:26:09 - 0:26:12

if you're consuming from a PubSub channel, you get a message,

0:26:12 - 0:26:13

you do something about it.

0:26:13 - 0:26:16

If you have a queue of messages, you just get a job

0:26:16 - 0:26:17

and do something about it.

0:26:17 - 0:26:18

So it's natural that someone would think

0:26:18 - 0:26:20

to write their spec in the same way

0:26:20 - 0:26:22

as they write those handlers.

0:26:22 - 0:26:24

Yes.

0:26:24 - 0:26:26

And so we wanted to put this in Corio as well.

0:26:26 - 0:26:29

So we want to, like, oh, you don't think about, like, actions, transitions, state machines.

0:26:30 - 0:26:32

You just write these handlers for single messages, and then you're done.

0:26:33 - 0:26:35

And I was really pushing for this.

0:26:35 - 0:26:37

Like, I really wanted this to be super easy.

0:26:39 - 0:26:43

But this is not super compatible with the message soup technique.

0:26:43 - 0:26:49

It's still, like, I can still consume messages from the set like this, so it still works.

0:26:50 - 0:26:53

But it is not the most efficient way of writing it.

0:26:53 - 0:26:55

I'll explain why in a bit.

0:26:57 - 0:27:01

But we are trying very hard to have this the easiest way possible.

0:27:02 - 0:27:06

And we started doing benchmarks and running like estimating the state space

0:27:06 - 0:27:07

for different approaches,

0:27:08 - 0:27:09

MessageSoup without MessageSoup,

0:27:10 - 0:27:12

removing messages or not removing messages.

0:27:12 - 0:27:13

We tried a bunch of things.

0:27:13 - 0:27:16

That's also what's in the blog post.

0:27:17 - 0:27:21

And we ended up like showing with the data

0:27:21 - 0:27:23

how this MessageSoup technique is actually the best

0:27:23 - 0:27:26

and how you shouldn't receive,

0:27:26 - 0:27:30

you shouldn't think about handlers message by message.

0:27:30 - 0:27:36

So not thinking about receiving one single message and then you receive another one and then you receive another one.

0:27:37 - 0:27:39

So this is not the most effective way.

0:27:39 - 0:27:42

And Choreo, so you can write it like this with Choreo.

0:27:40 - 0:27:42

So you can write it like this with Choreo.

0:27:42 - 0:27:43

There is an option for that.

0:27:44 - 0:27:46

But the main option that Choreo offers you,

0:27:46 - 0:27:48

like the tutorials talk about,

0:27:49 - 0:27:52

it's not message by message.

0:27:52 - 0:27:55

It's more like you look at the message soup

0:27:55 - 0:27:56

and see what's in there.

0:27:57 - 0:27:59

So if you're thinking about Quorum, for example,

0:28:00 - 0:28:02

so you want to have, for example, in Tendermint,

0:28:02 - 0:28:05

you want to have three pre-votes before,

0:28:05 - 0:28:08

or three in a system with four nodes, right?

0:28:08 - 0:28:11

You want to have at least three pre-votes

0:28:11 - 0:28:13

before you proceed, right?

0:28:13 - 0:28:14

So you need to have quorum.

0:28:14 - 0:28:17

So what you can do if you're thinking about handlers

0:28:17 - 0:28:18

receiving a specific message,

0:28:18 - 0:28:20

and then another one, another one,

0:28:20 - 0:28:22

what you have is like you receive one pre-vote

0:28:22 - 0:28:24

and then you do something like increment

0:28:24 - 0:28:29

your internal counter or your internal registry of the votes you received. Then you

0:28:29 - 0:28:33

receive another vote and then you increment that again and then you receive another one and then

0:28:33 - 0:28:37

increment that again. Oh, now I have quorum and then you do something. So this is like a lot of

0:28:37 - 0:28:44

transitions in the state machine. While if you know that you have this message soup, right? So you

0:28:44 - 0:28:46

are aware that the message soup is there and exists.

0:28:46 - 0:28:49

You don't have to receive all of those messages.

0:28:49 - 0:28:52

You can just go and look at the message soup and say,

0:28:52 - 0:28:56

oh, there are three pre-votes here in the message soup.

0:28:56 - 0:28:59

So I can kind of like, you don't actually receive those.

0:28:59 - 0:29:00

You just know that it's there.

0:29:00 - 0:29:02

And since the messages are there,

0:29:02 - 0:29:04

then it means you can do your thing.

0:29:04 - 0:29:05

You have quality.

0:29:06 - 0:29:10

Interesting. So really, you know, the key point here is what we're saying is, look,

0:29:10 - 0:29:14

we want to be able to model the system effectively so that we can understand it.

0:29:14 - 0:29:19

And the way we implement it, we don't have a choice when we implement the system,

0:29:19 - 0:29:23

but to write a per message handler. We have to, you know, it's a device,

0:29:23 - 0:29:29

it's separate from the network. It has to receive the message, process it, you know, update the state, whatever, because it has

0:29:29 - 0:29:35

to accumulate, you know, something, some counter, you know, pay attention until it reaches the

0:29:35 - 0:29:43

threshold. But when we're modeling the system, what we actually care about is the state change

0:29:43 - 0:29:54

on the threshold. And if we were to, if we we model the system one message at a time, then we're treating each message as if it's its own state transition, right?

0:29:54 - 0:29:58

Which isn't actually true from the perspective of the protocol.

0:29:59 - 0:30:01

It's true from the perspective of the code.

0:30:01 - 0:30:09

We're going to have some state variable and we're going to add the effect of each message to that state variable, but nothing is going to happen even in the code at a protocol

0:30:09 - 0:30:16

level until we cross some threshold. So when we're modeling, it actually will be much more efficient

0:30:16 - 0:30:22

and it provides a better abstraction if we don't think about things one message at a time. And if

0:30:22 - 0:30:27

we don't, you know, have each message imply a state transition,

0:30:27 - 0:30:29

but rather we just allow the messages

0:30:29 - 0:30:34

to accumulate in the soup, and the only state transitions

0:30:34 - 0:30:36

we want to have in the model are the actual state

0:30:36 - 0:30:38

transitions in the protocol, which

0:30:38 - 0:30:41

are when we reach thresholds or meaningful events happen

0:30:41 - 0:30:43

that don't just happen one message at a time.

0:30:43 - 0:30:45

Is that the idea?

0:30:46 - 0:30:46

Yes, exactly.

0:30:48 - 0:30:48

So if you think of a state machine, right?

0:30:51 - 0:30:51

So in state machine, you have circles and arrows.

0:30:55 - 0:30:59

If you're going with a message soup and you start with a circle where you haven't reached threshold and you receive a message

0:30:59 - 0:31:02

and you are in a circle where you have the threshold or have quorum

0:31:02 - 0:31:06

and then you go to a circle where you do something based on quorum.

0:31:06 - 0:31:10

So it's very, you know, like there are three states in there in this, in this part.

0:31:10 - 0:31:14

While if you are receiving message by message, then you have a bunch of orderings in which

0:31:14 - 0:31:15

you can receive them.

0:31:15 - 0:31:17

So I'm thinking, I'm talking here about three messages.

0:31:17 - 0:31:22

You can receive first A, then B, then C, or you can receive first B and C, then A.

0:31:22 - 0:31:26

So all of these orderings, and when you're doing formal specifications

0:31:26 - 0:31:32

you are going to simulate or model check all of these possibilities so the more you have the the

0:31:32 - 0:31:37

like you're spending a lot of computing on this different orderings and for your protocol it

0:31:37 - 0:31:42

doesn't matter you receive the pre-votes right so it just explodes the state space and you get this

0:31:42 - 0:31:51

combinatorial explosion that makes it harder to actually do any reasoning or modeling or finding traces that matter, things like that.

0:31:52 - 0:31:56

Yes, yes. The finding traces that matter might be the highlight here.

0:31:57 - 0:32:05

That's what we talk about in the blog post as well. bunch of interesting states that we want to see that we try to find using simulation.

0:32:10 - 0:32:10

And we see how using this technique, we can find these traces much quicker.

0:32:17 - 0:32:21

And even in some cases, using the message-by-message approach, we don't find those scenarios. While using this message soup technique, we find them very quickly.

0:32:22 - 0:32:26

Yeah, that's super interesting and compelling. It sounds like, I mean, this thing

0:32:26 - 0:32:31

about maybe you want to just talk about traces for a minute and what's useful about traces and why,

0:32:31 - 0:32:36

you know, why should people care about traces and what do they do for them?

0:32:38 - 0:32:43

Yeah, I guess there's so much things to think about traces. Traces are my favorite thing, maybe.

0:32:44 - 0:32:48

Traces are an amazing way of understanding what's happening. I think even when we started Quint,

0:32:48 - 0:32:54

we underestimated a bit traces. Because we started doing the Quint simulator for

0:32:57 - 0:33:03

just because people asked for it, but we had a mix of intentions. But when we run the simulator,

0:33:03 - 0:33:06

and it's super fast, you can run the simulator for one single sample,

0:33:06 - 0:33:08

which it's like instant.

0:33:08 - 0:33:10

And then you get one trace of your application

0:33:10 - 0:33:13

and then you see what happened in this like abstract level.

0:33:13 - 0:33:15

Cause when you are working with Quint and Corio,

0:33:15 - 0:33:19

you have a very abstract level of understanding things.

0:33:19 - 0:33:22

Seeing a trace of Tendermint helps you understand it so well.

0:33:22 - 0:33:23

It's so good.

0:33:23 - 0:33:25

And then you can use the traces for

0:33:25 - 0:33:31

testing, for example, your application. So just to clarify, like when we write a spec,

0:33:32 - 0:33:39

it's an abstraction, it's kind of lifeless, it's just supposed to state what the protocol does and

0:33:39 - 0:33:49

how it behaves, but what we actually want isn't just to like reason in the abstract about the protocol we care about you know ultimately a real implementation and being able to debug it

0:33:49 - 0:33:57

and all of these things and so the traces are actual concrete runs through the the protocol

0:33:57 - 0:34:03

specified and quit right they're the actual set of states and state transitions that might occur

0:34:03 - 0:34:05

and of course for any given protocol spec,

0:34:11 - 0:34:17

there could be a zillion possible traces, right? Like the set of all possible traces is really the set of all possible states and state transitions that the protocol could execute through.

0:34:18 - 0:34:27

But the, you know, the Quint tooling has been somewhat specialized to provide traces and make it so that users can

0:34:27 - 0:34:33

actually reason about the protocol using the traces rather than just the abstract spec.

0:34:33 - 0:34:36

Because if you're stuck at the abstract spec, then you yourself would have to like go and think

0:34:36 - 0:34:40

through, okay, this happens and then this happens and then this happens. Well, what if this happened

0:34:40 - 0:34:46

and then would this happen, right? And you're like kind of walking through it yourself, whereas the traces kind of do that automatically.

0:34:46 - 0:34:48

So the simulator will actually, you know,

0:34:48 - 0:34:52

it'll randomly run through a bunch of possible states

0:34:52 - 0:34:54

and state transitions and give them to you as traces.

0:34:54 - 0:34:58

And then you can inspect them to actually understand

0:34:58 - 0:35:01

how the protocol logic flows in a real concrete run, right?

0:35:03 - 0:35:03

Yes.

0:35:03 - 0:35:08

And one of my favorite use cases for that

0:35:08 - 0:35:10

is this sort of like witnesses

0:35:10 - 0:35:12

where you can find interesting scenarios.

0:35:12 - 0:35:14

So if you're, so a queen spec per se

0:35:14 - 0:35:17

is very similar to like pseudocode in a way.

0:35:17 - 0:35:20

So if you're only looking at it, if you're not running it,

0:35:20 - 0:35:22

so it's very similar to pseudocode

0:35:22 - 0:35:24

where you have all the things that can happen

0:35:24 - 0:35:26

in a high level of abstraction.

0:35:27 - 0:35:34

And if you're like me, maybe you already looked at some pseudocode and we are talking about this like handler sort of perspective, right?

0:35:34 - 0:35:43

So like, oh, when you receive three proposals with value minus one, then, and you had a timeout and blah, blah, blah, then you do this.

0:35:40 - 0:35:42

and you had a timeout and blah, blah, blah,

0:35:42 - 0:35:44

then you do this.

0:35:44 - 0:35:45

So you look at the pseudocode

0:35:45 - 0:35:49

and you have this very specific reactions or handlers

0:35:49 - 0:35:52

and you're like, okay, but why do we need this?

0:35:52 - 0:35:53

When is this useful?

0:35:53 - 0:35:54

When is this going to happen?

0:35:55 - 0:35:55

Like there must be a way

0:35:55 - 0:35:57

because the people who wrote the paper are very smart

0:35:57 - 0:36:00

and there must be some scenario that needs this,

0:36:00 - 0:36:03

that requires these mechanics, this mechanism.

0:36:05 - 0:36:07

But I don't know when this is necessary.

0:36:07 - 0:36:08

How can I figure that out?

0:36:08 - 0:36:11

You can spend maybe hours on paper playing with scenarios

0:36:11 - 0:36:14

and trying to understand how you get to that.

0:36:14 - 0:36:17

But we, Quint you can just ask like, hey, Quint,

0:36:17 - 0:36:20

can you get me a scenario where this thing right here

0:36:20 - 0:36:23

is executed and then Quint will give you,

0:36:23 - 0:36:25

and then you can understand like, oh, okay, that's how. So you time out here and then quaint will give you and then you can understand like oh

0:36:25 - 0:36:31

okay that's how so you time out here and then two states later you receive this type of message oh

0:36:31 - 0:36:35

okay now right so there might be what you're saying is there might be some part of the spec

0:36:35 - 0:36:39

that you know that describes some part of the protocol that is maybe covering

0:36:40 - 0:36:44

like the unhappy path right where some complex sequence of things happens and you have to

0:36:44 - 0:36:45

account for it.

0:36:45 - 0:36:47

And it's not clear just by looking at the spec,

0:36:47 - 0:36:49

like what would lead you to that state

0:36:49 - 0:36:52

where like this part of the protocol kicks in, right?

0:36:52 - 0:36:53

But with Quint, you can say,

0:36:53 - 0:36:54

hey, give me a trace

0:36:54 - 0:36:55

where this part of the protocol kicks in.

0:36:56 - 0:36:58

And then Quint will just spit out the trace

0:36:58 - 0:36:59

and suddenly you have a, you know,

0:36:59 - 0:37:01

exact sequence of steps that would get you there.

0:37:01 - 0:37:02

You don't have to figure it out yourself

0:37:02 - 0:37:04

and you can just see, oh, it's obvious now.

0:37:04 - 0:37:05

If it weren't for that line in see, oh, it's obvious now.

0:37:05 - 0:37:06

If it weren't for that line in the protocol,

0:37:06 - 0:37:09

then it's possible that like this set of things could happen

0:37:09 - 0:37:11

and then the protocol would break because, you know,

0:37:11 - 0:37:14

some constraint wasn't there or something like that.

0:37:15 - 0:37:16

Yes.

0:37:16 - 0:37:17

And then you can use that for tests

0:37:17 - 0:37:19

on your production code.

0:37:19 - 0:37:21

So even if you're writing tests, you know,

0:37:21 - 0:37:27

for your implementation, I don't know i sometimes it's

0:37:27 - 0:37:31

hard you know to come up with scenarios so how can i set up my environment in the test to reproduce

0:37:31 - 0:37:36

these scenarios uh quint can also help with that you just ask quid for a trace that hits the scenario

0:37:36 - 0:37:50

quint will give it to you or if you don't want one trace but you want ten thousand also easy it can give you amazing okay so we'd started talking about um i sort of forget where

0:37:50 - 0:37:54

we were we got distracted by traces you were saying they're your favorite thing we all we got

0:37:54 - 0:37:59

all excited um i think we were trying to talk about yeah oh we talked about the soup right and

0:37:59 - 0:38:06

how yeah yeah and that's also something interesting that's related to traces. Cause when you're using the right techniques,

0:38:06 - 0:38:09

for example, there's like looking at the message soup,

0:38:09 - 0:38:11

then your traces are smaller.

0:38:11 - 0:38:12

Not only your state machine.

0:38:12 - 0:38:14

So if your state machine is smaller,

0:38:14 - 0:38:17

usually means that also the traces are smaller, right?

0:38:17 - 0:38:19

So when we were talking about like receiving a pre-vote,

0:38:19 - 0:38:21

then another pre-vote, then another pre-vote,

0:38:21 - 0:38:22

and then you actually do something interesting.

0:38:22 - 0:38:31

This means you have to scroll to a longer trace. longer trace well if you just oh here we had quorum and therefore we meant we did

0:38:31 - 0:38:36

that this is much easier to read on the trace right also makes that right so these abstractions

0:38:36 - 0:38:42

we're talking about with the message soup that are built into choreo they actually make working

0:38:42 - 0:38:45

with traces a lot easier and if traces are one of the most valuable things you get out of Quint

0:38:45 - 0:38:47

and that helps you understand the protocol,

0:38:47 - 0:38:50

having good traces is actually very important

0:38:50 - 0:38:54

because if your traces are thousands of steps long,

0:38:54 - 0:38:56

it's going to be way harder to make any sense of it

0:38:56 - 0:38:59

and the value of the trace is going to be sort of minimal.

0:38:59 - 0:39:09

It basically becomes as if you're scrolling through the logs of your implementation and you see, you know, the debug logs basically, you know, one message at a time.

0:39:10 - 0:39:12

You kind of can't make sense of things because it's too much, right?

0:39:12 - 0:39:17

And so you want your traces to actually be small and full of useful information.

0:39:17 - 0:39:20

And so this message soup technique actually allows that to happen.

0:39:20 - 0:39:28

allows that to happen is that right yes yes that's right and i guess we can continue on traces

0:39:21 - 0:39:21

Is that right?

0:39:22 - 0:39:23

Yes.

0:39:23 - 0:39:24

Yes, that's right.

0:39:28 - 0:39:37

because this will lead uh oh yes sorry i got distracted uh yes we let's try to talk for 10

0:39:37 - 0:39:42

more minutes and then open the q a to the audience and then uh if someone has questions just write

0:39:42 - 0:39:45

them down and we'll get them to them in 10 minutes.

0:39:46 - 0:39:51

I want just to continue a bit on traces to get to my favorite thing maybe about choreo.

0:39:51 - 0:39:51

I don't know.

0:39:51 - 0:39:57

I have lots of things that I like, but traces are really like, because why do I like traces?

0:39:57 - 0:40:02

Because when I try to understand tendermint, traces were the most helpful thing for me.

0:40:03 - 0:40:05

So it made all the difference.

0:40:06 - 0:40:11

And with Quint, one thing that we introduced that were, was not on TLA plus.

0:40:11 - 0:40:16

So Quint is very compatible with TLA plus, like everything on Quint transpires to TLA plus,

0:40:17 - 0:40:22

but we have one extra thing that's not present on the TLA plus and it's not part of your state

0:40:22 - 0:40:22

machine.

0:40:22 - 0:40:23

It's not part of your spec.

0:40:24 - 0:40:25

And we call them like tests. They are sort part of your spec. And we call them like

0:40:25 - 0:40:31

tests. They are sort of like quint tests, or we call them also run. So you can use the keyword

0:40:31 - 0:40:38

run to specify a specific trace in the quint code. So in your code, you also write this,

0:40:38 - 0:40:42

as you would write like unit tests for your implementation, you write this runs for quint,

0:40:42 - 0:40:46

if you want. They're optional. But those are a way to bring the traces

0:40:46 - 0:40:49

to be sort of like first-class citizens

0:40:49 - 0:40:50

in your files and your specs.

0:40:50 - 0:40:53

These are not something just like JSON files you run

0:40:53 - 0:40:54

or something you save somewhere.

0:40:54 - 0:40:57

This can be part of your specification as well.

0:40:57 - 0:41:02

And when I was talking to Josef about this some years ago,

0:41:02 - 0:41:06

Josef told me like, this is way more useful than I thought, because these runs,

0:41:07 - 0:41:09

they are ways to document some scenarios.

0:41:09 - 0:41:13

So we were just talking about this, like super rare scenarios that

0:41:13 - 0:41:15

activate some super specific mechanisms.

0:41:16 - 0:41:21

You can, you as a protocol designer can document that to your, to the engineers,

0:41:21 - 0:41:24

for example, that are implementing the protocol for you or for the team.

0:41:24 - 0:41:25

And, but even not as a protocol designer, even like someone like me trying to the engineers, for example, that are implementing the protocol for you or for the team.

0:41:30 - 0:41:37

But even not as a protocol designer, even like someone like me trying to understand a protocol that I didn't know, like Tendermint, writing these runs from traces was really insightful.

0:41:37 - 0:41:41

And with Choreo, the way you write these runs are amazing.

0:41:42 - 0:41:44

It checks more things than you'd normally check.

0:41:44 - 0:41:47

It drives you to write

0:41:47 - 0:41:55

something of really high quality. And this is what I'm calling the cue pattern. And this is a

0:41:55 - 0:42:00

bit dance-oriented as well. So as we had the choreo nomenclature, I came up with this cue.

0:42:00 - 0:42:05

So you listen to some cue and then you do some some step in your dance, you perform something, right?

0:42:05 - 0:42:07

So first you have to hear your cue.

0:42:07 - 0:42:09

You are not going to perform something out of the blue.

0:42:09 - 0:42:14

Just as in protocols, like your node doesn't spontaneously decide to pre-vote, right?

0:42:14 - 0:42:15

It reacts to something.

0:42:16 - 0:42:19

So the cue is the thing you react to and then you perform something.

0:42:20 - 0:42:25

And when you write both your specification and your traces,

0:42:26 - 0:42:27

your runs like this,

0:42:30 - 0:42:31

it makes so clear what the protocol is doing at a high level.

0:42:33 - 0:42:33

So this is something that came later in the choreo.

0:42:34 - 0:42:34

So we already had choreo.

0:42:37 - 0:42:38

And then I was trying to improve the UX.

0:42:39 - 0:42:39

And then I came up with this queue pattern.

0:42:39 - 0:42:39

It's optional.

0:42:42 - 0:42:43

You can use choreo without the queue pattern if you don't like it.

0:42:45 - 0:42:49

There is a separate documentation about the queue pattern. But using the queue pattern basically means that everything you define in

0:42:49 - 0:42:57

the protocol has to have a queue and a perform step, what you actually do. So you have to define

0:42:57 - 0:43:04

when is this going to happen, reacting to what, what is this reacting to, and then what do you do.

0:43:00 - 0:43:03

into what is this reacting to?

0:43:03 - 0:43:05

And then what do you do?

0:43:05 - 0:43:07

And then your main,

0:43:07 - 0:43:08

because you have what we call a main listeners,

0:43:08 - 0:43:11

kind of like a main function in your programming language.

0:43:12 - 0:43:16

And you have like, when this happens, then you do that.

0:43:16 - 0:43:18

And if you look at the Tandermint version,

0:43:18 - 0:43:19

it's just so pretty,

0:43:19 - 0:43:22

because you have like, oh, when there are enough pre-votes,

0:43:22 - 0:43:23

then you do pre-commit.

0:43:23 - 0:43:24

When there is this,

0:43:24 - 0:43:28

so you can understand the protocol reading like seven lines of code in a high level.

0:43:28 - 0:43:31

And of course, if you want to understand what's actually happening in detail,

0:43:31 - 0:43:33

then you go to definition and you see what's in there.

0:43:34 - 0:43:36

But just reading the names is so good.

0:43:36 - 0:43:39

And then writing the traces is very precise.

0:43:39 - 0:43:46

So at this point, you use this like reaction, this cue, to see that this message arrived

0:43:46 - 0:43:50

or this event arrived, and then you performed this action.

0:43:50 - 0:43:55

And then after that, there was another reaction that performed an action, and then after that

0:43:55 - 0:43:55

another reaction.

0:43:56 - 0:44:00

And so you can be very precise to what actually happened, and it's amazing to read and understand.

0:44:02 - 0:44:02

Amazing.

0:44:02 - 0:44:06

So it sounds like really Chore like really choreo is just the

0:44:06 - 0:44:11

accumulation and synthesis of good modeling techniques that we've learned experience

0:44:11 - 0:44:16

developed over the years from actually writing, you know, a number of distributed system specs,

0:44:16 - 0:44:21

working with them, implementing them, debugging the implementation, trying to understand the

0:44:21 - 0:44:25

protocol. And we've just wrapped all that into a nice package called Choreo

0:44:25 - 0:44:26

that makes it easy for others

0:44:26 - 0:44:29

to write distributed system specs

0:44:29 - 0:44:31

without having to go through all those learnings

0:44:31 - 0:44:34

from scratch and just inherit

0:44:34 - 0:44:35

the sort of experience we've accumulated.

0:44:36 - 0:44:38

Yes, yes, I really hope so.

0:44:38 - 0:44:41

We need to get some new people to use it

0:44:41 - 0:44:42

and tell us what they think.

0:44:43 - 0:44:46

But I'm super happy with all the specs that we rewrote

0:44:46 - 0:44:48

in the pattern. I think every single

0:44:48 - 0:44:49

one of them looks so much better.

0:44:50 - 0:44:52

And I'm excited about rewriting more specs in Corel.

0:44:52 - 0:44:54

I think I want some vacation time

0:44:54 - 0:44:54

just to do that.

0:44:55 - 0:44:57

Which specs did we

0:44:57 - 0:44:59

rewrite in Corel?

0:44:59 - 0:45:01

We have four

0:45:01 - 0:45:04

official examples. Two-phase commit,

0:45:04 - 0:45:06

Tendermint, the Appenglow,

0:45:06 - 0:45:09

which is the Solana consensus and Monad BFT.

0:45:09 - 0:45:10

Right.

0:45:10 - 0:45:12

And then there's also the Minimit spec.

0:45:12 - 0:45:15

So the Minimit spec was written externally

0:45:15 - 0:45:17

from some amazing people.

0:45:17 - 0:45:20

And I think Dennis wrote that with someone else

0:45:20 - 0:45:21

from CommaWare.

0:45:21 - 0:45:23

And we actually got,

0:45:23 - 0:45:25

we have sort of like work in progress version of that it's

0:45:25 - 0:45:31

not published yet but we have a version of that with choreo and it looks really great uh i actually

0:45:31 - 0:45:35

that i didn't tell anyone about this but i was playing with writing amalekite spec for with

0:45:35 - 0:45:42

choreo yesterday uh and yeah i i'm i don't know i really like it i hope a lot of people like it as

0:45:42 - 0:45:49

well especially the people who want to try queens but have had like trouble feeling like the lack of batteries included experience and

0:45:50 - 0:45:56

like got stuck maybe at the message passing i hope choreo can be useful for them right okay amazing

0:45:58 - 0:46:01

anything else you want to say or should we open it for questions

0:46:03 - 0:46:06

no we can open for questions i think i think we got all covered and

0:46:06 - 0:46:11

if i didn't cover anything there's everything's own documentation uh and you can also ask questions

0:46:11 - 0:46:16

we have the telegram channel uh and i'm always available there to answer questions i really like

0:46:16 - 0:46:22

interacting with you all there uh so you can join the telegram channel we have a slip channel as

0:46:22 - 0:46:25

well if you prefer that, or everywhere.

0:46:25 - 0:46:26

GitHub as well works.

0:46:26 - 0:46:29

We have GitHub discussions where you can post questions.

0:46:29 - 0:46:31

But we have time for questions now.

0:46:31 - 0:46:32

We have 15 minutes.

0:46:32 - 0:47:03

So if anyone has anything. Do we have anyone here that's written a distributed system spec,

0:47:03 - 0:47:05

either in Quint or in another language,

0:47:05 - 0:47:09

and has, you know, battle scars to share?

0:47:11 - 0:47:14

Or are we all just curious about Quint

0:47:14 - 0:47:16

and, you know, planning to in the future?

0:47:17 - 0:47:19

One question.

0:47:19 - 0:47:20

Oh, go ahead.

0:47:21 - 0:47:22

Thanks, Gabriela.

0:47:22 - 0:47:23

Thanks, Ethan.

0:47:23 - 0:47:28

So I have some experience with Quint. What would be the best set of examples to try out Choreo?

0:47:28 - 0:47:34

Of course, I can read through what you have already, but what would you recommend as beginning

0:47:34 - 0:47:37

with Choreo examples?

0:47:37 - 0:47:41

So you want to write one example that doesn't exist in Choreo?

0:47:41 - 0:47:50

Yes. Yeah. I would get some of the textbook examples. Like we can maybe do Raft,

0:47:51 - 0:47:57

Paxos, one of those. Because I did two-phase commit and I had so much fun. So I think those

0:47:57 - 0:48:02

text books... So those are... Most Paxos and Raft are more complicated than two-phase commit.

0:48:03 - 0:48:05

But I think those can be very interesting.

0:48:05 - 0:48:08

And you can already find TLA specs or Parpaxos.

0:48:08 - 0:48:12

We even have a Quint spec that have,

0:48:12 - 0:48:16

that use already like sort of the message to pattern.

0:48:16 - 0:48:18

So it's easy to transfer them into Corio.

0:48:19 - 0:48:21

Or you can start from papers as well.

0:48:23 - 0:48:27

For I guess consensus algorithms have been the ones that we tried the most.

0:48:27 - 0:48:33

So grab any consensus algorithm, grab the paper, try to write a core respect from that.

0:48:33 - 0:48:34

That's interesting.

0:48:34 - 0:48:35

Cool.

0:48:35 - 0:48:36

Thanks.

0:48:36 - 0:48:39

And if I may go with another one.

0:48:39 - 0:48:40

Yes.

0:48:40 - 0:48:42

Go ahead.

0:48:42 - 0:48:47

So what I was wondering is, you mentioned the message soup technique.

0:48:47 - 0:48:50

Is there a way to introduce some assumptions on the soup?

0:48:50 - 0:48:55

Like, for example, I don't want message reordering or something like that.

0:48:55 - 0:48:57

So I have some external assumptions that I would like to introduce there.

0:48:59 - 0:49:06

Yeah. At the beginning, we thought about having those sort of different setups.

0:49:06 - 0:49:11

We didn't find a strong need for them, so we actually ended up not adding them to Choreo.

0:49:11 - 0:49:13

But if the need arises, we can add them.

0:49:13 - 0:49:19

I guess right now you would have to change sort of like the Choreo structure to represent this differently.

0:49:20 - 0:49:27

Or maybe adding some timestamps to your messages also works,

0:49:27 - 0:49:30

but you basically would need a different version of Choreo.

0:49:30 - 0:49:34

So we already have some different constructs inside Choreo

0:49:34 - 0:49:36

that can help you with different setups.

0:49:36 - 0:49:39

But for the messages specifically, we haven't added this now,

0:49:39 - 0:49:44

but we could have Choreo with sequential or ordered messages.

0:49:49 - 0:49:49

So then you represent your messages with a list instead of a set and

0:49:51 - 0:49:52

everything else stays mostly the same.

0:49:53 - 0:49:54

Right.

0:49:54 - 0:49:56

Thanks.

0:50:05 - 0:50:05

Some other questions or even comments, thoughts,

0:50:38 - 0:50:45

that don't have to be able to speak soon.

0:50:46 - 0:50:47

Just give it a second.

0:50:49 - 0:50:51

This space is a bit unintuitive for us.

0:50:52 - 0:50:53

We need to tell people

0:50:53 - 0:50:55

how do you, what do you have to press

0:50:55 - 0:50:58

to want to speak.

0:51:01 - 0:51:02

I don't know

0:51:02 - 0:51:04

because I've already chosen to be

0:51:04 - 0:51:04

a speaker.

0:51:06 - 0:51:07

No, yeah.

0:51:07 - 0:51:09

If anyone is struggling with the interface,

0:51:09 - 0:51:12

there is a button in the upper,

0:51:13 - 0:51:17

or sorry, the left down part of the spaces square

0:51:17 - 0:51:20

where you can request to speak.

0:51:20 - 0:51:24

I think there's a mic in there, but I also don't recall because I'm already speaking.

0:51:20 - 0:51:22

I think there's a mic in there,

0:51:22 - 0:51:24

but I also don't recall because I'm already speaking.

0:51:28 - 0:51:34

I've accepted the request a couple of times and it's not coming through, but I'll keep inviting Daniel to speak.

0:51:35 - 0:51:36

We'll see.

0:51:37 - 0:51:38

We might need Elon to approve him.

0:51:39 - 0:51:39

Yeah.

0:51:41 - 0:51:43

You don't have Elon's number?

0:51:45 - 0:51:47

No, I don't.

0:51:48 - 0:51:49

Shame.

0:51:57 - 0:52:01

We do have in the call here, or in the spaces here,

0:52:01 - 0:52:04

Yasin and Yosef, who were our main part of this development.

0:52:04 - 0:52:05

I think I forgot. Oh, no, I did mention that Yasin and Josef, who were a main part of this development.

0:52:05 - 0:52:05

I think I forgot.

0:52:07 - 0:52:07

Oh, no, I did mention, right,

0:52:09 - 0:52:10

that Yasin wrote his master thesis about this,

0:52:11 - 0:52:11

about choreo.

0:52:14 - 0:52:15

So, and he was the one who developed most of it.

0:52:17 - 0:52:17

And so if you want to check out his thesis,

0:52:19 - 0:52:22

he makes very, like, stronger arguments. So I'm explaining things kind of informally here,

0:52:22 - 0:52:26

but his thesis has, like, the proofs and stuff for how this,

0:52:27 - 0:52:28

especially how the message technique,

0:52:28 - 0:52:31

the message technique is great,

0:52:31 - 0:52:34

and also how you can extend choreo

0:52:34 - 0:52:36

and use it in different ways.

0:52:36 - 0:52:38

It's out there, so you can check that out.

0:52:41 - 0:52:42

And then I was going to say,

0:52:42 - 0:52:48

if Yasin or Josef want to say something, also feel free, please.

0:52:49 - 0:52:51

Yeah, I'm jumping in. I hope you hear me.

0:52:51 - 0:52:57

Also, because I have a hidden channel with Daniel and we chatted in the meantime, so I might guess his question.

0:53:00 - 0:53:06

He was thinking about the specific scenario in Malachite,

0:53:06 - 0:53:14

and it's sort of hard to find with the message-for-message semantics.

0:53:14 - 0:53:18

So he would be interested to see, because you mentioned that you're on Malachite now,

0:53:19 - 0:53:28

how far we are in this spec and whether we can look at special a special special semantics and special scenarios so uh

0:53:28 - 0:53:35

just to make sure i understand correctly so it's a scenario where the message by message would

0:53:35 - 0:53:41

catch it and no it's too long to catch it uh it's too complicated it takes so long yeah yeah yeah

0:53:41 - 0:53:51

that's interesting uh yeah i i would be definitely be definitely be very excited to rewrite the Malachite spec in the Choreo framework.

0:53:51 - 0:54:00

So what I tried yesterday, to be honest, I was thinking about some Malachite stuff and then I was like, oh, I'll just leave this running in the background while I'll do other stuff.

0:54:00 - 0:54:06

And I asked Claude to do it. And it's very interesting to see how it does it.

0:54:06 - 0:54:09

And I think it got a lot of things correctly.

0:54:09 - 0:54:12

And looking at the spec just seems very organized

0:54:12 - 0:54:14

with the queue pattern and like when this happened

0:54:14 - 0:54:17

and that happens, it looks really nice.

0:54:17 - 0:54:22

So I can share that with you and Daniel as well,

0:54:22 - 0:54:24

if you want to, oh, Daniel can speak, I guess.

0:54:24 - 0:54:25

He's speaker. Yeah, Daniel can speak, I guess. He's the speaker.

0:54:25 - 0:54:34

Yeah, I hope I can. Hey guys, how are you? Yeah, I don't know much about consensus protocol

0:54:34 - 0:54:41

itself, but I was wondering, asking, one thing that's hard to know when you have BFT protocols

0:54:41 - 0:54:50

is the behavior of Byzantine agents. The typical modeling that we do essentially produce all the message possible by Byzantine,

0:54:50 - 0:54:57

like every round, every value, etc., to try to emulate the attacks that you can try.

0:54:57 - 0:55:00

But this increases a lot the space, right?

0:55:00 - 0:55:02

Do you have some solution for that?

0:55:02 - 0:55:09

Because, I mean, you are around one, you have four messages generated by legit process,

0:55:09 - 0:55:11

and then you have 200 messages generated by Byzantines.

0:55:13 - 0:55:14

Yes, that's a very good question,

0:55:14 - 0:55:18

because this is where the message soup technique shines the most.

0:55:19 - 0:55:22

So I really recommend that you take a look at the message soup blog

0:55:22 - 0:55:24

that we posted on the Quint blog,

0:55:24 - 0:55:25

because we talk about

0:55:25 - 0:55:31

this there but uh so if you're not thinking about byzantine messages if you just have the the

0:55:31 - 0:55:36

regular ones then what we were talking about here with etan and having the messages in different

0:55:36 - 0:55:43

orders it has some impact when you add byzantine messages which as i said is usually such a larger

0:55:43 - 0:55:48

set than the good messages because there's so so many possibilities that what people can do wrong,

0:55:48 - 0:55:54

right? What, what the malicious, uh, wrong nodes can do. Uh, then the, the

0:55:54 - 0:55:59

difference that the message soup makes is huge. So think like you're in one node

0:55:59 - 0:56:03

and you have to consider like receiving all of the individual messages. If you're

0:56:03 - 0:56:06

doing the message by message style,

0:56:06 - 0:56:09

then you have to consider receiving

0:56:09 - 0:56:11

all of these different messages.

0:56:11 - 0:56:15

So three of the correct ones, 200 of the Byzantine ones.

0:56:17 - 0:56:19

So the chances that you pick a good messages

0:56:19 - 0:56:22

in random simulation, for example, are very low.

0:56:22 - 0:56:25

While if you do write them using the, like,

0:56:26 - 0:56:27

looking at the message soup and

0:56:27 - 0:56:29

observing if you have threshold,

0:56:30 - 0:56:31

you see how this problem

0:56:31 - 0:56:33

mostly goes away. Because you're

0:56:33 - 0:56:35

looking at the message soup and you're saying,

0:56:36 - 0:56:38

like, do I have quorum of pre-votes for something?

0:56:38 - 0:56:40

And then the Byzantine influence

0:56:40 - 0:56:41

there, like the,

0:56:41 - 0:56:43

how much it influences your state space is much

0:56:43 - 0:56:45

lower because the interesting

0:56:45 - 0:56:50

things that you do are much more on the correct messages than on the Byzantine messages, because

0:56:50 - 0:56:51

you have a bunch of validations, right?

0:56:51 - 0:56:55

So you're not going to receive any Byzantine message and do something about it.

0:56:55 - 0:57:01

Most of the Byzantine messages won't cause any reaction in the system unless you somehow

0:57:01 - 0:57:02

have quarrel.

0:57:03 - 0:57:06

So it does help so much with that factor as well.

0:57:08 - 0:57:10

Also, let me add that,

0:57:10 - 0:57:12

I think you mentioned it before,

0:57:12 - 0:57:14

Yassin has written a master thesis on this,

0:57:15 - 0:57:17

and he's called it now,

0:57:17 - 0:57:18

but he's actually on well-deserved vacations

0:57:18 - 0:57:20

after he's finished his thesis.

0:57:20 - 0:57:22

And he has done a lot of experiments,

0:57:22 - 0:57:27

performance experiments on Quint and also for Byzantine notes

0:57:27 - 0:57:31

in his thesis to take a look there. And pretty much shows the

0:57:31 - 0:57:35

graphs that he has there show that there's basically roughly no

0:57:35 - 0:57:38

influence or constant influence when you have Byzantine

0:57:38 - 0:57:43

messages in the message soup. While if you have this message by message

0:57:43 - 0:57:45

semantics, it

0:57:45 - 0:57:51

makes the traces very, very long. So it's super interesting to see the graphs. So yeah,

0:57:51 - 0:57:56

go there and check out this thesis also. A lot of data in there.

0:57:56 - 0:58:00

And with Choreo, it's super easy to interchange both as well, if you need to have both. Because

0:58:00 - 0:58:04

I know from Malachi, there are some things where you want to consider message by message.

0:58:04 - 0:58:06

So with Choreo, it should be super easy to have, some things where you want to consider message by message. So with Choreo it should be super easy to have,

0:58:06 - 0:58:08

like now I want to run it with message by message semantics.

0:58:08 - 0:58:11

Oh, now I want to run with message sub semantics.

0:58:11 - 0:58:13

So it should be easy to have both.

0:58:15 - 0:58:16

Yeah, I remember.

0:58:16 - 0:58:19

There is another thing that I think some you have mentioned

0:58:19 - 0:58:22

in the presentations that regarding message loss.

0:58:23 - 0:58:28

I had a look on the code and apparently the message has a sender, right?

0:58:28 - 0:58:30

But it doesn't have a destination.

0:58:30 - 0:58:35

So when you have a message loss, the message is lost for everyone.

0:58:35 - 0:58:38

Am I correct here or I interpreted it wrong?

0:58:39 - 0:58:44

No, because so the message is never taken out of the set in that sense.

0:58:40 - 0:58:44

No, because so the message is never taken out of the set in that sense.

0:58:44 - 0:58:45

Yep.

0:58:45 - 0:58:52

What can happen is that a certain node never receives that message or never

0:58:52 - 0:58:53

receives that message.

0:58:53 - 0:58:57

So when you are looking also at the set of all messages that the node sees,

0:58:57 - 0:58:59

you can not look at the entire set.

0:58:59 - 0:59:01

You can look at a subset of that.

0:59:02 - 0:59:06

So for node one, you'll see all messages, but for node two, when you're going to

0:59:06 - 0:59:11

like do the message sub technique, you're going to only see a subset of the messages. You can do it

0:59:11 - 0:59:18

like that. So you can program as a, okay, a message can be seen by some subset of the processes,

0:59:18 - 0:59:23

because again, when we are talking about Byzantine attacks, this is super important, right? The

0:59:23 - 0:59:27

ability, for example, of sending a message to some guys,

0:59:27 - 0:59:29

another different message to others, right?

0:59:31 - 0:59:32

Yes.

0:59:32 - 0:59:33

Yeah, I'm a bit too short.

0:59:33 - 0:59:37

So what I just told you, it's not the best,

0:59:37 - 0:59:38

like the most performatic thing,

0:59:38 - 0:59:42

considering all subsets and stuff, but we do have,

0:59:42 - 0:59:44

so if you look at the Tendermint spec in Corio,

0:59:44 - 0:59:47

we do have a run for disagreement where this happens.

0:59:47 - 0:59:49

This was just said, like the,

0:59:49 - 0:59:51

you see some messages from one Byzantine

0:59:51 - 0:59:54

and the other processes receive another message

0:59:54 - 0:59:55

from the business.

0:59:55 - 0:59:58

So these are the receiving part of the code.

0:59:58 - 0:59:59

It's not on the sending.

0:59:59 - 1:00:01

The sending is done for all,

1:00:01 - 1:00:03

and the message is always there,

1:00:03 - 1:00:07

but you can non-deterministically not receive certain message.

1:00:12 - 1:00:13

Okay, great.

1:00:13 - 1:00:15

Yeah, I think it's something that we,

1:00:15 - 1:00:19

yeah, we have tried to do a generic network.

1:00:19 - 1:00:21

I think I even discussed it with you

1:00:21 - 1:00:23

because this is looking super important to me

1:00:23 - 1:00:30

to have a generic network layer to say like that, right? Because for every protocol, you have to write this down

1:00:30 - 1:00:36

again. And I think this is very useful. I tried to read the masterpieces, but it's very mathematical.

1:00:36 - 1:00:43

It will take some time to understand all the formulas, but very nice work. And yeah, I hope

1:00:43 - 1:00:47

it will be useful for you and also for other people that are designing

1:00:47 - 1:00:48

products.

1:00:49 - 1:00:50

Thank you, Daniel.

1:00:50 - 1:00:51

Thank you for your feedback.

1:00:51 - 1:00:55

Yeah, and the master thesis is more like if you're going to go deep into the mathematical

1:00:55 - 1:00:59

part, but if you just want to use Choreo, I recommend you read the documentation that

1:00:59 - 1:01:00

we have on the website.

1:01:00 - 1:01:06

So on the Quint website, we have Choreo tab, and you can check out the more like practical stuff in there.

1:01:10 - 1:01:12

Okay, we are about time,

1:01:12 - 1:01:15

but if anyone has one more question, we can do it.

1:01:15 - 1:01:16

Otherwise,

1:01:21 - 1:01:25

I'll give you 10 seconds to ask for to speak.

1:01:26 - 1:01:27

Then Ocean can let me know.

1:01:32 - 1:01:35

Maybe just a closing question, Gabriela.

1:01:35 - 1:01:41

You said traces and runs are probably your favorite features of Quint.

1:01:42 - 1:01:44

What's your favorite spec that you've written?

1:01:47 - 1:01:49

That I've written. Or not, that you've reviewed.

1:01:49 - 1:01:58

Oh, it's tricky. I think I fell in love with the Tendermint one for a while. Because I

1:01:58 - 1:02:03

have written I think so many versions of it. Like we had a very initial version that is

1:02:03 - 1:02:06

very close to what we had in TLA

1:02:06 - 1:02:08

because we had a Tendermint spec in TLA and then we just

1:02:08 - 1:02:10

made like a kind of like direct

1:02:10 - 1:02:12

translation to Quint and then that spec

1:02:12 - 1:02:13

kind of like evolved and I've written a different

1:02:13 - 1:02:16

version. That's how like the first

1:02:16 - 1:02:18

experiments on choreo was also done with

1:02:18 - 1:02:19

Tendermint from my side

1:02:19 - 1:02:22

and now we have this choreo

1:02:22 - 1:02:23

spec that I really like it.

1:02:24 - 1:02:25

I want to show it to everyone.

1:02:25 - 1:02:27

So I think that that is my favorite.

1:02:27 - 1:02:27

Very cool.

1:02:27 - 1:02:30

And for fun, I really like the tic-tac-toe example.

1:02:31 - 1:02:31

Okay, great.

1:02:33 - 1:02:34

Awesome.

1:02:34 - 1:02:36

Well, if there's nothing else, this was great.

1:02:36 - 1:02:38

Thanks, everyone, for tuning in.

1:02:38 - 1:02:40

And check out Quint and check out Choreo.

1:02:41 - 1:02:43

Yes, thank you, everyone.

1:02:43 - 1:02:44

Thank you for listening.

1:02:45 - 1:02:47

Bye, all. Thank you. Take care.

1:02:48 - 1:02:49

Bye-bye.

Short Summary

Full Transcription

Host

Speaker