Sui Security Discussions: The AI Cybersecurity Arms Race

0:00:00 - 0:01:19

Thank you. Hello everyone and welcome. So we're here, we're going to be talking about security and we're

0:01:19 - 0:01:24

going to be talking specifically about the AI cybersecurity arms race. You know, unless you've

0:01:24 - 0:01:25

been living under a rock,

0:01:26 - 0:01:28

you notice that around November of last year,

0:01:28 - 0:01:30

coding agents really started to change in terms of their effectiveness.

0:01:31 - 0:01:33

Cloud Code Codex, these things have been around for a while,

0:01:33 - 0:01:37

but there's a step change in the work they can do autonomously,

0:01:37 - 0:01:39

the work they can do reliably, and what they're able to do.

0:01:39 - 0:01:41

And of course, coding agents are also security agents.

0:01:41 - 0:01:44

If you can write code, then you can write static analyzers,

0:01:44 - 0:01:51

you can read code, you can reason about it, you can write exploits, you can debug. These are what the AI

0:01:51 - 0:01:54

people like to call dual-use capabilities. They help defenders, they help attackers as well.

0:01:55 - 0:01:58

So it's a very interesting situation for, I think, everyone in the software industry. But I think

0:01:58 - 0:02:03

in crypto and with smart contract platforms like SWE and the other ones that you all work on,

0:02:04 - 0:02:27

we're on the front lines of this because we write code that manages money. It's radically open. There are rich bug bounty programs. So a lot of this is getting a lot of attention and there's increased action. you know, just inform people both in the crypto industry and outside who are going to, who are not on the front lines, but are going to be affected just as much what they, what

0:02:27 - 0:02:31

they need to be thinking about and doing to keep themselves safe.

0:02:31 - 0:02:35

So I wanted to start off by a, well, if everyone go around and do a brief intro and just tell

0:02:35 - 0:02:40

me about a jaw dropping moment you had with a coding agent related to security, where

0:02:40 - 0:02:42

it just did something where you're like, wow, I did not think that I could do that.

0:02:42 - 0:02:44

I didn't expect that that could work.

0:02:44 - 0:02:48

Or, you know, this is just like your, this is just your moment of things changed since November.

0:02:48 - 0:02:50

Maybe we could start with you, Ben.

0:02:51 - 0:02:51

Sure.

0:02:52 - 0:02:58

I'd say the jump from Sonnet 3.7 to Opus 4.5,

0:02:58 - 0:03:02

that transition period was when it really seemed like

0:03:02 - 0:03:04

we could actually code things with agents

0:03:04 - 0:03:05

and have them abstract about the code base and actually do tool calling. period was when it really seemed like we could actually code things with agents

0:03:11 - 0:03:16

and have them abstract about the code base and actually do tool calling because it's hard to remember but like a couple months ago models hated calling tools they would never call tools and they

0:03:16 - 0:03:22

would never use them properly and to finally see that work correctly was absolutely incredible

0:03:23 - 0:03:26

what was the thing you tried to do where you saw the capability jump concretely?

0:03:27 - 0:03:28

So what we were trying to do

0:03:28 - 0:03:31

is we were trying to mix Slither's static analysis

0:03:31 - 0:03:34

with LLM-based semantic analysis

0:03:34 - 0:03:37

because there's a lot of limitations with static analysis.

0:03:37 - 0:03:39

You can't figure out like what's an actor,

0:03:39 - 0:03:41

like what's the semantic meaning of this code.

0:03:41 - 0:03:43

And so we tried to plug it into an LLM

0:03:43 - 0:03:46

and have the LLM infer certain things

0:03:46 - 0:03:51

about the code and then pass them back into Slither using tool calls. And it just wouldn't do it.

0:03:52 - 0:03:57

When it would call the tool, it would be malformatted. It would have issues. It wouldn't

0:03:57 - 0:04:02

know what to do. Sometimes it would call it when it shouldn't call it. Sometimes it would

0:04:02 - 0:04:07

do the exact opposite. And I'd say it was really the jump after Sonnet 3.7

0:04:07 - 0:04:10

where we really started to see that get resolved.

0:04:11 - 0:04:12

That's super cool.

0:04:13 - 0:04:14

And I should have said mine too.

0:04:14 - 0:04:16

I'm the co-founder and CTO of Mistin Labs

0:04:16 - 0:04:18

and creator of the Moot programming language.

0:04:18 - 0:04:20

But mine also had to do with static analysis and triaging

0:04:20 - 0:04:23

where I did a lot of work on static analysis at Facebook

0:04:23 - 0:04:23

in an open source tool.

0:04:24 - 0:04:28

And so I said, okay, you know, Cloud Code code, please take my quandary taint analyzer that's

0:04:28 - 0:04:32

written in OCaml for Java and port it to move and run it on the full corpus of all the move

0:04:32 - 0:04:38

programs. And it just did it. And that was like, okay, add entry points as sources and add, you

0:04:38 - 0:04:41

know, usage of money as a sink. And then it's like, cool, here are the results. And then I was like,

0:04:41 - 0:04:46

triage the results and show me what looks like a phone. And then when I saw the results, I was like, whoa,

0:04:46 - 0:04:49

this is a new world that we're in

0:04:49 - 0:04:50

in terms of like the speed and the capabilities.

0:04:50 - 0:04:51

Whereas like the triage results

0:04:51 - 0:04:53

that was completely manual before

0:04:53 - 0:04:55

and the coding would have taken a long, long time.

0:04:55 - 0:04:57

Yeah, absolutely.

0:04:57 - 0:04:58

Building on that, there was something that we did

0:04:58 - 0:05:00

very recently around dimensional analysis.

0:05:00 - 0:05:02

You mentioned OCaml.

0:05:02 - 0:05:04

I used to be really into F-sharp

0:05:04 - 0:05:06

and F-sh sharp has these like

0:05:06 - 0:05:11

units of measure in its language typing system. We're like, what if we took that and we tried to

0:05:11 - 0:05:16

apply it to Solidity or like a smart contract language and have an LLM infer what the unit

0:05:16 - 0:05:21

should be and then do the arithmetic to see if all the units work out. And it gave us really,

0:05:21 - 0:05:25

really good results. We published the plugin to our skills repo if anyone's interested.

0:05:25 - 0:05:30

But being able to take these traditional techniques

0:05:30 - 0:05:32

and blend them with the semantic analysis of LLMs,

0:05:32 - 0:05:35

it's absolutely incredible where it's going.

0:05:35 - 0:05:36

Awesome.

0:05:36 - 0:05:38

Seth, what about for you?

0:05:38 - 0:05:39

Please introduce yourself and tell us

0:05:39 - 0:05:42

about your jaw-dropping moment.

0:05:42 - 0:05:42

Sure.

0:05:42 - 0:05:43

So my name is Seth.

0:05:43 - 0:05:46

I'm the CEO of Satora and Satora is a security company

0:05:46 - 0:05:49

and we work across all different chains,

0:05:49 - 0:05:51

all different aspects of security in Web3.

0:05:51 - 0:05:54

And I think I can take that personally

0:05:54 - 0:05:54

or professionally.

0:05:55 - 0:05:56

And what I mean by that is in my role,

0:05:56 - 0:05:57

I'm not coding day-to-day

0:05:57 - 0:05:59

and I'm not auditing day-to-day.

0:05:59 - 0:06:01

So I've had my own personal

0:06:01 - 0:06:03

recent jaw-dropping moments,

0:06:03 - 0:06:05

but, and they're kind of interesting.

0:06:06 - 0:06:13

So for me, recently, and this was a while ago, I wanted to extract a list of all of our contacts from Salesforce.com.

0:06:13 - 0:06:15

This is like a stupid, annoying task.

0:06:16 - 0:06:21

And I asked Claude to do it, and I just said, Claude, write me a script that extracts all my contacts from Salesforce.com.

0:06:21 - 0:06:22

It does it right.

0:06:23 - 0:06:25

But what's interesting is it does so many things

0:06:25 - 0:06:29

wrong from a security perspective, but they're subtle. So it stored the files in a place that,

0:06:29 - 0:06:32

first of all, I don't have access to. And it's like, where did it even get that from?

0:06:33 - 0:06:38

It's some arbitrary directory on disk that includes a path that has nothing to do with

0:06:38 - 0:06:42

anything I could imagine. I thought, it's fascinating to know that it pulled this

0:06:42 - 0:06:45

information from somewhere and injected it into my code.

0:06:45 - 0:06:47

And I don't know how it got here.

0:06:47 - 0:06:53

And then just the usage of environment to store safety, critical secrets like my Salesforce

0:06:53 - 0:06:58

login key that really should be kept secret in a more meaningful way.

0:06:58 - 0:07:01

That gave me very personally both sides of the story.

0:07:01 - 0:07:04

More professionally, it's like every time I talk to our researchers, there's something

0:07:04 - 0:07:07

amazing. Just a couple of days ago, I was talking to one of our researchers

0:07:07 - 0:07:12

and he was saying that a client of his who he worked with before he came to Sartora,

0:07:12 - 0:07:17

who had a really complex protocol, and he'd found seven critical vulnerabilities in that project.

0:07:17 - 0:07:24

And this client came to him with a list of 200 reports from an AI tool. And painstakingly,

0:07:24 - 0:07:26

the client had gone through the first 160 and decided

0:07:26 - 0:07:31

that they weren't real, but there were 40 left that were just painfully difficult to evaluate.

0:07:31 - 0:07:36

So difficult that our researcher wasn't sure that he was clear on all of them. And he asked the LLM

0:07:36 - 0:07:41

itself to figure out which of those were valid. And it did. And it came back to him with a list

0:07:41 - 0:07:48

that when he went back through that sort of self-prioritized list, he found seven real issues out of that 200.

0:07:48 - 0:07:57

And what's most frightening about that story is that he said three of those intersected with his own findings, but four did not.

0:07:57 - 0:08:02

And, you know, his sort of seven findings were all steel funds kinds of findings.

0:08:00 - 0:08:02

were all steel funds kinds of findings.

0:08:02 - 0:08:04

And the four non-intersecting LLM findings

0:08:04 - 0:08:07

were all steel funds kinds of findings

0:08:07 - 0:08:10

embedded in that original list of 200.

0:08:10 - 0:08:12

So, you know, beyond the interesting things

0:08:12 - 0:08:14

we're doing with tools and, you know,

0:08:14 - 0:08:15

LLMs evaluating results,

0:08:15 - 0:08:18

which we absolutely do with the Prover all the time,

0:08:18 - 0:08:19

looking at formal verification results

0:08:19 - 0:08:22

that are really subtle and hard to understand.

0:08:22 - 0:08:24

We have our new violation analyzer

0:08:24 - 0:08:26

that now uses LLMs to do all sorts of interesting stuff.

0:08:27 - 0:08:29

There's just that very disturbing knowledge

0:08:29 - 0:08:32

that LLMs can find things that humans can

0:08:32 - 0:08:33

and vice versa.

0:08:34 - 0:08:34

So what do you do?

0:08:35 - 0:08:37

So that's my moment.

0:08:37 - 0:08:39

Yeah, I love both those stories.

0:08:39 - 0:08:42

And I feel like pretty soon we'll feel that I found four

0:08:42 - 0:08:43

and the LLM found seven.

0:08:43 - 0:08:50

It's going to be a pretty good score for the human so he should take yeah uh Robert what what about you uh tell me about

0:08:50 - 0:08:56

your background and uh your your jaw-dropping moment yes um I'm Robert I'm the founder of

0:08:56 - 0:09:01

PoderSec we've been working with Steve for for a long time now I guess yeah probably like three

0:09:01 - 0:09:06

years I feel like time sort of flies in this industry.

0:09:09 - 0:09:09

I think for me, there wasn't one particular moment.

0:09:11 - 0:09:12

I guess it was more like the trend.

0:09:14 - 0:09:16

Like, I don't think there was one instance where we ran the tool and we found a bunch of bugs and we're like, oh, this is amazing.

0:09:16 - 0:09:21

I think for me, probably it's been, I mean, we worked on the EVM bench post with OpenAI

0:09:21 - 0:09:22

and Paradigm.

0:09:22 - 0:09:26

And I remember, I don't know if I sent you the chart actually, but I remember there was

0:09:26 - 0:09:32

this one chart which was like GPT-5, I think had a 20% on the benchmark and then GPT-5.3

0:09:32 - 0:09:33

had like a 40% on the benchmark.

0:09:33 - 0:09:37

And, you know, I think when we saw that, we actually put the data down.

0:09:37 - 0:09:40

I mean, that trend was super, right.

0:09:40 - 0:09:49

And I mean, even if the trend doesn't continue entirely, right? Like you can imagine GPD 5.4 or 5.5 has like 50% or 60%,

0:09:49 - 0:09:52

that by its head doubled over the course of a few months.

0:09:52 - 0:09:57

I think that was to me a sign of what's to come, right?

0:09:57 - 0:10:00

And how we all need to be using AI more seriously

0:10:00 - 0:10:01

in our workflows.

0:10:03 - 0:10:04

Definitely.

0:10:04 - 0:10:05

EVM Bench is super cool work,

0:10:05 - 0:10:08

and we're gonna be digging into that a little bit more

0:10:08 - 0:10:09

with our first question.

0:10:09 - 0:10:11

But first, Klaas, let me go to you

0:10:11 - 0:10:12

to hear about your background, your company,

0:10:12 - 0:10:15

and what really impressed you

0:10:15 - 0:10:17

with coding and security agents.

0:10:18 - 0:10:21

Yeah, so we're a tiny company focusing on SUI,

0:10:21 - 0:10:23

so we're doing auditing and formal verification.

0:10:23 - 0:10:30

Actually, we're doing almost only formal verification lately for SUI, so we're doing auditing and formal verification, actually doing mostly almost only formal verification lately for SUI. We've been essentially

0:10:30 - 0:10:36

working on the, to make the SUI Prover work well for the past almost year and a half.

0:10:36 - 0:10:44

And yeah, essentially our focus very much over the past few months is making formal

0:10:44 - 0:10:45

verification scale, so we've been working with many of the top DeFi projects on SUI. And yeah, essentially our focus very much over the past few months is making formal education scale.

0:10:45 - 0:10:49

So we've been working with many of the top DeFi projects on Sui.

0:10:51 - 0:10:53

Now, in terms of story, well, yeah.

0:10:54 - 0:10:58

Well, it's actually a story from my co-founder, Andrei Steppanescu.

0:10:59 - 0:11:05

So he was working on modeling the borrow mute for dynamic fields for the prover.

0:11:06 - 0:11:10

And yeah, and basically the agent struggled with it quite a bit

0:11:10 - 0:11:11

because it was in the loop with Cloud Code.

0:11:12 - 0:11:18

And eventually the agent actually decided to go upstream in the SWE prover

0:11:18 - 0:11:21

for everyone is actually based on the SWE compiler, right?

0:11:22 - 0:11:24

And the agent decided to go upstream in the compiler

0:11:24 - 0:11:29

and essentially start putting print lines there and starting to understand the structure so that it

0:11:29 - 0:11:35

can propagate ownership information down and it then made the changes to propagate ownership

0:11:35 - 0:11:45

information and essentially solve the its task which is considering the complexity of all of this is very impressive.

0:11:49 - 0:11:55

Now on the other side, I mean, I'll probably have more examples of failures probably also,

0:11:55 - 0:12:00

but I think the main thing, kind of a summary of the failures is that,

0:12:00 - 0:12:04

I think our main struggle is with making the agents reliable,

0:12:06 - 0:12:07

both for auditing and for modification.

0:12:07 - 0:12:15

So they can be intermittently brilliant, but at this point, we cannot rely on them for

0:12:15 - 0:12:16

either.

0:12:16 - 0:12:19

They're just making us way, way faster.

0:12:19 - 0:12:20

Nice.

0:12:20 - 0:12:44

Yeah, the story of it needs to do something, but it sort of doesn't have enough information in this part of the code, but it exists elsewhere and sort of figuring out where it'd be and like pulling the plumbing through is like a super, super impressive thing. And, you know, I can't count how many times I've done that as a programmer. And like the fact that you don't have to do that anymore, that someone else can do it, especially the plumbing part. The discovery part is really the impressive part, but the plumbing is like the time consuming is pretty amazing.

0:12:40 - 0:12:44

is really the impressive part, but the plumbing is like the time consuming is pretty amazing.

0:12:45 - 0:12:50

So thanks guys for the intros and the stories. Now we're going to get into the questions. So for

0:12:50 - 0:12:53

the audience, what I did is I sent out a survey with eight questions, eight questions that are

0:12:53 - 0:12:57

supposed to be hard, where I thought there would, you know, sort of be different opinions. And on

0:12:57 - 0:13:00

six out of eight, we managed to split the room and get a very different opinions. And so I'm going to

0:13:00 - 0:13:04

go through the questions and call on people on different sides of it. These are yes or no questions

0:13:04 - 0:13:07

and sort of get some interesting discussion and debate going.

0:13:08 - 0:13:12

So one question that we'll start with is the question is, for some value of X,

0:13:13 - 0:13:17

releasing a model and skills that can reliably find X percent of critical vulnerabilities

0:13:17 - 0:13:21

in some substantial open source benchmark set is a violation of responsible disclosure.

0:13:22 - 0:13:23

This is an ethics question.

0:13:23 - 0:13:25

So Robert, you talked about EVM Bench. You said, yes, this would be a violation of responsible disclosure. This is an ethics question. So Robert, you talked

0:13:25 - 0:13:29

about EVM Bench. You said yes, this would be a violation of responsible disclosure, but you

0:13:29 - 0:13:34

clearly don't think x equals 40 because you released the code and the skills. So what's the

0:13:34 - 0:13:41

x for which this is true and how do you think about this question? Yeah, I mean, I think there's

0:13:41 - 0:13:52

a bit of a tension here, right, in the sense that if you don't release anything ever, people won't know that security tools are increasing at such a rapid pace.

0:13:52 - 0:13:54

And, you know, you might actually do worse for the security ecosystem.

0:13:55 - 0:14:04

But I think the obvious example that comes to mind is like if X is 100 and you release it and immediately everyone gets hacked, then clearly that's also not valid.

0:14:04 - 0:14:06

Right. And that can't possibly be correct either.

0:14:07 - 0:14:10

And I think this is one of the questions that we were debating, right,

0:14:10 - 0:14:11

when we worked on this project too, is like, hey,

0:14:12 - 0:14:14

if we have this tool that can actually reliably find bugs

0:14:14 - 0:14:17

in a large amount of these smart contracts,

0:14:17 - 0:14:22

how do we best get it to developers so they can run it first

0:14:22 - 0:14:26

as opposed to hackers.

0:14:26 - 0:14:30

And I think one thing that we did for Ethereum Venture example is that we worked on a front-end

0:14:30 - 0:14:36

where we sponsored or I guess paradigm sponsored the credits and this allowed people to run their

0:14:36 - 0:14:43

contracts essentially for free with the framework that we used. And honestly yeah I agree with you

0:14:43 - 0:14:45

like I think 40% is probably not the right.

0:14:45 - 0:14:51

I mean, 40% on our specific eval set, which means that, like, in the real world, maybe it's slightly less.

0:14:54 - 0:15:02

But if it was, like, 90% or, like, 85%, I feel like I would be much more inclined to say, like, hey, you should first run this on all the major projects.

0:15:03 - 0:15:04

See if there's any findings.

0:15:04 - 0:15:06

And for us, we actually did some of that.

0:15:06 - 0:15:08

Like, we did run this tooling against big projects

0:15:08 - 0:15:14

and made sure that there weren't a bunch of critical bugs laying around.

0:15:17 - 0:15:18

Yeah, that's the sort of thing I was wondering,

0:15:18 - 0:15:20

where, like, you guys in the EVM bench,

0:15:20 - 0:15:21

you very carefully selected the benchmarks,

0:15:21 - 0:15:23

so it's all vulnerabilities that have been fixed or code

0:15:23 - 0:15:25

where it's sort of, like, not at risk anymore. But, you know, at some point, the it's all vulnerabilities that have been fixed or code where it's sort of like not at risk anymore.

0:15:25 - 0:15:27

But, you know, at some point, the X is high enough

0:15:27 - 0:15:28

that you're like, well, someone can also take this

0:15:28 - 0:15:30

and run it on any open source code.

0:15:30 - 0:15:32

So like, what's my ethical obligation to run that,

0:15:33 - 0:15:35

to report it, to sort of give people a leg up?

0:15:35 - 0:15:37

I think those are the sort of really interesting questions

0:15:37 - 0:15:39

that arise around this.

0:15:40 - 0:15:42

So Ben, I wanted to go to zero no on this question.

0:15:42 - 0:15:44

You can sort of see it in the way Trail of Bits operates.

0:15:44 - 0:15:47

Like you guys are really proactive in open sourcing these skills,

0:15:47 - 0:15:49

open sourcing tools, like sort of making auditing.

0:15:50 - 0:15:53

An AI powered security auditor, security researcher available.

0:15:53 - 0:15:54

How do you think about this question?

0:15:55 - 0:15:57

I think this is a really good question.

0:15:58 - 0:16:01

I think the way you get to think about it from a framing perspective

0:16:01 - 0:16:11

is how hard is it to create a tool that is going to be able to exploit something so like a clod skill has a very low cost to create you could

0:16:11 - 0:16:15

literally open up clod and be like hey i want to create a skill that does blah blah blah blah

0:16:15 - 0:16:22

and it'll produce that um for something like that it's really easy for an attacker to also get

0:16:22 - 0:16:25

access to that capability they just have to stumble upon it.

0:16:32 - 0:16:34

So let's say in some way we built a skill that finds 80% of critical vulnerabilities.

0:16:38 - 0:16:43

In a case like that, if we actually want to triage it and be able to like responsibly disclose this across the industry, like it'd be a process that probably take a couple of months, we have to run

0:16:43 - 0:16:48

it against absolutely everything we could. And just going through the disclosure, I think would be like

0:16:48 - 0:16:52

three or four months because just not everyone has a bug bounty. And the people

0:16:52 - 0:16:56

who don't have a bug bounty are really hard to contact. And there's TVL at risk on

0:16:56 - 0:17:00

those protocols. On the flip side, if we

0:17:00 - 0:17:04

were to just publish it and not wait those

0:17:04 - 0:17:06

three months, then you have to think about, okay, who's going to be publish this, you know, and not wait those three months, then you have to think about,

0:17:06 - 0:17:10

okay, who's going to be running this? Well, you have white hats and you have black hats.

0:17:10 - 0:17:15

And if anyone in this room was to publish something that claimed to find 80% of the

0:17:15 - 0:17:20

critical issues, I'm pretty sure we would all be running it on as much stuff as we could

0:17:20 - 0:17:24

immediately. And so now you get to ask yourself, how many white hats are there in the industry?

0:17:24 - 0:17:30

And how many black hats are there? And what are their relative token budgets? And I think the

0:17:30 - 0:17:35

white hats would be operating on orders of magnitude, larger token budgets than the black

0:17:35 - 0:17:41

hats. So by publishing this tool quickly, the white hats are the ones who are going to have

0:17:41 - 0:17:48

the advantage, not the black hats. Whereas if we keep it private, now there's three months where maybe a black hat is going to stumble on this and start exploiting

0:17:48 - 0:17:54

it. That's kind of the approach there. But let's say if it was something more expensive,

0:17:54 - 0:17:59

like a full model, like we're training a model. Training a model costs billions of dollars.

0:18:00 - 0:18:06

If we were to build a model that was able to find those bugs, it's kind of okay to wait three months

0:18:06 - 0:18:08

because North Korea is not going to be training

0:18:08 - 0:18:11

a billion-dollar model in a couple of months.

0:18:11 - 0:18:14

So it really depends on the circumstances

0:18:14 - 0:18:16

of how you've been able to create this tool.

0:18:17 - 0:18:19

Yeah, I think that's a really interesting answer.

0:18:19 - 0:18:23

I like your, like, the white hat token budget

0:18:23 - 0:18:24

versus the black hat token budget.

0:18:24 - 0:18:25

It definitely echoes the like open source,

0:18:25 - 0:18:26

like more eyeballs,

0:18:26 - 0:18:28

like more good eyeballs than evil eyeballs,

0:18:28 - 0:18:30

you know, makes for safer code,

0:18:30 - 0:18:31

which I think is a good segue

0:18:31 - 0:18:32

into the next question,

0:18:33 - 0:18:35

which is in 2026,

0:18:35 - 0:18:36

as of March, 2026,

0:18:36 - 0:18:38

it is now safer for your smart contract code

0:18:38 - 0:18:39

to be open source

0:18:39 - 0:18:40

than it is to be closed source.

0:18:41 - 0:18:42

So this one split the room.

0:18:42 - 0:18:43

It was 50-50.

0:18:43 - 0:18:45

Seth, you were on the record as saying you think it's safer to be open source. How this one split the room, it was 50-50. Seth, you were on the record as saying

0:18:45 - 0:18:47

you think it's safer to be open source.

0:18:47 - 0:18:48

How do you think about this?

0:18:49 - 0:18:51

Yes, and I still stick with that.

0:18:51 - 0:18:53

I think that closed source,

0:18:53 - 0:18:56

particularly in a situation where you deploy

0:18:56 - 0:18:58

and people can reverse engineer the binary

0:18:58 - 0:19:01

and work from there anyway,

0:19:01 - 0:19:03

is more of a mirage than anything.

0:19:03 - 0:19:06

And putting it out open source, you know, creates

0:19:06 - 0:19:10

a responsibility on your side to make sure that you've done your diligence with security without

0:19:10 - 0:19:15

the false sense of security you get from this secret that's not really a secret. So, I mean,

0:19:15 - 0:19:20

I think as with many things, AI related and otherwise, it's about the mentality that goes

0:19:20 - 0:19:25

into it. If you know it's open to the world and anybody can run AI tools against it,

0:19:25 - 0:19:27

you can rightfully say to yourself,

0:19:28 - 0:19:29

I would be irresponsible

0:19:29 - 0:19:30

if I didn't run the best AI tools

0:19:30 - 0:19:31

against my own code.

0:19:32 - 0:19:33

But if you have this illusion

0:19:33 - 0:19:34

that because it's closed source,

0:19:34 - 0:19:35

somehow it's safe and protected,

0:19:36 - 0:19:38

then you might think that you could get away

0:19:38 - 0:19:40

with not running the best tools out there

0:19:40 - 0:19:41

and you would be wrong.

0:19:41 - 0:19:43

That's really not going to help you at all in the end.

0:19:44 - 0:19:50

So I think open sourcing in this context just raises the security bar for all those involved,

0:19:50 - 0:19:55

and it sets the right mindset from the beginning, and that's why it's absolutely the right way to go.

0:19:56 - 0:19:59

And I think to the previous question of responsible disclosure, I would say

0:19:59 - 0:20:06

one way that I always look at it is, you know, AI tools have made it really easy to execute social engineering attacks,

0:20:06 - 0:20:09

but we don't say that they should have been closed source and a warning should have been put out

0:20:09 - 0:20:15

that everybody should be prepared for far more deep fake social engineering attacks because they're coming.

0:20:16 - 0:20:22

It's really just a matter of if you're in this industry and you take it seriously and you're in an arms race,

0:20:22 - 0:20:23

you need to play in the arms race.

0:20:23 - 0:20:26

And so you have to look for the next thing that comes out

0:20:26 - 0:20:27

and be there first,

0:20:27 - 0:20:29

because that's part of your responsibility

0:20:29 - 0:20:30

as a protocol runner.

0:20:31 - 0:20:33

That said, if I were to come up with,

0:20:33 - 0:20:34

kind of agreeing with Ben here,

0:20:34 - 0:20:36

if I were to come up with a new brilliant tool

0:20:36 - 0:20:40

that could find 100% of the vulnerabilities out there,

0:20:40 - 0:20:42

that's different because the barrier to entry

0:20:42 - 0:20:43

is very, very high.

0:20:43 - 0:20:47

And now I need to start thinking about responsible disclosure before I put it out in the world.

0:20:49 - 0:20:50

Yeah.

0:20:50 - 0:20:51

Yeah, that's super interesting.

0:20:53 - 0:20:55

And I think, yeah, I totally agree with you.

0:20:55 - 0:20:59

Well, I don't want to tip it too hard, but on this one, I think I agree.

0:20:59 - 0:21:01

I think reversing capabilities are superhuman.

0:21:01 - 0:21:05

So, yeah, it's like if you're open source, it's almost kind of more of a flex nowadays.

0:21:05 - 0:21:07

So it's like, hey, you know, hit me, I've done everything,

0:21:07 - 0:21:09

you know, and especially if you have some,

0:21:09 - 0:21:10

if you have your skills in there,

0:21:10 - 0:21:13

if you have verification, like so much more so.

0:21:13 - 0:21:15

I think one other factor I think about is that

0:21:15 - 0:21:17

the safest place to be is if you're open source

0:21:17 - 0:21:19

and you are a dependency of one of the foundation labs,

0:21:19 - 0:21:21

because you know what they're doing

0:21:21 - 0:21:22

before they release the models.

0:21:22 - 0:21:24

Now this is a very actionable

0:21:24 - 0:21:25

for smart contract developers, but I think it definitely matters. And I think if you're open source and you're prominent, you know, a're doing before they release the models. Now this is a very actionable for smart contract developers,

0:21:25 - 0:21:26

but I think it definitely matters.

0:21:26 - 0:21:28

I think if you're open source and you're prominent,

0:21:28 - 0:21:30

a lot of them think pretty seriously

0:21:30 - 0:21:30

about these questions too,

0:21:30 - 0:21:32

and about their responsibility

0:21:32 - 0:21:33

of what models can and can't do.

0:21:33 - 0:21:36

So if you're a prominent project in closed source,

0:21:36 - 0:21:37

I think you're definitely less safe

0:21:37 - 0:21:39

than a prominent project that's open source

0:21:39 - 0:21:41

because the angels might be running the early model

0:21:41 - 0:21:42

and disclosing things to you.

0:21:42 - 0:21:43

There's also this leak the other day

0:21:43 - 0:21:47

about how there's like secret anthropic employee mode for patching bugs

0:21:47 - 0:21:48

and stuff like that.

0:21:48 - 0:21:51

But Kaz, you were on the other side of this question.

0:21:51 - 0:22:07

I want to hear your point of view. KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV KASZAKOV I agree with Seth and it doesn't matter. So models can now basically engineer everything.

0:22:07 - 0:22:08

It doesn't matter.

0:22:08 - 0:22:11

So you might as well just publish it in that case.

0:22:11 - 0:22:13

And I think it's a good thing to publish this outcome.

0:22:14 - 0:22:18

Now, I think the more interesting part is whether to publish for us.

0:22:18 - 0:22:22

And that's actually, that was an actual question we asked ourselves.

0:22:22 - 0:22:23

It's whether to publish specs.

0:22:24 - 0:22:27

So basically formal verification.

0:22:27 - 0:22:30

And a few months ago, we were definitely in the campaigns

0:22:30 - 0:22:33

of, hey, we're going to publish everything.

0:22:33 - 0:22:34

We're going to make it easy for people

0:22:34 - 0:22:37

to trust this by replicating on their system.

0:22:37 - 0:22:41

And we were thinking, oh, we're going to put this somewhere

0:22:41 - 0:22:45

so that they can actually see all the traces everything

0:22:45 - 0:22:51

executed and all of that so this was our position a few months ago um and that's no longer our

0:22:51 - 0:22:57

position uh and the thing is basically we've seen agents do really well reasoning about specs

0:22:57 - 0:23:03

so if we publish them we are essentially for each of our clients we are say we're giving

0:23:03 - 0:23:07

attackers a way to build upon our work

0:23:07 - 0:23:11

and see these are the edges that are not yet super well verified.

0:23:12 - 0:23:16

Let me try to build a bit more there and see whether there's anything I can exploit.

0:23:17 - 0:23:20

And that's a risk.

0:23:20 - 0:23:21

And that's a risk that's avoidable, right?

0:23:22 - 0:23:23

So code is out there.

0:23:23 - 0:23:26

You cannot really stop the code from being out there.

0:23:26 - 0:23:27

On the other hand, the specs,

0:23:27 - 0:23:29

you can stop them from being out there.

0:23:29 - 0:23:33

And I think it is responsible towards our,

0:23:33 - 0:23:36

responsible to keep them private, unfortunately.

0:23:37 - 0:23:38

That's a very interesting trade-off.

0:23:38 - 0:23:41

Like when we're saying the machines are superhuman reversing,

0:23:41 - 0:23:44

so it's easy to go from binary to source code.

0:23:44 - 0:23:47

I think there's implicit assumption you're making that they are not superhuman at going from source code

0:23:47 - 0:23:51

to the correct spec, which is certainly my experience, especially if you want a concise spec.

0:23:51 - 0:23:55

How long that capability gap will exist is an interesting question, but at least while it does,

0:23:55 - 0:24:00

I think I'd buy your argument like that's maybe something you'd want to keep closed. And similarly,

0:24:00 - 0:24:04

like if you have text files or supporting context about where bodies are buried or previous bug

0:24:04 - 0:24:10

vulnerability reports or, you know, areas that your team is nervous about, like maybe those you could keep, maybe those you could keep private.

0:24:11 - 0:24:14

But I think eventually, like, you know, you're going to want all of those things out there.

0:24:16 - 0:24:25

So the next question is about verification, which we've been sort of touching on at a high level. So the yes, no statement is in 2026,

0:24:25 - 0:24:29

there is no defensible reason to launch a new smart contract without specifying and formally

0:24:29 - 0:24:33

verifying all financial and access control related properties. This is not common practice in the

0:24:33 - 0:24:37

industry, but we got three out of four yeses, which is super exciting. And I think I'm eager

0:24:37 - 0:24:42

to hear why folks think this because it's definitely a step change and implies that we're

0:24:42 - 0:24:48

going to be seeing more in the future. Maybe let me start with the no. Robert, you were the lone dissenter on this question.

0:24:49 - 0:24:50

What do you think?

0:24:50 - 0:24:56

Yeah, I mean, it feels like a super strong statement. I guess maybe I come from a slightly

0:24:56 - 0:25:04

different perspective where I feel like form verification depends a lot on the specs you

0:25:04 - 0:25:06

write and how good the specs are.

0:25:06 - 0:25:14

And I feel like just saying a blanket of like, hey, you have to launch with these formal properties might sound good in theory.

0:25:15 - 0:25:17

But in practice, people probably aren't going to do your job with it.

0:25:18 - 0:25:21

I also think that there's a lot of other considerations with this.

0:25:21 - 0:25:25

Like, for example, certain protocols might not need

0:25:25 - 0:25:28

formal verification or formal verification might not be super suited to them, right?

0:25:28 - 0:25:34

Or for example, maybe fuzzing is better for certain types of protocols compared to formal

0:25:34 - 0:25:39

verification if there's like a lot of math involved, for example. And I think this sort

0:25:39 - 0:25:44

of blanket statement doesn't feel right to me. I could totally see narrowing it, right?

0:25:44 - 0:25:45

Like certain kinds of protocols,

0:25:45 - 0:25:47

like maybe multi-sig protocols or vault protocols,

0:25:47 - 0:25:50

need formal verification of all the financial properties.

0:25:50 - 0:25:52

But this sort of blanket statement

0:25:52 - 0:25:53

just doesn't sit right with me.

0:25:54 - 0:25:56

Yeah, that's totally fair, I think.

0:25:56 - 0:25:58

And you have to make very strong statements

0:25:58 - 0:26:00

if you wanna get folks to disagree.

0:26:00 - 0:26:02

So I appreciate you taking the other side of that argument.

0:26:02 - 0:26:03

And I think I biased some parts of it.

0:26:03 - 0:26:04

There's some things like access control,

0:26:04 - 0:26:05

you should just specify and verify it.

0:26:05 - 0:26:06

Like, every protocol has it.

0:26:07 - 0:26:08

You know, there's no, it's not really hard, actually.

0:26:08 - 0:26:09

There's no reason not to do it.

0:26:10 - 0:26:11

There might be some other things where it's, like,

0:26:12 - 0:26:15

maybe should I specify and verify the way my events, you know,

0:26:16 - 0:26:17

correspond to my UI?

0:26:17 - 0:26:19

But maybe, you know, there could be bugs there, maybe not.

0:26:19 - 0:26:21

But, you know, there's more sort of more gray areas.

0:26:21 - 0:26:23

On the yes side, Seth, I wanted to ask you,

0:26:23 - 0:26:25

because, you know, Sertoro is like maybe the most successful

0:26:25 - 0:26:27

formal verification company in history.

0:26:27 - 0:26:29

Like you guys are obviously gonna have a strong opinion

0:26:29 - 0:26:30

on this and a lot of perspective.

0:26:30 - 0:26:31

And of course you have personal background,

0:26:31 - 0:26:33

you know, with CoVarity and working in formal methods

0:26:33 - 0:26:35

besides verification and beyond.

0:26:35 - 0:26:37

So I'm really, really interested to see what you think.

0:26:37 - 0:26:39

Well, actually, let me start by saying,

0:26:39 - 0:26:40

would you have had the same answer in 25

0:26:40 - 0:26:42

or did your answer change in 26?

0:26:42 - 0:26:45

And how do you think about it in both of these? Actually, my answer didn't change. I would have said the same in 25 or did your answer change in 26 and how do you think about it in both of these?

0:26:50 - 0:26:55

Actually my answer didn't change. I would have said the same in 25 and I think fundamentally I'm not even, let's put aside the limitations of verification because there are always limits

0:26:55 - 0:27:00

and I understand that we can, you know, that in those cases what are you going to do? You can't

0:27:00 - 0:27:08

formally verify, the tools don't work, the mathematics is too complex, et cetera. There's always the limit. But if we put aside the limit, I just focus on the concept.

0:27:09 - 0:27:13

We're in an arms race. The person on the other side is getting smarter and smarter at a rate

0:27:13 - 0:27:18

which we cannot possibly keep up with as humans. The only thing you can try to do is go for

0:27:18 - 0:27:25

the absence of bugs. And formal verification is the best known method for proving the absence of

0:27:25 - 0:27:31

certain classes of bugs. And yes, you can't prove everything, but you've got to try because tomorrow

0:27:31 - 0:27:38

the next agent update is going to come. And no matter how many hours of well-guided fuzzing you

0:27:38 - 0:27:43

did, something is missing and you don't know if the model is going to find it. And it just comes

0:27:43 - 0:27:49

down to this. We're in an industry where we are, you know, putting out contracts that have no real defense

0:27:49 - 0:27:54

in depth. Yes, you can code it in, but there are no multiple layers of protection like you have in

0:27:54 - 0:27:59

every other industry that monitors and manages this kind of money. It's like putting the visa

0:27:59 - 0:28:05

transaction system with the code out there to be exploited, no firewalls, no nothing,

0:28:06 - 0:28:11

you know, here you go. So the level to which we have to try to prove these contracts correct

0:28:11 - 0:28:18

is drastically different, especially in a world where, you know, we don't have visibility

0:28:18 - 0:28:23

into how good the model is going to be tomorrow. So to me, it's just a matter of responsibility.

0:28:23 - 0:28:25

Is the world perfect? Can we all

0:28:25 - 0:28:30

write great specs? No. Agents are getting better and better at it, although I agree with everyone

0:28:30 - 0:28:34

who says they're not that good yet. But there's many things we can do to make them better, and we

0:28:34 - 0:28:40

should. But as a matter of responsibility, you have to try and get as close as you can to verifying

0:28:40 - 0:28:47

everything that you can. Yeah, I definitely buy that. And I think, you know, this industry or smart contracts

0:28:47 - 0:28:49

have always had the most adversarial threat model

0:28:49 - 0:28:50

of maybe any software out there.

0:28:50 - 0:28:51

You know, it's open.

0:28:52 - 0:28:53

Anyone can link to it and call it.

0:28:53 - 0:28:54

And that's sort of the point.

0:28:54 - 0:28:55

It manages money, all these things.

0:28:55 - 0:28:58

So the fact that your answer didn't change from 25 to 26,

0:28:58 - 0:28:59

I think makes sense.

0:28:59 - 0:29:00

It's just like now the threat model just gets worse

0:29:00 - 0:29:05

because the adversaries are good and getting better faster.

0:29:06 - 0:29:07

You know, the defenses are too.

0:29:08 - 0:29:12

So I wanted to move into talking about sort of the consequences of all of this.

0:29:12 - 0:29:15

So I'll start with a little bit of personal perspective on what we're seeing.

0:29:15 - 0:29:20

You know, we've run a bug bounty program for SWE since the very beginning.

0:29:20 - 0:29:21

It's on hack and proof.

0:29:22 - 0:29:24

You know, all these stats I'm talking about, most of these stats I'm talking about are public.

0:29:24 - 0:29:25

You can see, like, it's paid out quite a bit over the, all these stats I'm talking about, most of these stats I'm talking about are public, you can see like it's paid out quite a bit

0:29:25 - 0:29:30

over the years, like 2.3 million over three years that we've had a lot of great relationships

0:29:30 - 0:29:33

with the White Hats, a lot of great reports, it's an excellent resource for us. Something

0:29:33 - 0:29:38

we've been seeing recently is definitely since LLMs came out, there have been more reports,

0:29:38 - 0:29:41

it's early on, it's mostly slop, I would say actually, you know, almost all slop, it's

0:29:41 - 0:29:44

just elevated volume. But the number we've been looking at recently that is interesting

0:29:44 - 0:29:46

is over the lifetime of the program,

0:29:46 - 0:29:49

we've only had three duplicate reports over three years.

0:29:49 - 0:29:53

And then we had a month where there were eight in one month.

0:29:53 - 0:29:55

Now, why does a duplicate report happen?

0:29:55 - 0:29:57

Duplicate report happens because the bug bounty program

0:29:57 - 0:30:01

scope and the model are put, like you take the bug

0:30:01 - 0:30:02

bounty program scope and the code,

0:30:02 - 0:30:04

and you put it into the model, and you get roughly the same thing.

0:30:04 - 0:30:05

Or maybe you have a fancier harness.

0:30:05 - 0:30:06

You know, this is our thesis,

0:30:06 - 0:30:08

and I think that this is probably true.

0:30:08 - 0:30:10

And so it's like clearly, like, people are taking stuff,

0:30:10 - 0:30:12

and I mean, valid duplicate reports,

0:30:12 - 0:30:14

and finding more things.

0:30:14 - 0:30:16

So it's like, hmm, that's sort of been

0:30:16 - 0:30:17

one of the wake-up calls to us

0:30:17 - 0:30:20

that people are really doing this,

0:30:20 - 0:30:22

and it's working a lot better than it did before.

0:30:23 - 0:30:27

So, you know, the question I have related to this then is,

0:30:27 - 0:30:29

in 2026, crypto bug bounty programs

0:30:29 - 0:30:32

are the best ethical ways to convert subsidized tokens,

0:30:32 - 0:30:35

e.g. your cloud code and your codex max plans into dollars.

0:30:35 - 0:30:37

So the room is split 50-50 on this.

0:30:37 - 0:30:39

Ben, you said yes, what do you think?

0:30:41 - 0:30:51

So I was kind of interested in this question because it begs another question, like what's the best way to use an LLM right now?

0:30:52 - 0:31:11

I think a lot of people have been using LLMs to find bugs on programs like Immunify using like one shot or a couple shot prompts that basically say find the bugs or they give a list of vulnerability types and then like find a vulnerability type that matches one of these um and that's effective to some extent

0:31:11 - 0:31:16

and i think that's a big reason why we're seeing so many duplicate bugs because these are the these

0:31:16 - 0:31:23

are the bugs that the model has been rled for to be able to find um then the question is how do you

0:31:23 - 0:31:26

find the bugs that the model has not been RL'd for?

0:31:27 - 0:31:31

Because there are, so you said, mentioned that there's a couple of critical bugs that was found.

0:31:31 - 0:31:36

I guarantee there's probably more critical bugs, but the LLMs aren't going to find it with a single

0:31:36 - 0:31:47

shot. And I think for a really talented security researcher, they're able to take the model and push it into these areas of its exploration space

0:31:47 - 0:31:51

that the other people aren't going to be able to do with a single shot.

0:31:51 - 0:31:53

And we've seen people do that really effectively.

0:31:54 - 0:31:57

We've been starting to push people more internally to do bug bounties,

0:31:57 - 0:31:59

you know, for fun and stuff.

0:32:00 - 0:32:04

And the level of effectiveness that people have at being able to push these models

0:32:04 - 0:32:09

in directions where there isn't duplicate findings is really incredible.

0:32:09 - 0:32:14

But you have to really push beyond that initial, like, I just want to ask it to find the bugs.

0:32:16 - 0:32:18

What tips do you have for getting it out of that local maximum?

0:32:19 - 0:32:21

Don't assume that it's a thinking system.

0:32:20 - 0:32:25

assume that it's a thinking system. I think that it's very easy to make the mistake of thinking

0:32:25 - 0:32:32

that the system is able to think and reason and do things like that. It's best as a summarization

0:32:32 - 0:32:37

machine. So let's say you have a giant code base like the SWE node. There's probably like a million

0:32:37 - 0:32:42

lines of code in there. No single human could review all of that code in any reasonable amount

0:32:42 - 0:32:45

of time. But using a summarization machine like an

0:32:45 - 0:32:51

llm it might be able to point you at the highest risk most important areas to verify first and then

0:32:51 - 0:32:56

that's when you dive deep but if you started from the very beginning and you're like hey find all

0:32:56 - 0:33:01

the high severity bugs in the code base you don't have that extra guidance from the engineer saying

0:33:01 - 0:33:07

hey this is what i think is really important I know there's been like a bunch of results by Anthropic and them saying like,

0:33:07 - 0:33:13

when you do LLMs with like hybrid human LLM, like the performance isn't as good as just an LLM.

0:33:13 - 0:33:16

We haven't been able to corroborate that. So far for us,

0:33:16 - 0:33:20

human plus LLM is vastly, vastly better than either one alone.

0:33:22 - 0:33:24

Good for our job security in this room.

0:33:25 - 0:33:26

Klaus, you're on our job security in this room. Klaus,

0:33:26 - 0:33:27

you're on the other side of this question.

0:33:28 - 0:33:29

So what do you think is the best ethical

0:33:29 - 0:33:32

way to convert subsidized tokens into dollars

0:33:32 - 0:33:33

if it's not crypto bug bounty programs?

0:33:36 - 0:33:37

Oh, yeah.

0:33:37 - 0:33:39

Yeah, I think actually, yeah,

0:33:40 - 0:33:42

regarding this question, my take is more

0:33:42 - 0:33:43

like, you basically have

0:33:43 - 0:33:47

very, so I agree that the harness is important

0:33:47 - 0:33:50

and you essentially have, I know, basically,

0:33:50 - 0:33:52

both white hat teams and black hat teams

0:33:52 - 0:33:56

still trying to do the same task with different models, right?

0:33:56 - 0:34:01

And I guess, I think on both sides

0:34:01 - 0:34:03

you can also use all of this.

0:34:03 - 0:34:05

So basically that's just the fact that you have

0:34:05 - 0:34:13

these discounted prices for models is just kind of the substrat of everything that we do, right?

0:34:14 - 0:34:23

And I think that our jobs as security researchers are just to be much, much better than

0:34:20 - 0:34:25

to be much, much better than kind of the other side

0:34:26 - 0:34:27

that using the models.

0:34:27 - 0:34:31

And what's the magic of getting much better there?

0:34:31 - 0:34:33

Yes, that's a kind of open question

0:34:33 - 0:34:35

and we're all experimenting, I guess, internally.

0:34:37 - 0:34:39

But the crux is to be so much better

0:34:39 - 0:34:44

than that we are hopefully one model generation ahead

0:34:49 - 0:34:56

in terms of finding bugs versus let's say an off-the-shelf cloud code because if we are not then when a new model lands we will essentially

0:34:56 - 0:35:03

have a significant problem so basically i'm kind of that's how i see our job as a company be

0:35:03 - 0:35:05

be able to be one generation ahead

0:35:05 - 0:35:07

and then be able to react very quickly

0:35:07 - 0:35:09

to changes in your environment

0:35:09 - 0:35:12

in terms of both the models themselves

0:35:12 - 0:35:13

and the prices of the models.

0:35:15 - 0:35:15

Yeah.

0:35:16 - 0:35:19

I want to dig into the part about the new model releases

0:35:19 - 0:35:20

since there's a question about this.

0:35:20 - 0:35:21

I think it's very interesting.

0:35:21 - 0:35:22

The yes-no question I asked,

0:35:22 - 0:35:24

and this one actually didn't split the room,

0:35:24 - 0:35:27

but I think it's still a good question to discuss is,

0:35:27 - 0:35:29

in 2026, a team with a great 48 hour

0:35:29 - 0:35:32

new model release playbook and no pre-launch audit

0:35:32 - 0:35:34

is safer than a team with a top tier audit and no playbook.

0:35:34 - 0:35:37

And so what I mean by a 48 hour new model release playbook

0:35:37 - 0:35:38

is sort of like the new model drops,

0:35:38 - 0:35:40

like what is your team doing right now?

0:35:40 - 0:35:42

If you're not in the model pre-release program,

0:35:42 - 0:35:44

which obviously it's preferable to be in there

0:35:44 - 0:35:47

and do this before, you know, certainly every team should try to do that, but it's just

0:35:47 - 0:35:51

not going to be possible for everybody. I think for layer ones, you definitely can and should be

0:35:51 - 0:35:55

if you're not already. But anyway, so yeah, so what is the playbook that you start running then?

0:35:56 - 0:36:00

And I think like the, everyone said no, everyone said no, like, you know, you want the audit instead

0:36:00 - 0:36:06

of the playbook, you want the audit and no playbook instead of like good playbook, but no audit,

0:36:06 - 0:36:08

which it makes sense as auditors.

0:36:08 - 0:36:11

But I think my, so my framing would be,

0:36:11 - 0:36:13

yeah, sorry, go ahead.

0:36:13 - 0:36:15

Actually I would say you actually need both.

0:36:15 - 0:36:17

Yeah, of course, of course.

0:36:17 - 0:36:19

I have to try to split the room

0:36:19 - 0:36:22

so I make it a question without nuance.

0:36:22 - 0:36:25

But so the way I would think about it is,

0:36:29 - 0:36:30

are you the auditor with today's model better than the totality of the white hats with tomorrow's model?

0:36:31 - 0:36:32

And that's another way to think about the question.

0:36:33 - 0:36:33

And there it's like,

0:36:34 - 0:36:36

the answer seems like the totality of white hats with tomorrow's model are

0:36:36 - 0:36:39

better. So then, yeah,

0:36:39 - 0:36:42

maybe you actually care more about the playbook than the audit.

0:36:42 - 0:36:43

But of course the right answer is both.

0:36:45 - 0:36:49

So anyway, I won't, I won't call on anyone for that one since everyone has...

0:36:49 - 0:36:49

I think that one.

0:36:50 - 0:36:50

Please, please.

0:36:50 - 0:36:52

Okay, tell us then.

0:36:52 - 0:36:52

Tell us.

0:36:53 - 0:36:57

I think in the blockchain industry, and I'm sure everyone here will agree,

0:36:57 - 0:36:59

there's this perception that if you get an audit,

0:37:00 - 0:37:02

that means that there's no bugs in your code base.

0:37:03 - 0:37:07

And we constantly have to spend time educating clients saying,

0:37:08 - 0:37:11

listen, if we found three or four critical bugs during your audit,

0:37:11 - 0:37:14

there's probably a lot more and it's not safe to deploy.

0:37:15 - 0:37:18

And there's also qualitative guidance.

0:37:19 - 0:37:22

We might tell them, hey, your test coverage isn't great.

0:37:22 - 0:37:23

Your complexity management isn't great.

0:37:23 - 0:37:26

You need a private key management strategy. And if,

0:37:26 - 0:37:29

if the client comes through, they just fix their bugs.

0:37:29 - 0:37:33

They don't follow any of that guidance. I think they're totally toast.

0:37:33 - 0:37:35

It doesn't matter who gets, who audits them.

0:37:35 - 0:37:37

If they're not taking that guidance to heart, it's not going to matter.

0:37:38 - 0:37:43

The job of an audit isn't to be better than the white hats with that next

0:37:43 - 0:37:44

level of model.

0:37:44 - 0:37:48

It's to make sure that the development process is going to produce a code base

0:37:48 - 0:37:52

that is secure against the white hats plus the next generation model.

0:37:52 - 0:37:56

And if your audit can't produce that, then that's when you run into issues.

0:37:57 - 0:37:59

I would second that emphatically.

0:37:59 - 0:38:02

We are not in the business of finding all bugs.

0:38:02 - 0:38:04

And if we think we are, we are wrong.

0:38:04 - 0:38:06

We're in the business of finding all bugs. And if we think we are, we are wrong. We're in the business, I mean, an audit

0:38:06 - 0:38:09

in the traditional sense in the financial world

0:38:09 - 0:38:11

is an audit of process and numbers,

0:38:11 - 0:38:13

but it's not just the numbers.

0:38:13 - 0:38:17

And it's like, we should be heavily invested

0:38:17 - 0:38:20

in ensuring that our teams think end to end about security.

0:38:20 - 0:38:23

And they view it as a whole company initiative

0:38:23 - 0:38:25

that requires security as a first principle

0:38:25 - 0:38:27

and compromise as unacceptable

0:38:27 - 0:38:30

when it comes to the safety of their systems.

0:38:30 - 0:38:35

And that has to be resilient to the next model and beyond.

0:38:35 - 0:38:39

So it's like, it can't be just about the bugs.

0:38:39 - 0:38:40

And I think that's the point you're making,

0:38:40 - 0:38:42

but I absolutely agree.

0:38:43 - 0:38:45

Totally, I buy that too.

0:38:45 - 0:38:50

And let me connect this to a different question, which is one that people also all agreed on,

0:38:50 - 0:38:51

which is interesting.

0:38:52 - 0:38:56

In 2026, the role of a useful audit shifts from inspecting the code towards building

0:38:56 - 0:38:59

the invariants and scaffolding that will be used in red teaming in the future.

0:38:59 - 0:39:03

Now, to put it sort of crassly, maybe even the past people would have thought that an

0:39:03 - 0:39:06

audit is like the deliverable of an audit is a PDF with some bugs or something like that.

0:39:07 - 0:39:09

I'm sure none of you would say that, but I think teams maybe think of it that way.

0:39:09 - 0:39:18

But everyone here said like, yes, the role of an auditor is shifting from that old model or some better version of that old model to this new thing that's about scaffolding and invariance.

0:39:18 - 0:39:25

So Robert, tell me about the shift. How's the work that OtterSec does in 26 different than, you you know say you were doing in 2024 if you're

0:39:25 - 0:39:30

thinking about invariance and scaffolding as the main output instead of a pdf yeah yeah i guess the

0:39:30 - 0:39:35

way i took this question maybe was a bit more broadly um so actually i'm a little bit tired

0:39:35 - 0:39:40

today one reason for that is because there was a really big hack two days ago um just got hacked

0:39:40 - 0:39:49

for 50 million dollars um like quarter billion dollars quarter billion dollars. So that took my day, and I stayed up all night helping them.

0:39:50 - 0:39:53

And I think the really interesting thing about the hack is, as we all know, it was not a

0:39:53 - 0:39:54

smart contact hack, right?

0:39:55 - 0:40:00

Even though Drift had a relatively complicated code base, the part that hit them in the end

0:40:00 - 0:40:02

was a multi-state compromise.

0:40:02 - 0:40:05

And I think that's a trend that we're seeing as ecosystems

0:40:05 - 0:40:11

mature, right? Which is like, as I mean, I think this particular hack was inspired a lot by Bybit,

0:40:12 - 0:40:17

right? But as the code base gets better, hackers look for alternative venues, right? And at least

0:40:17 - 0:40:21

the way I took this question is more like, hey, there's a lot of different ways that hacks could

0:40:21 - 0:40:25

happen. And if we think about the totality of the risk,

0:40:25 - 0:40:29

AI might be really good at finding smart contract bugs, but if that is the case, then maybe the

0:40:29 - 0:40:33

weakest link moves somewhere else, right? And hackers want to go for the weakest link,

0:40:33 - 0:40:40

they don't need to break everything. So whether that's writing invariants to secure the code,

0:40:40 - 0:40:46

or whether that's working on op-sec practices with the team. I think there's a lot more that humans, you know,

0:40:46 - 0:40:49

hopefully can still help with to make teams more secure.

0:40:53 - 0:40:56

Colin, since you guys do mostly verification,

0:40:56 - 0:40:58

I hear the wisdom of what Robert's saying.

0:40:58 - 0:41:02

Do you work with teams on verifying OPSEC properties

0:41:02 - 0:41:03

outside of the smart contracts

0:41:03 - 0:41:05

as well

0:41:05 - 0:41:07

as the, you know, the core contract invariants?

0:41:07 - 0:41:11

Like how do you think about that where you're securing one part of the risk, but you know,

0:41:11 - 0:41:12

it might be moving around?

0:41:12 - 0:41:18

Yeah, I mean, for now we are focusing strictly on the mobile also on the smart contract itself.

0:41:18 - 0:41:25

But I think there's a lot you can do also in the smart contract itself including including to kind of put circuit breakers in

0:41:25 - 0:41:32

case of multi-sig failure or human multi-sig failure and you can you can there is a lot that

0:41:32 - 0:41:39

can be done in code um and i'm actually okay so maybe we're very small and naive and we're here

0:41:39 - 0:41:44

but it's uh i'm kind of thinking that our so we we cannot really promise our customer that

0:41:40 - 0:41:49

But I'm kind of thinking that we cannot really promise our customer that perfection in terms of security, at least from a legal perspective.

0:41:49 - 0:42:00

But I do think that we are for smart contracts on SUI specifically, with SUI being easier to verify than others.

0:42:00 - 0:42:07

others. I think we are approaching and maybe we're a few months away even from a world where we can

0:42:07 - 0:42:15

make at least the code of the smart contract completely resilient against, and provably resilient,

0:42:15 - 0:42:22

against catastrophic failure. So yes of course you can always have nuances like oh does the reward

0:42:22 - 0:42:25

model allow a bit too much to be drained

0:42:25 - 0:42:27

out of a DeFi contract or whatever, right?

0:42:27 - 0:42:34

So not, there's nuance, but at least I think we are probably a few months away from having

0:42:34 - 0:42:38

proofs of lack of catastrophic failure.

0:42:38 - 0:42:42

And yeah, and I think we should do it.

0:42:42 - 0:42:45

Yeah, it can't come soon enough.

0:42:45 - 0:42:48

I want to close by going around the horn on a question for everyone.

0:42:48 - 0:42:51

This is something where everyone said no, but I think the answer, the specifics of the

0:42:51 - 0:42:53

answer will be interesting.

0:42:53 - 0:42:57

And so the question is, in one year coding agents will advance to a point where find

0:42:57 - 0:43:00

all the critical vulnerabilities in this code, you know, the sort of trivial-ish prompts

0:43:00 - 0:43:04

with a sufficient token budget will work and it will be indistinguishable for a more sophisticated

0:43:04 - 0:43:08

harness. It's definitely not true today, but even now, like, you know, if you watch,

0:43:08 - 0:43:12

say, this viral Nicholas Carlini talk, Security Research and Anthropica, where he talks about what

0:43:12 - 0:43:17

they do, it's sort of like the file by file harness where you say, file one, find the bugs in this,

0:43:18 - 0:43:21

find the bugs that are here. File two, find the bugs that are here. So my argument would be,

0:43:22 - 0:43:28

for any bug, there's one or more files that contain the code for that bug. As you get smarter, you'll eventually be able to work backward from a

0:43:28 - 0:43:33

suspicious location to the other things that are relevant and find all the bugs. It doesn't work

0:43:33 - 0:43:37

today, but what stops it from working tomorrow? Or does everyone think that it will work in my

0:43:37 - 0:43:42

one year was just too aggressive of a prediction? Maybe I'll start with you, Seth.

0:43:40 - 0:43:41

of a prediction.

0:43:41 - 0:43:42

Maybe I'll start with you, Seth.

0:43:46 - 0:43:48

Maybe it is that you're one year too aggressive.

0:43:48 - 0:43:50

I mean, I think as models get smarter and smarter,

0:43:50 - 0:43:52

of course, finding all the bugs

0:43:52 - 0:43:53

is something you could ask them to do

0:43:53 - 0:43:55

and maybe they'll be able to do it.

0:43:55 - 0:43:57

But also have just a general belief

0:43:57 - 0:44:00

that we think we know more than we really know.

0:44:00 - 0:44:01

And that's always been the case in security

0:44:01 - 0:44:04

and it's been proven time and time again.

0:44:04 - 0:44:06

And it's like every attempt to ever map out

0:44:06 - 0:44:08

the comprehensive set of all possible hacks

0:44:08 - 0:44:10

against any system has always failed

0:44:10 - 0:44:14

when someone came up with the next system in the next way.

0:44:14 - 0:44:17

And so it's sort of embedded in that is an assumption

0:44:17 - 0:44:20

that it's a matter of the LLMs getting good

0:44:20 - 0:44:23

at figuring out how to get to the bottom of everything

0:44:23 - 0:44:24

that we know to be there.

0:44:24 - 0:44:26

But I think there are vulnerabilities we don't know are really there.

0:44:27 - 0:44:28

And so we're just shifting the surface.

0:44:28 - 0:44:33

Yeah, all the smart contract bugs might be found given the scope of knowledge

0:44:33 - 0:44:35

that we have now about how smart contracts can fail.

0:44:35 - 0:44:39

But people will come up with a new way to exploit as long as it's profitable to do so.

0:44:40 - 0:44:43

And it could be a type of bug that we never considered before.

0:44:43 - 0:44:51

And I have yet to, you know, and do I believe in a year that an LLM can properly explore the full space of everything we've never considered before?

0:44:51 - 0:44:53

I think that's where it breaks down for me.

0:44:54 - 0:44:55

Yeah, I think that's pretty interesting.

0:44:55 - 0:45:01

You know, the problem of find all bugs, given that you know the sort of bug types that you're looking for,

0:45:01 - 0:45:05

like find all buffer overflows, given that buffer overflows exist, and you know sort of know how they work,

0:45:05 - 0:45:06

is a different kind of thing than saying,

0:45:06 - 0:45:08

invent the buffer overflow, which had to be invented.

0:45:08 - 0:45:10

You know, it's not like somebody, you know,

0:45:10 - 0:45:12

it's not like a, right,

0:45:12 - 0:45:14

there's a security paper that created that,

0:45:14 - 0:45:15

and said there may be many other things like this,

0:45:15 - 0:45:16

and that of course are more complicated,

0:45:16 - 0:45:17

that aren't just software,

0:45:17 - 0:45:19

that are interacting processes,

0:45:19 - 0:45:22

oracles, code, and all of that.

0:45:22 - 0:45:24

Ben, what's your perspective?

0:45:24 - 0:45:25

Are we going to get to find all the bugs with

0:45:25 - 0:45:31

a trivial prompt? Maybe for like an arbitrarily small program, like something that's so simple

0:45:31 - 0:45:38

that you could formally verify the whole thing. So what's the point anyways? I think once you get

0:45:38 - 0:45:44

to like a non-trivial program, the problem's really coverage. Like these vulnerabilities

0:45:44 - 0:45:48

scanning, like the code agents and all this stuff, they're

0:45:48 - 0:45:50

really good at finding vulnerabilities.

0:45:50 - 0:45:51

They can find critical vulnerabilities.

0:45:52 - 0:45:55

I'm sure this next one that's going to come out is going to be way better at finding critical

0:45:55 - 0:45:57

vulnerabilities, but it always comes back to coverage.

0:45:58 - 0:45:58

What's your coverage?

0:45:58 - 0:45:59

What do you actually look at?

0:45:59 - 0:46:01

What vulnerabilities did you look at?

0:46:01 - 0:46:05

And do you know what your unknowns are?

0:46:05 - 0:46:07

And are there any unknown unknowns remaining?

0:46:07 - 0:46:10

And like, you can't even get that out of a human.

0:46:10 - 0:46:12

I don't think there's any reason to believe

0:46:12 - 0:46:13

that we'll get it out of an LLM.

0:46:15 - 0:46:18

Yeah, coverage is hard.

0:46:18 - 0:46:19

Koss, I want to hear a perspective

0:46:19 - 0:46:20

and then we'll close with Robert.

0:46:22 - 0:46:24

Yeah, well, I think that basically

0:46:24 - 0:46:27

there are two aspects here. One is agents looking for bugs,

0:46:27 - 0:46:33

right? And in that case, it's more of a stochastic process. And I think it's very hard,

0:46:33 - 0:46:37

especially for real sized systems, for the language models to actually find everything,

0:46:37 - 0:46:42

even in the future generations. I do think that we're going to push the boundary of formal

0:46:42 - 0:46:45

verification from small projects to

0:46:45 - 0:46:52

actually reasonably sized projects and even things maybe five years from now on the size of the Linux

0:46:52 - 0:46:58

kernel. So basically I think that this boundary will keep being pushed because of agents. And

0:46:59 - 0:47:05

then for those levels we have essentially in way, agents proving perfect security or finding all the bugs.

0:47:06 - 0:47:12

But yeah, it's not in their bug-finding mode, but more in their formal verification mode.

0:47:13 - 0:47:20

I think thinking of the, like, formally verify all the important, specify and formally verify all the important properties of this code is an interesting tool to think about.

0:47:20 - 0:47:21

Like, is that reached first?

0:47:21 - 0:47:22

You know, it gives you sort of more exhaustiveness.

0:47:22 - 0:47:26

And there's definitely exciting advances and like sort of jaw dropping moments there.

0:47:26 - 0:47:29

Like Leo DeMora started writing these blog posts

0:47:29 - 0:47:30

that are unbelievable about Lean

0:47:30 - 0:47:33

and his vision for verifying all the things.

0:47:33 - 0:47:35

And some of these results like, oh, Zlib,

0:47:35 - 0:47:37

like verify that zip is the inverse of unzip

0:47:37 - 0:47:38

and stuff like that.

0:47:38 - 0:47:39

Like that's really, really cool stuff.

0:47:39 - 0:47:41

And yeah, I'm excited to see it scale up

0:47:41 - 0:47:45

to Linux kernel size efforts to other large programs. But I'm not convinced, Sam scale up to Linux kernel size efforts or to other large programs.

0:47:49 - 0:47:49

But I'm not convinced, Sam, just to break the order here,

0:47:53 - 0:47:57

that this solves spec completeness, which is always a huge, huge problem. It's like, are you sure that even if you tell the LLM to create the perfect spec,

0:47:57 - 0:47:58

it really is the perfect spec?

0:47:59 - 0:47:59

How do you know?

0:48:00 - 0:48:03

The way you know is that somebody tomorrow finds a way to break your code.

0:48:03 - 0:48:07

And so it shifts the race, but the race is still on.

0:48:08 - 0:48:11

Spec completeness is really difficult to prove.

0:48:11 - 0:48:15

And I don't think that anybody has figured out how to prove that a spec is actually complete.

0:48:16 - 0:48:16

Yeah.

0:48:17 - 0:48:20

Even for the trivial examples, when I'm talking to people about formal verification,

0:48:20 - 0:48:24

I'll always ask, give me the spec for sorts example.

0:48:24 - 0:48:25

And they'll always get

0:48:25 - 0:48:28

the part about like, oh, the elements in the right order, correct. But they'll always miss the, oh,

0:48:29 - 0:48:32

and it's the input is a permutation of the output part. And then it's like, oh yeah,

0:48:32 - 0:48:35

this specification thing is hard and sort of counterintuitive. And of course, it only

0:48:35 - 0:48:40

gets trickier when you go for like more complex programs and properties. So yeah,

0:48:40 - 0:48:45

given the correct specs, we may be able to verify all the things with agents, the correct specs,

0:48:45 - 0:48:47

well, maybe humans will still have a role for some things.

0:48:49 - 0:48:50

Robert, what do you think?

0:48:50 - 0:48:51

Are we going to be able to find all the bugs

0:48:51 - 0:48:54

just by saying something pretty trivial?

0:48:55 - 0:48:58

Yeah, maybe I have a slightly different perspective here.

0:48:58 - 0:49:02

I mean, I think it's fair to say that LMs

0:49:02 - 0:49:08

are unlikely to find all the bugs, right? Because they might have the right context, or they might not understand the code base.

0:49:08 - 0:49:11

But I also feel like the same could be said about humans.

0:49:12 - 0:49:21

So at least the way I took your question, I took it like, will it be the case that LLMs are roughly equivalent, or maybe even arguably superior than humans?

0:49:22 - 0:49:25

And I mean, with the rate that AI is increasing,

0:49:25 - 0:49:31

like maybe, right? And I think it's definitely possible that it could find the bugs. And

0:49:31 - 0:49:37

I think where I disagree with this a bit is what a bug actually means. And I think, you

0:49:37 - 0:49:43

know, this is like an example on my mind, it's a recent, right? But like the drift thing

0:49:43 - 0:49:49

was examples that was not really a bug, right, like they sort of accepted that this admin multi-sig could control markets,

0:49:49 - 0:49:53

and, you know, with normal operations, assuming the admin multi-sig wasn't compromised,

0:49:54 - 0:49:58

that was totally safe, right, but you could argue also that like the fact that this multi-sig

0:49:58 - 0:50:03

existed was a bug, and I think that's something where oftentimes developers don't even know

0:50:03 - 0:50:07

themselves, right, like what is correct or what counts as a bug.

0:50:08 - 0:50:13

And I think my perspective is like for all the common bug classes or for anything that's like

0:50:13 - 0:50:17

a trivial loss of funds or anything that is clearly wrong, I think AI will actually within

0:50:20 - 0:50:26

a year or two years or however long it is, but I think it will be really good at finding those.

0:50:26 - 0:50:28

I think the points are to you and still have a chance.

0:50:28 - 0:50:34

I wonder if we should build a bit more effort, is it actually a bug?

0:50:34 - 0:50:37

Or what are your operational security practices?

0:50:37 - 0:50:47

Is this something that, like if you have this superconable hostage, is your object secure enough or are your people secure enough to actually enable that?

0:50:47 - 0:50:51

I guess I'll say sorry about my phone.

0:50:51 - 0:50:55

Robert's hotel Wi-Fi has turned into an agent.

0:50:55 - 0:50:58

But I think that's a good place to close.

0:50:58 - 0:51:01

And I like the, you know, the drift thing is an interesting one, I mean, because it's

0:51:01 - 0:51:04

top of mind, but it sort of reminds me of something Seth was saying earlier.

0:51:04 - 0:51:05

It's like sometimes something's a bug,

0:51:05 - 0:51:07

and sometimes something's just maybe unacceptable,

0:51:08 - 0:51:09

lack of defense in depth.

0:51:09 - 0:51:10

And, you know, maybe that's sort of where we get to,

0:51:11 - 0:51:12

is like we get better at finding the bugs

0:51:12 - 0:51:13

and more talking about, like, you know,

0:51:13 - 0:51:14

given something that could go wrong,

0:51:15 - 0:51:16

like what are the layers of defense that we have?

0:51:16 - 0:51:19

And that also seems like an infinite regress

0:51:19 - 0:51:21

that, you know, we'll spend a lot of time climbing.

0:51:21 - 0:51:23

Guys, thank you so much for this substantial conversation.

0:51:23 - 0:51:25

I really appreciate you engaging with the questions and sharing with your perspective and expertise. It was a lot of fun for. Guys, thank you so much for this substantial conversation. I really appreciate you engaging with the questions

0:51:25 - 0:51:27

and sharing with your perspective and expertise.

0:51:27 - 0:51:28

It was a lot of fun for me.

0:51:28 - 0:51:29

I hope you enjoyed it too.

0:51:30 - 0:51:31

Thanks for having us.

0:51:33 - 0:51:34

Catch you later.

Full Transcription