Why the Future of AI Runs on Data

you know i guess whatever it's whether it's uh morning evening whatever part of the world you're in. But yeah, so yeah, so today's space is, I apologize, I'm like having a bit of like

0:04:50 - 0:04:55

a respiratory issue, having kind of a nasty cough. It's kind of peak dry season where

0:04:55 - 0:05:00

I am right now. And if I have like a weird coughing outburst or something, hopefully

0:05:00 - 0:05:06

I'll be okay. But I apologize in advance. But yeah, so today's space is called

0:05:06 - 0:05:09

Why the Future of AI Runs on Data, right?

0:05:09 - 0:05:11

And this is a really exciting topic

0:05:11 - 0:05:14

that is very relevant to the Filecoin ecosystem.

0:05:14 - 0:05:16

It's something we've been talking about,

0:05:16 - 0:05:18

well, for many years, I guess.

0:05:19 - 0:05:20

And I think it's one of these areas where,

0:05:20 - 0:05:22

you know, folks in Filecoin were talking about this

0:05:22 - 0:05:25

long before all the normie people were talking

0:05:25 - 0:05:29

about it, right? This is something that we've been talking about for well before Chai Chibati

0:05:29 - 0:05:35

came along and everyone started paying attention to this. So we're going to kind of dive into some

0:05:35 - 0:05:41

of the intersections of what Filecoin is doing and also why this is relevant for AI and talk about

0:05:41 - 0:05:48

some of the other use cases, talk about some of the other developments where there's overlap and perhaps like why this is such, why decentralized data infrastructure

0:05:48 - 0:05:56

is such an important concept when we're talking about the future of AI. So as I mentioned,

0:05:56 - 0:06:02

I'm Aaron Stanley. I'll be hosting the conversation today. I do appreciate the invitation to be here.

0:06:04 - 0:06:08

And maybe, I mean, maybe we can just start off with kind of a quick round of intros.

0:06:08 - 0:06:16

So our panelists here today, we have Vuk Vukoye of, I'm sorry, Vuk, I can never pronounce

0:06:16 - 0:06:22

your name like 100% correctly, but I guess it's, but like Vuk is, everyone, everyone

0:06:20 - 0:06:23

But VOOC is, everyone knows VOOC, right?

0:06:22 - 0:06:23

knows Vuk, right?

0:06:23 - 0:06:24

Everyone, you can go by Vuk.

0:06:25 - 0:06:28

So yeah, VOOC of Ramo, Web3 Mine,

0:06:28 - 0:06:30

which is really focused on bringing together people,

0:06:30 - 0:06:33

hardware capital to build more open internet.

0:06:33 - 0:06:35

VOOC's been kind of a major player

0:06:35 - 0:06:37

in the Filecoin world for a long time

0:06:37 - 0:06:39

in a variety of different roles,

0:06:39 - 0:06:42

which maybe you can give us some context on that,

0:06:42 - 0:06:44

kind of your journey in Filecoin land.

0:06:44 - 0:06:49

And then we also have Carson Farmer of Recall Labs, which was formerly Textile.

0:06:50 - 0:06:52

Some of you probably recognize the name Textile.

0:06:52 - 0:06:57

They've also been kind of mainstays of the Filecoin ecosystem for quite some time.

0:06:58 - 0:07:04

And Recall Labs is the team behind RecallNet, which is a decentralized network for agentic

0:07:04 - 0:07:05

intelligence.

0:07:07 - 0:07:12

So maybe, Vuk, maybe to kick us off, why don't you give us an introduction of yourself and

0:07:12 - 0:07:18

talk about your current project with Bramo Web3Mine and maybe give us some more context on just what

0:07:18 - 0:07:22

you've been doing in the Filecoin world and maybe beyond over the last couple of years.

0:07:20 - 0:07:22

and maybe beyond over the last couple years.

0:07:24 - 0:07:24

Sure.

0:07:25 - 0:07:29

So I'll try to be super concise, but it has been all right.

0:07:30 - 0:07:33

So I did start doing a lot of mining back in 2013.

0:07:34 - 0:07:38

This was before Ethereum, so mined Litecoin a bunch.

0:07:38 - 0:07:40

Made a company around that, sold that company.

0:07:41 - 0:07:46

Then in 2017, I started building dev tools in the Ethereum ecosystem,

0:07:46 - 0:07:52

so I built a lot of EVM stuff. Then I led smart contracts for Cardano and then ultimately

0:07:52 - 0:07:58

a joint PL, where I led a bunch of the stuff around miners and basic infrastructure. Today,

0:07:58 - 0:08:05

what they do with Ramo is we build a protocol that basically abstracts away the complexity of doing Filecoin,

0:08:05 - 0:08:12

whether that is in the context of locking liquidity or in the context of storing the own Filecoin.

0:08:12 - 0:08:15

We abstract this complexity with different products,

0:08:15 - 0:08:22

and we basically allow users to either store their massive amounts of data, like hundreds of petabytes,

0:08:22 - 0:08:25

or stake liquidity without having

0:08:25 - 0:08:31

to actually do the mining, or basically provide infrastructure without having to do the ceiling

0:08:31 - 0:08:31

and so on.

0:08:32 - 0:08:37

Ultimately, we are trying to reduce the complexity of Filecoin and allow, like, more people to

0:08:37 - 0:08:38

do Filecoin.

0:08:41 - 0:08:41

Awesome.

0:08:42 - 0:08:44

And then, Carson, maybe we'll turn it over to you.

0:08:44 - 0:08:56

You've got a super interesting background. You've got an academic background as well. So I'd love for you to tell us a bit about yourself and what you guys are working on. Give us some background maybe on textile and Recall Labs as well, if you would.

0:09:00 - 0:09:08

Carson. I am a co-founder and CTO at Recall Labs. And so Recall is the on-chain arena for

0:09:08 - 0:09:14

evaluating, ranking, and rewarding agents. That's our tagline. I can go into what all that actually

0:09:14 - 0:09:21

means later. So yeah, in a past life, I was a university professor. I was at a couple different

0:09:21 - 0:09:26

places, but more recently, the University of Colorado Boulder.

0:09:33 - 0:09:34

And I left that way back in 2017, around when we joined the Filecoin and IPFS ecosystem.

0:09:40 - 0:09:49

And back then we were doing on-device machine learning stuff way before it was cool. And nowadays I lead a lot of the like R&D and engineering at Recall where we're sort of

0:09:49 - 0:09:56

designing and building the core systems to try and make AI evaluation systems more transparent and

0:09:56 - 0:10:13

rigorous and community powered. So we spend a lot of time thinking about how do we measure and govern AI systems and how do we capture and create audit trails to make sure that humans and AI are staying aligned.

0:10:14 - 0:10:15

We spent a lot of time thinking about that.

0:10:16 - 0:10:22

In a separate past life, yes, I was with Textile or still am with Textile.

0:10:20 - 0:10:28

textile or still am with textile we just rebranded a bit um and so we've been in the filecoin and ipfs

0:10:22 - 0:10:24

We just rebranded a bit.

0:10:28 - 0:10:36

ecosystem or the pl network for a long time since just around just before the filecoin ipo

0:10:36 - 0:10:48

and we've been building sort of various different developer infrastructure um You may remember things like Powergate or the Filecoin Bridge or

0:10:48 - 0:10:55

Bidbot and all sorts of different developer tooling that eventually kind of ends up getting

0:10:55 - 0:11:07

rolled into Filecoin in some way, shape, or form. And then most recently, we were engaging a fair bit with IPC and some of the Filecoin scaling technology.

0:11:09 - 0:11:11

And so, yeah, happy to be here chatting about data.

0:11:12 - 0:11:17

It's always kind of a nice topic and we've got a lot to talk about.

0:11:17 - 0:11:18

So looking forward to it.

0:11:20 - 0:11:21

Cool. Thanks for that.

0:11:22 - 0:11:33

Isabella, did you want to introduce yourself in the Filecoin TLDR team at all? Or you're hosting this. I always want to give you the chance to plug yourself if you so desire.

0:11:40 - 0:11:44

but just share a bit context of what Filecoin doing

0:11:44 - 0:11:46

and Filecoin TRDR.

0:11:46 - 0:11:50

So everybody here is now Filecoin doing storage

0:11:50 - 0:11:54

for all the layers for AI and DP layers.

0:11:54 - 0:11:57

And also after we launching FVM

0:11:57 - 0:12:02

and we can get bridging with different multiple cross chain

0:12:02 - 0:12:03

with other ecosystems.

0:12:03 - 0:12:05

So that's super great.

0:12:05 - 0:12:07

And for Falcon TRDR channel,

0:12:07 - 0:12:10

everyone can just follow on Falcon TRDR

0:12:10 - 0:12:14

to get all the news and the new ecosystem updates

0:12:14 - 0:12:15

from Falcon DR.

0:12:15 - 0:12:19

And also you can take it as investor views.

0:12:19 - 0:12:21

You can read all the data metrics

0:12:21 - 0:12:27

from the Falcon, chain, ecosystem, and all the third party research report,

0:12:27 - 0:12:31

which you can use for your investor perspective,

0:12:31 - 0:12:33

and you can use to make your decision,

0:12:33 - 0:12:37

and you can know it's a customer insights

0:12:37 - 0:12:41

for all the Falcon fields with different ecosystems.

0:12:41 - 0:12:44

That's all, but hi everyone.

0:12:44 - 0:12:48

And I want to join today's AI panel space,

0:12:48 - 0:12:56

but looking forward to Carson and Aaron and Vogue to share all your insights about AI with Filecoin.

0:12:56 - 0:13:03

Thank you. Cool. Yeah, thanks, Isabella. Thank you for that information and thank you for the

0:13:03 - 0:13:09

opportunity to be here and for setting this up. And yeah, definitely check out the Filecoin TLDR handle website.

0:13:09 - 0:13:16

They've got a lot of really good information. They kind of like dive a bit deeper into Filecoin,

0:13:16 - 0:13:20

some of the kind of more complex problems. And they do a really good job of kind of explaining

0:13:20 - 0:13:27

some of this stuff in plain English, let's say. Filecoin sometimes can be a little bit complicated

0:13:27 - 0:13:29

and maybe intimidating for folks who aren't super technical,

0:13:29 - 0:13:32

but they do a pretty good job of breaking all that down.

0:13:34 - 0:13:37

And then maybe I'll introduce myself really quickly last,

0:13:37 - 0:13:38

but probably least,

0:13:38 - 0:13:40

because I'm probably much less knowledgeable

0:13:40 - 0:13:41

on the subject than you guys.

0:13:41 - 0:13:43

But so my name is Aaron Stanley.

0:13:43 - 0:13:45

I am currently editorial director at

0:13:45 - 0:13:51

Filecoin Foundation, where I host our DWeb Decoded podcast, which is a podcast that really tries to

0:13:51 - 0:13:55

explore not just Filecoin, but we also talk about a lot of things that are kind of what we call

0:13:55 - 0:13:59

Filecoin adjacent, things that are not Filecoin specifically, but are very relevant to Filecoin

0:13:59 - 0:14:06

or where there's overlap with Filecoin. And I think this subject today is definitely one of those areas for sure.

0:14:07 - 0:14:10

Before this, I worked at CoinDesk for five years.

0:14:10 - 0:14:11

I was a reporter, editor.

0:14:12 - 0:14:16

I produced the consensus conference for a few years.

0:14:17 - 0:14:20

And then I made the jump over to Filecoin a couple of years ago.

0:14:21 - 0:14:22

Filecoin Foundation, I should say.

0:14:24 - 0:14:27

And then before that, I was doing mainstream media,

0:14:27 - 0:14:33

political reporting in Washington, DC. So I've sort of moved on from that into crypto, which is

0:14:33 - 0:14:38

maybe... Anyway, it's been an interesting transition, I guess we'll say.

0:14:38 - 0:14:47

But anyway, onto the subject at hand. I I think so the subject that we're trying to tackle

0:14:47 - 0:14:50

today is really a core challenge in AI, which is data.

0:14:50 - 0:14:56

And we all know, or I'm assuming all of you, most of you know that by now that AI models

0:14:56 - 0:15:00

and agents are really only as good as the data they're trained on, right?

0:15:01 - 0:15:06

And right now there are a lot of questions around accessibility verifiability of data that's

0:15:07 - 0:15:12

for example that's like kind of locked up in some of these big tech silos creating some risks around

0:15:12 - 0:15:19

things like transparency and reliability bias various entry all this kind of stuff and um i

0:15:19 - 0:15:27

think in in the filecoin world we you know we like to envision a future where these AI systems do involve

0:15:27 - 0:15:33

centralized data infrastructure in some capacity, where things like storage, provenance, access

0:15:33 - 0:15:41

are more open, are more verifiable, are more resilient due to the decentralized nature.

0:15:41 - 0:15:47

And Filecoin network obviously offers this decentralized and verifiable data storage infrastructure.

0:15:47 - 0:15:51

So maybe let's kind of dive in.

0:15:52 - 0:16:04

Maybe I'll turn it over to Vuk to kind of kick off with, but maybe just talk from a high level of like, from your vantage point and from the Ramo team's vantage point.

0:16:00 - 0:16:05

you know, from your vantage point and from the Ramo team's vantage point, like, how do you guys

0:16:05 - 0:16:13

see kind of this overlap between, like, why is there a need for something like what Filecoin is

0:16:13 - 0:16:18

building in the context of AI? Let's just maybe we'll frame it like that. Like, why is this

0:16:18 - 0:16:24

decentralized data infrastructure, storage infrastructure, such an important component

0:16:24 - 0:16:26

of like the future of AI moving forward like we'll

0:16:26 - 0:16:35

start kind of a high level maybe we'll drill down a bit sure I would say generally like very few

0:16:35 - 0:16:42

organizations networks or communities have had the chance of getting to exabyte scale let alone

0:16:42 - 0:16:48

tens of exabytes which has been been shown in the context of Falcon

0:16:48 - 0:16:49

in a provable way, right?

0:16:49 - 0:16:52

Like you have PowerApp that is basically showing us

0:16:52 - 0:16:56

there is like 10, 20 exabytes of storage

0:16:56 - 0:16:58

that was allocated towards Falcon.

0:16:58 - 0:17:02

Now, historically, Falcon has had difficulties

0:17:03 - 0:17:05

leveraging this capacity.

0:17:05 - 0:17:10

So a lot of this capacity is basically, let's say, more inactive

0:17:10 - 0:17:12

because it's just committed capacity.

0:17:13 - 0:17:19

But ultimately, the point is that we were able to really show

0:17:19 - 0:17:23

that it's possible to aggregate tens of exabytes of storage.

0:17:24 - 0:17:28

And that has not actually been such an easy task

0:17:28 - 0:17:30

in the centralized context.

0:17:30 - 0:17:32

So if you think about a particular data center,

0:17:33 - 0:17:38

for example, data centers that the big AI labs are building today,

0:17:38 - 0:17:42

even those data centers will struggle to actually do 10 exabytes of capacity.

0:17:43 - 0:17:53

So in a way, what we're seeing is that scaling in a centralized context is becoming impractical.

0:17:53 - 0:18:01

And this is mainly caused by the fact that it's really impractical to get massive amounts of energy in a particular place,

0:18:01 - 0:18:06

let alone for storage that also, other energy is also kind of massive like it does

0:18:06 - 0:18:12

take a lot of space as well it does take a lot of um it's uh uh super heavy so like hard drives are

0:18:12 - 0:18:20

really heavy like if you put them in racks like these things have like tons and ultimately like

0:18:20 - 0:18:31

you need to put them somewhere so it's really hard to actually stack them on multiple floors so very often you need to kind of have like a very wide like area where like these racks are spread

0:18:31 - 0:18:39

around uh and uh and yeah like uh that's one thing which is basically we showed that uh uh

0:18:39 - 0:18:47

it's possible to actually create like this incentives that allow the communities to actually bring together a lot of storage.

0:18:48 - 0:19:03

Well, on the other hand, we are seeing that putting more compute towards training or like reinforcement is not actually like linearly benefiting the quality of this, of the new LLMs that are being created.

0:19:00 - 0:19:03

the quality of the new LLMs that are being created.

0:19:04 - 0:19:07

And ultimately, the next thing is going to be like,

0:19:07 - 0:19:10

how do we actually find more high-quality data

0:19:10 - 0:19:14

that we can actually use for training better and better models?

0:19:15 - 0:19:22

Or how do we actually have, yeah, basically higher density of data

0:19:22 - 0:19:28

in the sense that you could have a video that is 480p and you can have

0:19:28 - 0:19:35

the same video that is 4k and there is so much more information in the 4k video and how do we

0:19:35 - 0:19:42

get to a point where instead of like always are having 480p how do we always are high like 4k

0:19:42 - 0:19:46

and how do we actually make sure that we can actually store more

0:19:46 - 0:19:52

of this yeah if you look at all the data generated today like it's massive but most of this data gets

0:19:52 - 0:19:59

trashed this is everything from logs of programs that actually run all the security cameras most

0:19:59 - 0:20:07

of the security cameras after a while they basically delete the videos and many others.

0:20:07 - 0:20:10

So like most of the content is actually getting deleted.

0:20:11 - 0:20:16

And what we've seen with Filecoin is that with the incentives that it has created, we

0:20:16 - 0:20:22

were able to reduce the cost of storage by order of magnitude, which basically allows

0:20:22 - 0:20:27

anyone to just keep storing data without thinking as hard

0:20:27 - 0:20:28

as they had to think before,

0:20:28 - 0:20:32

like if this was on AWS or something like that.

0:20:32 - 0:20:34

But yeah, ultimately we showed that it's possible

0:20:34 - 0:20:37

to scale storage in a very horizontal way.

0:20:37 - 0:20:40

And on the other hand, there is a big need

0:20:40 - 0:20:42

for actually storing more data than data

0:20:42 - 0:20:44

that has been stored today,

0:20:44 - 0:20:45

because we kind of just

0:20:45 - 0:20:50

all the data there is and now we need to figure out how do we actually change the way that we are

0:20:50 - 0:20:56

actually collecting there possibly collecting there that is not being stored today I'll just

0:20:56 - 0:20:59

pause there yeah well a quick follow-up there I mean it's interesting that you mentioned

0:20:59 - 0:21:06

um you mentioned a point of how you just just basically adding more compute just throwing like

0:21:06 - 0:21:12

more compute more gpus at these llms doesn't necessarily improve the outcome uh if i understood

0:21:12 - 0:21:17

your point correctly that you're making um i mean i you're i guess you're hitting maybe like

0:21:17 - 0:21:24

diminishing returns with regards yes yes yes like and um so it's so it's not just so if i'm if i'm

0:21:24 - 0:21:26

kind of reading between the lines of your point here,

0:21:26 - 0:21:28

and I'd like for you to maybe elaborate on this,

0:21:28 - 0:21:32

but like the solution here isn't just like throwing more GPUs at the problem,

0:21:32 - 0:21:34

but the solution here is like, how do you get more data?

0:21:35 - 0:21:39

How do you get better quality data ready or available

0:21:39 - 0:21:41

and ready to be actually used in these models?

0:21:41 - 0:21:44

Is that kind of the point you were making or is that what you were implying?

0:21:45 - 0:21:46

Yes.

0:21:47 - 0:21:49

So there are basically two functions.

0:21:49 - 0:21:52

One is the GPU compute,

0:21:53 - 0:21:54

which has two parts.

0:21:54 - 0:21:57

One is the interactive one,

0:21:57 - 0:21:58

which is basically just in time.

0:21:58 - 0:22:00

So that's the reasoning part

0:22:00 - 0:22:02

that needs to be done for every query.

0:22:02 - 0:22:03

And then you have the other part,

0:22:03 - 0:22:11

which is basically the training, which is kind of shared by all the users, because ultimately one model gets used

0:22:11 - 0:22:20

many times. Now, what we're seeing is that the training, we're kind of hitting a point where

0:22:20 - 0:22:26

we are definitely not linearly getting more benefits. What's happening now is that we

0:22:26 - 0:22:31

are throwing more compute on the reasoning side of things just to increase the quality, but that

0:22:31 - 0:22:37

doesn't scale super well because you need to put more reasoning into each basically query. So like

0:22:37 - 0:22:48

you need to do that for each user. At the same time, on the data side uh we have yet to kind of scratch the surface uh and uh there

0:22:48 - 0:22:55

is definitely like a linear benefit uh and and it's also easier uh to do in the context of like

0:22:56 - 0:23:02

uh i mean not really easier but like it's something that has like more impact if we

0:23:02 - 0:23:07

focus on instead of just throwing more g GPU at the problem and hoping for the best.

0:23:09 - 0:23:14

Got it. Maybe Carson, let's turn it over to you. Would love maybe your thoughts on the kind of the

0:23:14 - 0:23:19

big picture question I posed earlier, just kind of what do you see as the overlap between maybe

0:23:19 - 0:23:23

Filecoin, Filecoin's mission, and then building with recall. And then if you want to react to

0:23:23 - 0:23:25

anything that Vuk was saying,

0:23:25 - 0:23:26

we'd love some of your thoughts on that as well.

0:23:27 - 0:23:28

Yeah, cool.

0:23:28 - 0:23:30

Thanks for the prompt.

0:23:30 - 0:23:33

I mean, I think I can come at it from a slightly different perspective,

0:23:33 - 0:23:36

which is useful in the context of a discussion,

0:23:37 - 0:23:48

which is our team and our research is focused less on sort of like the raw data that goes to like actually drive

0:23:48 - 0:23:53

and build up these foundation models and more on the sort of like other side of the equation,

0:23:53 - 0:23:59

which is dealing with all of the outputs from these models, whether it's like reasoning outputs

0:23:59 - 0:24:08

or like actual like the raw text and tool calls and engagements and things that agentic systems are producing.

0:24:08 - 0:24:12

Because you can think about it like in general,

0:24:12 - 0:24:16

these foundation models are the models that are,

0:24:16 - 0:24:18

that's like a broad class of LMS and

0:24:18 - 0:24:22

multimodal models that are trained on just like general corpora.

0:24:22 - 0:24:25

This is like your GPTs and your clods and your llamas.

0:24:26 - 0:24:28

And these ones are designed to be

0:24:28 - 0:24:31

a sort of like raw intelligence of the system.

0:24:32 - 0:24:34

And as Vuk mentioned, like we're getting to the point

0:24:34 - 0:24:37

where we're starting to see diminishing returns on compute.

0:24:38 - 0:24:41

And in a lot of senses,

0:24:41 - 0:24:43

those foundation models are like effectively

0:24:43 - 0:24:44

starting to commoditize.

0:24:46 - 0:24:51

They're competing on price and they're competing on just general usability and utility.

0:24:52 - 0:24:58

And so then a lot of the interesting innovations need to start happening elsewhere

0:24:58 - 0:25:01

because, like Luke mentioned, just sort of diminishing returns.

0:25:02 - 0:25:05

Data collection is a hard problem. Data collection is like a hard problem.

0:25:05 - 0:25:06

It's always been a hard problem.

0:25:06 - 0:25:10

Data quality is just hard because it's easy.

0:25:11 - 0:25:11

Well, it's easy.

0:25:11 - 0:25:16

It's easier to just try and do a general crawl

0:25:16 - 0:25:17

and get as much data as possible

0:25:17 - 0:25:19

and then dump it in and hope for the best.

0:25:19 - 0:25:21

It's a lot harder to curate

0:25:21 - 0:25:24

and then even manage that curated data set.

0:25:24 - 0:25:27

So the foundations because of that

0:25:27 - 0:25:34

are sort of like arguably um uh commoditizing so from our perspective the the interesting thing

0:25:34 - 0:25:43

starts to be like oh okay well um then why does a system like filecoin need to exist if um in a

0:25:43 - 0:25:46

world where we want to think about capturing the outputs of these models.

0:25:47 - 0:25:54

Well, part of the reason for that is, as Vuk also mentioned, we want to capture more of the data

0:25:54 - 0:26:00

that we're creating. And by and large, you know, from this point forward, most of the data that is

0:26:00 - 0:26:05

being created, at least in terms of like human and computer interactions, is being created, at least in terms of human and computer interactions,

0:26:05 - 0:26:08

is being created by these models

0:26:08 - 0:26:11

and the agentic systems built on top of them.

0:26:11 - 0:26:15

And so harnessing that is really useful,

0:26:15 - 0:26:19

but it's not just useful in terms of capturing the data

0:26:19 - 0:26:21

and being like, oh good, we got that.

0:26:21 - 0:26:24

Let's think about what we can do with that later.

0:26:24 - 0:26:25

Systems like Filecoin are useful

0:26:25 - 0:26:28

and other similar systems

0:26:28 - 0:26:32

because not only do we have the volume

0:26:32 - 0:26:33

to capture all of the data,

0:26:33 - 0:26:36

but we can also do things like record the provenance

0:26:36 - 0:26:40

and the structure and things like that of the data.

0:26:40 - 0:26:43

So we can actually verifiably say,

0:26:43 - 0:26:46

okay, this model was run at this time with this prompt

0:26:46 - 0:26:52

and this set of tool calls and blah, blah, blah, and it produced this output. And whilst that

0:26:52 - 0:26:57

particular piece of information maybe isn't useful in the context of that one interaction

0:26:57 - 0:27:12

with the underlying LLM, in the future, it's going to become increasingly useful. And I think a big part of it is like, you know, yesterday's AI kind of produced just raw text.

0:27:12 - 0:27:24

Tomorrow's AI systems are producing like transactions and predictions and making decisions that flow through like real markets and supply chains and governance systems and all these things.

0:27:20 - 0:27:24

flow through like real markets and supply chains and governance systems and all these things.

0:27:25 - 0:27:32

And if we want to be able to sort of like track these decisions, we need some way in order to

0:27:32 - 0:27:38

like actually capture in a verifiable way those outputs. And that's been the perspective that

0:27:38 - 0:27:45

we've been taking a lot is like, look, my team isn't going to have a direct impact on foundation models.

0:27:47 - 0:27:47

A lot of this stuff is happening in labs.

0:27:51 - 0:27:53

A lot of the data is proprietary because data quality is so important.

0:28:00 - 0:28:01

We can try to have an influence on things, and we can work with decentralized training systems.

0:28:02 - 0:28:08

Prime Intellect is an awesome example. There's a lot of like DAOs and protocols and

0:28:08 - 0:28:14

teams that are working to collect, you know, high quality data in a way that's, excuse me,

0:28:14 - 0:28:21

verifiable and for and of the community. But by and large, the big foundation models are happening

0:28:21 - 0:28:27

in fairly siloed systems. But once we unleash those models, it's very useful

0:28:27 - 0:28:32

to be able to capture their outputs and do something useful with them. And so that's the

0:28:32 - 0:28:37

kind of perspective we're coming from. It's like most of the data of the future will be produced

0:28:37 - 0:28:46

by these models. And so building infrastructure and systems to capture that in a verifiable way is useful.

0:28:46 - 0:28:49

We see this in other contexts as well.

0:28:49 - 0:28:54

The outputs are more realistic to capture than inputs.

0:28:54 - 0:29:02

Our team works a lot with agentic AI builders.

0:29:02 - 0:29:05

In particular, we've been working with a slew of developers who build

0:29:05 - 0:29:13

trading bots or trading agents that are actually trying to optimize

0:29:13 - 0:29:18

P&L profit and loss and things like that over different timescales.

0:29:18 - 0:29:19

In a lot of cases,

0:29:19 - 0:29:23

their inputs are pretty proprietary information because they're trying to

0:29:23 - 0:29:26

build a business around it and they don't necessarily want you to have access

0:29:26 - 0:29:32

to either the training data that they're using

0:29:32 - 0:29:34

to fine-tune their own models

0:29:34 - 0:29:38

or the actual price feeds and data feeds and data sources

0:29:38 - 0:29:42

that they then actually run through the models at test time.

0:29:42 - 0:29:47

And so that sort of information they keep close to the chest.

0:29:47 - 0:29:50

They may be leveraging decentralized storage

0:29:50 - 0:29:54

as a backup in an encrypted way.

0:29:54 - 0:29:57

But in terms of open network storage,

0:29:57 - 0:30:00

they're not leveraging it too much in practice.

0:30:00 - 0:30:13

But the outputs of those systems are often either on-chain actions, so it's right there, easy to capture and see, or interactions with clients and things like that, which is instantly out of their hands.

0:30:13 - 0:30:18

And so they don't have any, you know, they're not pretending to have control necessarily over those outputs.

0:30:18 - 0:30:22

So capturing that and leveraging that in a useful way is helpful.

0:30:20 - 0:30:22

and leveraging that in a useful way is helpful.

0:30:23 - 0:30:26

And then furthermore, capturing that and storing it

0:30:26 - 0:30:33

and then leveraging it later to help with evals and fine-tuning

0:30:33 - 0:30:37

and even in some cases in-context learning is actually super useful.

0:30:38 - 0:30:42

And so just to finalize that, evals, evaluations,

0:30:42 - 0:30:49

this is sort of like a way to do I don't know if a simplified

0:30:49 - 0:30:57

explanation is this is a way to do unit testing on model outputs because the LLMs are you know

0:30:57 - 0:31:06

probabilistic systems so unlike more traditional code where you kind of like, ideally you get, you know, for a given input, you get a given output.

0:31:07 - 0:31:12

With LLM's, LLM-based models, it's probabilistic.

0:31:12 - 0:31:17

So like a given input will produce similar output, but not necessarily the same.

0:31:17 - 0:31:20

And so we have to change our testing framework a little bit.

0:31:20 - 0:31:24

And the testing framework that we call that is called evals.

0:31:26 - 0:31:30

No, thanks for that. There's a lot of really interesting information in what you just mentioned

0:31:30 - 0:31:34

there. And I was trying to take notes, but there's like too much good stuff. And I kind of

0:31:34 - 0:31:39

ran out, I kind of lost track here. But like, but like one point that you think you made that was

0:31:39 - 0:31:51

really interesting to me was, I think maybe it's kind of like a kind of an overall point that you're making is that the focus, your focus is really on the outputs and making sure that we can track

0:31:51 - 0:31:56

these out, these model outputs and like a verifiable, that the providence of these,

0:31:56 - 0:31:59

of these outputs, et cetera, is going to be very important in the future.

0:32:00 - 0:32:11

And it's also much more kind of realistic to track these rather than trying to track the inputs into the models just because most of these things are being used under proprietary systems, right?

0:32:11 - 0:32:14

So people don't want to necessarily say how they're training their models, et cetera, right?

0:32:14 - 0:32:17

Maybe in some instances they do, but if it's a proprietary business, they probably don't, right?

0:32:18 - 0:32:19

But we can track the output.

0:32:19 - 0:32:32

It's a lofty goal and it's a good goal at both ends, right? Like, really what I want is an LLM trained on, like, open data that I can know and in theory inspect.

0:32:33 - 0:32:34

And that is the ideal.

0:32:34 - 0:32:40

But from a, like, practical perspective, it's not always the case that it's easy on the input side.

0:32:40 - 0:32:41

the input side.

0:32:41 - 0:32:47

And then, but I want to double click on the provenance question because I thought that

0:32:47 - 0:32:52

was really interesting what you raised there in that, you know, right now, like, yeah,

0:32:52 - 0:32:56

we're using AI systems for, you know, it's like we ask ChatGPT, like, okay, what should

0:32:56 - 0:32:57

I order for dinner tonight?

0:32:57 - 0:32:59

Or like, make me a cat photo, whatever.

0:32:59 - 0:33:04

Or like, we have these kind of, these like, you know, agentic reply bots on Twitter or

0:33:04 - 0:33:07

whatever. These things that aren't like necessarily of, you know,

0:33:07 - 0:33:08

really great consequence.

0:33:08 - 0:33:10

But in the future, as you were saying, you know,

0:33:10 - 0:33:12

if these things are gonna be doing transactions,

0:33:12 - 0:33:16

if there's going to be major decisions that are being made based up by these

0:33:16 - 0:33:20

agents, there needs to be some way of really like tracking these outputs with,

0:33:20 - 0:33:27

with consistency, with verifiability, just for obviously future learning, but also like having,

0:33:27 - 0:33:32

you know, if something goes wrong, we actually can kind of look back and know like what went

0:33:32 - 0:33:37

wrong there, right? Maybe I'll punt it over to Vuk, but I'd love your thoughts on this.

0:33:37 - 0:33:41

Love your reaction to like, I guess, anything that stood out from Carson's remarks there,

0:33:41 - 0:33:47

but also, you know, how are you guys thinking about this question of provenance and why this is so important?

0:33:50 - 0:33:58

So basically, Filecoin by default provides a way to uniquely identify a particular piece

0:33:58 - 0:33:58

of content.

0:33:59 - 0:34:04

So like basically even the proofs that are being sent on chain for a particular sector,

0:34:04 - 0:34:09

every 24 hours are basically saying, okay, this piece of data is actually here.

0:34:10 - 0:34:15

There is another piece which is basically connecting the dots between a particular data set

0:34:15 - 0:34:23

and the set of sectors that are being onboarded to the network with a particular CID.

0:34:20 - 0:34:23

onboarded to the network with a particular CID.

0:34:25 - 0:34:29

And yeah, ultimately we are really like trying to like

0:34:29 - 0:34:34

rely a lot on like all these content addressed pieces

0:34:34 - 0:34:39

because what we're seeing often is that our clients

0:34:40 - 0:34:45

are basically asking for just getting a sense first of like,

0:34:45 - 0:34:46

is data actually there?

0:34:46 - 0:34:48

But also in the context of them,

0:34:48 - 0:34:54

like proving to their users that they use a particular piece of data

0:34:54 - 0:34:57

and that that piece of data was, for example,

0:34:57 - 0:34:59

in a particular jurisdiction and so on.

0:35:00 - 0:35:02

But yeah, like basically by default,

0:35:03 - 0:35:08

like the Falcon Network provides this abstraction and we are definitely relying on it.

0:35:09 - 0:35:26

Although, to be honest, we are focusing a lot on just making the use cases work first, and then adding the benefits of provenance and other ones, instead of making that the main feature that we're offering on our infrastructure.

0:35:30 - 0:35:31

So yeah, TLDR is like,

0:35:31 - 0:35:34

we're focused more on trying to push it

0:35:34 - 0:35:37

to tens or hundreds of petabytes.

0:35:38 - 0:35:42

And then we'll enrich these features

0:35:43 - 0:35:47

that allow our customers to take more advantage of the features

0:35:47 - 0:35:48

that the network provides.

0:35:50 - 0:35:51

Cool, cool.

0:35:52 - 0:35:54

And then maybe it's a good point just to mention

0:35:54 - 0:35:57

that we will have some time for Q&A at the end.

0:35:57 - 0:35:59

So if folks have questions or comments

0:35:59 - 0:36:03

or anything that they want to raise,

0:36:03 - 0:36:06

feel free to have a think about those at the moment.

0:36:06 - 0:36:11

And then we'll open it up maybe in like 10 to 15 minutes

0:36:11 - 0:36:13

or so for questions from the audience.

0:36:13 - 0:36:15

So folks, so have a think.

0:36:15 - 0:36:17

Maybe Carson, let's turn it back to you.

0:36:17 - 0:36:19

And I'd love to kind of loop all this

0:36:19 - 0:36:20

of what you're talking about back in

0:36:20 - 0:36:24

with what you guys are building with recall.

0:36:24 - 0:36:29

And you kind of gave the high level elevator pitch of recall in your intro remarks, but

0:36:29 - 0:36:33

it'd be great if you could maybe tell us a bit more about like, what is this sort of

0:36:33 - 0:36:35

agentic intelligence concept mean?

0:36:35 - 0:36:36

How are you guys deploying that?

0:36:36 - 0:36:40

And then maybe kind of looping that in with some of the other topics we've been discussing

0:36:40 - 0:36:40

here.

0:36:42 - 0:36:49

Yeah, yeah yeah sure so um i mean yeah i'll try and keep it focused on the the sort of like uh

0:36:50 - 0:37:00

ai and data um category but uh by and large recall is is is an agent arena where we test a lot of

0:37:00 - 0:37:07

capabilities on agents and we have a couple of ways of doing that um but the broad strokes uh

0:37:07 - 0:37:15

framework is imagine a world in which people could decide okay uh great a new foundation model has

0:37:15 - 0:37:21

come out but i want to know if it's actually good at not overusing m dashes right like we've all we

0:37:21 - 0:37:26

try you know you try to get it to write you some copy for a tweet and it just sticks all these dashes in there.

0:37:26 - 0:37:29

So I don't want that to happen ever again. And I have tests.

0:37:29 - 0:37:33

I have ways to test whether this model outputs or overusing dashes.

0:37:34 - 0:37:40

And so but I can't possibly write and sort of control it enough myself.

0:37:40 - 0:37:47

So I want to leverage the ecosystem and communities to help me build ai systems that

0:37:47 - 0:37:55

never overuse m dashes and so i want to deploy this particular test or set of tests um that i'm

0:37:55 - 0:38:01

going to like deploy to the network and start running competitions against all sorts of different

0:38:01 - 0:38:06

llms and agents and testing them on inputs and outputs,

0:38:06 - 0:38:07

producing those outputs,

0:38:07 - 0:38:09

and then evaluating whether they're overusing MDashes.

0:38:09 - 0:38:11

By the way, I'm using this MDashes example

0:38:11 - 0:38:13

because it's a bit silly, but it's easy to think about.

0:38:15 - 0:38:18

It's not exactly the most important test in the world,

0:38:18 - 0:38:22

but it's one you can wrap your head around.

0:38:22 - 0:38:24

So we deploy these tests,

0:38:24 - 0:38:26

we run them against lots of inputs and outputs.

0:38:26 - 0:38:31

Users and AIs evaluate those models and they determine,

0:38:31 - 0:38:34

yep, it wasn't overusing or no, it was overusing.

0:38:34 - 0:38:36

We build up a scoring system and

0:38:36 - 0:38:40

a ranking system that actually ranks these different systems.

0:38:40 - 0:38:45

It turns out that the latest models,

0:38:45 - 0:38:49

so ChatGPT5 is actually not that great.

0:38:49 - 0:38:52

It kind of overuses MDashes.

0:38:52 - 0:38:55

But you know what model doesn't overuse MDashes?

0:38:55 - 0:38:58

A recent coding model kind of makes sense

0:38:58 - 0:39:01

because it's optimized for coding,

0:39:01 - 0:39:04

so it doesn't have a lot of examples of MDashes

0:39:04 - 0:39:06

in its training set probably.

0:39:06 - 0:39:07

So if you ask it to write prose,

0:39:07 - 0:39:10

it actually does a pretty good job because it's

0:39:10 - 0:39:13

trained on technical content and code and data.

0:39:13 - 0:39:15

So you can start to build up

0:39:15 - 0:39:20

this intuition over time of which models are really

0:39:20 - 0:39:24

good at particular things by deploying it to a network of

0:39:24 - 0:39:26

users and having them engage with

0:39:26 - 0:39:32

it and rate it and rank it. The most successful of these that we've done so far is trading P&L.

0:39:33 - 0:39:39

And there's a couple of really important reasons why we started there. One, Web3 people kind of

0:39:39 - 0:39:50

dig it, so it's like a fun example to think about. But two, one of the problems with a lot of the foundation models is that they are trained against these static benchmarks.

0:39:52 - 0:39:54

And the benchmarks are awesome, right?

0:39:54 - 0:40:03

Like we need these benchmarks to help us understand how good is this model at a particular concept or is it good at abstract math?

0:40:03 - 0:40:05

Is it good at multi-step reasoning?

0:40:05 - 0:40:08

We can craft these benchmarks that we run

0:40:08 - 0:40:10

against these models to try and test,

0:40:10 - 0:40:12

okay, this one is fractionally better than

0:40:12 - 0:40:16

this one at that particular skill that we're measuring.

0:40:16 - 0:40:19

That's great. The problem is,

0:40:19 - 0:40:21

these are static benchmarks.

0:40:21 - 0:40:24

What happens is you start to test these.

0:40:24 - 0:40:25

It would be silly if

0:40:25 - 0:40:29

they didn't do this, right? This is a bunch of data. And we were talking about how we're, you

0:40:29 - 0:40:35

know, that's actually data is the hard problem, right? Curated data is the hard problem. And

0:40:35 - 0:40:41

benchmarks are like perfectly curated data. So obviously the models are going to be trained

0:40:41 - 0:40:46

against them. And so this is really similar to just like,

0:40:46 - 0:40:48

you know, a teacher basically saying,

0:40:48 - 0:40:50

OK, students, I'm going to test you later,

0:40:50 - 0:40:54

but here are all of the answers and questions ahead of time.

0:40:54 - 0:40:58

Go ahead and study those and then let's see how well you do.

0:40:58 - 0:41:01

And so obviously the students are going to do a lot better

0:41:01 - 0:41:04

if they've got all of the questions and answers ahead of time.

0:41:04 - 0:41:09

So what we really need is we need dynamic benchmarks, benchmarks that change all the time.

0:41:10 - 0:41:15

And so we started with a really intuitive and simple one, which is trading P&L.

0:41:15 - 0:41:23

So how good are these agents at determining stops and losses or trading on an open market?

0:41:23 - 0:41:29

This is obviously dynamic because it's very difficult to predict market movements.

0:41:29 - 0:41:32

And frankly, if they could predict market movements ahead of time,

0:41:32 - 0:41:37

they would just quietly stop competing and go off and start making a swag ton of money.

0:41:37 - 0:41:43

So, you know, it's pretty easy to be certain that they're not going to be able to optimize ahead of time

0:41:43 - 0:41:49

for a particular trading scenario. So this is a good way to test. It's very dynamic. It's very objective in terms

0:41:49 - 0:41:54

of how we measure it. And so it's a great example of a dynamic benchmark that we can start to build

0:41:54 - 0:42:03

up and test and then score and create rankings for. And so that's what we've been putting a lot

0:42:03 - 0:42:05

of effort into, and that's what recall is building up.

0:42:06 - 0:42:38

Now, this is pretty useful because we have now a ranking and scoring system that's fairly objective that we can use to track how well particular agents or LMs are at a bunch now. We've done trading is one, but we've done, we actually have done the M-1 and we've done how well are these models at, you know, delivering bad news in an empathetic way. We've done lots of different sort of subjective and objective tests and you start to build up this ranking.

0:42:40 - 0:42:46

that ranking system is also something that you want to be available to other systems because

0:42:46 - 0:42:54

it's useful to be able to explore like oh you know if i'm building a uh an agentic coding system

0:42:54 - 0:43:02

what are the best models right now uh for focusing on i don't know uh solidity or i'm building a

0:43:00 - 0:43:08

or I'm building an application that helps doctors be more empathetic.

0:43:09 - 0:43:17

What underlying model should I leverage to help doctors come up with ways to deliver bad news?

0:43:18 - 0:43:26

What about a model that's good at delivering good news. And all of these systems help, or sorry, all these rankings help these

0:43:26 - 0:43:33

system builders better understand which tools are good at the things that actual people care about.

0:43:33 - 0:43:40

Because I like to see which models are good at abstract math, but it doesn't really actually

0:43:40 - 0:43:52

help me in my day-to-day usage of these models. What really helps me in my day-to-day usage of these models is, does this model produce textual output that doesn't sound like a robot?

0:43:52 - 0:44:01

Can it handle multi-step tasks for writing code and much more practical things?

0:44:00 - 0:44:02

and much more practical things.

0:44:02 - 0:44:06

So yeah, we want to be able to build up benchmarks

0:44:06 - 0:44:08

that are testing very practical things,

0:44:08 - 0:44:10

and that's what we're doing with Recall.

0:44:10 - 0:44:15

And so inputs and outputs of all of these systems

0:44:15 - 0:44:19

are really important sources and syncs of data.

0:44:19 - 0:44:22

And you can do further training on that data.

0:44:22 - 0:44:27

So recent research has shown that even if a model is already trained,

0:44:27 - 0:44:34

if you actually feed it feedback in the form of like ranking or scoring,

0:44:34 - 0:44:42

it can actually produce better responses based purely on its in-context data.

0:44:42 - 0:44:44

And so for people who aren't familiar with in-context.

0:44:44 - 0:44:46

So basically you train a model,

0:44:46 - 0:44:48

and you have this large context window

0:44:48 - 0:44:51

within which you can feed it input tokens.

0:44:52 - 0:44:53

So input text, right?

0:44:53 - 0:44:56

Prompts, as everybody knows it.

0:44:57 - 0:45:01

And we also know if you do a good job of prompting model,

0:45:01 - 0:45:04

you will get a better output.

0:45:05 - 0:45:07

And it turns out sometimes it's very hard

0:45:07 - 0:45:08

to come up with a really good prompt.

0:45:09 - 0:45:13

But what you can do is you kind of train in that context,

0:45:13 - 0:45:14

in that prompt window.

0:45:14 - 0:45:17

And you can ask it the same question multiple times,

0:45:17 - 0:45:22

rank its responses, and then ask it one more time

0:45:22 - 0:45:23

and you'll get a better answer.

0:45:23 - 0:45:24

And you ask it another time,

0:45:24 - 0:45:25

you'll get a slightly better answer.

0:45:26 - 0:45:30

And so you can do a lot of this sort of training stuff in context,

0:45:30 - 0:45:34

and then you build up a data set of, this is called test time training,

0:45:35 - 0:45:38

that you can then actually leverage in real live systems

0:45:38 - 0:45:42

so you don't have to wait for the foundation models to be updated.

0:45:42 - 0:45:45

So there's lots of ways in which we can actually like

0:45:45 - 0:45:53

rank test and capture the data that these systems are producing like in the real world to improve

0:45:53 - 0:46:00

our agentic systems like trading and and writing and all of these things uh that was a bit rambly

0:46:00 - 0:46:15

but the the sort of like yeah the take-home point is there's a lot we can learn and do from capturing the outputs of these models and scoring and ranking them and understanding if they're actually solving the tasks that we need them to solve.

0:46:20 - 0:46:23

this, this mission, because you're really focused. If I'm hearing you correctly, it

0:46:23 - 0:46:28

seems like the focus is really on just trying to make these, these things more just like

0:46:28 - 0:46:34

reliable and usable and useful for just like average, like real world use cases. Right.

0:46:34 - 0:46:38

And I think we've all probably experienced, you know, chat GPT hallucinations or, you

0:46:38 - 0:46:43

know, hallucinations from other AI models. Like, you know, just last weekend I was going

0:46:43 - 0:46:48

out to dinner with my wife and I asked chat GP Chibi T like, what's a good restaurant we can go to. It suggested a place.

0:46:48 - 0:46:52

We went there and it was closed. Like the place like didn't even exist anymore. Um,

0:46:52 - 0:46:55

you know, I was like, okay, I, you know, like, you know, you kind of have to train yourself

0:46:55 - 0:46:58

to like double check these things. But in that case I was at that moment I was being

0:46:58 - 0:47:02

kind of lazy. I was like, okay, it's probably correct. Let's just go. Uh, and it, the place

0:47:02 - 0:47:06

didn't exist. So, um, but like, and obviously like, you know,

0:47:06 - 0:47:09

we're talking about, these are like, we're not outsourced. We're not to the point where like

0:47:09 - 0:47:15

outsourcing major decisions to like AIs, right. We're talking about, you know, just sort of

0:47:15 - 0:47:20

help me with this line of thinking, help me with, you know, create a prompt or help me create like

0:47:20 - 0:47:24

a, you know, a text for an article, help me create a code, you know, you know, some code for this app.

0:47:25 - 0:47:28

These aren't like, these are just like, it's like a sandbox they're playing around with right so

0:47:29 - 0:47:32

um so i really like how you're you're thinking about this in terms of like how can you make

0:47:32 - 0:47:38

like the outputs of this just more like easily trackable and like you know what models are better

0:47:39 - 0:47:43

at uh different things which are they more reliable at like you know different use cases

0:47:43 - 0:47:45

different types of prompting etc etc um so i think that's that's reliable at like, you know, different use cases, different types of prompting, et cetera, et cetera.

0:47:46 - 0:47:50

So I think that's, that's like a key, you know, a really, really key thing.

0:47:50 - 0:47:52

So, you know, thank you for, for explaining all that.

0:47:53 - 0:47:56

Vuk, I'd love your, you know, maybe your reaction to any of that.

0:47:56 - 0:47:59

If there's anything that stuck out, anything you want to chime in on and maybe, you know,

0:48:00 - 0:48:03

How does that kind of fit in at all with what you're building with Ramo?

0:48:00 - 0:48:03

how does that kind of fit in at all with what you're building with Rommel?

0:48:06 - 0:48:11

Yeah, I mean, ultimately, we are always trying to think it from the fundamentals.

0:48:12 - 0:48:19

And yeah, there are a few stages that you always want to go through.

0:48:19 - 0:48:25

One is the one that we mentioned, which is basically just collecting the data, like in whatever shape or form.

0:48:26 - 0:48:31

And then you have a few steps where you would clean this data, like you would label it in a particular way.

0:48:31 - 0:48:37

And then like you would input that into like either training or reinforcement for a particular model.

0:48:38 - 0:48:50

So, yeah, we are kind of thinking about like how do we actually like use the tag that both the Falcon ecosystem has enabled us but also like more of the zk innovations that we've seen in

0:48:51 - 0:49:00

the past couple of years to allow like use cases that or naturally would have been like in large

0:49:00 - 0:49:07

data centers all centralized in one particular location how do we allow this to happen in a more decentralized context,

0:49:07 - 0:49:10

maybe between multiple data centers?

0:49:10 - 0:49:16

Initially, maybe these data centers would need to be connected in the same area.

0:49:16 - 0:49:18

For example, if this is North Virginia,

0:49:18 - 0:49:21

like the data center LA.

0:49:21 - 0:49:28

But how do we get to a point where like you can scale uh all of these to multiple data centers

0:49:28 - 0:49:35

and allow like larger models uh to be created or more data to be harnessed uh for for actually

0:49:35 - 0:49:47

improving these months cool cool um now i want to touch on uh an earlier point that we addressed, which is kind of on the input, the training side.

0:49:48 - 0:50:00

And I think when I was first starting to research this whole subject a couple of years ago and researching the overlap with like BioCoin and with AI, LLM models, et cetera,

0:50:00 - 0:50:07

I was really interested in this idea of like, wow, this would be, you know, I see, I think

0:50:07 - 0:50:11

Filecoin really has like maybe a product market fit here because in the future, like you're

0:50:11 - 0:50:15

going to be wanting to make sure that you're trading your models off of like very pristine

0:50:15 - 0:50:15

data.

0:50:15 - 0:50:19

Like I want my, I want this data to be, I don't want, you know, if I'm trading a model, I

0:50:19 - 0:50:21

don't want this just to be like junk off the internet.

0:50:21 - 0:50:29

I want this to be, I want to like know that this is verifiable, like real data hasn't been tampered with, hasn't been, you know, like

0:50:29 - 0:50:34

it's not just synthetic to junk data or something. And Filecoin allows you the ability to basically

0:50:34 - 0:50:38

guarantee that, okay, this data that's being stored in this place is like cryptographically

0:50:38 - 0:50:44

sealed. It's, it's, it's secure. It hasn't been tampered with, hasn't moved, hasn't been

0:50:44 - 0:50:45

altered in any way.

0:50:46 - 0:51:01

And so my initial thinking was like, wow, that could be a very valuable thing for folks that are looking to train, you know, models in ways that are they can basically be sure that, OK, this is this is going to give me the results I'm intending.

0:51:00 - 0:51:05

the results I'm intending. Sounds like that path from what we were discussing with Carson earlier

0:51:05 - 0:51:11

has maybe been a bit more difficult than maybe, or maybe it's easier. It sounds better in theory

0:51:11 - 0:51:16

and it doesn't practice perhaps. But I would like to, and just given like the dominance of the big

0:51:16 - 0:51:21

players in this space and kind of the big silos and whatnot, it feels like, okay, this is probably

0:51:21 - 0:51:28

not a utopia that's going to be coming to pass anytime soon but like i do feel like there's probably at some point in some way shape or form

0:51:28 - 0:51:32

there's a future where like this there's going to be clients out there there's going to be folks out

0:51:32 - 0:51:40

there who will want this um this level of like pristine data this kind of guarantee that the

0:51:40 - 0:51:46

data is pristine and um and i'm just wondering maybe like is there i mean at what point would we

0:51:46 - 0:51:49

what would would it what would it take for us to get to this point where like

0:51:50 - 0:51:56

systems like filecoin become there's like a premium on this data that's thrown a file coin

0:51:56 - 0:52:01

because it's like it has this this it's been sealed we like we know it's we know it's

0:52:01 - 0:52:06

vera it's true we know it's it's verifiable, et cetera.

0:52:08 - 0:52:08

We'd love to maybe pose that hypothetical.

0:52:11 - 0:52:14

Maybe I'll punt it to Carson and then Vukifu and chime in as well.

0:52:16 - 0:52:18

Well, I don't know.

0:52:21 - 0:52:21

I don't know when we will be able to say like, yep, cool.

0:52:22 - 0:52:23

We did it.

0:52:24 - 0:52:31

We're there now uh i i have a feeling the answer is

0:52:31 - 0:52:40

somewhat political and fairly far and it's going to be a pretty contentious debate and discussion

0:52:40 - 0:52:46

you know it seems clear that we're going to have like sovereign AI and we're going

0:52:46 - 0:52:52

to have like open source AI and we're going to have closed source hyper optimized AI and we're

0:52:52 - 0:52:58

going to have these systems and there's going to be fights and arguments between groups on

0:52:58 - 0:53:07

how it needs to be done. You know, and actually Filecoin stands to benefit from either side of these discussions.

0:53:08 - 0:53:16

If it can provide verifiable storage of the underlying data within specific geographic

0:53:16 - 0:53:23

or socioeconomic regions, that's going to be useful for sovereign AI. If it can store it in

0:53:23 - 0:53:25

open, verifiable ways, that's going to be useful for open AI. If it can store it in open, verifiable ways, that's

0:53:25 - 0:53:30

going to be useful for open source AI. And if corporate entities can ensure that their

0:53:30 - 0:53:36

data is compliant and secure and backed up in many jurisdictions or whatever, then that's

0:53:36 - 0:53:38

going to be useful as well.

0:53:38 - 0:53:53

So I think if the Filecoin ecosystem can prove that it's providing the real specific value that each approach to underlying data in the LLM or the models is trying to get to, then I think it wins.

0:53:55 - 0:54:07

I think it's probably not really going to win until we get to a scenario where we can co-locate compute with the data.

0:54:08 - 0:54:15

Because if you're doing a model training run on exabytes of data,

0:54:15 - 0:54:18

and you have to move that data anywhere,

0:54:19 - 0:54:27

that is a major cost in terms of literal literal infrastructure cost and and like money but also

0:54:27 - 0:54:33

in terms of time right like every you know uh meter further that the data has to transfer over

0:54:33 - 0:54:41

the wire is you know translates into some amount of time um because physics so i think like the

0:54:41 - 0:54:55

winning combination is like being able to verifiably address different jurisdictional requirements for these different models and then being able to then like co-locate the data with the compute.

0:54:56 - 0:55:00

But that's maybe that's my hot take for today's discussion.

0:55:04 - 0:55:05

Book, do you want to chime in there? discussion.

0:55:07 - 0:55:17

Vuk, do you want to chime in there? Vuk Kususmanis Yeah, I mean, like, I feel that we need to focus first on the base primitives

0:55:17 - 0:55:27

and scale it to the point where like it becomes relevant in the context of Web2 and where Big Tech is today.

0:55:27 - 0:55:29

And then we want to basically make sure

0:55:29 - 0:55:33

that we build the tooling to enable a particular set

0:55:33 - 0:55:35

of use cases.

0:55:35 - 0:55:37

I do think we are still in the first phase

0:55:37 - 0:55:40

where we're basically trying to make it work well

0:55:40 - 0:55:51

for like 10, 100, maybe one one exploit uh skills uh from there basically amplifying the

0:55:51 - 0:55:58

value that we've created with tooling uh is a easier uh prone to solve uh but uh but yeah like

0:55:58 - 0:56:08

that's something that needs to happen now it's good mentioning that uh there is a gap uh generally in the ai space right now

0:56:08 - 0:56:15

which is basically like everything around they are prepping uh like we've seen acquisitions from meta

0:56:15 - 0:56:33

of like um scale ai and a few others uh which basically first centralized the data prepping side of things, which kind of left a vacuum of particular companies that are not able to use,

0:56:34 - 0:56:36

of course, tools from competitors.

0:56:37 - 0:56:45

So yeah, I do think there is space and we are well positioned to actually attack some of those.

0:56:46 - 0:56:51

But there is also a lot of work to do on the infrastructural side, which is basically

0:56:51 - 0:56:55

making the tech work at the scale that is relevant for these use cases.

0:56:56 - 0:56:57

Got it.

0:56:57 - 0:56:57

Got it.

0:56:58 - 0:57:02

Well, maybe, yeah, we'll open it up for questions if anybody has them.

0:57:03 - 0:57:07

And then while we wait for folks to raise their hands,

0:57:07 - 0:57:09

maybe I'd just like to turn it back to both of you

0:57:09 - 0:57:10

if you have any final thoughts

0:57:10 - 0:57:12

or like other points you'd like to make

0:57:12 - 0:57:16

that are germane, relevant to this conversation.

0:57:16 - 0:57:18

I guess we've got a couple minutes left.

0:57:18 - 0:57:20

So maybe Vuk will turn it back to you

0:57:20 - 0:57:21

if there's any final thoughts you wanted to add.

0:57:23 - 0:57:25

Yeah, I'm super optimistic.

0:57:25 - 0:57:30

I feel that we are in a very rare time in history

0:57:30 - 0:57:34

when things are changing very fast.

0:57:34 - 0:57:36

Like it kind of reminds me of times

0:57:36 - 0:57:38

where Hadoop was invented

0:57:38 - 0:57:40

and a few things that Google open source

0:57:40 - 0:57:42

back in the days during the,

0:57:43 - 0:57:44

I think back then,

0:57:44 - 0:57:48

like it was called Big Dea

0:57:48 - 0:57:49

or something like that.

0:57:50 - 0:57:56

But yeah, ultimately, these are rare moments in history when like we can actually define

0:57:56 - 0:57:59

like how the tech gets shaped.

0:57:59 - 0:58:01

So I'm super excited about that.

0:58:01 - 0:58:07

I think there is a big like potential for Filecoin to solve a big part of these problems.

0:58:09 - 0:58:11

Yeah, I would say likewise.

0:58:11 - 0:58:15

I think I actually recently had an internal discussion with my team,

0:58:15 - 0:58:22

and we were kind of saying it can be a little scary in the post-AI world,

0:58:22 - 0:58:25

but honestly, it's never been a better time to be a builder.

0:58:26 - 0:58:28

The leverage that you get right now

0:58:28 - 0:58:31

is just so much greater than I think ever before.

0:58:32 - 0:58:35

And so we're at a pretty interesting point in time

0:58:35 - 0:58:39

where you can have a greater impact

0:58:39 - 0:58:45

on the future of software and digital experiences

0:58:45 - 0:58:47

and all this stuff with a much smaller team

0:58:47 - 0:58:49

with access to the right tools.

0:58:49 - 0:58:51

And for sure, data is going to be the new,

0:58:51 - 0:58:54

you know, sort of like coordination layer for all of this.

0:58:54 - 0:58:55

Models are going to come and go.

0:58:57 - 0:59:01

But, you know, if we can help build the systems

0:59:01 - 0:59:04

that ensure sort of like open, fair, transparent

0:59:04 - 0:59:06

and persistent access to data.

0:59:06 - 0:59:10

That's definitely going to define the next decade of AI.

0:59:10 - 0:59:16

And so we're at a pretty interesting crossroads or like intersection here

0:59:16 - 0:59:21

where we've got like real incentive alignment mechanisms that we can leverage.

0:59:21 - 0:59:27

We've got like real data and real access to large volumes of data and capture tools for that data.

0:59:27 - 0:59:30

And then we have the same access to the same models

0:59:30 - 0:59:31

as just about everybody else.

0:59:31 - 0:59:35

So that's a pretty powerful combination

0:59:35 - 0:59:37

and I'm pretty excited to see what we unlock with that.

0:59:39 - 0:59:39

Very cool.

0:59:40 - 0:59:43

Well, if there's no questions from the audience,

0:59:43 - 0:59:45

maybe this is a good place to wrap it up.

0:59:45 - 0:59:47

We're about two minutes shy of the hour here.

0:59:48 - 0:59:52

But yeah, I wanted to appreciate everybody or just thank everybody for spending the hour

0:59:52 - 0:59:53

with us here.

0:59:53 - 0:59:54

It's a really interesting conversation.

0:59:56 - 1:00:00

I mean, definitely just scratching the surface, I think, of what the implications are.

1:00:00 - 1:00:10

And it's really great to hear from two folks who are really kind of building at the frontier of this whole kind of data meets AI decentralization realm here.

1:00:10 - 1:00:13

So we want to give a shout out to Isabella

1:00:13 - 1:00:17

and the Filecoin TLDR team for hosting this and arranging this.

1:00:17 - 1:00:19

So big thanks for that and for the invitations.

1:00:19 - 1:00:22

And thanks everyone for listening to us here today.

1:00:22 - 1:00:26

And be sure to give us a follow on Twitter if you wouldn't mind or a follow on X here.

1:00:27 - 1:00:29

That would be appreciated, helpful.