AI transcript
0:00:16 Hello, and welcome to the NVIDIA AI podcast. I’m your host, Noah Kravitz. My guests today
0:00:21 are Max Yatterberg and Sergei Yaknin. Max is the chief AI officer at Isomorphic Labs,
0:00:26 and Sergei is the chief technology officer. Isomorphic Labs is the company building a
0:00:32 world-leading AI drug design engine to transform drug discovery and usher in a new era of biomedical
0:00:38 breakthroughs. I’m here with Max and Sergei Live at GTC 25 in San Jose. Gentlemen, thanks for taking
0:00:42 time out of the week to join the podcast. No, thank you. It’s great to be here. Yeah, it’s a pleasure.
0:00:47 So why don’t we start with a little bit, if you would, about yourselves, your background, and
0:00:54 how you wound up at Isomorphic Labs. You want to go, Max? Yeah, sure. So I’ve actually been in the
0:01:01 field of AI for about 15 years now. This is long before it was cool. Back then, I was working a lot
0:01:07 on computer vision, so applying early days of AlexNet, how we use this for ImageNet, how do we scale up for
0:01:14 object recognition, text recognition. During my PhD, we had the best image recognition models in the
0:01:20 world. Actually, we had a company in this space that ultimately got acquired by Google to join DeepMind,
0:01:26 and that’s how I ended up starting to work with Demis, Demis Asabes. Spent a long time at DeepMind
0:01:31 working on early days of generative models and then caught the reinforcement learning bug.
0:01:38 Worked a lot on these challenge domains of AI for games, trying to beat the top professionals at
0:01:43 StarCraft and Go, all this sort of stuff. But really, at the core, Deep Learning was the thing I
0:01:48 loved. It’s this amazing technology, and I want to see it have real fundamental impact on the world
0:01:54 and positive impact on humanity. And so as we started to see AlphaFold unfolding, there was
0:02:00 this incredible opportunity to create this new company, Isomorphic Labs. And this was just such
0:02:06 a brilliant opportunity to apply all of this amazing deep learning and machine learning toolbox that we’ve
0:02:12 been developing over time to really try and transform the drug design space. And that’s what brought me
0:02:15 over to Isomorphic to really head up AI in this space.
0:02:20 Right. So to hit that cross of the technology you love so much and continuing to push the frontiers,
0:02:22 but applying it to something that’s meaningful to you.
0:02:26 Exactly. Exactly. And I’ve always loved the application of AI as well as the fundamentals.
0:02:31 And just to see it have that really positive impact, that’s really key for me.
0:02:32 Absolutely. That’s great. Sergey?
0:02:39 Yeah. I’ve also been at this since way before it was cool, I want to say, to borrow from Max’s words.
0:02:46 I was living in Toronto, attending U of T. We had this guy, Jeff Hinton, kind of godfather of AI.
0:02:47 I’ve heard of him, yeah.
0:02:52 Some may have heard of him. It was really great. I mean, he used to teach courses at U of T,
0:02:57 and we all learned about neural networks. And back then, these were not deep neural networks. These
0:03:04 were very shallow, a couple of layers, and everybody was trying to solve handwritten digit recognition
0:03:09 problems for a very long time. I went into the industry and worked in a number of different
0:03:16 technology companies, working in industries like fintech or telecommunications. Ended up working at
0:03:20 Amazon for a little while, building their Canadian software engineering organization.
0:03:27 After about 10 or 12 years working in the industry, I sort of got really interested in healthcare.
0:03:36 This was around 2012, 2013. People started generating DNA sequencing data as a routine.
0:03:42 And I realized that folks were really struggling to actually analyze these data sets because they were
0:03:49 quite large. And so I took the opportunity to join a cancer research institute to help them with
0:03:56 this really ambitious global project of unifying the world’s cancer DNA sequencing data and
0:04:01 processing it uniformly on the cloud and making it available to researchers worldwide so that
0:04:08 everybody could make use of this data. And, you know, to me, this kind of opened the door to how technology
0:04:14 actually enables healthcare. And this set me down this whole path where after working on this project for a little
0:04:21 while I decided to do a PhD in the field, came to Europe, to Germany, to do my PhD at the European Molecular
0:04:26 Biology Laboratory, I kind of thought, you know, as a computer scientist who doesn’t really understand
0:04:33 biology, I should just immerse myself in an environment where I knew the least about what was going on. And so,
0:04:39 you know, it was this incredible eye-opening sort of journey through this PhD learning about molecular biology.
0:04:44 Coming out of the other side of this, I actually decided to jump right back into the industry.
0:04:51 I just felt like this is where velocity was highest. And so I joined a company called Sophia Genetics where
0:04:58 we did molecular diagnostics, which is sequencing DNA of cancer patients to try to better help diagnose the
0:05:05 types of cancer that they had. And we had a great time building out this company. But I noticed right around
0:05:07 sort of 2021,
0:05:16 2022, there started to be this real uptick in the interest of using AI in drug discovery. And it first started bubbling up with
0:05:24 different pharma companies and different Silicon Valley startups sort of starting to speak about using this technology.
0:05:30 technology. And then I, I heard from Dennis’s team who were looking to start isomorphic labs. And I was just
0:05:35 completely enthralled by the idea of building this company from the ground up. This was, you know, before
0:05:42 isolabs was even announced. And so it was an opportunity to be part of this founding team of this organization
0:05:47 together with Max and a couple of other folks. And, you know, as soon as I heard about all of the different
0:05:53 factors that were going to make this company possible, it just became a really easy decision to say, yeah,
0:05:59 sure, I want to work on this. I want to really spend my time to give this a shot and, you know, never look back.
0:06:05 Amazing. So let’s talk about the company a little bit. The isomorphic labs mission is to reimagine drug discovery
0:06:11 from first principles with an AI first approach. What fundamental assumptions about traditional drug discovery
0:06:18 are you looking to challenge? I think one of the key things that you see in traditional drug discovery is
0:06:23 you work really, really hard on a particular drug design problem. So you’re going after a particular disease,
0:06:29 particular indication. That means probably a particular protein, the target that you want to try and modulate
0:06:35 with your drug. Right. And teams work really, really hard on solving that one problem. And, you know,
0:06:40 that design process can take years. Sure. It’s really difficult. Then you have to go into the clinic.
0:06:46 But after you’ve done that piece of design work, none of that transfers to the next drug design problem.
0:06:47 Right. Okay.
0:06:53 So you’ve got the next disease, the next target, the next protein. And this is where we can think about things
0:07:00 very, very differently with machine learning, with AI, because we’re building models, models like AlphaFold,
0:07:07 that generalize across the whole of the proteome, the whole of the protein universe, the whole of chemistry,
0:07:14 which means that we have techniques and we can think of ways of doing drug design against these models
0:07:20 that work for, you know, one target, one protein, one disease over here. And exactly the same models,
0:07:25 the same techniques can work for another one over here. And so we get a very generalizable drug design
0:07:29 engine. And so then that changes even the DNA of the company.
0:07:35 Instead of just thinking about a single disease, even a single disease area, we really think about
0:07:39 how can we be solving all disease and how can we build a company around that?
0:07:48 Maybe one more assumption to talk about is this assumption that you need to continuously do wet lab
0:07:56 experiments to ground yourself in some measurable scientific reality. You know, all of current
0:08:03 conventional drug design relies on these design-make-test cycles, where as a chemist, you’re going to
0:08:11 hypothesize that, well, if I take this particular molecule, it might bind to a protein target, it might
0:08:16 have these properties, and then I’m going to make it and test that assumption. And most of the time,
0:08:20 I will learn that actually it doesn’t have those properties. You know, if we were really great at
0:08:27 predicting properties like that, well, this would be a solved problem right now. And so what we can do
0:08:35 when we have these general models, as Max says, that are as good as experiment, we can actually run many of
0:08:43 them in combination, and we can just take the output as truth, which is an incredibly liberating way to do
0:08:47 drug design. You know, our drug designers often talk about the fact that they can make bigger and
0:08:53 bigger and bolder changes to these molecules, which is just not how you operate conventionally. Since you
0:08:58 don’t have a lot of confidence in the predictions you’re making, conventionally, you make small changes
0:09:03 because you don’t want to destroy what you’ve already built, and you’re trying to eke out little incremental
0:09:12 changes one by one. Whereas when you can trust these models’ predictions, you just design freely. And this
0:09:19 has been really great and kind of leads us to our North Star as a company where we’re aiming to get our
0:09:25 models to the stage where we can run an entire drug design program in this single design round where we
0:09:32 basically continue designing in silico and only validate once at the very end using experiment.
0:09:37 Right. But here’s the crazy thing. So even if we create these amazing models that are perfectly
0:09:43 predictive of this experimental technique, so we have this perfect model, and we’ve had some
0:09:49 significant breakthroughs in this. Still, the space of potential drugs is something like 10 to the 60.
0:09:55 That’s such a huge space that even if you have this perfectly predictive model, you can’t exhaustively
0:10:02 search every single design. So you need something more even than just these predictive models. You need
0:10:10 actual generative models or ways to search agents that can actually explore chemical space and come up with
0:10:16 these designs to be able to navigate this massive 10 to the 60 space. And that’s completely different to the
0:10:22 way that traditional biotech or pharma would do this early hit discovery, where there what you’re doing is you’re
0:10:27 taking a library of molecules, you know, maybe it’s a million, maybe it’s 10 million. If you’re lucky, it’s even to the
0:10:33 billion scale, and you’re screening them experimentally. That barely chips away at the surface of that 10 to the 60 space.
0:10:38 And that’s really the difference in potential, that exploration of the full chemical space that we can start to do.
0:10:44 Can you describe or kind of explain what it means when you talk about biology as an information processing system?
0:10:49 Kind of unpack that, but then relative to your approach to model building?
0:10:58 Yeah. When we think about cells, you know, cells exist in this environment that throws information
0:11:05 via different chemical signals at them, throws different challenges, and the cell needs to be able to
0:11:10 process that and do something with it. You know, the cell is trying to survive and is trying to proliferate.
0:11:18 And so we see a lot of parallels between how cells act, and they use the tools that they have at their
0:11:24 disposal. The arsenal that the cell has at its disposal is really encoded in this genome. So anything a cell can
0:11:30 do is encoded in this genome, which then the cell makes proteins to actually do what it needs to do.
0:11:39 And so we see this as an ultimate opportunity for us to really model those cellular processes, to really
0:11:45 understand how the cell deals with its environment and be able to replicate some of this in a machine
0:11:51 learning model that would allow us to do that in silico in this AI analog of a cell.
0:11:57 Right. So when this is all realized, when the vision kind of, you know, down the road comes
0:12:03 to full fruition, what does it look like? What does our world look like in this new paradigm of drug
0:12:08 discovery that you’ve just been detailing? How does it affect the way that, you know, we take care of
0:12:14 ourselves, medicine, healthcare, preventative care, everything? What does the vision look like?
0:12:21 Well, let me try to paint one vision. And in my mind, it’s actually, there’s going to be stages of
0:12:27 this in some sense. And we’re at an early stage where we’re still trying to kind of shed some of the
0:12:32 older generation tools, you know, these non-scalable processes, as Rax just talked about,
0:12:39 and build a real technology approach. Technology is scalable. This is what it’s done to disrupt so many
0:12:44 industries. And so this first step is about how can we build these models that will bring scalability
0:12:51 to this that will bring these general approaches where I don’t have to have a whole army of
0:12:57 disease-specific scientists to focus on a particular disease, instead I have a general model that allows
0:13:03 me to do this. And so to me, you know, stage one is just how do we do this in a tech-forward way?
0:13:11 But further stages actually get us much closer to how can we do this in a way that will start convincing
0:13:19 regulatory bodies that these models are predicting things with such a high degree of accuracy that maybe
0:13:24 we don’t need to spend five or seven years in clinical trials in the future, because we can prove
0:13:32 mathematically that the molecule we design is going to work. And so that’s a really important next
0:13:38 stage, because I think it’s going to really change how the entire industry works. When we think about
0:13:46 this industry now, the FDA approves about 50 drugs per year on average. And so it’s been that number for
0:13:53 probably three decades. And so there’s a real limitation to how many drugs, even if we had, you know, incredibly
0:14:00 powerful models that would allow us to design many, many more drugs, we currently have these limitations.
0:14:05 And so being able to overcome that together as humanity is going to be a key, you know, a key
0:14:13 accomplishment. But what this opens up is this future of a precision medicine that we talk about often, but
0:14:21 actually are nowhere near today, where we should be able to very, very precisely diagnose what is going on with,
0:14:27 with somebody, you know, imagine they have a particular kind of cancer, there’s a million different types
0:14:32 of cancer, depending on the molecular signature, depending on the mutations within the genome of
0:14:38 the patient. And we should be able to then design a very bespoke combination of compounds that would be
0:14:44 best for that patient or for that very small group of patients. And so we need to follow kind of the stages
0:14:50 to get there. But to me, you know, that’s the next waypoint on the road. And then ideally,
0:14:57 the last waypoint is, rather than waiting for somebody to get sick so that we can cure them
0:15:03 better, we should be getting ahead of that disease. And so we should be actually trying to detect when a
0:15:09 certain biomarker in a healthy individual starts going in the wrong direction so that we can design
0:15:13 interventions that will prevent them from getting sick in the first place. And I think that will
0:15:14 completely change this industry.
0:15:20 Yeah, absolutely. The approach that you’re taking, I’m wondering kind of how the old guard,
0:15:24 you know, looks at you and kind of responds to this approach. And then also wondering how
0:15:30 it shapes recruiting and team building. And if when you’re, you know, bringing people onto the team,
0:15:36 how much of an open mindset, a willingness to look through the glass the other way, what have you,
0:15:41 how much does that go into your approach to building out your team?
0:15:47 Yeah. So AI for drug design, we’ve actually seen maybe a first wave of companies and pharma companies
0:15:52 actually start to dip their toes into this space for maybe five, six, seven years now.
0:15:58 But I think that’s a slightly different wave to what we’ve been building. This first wave of AI for
0:16:03 drug design has really been, you know, how do we just use some of these machine learning tools
0:16:10 in the traditional drug design processes? And that often, you know, comes out as building these local
0:16:14 models where you’re, you know, you’re doing drug design around a particular target, you’ve got some
0:16:20 data and you’re fitting small local models to that to help inform you on that next experimental design,
0:16:27 make tests around. And that’s a very, very different paradigm to what we have been building, which is,
0:16:32 let’s create very general models, models that we actually can apply to any different part of space.
0:16:38 And, you know, a lot of people, I think, at least when we were starting out, didn’t think that was
0:16:44 probably the right way to go. But what we’ve just seen again and again is we can actually build these
0:16:49 general models, not just the alpha folds of this world, but many other sort of predictive capabilities,
0:16:55 generative capabilities. We see the potential to apply these on literally the hardest problems in
0:17:00 the industry. So, for example, our collaboration with Novartis, I think it’s no secret that they’ve
0:17:05 thrown some of the hardest targets in the industry to us. These are sort of drug design problems that
0:17:08 people have spent 10 years there. Can you give an example?
0:17:13 Can you give an example? We can’t give an exact example at the moment, but these are genuinely targets
0:17:17 they’ve been working on for over 10 years, not making progress. So they’re targets that chemists
0:17:23 come up to our leads and say, “This is impossible. Don’t try it.” And at the same time, we see that
0:17:30 in just months we’re able to make traction on this, creating novel chemical matter, finding novel ways to
0:17:36 modulate these targets, which are literally blowing the minds of these chemists. So I think there’s still
0:17:42 a long road for the whole of the industry to really fully understand what’s happening here.
0:17:50 This understanding is happening. The way I see it, in five years’ time, AI for drug design is going to be
0:17:55 across the entire industry. Doing drug design without AI is going to be like trying to do any type of
0:18:02 science without maths. It’s just going to be a fundamental part of science, and particularly this science.
0:18:09 In listening to the two of you talk about it, it sounds so simple in a way, but in a way that brings all
0:18:13 the power that you’re talking about behind it. But I would imagine if you’ve been doing it the other
0:18:19 way for, you know, a 50-year career or what have you, I don’t know, just the mental resistance to it might
0:18:25 be an issue. Yeah. And don’t get me wrong. It sounds really easy to say these things. In reality, these are
0:18:29 really, really hard modeling problems. Oh, 100%. Yes. I can’t even imagine.
0:18:33 These are holy grail modeling problems, things that people have been working on for decades.
0:18:38 Right. But I mean, I’m imagining just the resistance that people have to change new ways of doing things.
0:18:43 And if you’ve been, you know, you and your team have collectively been pounding on these problems,
0:18:47 and then there’s this new technology that solves it in such a short time.
0:18:55 There’s a healthy degree of skepticism going all around, I want to say. I think actually it can be
0:19:02 really challenging if you’re inside one of these pharma companies, because they have coalesced on a
0:19:08 particular structure as a result of how you would normally do drug design. So they will organize themselves
0:19:14 often by disease. You’ll have a whole area that is just focusing on oncology and another that is
0:19:21 focusing on ophthalmology. And so you have these whole structures and the types of models that we’re
0:19:28 creating, they span across all of this. And so imagine trying to nucleate this kind of initiative inside a
0:19:34 company that is organized by these disease areas. Where would you put them? And if you put them outside,
0:19:40 how would they actually permeate that structure and be able to institute change? So I think there’s
0:19:48 significant barriers within big pharma itself, which, you know, in a way is great for ISO. But what’s been
0:19:55 really heartening actually is seeing some of the transformation in the eyes of some of the chemists
0:20:00 and biologists that have joined ISO itself, many of who have also joined with a healthy degree of
0:20:09 skepticism. But over time, we’ve really convinced with data, with proof of the working of these models,
0:20:15 to really embrace these. And we have something that we call the ISO way, which is essentially,
0:20:22 how do you do drug design in this AI first approach? And all of our chemists are part of
0:20:29 this wave of how do we actually invent this ISO way together with all of the scientists and engineers
0:20:34 that are building the technology. And this has been really transformative. You know, in some sense,
0:20:41 I have this kind of old school Amazon principle of working from your customer backwards, where you want
0:20:46 to really understand who your customer is, and then you want to work backwards from that. And that will guide
0:20:53 you in your product design. And we have this dream situation here where actually the customer are our
0:21:01 peers, our buddies that are working together with us at Isomorphic Labs. And so at ISO, by contrast,
0:21:06 by construction, we have actually meshed all of our teams. You know, all of our teams are sitting
0:21:13 together in common spaces. It’s not like, you know, a technology team sits over here, a chemistry team sits
0:21:18 over here. We’re sitting interspersed, and we’re working very collaboratively on these projects.
0:21:25 And that cuts both ways in the sense that we’re getting incredible value and insight from these,
0:21:30 you know, long-term career domain experts that are helping us make sure that our models are really
0:21:37 grounded in the knowledge of their trade. But similarly, when we’re doing drug discovery projects,
0:21:41 we are doing that in this tech forward way. And so I feel like this melting pot
0:21:47 has been really amazing in transforming how chemists do their jobs and are, you know,
0:21:54 helping evolve this next generation of chemistry that is doing drug design in a completely new way.
0:21:56 And it’s being invented at ISO on a daily basis.
0:22:03 Our guests are Max Yatterberg and Sergei Yaknin from Isomorphic Labs. And we’re talking about their
0:22:09 really revolutionary approach to drug design, building world models and kind of taking a tech forward
0:22:13 approach. But Sergei, as you were saying, the collaboration, you know, really at the core of it.
0:22:19 But I want to ask you about the technology and ask you about AlphaFold. AlphaFold 3 is the current
0:22:25 version. And in my understanding, it was a big breakthrough in predicting biomolecular structures.
0:22:31 Can you talk about and even go back to the beginning and kind of maybe just briefly explain
0:22:36 what AlphaFold is and then talk about how we got to AlphaFold 3 and how important it is?
0:22:42 Yes. So maybe going back in time, AlphaFold actually started as a hackathon project in DeepMind.
0:22:46 You know, this was like a two week hackathon project. Can we throw a conf net on protein structure
0:22:52 prediction? And crazily, there was like signs of life there. And that snowballed into AlphaFold,
0:22:58 the project. AlphaFold 1 was a big step up in terms of accuracy, but AlphaFold 2, this was in 2020,
0:23:04 was that first moment that people started to see experimental level accuracy of protein structure
0:23:10 prediction from a neural network. And ultimately, AlphaFold 2 went on to win the Nobel Prize just last
0:23:16 year. But AlphaFold 2 just predicts the structure of proteins and proteins coming into contact with
0:23:22 other proteins. But there’s lots of other different types of biomolecules in addition to proteins that are
0:23:27 particularly very important when we think about designing drugs. So proteins are part of these molecular
0:23:34 machines that work by interacting with other proteins, but also things like DNA, RNA, small molecules.
0:23:38 And these small molecules could be things like, you know, something like caffeine that you consume,
0:23:43 or it could be drugs that we consume as well. And so really, we want to create a drug. And what
0:23:49 is a drug? A drug is something that comes in and modulates these molecular machines. And so we want
0:23:55 to actually design that really rationally, we want to be able to understand the structure of this protein
0:24:00 with these small molecules, maybe also together with DNA as these molecular machines form. And so that meant
0:24:06 we needed a completely new capability beyond AlphaFold 2. And that led us to the creation, you know,
0:24:13 this was a piece of work with Isomorphic Labs and Google DeepMind of AlphaFold 3. And this was a big
0:24:19 breakthrough for us. This came out last year. And this was the first time that we could predict the
0:24:25 structure of all of these molecules coming together at unprecedented accuracy. And this now is the thing
0:24:32 that, together with other models, allows our chemists to make changes to these molecule designs,
0:24:37 and literally, in a second, see the result of that. That’s a completely different way of working,
0:24:42 where traditionally, if you made a change to a molecule design, and you wanted to see how that changed
0:24:46 the structure, it would take literally months to get that structure back at best.
0:24:47 Yeah.
0:24:50 So that completely changes the game for our chemists.
0:24:54 Absolutely. Sure. And I didn’t want to interrupt you, but I don’t want to undersell the fact that
0:24:57 AlphaFold, it was AlphaFold 3 that won the Nobel Prize?
0:24:59 AlphaFold 2 won the Nobel Prize.
0:25:03 Two, Mike, excuse me. Yes. Okay. I didn’t want to cut in, but I didn’t want to let that go by.
0:25:10 So are you now building additional models? Are you tuning AlphaFold? What’s the process like from here?
0:25:17 Yeah. We go quite a bit beyond the capabilities of AlphaFold 3. If one thinks about the overall
0:25:23 drug design problem, you need to solve quite a lot of challenges on the way to making a molecule that’s
0:25:31 going to be in the pill that you’re going to buy from the drugstore. And this has to do with ascertaining
0:25:37 not only how is the molecule going to interact with its intended target, but actually how it’s going to
0:25:42 behave in the body. You know, we want these molecules to both make their way to where they
0:25:49 need to. We want them to stay around for as long as they need to, and we want them to safely break down
0:25:51 and exit the body. Right.
0:25:58 And so when we think about drug design, we need to solve this whole series of challenges and solve
0:26:03 them simultaneously. And so when we think about the different models that we need to build, we start
0:26:09 from this incredible structure prediction problem, and then we go on to solving other problems, such as
0:26:17 predicting if my potential drug binds to this target, how strong will it bind? I want it to bind strongly,
0:26:24 but not too strongly, potentially not forever. Right. And then I want it to be able to go inside a cell,
0:26:31 and I want it to be able to exit when it needs. And so all of our models are basically solving this whole
0:26:37 wide variety of problems. And then a key challenge for us is to actually how to make all of them work
0:26:43 in concept altogether so that we can have this holistic drug design engine that allows us to do
0:26:49 this end to end. How big of a part does infrastructure and compute for that matter play in this whole
0:26:56 process? And, you know, along the way, how is your access to the right kinds of infrastructure and compute,
0:27:01 you know, help you advance, I’m assuming faster than you would have otherwise, but you know,
0:27:05 help the whole process evolve? Like, can you talk a little bit about the importance of that infrastructure?
0:27:10 Yeah, it makes a huge difference, to be honest with you. And in some sense,
0:27:16 this really sets the stage for how ambitious you can be as a company. You know, in the current world
0:27:25 of AI being everywhere, there’s huge contention for resources, for graphics accelerators. And if
0:27:31 you’re not able to really have access to or be effective at commandeering these large machine
0:27:38 learning fleets, you’re at best going to be able to make use of models somebody else created. And you
0:27:44 may be able to adjust these models to your particular use case, but you’re not going to be able to do
0:27:52 foundational research. And what Isomorphic Labs really prides itself on is actually really moving
0:27:58 the needle on this foundational AI research for drug discovery. We’re able to create completely new
0:28:05 methods that are general and that are scalable. And this is predicated on having access to massive amounts
0:28:11 of infrastructure. And so we’re very lucky to be partnering with Google Cloud on this. And we work
0:28:17 very closely with them on sort of hosting our entire platform, our machine learning research and kind of
0:28:24 production inference platform. And so this is a huge part of what we do. You know, how do we make sure that
0:28:30 this kind of massive machine learning research and engineering team can have the least amount of
0:28:36 resistance, the highest research velocity possible so that they can try lots and lots of ideas because,
0:28:41 you know, ML research is a very empirical science. We end up trying and training models over and over
0:28:46 again. And so access to this hardware and the ability to do it efficiently, it’s really, really important.
0:28:46 Of course.
0:28:52 And one of the really exciting things is to start seeing that actually we’re seeing, you know, really
0:28:58 analogous scaling laws in our model development. The sort of things you might see in the LLM space we’re
0:29:04 starting to see in modeling biomolecular systems and understanding this biochemical world. And that’s
0:29:10 really, really exciting. Both actually on the, you know, training side is just how large our models
0:29:15 should be, but also really exciting on the inference side as well, how we can scale with inference time
0:29:19 compute. So there’s a huge amount of opportunity to actually scale the accuracy and the efficacy of
0:29:24 these models with the amount of compute. Right. Right. And what about data? Data wrangling,
0:29:30 synthetic data? What’s, what’s your approach to data? And I don’t know, best practices maybe?
0:29:36 Yeah, gosh, we could just spend an entire podcast talking about data, but let’s make a start at least.
0:29:43 Data is very important for, for any machine learning problem. It is especially important in science and it
0:29:49 is especially difficult to work with in science for the simple reason that you can’t just eyeball it.
0:29:54 and verify yourself as a human that it makes sense, that it’s any good. Right.
0:30:01 You know, we’re very used to data in the traditional natural language processing or imaging space where
0:30:07 I can look at an image that, you know, is generated by a model and be like, no, that’s garbage. Right.
0:30:13 Oh yeah, that’s really great. You cannot look at a structure unless it’s really terrible and be
0:30:18 able to say, oh yeah, that’s a, that’s a great structure. And so we need to be very careful with
0:30:24 how we work with this type of data. And well, there’s never enough of it. Right.
0:30:29 Truth be told there’s some great resources, of course, that are publicly available protein data
0:30:34 bank made alpha fold possible. You know, the amazing thing about that is that’s a resource that’s been
0:30:40 available to scientists for decades. And, you know, there’s a real proof point than that, that actually
0:30:47 with the right algorithmic advancements, you can create incredibly new, powerful systems using data.
0:30:52 Everybody has had access to forever. And so that’s really, really important. But when we think about
0:30:58 the entire drug design landscape and the complexity of the systems that we need to model, if you think
0:31:04 about, you know, there’s this kind of stack of different problems, and we’ve been talking so far
0:31:10 a lot about the molecular modeling, the chemistry side of this, but diseases experienced by humans.
0:31:15 And so when we go up the stack, we reason not just about molecules, but about cells and not just
0:31:23 about tissues, but organs and then humans, and then even how humans interact with the environment around
0:31:28 them. And so if we want to solve this problem holistically, we need to be thinking about data
0:31:35 at each layer of the stack. And those data sets simply don’t, don’t exist today. And so a big problem
0:31:41 that we need to contend with is how can we create the right data sets and be very, very principled about
0:31:48 it. There is, you know, a bit of a meme out there around the fact that maybe pharma has the data and
0:31:55 haven’t shared, you know, enough. And we will always welcome more sharing from our partners or from other
0:32:02 organizations. But the truth is a lot of that data is not machine learning ready in the sense that it has
0:32:09 been generated as part of past projects that were drug discovery projects or other types of scientific
0:32:15 projects. And that data does not have the necessary diversity to actually train the types of general
0:32:21 models that we’re interested in creating. And so while that data might be interesting for other purposes,
0:32:26 it might be interesting in improving performance on a particular subspace of the problem. Most of the
0:32:33 time it’s not going to be helpful or as helpful as one might hope for these general models. And so
0:32:39 something that we contend with at Isomorphic Labs is how do we generate these data sets in a principled
0:32:45 manner that will enable continued progress of the models. The great thing about it is we can lean
0:32:51 on our current generation models to help advise us where to look in this massive data space,
0:32:58 where to generate new data sets. And so we have this really virtuous cycle. If you have already an excellent
0:33:03 state-of-the-art model, you’re going to get great state-of-the-art advice on where to focus next.
0:33:03 Right, right, right.
0:33:09 And so this helps us continue actually improving on the edge that we have by continuing to generate data,
0:33:13 train new versions of the model. And this is a really virtuous cycle.
0:33:17 Yeah. You’ve talked about this a little bit in the conversation, but kind of to put a point on it,
0:33:25 the future of drug discovery. Is it AI end to end? Is it, what is it? I shouldn’t be positing things,
0:33:30 I should ask you guys. What do you see down the road and how will it ultimately, and you talked
0:33:34 about this a little bit, I think, Sergey, you were talking about preventative care and precision
0:33:41 medicine. What’s the, I don’t know, what’s the end game for humanity? What is it that this new paradigm
0:33:46 of drug discovery could ultimately help humans become? When I think about where we’re going,
0:33:52 you know, it really is this North Star of being able to do all of drug design on a computer, what we
0:33:54 call in silico. Yeah.
0:33:59 And to actually remove that experimental bottleneck that currently exists. Anytime you have to go out
0:34:05 into the real world, actually make some molecules and then test it out, this just adds so much overhead,
0:34:14 so much time. And ultimately we do see a track to genuinely solving a lot of these problems,
0:34:19 getting to experimental accuracy for some of these models, creating agents and generative models that
0:34:26 can explore this space. Where this leads is, okay, we, you know, we can get any target coming our way,
0:34:32 any protein that we want to start modulating, and we can very, very quickly come up with chemical
0:34:38 matter, that if we did go to a lab, we’d see positive results there. And, you know, near term,
0:34:44 as we go along that path, what that does is it means that it opens up new targets. So these sort
0:34:50 of age old targets or industry, you know, really, really challenging things that people, that people
0:34:54 use the term intractable, we try and not use that term, just call them challenging.
0:34:54 Challenging.
0:34:59 But, um, yeah, you know, opens up disease areas, opens up targets that previously people
0:35:01 couldn’t approach, but now we can.
0:35:02 Yeah.
0:35:06 And we start seeing that. When you’re doing more and more of this work in silico, on a computer,
0:35:11 without going into the lab, and you’re reducing those design-make test cycles, you’re doing things
0:35:12 faster. Yes.
0:35:17 And so that means, you know, just the time it takes from, hey, we want to go after this, or we really see
0:35:21 that there’s this patient need to actually starting to get into patients starts to reduce.
0:35:22 Right.
0:35:27 And then also, as we understand much more about how these molecules work, how they interact with these
0:35:32 targets, how these targets themselves interact with the whole network, all of these pathways
0:35:36 that constitute disease, how these molecules interact with the body, where they get taken up,
0:35:40 how they get broken down. We know so much more about how this works.
0:35:40 Yeah.
0:35:42 The safety profiles of this.
0:35:42 Right.
0:35:46 That when we do go into people and start testing these molecules out, they’re much safer.
0:35:47 Sure.
0:35:48 You know, they have higher efficacy.
0:35:49 Mm-hmm.
0:35:54 And that actually changes quite radically the economics, as we talked about a little bit
0:35:55 before. Yeah.
0:35:59 You know, if you’re, you know, go safer molecules, they have a higher chance of having efficacy,
0:36:03 then your failure rates are going down. And the ultimate, like, amount of you have to invest
0:36:03 as a company.
0:36:04 Yeah, right, right.
0:36:05 Drastically reduces.
0:36:05 Yeah.
0:36:09 And then that changes the economics of producing these molecules and R&D in this space.
0:36:15 So you can think about getting molecules to patients from, you know, potentially much lower
0:36:20 cost. That can even open up new indications, things that traditionally wouldn’t be commercially
0:36:24 viable to go after, where the patient populations are too small or the end market is too small
0:36:26 for traditional pharma to go after. Right.
0:36:26 Right.
0:36:29 But with this new technology, we actually can open up these markets.
0:36:31 So that’s a really exciting spot to get to.
0:36:32 That’s amazing. Yeah, yeah.
0:36:39 Maybe just to add, I feel like it’s really important for us to be as ambitious as possible
0:36:45 on this endeavor. And something that’s really unique about ISO labs is this really huge ambition
0:36:51 for what we could get to as an end game. But it’s also important for us to not get ahead of our skis
0:36:59 and sort of be completely disconnected from reality. Where we stand today is there’s no currently AI design
0:37:06 drug that has been approved. And so I think there’s a lot more to prove. And one of the things I appreciate
0:37:12 the most is we’re not just there to create these technologies in isolation. We’re there to make the
0:37:20 drugs and to prove stage by stage that this technology is working and that it’s making a difference. And so to me,
0:37:26 this balance between having huge ambition that points us directionally, you know, towards the future,
0:37:33 but actually being very execution focused so that we can improve on every program, the key parameters
0:37:37 is going to be really necessary to get us there in the long term.
0:37:42 Right, right. That’s amazing work you guys are doing. For folks listening who want to find out more,
0:37:48 about isomorphic labs, I assume the website, is there a technical blog, social channels,
0:37:53 where can we direct people to go online? I think the website is a great starting spot. We’ve got some,
0:37:58 you know, great articles on there, including some of our releases, interviews with people as well.
0:38:03 On our socials, you know, we’ll have podcasts like this and yeah, other material coming out.
0:38:06 Right. If you want to really geek out, read the AlphaFold3 paper.
0:38:08 Right. Including the appendix.
0:38:12 Exactly. That’s where all the secrets are, is in the appendix.
0:38:13 That’s where all the GC goods are.
0:38:18 Excellent. Max, Sergey, thank you so much for taking time out of GTC to talk with us.
0:38:22 Just incredible work you guys are doing and really kind of a pleasure for me to listen
0:38:25 to you talk about it for the audience, I’m sure. And it goes without saying,
0:38:29 but all the best of luck and fortune as you go forward with Isomorphic Labs.
0:38:31 Thank you, Nolan. It’s been a real pleasure.
0:38:42 Yeah, thank you so much.
0:38:54 Thank you.
0:38:55 Thank you.
0:39:25 Thank you.
0:00:21 are Max Yatterberg and Sergei Yaknin. Max is the chief AI officer at Isomorphic Labs,
0:00:26 and Sergei is the chief technology officer. Isomorphic Labs is the company building a
0:00:32 world-leading AI drug design engine to transform drug discovery and usher in a new era of biomedical
0:00:38 breakthroughs. I’m here with Max and Sergei Live at GTC 25 in San Jose. Gentlemen, thanks for taking
0:00:42 time out of the week to join the podcast. No, thank you. It’s great to be here. Yeah, it’s a pleasure.
0:00:47 So why don’t we start with a little bit, if you would, about yourselves, your background, and
0:00:54 how you wound up at Isomorphic Labs. You want to go, Max? Yeah, sure. So I’ve actually been in the
0:01:01 field of AI for about 15 years now. This is long before it was cool. Back then, I was working a lot
0:01:07 on computer vision, so applying early days of AlexNet, how we use this for ImageNet, how do we scale up for
0:01:14 object recognition, text recognition. During my PhD, we had the best image recognition models in the
0:01:20 world. Actually, we had a company in this space that ultimately got acquired by Google to join DeepMind,
0:01:26 and that’s how I ended up starting to work with Demis, Demis Asabes. Spent a long time at DeepMind
0:01:31 working on early days of generative models and then caught the reinforcement learning bug.
0:01:38 Worked a lot on these challenge domains of AI for games, trying to beat the top professionals at
0:01:43 StarCraft and Go, all this sort of stuff. But really, at the core, Deep Learning was the thing I
0:01:48 loved. It’s this amazing technology, and I want to see it have real fundamental impact on the world
0:01:54 and positive impact on humanity. And so as we started to see AlphaFold unfolding, there was
0:02:00 this incredible opportunity to create this new company, Isomorphic Labs. And this was just such
0:02:06 a brilliant opportunity to apply all of this amazing deep learning and machine learning toolbox that we’ve
0:02:12 been developing over time to really try and transform the drug design space. And that’s what brought me
0:02:15 over to Isomorphic to really head up AI in this space.
0:02:20 Right. So to hit that cross of the technology you love so much and continuing to push the frontiers,
0:02:22 but applying it to something that’s meaningful to you.
0:02:26 Exactly. Exactly. And I’ve always loved the application of AI as well as the fundamentals.
0:02:31 And just to see it have that really positive impact, that’s really key for me.
0:02:32 Absolutely. That’s great. Sergey?
0:02:39 Yeah. I’ve also been at this since way before it was cool, I want to say, to borrow from Max’s words.
0:02:46 I was living in Toronto, attending U of T. We had this guy, Jeff Hinton, kind of godfather of AI.
0:02:47 I’ve heard of him, yeah.
0:02:52 Some may have heard of him. It was really great. I mean, he used to teach courses at U of T,
0:02:57 and we all learned about neural networks. And back then, these were not deep neural networks. These
0:03:04 were very shallow, a couple of layers, and everybody was trying to solve handwritten digit recognition
0:03:09 problems for a very long time. I went into the industry and worked in a number of different
0:03:16 technology companies, working in industries like fintech or telecommunications. Ended up working at
0:03:20 Amazon for a little while, building their Canadian software engineering organization.
0:03:27 After about 10 or 12 years working in the industry, I sort of got really interested in healthcare.
0:03:36 This was around 2012, 2013. People started generating DNA sequencing data as a routine.
0:03:42 And I realized that folks were really struggling to actually analyze these data sets because they were
0:03:49 quite large. And so I took the opportunity to join a cancer research institute to help them with
0:03:56 this really ambitious global project of unifying the world’s cancer DNA sequencing data and
0:04:01 processing it uniformly on the cloud and making it available to researchers worldwide so that
0:04:08 everybody could make use of this data. And, you know, to me, this kind of opened the door to how technology
0:04:14 actually enables healthcare. And this set me down this whole path where after working on this project for a little
0:04:21 while I decided to do a PhD in the field, came to Europe, to Germany, to do my PhD at the European Molecular
0:04:26 Biology Laboratory, I kind of thought, you know, as a computer scientist who doesn’t really understand
0:04:33 biology, I should just immerse myself in an environment where I knew the least about what was going on. And so,
0:04:39 you know, it was this incredible eye-opening sort of journey through this PhD learning about molecular biology.
0:04:44 Coming out of the other side of this, I actually decided to jump right back into the industry.
0:04:51 I just felt like this is where velocity was highest. And so I joined a company called Sophia Genetics where
0:04:58 we did molecular diagnostics, which is sequencing DNA of cancer patients to try to better help diagnose the
0:05:05 types of cancer that they had. And we had a great time building out this company. But I noticed right around
0:05:07 sort of 2021,
0:05:16 2022, there started to be this real uptick in the interest of using AI in drug discovery. And it first started bubbling up with
0:05:24 different pharma companies and different Silicon Valley startups sort of starting to speak about using this technology.
0:05:30 technology. And then I, I heard from Dennis’s team who were looking to start isomorphic labs. And I was just
0:05:35 completely enthralled by the idea of building this company from the ground up. This was, you know, before
0:05:42 isolabs was even announced. And so it was an opportunity to be part of this founding team of this organization
0:05:47 together with Max and a couple of other folks. And, you know, as soon as I heard about all of the different
0:05:53 factors that were going to make this company possible, it just became a really easy decision to say, yeah,
0:05:59 sure, I want to work on this. I want to really spend my time to give this a shot and, you know, never look back.
0:06:05 Amazing. So let’s talk about the company a little bit. The isomorphic labs mission is to reimagine drug discovery
0:06:11 from first principles with an AI first approach. What fundamental assumptions about traditional drug discovery
0:06:18 are you looking to challenge? I think one of the key things that you see in traditional drug discovery is
0:06:23 you work really, really hard on a particular drug design problem. So you’re going after a particular disease,
0:06:29 particular indication. That means probably a particular protein, the target that you want to try and modulate
0:06:35 with your drug. Right. And teams work really, really hard on solving that one problem. And, you know,
0:06:40 that design process can take years. Sure. It’s really difficult. Then you have to go into the clinic.
0:06:46 But after you’ve done that piece of design work, none of that transfers to the next drug design problem.
0:06:47 Right. Okay.
0:06:53 So you’ve got the next disease, the next target, the next protein. And this is where we can think about things
0:07:00 very, very differently with machine learning, with AI, because we’re building models, models like AlphaFold,
0:07:07 that generalize across the whole of the proteome, the whole of the protein universe, the whole of chemistry,
0:07:14 which means that we have techniques and we can think of ways of doing drug design against these models
0:07:20 that work for, you know, one target, one protein, one disease over here. And exactly the same models,
0:07:25 the same techniques can work for another one over here. And so we get a very generalizable drug design
0:07:29 engine. And so then that changes even the DNA of the company.
0:07:35 Instead of just thinking about a single disease, even a single disease area, we really think about
0:07:39 how can we be solving all disease and how can we build a company around that?
0:07:48 Maybe one more assumption to talk about is this assumption that you need to continuously do wet lab
0:07:56 experiments to ground yourself in some measurable scientific reality. You know, all of current
0:08:03 conventional drug design relies on these design-make-test cycles, where as a chemist, you’re going to
0:08:11 hypothesize that, well, if I take this particular molecule, it might bind to a protein target, it might
0:08:16 have these properties, and then I’m going to make it and test that assumption. And most of the time,
0:08:20 I will learn that actually it doesn’t have those properties. You know, if we were really great at
0:08:27 predicting properties like that, well, this would be a solved problem right now. And so what we can do
0:08:35 when we have these general models, as Max says, that are as good as experiment, we can actually run many of
0:08:43 them in combination, and we can just take the output as truth, which is an incredibly liberating way to do
0:08:47 drug design. You know, our drug designers often talk about the fact that they can make bigger and
0:08:53 bigger and bolder changes to these molecules, which is just not how you operate conventionally. Since you
0:08:58 don’t have a lot of confidence in the predictions you’re making, conventionally, you make small changes
0:09:03 because you don’t want to destroy what you’ve already built, and you’re trying to eke out little incremental
0:09:12 changes one by one. Whereas when you can trust these models’ predictions, you just design freely. And this
0:09:19 has been really great and kind of leads us to our North Star as a company where we’re aiming to get our
0:09:25 models to the stage where we can run an entire drug design program in this single design round where we
0:09:32 basically continue designing in silico and only validate once at the very end using experiment.
0:09:37 Right. But here’s the crazy thing. So even if we create these amazing models that are perfectly
0:09:43 predictive of this experimental technique, so we have this perfect model, and we’ve had some
0:09:49 significant breakthroughs in this. Still, the space of potential drugs is something like 10 to the 60.
0:09:55 That’s such a huge space that even if you have this perfectly predictive model, you can’t exhaustively
0:10:02 search every single design. So you need something more even than just these predictive models. You need
0:10:10 actual generative models or ways to search agents that can actually explore chemical space and come up with
0:10:16 these designs to be able to navigate this massive 10 to the 60 space. And that’s completely different to the
0:10:22 way that traditional biotech or pharma would do this early hit discovery, where there what you’re doing is you’re
0:10:27 taking a library of molecules, you know, maybe it’s a million, maybe it’s 10 million. If you’re lucky, it’s even to the
0:10:33 billion scale, and you’re screening them experimentally. That barely chips away at the surface of that 10 to the 60 space.
0:10:38 And that’s really the difference in potential, that exploration of the full chemical space that we can start to do.
0:10:44 Can you describe or kind of explain what it means when you talk about biology as an information processing system?
0:10:49 Kind of unpack that, but then relative to your approach to model building?
0:10:58 Yeah. When we think about cells, you know, cells exist in this environment that throws information
0:11:05 via different chemical signals at them, throws different challenges, and the cell needs to be able to
0:11:10 process that and do something with it. You know, the cell is trying to survive and is trying to proliferate.
0:11:18 And so we see a lot of parallels between how cells act, and they use the tools that they have at their
0:11:24 disposal. The arsenal that the cell has at its disposal is really encoded in this genome. So anything a cell can
0:11:30 do is encoded in this genome, which then the cell makes proteins to actually do what it needs to do.
0:11:39 And so we see this as an ultimate opportunity for us to really model those cellular processes, to really
0:11:45 understand how the cell deals with its environment and be able to replicate some of this in a machine
0:11:51 learning model that would allow us to do that in silico in this AI analog of a cell.
0:11:57 Right. So when this is all realized, when the vision kind of, you know, down the road comes
0:12:03 to full fruition, what does it look like? What does our world look like in this new paradigm of drug
0:12:08 discovery that you’ve just been detailing? How does it affect the way that, you know, we take care of
0:12:14 ourselves, medicine, healthcare, preventative care, everything? What does the vision look like?
0:12:21 Well, let me try to paint one vision. And in my mind, it’s actually, there’s going to be stages of
0:12:27 this in some sense. And we’re at an early stage where we’re still trying to kind of shed some of the
0:12:32 older generation tools, you know, these non-scalable processes, as Rax just talked about,
0:12:39 and build a real technology approach. Technology is scalable. This is what it’s done to disrupt so many
0:12:44 industries. And so this first step is about how can we build these models that will bring scalability
0:12:51 to this that will bring these general approaches where I don’t have to have a whole army of
0:12:57 disease-specific scientists to focus on a particular disease, instead I have a general model that allows
0:13:03 me to do this. And so to me, you know, stage one is just how do we do this in a tech-forward way?
0:13:11 But further stages actually get us much closer to how can we do this in a way that will start convincing
0:13:19 regulatory bodies that these models are predicting things with such a high degree of accuracy that maybe
0:13:24 we don’t need to spend five or seven years in clinical trials in the future, because we can prove
0:13:32 mathematically that the molecule we design is going to work. And so that’s a really important next
0:13:38 stage, because I think it’s going to really change how the entire industry works. When we think about
0:13:46 this industry now, the FDA approves about 50 drugs per year on average. And so it’s been that number for
0:13:53 probably three decades. And so there’s a real limitation to how many drugs, even if we had, you know, incredibly
0:14:00 powerful models that would allow us to design many, many more drugs, we currently have these limitations.
0:14:05 And so being able to overcome that together as humanity is going to be a key, you know, a key
0:14:13 accomplishment. But what this opens up is this future of a precision medicine that we talk about often, but
0:14:21 actually are nowhere near today, where we should be able to very, very precisely diagnose what is going on with,
0:14:27 with somebody, you know, imagine they have a particular kind of cancer, there’s a million different types
0:14:32 of cancer, depending on the molecular signature, depending on the mutations within the genome of
0:14:38 the patient. And we should be able to then design a very bespoke combination of compounds that would be
0:14:44 best for that patient or for that very small group of patients. And so we need to follow kind of the stages
0:14:50 to get there. But to me, you know, that’s the next waypoint on the road. And then ideally,
0:14:57 the last waypoint is, rather than waiting for somebody to get sick so that we can cure them
0:15:03 better, we should be getting ahead of that disease. And so we should be actually trying to detect when a
0:15:09 certain biomarker in a healthy individual starts going in the wrong direction so that we can design
0:15:13 interventions that will prevent them from getting sick in the first place. And I think that will
0:15:14 completely change this industry.
0:15:20 Yeah, absolutely. The approach that you’re taking, I’m wondering kind of how the old guard,
0:15:24 you know, looks at you and kind of responds to this approach. And then also wondering how
0:15:30 it shapes recruiting and team building. And if when you’re, you know, bringing people onto the team,
0:15:36 how much of an open mindset, a willingness to look through the glass the other way, what have you,
0:15:41 how much does that go into your approach to building out your team?
0:15:47 Yeah. So AI for drug design, we’ve actually seen maybe a first wave of companies and pharma companies
0:15:52 actually start to dip their toes into this space for maybe five, six, seven years now.
0:15:58 But I think that’s a slightly different wave to what we’ve been building. This first wave of AI for
0:16:03 drug design has really been, you know, how do we just use some of these machine learning tools
0:16:10 in the traditional drug design processes? And that often, you know, comes out as building these local
0:16:14 models where you’re, you know, you’re doing drug design around a particular target, you’ve got some
0:16:20 data and you’re fitting small local models to that to help inform you on that next experimental design,
0:16:27 make tests around. And that’s a very, very different paradigm to what we have been building, which is,
0:16:32 let’s create very general models, models that we actually can apply to any different part of space.
0:16:38 And, you know, a lot of people, I think, at least when we were starting out, didn’t think that was
0:16:44 probably the right way to go. But what we’ve just seen again and again is we can actually build these
0:16:49 general models, not just the alpha folds of this world, but many other sort of predictive capabilities,
0:16:55 generative capabilities. We see the potential to apply these on literally the hardest problems in
0:17:00 the industry. So, for example, our collaboration with Novartis, I think it’s no secret that they’ve
0:17:05 thrown some of the hardest targets in the industry to us. These are sort of drug design problems that
0:17:08 people have spent 10 years there. Can you give an example?
0:17:13 Can you give an example? We can’t give an exact example at the moment, but these are genuinely targets
0:17:17 they’ve been working on for over 10 years, not making progress. So they’re targets that chemists
0:17:23 come up to our leads and say, “This is impossible. Don’t try it.” And at the same time, we see that
0:17:30 in just months we’re able to make traction on this, creating novel chemical matter, finding novel ways to
0:17:36 modulate these targets, which are literally blowing the minds of these chemists. So I think there’s still
0:17:42 a long road for the whole of the industry to really fully understand what’s happening here.
0:17:50 This understanding is happening. The way I see it, in five years’ time, AI for drug design is going to be
0:17:55 across the entire industry. Doing drug design without AI is going to be like trying to do any type of
0:18:02 science without maths. It’s just going to be a fundamental part of science, and particularly this science.
0:18:09 In listening to the two of you talk about it, it sounds so simple in a way, but in a way that brings all
0:18:13 the power that you’re talking about behind it. But I would imagine if you’ve been doing it the other
0:18:19 way for, you know, a 50-year career or what have you, I don’t know, just the mental resistance to it might
0:18:25 be an issue. Yeah. And don’t get me wrong. It sounds really easy to say these things. In reality, these are
0:18:29 really, really hard modeling problems. Oh, 100%. Yes. I can’t even imagine.
0:18:33 These are holy grail modeling problems, things that people have been working on for decades.
0:18:38 Right. But I mean, I’m imagining just the resistance that people have to change new ways of doing things.
0:18:43 And if you’ve been, you know, you and your team have collectively been pounding on these problems,
0:18:47 and then there’s this new technology that solves it in such a short time.
0:18:55 There’s a healthy degree of skepticism going all around, I want to say. I think actually it can be
0:19:02 really challenging if you’re inside one of these pharma companies, because they have coalesced on a
0:19:08 particular structure as a result of how you would normally do drug design. So they will organize themselves
0:19:14 often by disease. You’ll have a whole area that is just focusing on oncology and another that is
0:19:21 focusing on ophthalmology. And so you have these whole structures and the types of models that we’re
0:19:28 creating, they span across all of this. And so imagine trying to nucleate this kind of initiative inside a
0:19:34 company that is organized by these disease areas. Where would you put them? And if you put them outside,
0:19:40 how would they actually permeate that structure and be able to institute change? So I think there’s
0:19:48 significant barriers within big pharma itself, which, you know, in a way is great for ISO. But what’s been
0:19:55 really heartening actually is seeing some of the transformation in the eyes of some of the chemists
0:20:00 and biologists that have joined ISO itself, many of who have also joined with a healthy degree of
0:20:09 skepticism. But over time, we’ve really convinced with data, with proof of the working of these models,
0:20:15 to really embrace these. And we have something that we call the ISO way, which is essentially,
0:20:22 how do you do drug design in this AI first approach? And all of our chemists are part of
0:20:29 this wave of how do we actually invent this ISO way together with all of the scientists and engineers
0:20:34 that are building the technology. And this has been really transformative. You know, in some sense,
0:20:41 I have this kind of old school Amazon principle of working from your customer backwards, where you want
0:20:46 to really understand who your customer is, and then you want to work backwards from that. And that will guide
0:20:53 you in your product design. And we have this dream situation here where actually the customer are our
0:21:01 peers, our buddies that are working together with us at Isomorphic Labs. And so at ISO, by contrast,
0:21:06 by construction, we have actually meshed all of our teams. You know, all of our teams are sitting
0:21:13 together in common spaces. It’s not like, you know, a technology team sits over here, a chemistry team sits
0:21:18 over here. We’re sitting interspersed, and we’re working very collaboratively on these projects.
0:21:25 And that cuts both ways in the sense that we’re getting incredible value and insight from these,
0:21:30 you know, long-term career domain experts that are helping us make sure that our models are really
0:21:37 grounded in the knowledge of their trade. But similarly, when we’re doing drug discovery projects,
0:21:41 we are doing that in this tech forward way. And so I feel like this melting pot
0:21:47 has been really amazing in transforming how chemists do their jobs and are, you know,
0:21:54 helping evolve this next generation of chemistry that is doing drug design in a completely new way.
0:21:56 And it’s being invented at ISO on a daily basis.
0:22:03 Our guests are Max Yatterberg and Sergei Yaknin from Isomorphic Labs. And we’re talking about their
0:22:09 really revolutionary approach to drug design, building world models and kind of taking a tech forward
0:22:13 approach. But Sergei, as you were saying, the collaboration, you know, really at the core of it.
0:22:19 But I want to ask you about the technology and ask you about AlphaFold. AlphaFold 3 is the current
0:22:25 version. And in my understanding, it was a big breakthrough in predicting biomolecular structures.
0:22:31 Can you talk about and even go back to the beginning and kind of maybe just briefly explain
0:22:36 what AlphaFold is and then talk about how we got to AlphaFold 3 and how important it is?
0:22:42 Yes. So maybe going back in time, AlphaFold actually started as a hackathon project in DeepMind.
0:22:46 You know, this was like a two week hackathon project. Can we throw a conf net on protein structure
0:22:52 prediction? And crazily, there was like signs of life there. And that snowballed into AlphaFold,
0:22:58 the project. AlphaFold 1 was a big step up in terms of accuracy, but AlphaFold 2, this was in 2020,
0:23:04 was that first moment that people started to see experimental level accuracy of protein structure
0:23:10 prediction from a neural network. And ultimately, AlphaFold 2 went on to win the Nobel Prize just last
0:23:16 year. But AlphaFold 2 just predicts the structure of proteins and proteins coming into contact with
0:23:22 other proteins. But there’s lots of other different types of biomolecules in addition to proteins that are
0:23:27 particularly very important when we think about designing drugs. So proteins are part of these molecular
0:23:34 machines that work by interacting with other proteins, but also things like DNA, RNA, small molecules.
0:23:38 And these small molecules could be things like, you know, something like caffeine that you consume,
0:23:43 or it could be drugs that we consume as well. And so really, we want to create a drug. And what
0:23:49 is a drug? A drug is something that comes in and modulates these molecular machines. And so we want
0:23:55 to actually design that really rationally, we want to be able to understand the structure of this protein
0:24:00 with these small molecules, maybe also together with DNA as these molecular machines form. And so that meant
0:24:06 we needed a completely new capability beyond AlphaFold 2. And that led us to the creation, you know,
0:24:13 this was a piece of work with Isomorphic Labs and Google DeepMind of AlphaFold 3. And this was a big
0:24:19 breakthrough for us. This came out last year. And this was the first time that we could predict the
0:24:25 structure of all of these molecules coming together at unprecedented accuracy. And this now is the thing
0:24:32 that, together with other models, allows our chemists to make changes to these molecule designs,
0:24:37 and literally, in a second, see the result of that. That’s a completely different way of working,
0:24:42 where traditionally, if you made a change to a molecule design, and you wanted to see how that changed
0:24:46 the structure, it would take literally months to get that structure back at best.
0:24:47 Yeah.
0:24:50 So that completely changes the game for our chemists.
0:24:54 Absolutely. Sure. And I didn’t want to interrupt you, but I don’t want to undersell the fact that
0:24:57 AlphaFold, it was AlphaFold 3 that won the Nobel Prize?
0:24:59 AlphaFold 2 won the Nobel Prize.
0:25:03 Two, Mike, excuse me. Yes. Okay. I didn’t want to cut in, but I didn’t want to let that go by.
0:25:10 So are you now building additional models? Are you tuning AlphaFold? What’s the process like from here?
0:25:17 Yeah. We go quite a bit beyond the capabilities of AlphaFold 3. If one thinks about the overall
0:25:23 drug design problem, you need to solve quite a lot of challenges on the way to making a molecule that’s
0:25:31 going to be in the pill that you’re going to buy from the drugstore. And this has to do with ascertaining
0:25:37 not only how is the molecule going to interact with its intended target, but actually how it’s going to
0:25:42 behave in the body. You know, we want these molecules to both make their way to where they
0:25:49 need to. We want them to stay around for as long as they need to, and we want them to safely break down
0:25:51 and exit the body. Right.
0:25:58 And so when we think about drug design, we need to solve this whole series of challenges and solve
0:26:03 them simultaneously. And so when we think about the different models that we need to build, we start
0:26:09 from this incredible structure prediction problem, and then we go on to solving other problems, such as
0:26:17 predicting if my potential drug binds to this target, how strong will it bind? I want it to bind strongly,
0:26:24 but not too strongly, potentially not forever. Right. And then I want it to be able to go inside a cell,
0:26:31 and I want it to be able to exit when it needs. And so all of our models are basically solving this whole
0:26:37 wide variety of problems. And then a key challenge for us is to actually how to make all of them work
0:26:43 in concept altogether so that we can have this holistic drug design engine that allows us to do
0:26:49 this end to end. How big of a part does infrastructure and compute for that matter play in this whole
0:26:56 process? And, you know, along the way, how is your access to the right kinds of infrastructure and compute,
0:27:01 you know, help you advance, I’m assuming faster than you would have otherwise, but you know,
0:27:05 help the whole process evolve? Like, can you talk a little bit about the importance of that infrastructure?
0:27:10 Yeah, it makes a huge difference, to be honest with you. And in some sense,
0:27:16 this really sets the stage for how ambitious you can be as a company. You know, in the current world
0:27:25 of AI being everywhere, there’s huge contention for resources, for graphics accelerators. And if
0:27:31 you’re not able to really have access to or be effective at commandeering these large machine
0:27:38 learning fleets, you’re at best going to be able to make use of models somebody else created. And you
0:27:44 may be able to adjust these models to your particular use case, but you’re not going to be able to do
0:27:52 foundational research. And what Isomorphic Labs really prides itself on is actually really moving
0:27:58 the needle on this foundational AI research for drug discovery. We’re able to create completely new
0:28:05 methods that are general and that are scalable. And this is predicated on having access to massive amounts
0:28:11 of infrastructure. And so we’re very lucky to be partnering with Google Cloud on this. And we work
0:28:17 very closely with them on sort of hosting our entire platform, our machine learning research and kind of
0:28:24 production inference platform. And so this is a huge part of what we do. You know, how do we make sure that
0:28:30 this kind of massive machine learning research and engineering team can have the least amount of
0:28:36 resistance, the highest research velocity possible so that they can try lots and lots of ideas because,
0:28:41 you know, ML research is a very empirical science. We end up trying and training models over and over
0:28:46 again. And so access to this hardware and the ability to do it efficiently, it’s really, really important.
0:28:46 Of course.
0:28:52 And one of the really exciting things is to start seeing that actually we’re seeing, you know, really
0:28:58 analogous scaling laws in our model development. The sort of things you might see in the LLM space we’re
0:29:04 starting to see in modeling biomolecular systems and understanding this biochemical world. And that’s
0:29:10 really, really exciting. Both actually on the, you know, training side is just how large our models
0:29:15 should be, but also really exciting on the inference side as well, how we can scale with inference time
0:29:19 compute. So there’s a huge amount of opportunity to actually scale the accuracy and the efficacy of
0:29:24 these models with the amount of compute. Right. Right. And what about data? Data wrangling,
0:29:30 synthetic data? What’s, what’s your approach to data? And I don’t know, best practices maybe?
0:29:36 Yeah, gosh, we could just spend an entire podcast talking about data, but let’s make a start at least.
0:29:43 Data is very important for, for any machine learning problem. It is especially important in science and it
0:29:49 is especially difficult to work with in science for the simple reason that you can’t just eyeball it.
0:29:54 and verify yourself as a human that it makes sense, that it’s any good. Right.
0:30:01 You know, we’re very used to data in the traditional natural language processing or imaging space where
0:30:07 I can look at an image that, you know, is generated by a model and be like, no, that’s garbage. Right.
0:30:13 Oh yeah, that’s really great. You cannot look at a structure unless it’s really terrible and be
0:30:18 able to say, oh yeah, that’s a, that’s a great structure. And so we need to be very careful with
0:30:24 how we work with this type of data. And well, there’s never enough of it. Right.
0:30:29 Truth be told there’s some great resources, of course, that are publicly available protein data
0:30:34 bank made alpha fold possible. You know, the amazing thing about that is that’s a resource that’s been
0:30:40 available to scientists for decades. And, you know, there’s a real proof point than that, that actually
0:30:47 with the right algorithmic advancements, you can create incredibly new, powerful systems using data.
0:30:52 Everybody has had access to forever. And so that’s really, really important. But when we think about
0:30:58 the entire drug design landscape and the complexity of the systems that we need to model, if you think
0:31:04 about, you know, there’s this kind of stack of different problems, and we’ve been talking so far
0:31:10 a lot about the molecular modeling, the chemistry side of this, but diseases experienced by humans.
0:31:15 And so when we go up the stack, we reason not just about molecules, but about cells and not just
0:31:23 about tissues, but organs and then humans, and then even how humans interact with the environment around
0:31:28 them. And so if we want to solve this problem holistically, we need to be thinking about data
0:31:35 at each layer of the stack. And those data sets simply don’t, don’t exist today. And so a big problem
0:31:41 that we need to contend with is how can we create the right data sets and be very, very principled about
0:31:48 it. There is, you know, a bit of a meme out there around the fact that maybe pharma has the data and
0:31:55 haven’t shared, you know, enough. And we will always welcome more sharing from our partners or from other
0:32:02 organizations. But the truth is a lot of that data is not machine learning ready in the sense that it has
0:32:09 been generated as part of past projects that were drug discovery projects or other types of scientific
0:32:15 projects. And that data does not have the necessary diversity to actually train the types of general
0:32:21 models that we’re interested in creating. And so while that data might be interesting for other purposes,
0:32:26 it might be interesting in improving performance on a particular subspace of the problem. Most of the
0:32:33 time it’s not going to be helpful or as helpful as one might hope for these general models. And so
0:32:39 something that we contend with at Isomorphic Labs is how do we generate these data sets in a principled
0:32:45 manner that will enable continued progress of the models. The great thing about it is we can lean
0:32:51 on our current generation models to help advise us where to look in this massive data space,
0:32:58 where to generate new data sets. And so we have this really virtuous cycle. If you have already an excellent
0:33:03 state-of-the-art model, you’re going to get great state-of-the-art advice on where to focus next.
0:33:03 Right, right, right.
0:33:09 And so this helps us continue actually improving on the edge that we have by continuing to generate data,
0:33:13 train new versions of the model. And this is a really virtuous cycle.
0:33:17 Yeah. You’ve talked about this a little bit in the conversation, but kind of to put a point on it,
0:33:25 the future of drug discovery. Is it AI end to end? Is it, what is it? I shouldn’t be positing things,
0:33:30 I should ask you guys. What do you see down the road and how will it ultimately, and you talked
0:33:34 about this a little bit, I think, Sergey, you were talking about preventative care and precision
0:33:41 medicine. What’s the, I don’t know, what’s the end game for humanity? What is it that this new paradigm
0:33:46 of drug discovery could ultimately help humans become? When I think about where we’re going,
0:33:52 you know, it really is this North Star of being able to do all of drug design on a computer, what we
0:33:54 call in silico. Yeah.
0:33:59 And to actually remove that experimental bottleneck that currently exists. Anytime you have to go out
0:34:05 into the real world, actually make some molecules and then test it out, this just adds so much overhead,
0:34:14 so much time. And ultimately we do see a track to genuinely solving a lot of these problems,
0:34:19 getting to experimental accuracy for some of these models, creating agents and generative models that
0:34:26 can explore this space. Where this leads is, okay, we, you know, we can get any target coming our way,
0:34:32 any protein that we want to start modulating, and we can very, very quickly come up with chemical
0:34:38 matter, that if we did go to a lab, we’d see positive results there. And, you know, near term,
0:34:44 as we go along that path, what that does is it means that it opens up new targets. So these sort
0:34:50 of age old targets or industry, you know, really, really challenging things that people, that people
0:34:54 use the term intractable, we try and not use that term, just call them challenging.
0:34:54 Challenging.
0:34:59 But, um, yeah, you know, opens up disease areas, opens up targets that previously people
0:35:01 couldn’t approach, but now we can.
0:35:02 Yeah.
0:35:06 And we start seeing that. When you’re doing more and more of this work in silico, on a computer,
0:35:11 without going into the lab, and you’re reducing those design-make test cycles, you’re doing things
0:35:12 faster. Yes.
0:35:17 And so that means, you know, just the time it takes from, hey, we want to go after this, or we really see
0:35:21 that there’s this patient need to actually starting to get into patients starts to reduce.
0:35:22 Right.
0:35:27 And then also, as we understand much more about how these molecules work, how they interact with these
0:35:32 targets, how these targets themselves interact with the whole network, all of these pathways
0:35:36 that constitute disease, how these molecules interact with the body, where they get taken up,
0:35:40 how they get broken down. We know so much more about how this works.
0:35:40 Yeah.
0:35:42 The safety profiles of this.
0:35:42 Right.
0:35:46 That when we do go into people and start testing these molecules out, they’re much safer.
0:35:47 Sure.
0:35:48 You know, they have higher efficacy.
0:35:49 Mm-hmm.
0:35:54 And that actually changes quite radically the economics, as we talked about a little bit
0:35:55 before. Yeah.
0:35:59 You know, if you’re, you know, go safer molecules, they have a higher chance of having efficacy,
0:36:03 then your failure rates are going down. And the ultimate, like, amount of you have to invest
0:36:03 as a company.
0:36:04 Yeah, right, right.
0:36:05 Drastically reduces.
0:36:05 Yeah.
0:36:09 And then that changes the economics of producing these molecules and R&D in this space.
0:36:15 So you can think about getting molecules to patients from, you know, potentially much lower
0:36:20 cost. That can even open up new indications, things that traditionally wouldn’t be commercially
0:36:24 viable to go after, where the patient populations are too small or the end market is too small
0:36:26 for traditional pharma to go after. Right.
0:36:26 Right.
0:36:29 But with this new technology, we actually can open up these markets.
0:36:31 So that’s a really exciting spot to get to.
0:36:32 That’s amazing. Yeah, yeah.
0:36:39 Maybe just to add, I feel like it’s really important for us to be as ambitious as possible
0:36:45 on this endeavor. And something that’s really unique about ISO labs is this really huge ambition
0:36:51 for what we could get to as an end game. But it’s also important for us to not get ahead of our skis
0:36:59 and sort of be completely disconnected from reality. Where we stand today is there’s no currently AI design
0:37:06 drug that has been approved. And so I think there’s a lot more to prove. And one of the things I appreciate
0:37:12 the most is we’re not just there to create these technologies in isolation. We’re there to make the
0:37:20 drugs and to prove stage by stage that this technology is working and that it’s making a difference. And so to me,
0:37:26 this balance between having huge ambition that points us directionally, you know, towards the future,
0:37:33 but actually being very execution focused so that we can improve on every program, the key parameters
0:37:37 is going to be really necessary to get us there in the long term.
0:37:42 Right, right. That’s amazing work you guys are doing. For folks listening who want to find out more,
0:37:48 about isomorphic labs, I assume the website, is there a technical blog, social channels,
0:37:53 where can we direct people to go online? I think the website is a great starting spot. We’ve got some,
0:37:58 you know, great articles on there, including some of our releases, interviews with people as well.
0:38:03 On our socials, you know, we’ll have podcasts like this and yeah, other material coming out.
0:38:06 Right. If you want to really geek out, read the AlphaFold3 paper.
0:38:08 Right. Including the appendix.
0:38:12 Exactly. That’s where all the secrets are, is in the appendix.
0:38:13 That’s where all the GC goods are.
0:38:18 Excellent. Max, Sergey, thank you so much for taking time out of GTC to talk with us.
0:38:22 Just incredible work you guys are doing and really kind of a pleasure for me to listen
0:38:25 to you talk about it for the audience, I’m sure. And it goes without saying,
0:38:29 but all the best of luck and fortune as you go forward with Isomorphic Labs.
0:38:31 Thank you, Nolan. It’s been a real pleasure.
0:38:42 Yeah, thank you so much.
0:38:54 Thank you.
0:38:55 Thank you.
0:39:25 Thank you.
Max Jaderberg and Sergei Yakneen from Isomorphic Labs discuss how AI is enhancing drug discovery by treating biology as an information processing system. They share how advanced AI models like AlphaFold 3 are accelerating the pipeline and paving the way for a future of precision medicine.



Leave a Reply
You must be logged in to post a comment.