ElevenLabs CEO: Why Voice is the Next AI Interface

AI transcript
0:00:04 We don’t want to become same as previous generation of the editing suites.
0:00:08 So instead, let’s solve it on the research level where it will know based on the voice
0:00:09 exactly how it should speak with the speed.
0:00:13 To be able to cater to all those different use cases, you need such a big array of different voices,
0:00:17 different languages, different accents, different styles.
0:00:22 So we launched the voice marketplace where you could create your voice and then share it.
0:00:25 And when the voice is shared, you earn money in the return.
0:00:27 Today we have almost 10,000 voices.
0:00:30 We paid $10 million back to the people in the community.
0:00:32 There are some crazy stories from the voices.
0:00:36 Just speaking through exactly the technology, showing the examples,
0:00:40 and kind of avoiding this initial knee-jerk reaction that AI is bad has been made.
0:00:43 Today, we’ll hear from David Sachs, Mark Andreessen, and Ben Horowitz,
0:00:46 how what it takes for America to stay ahead for human computer interaction.
0:00:52 Today, you’ll hear from Maddie Stanishevski, co-founder and CEO of 11 labs of building at lightning speed,
0:00:57 and how the U.S. can lead on energy, chips, and only licensed AI using to real-time voice regulation.
0:01:02 As well as how small autonomous teams and global hiring power the company’s product velocity.
0:01:07 We discuss the ethics of AI audio, the voice marketplace paying creators over $10 million,
0:01:11 and the shift from creator brand to enterprise platform.
0:01:14 Plus, why speed is the moat in the race to define the future of sound.
0:01:16 Let’s get into it.
0:01:23 I’m excited to welcome our first speaker, Maddie, co-founder and CEO of 11 Labs.
0:01:35 Alright, so good to have you here, Maddie.
0:01:39 Thanks so much for having me here. Great to see everyone, and good morning.
0:01:42 That was the walk-home music generated by 11 Labs, was it?
0:01:47 It was. We expand continuously across the audio space, so we started with voices,
0:01:53 then created orchestration of how to build voice agents, and now also create a fully licensed music model.
0:01:56 So, can produce amazing music to go alongside of it.
0:01:58 Awesome. Well, talk all about that.
0:02:03 I’ve had the opportunity, and also the luck to get to know from the very early days,
0:02:07 when 11 Labs got started and got to partner over the last three years
0:02:15 to just see your execution everywhere from product launches to shipping new lines and models,
0:02:19 like you just mentioned, everything from text-to-speech models, speech-to-text,
0:02:24 and then we started doing music, sound effects, and now the AI agent platform.
0:02:25 I’m very curious.
0:02:29 First, I’m still in awe of the shipping speed after all the three years,
0:02:34 but I want to ask, how do you actually maintain both the speed and quality
0:02:36 when you have such an expensive product roadmap?
0:02:40 So, first of all, we partnered almost three years ago,
0:02:43 and so it’s great to hear all the kind notes,
0:02:47 but also, they didn’t realize when we partnered, the infrastructure team was three people,
0:02:50 and of course, now I’m 11 Labs founder, we love number 11,
0:02:53 and the company infra team is 11 people.
0:02:56 So, we’ve seen the growth of the other side as well,
0:03:01 and I hear that the companies here raised $66 billion in the total fundraising,
0:03:03 so the number 11 is everywhere here.
0:03:06 But I think that, to start off, I think, first piece,
0:03:09 I think the smartest person I got to know as my co-founder, Piotr,
0:03:12 who has been the research brain for creating a lot of the models
0:03:16 and then being able to assemble what we think are the most incredible researchers in the voice space
0:03:19 to really create the first text-to-speech model
0:03:21 that could understand the context in a better way
0:03:24 and turn that into the emotion, intonation,
0:03:27 then find a way to capture the characteristics of the voice
0:03:30 so you have the voice sound with the right style,
0:03:35 with the right age, with the right gender, dialect, everything in one.
0:03:39 And then the researchers across, of course, now expanded that to speech-to-text, music, and other work.
0:03:40 So, that’s our foundation.
0:03:43 And then the way we structure it to be able to ship quickly,
0:03:45 especially with so many things happening in AI space,
0:03:47 it’s a lot of small teams.
0:03:51 So, today we have roughly 20 product teams, each of 5 to 10 people’s size,
0:03:56 which with full independence can go ahead and ship products.
0:04:00 Of course, that carries some of the sometimes issues of duplicative work
0:04:03 or sometimes people going at different speeds,
0:04:07 but at the positive end, the ownership of each of the teams is extremely high.
0:04:10 So, people know that this is down to them to really deliver and ship.
0:04:13 And it allows us to move extremely quickly.
0:04:15 We bucket our work into creative space,
0:04:18 so creative platform where we help with narrations, voiceovers,
0:04:22 dubs for creatives and creatives in the media entertainment space.
0:04:26 And then on the agents side, where we help people recreate voice agent experience,
0:04:28 conversational agent experience, across customer experience,
0:04:31 all the way through to immersive media.
0:04:31 Great.
0:04:35 Eleven Labs has labs in the name, very similar to many of the other big labs,
0:04:39 which means you’re doing your first-party R&D and model development,
0:04:41 but also building all these 20 products.
0:04:44 How do you think about balancing both?
0:04:47 Like, keep progressing on the model research,
0:04:49 but at the same time not delaying sort of the product launches?
0:04:51 It’s very tricky.
0:04:53 I’m sure many of you have the same thing.
0:04:56 Like, do you build a product when you don’t know
0:05:00 if the research innovation will displace the product you just built?
0:05:01 We had this in the early days, too.
0:05:05 So one of the simple examples was we had a model at work,
0:05:06 and one of the most common requests was,
0:05:08 could we do a different speed for voices?
0:05:12 So could you have additional slider to modify the speed
0:05:14 of how audio gets generated and how quickly it speaks?
0:05:17 And we are very against this idea of,
0:05:19 no, we don’t want to do any sliders, any toggles.
0:05:23 We don’t want to become same as previous generation of the editing suites.
0:05:25 So instead, let’s solve it on the research level
0:05:27 where it will know, based on the voice,
0:05:29 exactly how it should speak with the speed.
0:05:32 And we resisted this for, I think, a good amount of nine months,
0:05:34 and we couldn’t solve it on the research side.
0:05:38 And then the product was super simple solved that got all the users across.
0:05:40 And now the approach we take in looking at this is,
0:05:43 if we think the research work will take more than three months,
0:05:46 then the product can do anything they want
0:05:49 to start adding other models, adding some of the extensions.
0:05:51 Of course, sometimes the timeline is tricky to predict,
0:05:54 but roughly the guidance we have from our internal research team,
0:05:57 what are the initiatives we hope to ship this quarter,
0:05:58 what are long-term initiatives,
0:06:00 and then for anything long-term,
0:06:04 you can use any other work to close that gap and make it better.
0:06:08 I guess, first, you kind of have to figure out if the research commitment
0:06:14 is going to meet the timeline first and then go on to align with the product teams.
0:06:15 That makes a lot of sense.
0:06:22 As everyone is moving to San Francisco and building in person and locked in in the same space,
0:06:27 Eleven has always been building globally and having people more distributed.
0:06:32 But you now have centers, I guess, in different locations from London, Warsaw,
0:06:34 San Francisco to New York and other places.
0:06:40 How do you think about building this global expansion and finding talent globally
0:06:44 versus, I guess, the trade-offs of building in the same place?
0:06:48 Yeah, so me and my co-founder of Polish, we started between Warsaw and London at the time,
0:06:52 and I think Eleven Labs wouldn’t have existed if we weren’t starting from Europe.
0:06:56 It’s a very peculiar thing, but in Poland, if you watch a movie in Polish language,
0:07:01 like a foreign movie in Polish language, all the voices, whether that’s a male voice or a female voice,
0:07:03 get narrated with one single character.
0:07:05 No emotions, no intonation.
0:07:07 As you can imagine, it’s pretty terrible,
0:07:10 and it’s still happening today for most of the content out there.
0:07:17 I’ve had a similar experience growing up in China that we have a lot of Western movies dubbed in Chinese monotone.
0:07:19 So bad, so bad.
0:07:23 And it’s like in Poland, of course, post-communist country, it’s a cheaper way to do it.
0:07:24 You don’t have to hire as many people.
0:07:27 You have one monotone audiobook reading of a movie.
0:07:30 And that was kind of where the company started.
0:07:36 And we started initially in Europe, and we realized that if we wanted the best people
0:07:40 to solve what was a research problem at the time, we need to hire wherever they are.
0:07:45 And we couldn’t lock ourselves to just San Francisco or look at the West Coast.
0:07:50 We knew that we need to find them across Europe, across Asia, and bring them into the company.
0:07:52 So we started fully remote and started looking at those people.
0:07:58 And then on engineering, we also were very against this traditional hiring method of looking at LinkedIn,
0:08:01 looking at traditional background, and trying to figure out,
0:08:04 could we go and figure out a different method to hire people?
0:08:06 That led to some very interesting hires.
0:08:10 So we hired a person that had incredible open source text-to-speech model
0:08:16 and was working in the call center at the same time as a recipient of the calls to make money.
0:08:19 And he’s now on the team, one of the most brilliant researchers we have
0:08:20 doing all the data processing.
0:08:22 But the same pattern kind of followed.
0:08:24 And of course, the early team was very distributed.
0:08:28 And then as we started scaling, so beyond 30 people,
0:08:30 we realized that the new people joining,
0:08:34 there’s benefit of them having a space to be next to others,
0:08:36 to get deeper into the culture,
0:08:38 understand what are all the projects that are happening in the company.
0:08:42 So we started the hubs where you can go into London and Warsaw and San Francisco,
0:08:44 where you can work with others in person.
0:08:46 And that’s how we try to marry those two.
0:08:48 If you are early in your career,
0:08:51 we try to hire you in the hub so you can immerse yourself in the company.
0:08:53 If you are used to remote work, completely fine.
0:08:56 But then if you want, you can always come and join us in the hub.
0:08:57 And that worked really well.
0:09:02 Currently, we continue hiring very untraditional backgrounds
0:09:03 in some of the places of the company.
0:09:06 And then fusing that with very traditional backgrounds,
0:09:07 which can teach the others.
0:09:10 And in sales, for example, we’ve done some of those experiments too,
0:09:12 where that combination worked really well.
0:09:15 The lesson is you can really find talent everywhere.
0:09:18 It’s just how hard and how you look for them.
0:09:21 And I think in Europe also, this was an interesting one.
0:09:24 In the US, people are very keen and excited to work.
0:09:28 And if you go for any social event, you want to talk about work.
0:09:30 And in Europe, I didn’t have this feeling where it’s like
0:09:31 most people don’t want to do that.
0:09:33 It’s like the cultural piece is different.
0:09:36 But then you do have the pockets of people that actually strive it too.
0:09:38 They just don’t have the companies where they could do that then.
0:09:44 So I feel like our team from Europe is the most motivated and passionate set of people
0:09:45 that we are lucky to have.
0:09:48 Yeah, I can attest to that, given I’ve met some of them.
0:09:51 Very hardcore, very good work ethic, for sure.
0:09:54 And you have also maintained a pretty flat work structure
0:09:58 and have people own quite laterally a lot of responsibilities.
0:10:01 Can you talk about the rationale behind that?
0:10:03 And I guess there was also no title policy.
0:10:06 Yeah, so we removed titles a year ago.
0:10:07 And it’s going well.
0:10:08 It still works.
0:10:12 And I do think that, but I thought a lot of AI companies kind of do it too already
0:10:15 with member of technical staff being like the usual piece you have for engineering.
0:10:18 And then in a lot of the go-to-market, you are just go-to-market,
0:10:20 not VP of sales or other roles.
0:10:23 So I think it’s actually a pretty common pattern.
0:10:29 But in our case, we had a small team approach where you have extremely small amount of people,
0:10:30 usually the five to 10.
0:10:38 And we wanted to make it very clear that every team, we create those teams.
0:10:39 You have six months to prove it.
0:10:42 If it’s proven, the team will stay and continue working.
0:10:46 But it really is that the moment you join, you can have any impact on the company.
0:10:48 So you can have any role in that team.
0:10:51 The tenure will not define your position in the hierarchy.
0:10:56 If you are smart and quick and passionate, you can elevate yourself very quickly,
0:10:58 which this really helped.
0:11:03 And also, it’s a common layer to the external world,
0:11:06 where everybody looking at 11 Labs knows that we are the go-to-market team,
0:11:07 is go-to-market team.
0:11:11 There’s no positioning to the same extent.
0:11:15 What this allows us to do is I think when we speak with a lot of our partners,
0:11:20 with a lot of our customers, they also know that they are getting the best people
0:11:25 always and we can also send people to different conferences,
0:11:27 different events, regardless of that positioning.
0:11:31 I think the tricky thing in the flat structure is not only positives.
0:11:37 In the way we currently have, it’s a set of leads effectively for the subdivisions.
0:11:44 So the research, creative work, agents work, go-to-market, self-serve, and sales-led, and of course, Ops.
0:11:48 Only that’s the layer of leads.
0:11:52 And then under that, there’s pretty flat, small-team approach across the world.
0:11:57 But then you really want the leads to be able to carry the complexity around the team.
0:12:02 So suggest things between one team to another, if they see that there’s something valuable between them happening.
0:12:07 So I think picking those people that can context switch between is super important,
0:12:10 and then letting the team fully focus on that.
0:12:18 And then having, which was interesting learning, where if you put a person into all the Slack channels
0:12:21 and give them transparency, they actually get frequently distracted
0:12:22 because then they read all the messages.
0:12:26 You can still choose not to read them, but they still do.
0:12:30 So you kind of need to cut the access to a lot of those pieces to force the attention,
0:12:31 and that kind of works.
0:12:33 All those small things work really well.
0:12:36 Maybe we can borrow some of that lesson too.
0:12:39 Let’s switch in gear a little bit.
0:12:43 You’re on the front line seeing a lot of the creative work,
0:12:48 whether it’s from art, music, or advertising that are starting to adopt.
0:12:49 AI tools.
0:12:51 And in the beginning, that was not the case.
0:12:52 There was a lot of resistance.
0:12:59 And now we’re just seeing the adaptation and the welcoming of using more of the generative AI tools,
0:13:00 including AI audio.
0:13:06 And you have done some really smart things from the marketplace payouts
0:13:09 to working with these creative industries since day one, actually.
0:13:13 I remember how much you stress we have to find a way to work with them
0:13:17 and observing market shift over time.
0:13:22 So the question is, how do you actually adapt to these changes
0:13:27 and find the ways to work with the industry in the infancy, in the beginning?
0:13:31 And how did you navigate some of the challenges in that?
0:13:38 So I think the first piece is actually spending time with the industry
0:13:41 and trying to understand what are their priorities, their incentives.
0:13:43 Of course, it’s sometimes tricky.
0:13:45 Sometimes you then end up being starstruck.
0:13:50 In fact, we had an honor and pleasure to work with Jared on some of his incredible work
0:13:57 and learn from him on what is important and which parts of the production process you can actually use AI,
0:14:01 which ones you want to keep, where is it actually helpful.
0:14:07 And so I think that’s the super important thesis across all the partnerships in the space.
0:14:12 In our case, we tried to figure out how to do that on the voice space,
0:14:14 which is, of course, with that technology,
0:14:19 A, how will the voice acting space look like in the future?
0:14:23 And then two, of course, to be able to cater to all those different use cases,
0:14:29 you need such a big array of different voices, different languages, different accents, different styles.
0:14:36 So we launched the voice marketplace where you could create your voice and then share it.
0:14:39 And when the voice is shared, you earn money in the return.
0:14:40 Today, we have almost 10,000 voices.
0:14:43 We paid $10 million back to the people in the community.
0:14:45 There are some crazy stories from the voices.
0:14:50 One of our first voices will say a deep Spanish voice.
0:14:55 And the magic of the technology is that the same voice now is available on all different languages in the same way.
0:14:58 So it’s 30 different languages at the time.
0:15:00 Now it’s 70, but 30 languages at the time.
0:15:01 And we had the Spanish voice join us.
0:15:04 And it wasn’t picking up on the Spain.
0:15:05 Nobody really liked it as much.
0:15:10 And then it picked up in an English-speaking country, that same voice, that deepness.
0:15:13 And now it’s our top three voice for all the use cases.
0:15:20 So hidden messages, you can all register to our voice marketplace and maybe earn some money too.
0:15:23 So that’s, I think, the second important thing.
0:15:30 It’s like figuring out how we can be part, how we can bring the industry together to disrupt together rather than just to disrupt.
0:15:34 And with labels, I think I’m still learning how to interact.
0:15:43 So we worked with labels, the Merlin and Cobalt, so fourth majors, to bring their music into the music model so we can do it in a licensed way.
0:15:47 So you can generate that and give commercial rights so you’re fully protected.
0:15:50 That was a hard process.
0:15:53 It took us 18 months to figure out the agreement that works.
0:16:05 And in the end, I think the main thing was adding a set of forcing functions or forcing timings to find effectively a trigger of like,
0:16:06 okay, this is when we do it.
0:16:09 And we either do it together or we do it separately.
0:16:13 And those forcing functions really help add urgency.
0:16:20 Then we needed to move that forcing function a few times, but it still worked to a large extent to go after that.
0:16:26 And then two is, of course, finding the compromise wasn’t easy.
0:16:35 But then in our case, working with the labels there was kind of protecting what they are caring about.
0:16:42 And they, of course, also care about how they continue doing well by their members, by their artists that they work with.
0:16:50 So we would spend a lot of time working with their members, speaking about how we think about technology, what’s going to happen in the next couple of years.
0:16:51 And that really helped.
0:17:01 So just speaking through exactly the technology, showing the examples, and kind of avoiding this initial knee-jerk reaction that AI is bad has been tremendous.
0:17:14 And maybe tying back to the earlier question, as you are navigating this landscape, how do you think about bringing the right talent that can lead some of these functions?
0:17:22 And these are mostly unknown territories of how to navigate it, like where have you been seeing success in bringing the right people?
0:17:38 So here, for the spaces that are kind of completely new to us, so this and like legalists and another example, we would always kind of bring at least one or two people that were in that space that kind of have interacted with the same parties full-time in the past.
0:17:47 But then we would actually adjust that with a lot of consulting people that would help us in a specific conversation.
0:17:54 So in this case, in music, we had music lawyers that worked very closely with us that consult across a few of them.
0:18:03 And the good thing is that they know all the players and they effectively were this bridging gap between both of us so we could speak the same language.
0:18:06 And then that was really helpful.
0:18:27 Yeah, and you have had a very specific taste for people that are risk tolerant enough and also understand the commercial business opportunities to, you know, help guide the right chain of actions in each of those domains.
0:18:28 I found that very fascinating.
0:18:29 100%.
0:18:35 I mean, legal, I don’t know how many of you are trying to find a first legal counsel or have a number of those.
0:18:45 For us, this was, I think, one of the trickiest roles to hire for because you are hiring into the space you don’t know, you know very little about.
0:18:54 And then we had the first couple of legal people that were clearly not fit, so we separated paths.
0:19:03 Then we hired a third person and that person came from like a number of Fortune 500 companies and they never worked in startup space, never worked in venture.
0:19:08 And what resulted is like everything, every conversation was pointing out the risks that we see.
0:19:13 Oh, like anything we wanted to do was like the number of risks that this could carry.
0:19:21 And it was really tricky to work because we, it’s like you kind of get risks, but you’ve gotten the risk advice of like, okay, and this is where we should draw the line.
0:19:23 But everything was back to the decision.
0:19:29 And now we hired a person working previously in a number of companies as a council and don’t poach them.
0:19:45 And they understand the risk equation a lot better where they are not only like a counterpart to figuring out what the risks are, but also like, okay, this is what other companies do.
0:19:50 This is what we should potentially do and then they are like a true fault partner and a tremendous change.
0:19:51 For sure.
0:20:02 Eleven Labs started as more of a creator brand, everywhere from the individual creators to the creators that are building businesses.
0:20:12 But now you have been having a lot of success moving into enterprise, not just started from the AI agent platform, but, you know, even with the tech to speech, speech to text models.
0:20:15 How have you been navigating that transition?
0:20:26 Because that’s one of the very common place where, you know, a lot of really great consumer creator brands fell down, but you have had so far a pretty smooth transition.
0:20:34 So when we, when we launched, we had a lot of early inbound when, when, when we started the kind of the classic PLG, a lot of inbound from enterprise.
0:20:42 And I remember speaking with A60Z team when, when, when they joined us, where our initial take was, of course, we want to be engineering company.
0:20:43 We don’t want salespeople.
0:20:47 We would like to reinvent that and have like engineers do the sales.
0:20:54 We, we, we, we did hire one traditional salesperson and one non-traditional salesperson, like an engineer.
0:20:55 And we told them like, do sales now.
0:20:59 And that really, as you can imagine, didn’t work out in this specific case.
0:21:04 But we learned our lesson and we, we now do invest in, in a combination of that.
0:21:06 It’s 80% sales, 20% engineering.
0:21:08 So still a little bit of that.
0:21:17 But it was like super important level of understanding who are the customers, what they care about and working deeply with them to, to bring it back.
0:21:23 And then that kind of working with them was kind of opening of what we need to actually do on the product and research side.
0:21:27 Munjal from Hippocratic is, is, is here.
0:21:40 He was one of the, the earliest incredible use cases in the healthcare space where they would create effectively agents that would take inbound calls that are calling the hospitals to take and schedule appointments.
0:21:49 And beyond that, they would do all the other parts of outbounding to the patients to remind them about taking medicine or reminding them about the appointment that’s happening.
0:22:01 And, and to be able to do that, that’s suddenly shifts from using a one foundational model into combining the speech to text, the LM, the text to speech to orchestrate them together.
0:22:04 Then the integrations you need to build, then you actually need to deploy.
0:22:14 And they were one of the areas that was 2023, but then we’ve seen this repeated pattern across, across number of other customers and customer experience space and, and many others.
0:22:19 And, and, and we decided to invest more into helping with the entire orchestration.
0:22:28 So instead of just doing text to speech, we can help combining our research to make this whole, whole combination of that’s more fluid.
0:22:34 But then if you are thinking about enterprise, you do need to build the combination of knowledge base inside a system.
0:22:41 You need to help deploy that with telephony providers, whether it’s Twilio, whether it’s SIP tracking, like how do you do that in a templatized and easier, easier way.
0:22:48 And then of course the, the biggest gap that’s the most common, it’s easy to do a demo, but how do you actually build it to production?
0:22:53 How do you test, how your version control, how you evaluate, monitor over time, fine tune over time based on the results.
0:22:59 And, and all of that is, has been a big, big part and underlying all of that.
0:23:11 And we spoke a little bit with, with Matt before coming here, the, the foundation needs to be there, which is the security, the compliance, serving, serving the, the customers across that will rely on that infrastructure.
0:23:17 That’s something that we want to shine through at C11 Labs, where if you are using the software, it’s going to always be reliable.
0:23:23 And always the four nines or five nines, hopefully one day we’ll be, we’ll be there, which is tricky in AI space.
0:23:26 But the, the, that’s the, that’s the, that’s the goal.
0:23:36 Of course the, the difference between the, the, one obvious difference between PLG and sales is the, the cycle to work through and identify the right customers is much longer.
0:23:48 And, and, and I think that’s where eagerness from our internal team was, was, was, was, was interesting to observe where you had a lot of people that didn’t work in an enterprise setting.
0:23:57 And then you had other side of the company that did, and the side that didn’t was very skeptic about going enterprise and like kind of waiting the six months or 12 months to results.
0:24:04 And in the early days, we needed to shield them from that information and like, trust us, we’ll do this and it will work.
0:24:05 But they were very skeptic.
0:24:08 And then of course, after 12 months, it, it worked out.
0:24:16 But that was probably the hardest culturally of how you kind of still keep everyone jumping on the same, on the same train.
0:24:18 That’s exactly right.
0:24:36 A lot of companies actually, at least I observed, sort of slowed down after start adopting more of the enterprise sort of product launching and like building for the customers request that started to, thank you so much, to delay sort of the, the product launches.
0:24:45 Is that something you’re seeing or is there still like a good balance of like, we still want to be able to put out demos and POCs and early teasers quickly.
0:24:50 But at the same time, we’ll get to, you know, deliver a very robust and reliable product.
0:24:52 So there are two parts.
0:24:55 The first part is, so we have like a difference on the team structure.
0:24:58 And then we have a difference on the kind of external product structure.
0:25:03 On the external product structure, we want to ship very quickly.
0:25:08 But of course, if you’re shipping to enterprise, you want to make sure that it’s stable and reliable.
0:25:11 So we delineate very clearly what’s alpha, what’s not alpha.
0:25:15 And then we go for that transition for that period.
0:25:21 And then as we work with the customers, they can, and then our partners, they can decide whether, whether they want the access to alpha in the first place.
0:25:25 And when they do, they, that’s clearly shown that this is an alpha product.
0:25:26 It might not be as stable.
0:25:27 And so they get a choice.
0:25:30 And I think that choice has been the most important lever.
0:25:33 Like, do you, do you, do you want it or not?
0:25:41 And some are, are, are, are incredible on doing that, that innovation and, and, and, and showing some of the work or experimenting with that work.
0:25:48 Like, uh, Deutsche Telekom with John here is, is, is, is, is, is creating some of the incredible new podcast experiences.
0:26:04 And that came from like testing early models of turning a, a text, a text into like a more notebook LM style of a podcast with incredible voices that you can select for German speaking voices, English speaking voices that, um, that, that sound good.
0:26:07 Um, and then there’s a second, which is team structure piece.
0:26:18 And that’s something that we didn’t do until, until later when we had, uh, more than a hundred of us is that we delineate inside a company products that are pre-product market fit and post-product market fit.
0:26:23 Um, on the post-product market fit, you are working for the long term.
0:26:29 You test and evaluate a lot before you, um, you only deploy when, when that’s, that’s, that’s, that’s truly ready.
0:26:35 Um, the pre-product market fit, your mission is, is to ship until you think we’ve hit the product market fit.
0:26:38 And usually we give the six months period of like proving it out.
0:26:42 If not, we kill the product and we’ve killed product in the past, uh, this way.
0:26:49 But that’s like the, the main important piece of like, okay, until we know there is a big potential user base, we, we, we, we, we will continue iterating.
0:26:58 I have, uh, being able to observe some of those, uh, I guess hard decisions in the moment, but it’s the right decision later on to, to let go of some of the products.
0:27:01 Um, this is one of my favorite questions.
0:27:05 Um, my partner, Martin Casato, always say companies go through three phases.
0:27:08 There is the product phase, there’s sales phase, and there’s a scaling phase.
0:27:14 And given you have been through some of those phases, what has been the hardest transition for you as a CEO?
0:27:18 There is a lot of, a lot of many ones.
0:27:23 Of course, I have my, my co-founder next to me across each of those, which is the, I know him for 15 years.
0:27:25 He’s my best friend since high school.
0:27:33 So I have like the, the most luck, uh, to, to, to, to have, uh, that combination, of course, uh, you, you, Jennifer and all the, all the partners to help us through those transitions, which is, which has been incredible.
0:27:42 Um, but I think the, the, the, the recent, or like a recent realization was when we, we are now 350 people company.
0:27:49 And, uh, and of course that means our go-to-market team and the incentive structure around that has evolved pretty, pretty strongly.
0:27:58 And what wasn’t clear to me, and now in hindsight is obvious, is that the, um, in early days, everybody would just operate on a passion basis.
0:28:00 They would just operate what they think is best for the company.
0:28:08 As our go-to-market team enlarged, we realized that the incentive structure really matters if you are building that machine.
0:28:18 And, um, and that transition where you shift from, from, from, from a lot of, a lot of the people that are helping create that machine are part of that machine.
0:28:26 Those incentive structures will eventually drive the behaviors, which might be slightly different to what you had in mind if you don’t make it extremely clear.
0:28:34 And in some ways, the, the quota, the commissions are a, effectively a lagging indicator of strategy.
0:28:41 And then, um, and then strategy, um, uh, is kind of leading of what will happen in the future.
0:28:49 So you need to find a way to resolve those two together, where you want to make sure the quota and commissions and the strategy that you want to drive are closer together.
0:28:52 And they, and the, the kind of the disparity as close as possible.
0:29:01 And, uh, so, so here, the, for me, the biggest realization was that we are becoming a bigger company because there are clear behaviors that happen based on the commissions.
0:29:14 And then two, to actually resolve those, we need to be very upfront in terms of, um, of, uh, of making it explicit that sometimes, even if commissions are just this, and you think it’s the wrong thing, come back to us.
0:29:16 Let’s speak about it and let just score.
0:29:27 So now we are explicit of all our sales teams that if they are seeing a deal that let’s say might be competitive in nature, and our pricing table would suggest that they can go very low and earn higher commission, but they think it’s wrong.
0:29:28 It’s better to come to us.
0:29:35 We are happy to still grant commission, but kill the deal and, um, and, and go out for us.
0:29:43 We had this case recently where one of our foundational level competitor came to us wanting to license our models for demos.
0:29:49 And, um, and of course the incentive would suggest that you should sell to them, but luckily, luckily we didn’t.
0:29:49 Yeah.
0:29:51 You granted commission though.
0:29:54 In the early days, you can definitely allow it.
0:29:56 And I trusted that now it’s in the policy.
0:29:59 I cannot sell to the foundational model companies.
0:30:04 So it’s clear, clear to, to all the, um, that was incredible, Maddie.
0:30:07 Thank you so much for, for sharing all the lessons and learnings with us.
0:30:09 Let’s give a round of applause to, to Maddie.
0:30:10 Thank you.
0:30:28 As a reminder, the content here is for informational purposes only should not be taken as legal business tax or investment advice, or be used to evaluate any investment or security and is not directed at any investors or potential investors in any a 16 Z fund.
0:30:33 Please note that a 16 Z and its affiliates may also maintain investments in the companies discussed in this podcast.
0:30:40 For more details, including a link to our investments, please see a 16 Z dot com forward slash disclosures.

ElevenLabs CEO and co‑founder Mati Staniszewski joins Jennifer Li to explain how the team ships research‑grade AI at lightning speed—from text‑to‑speech and fully licensed AI music to real‑time voice agents—and why voice is the next interface for human‑computer interaction. He shares the small, autonomous team model, global hiring approach, and how the Voice Marketplace has paid creators over $10M while evolving into an enterprise platform.

 

Resources:

Follow Mati on X: https://x.com/matistanis

Follow Jennifer on X: https://x.com/JenniferHli

 

Stay Updated: 

If you enjoyed this episode, be sure to like, subscribe, and share with your friends!

Find a16z on X: https://x.com/a16z

Find a16z on LinkedIn: https://www.linkedin.com/company/a16z

Listen to the a16z Podcast on Spotify: https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX

Listen to the a16z Podcast on Apple Podcasts: https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711

Follow our host: https://x.com/eriktorenberg

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

 

Stay Updated:

Find a16z on X

Find a16z on LinkedIn

Listen to the a16z Podcast on Spotify

Listen to the a16z Podcast on Apple Podcasts

Follow our host: https://twitter.com/eriktorenberg

 

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

a16z Podcasta16z Podcast
0
Let's Evolve Together
Logo