a16z Podcast
Inferact is a new AI infrastructure company founded by the creators and core maintainers of vLLM. Its mission is to build a universal, open-source inference layer that makes large AI models faster, cheaper, and more reliable to run across any hardware, model architecture, or deployment environment. Together, they broke down how modern AI models are actually run in production, why “inference” has quietly become one of the hardest problems in AI infrastructure, and how the open-source project vLLM emerged to solve it. The conversation also looked at why the vLLM team started Inferact and their vision for a universal inference layer that can run any model, on any chip, efficiently.
Follow Matt Bornstein on X: https://twitter.com/BornsteinMatt
Follow Simon Mo on X: https://twitter.com/simon_mo_
Follow Woosuk Kwon on X: https://twitter.com/woosuk_k
Follow vLLM on X: https://twitter.com/vllm_project
Stay Updated:
Find a16z on X
Find a16z on LinkedIn
Listen to the a16z Show on Spotify
Listen to the a16z Show on Apple Podcasts
Follow our host: https://twitter.com/eriktorenberg
Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.
Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

-
Why Speed, Not Size, Will Define the Next War
As global tensions rise, AI and autonomy are transforming how nations prepare for conflict. In this episode, Horacio Rozanski, CEO of Booz Allen Hamilton and Gary Shield, CEO of Shield AI join Erik Torenberg to…
-
Beyond Chatbots: Marc Andreessen and Ben Horowitz on AI’s Future
In this closing keynote from a16z’s Runtime conference, General Partner Erik Torenberg speaks with our firm’s cofounders, Marc Andreessen and Ben Horowitz on highlights from throughout the conference, the current state of LLM capabilities, and…
-
“Is there an AI bubble?” Gavin Baker and David George
In this conversation from a16z’s Runtime conference, Gavin Baker, Managing Partner and CIO of Atreides Management, joins David George, General Partner at a16z, to unpack the macro view of AI: the trillion-dollar data center buildout,…
-
Building the Real-World Infrastructure for AI, with Google, Cisco & a16z
AI isn’t just changing software, it’s causing the biggest buildout of physical infrastructure in modern history. In this episode, Raghu Raghuram (a16z) speaks with Amin Vahdat, VP and GM of AI and Infrastructure at Google,…
-
Google DeepMind Developers: How Nano Banana Was Made
Google DeepMind’s new image model Nano Banana took the internet by storm. In this episode, we sit down with Principal Scientist Oliver Wang and Group Product Manager Nicole Brichtova to discuss how Nano Banana was…
-
Raghu Raghuram: AI, Robotics, and the Rebirth of Infrastructure
From Netscape to VMware, Raghu Raghuram has been at the center of nearly every major inflection point in enterprise technology. In this episode, Raghu joins Ben Horowitz, Martin Casado and David George to reflect on…
-
Marc Andreessen: How Movies Explain America
In this episode of Monitoring the Situation, Marc Andreessen, Katherine Boyle, and Erik Torenberg dive into the movies that best explain America, from Once Upon a Time in Hollywood to Tropic Thunder to Fight Club.…
-
Marc Andreessen and Amjad Masad: English As the New Programming Language
Amjad Masad, founder and CEO of Replit, joins a16z’s Marc Andreessen and Erik Torenberg to discuss the new world of AI agents, the future of programming, and how software itself is beginning to build software.…
-
Why Creativity Will Matter More Than Code
In this episode, a16z’s Anish Acharya joins Kevin Rose for an in-depth, fast-paced conversation on the rebirth of consumer technology, and how AI is reshaping what it means to build, invest, and create. They talk…
-
How Kong Was Born: APIs, Hustle, and the Future of AI Infrastructure
Augusto Marietti, CEO and cofounder of Kong, has one of the most remarkable founder stories in Silicon Valley history. In this conversation with Martin Casado, Aghi shares how he went from a garage in Milan…
