The Godmother of AI on jobs, robots & why world models are next | Dr. Fei-Fei Li

Lenny's Podcast

Dr. Fei-Fei Li is known as the “godmother of AI.” She’s been at the center of AI’s biggest breakthroughs for over two decades. She spearheaded ImageNet, the dataset that sparked the deep-learning revolution we’re living right now, served as Google Cloud’s Chief AI Scientist, directed Stanford’s Artificial Intelligence Lab, and co-founded Stanford’s Institute for Human-Centered AI. In this conversation, Fei-Fei shares the rarely told history of how we got here—including the wild fact that just nine years ago, calling yourself an AI company was basically a death sentence. *We discuss:* 1. How ImageNet helped spark the AI explosion we’re living through 2. Why world models and spatial intelligence represent the next frontier in AI, beyond large language models 3. Why Fei-Fei believes AI won’t replace humans but will require us to take responsibility for ourselves 4. The surprising applications of Marble, from movie production to psychological research 5. Why robotics faces unique challenges compared with language models and what’s needed to overcome them 6. How to participate in AI regardless of your role *Brought to you by:* Figma Make—A prompt-to-code tool for making ideas real: https://www.figma.com/lenny/ Justworks—The all-in-one HR solution for managing your small business with confidence: https://www.justworks.com/ Sinch—Build messaging, email, and calling into your product: https://sinch.com/lenny *Transcript:* https://www.lennysnewsletter.com/p/the-godmother-of-ai *My biggest takeaways (for paid newsletter subscribers):* https://www.lennysnewsletter.com/i/178223233/my-biggest-takeaways-from-this-conversation *Where to find Dr. Fei-Fei Li:* • X: https://x.com/drfeifei • LinkedIn: https://www.linkedin.com/in/fei-fei-li-4541247 • World Labs: https://www.worldlabs.ai *Where to find Lenny:* • Newsletter: https://www.lennysnewsletter.com • X: https://twitter.com/lennysan • LinkedIn: https://www.linkedin.com/in/lennyrachitsky/ *In this episode, we cover:* (00:00) Introduction to Dr. Fei-Fei Li (05:31) The evolution of AI (09:37) The birth of ImageNet (17:25) The rise of deep learning (23:53) The future of AI and AGI (29:51) Introduction to world models (40:45) The bitter lesson in AI and robotics (48:02) Introducing Marble, a revolutionary product (51:00) Applications and use cases of Marble (01:01:01) The founder’s journey and insights (01:10:05) Human-centered AI at Stanford (01:14:24) The role of AI in various professions (01:18:16) Conclusion and final thoughts *Referenced:* • From Words to Worlds: Spatial Intelligence Is AI’s Next Frontier: https://drfeifei.substack.com/p/from-words-to-worlds-spatial-intelligence • World Lab’s Marble GA blog post: https://www.worldlabs.ai/blog/marble-world-model • Fei-Fei’s quote about AI on X: https://x.com/drfeifei/status/963564896225918976 • ImageNet: https://www.image-net.org • Alan Turing: https://en.wikipedia.org/wiki/Alan_Turing • Dartmouth workshop: https://en.wikipedia.org/wiki/Dartmouth_workshop • John McCarthy: https://en.wikipedia.org/wiki/John_McCarthy_(computer_scientist) • WordNet: https://wordnet.princeton.edu • Game-Changer: How the World’s First GPU Leveled Up Gaming and Ignited the AI Era: https://blogs.nvidia.com/blog/first-gpu-gaming-ai • Geoffrey Hinton on X: https://x.com/geoffreyhinton • Amazon Mechanical Turk: https://www.mturk.com • Why experts writing AI evals is creating the fastest-growing companies in history | Brendan Foody (CEO of Mercor): https://www.lennysnewsletter.com/p/experts-writing-ai-evals-brendan-foody • Surge AI: https://surgehq.ai • First interview with Scale AI’s CEO: $14B Meta deal, what’s working in enterprise AI, and what frontier labs are building next | Jason Droege: https://www.lennysnewsletter.com/p/first-interview-with-scale-ais-ceo-jason-droege • Alexandr Wang on LinkedIn: https://www.linkedin.com/in/alexandrwang • Even the ‘godmother of AI’ has no idea what AGI is: https://techcrunch.com/2024/10/03/even-the-godmother-of-ai-has-no-idea-what-agi-is • AlexNet: https://en.wikipedia.org/wiki/AlexNet • Demis Hassabis interview: https://deepmind.google/discover/the-podcast/demis-hassabis-the-interview • Elon Musk on X: https://x.com/elonmusk • Jensen Huang on LinkedIn: https://www.linkedin.com/in/jenhsunhuang • Stanford Institute for Human-Centered AI: https://hai.stanford.edu • Percy Liang on X: https://x.com/percyliang • Christopher Manning on X: https://x.com/chrmanning • With spatial intelligence, AI will understand the real world: https://www.ted.com/talks/fei_fei_li_with_spatial_intelligence_ai_will_understand_the_real_world • Rosalind Franklin: https://en.wikipedia.org/wiki/Rosalind_Franklin ...References continued at: https://www.lennysnewsletter.com/p/the-godmother-of-ai _Production and marketing by https://penname.co/._ _For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com._ Lenny may be an investor in the companies discussed.

Hosts: Dr. Fei-Fei Li, Lenny

📺Watch on YouTube

📅November 16, 2025

⏱️01:19:34

🌐English

🤍0 likes

Disclaimer: The transcript on this page is for the YouTube video titled "The Godmother of AI on jobs, robots & why world models are next | Dr. Fei-Fei Li" from "Lenny's Podcast". All rights to the original content belong to their respective owners. This transcript is provided for educational, research, and informational purposes only. This website is not affiliated with or endorsed by the original content creators or platforms.

Watch the original video here: https://www.youtube.com/watch?v=Ctjiatnd6Xk

00:00:00Lenny

A lot of people call you the godmother of AI. The work you did actually was the spark that brought us out of AI winter.

🤍0 likes💬 0 comments

00:00:06Dr. Fei-Fei Li

In the middle of 2015, middle of 2016, some tech companies avoid using the word AI because they were not sure if AI was a dirty word. 2017-ish was the beginning of companies calling themselves AI companies.

🤍0 likes💬 0 comments

00:00:22Lenny

There's this line, I think this was when you were presenting to Congress, "There's nothing artificial about AI. It's inspired by people. It's created by people. And most importantly, it impacts people."

🤍0 likes💬 0 comments

00:00:30Dr. Fei-Fei Li

It's not like I think AI will have no impact on jobs or people. In fact, I believe that whatever AI does currently or in the future is up to us. It's up to the people. I do believe technology is a net positive for humanity, but I think every technology is a double-edged sword. If we're not doing the right thing as a society, as individuals, we can screw this up as well.

🤍0 likes💬 0 comments

00:00:55Lenny

You had this breakthrough insight of just, okay, we can train machines to think like humans, but it's just missing the data that humans have to learn as a child.

🤍0 likes💬 0 comments

00:01:03Dr. Fei-Fei Li

I chose to look at artificial intelligence through the lens of visual intelligence because humans are deeply visual animals. We need to train machines with as much information as possible on images of objects, but objects are very, very difficult to learn. A single object can have infinite possibilities that is shown on an image. In order to train computers with tens and thousands of object concepts, you really need to show it millions of examples.

🤍0 likes💬 0 comments

00:01:36Lenny

Today, my guest is Dr. Fei-Fei Li, who's known as the godmother of AI. Fei-Fei has been responsible for and at the center of many of the biggest breakthroughs that sparked the AI revolution that we are currently living through. She spearheaded the creation of ImageNet, which was basically her realizing that AI needed a ton of clean labeled data to get smarter. And that data set became the breakthrough that led to the current approach to building and scaling AI models.

🤍0 likes💬 0 comments

00:02:01Lenny

She was chief AI scientist at Google Cloud, which is where some of the biggest early technology breakthroughs emerged from. She was director at SAIL, Stanford's artificial intelligence lab, where many of the biggest AI minds came out of. She's also co-creator of Stanford's human-centered AI institute, which is playing a vital role in the direction that AI is taking. She's also been on the board of Twitter. She was named one of Time's 100 most influential people in AI. She's also on the United Nations Advisory Board. I could go on.

🤍0 likes💬 0 comments

00:02:29Lenny

In our conversation, Fei shares a brief history of how we got to today in the world of AI, including this mind-blowing reminder that 9 to 10 years ago, calling yourself an AI company was basically a death knell for your brand because no one believed that AI was actually going to work. Today, it's completely different. Every company is an AI company.

🤍0 likes💬 0 comments

00:02:48Lenny

We also chat about her take on how she sees AI impacting humanity in the future, how far current technologies will take us, why she's so passionate about building a world model, and what exactly world models are. And most exciting of all, the launch of the world's first large world model, Marble, which just came out as this podcast comes out. Anyone can go play with this at marble.worldlabs.ai. It's insane. Definitely check it out.

🤍0 likes💬 0 comments

00:03:14Lenny

Fei is incredible and way too under the radar for the impact that she's had on the world. So, I am really excited to have her on and to spread her wisdom with more people. A huge thank you to Ben Horowitz and Condoleezza Rice for suggesting topics for this conversation. If you enjoy this podcast, don't forget to subscribe and follow it in your favorite podcasting app or YouTube. With that, I bring you Dr. Fei-Fei Li after a short word from our sponsors.

🤍0 likes💬 0 comments

00:03:37Lenny

This episode is brought to you by Figma, makers of Figma Make. When I was a PM at Airbnb, I still remember when Figma came out and how much it improved how we operated as a team. Suddenly, I could involve my whole team in the design process, give feedback on design concepts really quickly, and it just made the whole product development process so much more fun.

🤍0 likes💬 0 comments

00:03:57Lenny

But Figma never felt like it was for me. It was great for giving feedback on designs, but as a builder, I wanted to make stuff. That's why Figma built Figma Make. With just a few prompts, you can make any idea or design into a fully functional prototype or app that anyone can iterate on and validate with customers. Figma make is a different kind of vibe coding tool. Because it's all in Figma, you can use your team's existing design building blocks, making it easy to create outputs that look good and feel real and are connected to how your team builds. Stop spending so much time telling people about your product vision and instead show it to them. Make code-backed prototypes and apps fast with Figma Make. Check it out at figma.com/lenny.

🤍0 likes💬 0 comments

00:04:40Lenny

Did you know that I have a whole team that helps me with my podcast and with my newsletter? I want everyone on that team to be super happy and thrive in their roles. JustWorks knows that your employees are more than just your employees. They're your people. My team is spread out across Colorado, Australia, Nepal, West Africa, and San Francisco. My life would be so incredibly complicated to hire people internationally, to pay people on time and in their local currencies, and to answer their HR questions 24/7. But with JustWorks, it's super easy. Whether you're setting up your own automated payroll, offering premium benefits, or hiring internationally, JustWorks offers simple software and 24/7 human support from small business experts for you and your people. They do your human resources right so that you can do right by your people. JustWorks for your people.

🤍0 likes💬 0 comments

00:05:31Lenny

Fei-Fei, thank you so much for being here and welcome to the podcast.

🤍0 likes💬 0 comments

00:05:34Dr. Fei-Fei Li

I'm excited to be here, Lenny.

🤍0 likes💬 0 comments

00:05:36Lenny

I'm even more excited to have you here. It is such a treat to get to chat with you. There's so much that I want to talk about. You've been at the center of this AI explosion that we're seeing right now for so long. We're going to talk about a bunch of the history that I think a lot of people don't even know about how this whole thing started.

🤍0 likes💬 0 comments

00:05:53Lenny

But let me first read a quote from Wired about you just so people get a sense, and in the intro I'll share all of the other epic things you've done, but I think this is a good way to just set context. "Fei is one of a tiny group of scientists, a group perhaps small enough to fit around a kitchen table, who are responsible for AI's recent remarkable advances."

🤍0 likes💬 0 comments

00:06:11Lenny

A lot of people call you the godmother of AI. And unlike a lot of AI leaders, you're an AI optimist. You don't think AI is going to replace us. You don't think it's going to take all our jobs. You don't think it's going to kill us. So, I thought it'd be fun to start there. Just what's your perspective on how AI is going to impact humanity over time?

🤍0 likes💬 0 comments

00:06:29Dr. Fei-Fei Li

Yeah. Okay. So, Lenny, let me be very clear. I'm not a utopian. So, it's not like I think AI will have no impact on jobs or people. In fact, I'm a humanist. I believe that whatever AI does currently or in the future is up to us. It's up to the people. So I do believe technology is a net positive for humanity. If you look at the long course of civilization, I think we are fundamentally, we're an innovative species. That, you know, if you look at from, you know, written record thousands of years ago to now, humans just kept innovating ourselves and innovating our tools. And with that, we make lives better, we make work better, we build civilization, and I do believe AI is part of that. So, that's where the optimism comes from. But I think every technology is a double-edged sword. And if we're not doing the right thing as a species, as a society, as communities, as individuals, we can screw this up as well.

🤍0 likes💬 0 comments

00:07:47Lenny

There's this line, I think this was when you were presenting to Congress. "There's nothing artificial about AI. It's inspired by people. It's created by people, and most importantly, it impacts people." I don't have a question there, but what a what a great line.

🤍0 likes💬 0 comments

00:07:59Dr. Fei-Fei Li

Yeah, I feel pretty deeply. You know, I started working in AI two and a half decades ago, and I've been having students for the past two decades. And almost every student who graduates, I remind them, you know, when they graduate from my lab, that your field is called artificial intelligence, but there's nothing artificial about it.

🤍0 likes💬 0 comments

00:08:23Lenny

Coming back to the point you just made about how it's kind of up to us about where this all goes. What is it you think we need to get right? How do we set things on a path? I know this is a very difficult question to answer, but just what should... what's your advice?

🤍0 likes💬 0 comments

00:08:36Dr. Fei-Fei Li

Yeah. How many hours do we have?

🤍0 likes💬 0 comments

00:08:38Lenny

How do we align AI? There we go. Let's solve it.

🤍0 likes💬 0 comments

00:08:41Dr. Fei-Fei Li

Also, I think people should be responsible individuals, no matter what we do. This is what we teach our children, and this is what we need to do as grown-ups as well. No matter which part of the AI development or AI deployment or AI application you are participating in, and most likely many of us, especially as technologists, we're in multiple points, we should act like responsible individuals and care about this, actually care a lot about this. I think everybody today should care about AI because it is going to impact your individual life. It is going to impact your community. It's going to impact the society and the future generation. And caring about it as a responsible person is the first, but also the most important step.

🤍0 likes💬 0 comments

00:09:37Lenny

Okay. So, let me let me actually take a step back and kind of go to the beginning of AI. Most people started hearing and caring about AI as what it's called today, just like, I don't know, a few years ago when ChatGPT came out. Maybe it was like three years ago.

🤍0 likes💬 0 comments

00:09:51Dr. Fei-Fei Li

Three years ago. Almost one more month, three years ago.

🤍0 likes💬 0 comments

00:09:54Lenny

Wow. Okay. That was ChatGPT coming out. Is that the milestone that you have in mind? Okay. Cool. That's exactly how I saw it. But very few people know there was a long, long history of people working on, it was called machine learning back then and there's other terms and now it's just everything's AI. And there was kind of like a long period of just a lot of people working on it, and then there's this what people refer to as the AI winter, where people just gave up, almost. People did and just okay, this this idea isn't going anywhere. And then the work you did actually was essentially the spark that brought us out of AI winter and is directly responsible for the world we're in now of just AI is all we talk about. As you just said, it's going to impact everything we do. So, I thought it'd be really interesting to hear from you just kind of like the brief history of what the world was like before ImageNet, then just the work you did to create ImageNet, why that was so important, and then just what happened after.

🤍0 likes💬 0 comments

00:10:44Dr. Fei-Fei Li

It is for me hard to keep in mind that AI is so new for everybody. When I lived my entire professional life in AI, it's there's a part of me that is just, it's so satisfying to see a personal curiosity that I started barely out of teenagehood and now has become a transformative force of our civilization. It generally is a civilizational level technology. So, so that journey is about about 30 years or 20 something, 20 plus years, and it's it's just very satisfying. So, where did it all start?

🤍0 likes💬 0 comments

00:11:28Dr. Fei-Fei Li

Well, I'm not even the first generation AI researcher. The first generation really date back to the 50s and 60s. And, you know, Alan Turing was ahead of his time by in the 40s by asking, daring humanity with the question, "Can we... is there thinking machines?" Right? And of course, he has a specific way of testing this concept of thinking machine, which is a conversational chatbot, which to his standard, we now have a thinking machine, but that was just a more anecdotal inspiration.

🤍0 likes💬 0 comments

00:12:09Dr. Fei-Fei Li

The field really began in the 50s when computer scientists came together and look at how we can use computer programs and algorithms to build these programs that can do things that have been only capable by human cognition. And that was the beginning. And the founding fathers, the Dartmouth workshop in the 1956, we have Professor John McCarthy who later came to Stanford who coined the term artificial intelligence.

🤍0 likes💬 0 comments

00:12:45Dr. Fei-Fei Li

And between the 50s, 60s, 70s, and 80s, it was the early days of AI exploration, and we had logic systems, we had expert systems. We also had early exploration of neural network, and then it came to around the late 80s, the 90s, and the very beginning of the 21st century. That stretch, about 20 years, is actually the beginning of machine learning. It's the marriage between computer programming and statistical learning.

🤍0 likes💬 0 comments

00:13:24Dr. Fei-Fei Li

And that marriage brought a very, very critical concept into AI, which is that purely rule-based program is not going to account for the vast amount of cognitive capabilities that we imagine computers can do. So we have to use machines to learn the patterns. Once the machines can learn the patterns, it has a hope to do more things. For example, if you give it three cats, the hope is not just for the machines to recognize these three cats. The hope is the machines can recognize the fourth cat, the fifth cat, the sixth cat, and all the other cats. And that's a learning ability that is fundamental to humans and many animals, and we as a field realized we need machine learning.

🤍0 likes💬 0 comments

00:14:22Dr. Fei-Fei Li

So that was up till the beginning of the 21st century. I entered the field of AI literally in the year of 2000. That's when my PhD began at Caltech. And so I was one of the first generation machine learning researchers, and we were already studying this concept of machine learning, especially neural network. I remember that was one of my first courses in at Caltech is called neural network, but it was very painful. It was still smack in the middle of the so-called AI winter, meaning the public didn't look at this too much. There wasn't that much funding, but there was also a lot of ideas flowing around.

🤍0 likes💬 0 comments

00:15:01Dr. Fei-Fei Li

And I think two things happened to myself that brought my own career so close to the birth of modern AI is that I chose to look at artificial intelligence through the lens of visual intelligence because humans are deeply visual animals. We can talk a little more later, but so much of our intelligence is built upon visual, perceptual, spatial understanding, not just language per se. I think they're complementary. So I chose to look at visual intelligence, and my PhD and my early professor years, my students and I are very committed to a northstar problem, which is solving the problem of object recognition because it's a building block for the perceptual world. Right? We go around the world interpreting, reasoning, and interacting with it more or less at the object level. We don't interact with the world at the molecular level. We don't interact with the world as... we sometimes do, but we rarely, for example, if you want to lift a teapot, you don't say, okay, the teapot is made of 100 pieces of porcelain, and let me work on this 100 pieces. You look at this as one object and interact with it. So object is really important.

🤍1 like💬 0 comments

00:16:29Dr. Fei-Fei Li

So I was among the first researchers to identify this as a northstar problem. But I think what happened is that as a student of AI and then a researcher of AI, I was working on all kinds of mathematical models, including neural network, including Bayesian network, including many, many models. And there was one singular pain point is that these models don't have data to be trained on. And as a field, we were so focusing on these models, but it dawned on me that human learning as well as evolution is actually a big data learning process. Humans learn with so much experience, you know, constantly. And evolution, if you look at time, animals evolve with just experiencing the world.

🤍0 likes💬 0 comments

00:17:27Dr. Fei-Fei Li

So I think my students and I conjectured that a very critically overlooked ingredient of bringing AI to life is big data. And then we began this ImageNet project in 2006, 2007. We were very ambitious. We want to get the entire internet's image data on objects. Now, granted, internet was a lot smaller than today. So I felt like that ambition was at least not too crazy. Now it's totally delusional to think a couple of graduate student and a professor can do this. But and that's what we did. We curated very carefully 15 million images on the internet, created a taxonomy of 22,000 concepts, borrowing other researchers' work like a linguist work on WordNet, and it's a particular way of dictionarying words. And we combine that into ImageNet, and we open source that to the research community. We held an annual ImageNet challenge to encourage everybody to participate in this.

🤍0 likes💬 0 comments

00:18:42Dr. Fei-Fei Li

We continue to do our own research, but 2012 was the moment that many people think was the beginning of the deep learning or birth of modern AI because a group of Toronto researchers led by Professor Jeff Hinton participated in ImageNet challenge, used the ImageNet big data and two GPUs from NVIDIA, and created successfully the first neural network algorithm that can, it didn't, fundamentally, it didn't totally solved, but made a huge progress towards solving the problem of object recognition.

🤍0 likes💬 0 comments

00:19:22Dr. Fei-Fei Li

And that combination of the trio technology, big data, neural network, and GPU, was kind of the golden recipe for modern AI. And then fast forward, the public moment of AI, which is the ChatGPT moment, if you look at the ingredients of what brought ChatGPT to the world, technically still use these three ingredients. Now it's internet-scale data, mostly texts. It's a much more complex neural network architecture than 2012, but it's still neural network and a lot more GPUs, but it's still GPUs. So these three ingredients are still at the core of modern AI.

🤍0 likes💬 0 comments

00:20:16Lenny

Incredible. I have never heard that full story before. I love that it was two GPUs was the first... I love, and now it's, I don't know, hundreds of thousands, right? That are orders of magnitudes more powerful. And those two GPUs were, they just bought, they were like gaming GPUs? They just went to like the game store, right? That people use for playing games?

🤍0 likes💬 0 comments

00:20:37Lenny

As you said, this continues to be in a large way the way models get smarter. Some of the fastest growing companies in the world right now, I've had them all mostly on the podcast, Remitly, Surge, and Scale, like, they do this. They continue to do this for labs, just give them more and more labeled data of the things they're most excited about.

🤍0 likes💬 0 comments

00:20:52Dr. Fei-Fei Li

Yeah, I remember Alex Wang from Scale very early days. I probably still has his emails when he was starting Scale. He was very kind. He keeps sending me emails about how ImageNet inspired Scale. I was very pleased to see that.

🤍0 likes💬 0 comments

00:21:09Lenny

One of my other favorite takeaways from what you just shared is just such an example of high agency and just doing things. That's kind of a meme on Twitter. Just you can just do things. You're just like, okay, this is probably necessary to move AI... And it was called machine learning back then, right? Was that the term most people used?

🤍0 likes💬 0 comments

00:21:25Dr. Fei-Fei Li

I think it was interchangeably. It's true. Like I do remember the companies, the tech companies, I'm not going to name names, but I was in a conversation in one of the early days, I think is in the middle of 2015, middle of 2016, some tech companies avoids using the word AI because they were not sure if AI was a dirty word. And I remember I was actually encouraging everybody to use the word AI because to me that is one of the most audacious question humanity has ever asked in our quest for science and technology, and I feel very proud of this term. But yes, at the beginning, some people were not sure.

🤍0 likes💬 0 comments

00:22:12Lenny

What year was that roughly when AI was developed? 2016?

🤍0 likes💬 0 comments

00:22:17Dr. Fei-Fei Li

Less than 10 years ago. That was the changing, like some people start calling it AI, but I think if you look at the Silicon Valley tech companies, if you trace their marketing term, I think 2017-ish was the beginning of companies calling themselves AI companies.

🤍0 likes💬 0 comments

00:22:40Lenny

That's incredible, just how the world has changed. Now you can't not call yourself an AI company.

🤍0 likes💬 0 comments

00:22:46Dr. Fei-Fei Li

I know.

🤍0 likes💬 0 comments

00:22:46Lenny

Just nine-ish years later.

🤍0 likes💬 0 comments

00:22:48Dr. Fei-Fei Li

Yeah.

🤍0 likes💬 0 comments

00:22:49Lenny

Oh man. Okay. Is there anything else around the history, that early history that you think people don't know that you think is important before we chat about where things are going in the work that you're doing?

🤍0 likes💬 0 comments

00:23:01Dr. Fei-Fei Li

I think as all histories, you know, I'm keenly aware that I am recognized for being part of the history, but there are so many heroes and so many researchers. We're talking about generations of researchers there. You know, in my own world, there are so many people who have inspired me, which I talked about in my book. But I do feel our culture, especially Silicon Valley tends to assign achievements to a single person. While I think it has value, but it's just to be remembered, AI is a field of at this point 70 years old, and we have gone through many generations. Nobody, no one could have gotten here by themselves.

🤍0 likes💬 0 comments

00:23:54Lenny

Okay. So let me ask you this question. It feels like we're always on this precipice of AGI. This kind of vague term people throw around. AGI is coming. Is it going to take over everything? What's your take on how far you think we might be from AGI? Do you think we're going to get there on the current trajectory we're on? Do you think we need more breakthroughs? Do you think the current approach will get us there?

🤍0 likes💬 0 comments

00:24:13Dr. Fei-Fei Li

Yeah, this is a very interesting term, Lenny. I don't know if anyone has ever defined AGI. You know, there are many different definitions, including, you know, some kind of superpower for machines all the way to can a machines can become economically viable agent in the society. In other words, making salaries to live. Is that the definition of AGI?

🤍1 like💬 0 comments

00:24:46Dr. Fei-Fei Li

As a scientist, I take science very seriously, and I enter the field because I was inspired by this audacious question of can machines think and do things in the way that humans can do. For me, that's always the northstar of AI. And from that point of view, I don't know what's the difference between AI and AGI. I think we've done very well in achieving parts of the goal, including conversational AI, but I don't think we have completely conquered all the goals of of AI. And I think our founding fathers, Alan Turing, I wonder if Alan Turing is around today and you ask him to contrast AI versus AGI, he might just shrug and said, "Well, I asked the same question back in 1940s." So, so I don't want to get onto a rabbit hole of defining AI versus AGI. I feel AGI is more a marketing term than a scientific term. As a scientist and technologist, AI is my northstar is my field's northstar, and I'm happy people call it whatever name they want to call it.

🤍0 likes💬 0 comments

00:26:05Lenny

So let me ask you maybe this way, like you described, there's kind of these components that from ImageNet and AlexNet kind of took us to where we're today. GPUs, essentially data, labeled data, just like the algorithm of the model. There's also just the transformer feels like an important step in that trajectory. Do you feel like those are the same components that'll get us to, I don't know, 10 times smarter model, something that's like life-changing for the entire world? Or do you think we need more breakthroughs? I know we're going to talk about world models, which I think is a component of this, but is there anything else that you think is like, oh, this will plateau, or okay, this will take us, just need more data, more compute, more GPUs.

🤍0 likes💬 0 comments

00:26:44Dr. Fei-Fei Li

Oh, no, I definitely think we need more innovations. I think scaling laws of more data, more GPUs, and bigger current model architecture is, there's still a lot to be done there, but I absolutely think we need to innovate more. There is not a single deeply scientific discipline in human history that has arrived at a place that says we're done. We're done innovating. And AI is one of the, if not the youngest discipline in human civilization in terms of science and technology. We're still scratching the surface. For example, like I said, we're going to segue into world models today.

🤍0 likes💬 0 comments

00:27:29Dr. Fei-Fei Li

You take a model and run it through a video of a couple of office rooms and ask the model to count the number of chairs. And this is something a toddler could do, or maybe, maybe a elementary school kid could do, and AI could not do that, right? So there's just so much AI today could not do, then let alone thinking about how did, you know, someone like Isaac Newton look at the movements of the celestial bodies and and derive an equation or a set of equations that governs the movement of all bodies. That level of creativity, extrapolation, abstraction, we have no way of enabling AI to do that today.

🤍0 likes💬 0 comments

00:28:24Dr. Fei-Fei Li

And then let's look at emotional intelligence. If you look at a student coming into a teacher's office and have a conversation about motivation, passion, what to learn, what's the problem that's really bothering you. That conversation, as powerful as as today's conversational bots are, you don't get that level of emotional cognitive intelligence from today's AI. So there's a lot we can do better. And I do not believe we're done innovating.

🤍0 likes💬 0 comments

00:29:00Lenny

Demis had this really interesting interview recently from DeepMind, Google, where someone asked him just like, what do you think, how far are we from AGI? What does it look like when it's through there? He had a really interesting way of approaching it is if we were to give the most cutting edge model all the information until the end of the 20th century, see if it could come up with all the breakthroughs Einstein had. And so far, we're nowhere near that, but they can...

🤍0 likes💬 0 comments

00:29:22Dr. Fei-Fei Li

No, we're not. In fact, it's even worse. Let's give AI all the data, including modern instruments' data of celestial bodies, which Newton did not have, and give it to that and just ask AI to create the 17th century set of equations on the laws of bodily movements. Today's AI cannot do that.

🤍0 likes💬 0 comments

00:29:48Lenny

All right, we're a ways away is what I'm hearing.

🤍0 likes💬 0 comments

00:29:50Dr. Fei-Fei Li

Yeah.

🤍0 likes💬 0 comments

00:29:51Lenny

Okay, so let's talk about world models. This is, to me, this is just another really amazing example of you being ahead of where people end up. So you were way ahead on, okay, we just need a lot of clean data for AI and neural networks to learn. You've been talking about this idea of world models for a long time. You started a company to build, essentially, there's language models, this is a different thing, this is a world model. We'll talk about what that is. And now, as I was preparing for this, Elon's like talking about world models. Jensen's talking about world models. I know Google's working on this stuff. You've been at this for a long time. And you're actually just launched something that's going to, we're going to talk about right before this podcast airs. Talk about what is a world model? Why is it so important?

🤍0 likes💬 0 comments

00:30:33Dr. Fei-Fei Li

I'm very excited to see that more and more people are talking about world models like Elon, like Jensen. I have been thinking about really how to push AI forward all my life, right? And the large language models that came out of the research world and then OpenAI and all this for the past few years were extremely inspiring even for a researcher like me. I remembered when GPT-2 came out, and that was in, I think, late 2020. I was co-director, I still am, but I was at that time full-time co-director of Stanford's Human-Centered AI Institute, and I remember it was, you know, the public was not aware of the power of the large language model yet, but as researchers, we were seeing it, we're seeing the future.

🤍0 likes💬 0 comments

00:31:37Dr. Fei-Fei Li

And I had pretty long conversations with my natural language processing colleagues like Percy Liang and Chris Manning, we were talking about how critical this technology is going to be. And Stanford AI institute, Human-Centered AI Institute, HAI, was the first one to establish a full research center on foundation model. We were... Percy Liang and many researchers led the first academic paper on foundation model. So, so it was just very inspiring for me.

🤍0 likes💬 0 comments

00:32:10Dr. Fei-Fei Li

So, of course, I come from the world of visual intelligence, and I was just thinking, there's so much we can push forward on beyond language because humans have used our sense of spatial intelligence and world understanding to do so many things, and they are beyond language. Think about a very chaotic first responder scene, whether it's fire or some traffic accident or some natural disaster. And it's, if you immerse yourself in those scene and think about how people organize themselves to rescue people, to stop further disasters, to put down fires, a lot of that is movements, is spontaneous understanding of objects, worlds, situational awareness. Language is part of that. But a lot of those situations, language cannot get you to put down the fire.

🤍0 likes💬 0 comments

00:33:24Dr. Fei-Fei Li

So that is what is that? I was thinking a lot, and in the meantime, I was doing a lot of robotics research, and it dawned on me that the lynchpin of connecting the additional intelligence in addition to language and connecting embodied AI, which are robotics, connecting visual intelligence is this sense of spatial intelligence about understanding the world. And that's when I think I, it was 2024, I gave a TED talk about spatial intelligence and world models, and I start formulating this idea back in 2022, based on my robotics and computer vision research. And then one thing that is really clear to me is that I really want to work with the brightest technologists and move as fast as possible to bring this technology to life. And that's when we founded this company called World Labs. And you can see the word "world" is in the title of our company because we believe so much in world modeling and spatial intelligence.

🤍0 likes💬 0 comments

00:34:41Lenny

People are so used to just chatbots, and that's a large language model. So the simple way to understand a world model is you basically describe a scene and it generates an infinitely explorable world. We'll link to the thing you launch, which we'll talk about, but just is that a simple way to understand it?

🤍0 likes💬 0 comments

00:34:56Dr. Fei-Fei Li

That's part of it, Lenny. I think a simple way to understand a world model is that this model can allow anyone to create any worlds in their mind's eye by prompting, whether it's an image or a sentence, and also be able to interact in this world, whether you're browsing and walking or picking objects up or changing things, as well as to reason within this world. For example, if the person consuming, if the agent consuming this output of the world model is a robot, it should be able to plan its path and help to, you know, tidy the kitchen, for example. So, so world model is a foundation that you can use to reason, to interact, and to create worlds.

🤍1 like💬 0 comments

00:36:00Lenny

Great. Yeah. So, robots feels like that's potentially the next big focus for AI researchers and just like the impact on the world. And what you're saying here is this is a key missing piece of making robots actually work in the real world, understanding how the world works.

🤍0 likes💬 0 comments

00:36:17Dr. Fei-Fei Li

Yeah. Well, first of all, I do think there's more than robots that's exciting. But I agree with everything you just said. I think world modeling and spatial intelligence is a key missing piece of embodied AI. I also think let's not underestimate that humans are embodied agents and humans can be augmented by AI's intelligence, just like today humans are language animals, but we're very much augmented by AI when helping us to, you know, do language tasks, including software engineering. I think that we shouldn't underestimate or maybe it's we tend not to talk about how humans as an embodied agents can actually benefit so much from world models and spatial intelligent models as well as robots can.

🤍0 likes💬 0 comments

00:37:15Lenny

So the big unlocks here, robots, which a huge deal. If this works out, imagine each of us has robots doing a bunch of stuff for us. Goes into, you know, they help us with disasters, things like that. Games obviously is a really cool example. Just like infinitely playable games that you just invent out of your head. And then creativity feels like just like being fun, having fun, being creative, thinking of wild new worlds and environments.

🤍0 likes💬 0 comments

00:37:39Dr. Fei-Fei Li

And also design. Humans design from machines to buildings to homes, and also scientific discovery. Right? There is so much... I like to use the example of the discovery of the structure of DNA. If you look at one of the most important piece in DNA's discovery history is the X-ray diffraction photo that was captured by Rosalind Franklin. And it was a flat 2D photo of a structure that looks like, it looks like a cross with diffractions. You can, you can Google those photos. But with that 2D flat photo, humans, especially two important humans, James Watson and Francis Crick, in addition to their other information, was able to reason in 3D space and deduce a highly three-dimensional double helix structure of the DNA. And that structure cannot possibly be 2D. You cannot think in 2D and deduce that structure. You have to think in 3D spatial, use the human spatial intelligence. So I think even in scientific discovery, spatial intelligence or AI-assisted spatial intelligence is critical.

🤍0 likes💬 0 comments

00:39:07Lenny

This is such an example of, I think it was Chris Dixon that had this line that "the next big thing is going to start off feeling like a toy." When ChatGPT just came out, if like, I remember Sam Altman just tweeted as like, "Here's a cool thing we're playing with. Check it out." Now it's the fastest growing product, all of history changed the world.

🤍0 likes💬 0 comments

00:39:24Dr. Fei-Fei Li

Yeah.

🤍0 likes💬 0 comments

00:39:24Lenny

And it's oftentimes the things that just look like, okay, this is cool, this is fun to play with, that end up changing the world most.

🤍0 likes💬 0 comments

00:39:32Dr. Fei-Fei Li

Yeah.

🤍0 likes💬 0 comments

00:39:32Lenny

This episode is brought to you by Cinch, the customer communications cloud. Here's the thing about digital customer communications. Whether you're sending marketing campaigns, verification codes, or account alerts, you need them to reach users reliably. That's where Cinch comes in. Over 150,000 businesses, including eight of the top 10 largest tech companies globally, use Cinch's API to build messaging, email, and calling into their products. And there's something big happening in messaging that product teams need to know about. Rich Communication Services, or RCS. Think of RCS as SMS 2.0. Instead of getting text from a random number, your users will see your verified company name and logo without needing to download anything new. It's a more secure and branded experience. Plus, you get features like interactive carousels and suggested replies. And here's why this matters. US carriers are starting to adopt RCS. Cinch is already helping major brands send RCS messages around the world, and they're helping Lenny's podcast listeners get registered first before the rush hits the US market. Learn more and get started at cinch.com/lenny. That's s-i-n-c-h.com/lenny.

🤍0 likes💬 1 comment

00:40:45Lenny

I reached out to Ben Horowitz who loves what you're doing. A big fan of yours. They're investors, I believe.

🤍0 likes💬 0 comments

00:40:51Dr. Fei-Fei Li

Yeah, we've known each other for for many years, but yes, right now they are investors of World Labs.

🤍0 likes💬 0 comments

00:40:57Lenny

Amazing. Okay. So I asked him what I should ask you about and he suggested ask you why is the bitter, why is the bitter lesson alone not likely to work for robots. So first of all just explain what the bitter lesson was in the history of AI and then just why that won't get us to where we want to be with robots.

🤍0 likes💬 0 comments

00:41:16Dr. Fei-Fei Li

So, well, first of all, there are many bitter lessons, but the bitter lessons everybody refers to is a is a paper written by Richard Sutton, who won the Turing Award recently. And he does a lot of reinforcement learning. And Richard has said, right, if you look at the history, especially the algorithmic development of AI, it turns out simpler model with a ton of data always win at the end of the day instead of the more complex model with less data. I mean, that was actually, this paper came years after ImageNet. That to me was not bitter, it was a sweet lesson. That's why I built ImageNet because I believe that big data plays that role.

🤍0 likes💬 0 comments

00:42:07Dr. Fei-Fei Li

So why can't bitter lesson work in robotics alone? Well, first of all, I think we need to give credit to where we are today. Robotics is very much in the early days of experimentation. It's not the, the research is not nearly as mature as say language models. So many people are still experimenting with different algorithms, and some of those algorithms are driven by big data. So I do think big data will continue to play a role in robotics.

🤍1 like💬 0 comments

00:42:46Dr. Fei-Fei Li

But what is hard for robotics, there are a couple of things. One is that it's harder to get data. It's a lot harder to get data. You can say, well, there is web data. This is where the latest robotics research is using web videos, and I think web videos do play a role. But if you think about what made language model work, a very... as someone who does computer vision and spatial intelligence and robotics, I'm very jealous of my colleagues in language because they had this perfect setup where their training data are in words, eventually tokens, and then they produce a model that outputs words. So you have this perfect alignment between what you hope to get, which we call objective function, and what your training data looks like.

🤍0 likes💬 0 comments

00:43:48Dr. Fei-Fei Li

But robotics is different. Even spatial intelligence is different. You hope to get actions out of robots. But your training data lacks actions in 3D worlds. And that's what robots have to do, right? Actions in 3D worlds. So, you have to find different ways to fit a, what do they call, a square in a round hole that what we have is tons of web videos. So then we have to start talking about adding, supplementing data such as teleoperation data or synthetic data so that the robots are trained with this hypothesis of bitter lesson, which is large amount of data. I think there's still hope because even what we are doing in world modeling will really unlock a lot of this information for robots. But I think we have to be careful because we're at the early days of this and bitter lesson is still to be tested because we haven't fully figured out the data.

🤍0 likes💬 0 comments

00:45:07Dr. Fei-Fei Li

For another part of the bitter lesson of robotics, I think we should be so so realistic about is again, compared to language models or even spatial models, robots are physical systems. So robots are closer to self-driving cars than a large language model. And that's very important to recognize. That means that in order for robots to work, we not only need brains, we also need the physical body, we also need application scenarios. And if you look at the history of self-driving car, my colleague Sebastian Thrun took Stanford's car to win the first DARPA challenge in 2006 or 2005. It's 20 years since that prototype of a self-driving car being able to drive 130 miles in the Nevada desert to today's Waymo on the street of San Francisco, and we're not even done yet. There's still a lot. So that's a 20-year journey.

🤍0 likes💬 0 comments

00:46:22Dr. Fei-Fei Li

And self-driving cars are much simpler robots. They're just metal boxes running on 2D surfaces. And the goal is not to touch anything. Robot is 3D things running in 3D world, and the goal is to touch things. So the journey is going to be, you know, there's many aspects, elements. And of course, one could say, well, the self-driving car early algorithm were pre-deep learning era. So deep learning is accelerating the brains, and I think that's true. That's why I'm in robotics. That's why I'm in spatial intelligence, and I'm excited by it. But in the meantime, the car industry is very mature, and productizing also involves the mature use cases, supply chains, the hardware. So I think it's a very interesting time to work in these problems. But it's true, Ben is right. We might still be subject to a number of bitter lessons doing this work.

🤍0 likes💬 0 comments

00:47:28Lenny

Do you ever just feel awe for the way the brain works and is able to do all of this for us? Just the complexity, just to get a machine to just walk around and not hit things and fall. Does it just give you more spec for what we've already got?

🤍0 likes💬 0 comments

00:47:44Dr. Fei-Fei Li

Totally. We operate on about 20 watts. That's dimmer than any light bulb in the room I'm in right now. And yet we can do so much. So I think actually the more I work in AI, the more I respect humans.

🤍1 like💬 0 comments

00:48:03Lenny

Let's talk about this product you just launched. It's called Marble. A very cute name. Talk about what this is, why this important. I've been playing with it. It's incredible. We'll link to it and for folks to check it out. What is Marble?

🤍0 likes💬 0 comments

00:48:14Dr. Fei-Fei Li

Yeah, I'm very excited. So first of all, Marble is one of the first product that World Labs has rolled out. World Labs is a foundation frontier model company. We are founded by four co-founders who have deep technical history. My co-founders Justin Johnson, Kristoff Lassner, and Ben Mildenhall. We all come from the research field of AI, computer graphics, computer vision. And we believe that spatial intelligence and world modeling is as important if not more to language models and complementary to language models.

🤍0 likes💬 0 comments

00:48:54Dr. Fei-Fei Li

So we wanted to seize this opportunity to create deep tech research lab that can connect the dots between frontier models with products. So, Marble is an app that's built upon our frontier models. We've spent a year and plus building the world's first generative model that can output genuinely 3D worlds. That's a very, very hard problem. And I, it was a very hard process. We have a team of incredible founding team of incredible technologists from, you know, incredible teams.

🤍0 likes💬 0 comments

00:49:46Dr. Fei-Fei Li

And then around just a month or two ago, we saw the first time that we can just prompt with a sentence and an image and multiple images and create worlds that we can just navigate in. If you put it on goggle, which we have an option to let you do that, you can even walk around, right? So it was, even though we've been building this for for quite a while, it was still just awe-inspiring, and we wanted to get into the hands of people who need it.

🤍0 likes💬 0 comments

00:50:21Dr. Fei-Fei Li

And then we know that so many creators, designers, people who are thinking about robotic simulation, people who are thinking about different use cases of navigable, interactable, immersive worlds, game developers will find this useful. So we developed Marble as a first step. It's, it's again still very early, but it's the world's first model doing this, and it's the world's first product that allows people to just prompt, we call it "prompt to worlds."

🤍1 like💬 0 comments

00:51:00Lenny

Well, I've been playing around with it. It is insane. Like you could just have a little Hobbit world where you just infinitely walk around Middle-earth basically and there's no, there's no one there yet, but it's insane. You just go anywhere. There's like dystopian world. I'm just looking at all these examples.

🤍0 likes💬 0 comments

00:51:14Dr. Fei-Fei Li

Yes.

🤍0 likes💬 0 comments

00:51:14Lenny

And my favorite part actually, I don't know, I don't know if this is a feature or bug, you can see like the dots of the world before it actually renders with all the textures. And I just love to like you get a glimpse into what is going on with this model basically create.

🤍0 likes💬 0 comments

00:51:27Dr. Fei-Fei Li

That's so cool to hear because this is where as a researcher I'm learning because the dots that lead you into the world was an intentional feature visualization. It is not part of the model. It's the model actually just generates the world. We were trying to find a way to guide people into the world, and a number of engineers worked on different versions, but we converged on the dot. And so many people, you're not the only one, told us how delightful that experience is. And it was really satisfying for us to hear that this intentional visualization feature that's not just the big hardcore model actually has delighted our users.

🤍0 likes💬 0 comments

00:52:19Lenny

Wow. So, you add that to make it more like to have humans understand what's going on more, get more delightful. Wow, that is hilarious. It makes me think about LMs and the way they, it's not the same thing, but they talk about what they're thinking and what they're doing.

🤍0 likes💬 0 comments

00:52:32Dr. Fei-Fei Li

Yes, it is. It is.

🤍0 likes💬 0 comments

00:52:34Lenny

It also makes me think about just the Matrix. Like, it's exactly the Matrix experience. I don't know if that was your inspiration.

🤍0 likes💬 0 comments

00:52:41Dr. Fei-Fei Li

Um, well, like I said, a number of engineers worked on that. It could be their inspiration. It's in their, it's in their, it's in their subconscious.

🤍0 likes💬 0 comments

00:52:50Lenny

Yeah. Okay. So, just for folks that may want to play around with this, maybe use it. What's like, what are some applications today that folks can start using today? What's what's your goal with this launch?

🤍0 likes💬 0 comments

00:52:59Dr. Fei-Fei Li

Yeah. So, we do believe that world modeling is very horizontal, but we're already seeing some really exciting use cases. Virtual production for movies, because what they need are 3D worlds that they can align with the camera so when the actors are acting on it, they can, you know, they can position the camera and shoot the segments really well. And we're already seeing incredible use. In fact, I don't know if you have seen our launch video showing Marble. It was produced by a virtual production company. We collaborated with Sony and they use Marble scenes to shoot those videos. So we were collaborating with those technical artists and directors and they were saying this has cut our production time by 40x.

🤍0 likes💬 0 comments

00:53:57Lenny

40x.

🤍0 likes💬 0 comments

00:53:59Dr. Fei-Fei Li

Yes. In fact, I had to because we only had one month to work on this project and there were so many things they were trying to shoot. So, so using Marble really, really significantly accelerated the production of virtual, virtual production for VFX and movies. That's one use cases.

🤍0 likes💬 1 comment

00:54:21Dr. Fei-Fei Li

We are already seeing our users putting, taking our Marble scene and taking the mesh export and putting games, you know, whether it's games on VR or games, just, just, just fun games that they have developed.

🤍0 likes💬 0 comments

00:54:36Dr. Fei-Fei Li

We have had, we were showing an example of robotic simulation because when I was, I mean, I'm still am a researcher doing robotic training, one of the biggest pain point is to create synthetic data for training robots. And these synthetic data needs to be very diverse. They need to come from different environments with different objects to manipulate. And one path to it is is to ask computers to simulate. Otherwise, humans have to, you know, build every single asset for robots. That that's just going to take a lot longer. So we already have researchers reaching out and wanting to use Marble to create those synthetic environments.

🤍0 likes💬 0 comments

00:55:26Dr. Fei-Fei Li

We also have unexpected user outreach in terms of how they want to use Marble. For example, a psychologist team called us to use Marble to do psychology research. It turned out some of the psychiatric patients they study, they need to understand how their brain respond to different immersive scenes of different features. For example, messy scenes or clean scenes or whatever you name it. And it's very hard for researchers to get their hands on these kind of immersive scenes. And it will take them too long and too much budget to create. And Marble is a really almost instantaneous way of getting so many of these experimental environments into their hands. So, we're seeing, we're seeing multiple use cases at this point, but the the VFX, the game developers, the simulation developers as well as designers are very excited.

🤍0 likes💬 0 comments

00:56:39Lenny

This is very much the way things work in AI. I've had other AI leaders on the podcast and it's always like, put things out there early as soon as you can to discover where the big use cases are. The head of ChatGPT told me how when they first put out ChatGPT, he was just scanning TikTok to see how people were using it and all the things they were talking about and that's what convinced them where to lean in and and help them see how people actually want to use it.

🤍0 likes💬 0 comments

00:57:01Lenny

I love this last use case of like for therapy. I'm just imagining like heights, people seeing dealing with heights or snakes or spiders, which...

🤍0 likes💬 0 comments

00:57:11Dr. Fei-Fei Li

It's amazing. A friend of mine last night literally called me and talked about his height scare and asked me if Marble should be used. That's amazing. You went straight there.

🤍0 likes💬 0 comments

00:57:23Lenny

That's, you know, because I'm imagining all the like the exposure therapy stuff. Like this could be so good for that. That is so cool.

🤍0 likes💬 0 comments

00:57:29Lenny

Okay, so let me, I should have asked you this before, but I think there's a, there's going to be a question of just how does this differ from things like Sora and other video generation models? It's pretty clear to me, but I think it might be helpful just to explain how this different from all the video AI tools people have seen.

🤍0 likes💬 0 comments

00:57:46Dr. Fei-Fei Li

World Lab's thesis is that spatial intelligence is fundamentally very important. And spatial intelligence is not just, it's not just about videos. In fact, the world is not passively watching videos passing by, right? I love Plato has the allegory of the cave analogy to describe vision. He said that imagine a prisoner tied on his chair, not very humane, but in a cave watching a full live theater on the in front of him. But the actual live theater that actors are acting is behind his back. It was just lit so that the projection of the action is on a wall of the cave. And then the goal, the task of this prisoner is to figure out what's going on. It's a pretty extreme example, but it really shows it describes what vision is about is that to make sense of the 3D world or 4D world out of 2D.

🤍0 likes💬 0 comments

00:59:07Dr. Fei-Fei Li

So spatial intelligence to me is deeper than only creating that flat 2D world. Spatial intelligence to me is the ability to create, reason, interact, make sense of deeply spatial world, whether it's 2D or 3D or 4D, including dynamics and all that. So, so World Lab is focusing on that. And of course, the ability to create videos per se, could be part of this. And in fact, just a couple of weeks ago, we rolled out the world's first realtime, demoable realtime video generation on a single H100 GPU. So we, part of our technology includes that. But I think Marble is very different because we really want creators, designers, developers to have in their hands a model that can give them worlds with 3D structure so they can use it for their work. And that's where, that's why Marble is so different.

🤍0 likes💬 0 comments

01:00:21Lenny

The way I see it is it's a platform for a ton of opportunity to do stuff. As you described, videos are just like, here's a one-off video that's very fun and cool and you could, and that's it, and you move on.

🤍0 likes💬 0 comments

01:00:33Dr. Fei-Fei Li

By the way, we could in Marble, we could allow people to export in video form. So you could actually, like you said, you go into a world. So let's say it's a hobbit cave, you can actually, especially as a creator, you have such a specific way of moving the camera in a trajectory in the director's mind, right? And then you can export that from Marble into a video.

🤍0 likes💬 0 comments

01:01:02Lenny

What does it take to create something like this? Just like how big is the team? How many, how many GPUs you working? Like anything you can share there? I don't know how much of this is private information, but just what does it take to create something like this that you've launched here?

🤍0 likes💬 0 comments

01:01:12Dr. Fei-Fei Li

It takes a lot of brain power. So, we just talk about 20 watts per brain. It's, so from that point of view, it's a small number, but it's actually an incredible, you know, it's a half billion years of evolution to give us those power. We have a team of 30-ish people now and we are predominantly researchers and research engineers. And but we also have designers and product. We actually really believe that we want to create a company that's anchored in the deep tech of spatial intelligence but we are actually building serious products. So so we have this integration of R&D and productization. And of course, we use, you know, a ton of GPUs. That's a that's the technical...

🤍0 likes💬 0 comments

01:02:16Lenny

I'm so happy to hear.

🤍0 likes💬 0 comments

01:02:20Dr. Fei-Fei Li

Well, congrats on the launch. I know this is a huge milestone. I know this took a ton of work. So, I just want to say congrats to you and your team.

🤍0 likes💬 0 comments

01:02:26Lenny

Let me talk about your founder journey for a moment. So, you're a founder of this company. You started how many years ago? Couple years ago, two, three years ago?

🤍0 likes💬 0 comments

01:02:33Dr. Fei-Fei Li

Oh, a year ago. A year ago.

🤍0 likes💬 0 comments

01:02:36Lenny

A year. Okay.

🤍0 likes💬 0 comments

01:02:37Dr. Fei-Fei Li

18 months. Yeah.

🤍0 likes💬 0 comments

01:02:38Lenny

Okay. What's something you wish you knew before you started this that you wish you could like whisper into the ear of Fei of 18 months ago?

🤍0 likes💬 0 comments

01:02:46Dr. Fei-Fei Li

Well, I continue to wish I know the future of technology. I think actually that's one of our founding advantage is that we see the future earlier in general than than most people. But still, man, this is so exciting and so amazing that what's unknown and what's coming. But I know the reason you're asking me this question is not about the future of technology. You're probably more, you know, look, I did not start a company of this scale at 20 year old. So, you know, I started a dry cleaner when I was 19, but that's a little smaller scale.

🤍0 likes💬 0 comments

01:03:28Lenny

We got to talk about that.

🤍0 likes💬 0 comments

01:03:31Dr. Fei-Fei Li

And then I, you know, founded Google Cloud AI and then I founded an institute at Stanford, but those are different beasts. I did feel I was a little more prepared as a founder of the grinding journey that that I compared to maybe, maybe the 20-year-old founders. But I still, I'm surprised and and it puts me into paranoia sometimes that how intensely competitive AI landscape is from the model, the technology itself, as well as talents. And you know, when I founded the company, we did not have these incredible stories of how much certain talents would cost, you know? So these are things that continue to surprise me and I have to be very alert about.

🤍0 likes💬 0 comments

01:04:39Lenny

So the competition you're talking about is yeah, the competition for talent, the speed at which things are moving.

🤍0 likes💬 0 comments

01:04:45Dr. Fei-Fei Li

Yeah.

🤍0 likes💬 0 comments

01:04:46Lenny

Yeah. You mentioned this point that I want to come back to that you, if you just look over the course of your career, you were like at all of the major collections of humans that led to so many of the breakthroughs that are happening today. Obviously, we talked about ImageNet, also just SAIL at Stanford is where a lot of the work happened, at Google Cloud, which a lot of the breakthroughs happened. What brought you to those places? For people looking for how to advance in their career, be at the center of the future, just like is there a through line there of just what pulled you from place to place and pulled you into those groups that might be helpful for people to hear?

🤍0 likes💬 0 comments

01:05:25Dr. Fei-Fei Li

Yeah, this is actually a great question, Lenny, because I do think about it. And obviously we talked about it, curiosity and passion that brought me to AI. That is more a scientific northstar, right? I did not care if AI was a thing or not. So, so that was one part. But how did I end up choosing in the particular places I work in, including starting World Labs, is I think I'm very grateful to myself or maybe to my parents' genes. I'm an intellectually very fearless person, and I have to say when I hire young people, I look for that because I think that's a very important quality if one wants to make a difference is that when you want to make a difference, you have to accept that you're creating something new or you're diving into something new. People haven't done that. And if you have that self-awareness, you almost have to allow yourself to be fearless and to be courageous.

🤍0 likes💬 0 comments

01:06:44Dr. Fei-Fei Li

So when I, for example, came to Stanford, you know, in the world of academia, I was very close to this thing called tenure, which is, you know, have the job forever at Princeton. But I chose to come to Stanford because I love Princeton. It's my alma mater. It's just at that moment, there are people who are so amazing at Stanford and the Silicon Valley ecosystem was so amazing that I was okay to take a risk of restarting my tenure clock.

🤍0 likes💬 0 comments

01:07:25Dr. Fei-Fei Li

Um going to becoming the first female director of SAIL, I was actually relatively speaking a very young faculty at that time and I wanted to do that because I care about that community. I didn't spend too much time thinking about all the failure cases. Obviously, I was very lucky that the more senior faculty supported me, but I just wanted to make a difference. And then going to Google was similar. I wanted to work with people like Jeff Dean, Jeff Hinton, and all these incredible, Dennis, the incredible people. So the same with World Labs. I have this passion and I also believe that people with the same mission can do incredible things. So that's how it guided me through life. I don't overthink of all possible things that can go wrong because that's too many.

🤍0 likes💬 0 comments

01:08:33Lenny

I feel like that's an important element of this is not focusing on the downside, focusing more on the people, the mission, what gets you excited?

🤍0 likes💬 0 comments

01:08:41Dr. Fei-Fei Li

I do, yeah, I do want to say one thing to all the young talents in AI, the engineers, the researchers out there, because some of you apply to World Labs. I feel very privileged you considered World Labs. I do find many of the young people today think about every single aspect of a equation when they decide on jobs at some point. Maybe, you know, maybe that's the way they want to do it. But sometimes I do want to encourage young people to focus on what's important because I find myself constantly in mentoring mode when I talk to job job candidates. Not necessarily recruiting or not recruiting, but just in mentoring mode when I see an incredible young talent who is overfocusing on every minute dimension and aspect of considering a job when maybe the most important thing is, where's your passion? Do you align with the mission? Do you believe and have faith in this team? And just focus on the impact and you can make and the kind of work and team you can you can work with.

🤍1 like💬 0 comments

01:10:05Lenny

Yeah, it's tough. It's tough for people in the AI space now. There's so much, so much at them, so much news, so much happening, so much FOMO.

🤍0 likes💬 0 comments

01:10:11Dr. Fei-Fei Li

That's true.

🤍0 likes💬 0 comments

01:10:12Lenny

I could see the stress. And so, I think that advice is really important. Just like what will actually make you feel fulfilled in what you're doing, not just where's the fastest growing company? Where's the, who's going to win? I don't know. I want to make sure I ask you about the work you're doing today at Stanford at the HAI, Human-Centered AI institute. What are you, what are you doing there? I know this is a thing you do on the site still.

🤍0 likes💬 0 comments

01:10:36Dr. Fei-Fei Li

So yes, I, HAI, Human-Centered AI institute, was co-founded by me and a group of faculty like Professor John Etchemendy, Professor James Landay, Professor Chris Manning back in 2018. I was actually finishing my last sabbatical at Google. And it was a very, very important decision for me because I could have stayed in industry, but my time at Google taught me one thing is AI is going to be a civilizational technology, and it dawned on me how important this is to humanity to the point that I actually wrote a piece in New York Times that year, 2018, to talk about the need for a guiding framework to develop and to apply AI. And that framework has to be anchored in human benevolence, is human-centeredness.

🤍0 likes💬 0 comments

01:11:42Dr. Fei-Fei Li

And I felt that Stanford, one of the world's top university in the heart of Silicon Valley that gave birth to important companies from NVIDIA to Google, should be a thought leader to create this human-centered AI framework and to to actually embody that in our research, education, and policy and in ecosystem work. So I founded HAI. It, you know, after fast forward after six, seven years, it has become the world's largest AI institute that does human-centered research, education, ecosystem outreach, and policy in impact.

🤍0 likes💬 0 comments

01:12:35Dr. Fei-Fei Li

It involves hundreds of faculty across all eight schools at Stanford from medicine to education to sustainability to business to engineering to humanities to law. And we support researchers, especially at the interdisciplinary area from digital economy to legal studies to political science to discovery of new drugs to to new algorithms to that's beyond transformers.

🤍0 likes💬 0 comments

01:13:11Dr. Fei-Fei Li

We also actually put a very strong focus on policy because when we started HAI, I realized that Silicon Valley did not talk to Washington D.C. and or Brussels or other parts of the world. And it's really, given how important this technology is, we need to bring everybody on board. So we created multiple programs from congressional bootcamp to AI index report to policy briefing. And we especially participated in policymaking including advocating for a National AI Research Cloud bill that was passed in the first Trump administration and participating in state level regulatory AI discussions. So there's a lot we did and I continue to be one of the leaders, even though I'm much less involved operationally, because I care not only we create this technology, but we use it in the right way.

🤍0 likes💬 0 comments

01:14:24Lenny

Wow. I was not aware of all that other work you were doing. As you were talking, I was reminded Charlie Munger had this quote, "Take a simple idea and take it very seriously." I feel like you've done that in so many different ways and and stayed with it and it's unbelievable the impact that you've had in so many ways over the years.

🤍0 likes💬 0 comments

01:14:45Lenny

I'm going to skip the lightning round and I'm just going to ask you one last question. Is there anything else that you wanted to share? Anything else you want to leave listeners with?

🤍0 likes💬 0 comments

01:14:52Dr. Fei-Fei Li

I'm very excited by AI, Lenny. I want to answer one question that I, when I travel around the world, everybody asks me is that if I'm a musician, if I'm a teacher, middle school teacher, if I'm a nurse, if I'm an accountant, if I'm a farmer, do I have a role in AI or is AI just going to take over my life or my work? And I think this is the most important question of AI. And I find that in Silicon Valley, we tend not to speak heart-to-heart with people, with people like us and not like us in Silicon Valley, but like all of us, we tend to just toss around words like infinite productivity or infinite leisure time or or you know, infinite power or whatever.

🤍0 likes💬 0 comments

01:15:54Dr. Fei-Fei Li

But at the end of the day, AI is about people. And when people ask me that question, it's a resounding yes. Everybody has a role in AI. It depends on what what you do and what you want. But no technology should take away human dignity, and the human dignity and agency should be at the heart of the development, the deployment as well as the governance of every technology.

🤍0 likes💬 0 comments

01:16:23Dr. Fei-Fei Li

So if you are a young artist and your passion is storytelling, embrace AI as a tool. In fact, embrace Marble. I hope it becomes a tool for you. Because the way you tell your story is unique and the world still needs it. But how you tell your story, how do you use the most incredible tool to tell your story in the most unique way is important and that that voice needs to be heard.

🤍0 likes💬 0 comments

01:16:58Dr. Fei-Fei Li

If you're a farmer near retirement, AI still matters because you're a citizen. You can participate in your community. You should have a voice in how AI is used, how AI is applied. You work with people that you can, you know, encourage all of all of you to use AI to make life easier for you.

🤍0 likes💬 0 comments

01:17:25Dr. Fei-Fei Li

If you're a nurse, I hope you know that at least in my career, I have worked so much in healthcare research because I feel our healthcare workers should be greatly augmented and helped by AI technology, whether it's smart cameras to feed more information or robotic assistance, because our nurses are overworked, over fatigued. And as our society ages, we need more help for for people to be taken care of. So AI can play that role. So I just want to say that it's so important that even a technologist like me are sincere about that everybody has a role in AI.

🤍0 likes💬 0 comments

01:18:16Lenny

What a beautiful way to end it. Such a tie back to where we started about how it's up to us and take individual responsibility for what AI will do in our lives. Final question, where can folks find Marble? Where can they go? Maybe try to join World Labs if they want to. What's the website? Where do people go?

🤍0 likes💬 0 comments

01:18:34Dr. Fei-Fei Li

Well, World Labs website is www.worldlabs.ai. And you can find our research progress there. We have technical blogs. You can find Marble, the product there. You can sign in there. You can find our job posts link there. You can, you know, we're in San Francisco. We love to work with the world's best talents.

🤍0 likes💬 0 comments

01:19:02Lenny

Amazing. Fei, thank you so much for being here.

🤍0 likes💬 0 comments

01:19:04Dr. Fei-Fei Li

Thank you, Lenny.

🤍0 likes💬 0 comments

01:19:06Lenny

Bye, everyone.

🤍0 likes💬 0 comments

01:19:09Lenny

Thank you so much for listening. If you found this valuable, you can subscribe to the show on Apple Podcasts, Spotify, or your favorite podcast app. Also, please consider giving us a rating or leaving a review as that really helps other listeners find the podcast. You can find all past episodes or learn more about the show at lennyspodcast.com. See you in the next episode.

🤍0 likes💬 0 comments

Video Player