READ MORE: Andrew Ng: Unbiggen AI > The AI pioneer says it’s time for smart-sized, “data-centric” solutions to big issues (IEEE Specturm)
The current way to train successful AI models is to throw massive data sets at it, but that hits a snag with video. The processing power and bandwidth required to crunch video at sufficient volumes in current neural networks is holding back developments in computer vision.
That could change if smaller, higher quality “data-centric AI” were employed, allowing it to scale much more quickly than today’s current rate.
Data scientist and businessman Andrew Ng says that “small data” solutions can solve big issues in AI, including model efficiency, accuracy, and bias.
“Data-centric AI is the discipline of systematically engineering the data needed to successfully build an AI system,” he explains in an interview with IEEE Spectrum.
Ng has form, which is why IEEE is interested in what he has to say. He pioneered the use of graphics processing units (GPUs) to train deep learning models in the late 2000s; he cofounded Google Brain in 2011; and then served for three years as chief scientist for Baidu, where he helped build the Chinese tech giant’s AI group.
EXPLORING ARTIFICIAL INTELLIGENCE:
With nearly half of all media and media tech companies incorporating Artificial Intelligence into their operations or product lines, AI and machine learning tools are rapidly transforming content creation, delivery and consumption. Find out what you need to know with these essential insights curated from the NAB Amplify archives:
- This Will Be Your 2032: Quantum Sensors, AI With Feeling, and Life Beyond Glass
- Learn How Data, AI and Automation Will Shape Your Future
- Where Are We With AI and ML in M&E?
- How Creativity and Data Are a Match Made in Hollywood/Heaven
- How to Process the Difference Between AI and Machine Learning
“I’m excited about the potential of building foundation models in computer vision,” he says. “I think there’s lots of signal to still be exploited in video: We have not been able to build foundation models yet for video because of compute bandwidth and the cost of processing video, as opposed to tokenized text.”
The compute power needed to process the large volume of images for video is significant, which is why foundation models have emerged first in audio and text contexts like Neural Language Processing. Ng is confident that advances in the power of semiconductors could see foundation models developed in computer vision.
“Architectures built for hundreds of millions of images don’t work with only 50 images,” he says. “But it turns out, if you have 50 really good examples, you can build something valuable. In many industries where giant data sets simply don’t exist, I think the focus has to shift from big data to good data. Having 50 thoughtfully engineered examples can be sufficient to explain to the neural network what you want it to learn.”
He says the difficulty in being able to scale AI models is a problem in just about every industry. Using health care as an example, he says, “Every hospital has its own slightly different format for electronic health records. How can every hospital train its own custom AI model? Expecting every hospital’s IT personnel to invent new neural-network architectures is unrealistic.
“The only way out of this dilemma is to build tools that empower the customers to build their own models by giving them tools to engineer the data and express their domain knowledge.”
BOOST YOUR BOTTOM LINE WITH INTELLIGENT CONTENT:
What is intelligent content? Designed to be used for multiple purposes across multiple platforms and technologies, intelligent content is media that is rich with data and metadata, enabling us to find, reuse and endlessly adapt it. Learn how intelligent content can boost your bottom line with these expert insights curated from the NAB Amplify archives:
- Your Business in 2025 is Data-Driven, Right? (Right?)
- Learn How Data, AI and Automation Will Shape Your Future
- Intelligent Content Creation Inspires, Accelerates, Energizes — and Cuts Costs
- Watch This: How Intelligent Content Advances Creativity
- Intelligent Content Kickstarts Your Creativity and Challenges You to Be Better at Your Job
That’s what Ng’s new company, Landing AI, is executing in computer vision.
“In the last decade, the biggest shift in AI was a shift to deep learning. I think it’s quite possible that in this decade the biggest shift will be to data-centric AI. With the maturity of today’s neural network architectures, I think for a lot of the practical applications the bottleneck will be whether we can efficiently get the data we need to develop systems that work well. The data-centric AI movement has tremendous energy and momentum across the whole community. I hope more researchers and developers will jump in and work on it.”
Want more? In a TED Talk, Jeff Dean, head of Google’s AI efforts, discusses the underlying technology that allows AI to do some of the things it can, like understanding languages and diagnosing diseases: