The roots of lots of NVIDIA’s landmark improvements — the foundational expertise that powers AI, accelerated computing, real-time ray tracing and seamlessly linked information facilities — may be discovered within the firm’s analysis group, a worldwide workforce of round 400 consultants in fields together with laptop structure, generative AI, graphics and robotics.
Established in 2006 and led since 2009 by Invoice Dally, former chair of Stanford College’s laptop science division, NVIDIA Analysis is exclusive amongst company analysis organizations — arrange with a mission to pursue advanced technological challenges whereas having a profound influence on the corporate and the world.
“We make a deliberate effort to do nice analysis whereas being related to the corporate,” mentioned Dally, chief scientist and senior vice chairman of NVIDIA Analysis. “It’s straightforward to do one or the opposite. It’s exhausting to do each.”
Dally is amongst NVIDIA Analysis leaders sharing the group’s improvements at NVIDIA GTC, the premier developer convention on the coronary heart of AI, happening this week in San Jose, California.
“We make a deliberate effort to do nice analysis whereas being related to the corporate.” — Invoice Dally, chief scientist and senior vice chairman
Whereas many analysis organizations could describe their mission as pursuing tasks with an extended time horizon than these of a product workforce, NVIDIA researchers hunt down tasks with a bigger “threat horizon” — and an enormous potential payoff in the event that they succeed.
“Our mission is to do the correct factor for the corporate. It’s not about constructing a trophy case of greatest paper awards or a museum of well-known researchers,” mentioned David Luebke, vice chairman of graphics analysis and NVIDIA’s first researcher. “We’re a small group of people who find themselves privileged to have the ability to work on concepts that would fail. And so it’s incumbent upon us to not waste that chance and to do our greatest on tasks that, in the event that they succeed, will make an enormous distinction.”
Innovating as One Group
One among NVIDIA’s core values is “one workforce” — a deep dedication to collaboration that helps researchers work intently with product groups and trade stakeholders to rework their concepts into real-world influence.
“All people at NVIDIA is incentivized to determine methods to work collectively as a result of the accelerated computing work that NVIDIA does requires full-stack optimization,” mentioned Bryan Catanzaro, vice chairman of utilized deep studying analysis at NVIDIA. “You’ll be able to’t do this if each bit of expertise exists in isolation and all people’s staying in silos. It’s a must to work collectively as one workforce to attain acceleration.”
When evaluating potential tasks, NVIDIA researchers think about whether or not the problem is a greater match for a analysis or product workforce, whether or not the work deserves publication at a high convention, and whether or not there’s a transparent potential profit to NVIDIA. In the event that they resolve to pursue the undertaking, they accomplish that whereas participating with key stakeholders.
“We’re a small group of people who find themselves privileged to have the ability to work on concepts that would fail. And so it’s incumbent upon us to not waste that chance.” — David Luebke, vice chairman of graphics analysis
“We work with individuals to make one thing actual, and infrequently, within the course of, we uncover that the good concepts we had within the lab don’t truly work in the actual world,” Catanzaro mentioned. “It’s a good collaboration the place the analysis workforce must be humble sufficient to study from the remainder of the corporate what they should do to make their concepts work.”
The workforce shares a lot of its work by papers, technical conferences and open-source platforms like GitHub and Hugging Face. However its focus stays on trade influence.
“We consider publishing as a very necessary facet impact of what we do, but it surely’s not the purpose of what we do,” Luebke mentioned.
NVIDIA Analysis’s first effort was centered on ray tracing, which after a decade of sustained work led on to the launch of NVIDIA RTX and redefined real-time laptop graphics. The group now consists of groups specializing in chip design, networking, programming techniques, massive language fashions, physics-based simulation, local weather science, humanoid robotics and self-driving automobiles — and continues increasing to deal with extra areas of examine and faucet experience throughout the globe.
“It’s a must to work collectively as one workforce to attain acceleration.” — Bryan Catanzaro, vice chairman of utilized deep studying analysis
Remodeling NVIDIA — and the Business
NVIDIA Analysis didn’t simply lay the groundwork for among the firm’s most well-known merchandise — its improvements have propelled and enabled at this time’s period of AI and accelerated computing.
It started with CUDA, a parallel computing software program platform and programming mannequin that allows researchers to faucet GPU acceleration for myriad functions. Launched in 2006, CUDA made it straightforward for builders to harness the parallel processing energy of GPUs to hurry up scientific simulations, gaming functions and the creation of AI fashions.
“Growing CUDA was the one most transformative factor for NVIDIA,” Luebke mentioned. “It occurred earlier than we had a proper analysis group, but it surely occurred as a result of we employed high researchers and had them work with high architects.”
Making Ray Tracing a Actuality
As soon as NVIDIA Analysis was based, its members started engaged on GPU-accelerated ray tracing, spending years growing the algorithms and the {hardware} to make it doable. In 2009, the undertaking — led by the late Steven Parker, a real-time ray tracing pioneer who was vice chairman {of professional} graphics at NVIDIA — reached the product stage with the NVIDIA OptiX software framework, detailed in a 2010 SIGGRAPH paper.
The researchers’ work expanded and, in collaboration with NVIDIA’s structure group, ultimately led to the event of NVIDIA RTX ray-tracing expertise, together with RT Cores that enabled real-time ray tracing for avid gamers {and professional} creators.
Unveiled in 2018, NVIDIA RTX additionally marked the launch of one other NVIDIA Analysis innovation: NVIDIA DLSS, or Deep Studying Tremendous Sampling. With DLSS, the graphics pipeline now not wants to attract all of the pixels in a video. As an alternative, it attracts a fraction of the pixels and provides an AI pipeline the knowledge wanted to create the picture in crisp, excessive decision.
Accelerating AI for Just about Any Utility
NVIDIA’s analysis contributions in AI software program kicked off with the NVIDIA cuDNN library for GPU-accelerated neural networks, which was developed as a analysis undertaking when the deep studying area was nonetheless in its preliminary levels — then launched as a product in 2014.
As deep studying soared in recognition and advanced into generative AI, NVIDIA Analysis was on the forefront — exemplified by NVIDIA StyleGAN, a groundbreaking visible generative AI mannequin that demonstrated how neural networks might quickly generate photorealistic imagery.
Whereas generative adversarial networks, or GANs, have been first launched in 2014, “StyleGAN was the primary mannequin to generate visuals that would fully go muster as {a photograph},” Luebke mentioned. “It was a watershed second.”

NVIDIA researchers launched a slew of well-liked GAN fashions such because the AI portray instrument GauGAN, which later developed into the NVIDIA Canvas software. And with the rise of diffusion fashions, neural radiance fields and Gaussian splatting, they’re nonetheless advancing visible generative AI — together with in 3D with latest fashions like Edify 3D and 3DGUT.

Within the area of huge language fashions, Megatron-LM was an utilized analysis initiative that enabled the environment friendly coaching and inference of large LLMs for language-based duties reminiscent of content material technology, translation and conversational AI. It’s built-in into the NVIDIA NeMo platform for growing customized generative AI, which additionally options speech recognition and speech synthesis fashions that originated in NVIDIA Analysis.
Reaching Breakthroughs in Chip Design, Networking, Quantum and Extra
AI and graphics are solely among the fields NVIDIA Analysis tackles — a number of groups are attaining breakthroughs in chip structure, digital design automation, programming techniques, quantum computing and extra.
In 2012, Dally submitted a analysis proposal to the U.S. Division of Power for a undertaking that may change into NVIDIA NVLink and NVSwitch, the high-speed interconnect that allows speedy communication between GPU and CPU processors in accelerated computing techniques.

In 2013, the circuit analysis workforce printed work on chip-to-chip hyperlinks that launched a signaling system co-designed with the interconnect to allow a high-speed, low-area and low-power hyperlink between dies. The undertaking ultimately grew to become the hyperlink between the NVIDIA Grace CPU and NVIDIA Hopper GPU.
In 2021, the ASIC and VLSI Analysis group developed a software-hardware codesign approach for AI accelerators referred to as VS-Quant that enabled many machine studying fashions to run with 4-bit weights and 4-bit activations at excessive accuracy. Their work influenced the event of FP4 precision help within the NVIDIA Blackwell structure.
And unveiled this yr on the CES commerce present was NVIDIA Cosmos, a platform created by NVIDIA Analysis to speed up the event of bodily AI for next-generation robots and autonomous autos. Learn the analysis paper and take a look at the AI Podcast episode on Cosmos for particulars.
Study extra about NVIDIA Analysis at GTC. Watch the keynote by NVIDIA founder and CEO Jensen Huang under:
See discover concerning software program product info.