TensorRT company logo - ML Infrastructure company based in United States
enterprise

TensorRT

Santa Clara, United States
Founded 2017

About

NVIDIA TensorRT is an SDK for optimizing and deploying deep learning models across a range of hardware, from data centers to edge devices. Utilizing techniques like quantization and kernel tuning, TensorRT significantly reduces inference latency and increases throughput compared to CPU-only deployments. The platform specifically targets developers working with performance-critical applications and large language models requiring efficient GPU acceleration.

Technology Focus

ml infrastructure llm developer tools

Quick Stats

Founded 2017
Status private
Type enterprise