Holzman, Burt, et al. Optimizing High-throughput Inference On Graph Neural Networks At Shared Computing Facilities with the Nvidia Triton Inference Server.
APA
Holzman, B., Ulmer, K., Perloff, A., Savard, C., Stenson, K., Pedro, K., Gray, L., & Manganelli, N.Optimizing High-Throughput Inference on Graph Neural Networks at Shared Computing Facilities with the NVIDIA Triton Inference Server.
Chicago
Holzman, Burt, Keith A Ulmer, Alexx Perloff, Claire Savard, Kevin Stenson, Kevin Pedro, Lindsey Gray et al.Optimizing High-Throughput Inference On Graph Neural Networks At Shared Computing Facilities with the Nvidia Triton Inference Server.