Oct 2025AI Scaling KV Caches for LLMs: How LMCache + NIXL Handle Network and Storage...- J. Jiang & M. Khazraee
Oct 2025AI Serving PyTorch LLMs at Scale: Disaggregated Inference With Kubernetes and Llm-dM. Ayoub & C. Liu
Oct 2025AI Sponsor Session: Low-Precision Inference without Quality Loss...Pankaj Gupta & Philip Kiely
Oct 2025AI The Building Blocks of Agentic AlJoe Spisak, Product Director, Meta Superintelligence Labs
Oct 2025AI The Future Is Tiled: Using CuTile & TileIR To Write Portable, High-performance GPU...- Jared Roesch
Oct 2025AI The OpenMDW License Agreement: Simple, Permissive Terms for AI Model MaterialsSteve Winslow
Oct 2025AI Thunder: Distribute and Optimize Your PyTorch Models With Zero...Luca Antiga & Thomas Viehmann
Oct 2025AI Transformers: Standardizing Model Definitions Across the PyTorch EcosystemL. Debut & A. Zucker
Oct 2025AI Unlocking Performance: Harnessing LLMs To Streamline GPU Kernel Development in...Jiannan Wang
Oct 2025AI Verl: A Flexible and Efficient RL Framework for LLMsHongpeng Guo & Ziheng Jiang, ByteDance Seed
Oct 2025AI A Practical Field Guide to Optimizing Cost, Speed & Accuracy of LLMsNiels Bantilan, Union.ai