Oct 2025AI Rethinking the Transformer: Toward Native Multimodal ArchitecturesBowen Peng, Nous Research
Oct 2025AI Scaling Inference of O(10K)-length Sequence Recommendation Models Using...S. Joshi & K. Rajesh
Oct 2025AI Scaling KV Caches for LLMs: How LMCache + NIXL Handle Network and Storage...- J. Jiang & M. Khazraee
Oct 2025AI Serving PyTorch LLMs at Scale: Disaggregated Inference With Kubernetes and Llm-dM. Ayoub & C. Liu
Oct 2025AI Sponsor Session: Low-Precision Inference without Quality Loss...Pankaj Gupta & Philip Kiely
Oct 2025AI The Building Blocks of Agentic AlJoe Spisak, Product Director, Meta Superintelligence Labs
Oct 2025AI The Future Is Tiled: Using CuTile & TileIR To Write Portable, High-performance GPU...- Jared Roesch
Oct 2025AI The OpenMDW License Agreement: Simple, Permissive Terms for AI Model MaterialsSteve Winslow
Oct 2025AI Thunder: Distribute and Optimize Your PyTorch Models With Zero...Luca Antiga & Thomas Viehmann
Oct 2025AI Transformers: Standardizing Model Definitions Across the PyTorch EcosystemL. Debut & A. Zucker
Oct 2025AI Unlocking Performance: Harnessing LLMs To Streamline GPU Kernel Development in...Jiannan Wang
Oct 2025AI Verl: A Flexible and Efficient RL Framework for LLMsHongpeng Guo & Ziheng Jiang, ByteDance Seed