LLM

24 Talks Recorded in November 2025

Nov 2025

The Shift to LLM Fine-Tuning with Thinking Machines

Devendra Chaplot

Oct 2025

LLM future is multi-model

Erik Davtyan

Oct 2025

Arctic Inference: Breaking the Speed-Cost Tradeoff in LLM Serving

Aurick Qiao

Oct 2025

Designing and Building Custom Reinforcement Learning Environments for Fine-tuning LLMs

N. Bantilan

Oct 2025

Enabling VLLM V1 on AMD GPUs With Triton

Thomas Parnell, IBM Research & Aleksandr Malyshev, AMD

Oct 2025

No GPU Left Behind: Scaling Online LLM Training With...

Mert Toslali & Yu Chin Fabian Lim

Oct 2025

PyTorch in Production: Boosting LLM Training and...

F. Hua

Oct 2025

Scaling KV Caches for LLMs: How LMCache + NIXL Handle Network and Storage...- J. Jiang & M. Khazraee

Oct 2025

Serving PyTorch LLMs at Scale: Disaggregated Inference With Kubernetes and Llm-d

M. Ayoub & C. Liu

Oct 2025

Unlocking Performance: Harnessing LLMs To Streamline GPU Kernel Development in...

Jiannan Wang

Oct 2025

Verl: A Flexible and Efficient RL Framework for LLMs

Hongpeng Guo & Ziheng Jiang, ByteDance Seed

Oct 2025

vLLM & Deepspeed Updates

Simon Mo & Tunji Ruwase

Oct 2025

vLLM: Easy, Fast, and Cheap LLM Serving for Everyone

Simon Mo, vLLM

Oct 2025

A Practical Field Guide to Optimizing Cost, Speed & Accuracy of LLMs

Niels Bantilan, Union.ai

Oct 2025

AI Red Teaming — Why & How to Jailbreak LLM Agents

Alex Combessie, Giskard

Oct 2025

LLM Inference: A Comparative Guide to Modern Open-Source Runtimes

Aleksandr Shirokov, Wildberries

Oct 2025

Vibe Coding Your First LLM End-to-End Application

Greg Loughnane & Chris Alexiuk, AI Makerspace

Oct 2025

Vibe Coding Your First LLM End-to-End Application

Greg Loughnane & Chris Alexiuk, AI Makerspace

Jul 2025

Hacking LLMs: An Introduction to Mechanistic Interpretability

Jenny Vega

Jun 2025

From AI agents to LLMs as judges: Reshaping observability in the era of generative AI

Diana Todea

Jun 2025

Replacing platform abstractions with LLM-powered interfaces

George Fahmy

Jun 2025

From LLM-as-a-Judge To Human-in-the-Loop: Rethinking Evaluat...

Eric Pugh & Fernando Rejon Barrera

Jun 2025

From Traces To Action: Auto-Instrumenting LLMs for Observability...

Aditya Soni & Anshika Tiwari

Jun 2025

Optimizing LLM Performance With Caching Strategies in OpenSearch

‪Uri Rosenberg‬‏ & Sherin Chandy