MXFP8 training for MoEs on GB200s enables a 1.3x speedup with equivalent convergence versus BF16:
π https://pytorch.org/blog/mxfp8-training-for-moes-1-3x-training-speedup-vs-bf16-for-llama4-scout-on-gb200-cluster-using-torchao-and-torchtitan/
MXFP8 training for MoEs on GB200s enables a 1.3x speedup with equivalent convergence versus BF16:
π https://pytorch.org/blog/mxfp8-training-for-moes-1-3x-training-speedup-vs-bf16-for-llama4-scout-on-gb200-cluster-using-torchao-and-torchtitan/
PyTorch Foundation is attending Optimized AI Conference in Atlanta, April 14-16. Join 100+ experts to discuss #LLM operations, #RAG, and #InferenceOptimization.
Get 20% off with code: OAIC-20.
Details: oaiconference.com.
#PyTorch #AIInfrastructure #OpenSourceAI
DeepNVMe just got faster and more flexible:
β
Gen5 NVMe support
β
20X faster model checkpointing
β
Cost-efficient SGLang inference via ZeRO-Inference
β
CPU-only pinned memory support
π pytorch.org/blog/deepnvm...
#PyTorch #DeepSpeed #AIInfrastructure
The #PyTorchFoundation newsletter is your go-to source for the latest updates, events, and community insights to build and innovate with #PyTorchβall in support of accelerating #OpenSourceAI.
π¬ Subscribe: pytorch.org/newsletter/
π June: pytorch.org/newsletter/j...
Update from the PyTorch ecosystem: The latest NVIDIA
DALI release adds DALI Proxyβmaking it easier to accelerate parts of your PyTorch DataLoader pipeline without a full refactor.
Learn more
π developer.nvidia.com/blog/unlock-...
#PyTorch #OpenSourceAI #DataPipelines #DeepLearning
π§ Responsible AI is a design decisionβand a strategic edge.
This new guide shows how to build a Yellow Teaming assistant using PyTorch and AWS Graviton4 to surface risks early and build more accountable systems.
π pytorch.org/blog/build-r...
#ResponsibleAI #LLM #PyTorch #builtonArm
β³ Just a few days left to apply for the PyTorch Ambassador Program.
If you're making an impact with PyTorch through research, code, education, or community work, nowβs your chance to join a global network of ML leaders.
π
Deadline: June 7
π pytorch.org/programs/amb...
#PyTorch #AICommunity
Join us at #GTC25Paris25 for the session β10x Your GPU Power with #Python: Python for Programming the GPUβ
Learn how Python now matches the performance and control of C++ #CUDA.
Explore #PyTorch, CuPy, RAPIDS, cuda.parallel, numba.cuda, cuTile, etc.
π www.nvidia.com/en-eu/gtc/se...
Mixture-of-Experts (MoE) is a popular #LLM architecture that reduces computation by activating fewer parameters per token. But it brings memory, communication, & control challenges.
π‘We introduce MetaShuffling, enabling efficient Llama 4 model inference in production. π pytorch.org/blog/metashu...
The PyTorch Foundation is a Gold Sponsor of #MLSys2025 this week in Santa Clara.
Visit the booth and explore talks from Soumith Chintala, Ion Stoica, and Exec Dir Matt White on open source AI and scalable ML systems.
π pytorch.org/blog/pytorch...
#PyTorch #OpenSourceAI #AIInfrastructure
ποΈ: pytorch.org/event/toward...
PyTorch Foundation has expanded into an umbrella foundation.
vLLM and DeepSpeed have been accepted as hosted projects, advancing community-driven AI across the full lifecycle.
Quotes from AMD, AWS, Arm, Huawei, HuggingFace, IBM, Intel, LightningAI, Meta.
Read more: pytorch.org/blog/press-r...
Can language model systems autonomously complete entire tasks end-to-end?
In our next Expert Exchange webinar, Ofir Press explores autonomous LM systems for software engineering, featuring SWE-bench & SWE-agentβused by OpenAI, Meta, & more.
π pytorch.org/autonomous-l...
#PyTorch #AI #OpenSource
TODAY: Join PyTorch Core Maintainers Piotr Bialecki (NVIDIA) and Nikita Shulga (Meta) for a live Q&A session on the #PyTorch 2.7 release at 12 PM PST.
Have questions? Drop them below, & we'll share them during the webinar.
π More info: pytorch.org/pt-27-releas...
#MachineLearning #OpenSourceAI
Update from the PyTorch maintainers: 2.7 is out now.
πΉ Support for NVIDIA Blackwell (CUDA 12.8)
πΉ Mega Cache
πΉ torch.compile for Function Modes
πΉ FlexAttention updates
πΉ Intel GPU perf boost
π Blog: hubs.la/Q03jBPSL0
π Release notes: hubs.la/Q03jBPlW0
#PyTorch #OpenSourceAI
The PyTorch Day France 2025 schedule is now live:
Explore the full agenda of talks and sessions
βοΈ pytorchdayfrance2025.sched.com
Co-located with #GOSIMAI2025
ποΈ Use code PYTORCHFRIEND for 25% off registration
π Or enter the Lucky Draw: paris2025.gosim.org
#PyTorch #PyTorchDayFrance
π PyTorch's updated Sphinx theme is now in the main branch on docs.pytorch.org (coming to stable in v2.8)!
This update features dark mode, page ratings, expandable nav & more.
Try it out and share feedback via our survey: forms.gle/VJCypjGdZ1Ty.... #PyTorch #Documentation
Enter GOSIM Foundation's Lucky Draw for 70β90% off PyTorch Day France ticketsβco-located with GOSIM AI Paris 2025.
π Look for banner at paris2025.gosim.org
Schedule: paris2025.gosim.org/schedule-day...
Info: events.linuxfoundation.org/pytorch-day-...
#PyTorch #GOSIMAIParis #PyTorchDayFrance
Curious about whatβs coming in PyTorch 2.7?
Core Maintainers Piotr Bialecki (NVIDIA) and Nikita Shulga (Meta) will take them live during a Q&A on April 28 at 12 PM PST.
Hear directly from the folks behind CUDA, CI, and releases.
π pytorch.org/pt-27-releas...
#PyTorch #PyTorch27 #OpenSourceAI #ML
π The EU AI Act is hereβthe worldβs first comprehensive AI regulation. While it recognizes open source AIβs value, its exemptions arenβt unlimited.
πΉ Whoβs affected
πΉ Open source exemptions
πΉ GPAI provider obligations
linuxfoundation.eu/newsroom/ai-...
#AIAct #OpenSourceAI
Jim Zemlin on how open source PyTorch powers DeepSeek's AI breakthroughs and expands access to innovation: lnkd.in/earz5jQa
DeepSeek is building an F1 racerβfast and specialized. PyTorch is an all-terrain vehicleβmodular and open for anyone to customize their ML stack.
#OpenSource #AI #PyTorch
Please fill out the linked form to participate in our documentation survey to help the PyTorch documentation team know which areas to focus on to improve your docs experience: forms.gle/KZ4xGL65VRMY...
PyTorch documentation is the cornerstone for how developers get the information they need about PyTorch!
As such, the PyTorch documentation team is looking towards improving this overall experience and would love your feedback on how we can improve! β¬
Explore how PyTorch and DINOv2 power multi-label plant species classification in our upcoming webinar with Intel's Murilo Gustineli on March 27 at 12 PM PST.
π Register today: pytorch.org/pt-dinov2-mu...
#pytorch #machinelearning #optimization
We're sponsoring TODAY's SemiAnalysis GPU Hackathon in San Jose ahead of GTC ποΈ Speakers: Mark Saroufim, Vijay Thakkar, Horace He, Philippe Tillet & Tri Dao π Prizes include hundreds of GPU compute credits for top participants. More: semianalysis.com/hackathon-20... We can't wait to see you there!
Join us in San Francisco Oct 22-23 to showcase your expertise at #PyTorchConf 2025! Share insights with the global #AI community at this industry-leading #OpenSource #ML framework event. Submit proposals for sessions, lightning talks & more by June 1: hubs.ly/Q03bpZ310
Explore the integration of a custom #triton kernel, Liger Kernel w/ torch.compile to enhance the performance of fine-tuning #LLMs using #torchtune.
π‘ Results show a 47% reduction in peak GPU memory allocation at batch size 256 with meta-llama/Llama-3.2-1B
π Read more: pytorch.org/blog/peak-pe...