[ad_1]
Ted Hisokawa
Jan 31, 2025 06:38
NVIDIA’s NCCL 2.23 release introduces a new scaling algorithm, accelerated initialization, and a profiler plugin API, optimizing inter-GPU and multinode communication for AI and HPC applications.
The latest release of the NVIDIA Collective Communications Library (NCCL) 2.23 introduces a suite of enhancements aimed at optimizing inter-GPU and multinode communication, essential for artificial intelligence (AI) and high-performance computing (HPC) applications. According to NVIDIA, these improvements are designed to boost the efficiency and scalability of parallel computing.
The NCCL 2.23 release is marked by several key innovations:
ncclCommInitRankScalable API.
The PAT algorithm, inspired by the Bruck algorithm, enables efficient communication across various network sizes by minimizing buffering needs. This enhancement is particularly beneficial for large language model training, where pipeline and tensor parallelism are critical.
The ncclCommInitRankScalable API facilitates scalable initialization by allowing multiple unique IDs, thus mitigating the bottleneck associated with all-to-one communication patterns in large-scale operations.
NCCL 2.23 supports intranode user buffer registration, optimizing data transfer over NvLink and PCIe. This feature reduces overhead and enhances performance by leveraging registered user buffers, which are automatically registered during CUDA Graph capture.
The new profiler plugin API addresses the growing need for domain-specific monitoring tools in expansive GPU clusters. By enabling the profiling of NCCL events, this API aids in detecting performance anomalies and optimizing resource allocation.
With the introduction of these advanced features, NVIDIA’s NCCL 2.23 promises to significantly enhance the performance and scalability of GPU communications, reinforcing its utility in AI and HPC domains. For a deeper understanding of these updates, visit the official NVIDIA blog.
Image source: Shutterstock
[ad_2]
Source link
[ad_1] भारतीय शेयर बाजारों में शुक्रवार (11 अप्रैल) को जबरदस्त तेजी देखने को मिली। सेंसेक्स…
[ad_1] Joerg Hiller Dec 13, 2025 13:56 BTC price prediction suggests…
[ad_1] Mutual Fund March 2025 Data: शेयर बाजार में जारी उतार-चढ़ाव और ट्रंप टैरिफ (Trump…
[ad_1] Lawrence Jengar Dec 10, 2025 12:37 Glassnode releases The Bitcoin…
[ad_1] जेफरीज के अनुसार 2026 में देखने योग्य शीर्ष उपभोक्ता वित्त स्टॉक्स [ad_2] Source link
[ad_1] Felix Pinkston Dec 10, 2025 12:39 ARB price prediction shows…
This website uses cookies.