Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration

[ad_1]



Rongchai Wang
Oct 06, 2025 06:01

NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark.





As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing.

Velox and cuDF: A Powerful Combination

The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF.

Accelerating Presto with GPU Power

By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts.

Multi-GPU Execution for Enhanced Performance

The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes.

Hybrid Execution in Apache Spark

In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid clusters. This strategy allows for efficient use of GPU resources while maintaining CPU availability for other tasks, resulting in significant performance improvements.

Community Involvement and Future Prospects

The open-source nature of this project encourages community involvement, aiming to drive further innovations across the data processing ecosystem. By implementing reusable GPU operators in Velox, the collaboration seeks to reduce duplication and simplify maintenance while accelerating various systems.

Image source: Shutterstock


[ad_2]

Source link

Santosh

Share
Published by
Santosh

Recent Posts

Stocks Vs Crypto vs Forex what to do?

Source Download video - Download Video

2 weeks ago

7 Most Time Management Tips | by Him eesh Madaan

Discover 7 magical time management techniques for 100% success. Do you want to achieve more…

2 weeks ago

THIS CHAKRA THAT SUMMONS ME IS IT MADARA’S

Source Download video - Download Video

2 weeks ago

2026 में Crypto Market में वापसी की जोरदार उम्मीद! | Bitcoin News

2026 में Crypto Market में वापसी की जोरदार उम्मीद! | Bitcoin News 2025 में क्रिप्टो…

2 weeks ago

Caffeinated Cowboys: A History of Coffee in the Old Wild West…

Coffee played an essential role in shaping the American frontier during the Old West. For…

2 weeks ago

Financial Education in Hindi Financial literacy

Financial Education in Hindi Financial Literacy Follow me here Qj1GXxO16XXOpVIuAYUNm7 youtube channelhttps://www.youtube.com/channel/UCZt6GXD3VnY4rsvXqLX8IQw Source Download video…

3 weeks ago

This website uses cookies.