PlasmaENGINE® and Oracle Cloud Infrastructure Team Up to Produce Incredible Results
Oracle Cloud Infrastructure’s next-generation cloud technology was designed to handle performance-intensive workloads, including high-performance computing (HPC) workloads, and has given enterprises an incredible new set of tools for digital innovation.
PlasmaENGINE® from FASTDATA.io is the world’s first GPU-native, real-time stream processing software. Using the thousands of cores on a GPU, PlasmaENGINE® processes structured or unstructured streaming data up to three orders of magnitude faster than CPU-based alternatives such as Apache Spark.
We at FASTDATA.io built PlasmaENGINE® to provide CTOs, product managers, software developers, data engineers, scientists, and architects with access to highly efficient, real-time data analytics for business intelligence. The goal of PlasmaENGINE® is to give companies the ability to “wow” their customers, accelerate operations, and extract value and revenue opportunities from their data that they previously deemed impossible.
In order for every industry to take advantage of the NVIDIA GPU platform, an incredibly efficient software had to be written for it. We developed exactly that: an equally performant and efficient software technology to transform data processing from “collect > store > process” to “collect > process > store.”
To test the limits of PlasmaENGINE®, Oracle enabled FASTDATA.io to access its new bare metal BM.GPU3.8 compute instances, which contain 8x NVIDIA V100 top-of-the-line GPUs. Bare metal GPU servers, without the hypervisor overhead, deliver uncompromising and predictable performance. We tested it by using the haversine benchmark, a SparkSQL query used to calculate and compare the distance between two GPS points on Earth. The benchmark takes gigabytes of CDR rows, each containing the location of both ends of a cellphone call, and then filters those calls based on distance as calculated by the haversine function. This benchmark simulates pipelines common in large telecom companies.
The PlasmaENGINE® test on the Oracle Cloud BM.GPU3.8 instance was conducted with the following specifications:
- GPU: 8x V100
- Interconnect: NVLINK
- CPU: 52 cores
- Memory: 768 GB
- Network: 2 x 25 Gbps
- Storage: Up to 1 PB of remote block storage volumes
The benchmark measures how many records can be processed per second. The following images show the results for Spark running on a CPU compared with PlasmaENGINE® running on GPUs. PlasmaENGINE® processed 1.45 billion rows per second on BM.GPU3.8, compared to Apache Spark’s 12.84 million rows.
Oracle Cloud Infrastructure harnesses the capabilities of its NVIDIA GPUs—in this case, 8x V100s—and enables PlasmaENGINE® to process data with unparalleled performance. The resulting performance was over two orders of magnitude greater than any CPU-bound stream-processing software on the market today, with the potential to produce even greater results in the future. That type of speed and efficiency allows businesses to save time and money, and to create unforeseen revenue opportunities.
Both solutions working together have the potential to achieve incredible results for your business.