Rockset Unveils Real-Time SQL Analytics for Raw Events from Apache Kafka
September 30, 2019

Rockset announced the capability to analyze raw events from Apache Kafka in real time.

Kafka, backed by Confluent, is one of the most popular distributed streaming platforms and capable of handling trillions of events a day. Rockset takes an entirely new approach to ingesting, analyzing and serving data so that developers and business stakeholders can run powerful SQL analytics, including joins, on raw event data from Kafka. With this release, Rockset is also announcing a partnership with Confluent, with Rockset’s Kafka Connect Plugin listed as a Verified Gold Connector in Confluent Hub.

Increasingly, businesses are capturing real-time data to drive intelligent actions on the fly. However, traditional databases are not built to handle semi-structured data, making it difficult to operationalize event data like this in real time. In an effort to solve this issue and unlock analytics, considerable data engineering effort goes into building complex data pipelines that schematize and load NoSQL data from Kafka event streams into SQL-based systems. These pipelines are difficult to build, expensive to maintain and hours behind in terms of insights into events - making “real-time” operational analytics on event data next to impossible.

Rockset complements Kafka’s KSQL stream processing capabilities by serving as the “sink” that ingests the processed stream. With Rockset, new event data from Kafka is automatically represented as a dynamic SQL table and available for querying in seconds. Rockset uses Converged Indexing™ and a Distributed SQL Processing Engine under the hood to enable customers to filter, aggregate and join across different datasets from different sources in milliseconds, without upfront schema definitions.

“When you embrace modern real-time technologies like Kafka, you discover that NoSQL databases do not support the type of powerful analytics you need, and that’s when you turn to SQL databases. But it will take you hours to extract-transform-load these events into a traditional SQL database and that is just not fast enough for real-time use cases,” said Venkat Venkataramani, co-founder and CEO of Rockset. “Our goal is to give Kafka users the speed and simplicity they need for deriving maximum value from their event streams in seconds.”

With this release, Rockset supports the ability to:

- Visualize event data in leading real-time SQL dashboards with JDBC support, including Tableau, Apache Superset, Redash and Grafana.

- Create developer APIs for building microservices and applications for the Internet of Things (IoT), e-commerce, operational monitoring and more.

- Join Kafka event streams with business data in Amazon DynamoDB, Amazon Kinesis, Amazon S3, Google Cloud Storage and more.

Share this

Industry News

April 08, 2020

JFrog is launching the FrogCare program for companies and organizations who are actively researching and fighting COVID-19.

April 08, 2020

Split Software announced a pre-built integration with mParticle, a customer data platform for enterprise B2C brands.

April 08, 2020

SmartBear announced the acquisition of Test Management for Jira (TM4J), an user-rated QA and test management app in Jira for enterprise teams, from London-based Adaptavist.

April 07, 2020

Docker has open sourced the Compose Specification into a standalone organization on GitHub with open governance.

April 07, 2020

AppGyver, a Finnish software company, is unveiling its new Composer Pro product to the public after four years of quiet development.

April 07, 2020

Red Hat named Paul Cormier as President and CEO of Red Hat.

April 06, 2020

Alcide announced that the Alcide Kubernetes Security Platform now supports HIPAA compliance scans.

April 06, 2020

Copado announced the immediate availability of free access to its platform for anyone working on applications to fight COVID-19.

April 06, 2020

JourneyApps will open its low-code app development platform at no charge to state governments, healthcare agencies and NGOs fighting the rapidly-spreading COVID-19 pandemic.

April 02, 2020

VMware announced the general availability of VMware vSphere 7, the biggest evolution of vSphere in over a decade.

April 02, 2020

Grafana Labs announced that Cortex v1.0 is generally available for production use.

April 02, 2020

IT Revolution announced new dates, extended pricing and its first round of confirmed speakers for DevOps Enterprise Summit Las Vegas 2020. Hosted at The Cosmopolitan of Las Vegas, DevOps Enterprise Summit will now take place November 9-11, 2020.

April 01, 2020

Compuware Corporation announced new capabilities that enable application development teams to automate performance tests early in the development lifecycle, helping large enterprises speed time to market and improve application performance—while decreasing the significant and unnecessary cost of wasted time.

April 01, 2020

PlanetScale released the newest version of PlanetScaleDB, a multi-cloud database.

April 01, 2020

Datawire announced the newest release of Ambassador Edge Stack that is designed to speed up the inner development loop.