Rockset Unveils Real-Time SQL Analytics for Raw Events from Apache Kafka
September 30, 2019

Rockset announced the capability to analyze raw events from Apache Kafka in real time.

Kafka, backed by Confluent, is one of the most popular distributed streaming platforms and capable of handling trillions of events a day. Rockset takes an entirely new approach to ingesting, analyzing and serving data so that developers and business stakeholders can run powerful SQL analytics, including joins, on raw event data from Kafka. With this release, Rockset is also announcing a partnership with Confluent, with Rockset’s Kafka Connect Plugin listed as a Verified Gold Connector in Confluent Hub.

Increasingly, businesses are capturing real-time data to drive intelligent actions on the fly. However, traditional databases are not built to handle semi-structured data, making it difficult to operationalize event data like this in real time. In an effort to solve this issue and unlock analytics, considerable data engineering effort goes into building complex data pipelines that schematize and load NoSQL data from Kafka event streams into SQL-based systems. These pipelines are difficult to build, expensive to maintain and hours behind in terms of insights into events - making “real-time” operational analytics on event data next to impossible.

Rockset complements Kafka’s KSQL stream processing capabilities by serving as the “sink” that ingests the processed stream. With Rockset, new event data from Kafka is automatically represented as a dynamic SQL table and available for querying in seconds. Rockset uses Converged Indexing™ and a Distributed SQL Processing Engine under the hood to enable customers to filter, aggregate and join across different datasets from different sources in milliseconds, without upfront schema definitions.

“When you embrace modern real-time technologies like Kafka, you discover that NoSQL databases do not support the type of powerful analytics you need, and that’s when you turn to SQL databases. But it will take you hours to extract-transform-load these events into a traditional SQL database and that is just not fast enough for real-time use cases,” said Venkat Venkataramani, co-founder and CEO of Rockset. “Our goal is to give Kafka users the speed and simplicity they need for deriving maximum value from their event streams in seconds.”

With this release, Rockset supports the ability to:

- Visualize event data in leading real-time SQL dashboards with JDBC support, including Tableau, Apache Superset, Redash and Grafana.

- Create developer APIs for building microservices and applications for the Internet of Things (IoT), e-commerce, operational monitoring and more.

- Join Kafka event streams with business data in Amazon DynamoDB, Amazon Kinesis, Amazon S3, Google Cloud Storage and more.

Share this

Industry News

March 04, 2021

GrammaTech announced a technology partnership with GitLab, the single application for the DevOps lifecycle.

March 04, 2021

Exadel announced that Sun Capital Partners, a private investment firm, has completed an acquisition of the company.

March 04, 2021

Palo Alto Networks completed its acquisition of Bridgecrew, a developer-first cloud security company.

March 03, 2021

Red Hat announced the latest release of Red Hat Process Automation, which delivers new developer tooling, extended support for eventing and streaming for event-driven architectures (EDA) through integration with Apache Kafka, and new monitoring capabilities through heatmap dashboards.

March 03, 2021

Leaders of the software development industry announced the formation of the Value Stream Management Consortium (VSMC).

March 03, 2021

Delphix and GenRocket announced a technology alliance designed to fulfill the needs of enterprise customers who desire a comprehensive test data solution that improves software quality.

March 02, 2021

JFrog announced that its DevOps Platform tools – JFrog Artifactory and JFrog Xray – are available with native deployment templates for customers using AWS GovCloud (US) and Azure Government clouds.

March 02, 2021

Spectro Cloud announced support for existing Kubernetes environments, including clusters on public cloud services such as Amazon EKS, Azure AKS and Google GKE, has been added to the Spectro Cloud Kubernetes management platform.

March 02, 2021

Idera announced the acquisition of PreEmptive Solutions, LLC, a provider of application protection and security.

March 01, 2021

CloudBolt Software announced the launch of OneFuse Community Edition, a free version of its codeless integration platform for automating, integrating, and extending private and hybrid cloud infrastructures.

March 01, 2021

DBmaestro launched support for Snowflake, the Data Cloud company.

March 01, 2021

Platform9 closed Series-D funding with an additional $12.5 million for a total of $37.5 million.

February 25, 2021

Red Hat announced Red Hat OpenShift 4.7, the latest version of the company’s enterprise Kubernetes platform.

February 25, 2021

Granulate announced the release of its open-source platform, the G-Profiler, a production profiling solution that measures the performance of code in production applications to facilitate compute optimization.

February 25, 2021

Checkmarx announced the launch of KICS (Keeping Infrastructure as Code Secure), an open source static analysis solution that enables developers to write more secure infrastructure as code (IaC).