Rockset Unveils Real-Time SQL Analytics for Raw Events from Apache Kafka
September 30, 2019

Rockset announced the capability to analyze raw events from Apache Kafka in real time.

Kafka, backed by Confluent, is one of the most popular distributed streaming platforms and capable of handling trillions of events a day. Rockset takes an entirely new approach to ingesting, analyzing and serving data so that developers and business stakeholders can run powerful SQL analytics, including joins, on raw event data from Kafka. With this release, Rockset is also announcing a partnership with Confluent, with Rockset’s Kafka Connect Plugin listed as a Verified Gold Connector in Confluent Hub.

Increasingly, businesses are capturing real-time data to drive intelligent actions on the fly. However, traditional databases are not built to handle semi-structured data, making it difficult to operationalize event data like this in real time. In an effort to solve this issue and unlock analytics, considerable data engineering effort goes into building complex data pipelines that schematize and load NoSQL data from Kafka event streams into SQL-based systems. These pipelines are difficult to build, expensive to maintain and hours behind in terms of insights into events - making “real-time” operational analytics on event data next to impossible.

Rockset complements Kafka’s KSQL stream processing capabilities by serving as the “sink” that ingests the processed stream. With Rockset, new event data from Kafka is automatically represented as a dynamic SQL table and available for querying in seconds. Rockset uses Converged Indexing™ and a Distributed SQL Processing Engine under the hood to enable customers to filter, aggregate and join across different datasets from different sources in milliseconds, without upfront schema definitions.

“When you embrace modern real-time technologies like Kafka, you discover that NoSQL databases do not support the type of powerful analytics you need, and that’s when you turn to SQL databases. But it will take you hours to extract-transform-load these events into a traditional SQL database and that is just not fast enough for real-time use cases,” said Venkat Venkataramani, co-founder and CEO of Rockset. “Our goal is to give Kafka users the speed and simplicity they need for deriving maximum value from their event streams in seconds.”

With this release, Rockset supports the ability to:

- Visualize event data in leading real-time SQL dashboards with JDBC support, including Tableau, Apache Superset, Redash and Grafana.

- Create developer APIs for building microservices and applications for the Internet of Things (IoT), e-commerce, operational monitoring and more.

- Join Kafka event streams with business data in Amazon DynamoDB, Amazon Kinesis, Amazon S3, Google Cloud Storage and more.

Share this

Industry News

November 30, 2020

Shipa is open sourcing Ketch, Shipa's deployment engine, under Apache License Version 2.0.

November 30, 2020

Portworx by Pure Storage announced its qualification and support of Portworx Enterprise for Google Cloud's Anthos on bare metal.

November 30, 2020

SnapLogic now supports SaaS contracts in AWS Marketplace, a digital catalog with thousands of software listings from independent software vendors that make it easy to find, test, buy, and deploy software that runs on Amazon Web Services (AWS).

November 24, 2020

Red Hat announced new capabilities and features for Red Hat OpenShift, the company's enterprise Kubernetes platform.

November 24, 2020

Sectigo released Chef, Jenkins, JetStack Cert-Manager, Puppet, and SaltStack integrations for its certificate management platform.

November 24, 2020

DataStax released K8ssandra, an open-source distribution of Apache Cassandra on Kubernetes.

November 23, 2020

Spectro Cloud has released a new, self-hosted version of its flagship product, Spectro Cloud.

November 23, 2020

GitLab completed integration of Peach Tech, a security software firm specializing in protocol fuzz testing and dynamic application security testing (DAST) API testing, and Fuzzit, a continuous fuzz testing solution providing coverage-guided testing.

November 23, 2020

Fugue announced the availability of its SaaS product in AWS Marketplace, further simplifying the process for Amazon Web Services customers to use Fugue to bring their environments into compliance quickly, demonstrate compliance at any time, and Shift Left on cloud security.

November 19, 2020

Rollbar announced AI-assisted workflows powered by its new automation-grade grouping engine.

November 19, 2020

Buildkite expanded its integration with GitHub and introduced a new onboarding experience.

November 19, 2020

Rancher Labs launched a new Partner Program for the OEM and embedded community.

November 18, 2020

Puppet announced its evolution to an integrated automation platform to enable key business initiatives such as scaling DevOps, risk reduction, policy as code, and evolving cloud strategies.

November 18, 2020

Adaptavist has joined the GitLab partner program as a Select partner.

November 18, 2020

Postman launched the beta version of public workspaces, a hub that makes it possible for both API producers and consumers to seamlessly communicate and collaborate in real time without team or organizational boundaries.