Workshop: Apache Kafka and KSQL in Action: Let’s Build a Streaming Data Pipeline

Location: Marquis C, 9th fl.

Duration: 9:00am - 4:00pm

Day of week: Friday

Level: Beginner

Key Takeaways

  • Brief introduction to Apache Kafka, and streaming concepts

  • Integrating systems with Kafka using Kafka Connect

  • Exploring data with KSQL

  • Using KSQL to filter, join, and aggregate data

Prerequisites

Participants must have their own laptop and complete the prerequisite steps detailed here: https://github.com/confluentinc/demo-scene/blob/master/ksql-workshop/pre-requisites.adoc

For reasons of compatibility we strongly recommend the use of Mac or Linux for the workshop. Knowledge of SQL is assumed.

Participants will benefit from existing knowledge of the concepts of Kafka (topics, streaming, etc).

Have you ever thought that you needed to be a programmer to do stream processing and build streaming data pipelines? Think again! Apache Kafka is a distributed, scalable, and fault-tolerant streaming platform, providing low-latency pub-sub messaging coupled with native storage and stream processing capabilities. Integrating Kafka with RDBMS, NoSQL, and object stores is simple with Kafka Connect, which is part of Apache Kafka. KSQL is the open-source SQL streaming engine for Apache Kafka, and makes it possible to build stream processing applications at scale, written using a familiar SQL interface.   

In this workshop you will learn the architectural reasoning for Apache Kafka and the benefits of real-time integration, and then build a streaming data pipeline using nothing but your bare hands, Kafka Connect, and KSQL.   

Gasp as we filter events in real time! Be amazed at how we can enrich streams of data with data from RDBMS! Be astonished at the power of streaming aggregates for anomaly detection!

Speaker: Tim Berglund

Senior Director of Developer Experience @Confluent

Tim is a teacher, author, and technology leader with Confluent, where he serves as the Senior Director of Developer Experience. He can frequently be found at speaking at conferences in the United States and all over the world. He is the co-presenter of various O’Reilly training videos on topics ranging from Git to Distributed Systems, and is the author of Gradle Beyond the Basics. He tweets as @tlberglund, blogs very occasionally at http://timberglund.com, and lives in Littleton, CO, USA with the wife of his youth and their youngest child, the other two having mostly grown up.

Find Tim Berglund at

Other Workshops:

Tracks

Monday, 24 June

Tuesday, 25 June

Wednesday, 26 June

  • Architecting For Failure

    More than just building software, building deployable production ready software in the face of guaranteed failure.

  • 21st Century Languages

    Lessons learned from building languages like Rust, Go-lang, Swift, Kotlin, and more.

  • Building High-Performing Teams

    What “high-performing team” means and how to build one effectively depends on context. This track will share different experiences of building high-performing teams in order to highlight how different contexts lead to different solutions but also what typically stays the same because we’re still dealing with humans trying to work together. How do different forces affect the building of high-performing teams.

  • Software Defined Infrastructure: Kubernetes, Service Meshes, & Beyond

    Deploying, scaling, managing your services is undifferentiated heavy lifting. Hear stories, learn techniques, and dive deep into software infrastructure.

  • High-Performance Computing: Lessons from FinTech & AdTech

    Killing latency and getting the most out of your hardware.