Tutorial: Capturing and processing streaming data with Apache Kafka

This half-day tutorial will describe how to use Apache Kafka to store and process streaming data -- anything from user activity and app metrics to device instrumentation and logs. Learn about Kafka's core abstractions and how to interact with Kafka using the Kafka clients, called producers and consumers. With these tools, it's easy to design applications as a set of loosely coupled services exchanging data via Kafka.

The introduction will describe Kafka's unifying abstraction, a partitioned and replicated low-latency commit log, and how it can be applied to several types of applications. The majority of the tutorial will focus on building an end-to-end application that performs simple anomaly detection on user activity data. A front-end application will be instrumented with a Kafka producer to report activity data to a Kafka topic. We'll discuss how to use schemas, and Avro specifically, to ensure downstream consumers agree on the data format and allow for safe, robust evolution of the data format. Then, to detect anomalies, we'll build a distributed, fault tolerant service using Kafka's consumer group abstraction to process the data.

Co-founder and CTO, Gridgain

Nikita Ivanov

How Cloud Database Enables Efficient Real-Time Analytics

Co-founder & CTO at Clusterpoint

Jurgis Orups

Designing for Distributed, Unstructured Data: Availability vs. Analytics in NoSQL

Senior Developer Advocate, Basho

Matthew Brender

Using Luigi to build data pipelines that won’t wake you at 3am

DevOps Evangelist @ Datadog

Matthew Williams

Building Cognitive Applications with IBM Watson

Senior Product Manager, IBM Watson Group

Salil Ahuja

Consensus Systems for the Skeptical Architect

CTO at Rent the Runway

Camille Fournier

Financial Modeling with Apache Spark: Calculating Value at Risk

Data Scientist at Cloudera

Sandy Ryza

Making Distributed Data Persistent Services Elastic (Without Losing All Your Data)

‎Founder, Principal Consultant at Big Data Open Source Security LLC

Joe Stein

Pulsar: Realtime Analytics at Scale

Director of Engineering, Ebay

Tony Ng

Tracks

Wednesday Jun 10

Applied Data Science and Machine Learning

Putting your data to use. The latest production methods for deriving novel insights
Engineer Your Culture

Building and scaling a compelling engineering culture
Modern Advances in Java Technology

Tips, techniques and technologies at the cutting edge of modern Java
Monoliths to Microservices

How to evolve beyond a monolithic system -- successful migration and implementation stories
The Art of Software Design

Software Arch as a craft, scenario based examples and general guidance
Sponsored Solutions Track I

Thursday Jun 11

Emerging Technologies in Front-end Development

The state of the art in client-side web development
Fraud Detection and Hack Prevention

Businesses are built around trust in systems and data. Securing systems and fighting fraud throughout the data in them.
Reactive Architecture Tactics

The how of the Reactive movement: Release It! techniques, Rx, Failure Concepts, Throughput, Availability
Architecting for Failure

War stories and lessons learned from building highly robust and resilient systems
High Performance Streaming Data

Scalable architectures and high-performance frameworks for immediate data over persistent connections
Sponsored Solutions Track II

Friday Jun 12

Architectures You've Always Wondered about

Learn from the architectures powering some of the most popular applications and sites
Continuously Deploying Containers in Production

Production ready patterns for growing containerization in your environment
Mobile and IoT at Scale

Users, Usage and Microservices
Modern Computer Science in the Real World

How modern CS tackles problems in the real world
Optimizing Yourself

Maximizing your impact as an engineer, as a leader, and as a person
Sponsored Solutions Track III

Schedule

Location:

Prerequisites

Tutorial: Capturing and processing streaming data with Apache Kafka

Find Ewen Cheslack-Postava at

Similar Talks

Other Tutorials:

Tracks

Wednesday Jun 10

Thursday Jun 11

Friday Jun 12

Conference for Professional Software Developers

Follow QCon

Contact

Menu

QCons around the World