Presented at OSACon 2024

Apache Flink has steadily established itself as the leader in stream processing technologies. With thousands of users implementing simple to advanced streaming use cases, the future of the flink community looks bright.

While Apache Flink runs on JVM, for non-JVM users Flink has a well defined pyflink port which helps python developers build sophisticated stream processing jobs. Today, most of the data engineers, data scientists and data analysts prefer using python as their main programming language of choice to build complex use cases.

In this session, I will explore Flink APIs wearing the non-JVM hat and will deep dive into pyflink Table APIs and UDFs. Pyflink appeals to python developers since complex stream processing techniques like windowing, event time semantics could be written in simple python DSLs,.

I will also look at how pyflink Table API and Flink SQL could work hand-in-hand in developing streaming pipelines.

The session will also have a short demo showcasing how pyflink ingests fast moving data from Kafka and runs pyflink Table API DSLs to process such streams.