25.7.12
This website uses cookies to ensure you get the best experience on our website. Learn more

Streaming Data Architectures: Processing Streaming Data with Spark

Skillsoft issued completion badges are earned based on viewing the percentage required or receiving a passing score when assessment is required. Process streaming data with Spark, the analytic engine built on Hadoop. In this course, you will discover how to develop applications in Spark to work with streaming data and generate output. Topics include the following: Configure a streaming data source; Use Netcat and write applications to process the data stream; Learn the effects of using the Update mode on your stream processing application's output; Write a monitoring application that listens for new files added to a directory; Compare the append output with the update mode; Develop applications to limit files processed in each trigger; Use Spark's Complete mode for output; Perform aggregation operations on streaming data with the DataFrame API; Process streaming data with Spark SQL queries.