Spark Streaming is an extension of the core Spark API that allows
enables scalable, high-throughput, fault-tolerant stream processing of
live data streams. Data can be ingested from many sources like Kafka,
Flume, Twitter, ZeroMQ, Kinesis or plain old TCP sockets and be
processed using complex algorithms expressed with high-level functions
like map, reduce, join and window. Finally, processed data can be pushed
out to filesystems, databases, and live dashboards. In fact, you can
apply Spark’s machine learning algorithms, ......