Who uses Spark?
Sorting
Uber
Uber – the online taxi company gathers terabytes of event data from its mobile
users every day.
- By using Kafka, Spark Streaming, and HDFS, to build a continuous ETL pipeline
- Convert raw unstructured event data into structured data as it is collected
- Uses it further for more complex analytics and optimization of operations
Pinterest
- Uses a Spark ETL pipeline
- Leverages Spark Streaming to gain immediate insight into how users all over the
world are engaging with Pins in real time.
- Can make more relevant recommendations as people navigate the site
- Recommends related Pins
- Determine which products to buy, or destinations to visit
Conviva
- 4 million video feeds per month
- This streaming video company is second only to YouTube.
- Uses Spark to reduce customer churn by optimizing video streams and managing
live video traffic
- Maintains a consistently smooth, high quality viewing experience.
Capital One
- is using Spark and data science algorithms to understand customers
in a better way.
- Developing next generation of financial products and services
- Find attributes and patterns of increased probability for fraud
Netflix
- leveraging Spark for insights of user viewing habits and then
recommends movies to them.
- User data is also used for content creation