An open-source distributed computing system designed for processing large-scale data sets, providing high-level APIs in languages like Java, Scala, and Python.
"The data science team used Spark for big data processing and analysis."
— @openai · February 25, 2024