Apache Spark 3.0 — A Next-Generation data processing and analytics workload

 


Since the initial release in 2010, Spark has come a long way and has grown to be one of the most active open-source projects. In the TPC-DS 30TB benchmark, Spark 3.0 is roughly two times faster than Spark 2.4 enabled by adaptive query execution, dynamic partition pruning, and other optimisations.

Read more: https://rajeshkotian.medium.com/apache-spark-3-0-a-next-generation-data-processing-and-analytics-workloads-7895b29fd834

 

 

 

Comments