Loading...
「ツール」は右上に移動しました。
利用したサーバー: wtserver1
3いいね 92 views回再生

The Secrets of Influencing Spark Partitions during reads - Spark Partitioning (Part 8)

In today's episode we will deep dive into the influence factors on partitions and how we choose a good partition based on it. The behaviours include:
Max Partitions Bytes
Open Cost in Bytes
Num of cores
File size
Num of files

Feel free to comment or challenge my explanations as always. Happy to learn also myself more by the community.

Link to the code can be found here: https://github.com/datanikkthegreek/S...

コメント