Skip to content

Latest commit

 

History

History
17 lines (9 loc) · 466 Bytes

spark-rdd-Partitioner.adoc

File metadata and controls

17 lines (9 loc) · 466 Bytes

Partitioner

Caution
FIXME

Partitioner captures data distribution at the output. A scheduler can optimize future operations based on this.

val partitioner: Option[Partitioner] specifies how the RDD is partitioned.

The contract of partitioner ensures that records for a given key have to reside on a single partition.

numPartitions Method

Caution
FIXME

getPartition Method

Caution
FIXME