生产者_分区策略-【官方】百战程序员_IT在线教育培训机构


xxxxxxxxxx
/**
 * The default partitioning strategy:默认的分区策略
 * <ul>
 * <li>If a partition is specified in the record, use it
       如果在记录中指定了一个分区，请使用它。
 * <li>If no partition is specified but a key is present choose a partition based on a hash of the key
 如果没有指定任何分区，但存在一个键，则基于键的hash值%分区数的结果选择一个分区
 * <li>If no partition or key is present choose the sticky partition that changes when the batch is full.
 如果没有分区或键，请选择批处理满时更改的粘性分区。
 * 
 * See KIP-480 for details about sticky partitioning.
 */
public class DefaultPartitioner implements Partitioner {


xxxxxxxxxx
/**
     * Compute the partition for the given record.
     *
     * @param topic The topic name
     * @param key The key to partition on (or null if no key)
     * @param keyBytes serialized key to partition on (or null if no key)
     * @param value The value to partition on or null
     * @param valueBytes serialized value to partition on or null
     * @param cluster The current cluster metadata
     */
    public int partition(String topic, Object key, byte[] keyBytes, Object value, byte[] valueBytes, Cluster cluster) {
        return partition(topic, key, keyBytes, value, valueBytes, cluster, cluster.partitionsForTopic(topic).size());
    }
/**
 * Compute the partition for the given record.
 * 计算给定记录的分区
 * @param topic The topic name
 * @param numPartitions The number of partitions of the given {@code topic}
 * @param key The key to partition on (or null if no key)
 * @param keyBytes serialized key to partition on (or null if no key)
 * @param value The value to partition on or null
 * @param valueBytes serialized value to partition on or null
 * @param cluster The current cluster metadata
 */
public int partition(String topic, Object key, byte[] keyBytes, Object value, byte[] valueBytes, Cluster cluster,
                     int numPartitions) {
    if (keyBytes == null) {
        return stickyPartitionCache.partition(topic, cluster);
    }
    // hash the keyBytes to choose a partition
    return Utils.toPositive(Utils.murmur2(keyBytes)) % numPartitions;
}
//......
}

分区原则一：指明partition的情况下，直接将指定的值作为分区值。例如partition=1，对应数据就如分区1。


xxxxxxxxxx
public ProducerRecord(String topic, Integer partition, Long timestamp, K key, V value, Iterable<Header> headers) {
    ......
}
public ProducerRecord(String topic, Integer partition, Long timestamp, K key, V value) {
    this(topic, partition, timestamp, key, value, null);
}

public ProducerRecord(String topic, Integer partition, K key, V value, Iterable<Header> headers) {
    this(topic, partition, null, key, value, headers);
}
public ProducerRecord(String topic, Integer partition, K key, V value) {
    this(topic, partition, null, key, value, null);
}

分区原则二：没有具体的partition值而有key的情况下，消息要被发送到的目标分区号partition=Utils.toPositive(Utils.murmur2(keyBytes)) % numPartitions


xxxxxxxxxx
public ProducerRecord(String topic, K key, V value) {
    this(topic, null, null, key, value, null);
}

分区原则三：既没有partition值也没有key的情况下，Kafka采用stickyPartitionCache.partition(topic, cluster) 黏性分区器，会随机选择一个分区，并尽可能一直使用该分区，待该分区的batch已满或linger.ms设置的时间到了，再随机一个分区进行使用（通常和上一次的分区不同）。


xxxxxxxxxx
public ProducerRecord(String topic, V value) {
    this(topic, null, null, null, value, null);
}

实时效果反馈

1. 关于Kafka分区策略的描述，正确的是：

A 指明partition的情况下，直接将指定的值作为分区值。例如partition=1，对应数据就如分区1。

B 没有具体的partition值而有key的情况下，消息要被发送到的目标分区号partition=Utils.toPositive(Utils.murmur2(keyBytes)) % numPartitions。

C 既没有partition值也没有key的情况下，Kafka采用黏性分区器，会随机选择一个分区，并尽可能一直使用该分区，待该分区的batch已满或linger.ms设置的时间到了，再随机一个分区进行使用（通常和上一次的分区不同）。

D 以上三个选项都正确。

答案：

1=>D

生产者_分区的优势生产者_分区实战一

北京市昌平区回龙观镇南店村综合商业楼2楼226室