-
Notifications
You must be signed in to change notification settings - Fork 601
REST APIs
For all the requests, make sure that you interact with endpoints using UUID or cookies, explicitly. Cruise Control requests have been redesigned as async calls to avoid blocking. Hence, if you don't use UUID nor cookies, you won't be able to see the server response if it takes longer than a predefined time (default: 10 seconds
). You can retrieve the response within a predefined time(default: 6 hours
) using the UUID returned in initial response; or you can reuse the session to get response if the session has not expired (default: 1 minutes
). If you do not specify UUID or cookie in subsequent requests, such requests will each create a new session and excessive number of ongoing requests will make CC unable to create a new session due to hitting the maximum number of active user task limit. GET
requests that are sent via a web browser typically use cookies by default; hence, you will preserve the session upon multiple calls to the same endpoint via a web browser.
- Here is a quick recap of how to use UUID with requests using
cURL
:
- Create a cookie associated with a new request
curl -vv -X POST -c /tmp/mycookie-jar.txt "http://CRUISE_CONTROL_HOST:2540/kafkacruisecontrol/remove_broker?brokerid=1234&dryrun=false"
- Record the User-Task-ID in response, e.g.
User-Task-ID: 5ce7c299-53b3-48b6-b72e-6623e25bd9a8
- Specifying the User-Task-ID in request that has not completed
curl -vv -X POST -H "User-Task-ID: 5ce7c299-53b3-48b6-b72e-6623e25bd9a8" "http://CRUISE_CONTROL_HOST:2540/kafkacruisecontrol/remove_broker?brokerid=1234&dryrun=false"
- Here is a quick recap of how to use cookies with requests using
cURL
:
- Create a cookie associated with a new request
curl -X POST -c /tmp/mycookie-jar.txt "http://CRUISE_CONTROL_HOST:2540/kafkacruisecontrol/remove_broker?brokerid=1234&dryrun=false"
- Use an existing cookie from the created file for a request that has not completed
curl -X POST -b /tmp/mycookie-jar.txt "http://CRUISE_CONTROL_HOST:2540/kafkacruisecontrol/remove_broker?brokerid=1234&dryrun=false"
The GET requests in Kafka Cruise Control REST API are for read only operations, i.e. the operations that do not have any external impacts. The GET requests include the following operations:
- Query the state of Cruise Control
- Query the current cluster load
- Query partition resource utilization
- Query partition and replica state
- Get optimization proposals
- Query the user request result
User can query the state of Kafka Cruise Control at any time by issuing a HTTP GET request.
GET /kafkacruisecontrol/state
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
substates | list | list of components to return their state, available components are analyzer , monitor , executor and anomaly_detector
|
all the components | yes |
json | boolean | return in JSON format or not | false | yes |
verbose | boolean | return detailed state information | false | yes |
super_verbose | boolean | return more detailed state information | false | yes |
The returned state contains the following information:
- Monitor State:
- State: NOT_STARTED / RUNNING / SAMPLING / PAUSED / BOOTSTRAPPING / TRAINING / LOADING,
- Bootstrapping progress (If state is BOOTSTRAPPING)
- Number of valid monitored windows / number of total monitored windows
- Number of valid partitions out of the total number of partitions
- Percentage of the partitions that are valid
- Time and reason of recently sampling task pause/resume
- Executor State:
- State: NO_TASK_IN_PROGRESS / EXECUTION_STARTED / INTER_BROKER_REPLICA_MOVEMENT_IN_PROGRESS / INTRA_BROKER_REPLICA_MOVEMENT_IN_PROGRESS / LEADER_MOVEMENT_IN_PROGRESS
- Inter-broker replica movement progress (if state is INTER_BROKER_REPLICA_MOVEMENT_IN_PROGRESS)
- Intra-broker replica movement progress (if state is INTRA_BROKER_REPLICA_MOVEMENT_IN_PROGRESS)
- Leadership movement progress (if state is LEADERSHIP_MOVEMENT)
- movement concurrency
- UUID triggers the execution
- Analyzer State:
- isProposalReady: Is there a proposal cached
- ReadyGoals: A list of goals that are ready for running
- Anomaly Detector State:
- selfHealingEnabled: Anomaly type for which self healing is enabled
- selfHealingDisabled: Anomaly type for which self healing is disabled
- recentGoalViolations: Recently detected goal violations
- recentBrokerFailures: Recently detected broker failures
- recentDiskFailures: Recently detected disk failures
- recentMetricAnomalies: Recently detected metric anomalies
If verbose
is set to true
, the details about monitored windows and goals will be displayed.
If super_verbose
is set to true
, the details about extrapolation made on metric samples will be displayed.
If substates
is not set, the full state will be displayed; if it is set to the specific substate(s), only state(s) of interest will be displayed and response will be returned faster.
Once Cruise Control Load Monitor shows it is in the RUNNING state, Users can use the following HTTP GET to get the cluster load:
GET /kafkacruisecontrol/load
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
start | long | start time of the cluster load | time of earliest valid window | yes |
end | long | end time of the cluster load | current system time | yes |
time | long | end time of the cluster load | current system time | yes |
allow_capacity_estimation | boolean | whether allow broker capacity to be estimated from other broker in the cluster | true | yes |
populate_disk_info | boolean | whether show the load of each disk of broker | false | yes |
json | boolean | return in JSON format or not | false | yes |
verbose | boolean | return detailed state information | false | yes |
If the number of workload snapshots for the given timestamp is not sufficient to generate a good load model, an exception will be returned.
Timestamp for start
/end
/time
is in milliseconds since the epoch; what System.currentTimeMillis()
returns. The time zone is the time zone of the Cruise Control server.
If allow_capacity_estimation
is set to true
, for brokers missing capacity information Cruise Control will make estimations based on other brokers in the cluster; otherwise an IllegalStateException
will be thrown and shown in response.
The response contains both load-per-broker and load-per-host information. This is specifically useful when multiple brokers are hosted by the same machine.
NOTE: The load shown is only for the load from the valid partitions. i.e the partitions with enough metric samples. So please always check the LoadMonitor
's state(via State
endpoint) to decide whether the workload is representative enough.
The following GET request gives the partition load sorted by the utilization of a given resource:
GET /kafkacruisecontrol/partition_load
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
resource | string | resource type to sort partition load, available resource are DISK /CPU /NW_IN /NW_OUT
|
DISK |
yes |
start | long | start time of the partition load | time of earliest valid window | yes |
end | long | end time of the partition load | current system time | yes |
entries | integer | number of partition load entries to report in response | MAX_INT |
yes |
json | boolean | return in JSON format or not | false | yes |
allow_capacity_estimation | boolean | whether allow broker capacity be estimated from other broker in the cluster | true | yes |
max_load | boolean | whether report the max load for partition in windows | false | yes |
avg_load | boolean | whether report the average load for partition in windows | false | yes |
topic | regex | regular expression to filter partition load to report based on partition's topic | null | yes |
partition | integer/range | partition number(e.g. 10) range(e.g. 1-10) to filter partition load to report | null | yes |
min_valid_partition_ratio | double | minimal valid partition ratio requirement for cluster model | null | yes |
brokerid | int | broker id to to filter partition load to report | null | yes |
The returned result would be a partition list sorted by the utilization of the specified resource in the time range specified by start
and end
. The resource can be CPU
, NW_IN
, NW_OUT
and DISK
. By default the start
is the earliest monitored time, the end
is current wall clock time, resource
is DISK
, and entries
is the all partitions in the cluster.
By specifying topic
,partition
and/or brokerid
parameter, client can filter returned partition entries.
The min_valid_partition_ratio
specifies minimal monitored valid partition percentage needed to calculate the partition load. If this parameter is not set in request, the config value min.valid.partition.ratio
will be used.
The max_load
parameter specifies whether report the maximal historical value or not. The avg_load
parameter specifies whether report the average historical value or not. If both are not specified or specified as false
, for DISK
resource, latest value will be reported; for NW_IN
/NW_OUT
/CPU
resource, average value will be reported.
The following GET request gives partition healthiness on the cluster:
GET /kafkacruisecontrol/kafka_cluster_state
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
topic | regex | regular expression to filter partition state to report based on partition's topic | null | yes |
json | boolean | return in JSON format or not | false | yes |
verbose | boolean | return detailed state information | false | yes |
The returned result contains the following information
-
For each broker
- Distribution of leader/follower/out-of-sync/offline replica information
- Online/offline disks
-
For each partition
- Distribution of leader/follower/in-sync/out-of-sync/offline replica information
The following GET request returns the optimization proposals generated based on the workload model of the given timestamp. The workload summary before and after the optimization will also be returned.
GET /kafkacruisecontrol/proposals
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
ignore_proposal_cache | boolean | whether ignore the cached proposal or not | false | yes |
data_from | string | whether calculating proposal from available valid partitions or valid windows | VALID_WINDOWS |
yes |
goals | list | list of goals used to generate proposal | all goals | yes |
kafka_assigner | boolean | whether use Kafka assigner mode to general proposal | false | yes |
allow_capacity_estimation | boolean | whether allow broker capacity to be estimated | true | yes |
excluded_topics | regex | regular expression to specify topic not to be considered for replica movement | null | yes |
use_ready_default_goals | boolean | whether only using ready goals to generate proposal | false | yes |
exclude_recently_demoted_brokers | boolean | whether allow leader replicas to be moved to recently demoted broker | false | yes |
exclude_recently_removed_brokers | boolean | whether allow replicas to be moved to recently removed broker | false | yes |
destination_broker_ids | boolean | specify brokers to move replicas to | null | yes |
rebalance_disk | boolean | whether to balance load between brokers or between disks within broker | false | yes |
json | boolean | return in JSON format or not | false | yes |
verbose | boolean | return detailed state information | false | yes |
Kafka cruise control tries to precompute the optimization proposal in the background and caches the best proposal to serve when user queries. If users want to have a fresh proposal without reading it from the proposal cache, set the ignore_proposal_cache
flag to true. The precomputing always uses available valid partitions to generate the proposals.
By default the proposal will be returned from the cache where all the pre-defined goals are used. Detailed information about the reliability of the proposals will also be returned. If users want to run with a different set of goals, they can specify the goals
parameter with the goal names (simple class name).
If verbose
is turned on, Cruise Control will return all the generated proposals. Otherwise a summary of the proposals will be returned.
If kafka_assigner
is turned on, the proposals will be generated in Kafka Assigner mode,i.e. Cruise Control behaves like kafka assigner tool. This mode performs optimizations using the goals specific to Kafka Assigner -- i.e. goals with name KafkaAssigner*
.
Users can specify excluded_topics
to prevent certain topics' replicas from moving in the generated proposals.
If use_ready_default_goals
is turned on, Cruise Control will use whatever ready goals(based on available metric data) to calculate the proposals.
The following get request allows user to get a full list of all the active/completed(and not recycled) tasks inside Cruise Control, with their initial request detail(request time/IP address/request URL and parameter) and UUID information. User can then use the returned UUID and URL to fetch the original final result of the specific request.
GET /kafkacruisecontrol/user_tasks
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
user_task_ids | list | comma separated UUIDs to filter the task results Cruise Control report | all tasks | yes |
client_ids | list | comma separated IP addresses to filter the task results Cruise Control report | all users | yes |
entries | integer | number of partition load entries to report in response | MAX_INT |
yes |
endpoints | list | comma separated endpoints to filter the task results Cruise Control report | all endpoints | yes |
types | string | comma separated HTTP request types to filter the task results Cruise Control report | all request types | yes |
json | boolean | return in JSON format or not | false | yes |
fetch_completed_task | boolean | whether return the original request's final response | false | yes |
User can use user_task_ids
/client_ids
/endpoints
/types
make Cruise Control only return requests they are interested. By default all the requests get returned.
If fetch_completed_task
is set to true
, the original response of each requests will be returned.
The post requests of Kafka Cruise Control REST API are operations that will have impact on the Kafka cluster. The post operations include:
- Trigger a workload balance
- Add a list of new brokers to Kafka Cluster
- Decommission a list of brokers from the Kafka cluster
- Fix offline replicas in Kafka cluster
- Demoting a list of brokers from the Kafka cluster
- Stop the current proposal execution task
- Pause metrics load sampling
- Resume metrics load sampling
- Change Kafka topic configuration
- Change Cruise Control configuration
Most of the POST actions has a dry-run mode, which only generate the proposals and estimated result but not really execute the proposals. To avoid accidentally triggering of data movement, by default all the POST actions are in dry run mode. To let Kafka Cruise Control actually move data, users need to explicitly set dryrun=false
.
The following POST request will let Kafka Cruise Control rebalance a Kafka cluster
POST /kafkacruisecontrol/rebalance
Supported parameters are:
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
dryrun | Boolean | Whether dry-run the request or not | true | yes |
data_from | String | Specify either generate proposal from available valid partitions or valid windows | Null | yes |
goals | List | List of goals used to generate proposal | empty list | yes |
kafka_assigner | Boolean | Whether use Kafka assigner mode to general proposal | false | yes |
allow_capacity_estimation | Boolean | Whether allow broker capacity to be estimated | true | yes |
concurrent_partition_movements_per_broker | Integer | Maximal count of ongoing replica movements into/out of a broker | true | yes |
concurrent_intra_partition_movements | Integer | Maximal count of ongoing replica movements between disks within a broker | Null | yes |
concurrent_leader_movements | Integer | Maximal count of ongoing leadership movements | true | yes |
skip_hard_goal_check | Boolean | Whether allow hard goal be skipped in proposal generation | true | yes |
excluded_topics | Regex | Regular expression to specify topic not to be considered for replica movement | Null | yes |
use_ready_default_goals | Boolean | Whether only use ready goals to generate proposal | false | yes |
exclude_recently_demoted_brokers | Boolean | Whether allow leader replicas to be moved to recently demoted broker | false | yes |
exclude_recently_removed_brokers | Boolean | Whether allow replicas to be moved to recently removed broker | false | yes |
replica_movement_strategies | String | Replica movement strategy to use | Null | yes |
ignore_proposal_cache | boolean | Whether ignore the cached proposal or not | false | yes |
replication_throttle | Long | Upper bound on the bandwidth used to move replicas | false | yes |
destination_broker_ids | boolean | specify brokers to move replicas to | Null | yes |
rebalance_disk | boolean | Whether generate intra-broker replica movement to rebalance disk load with in a broker or inter-broker replica movement to rebalance broker load | false | yes |
json | Boolean | Return in JSON format or not | false | yes |
verbose | Boolean | Return detailed state information | false | yes |
Similar to the GET interface for getting proposals, the rebalance can also be based on available valid windows or available valid partitions.
valid_windows: rebalance the cluster based on the information in the available valid snapshot windows. A valid snapshot window is a windows whose valid monitored partitions coverage meets the requirements of all the goals. (This is the default behavior)
valid_partitions: rebalance the cluster based on all the available valid partitions. All the snapshot windows will be included in this case.
Users can only specify either valid_windows
or valid_partitions
, but not both.
goals: a list of goals to use for rebalance. When goals is provided, the cached proposals will be ignored.
When rebalancing a cluster, all the brokers in the cluster are eligible to give / receive replicas. All the brokers will be throttled during the partition movement.
**Note by default the rebalance will be in DryRun mode. Please explicitly set dryrun
to false to execute the proposals. **
The following POST request adds the given brokers to the Kafka cluster
POST /kafkacruisecontrol/add_broker?brokerid=[id1,id2...]
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
brokerid | List | List of ids of new broker added to the cluster | true | no |
dryrun | Boolean | Whether dry-run the request or not | true | yes |
data_from | String | Specify either generate proposal from available valid partitions or valid windows | Null | yes |
goals | List | List of goals used to generate proposal | empty list | yes |
kafka_assigner | Boolean | Whether use Kafka assigner mode to general proposal | false | yes |
allow_capacity_estimation | Boolean | Whether allow broker capacity to be estimated | true | yes |
concurrent_partition_movements_per_broker | Integer | Maximal count of ongoing replica movements into/out of a broker | true | yes |
concurrent_leader_movements | Integer | Maximal count of ongoing leadership movements | true | yes |
skip_hard_goal_check | Boolean | Whether allow hard goal be skipped in proposal generation | true | yes |
excluded_topics | Regex | Regular expression to specify topic not to be considered for replica movement | Null | yes |
use_ready_default_goals | Boolean | Whether only use ready goals to generate proposal | false | yes |
exclude_recently_demoted_brokers | Boolean | Whether allow leader replicas to be moved to recently demoted broker | false | yes |
exclude_recently_removed_brokers | Boolean | Whether allow replicas to be moved to recently removed broker | false | yes |
replica_movement_strategies | String | Replica movement strategy to use | Null | yes |
replication_throttle | Long | Upper bound on the bandwidth used to move replicas | false | yes |
throttle_added_broker | Boolean | Whether throttle replica movement to new broker or not | false | yes |
json | Boolean | Return in JSON format or not | false | yes |
verbose | Boolean | Return detailed state information | false | yes |
When adding new brokers to a Kafka cluster, Cruise Control makes sure that the replicas will only be moved from the existing brokers to the provided new broker, but not moved among existing brokers.
Users can choose whether to throttle replica movement to the newly added broker via throttle_added_broker
, in either case, the replica movement out of current broker are throttled, and the throttling can be set in two ways.
- By throttling number of partitions concurrently moving into a broker. It is gated by the request parameter
concurrent_partition_movements_per_broker
or config valuenum.concurrent.partition.movements.per.broker
(if request parameter is not set). Similarly, the number of concurrent partition leadership changes is gated by request parameterconcurrent_leader_movements
or config valuenum.concurrent.leader.movements
. - By throttling the bandwidth used to move replicas into a broker. It is gated by the request parameter
replication_throttle
or config valuedefault.replication.throttle
(if request parameter is not set).
Set skip_hard_goal_check
to true enforcing a sanity check that all the hard goals are included in the goals
parameter, otherwise an exception will be thrown.
The following POST request removes a list of brokers from the Kafka cluster:
POST /kafkacruisecontrol/remove_broker?brokerid=[id1,id2...]
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
brokerid | List | List of ids of broker to be removed from the cluster | true | no |
dryrun | Boolean | Whether dry-run the request or not | true | yes |
data_from | String | Specify either generate proposal from available valid partitions or valid windows | Null | yes |
goals | List | List of goals used to generate proposal | empty list | yes |
kafka_assigner | Boolean | Whether use Kafka assigner mode to general proposal | false | yes |
allow_capacity_estimation | Boolean | Whether allow broker capacity to be estimated | true | yes |
concurrent_partition_movements_per_broker | Integer | Maximal count of ongoing replica movements into/out of a broker | true | yes |
concurrent_leader_movements | Integer | Maximal count of ongoing leadership movements | true | yes |
skip_hard_goal_check | Boolean | Whether allow hard goal be skipped in proposal generation | true | yes |
excluded_topics | Regex | Regular expression to specify topic not to be considered for replica movement | Null | yes |
use_ready_default_goals | Boolean | Whether only use ready goals to generate proposal | false | yes |
exclude_recently_demoted_brokers | Boolean | Whether allow leader replicas to be moved to recently demoted broker | false | yes |
exclude_recently_removed_brokers | Boolean | Whether allow replicas to be moved to recently removed broker | false | yes |
replica_movement_strategies | String | Replica movement strategy to use | Null | yes |
replication_throttle | Long | Upper bound on the bandwidth used to move replicas | false | yes |
throttle_removed_broker | Boolean | Wether throttle replica movement out of the removed broker or not | false | yes |
destination_broker_ids | boolean | specify brokers to move replicas to | Null | yes |
json | Boolean | Return in JSON format or not | false | yes |
verbose | Boolean | Return detailed state information | false | yes |
Similar to adding brokers to a cluster, removing brokers from a cluster will only move partitions from the brokers to be removed to the other existing brokers. There won't be partition movements among remaining brokers. And user can specify the destination broker for these replica movement via destination_broker_ids
parameter.
Like adding brokers, users can choose whether to throttle the removed broker during the partition movement. If the removed brokers are throttled, the number of partitions concurrently moving out of a broker, number of concurrent partition leadership changes and bandwidth used for replica movement are gated in the same way as add_broker above.
Note if the topics specified in excluded_topics
has replicas on the removed broker, the replicas will still get moved off the broker.
The following POST request moves all the offline replica from dead disks/brokers. Note this endpoint only exist in migrate_to_kafka_2_0
branch.
POST /kafkacruisecontrol/fix_offline_replica
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
dryrun | Boolean | Whether dry-run the request or not | true | yes |
data_from | String | Specify either generate proposal from available valid partitions or valid windows | Null | yes |
goals | List | List of goals used to generate proposal | empty list | yes |
kafka_assigner | Boolean | Whether use Kafka assigner mode to general proposal | false | yes |
allow_capacity_estimation | Boolean | Whether allow broker capacity to be estimated | true | yes |
concurrent_partition_movements_per_broker | Integer | Maximal count of ongoing replica movements into/out of a broker | true | yes |
concurrent_leader_movements | Integer | Maximal count of ongoing leadership movements | true | yes |
skip_hard_goal_check | Boolean | Whether allow hard goal be skipped in proposal generation | true | yes |
excluded_topics | Regex | Regular expression to specify topic not to be considered for replica movement | Null | yes |
use_ready_default_goals | Boolean | Whether only use ready goals to generate proposal | false | yes |
exclude_recently_demoted_brokers | Boolean | Whether allow leader replicas to be moved to recently demoted broker | false | yes |
exclude_recently_removed_brokers | Boolean | Whether allow replicas to be moved to recently removed broker | false | yes |
replica_movement_strategies | String | Replica movement strategy to use | Null | yes |
replication_throttle | Long | Upper bound on the bandwidth used to move replicas | false | yes |
json | Boolean | Return in JSON format or not | false | yes |
verbose | Boolean | Return detailed state information | false | yes |
Likewise, users can throttle partition movement via either replica count or bandwidth.
Note if the topics specified in excluded_topics
has offline replicas, the replicas will still get moved to healthy brokers.
The following POST request moves all the leader replicas away from a list of brokers.
POST /kafkacruisecontrol/demote_broker?brokerid=[id1, id2...]
In migrate_to_kafka_2_0
branch, user can also request to move all the leader replicas away from the a list of disks via
POST /kafkacruisecontrol/demote_broker?brokerid_and_logdirs=[id1-logdir1, id2-logdir2...]
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
brokerid | List | List of ids of broker to be demoted in the cluster | true | no if brokerid_and_logdirs is not set |
brokerid_and_logdirs | List | List of broker id and logdir pair to be demoted in the cluster | true | no if brokerids is not set |
dryrun | Boolean | Whether dry-run the request or not | true | yes |
allow_capacity_estimation | Boolean | Whether allow broker capacity to be estimated | true | yes |
concurrent_leader_movements | Integer | Maximal count of ongoing leadership movements | true | yes |
skip_urp_demotion | Boolean | Whether skip demoting leader replica of under replicated partition | false | yes |
exclude_follower_demotion | Boolean | Whether skip demoting follower replicas on the broker to be demoted | false | yes |
exclude_recently_demoted_brokers | Boolean | Whether allow leader replicas to be moved to recently demoted broker | false | yes |
replica_movement_strategies | String | Replica movement strategy to use | Null | yes |
replication_throttle | Long | Upper bound on the bandwidth used to move replicas | false | yes |
json | Boolean | Return in JSON format or not | false | yes |
verbose | Boolean | Return detailed state information | false | yes |
Demoting a broker/disk is consist of tow steps.
- Make all the replicas on given broker/disk the least preferred replicas for leadership election within their corresponding partitions
- Trigger a preferred leader election on the partitions to migrate the leader replicas off the broker/disk
Set skip_urp_demotion
to true will skip the operations on partitions which is currently under replicated; Set exclude_follower_demotion
will skip operations on the partitions which only have follower replicas on the brokers/disks to be demoted. The purpose of these two parameters is to avoid the URP recovery process blocking demoting execution.
The following POST request will let Kafka Cruise Control stop an ongoing rebalance
, add_broker
, remove_broker
, fix_offline_replica
, topic_configuration
or demote_broker
operation:
POST /kafkacruisecontrol/stop_proposal_execution
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
json | Boolean | Return in JSON format or not | false | yes |
Note that Cruise Control does not wait for the ongoing batch to finish when it stops execution, i.e. the in-progress batch may still be running after Cruise Control stops the execution.
The following POST request will let Kafka Cruise Control pause an ongoing metrics sampling process:
POST /kafkacruisecontrol/pause_sampling
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
reason | String | Reason to pause sampling | Null | yes |
json | Boolean | Return in JSON format or not | false | yes |
The reason to pause sampling will be recorded and shows up in state
endpoint(under LoadMonitor
sub state).
The following POST request will let Kafka Cruise Control resume a paused metrics sampling process:
POST /kafkacruisecontrol/resume_sampling
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
reason | String | Reason to resume sampling | Null | yes |
json | Boolean | Return in JSON format or not | false | yes |
The reason to resume sampling will be recorded and shows up in state
endpoint(under LoadMonitor
sub state).
Currently Cruise Control only supports changing topic's replication factor via topic_configuration
endpoint. Ultimately we want make Cruise Control the central place to change any topic configurations.
The following POST request can change topic's replication factor. POST /kafkacruisecontrol/topic_configuration?topic=[topic_regex]&replication_factor=[target_replication_factor]
Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
dryrun | Boolean | Whether dry-run the request or not | true | yes |
topic | Regex | regular express to specify subject topics | true | no |
replication_factor | Integer | target replication factor | N/A | no |
data_from | String | Specify either generate proposal from available valid partitions or valid windows | Null | yes |
goals | List | List of goals used to generate proposal | empty list | yes |
allow_capacity_estimation | Boolean | Whether allow broker capacity to be estimated | true | yes |
concurrent_partition_movements_per_broker | Integer | Maximal count of ongoing replica movements into/out of a broker | true | yes |
concurrent_leader_movements | Integer | Maximal count of ongoing leadership movements | true | yes |
skip_hard_goal_check | Boolean | Whether allow hard goal be skipped in proposal generation | true | yes |
exclude_recently_demoted_brokers | Boolean | Whether allow leader replicas to be moved to recently demoted broker | false | yes |
exclude_recently_removed_brokers | Boolean | Whether allow replicas to be moved to recently removed broker | false | yes |
replica_movement_strategies | String | Replica movement strategy to use | Null | yes |
ignore_proposal_cache | boolean | Whether ignore the cached proposal or not | false | yes |
replication_throttle | Long | Upper bound on the bandwidth used to move replicas | false | yes |
json | Boolean | Return in JSON format or not | false | yes |
verbose | Boolean | Return detailed state information | false | yes |
Changing topic's replication factor will not move any existing replicas. goals
are used to determine which replica to be deleted(to decrease topic's replication factor) and which broker to assign newly replica (to increase topic's replication factor).
Note sometimes the topic regex can be too long to fix in POST request head, in this case user can specify topic regex and target replication factor pair in POST request body. For details, check Change-topic-replication-factor-through-Cruise-Control wiki page.
Some of Cruise Control's config can be changed dynamically via admin
endpoint, which includes
- enable/disable self-healing
- increase/decrease execution concurrency
- drop recently removed/demoted brokers Supported parameters are:
PARAMETER | TYPE | DESCPRIPTION | DEFAULT | OPTIONAL |
---|---|---|---|---|
disable_self_healing_for | List | list of anomaly types to disable self-healing | Null | yes |
enable_self_healing_for | List | list of anomaly types to enable self-healing | Null | yes |
concurrent_partition_movements_per_broker | Integer | Maximal count of ongoing replica movements into/out of a broker | true | yes |
concurrent_intra_partition_movements | Integer | Maximal count of ongoing replica movements between disks within a broker | Null | yes |
drop_recently_removed_brokers | List | List of id of recently removed brokers to be dropped | Null | yes |
concurrent_leader_movements | Integer | List of id of recently demoted brokers to be dropped | Null | yes |
Contents
- Cruise Control Wiki
- Overview
- Troubleshooting
- User Guide
- Python Client
- Developer Guide