src/main/asciidoc/_chapters/configuration.adoc

   1 ////
   2 /**
   3  *
   4  * Licensed to the Apache Software Foundation (ASF) under one
   5  * or more contributor license agreements.  See the NOTICE file
   6  * distributed with this work for additional information
   7  * regarding copyright ownership.  The ASF licenses this file
   8  * to you under the Apache License, Version 2.0 (the
   9  * "License"); you may not use this file except in compliance
  10  * with the License.  You may obtain a copy of the License at
  11  *
  12  *     http://www.apache.org/licenses/LICENSE-2.0
  13  *
  14  * Unless required by applicable law or agreed to in writing, software
  15  * distributed under the License is distributed on an "AS IS" BASIS,
  16  * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  17  * See the License for the specific language governing permissions and
  18  * limitations under the License.
  19  */
  20 ////
  21
  22 [[configuration]]
  23 = Apache HBase Configuration
  24 :doctype: book
  25 :numbered:
  26 :toc: left
  27 :icons: font
  28 :experimental:
  29
  30 This chapter expands upon the <<getting_started>> chapter to further explain configuration of
  31 Apache HBase. Please read this chapter carefully, especially the
  32 <<basic.prerequisites,Basic Prerequisites>> to ensure that your HBase testing and deployment goes
  33 smoothly. Familiarize yourself with <<hbase_supported_tested_definitions>> as well.
  34
  35 == Configuration Files
  36 Apache HBase uses the same configuration system as Apache Hadoop. All configuration files are
  37 located in the _conf/_ directory, which needs to be kept in sync for each node on your cluster.
  38
  39 .HBase Configuration File Descriptions
  40 _backup-masters_::
  41   Not present by default. A plain-text file which lists hosts on which the Master should start a
  42   backup Master process, one host per line.
  43
  44 _hadoop-metrics2-hbase.properties_::
  45   Used to connect HBase Hadoop's Metrics2 framework.
  46   See the link:https://cwiki.apache.org/confluence/display/HADOOP2/HADOOP-6728-MetricsV2[Hadoop Wiki entry]
  47   for more information on Metrics2. Contains only commented-out examples by default.
  48
  49 _hbase-env.cmd_ and _hbase-env.sh_::
  50   Script for Windows and Linux / Unix environments to set up the working environment for HBase,
  51   including the location of Java, Java options, and other environment variables. The file contains
  52   many commented-out examples to provide guidance.
  53
  54 _hbase-policy.xml_::
  55   The default policy configuration file used by RPC servers to make authorization decisions on
  56   client requests. Only used if HBase <<security,security>> is enabled.
  57
  58 _hbase-site.xml_::
  59   The main HBase configuration file.
  60   This file specifies configuration options which override HBase's default configuration.
  61   You can view (but do not edit) the default configuration file at _hbase-common/src/main/resources/hbase-default.xml_.
  62   You can also view the entire effective configuration for your cluster (defaults and overrides) in
  63   the [label]#HBase Configuration# tab of the HBase Web UI.
  64
  65 _log4j2.xml_::
  66   Configuration file for HBase logging via `log4j2`.
  67
  68 _regionservers_::
  69   A plain-text file containing a list of hosts which should run a RegionServer in your HBase cluster.
  70   By default, this file contains the single entry `localhost`.
  71   It should contain a list of hostnames or IP addresses, one per line, and should only contain
  72   `localhost` if each node in your cluster will run a RegionServer on its `localhost` interface.
  73
  74 .Checking XML Validity
  75 [TIP]
  76 ====
  77 When you edit XML, it is a good idea to use an XML-aware editor to be sure that your syntax is
  78 correct and your XML is well-formed. You can also use the `xmllint` utility to check that your XML
  79 is well-formed. By default, `xmllint` re-flows and prints the XML to standard output. To check for
  80 well-formedness and only print output if errors exist, use the command `xmllint -noout filename.xml`.
  81 ====
  82 .Keep Configuration In Sync Across the Cluster
  83 [WARNING]
  84 ====
  85 When running in distributed mode, after you make an edit to an HBase configuration, make sure you
  86 copy the contents of the _conf/_ directory to all nodes of the cluster. HBase will not do this for
  87 you. Use a configuration management tool for managing and copying the configuration files to your
  88 nodes. For most configurations, a restart is needed for servers to pick up changes. Dynamic
  89 configuration is an exception to this, to be described later below.
  90 ====
  91
  92 [[basic.prerequisites]]
  93 == Basic Prerequisites
  94
  95 This section lists required services and some required system configuration.
  96
  97 [[java]]
  98 .Java
  99
 100 HBase runs on the Java Virtual Machine, thus all HBase deployments require a JVM runtime.
 101
 102 The following table summarizes the recommendations of the HBase community with respect to running
 103 on various Java versions. The icon:check-circle[role="green"] symbol indicates a base level of
 104 testing and willingness to help diagnose and address issues you might run into; these are the
 105 expected deployment combinations. An entry of icon:exclamation-circle[role="yellow"]
 106 means that there may be challenges with this combination, and you should look for more information
 107 before deciding to pursue this as your deployment strategy. The icon:times-circle[role="red"] means
 108 this combination does not work; either an older Java version is considered deprecated by the HBase
 109 community, or this combination is known to not work. For combinations of newer JDK with older HBase
 110 releases, it's likely there are known compatibility issues that cannot be addressed under our
 111 compatibility guarantees, making the combination impossible. In some cases, specific guidance on
 112 limitations (e.g. whether compiling / unit tests work, specific operational issues, etc) are also
 113 noted. Assume any combination not listed here is considered icon:times-circle[role="red"].
 114
 115 .Long-Term Support JDKs are Recommended
 116 [WARNING]
 117 ====
 118 HBase recommends downstream users rely only on JDK releases that are marked as Long-Term Supported
 119 (LTS), either from the OpenJDK project or vendors. At the time of this writing, the following JDK
 120 releases are NOT LTS releases and are NOT tested or advocated for use by the Apache HBase
 121 community: JDK9, JDK10, JDK12, JDK13, and JDK14. Community discussion around this decision is
 122 recorded on link:https://issues.apache.org/jira/browse/HBASE-20264[HBASE-20264].
 123 ====
 124
 125 .HotSpot vs. OpenJ9
 126 [TIP]
 127 ====
 128 At this time, all testing performed by the Apache HBase project runs on the HotSpot variant of the
 129 JVM. When selecting your JDK distribution, please take this into consideration.
 130 ====
 131
 132 .Java support by release line
 133 [cols="3*^.^", options="header"]
 134 |===
 135 |Java Version
 136 |HBase 1.4+
 137 |HBase 2.3+
 138
 139 |JDK7
 140 |icon:check-circle[role="green"]
 141 |icon:times-circle[role="red"]
 142
 143 |JDK8
 144 |icon:check-circle[role="green"]
 145 |icon:check-circle[role="green"]
 146
 147 |JDK11
 148 |icon:times-circle[role="red"]
 149 |icon:exclamation-circle[role="yellow"]*
 150
 151 |===
 152
 153 .A Note on JDK11 icon:exclamation-circle[role="yellow"]*
 154 [WARNING]
 155 ====
 156 Preliminary support for JDK11 is introduced with HBase 2.3.0. This support is limited to
 157 compilation and running the full test suite. There are open questions regarding the runtime
 158 compatibility of JDK11 with Apache ZooKeeper and Apache Hadoop
 159 (link:https://issues.apache.org/jira/browse/HADOOP-15338[HADOOP-15338]). Significantly, neither
 160 project has yet released a version with explicit runtime support for JDK11. The remaining known
 161 issues in HBase are catalogued in
 162 link:https://issues.apache.org/jira/browse/HBASE-22972[HBASE-22972].
 163 ====
 164
 165 NOTE: You must set `JAVA_HOME` on each node of your cluster. _hbase-env.sh_ provides a handy
 166 mechanism to do this.
 167
 168 [[os]]
 169 .Operating System Utilities
 170 ssh::
 171   HBase uses the Secure Shell (ssh) command and utilities extensively to communicate between
 172 cluster nodes. Each server in the cluster must be running `ssh` so that the Hadoop and HBase
 173 daemons can be managed. You must be able to connect to all nodes via SSH, including the local
 174 node, from the Master as well as any backup Master, using a shared key rather than a password.
 175 You can see the basic methodology for such a set-up in Linux or Unix systems at
 176 "<<passwordless.ssh.quickstart>>". If your cluster nodes use OS X, see the section,
 177 link:https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=120730246#RunningHadoopOnOSX10.564-bit(Single-NodeCluster)-SSH:SettingupRemoteDesktopandEnablingSelf-Login[SSH: Setting up Remote Desktop and Enabling Self-Login]
 178 on the Hadoop wiki.
 179
 180 DNS::
 181   HBase uses the local hostname to self-report its IP address.
 182
 183 NTP::
 184   The clocks on cluster nodes should be synchronized. A small amount of variation is acceptable,
 185 but larger amounts of skew can cause erratic and unexpected behavior. Time synchronization is one
 186 of the first things to check if you see unexplained problems in your cluster. It is recommended
 187 that you run a Network Time Protocol (NTP) service, or another time-synchronization mechanism on
 188 your cluster and that all nodes look to the same service for time synchronization. See the
 189 link:http://www.tldp.org/LDP/sag/html/basic-ntp-config.html[Basic NTP Configuration] at
 190 [citetitle]_The Linux Documentation Project (TLDP)_ to set up NTP.
 191
 192 [[ulimit]]
 193 Limits on Number of Files and Processes (ulimit)::
 194   Apache HBase is a database. It requires the ability to open a large number of files at once. Many
 195 Linux distributions limit the number of files a single user is allowed to open to `1024` (or `256`
 196 on older versions of OS X). You can check this limit on your servers by running the command
 197 `ulimit -n` when logged in as the user which runs HBase. See
 198 <<trouble.rs.runtime.filehandles,the Troubleshooting section>> for some of the problems you may
 199 experience if the limit is too low. You may also notice errors such as the following:
 200 +
 201 ----
 202 2010-04-06 03:04:37,542 INFO org.apache.hadoop.hdfs.DFSClient: Exception increateBlockOutputStream java.io.EOFException
 203 2010-04-06 03:04:37,542 INFO org.apache.hadoop.hdfs.DFSClient: Abandoning block blk_-6935524980745310745_1391901
 204 ----
 205 +
 206 It is recommended to raise the ulimit to at least 10,000, but more likely 10,240, because the value
 207 is usually expressed in multiples of 1024. Each ColumnFamily has at least one StoreFile, and
 208 possibly more than six StoreFiles if the region is under load. The number of open files required
 209 depends upon the number of ColumnFamilies and the number of regions. The following is a rough
 210 formula for calculating the potential number of open files on a RegionServer.
 211 +
 212 .Calculate the Potential Number of Open Files
 213 ----
 214 (StoreFiles per ColumnFamily) x (regions per RegionServer)
 215 ----
 216 +
 217 For example, assuming that a schema had 3 ColumnFamilies per region with an average of 3 StoreFiles
 218 per ColumnFamily, and there are 100 regions per RegionServer, the JVM will open `3 * 3 * 100 = 900`
 219 file descriptors, not counting open JAR files, configuration files, and others. Opening a file does
 220 not take many resources, and the risk of allowing a user to open too many files is minimal.
 221 +
 222 Another related setting is the number of processes a user is allowed to run at once. In Linux and
 223 Unix, the number of processes is set using the `ulimit -u` command. This should not be confused
 224 with the `nproc` command, which controls the number of CPUs available to a given user. Under load,
 225 a `ulimit -u` that is too low can cause OutOfMemoryError exceptions.
 226 +
 227 Configuring the maximum number of file descriptors and processes for the user who is running the
 228 HBase process is an operating system configuration, rather than an HBase configuration. It is also
 229 important to be sure that the settings are changed for the user that actually runs HBase. To see
 230 which user started HBase, and that user's ulimit configuration, look at the first line of the
 231 HBase log for that instance.
 232 +
 233 .`ulimit` Settings on Ubuntu
 234 ====
 235 To configure ulimit settings on Ubuntu, edit _/etc/security/limits.conf_, which is a
 236 space-delimited file with four columns. Refer to the man page for _limits.conf_ for details about
 237 the format of this file. In the following example, the first line sets both soft and hard limits
 238 for the number of open files (nofile) to 32768 for the operating system user with the username
 239 hadoop. The second line sets the number of processes to 32000 for the same user.
 240 ----
 241 hadoop  -       nofile  32768
 242 hadoop  -       nproc   32000
 243 ----
 244 The settings are only applied if the Pluggable Authentication Module (PAM) environment is directed
 245 to use them. To configure PAM to use these limits, be sure that the _/etc/pam.d/common-session_
 246 file contains the following line:
 247 ----
 248 session required  pam_limits.so
 249 ----
 250 ====
 251
 252 Linux Shell::
 253   All of the shell scripts that come with HBase rely on the
 254 link:http://www.gnu.org/software/bash[GNU Bash] shell.
 255
 256 Windows::
 257   Running production systems on Windows machines is not recommended.
 258
 259 [[hadoop]]
 260 === link:https://hadoop.apache.org[Hadoop](((Hadoop)))
 261
 262 The following table summarizes the versions of Hadoop supported with each version of HBase. Older
 263 versions not appearing in this table are considered unsupported and likely missing necessary
 264 features, while newer versions are untested but may be suitable.
 265
 266 Based on the version of HBase, you should select the most appropriate version of Hadoop. You can
 267 use Apache Hadoop, or a vendor's distribution of Hadoop. No distinction is made here. See
 268 link:https://cwiki.apache.org/confluence/display/HADOOP2/Distributions+and+Commercial+Support[the Hadoop wiki]
 269 for information about vendors of Hadoop.
 270
 271 .Hadoop 2.x is recommended.
 272 [TIP]
 273 ====
 274 Hadoop 2.x is faster and includes features, such as short-circuit reads (see
 275 <<perf.hdfs.configs.localread>>), which will help improve your HBase random read profile. Hadoop
 276 2.x also includes important bug fixes that will improve your overall HBase experience. HBase does
 277 not support running with earlier versions of Hadoop. See the table below for requirements specific
 278 to different HBase versions.
 279
 280 Hadoop 3.x is still in early access releases and has not yet been sufficiently tested by the HBase community for production use cases.
 281 ====
 282
 283 Use the following legend to interpret this table:
 284
 285 .Hadoop version support matrix
 286
 287 * icon:check-circle[role="green"] = Tested to be fully-functional
 288 * icon:times-circle[role="red"] = Known to not be fully-functional, or there are
 289 link:https://hadoop.apache.org/cve_list.html[CVEs] so we drop the support in newer minor releases
 290 * icon:exclamation-circle[role="yellow"] = Not tested, may/may-not function
 291
 292 [cols="1,3*^.^", options="header"]
 293 |===
 294 | | HBase-1.7.x | HBase-2.3.x | HBase-2.4.x
 295 |Hadoop-2.10.x | icon:check-circle[role="green"] | icon:check-circle[role="green"] | icon:check-circle[role="green"]
 296 |Hadoop-3.1.0 | icon:times-circle[role="red"] | icon:times-circle[role="red"] | icon:times-circle[role="red"]
 297 |Hadoop-3.1.1+ | icon:times-circle[role="red"] | icon:check-circle[role="green"] | icon:check-circle[role="green"]
 298 |Hadoop-3.2.x | icon:times-circle[role="red"] | icon:check-circle[role="green"] | icon:check-circle[role="green"]
 299 |Hadoop-3.3.x | icon:times-circle[role="red"] | icon:check-circle[role="green"] | icon:check-circle[role="green"]
 300 |===
 301
 302 .Hadoop 2.y.0 Releases
 303 [TIP]
 304 ====
 305 Starting around the time of Hadoop version 2.7.0, the Hadoop PMC got into the habit of calling out
 306 new minor releases on their major version 2 release line as not stable / production ready. As such,
 307 HBase expressly advises downstream users to avoid running on top of these releases. Note that
 308 additionally the 2.8.1 release was given the same caveat by the Hadoop PMC. For reference, see the
 309 release announcements for link:https://s.apache.org/hadoop-2.7.0-announcement[Apache Hadoop 2.7.0],
 310 link:https://s.apache.org/hadoop-2.8.0-announcement[Apache Hadoop 2.8.0],
 311 link:https://s.apache.org/hadoop-2.8.1-announcement[Apache Hadoop 2.8.1], and
 312 link:https://s.apache.org/hadoop-2.9.0-announcement[Apache Hadoop 2.9.0].
 313 ====
 314
 315 .Hadoop 3.1.0 Release
 316 [TIP]
 317 ====
 318 The Hadoop PMC called out the 3.1.0 release as not stable / production ready. As such, HBase
 319 expressly advises downstream users to avoid running on top of this release. For reference, see
 320 the link:https://s.apache.org/hadoop-3.1.0-announcement[release announcement for Hadoop 3.1.0].
 321 ====
 322
 323 .Replace the Hadoop Bundled With HBase!
 324 [NOTE]
 325 ====
 326 Because HBase depends on Hadoop, it bundles Hadoop jars under its _lib_ directory. The bundled jars
 327 are ONLY for use in stand-alone mode. In distributed mode, it is _critical_ that the version of
 328 Hadoop that is out on your cluster match what is under HBase. Replace the hadoop jars found in the
 329 HBase lib directory with the equivalent hadoop jars from the version you are running on your
 330 cluster to avoid version mismatch issues. Make sure you replace the jars under HBase across your
 331 whole cluster. Hadoop version mismatch issues have various manifestations. Check for mismatch if
 332 HBase appears hung.
 333 ====
 334
 335 [[dfs.datanode.max.transfer.threads]]
 336 ==== `dfs.datanode.max.transfer.threads` (((dfs.datanode.max.transfer.threads)))
 337
 338 An HDFS DataNode has an upper bound on the number of files that it will serve at any one time.
 339 Before doing any loading, make sure you have configured Hadoop's _conf/hdfs-site.xml_, setting the
 340 `dfs.datanode.max.transfer.threads` value to at least the following:
 341
 342 [source,xml]
 343 ----
 344
 345 <property>
 346   <name>dfs.datanode.max.transfer.threads</name>
 347   <value>4096</value>
 348 </property>
 349 ----
 350
 351 Be sure to restart your HDFS after making the above configuration.
 352
 353 Not having this configuration in place makes for strange-looking failures.
 354 One manifestation is a complaint about missing blocks.
 355 For example:
 356
 357 ----
 358 10/12/08 20:10:31 INFO hdfs.DFSClient: Could not obtain block
 359           blk_XXXXXXXXXXXXXXXXXXXXXX_YYYYYYYY from any node: java.io.IOException: No live nodes
 360           contain current block. Will get new block locations from namenode and retry...
 361 ----
 362
 363 See also <<casestudies.max.transfer.threads,casestudies.max.transfer.threads>> and note that this
 364 property was previously known as `dfs.datanode.max.xcievers` (e.g.
 365 link:http://ccgtech.blogspot.com/2010/02/hadoop-hdfs-deceived-by-xciever.html[Hadoop HDFS: Deceived by Xciever]).
 366
 367 [[zookeeper.requirements]]
 368 === ZooKeeper Requirements
 369
 370 An Apache ZooKeeper quorum is required. The exact version depends on your version of HBase, though
 371 the minimum ZooKeeper version is 3.4.x due to the `useMulti` feature made default in 1.0.0
 372 (see https://issues.apache.org/jira/browse/HBASE-16598[HBASE-16598]).
 373
 374 [[standalone_dist]]
 375 == HBase run modes: Standalone and Distributed
 376
 377 HBase has two run modes: <<standalone,standalone>> and <<distributed,distributed>>.
 378 Out of the box, HBase runs in standalone mode.
 379 Whatever your mode, you will need to configure HBase by editing files in the HBase _conf_ directory.
 380 At a minimum, you must edit [code]+conf/hbase-env.sh+ to tell HBase which +java+ to use.
 381 In this file you set HBase environment variables such as the heapsize and other options for the
 382 `JVM`, the preferred location for log files, etc. Set [var]+JAVA_HOME+ to point at the root of
 383 your +java+ install.
 384
 385 [[standalone]]
 386 === Standalone HBase
 387
 388 This is the default mode.
 389 Standalone mode is what is described in the <<quickstart,quickstart>> section.
 390 In standalone mode, HBase does not use HDFS -- it uses the local filesystem instead -- and it runs
 391 all HBase daemons and a local ZooKeeper all up in the same JVM. ZooKeeper binds to a well-known
 392 port so clients may talk to HBase.
 393
 394 [[standalone.over.hdfs]]
 395 ==== Standalone HBase over HDFS
 396 A sometimes useful variation on standalone hbase has all daemons running inside the
 397 one JVM but rather than persist to the local filesystem, instead
 398 they persist to an HDFS instance.
 399
 400 You might consider this profile when you are intent on
 401 a simple deploy profile, the loading is light, but the
 402 data must persist across node comings and goings. Writing to
 403 HDFS where data is replicated ensures the latter.
 404
 405 To configure this standalone variant, edit your _hbase-site.xml_
 406 setting _hbase.rootdir_  to point at a directory in your
 407 HDFS instance but then set _hbase.cluster.distributed_
 408 to _false_. For example:
 409
 410 [source,xml]
 411 ----
 412 <configuration>
 413   <property>
 414     <name>hbase.rootdir</name>
 415     <value>hdfs://namenode.example.org:8020/hbase</value>
 416   </property>
 417   <property>
 418     <name>hbase.cluster.distributed</name>
 419     <value>false</value>
 420   </property>
 421 </configuration>
 422 ----
 423
 424 [[distributed]]
 425 === Distributed
 426
 427 Distributed mode can be subdivided into distributed but all daemons run on a single node -- a.k.a.
 428 _pseudo-distributed_ -- and _fully-distributed_ where the daemons are spread across all nodes in
 429 the cluster. The _pseudo-distributed_ vs. _fully-distributed_ nomenclature comes from Hadoop.
 430
 431 Pseudo-distributed mode can run against the local filesystem or it can run against an instance of
 432 the _Hadoop Distributed File System_ (HDFS). Fully-distributed mode can ONLY run on HDFS.
 433 See the Hadoop link:https://hadoop.apache.org/docs/current/[documentation] for how to set up HDFS.
 434 A good walk-through for setting up HDFS on Hadoop 2 can be found at
 435 http://www.alexjf.net/blog/distributed-systems/hadoop-yarn-installation-definitive-guide.
 436
 437 [[pseudo]]
 438 ==== Pseudo-distributed
 439
 440 .Pseudo-Distributed Quickstart
 441 [NOTE]
 442 ====
 443 A quickstart has been added to the <<quickstart,quickstart>> chapter.
 444 See <<quickstart_pseudo,quickstart-pseudo>>.
 445 Some of the information that was originally in this section has been moved there.
 446 ====
 447
 448 A pseudo-distributed mode is simply a fully-distributed mode run on a single host.
 449 Use this HBase configuration for testing and prototyping purposes only.
 450 Do not use this configuration for production or for performance evaluation.
 451
 452 [[fully_dist]]
 453 === Fully-distributed
 454
 455 By default, HBase runs in stand-alone mode. Both stand-alone mode and pseudo-distributed mode are
 456 provided for the purposes of small-scale testing. For a production environment, distributed mode
 457 is advised. In distributed mode, multiple instances of HBase daemons run on multiple servers in the
 458 cluster.
 459
 460 Just as in pseudo-distributed mode, a fully distributed configuration requires that you set the
 461 `hbase.cluster.distributed` property to `true`. Typically, the `hbase.rootdir` is configured to
 462 point to a highly-available HDFS filesystem.
 463
 464 In addition, the cluster is configured so that multiple cluster nodes enlist as RegionServers,
 465 ZooKeeper QuorumPeers, and backup HMaster servers. These configuration basics are all demonstrated
 466 in <<quickstart_fully_distributed,quickstart-fully-distributed>>.
 467
 468 .Distributed RegionServers
 469 Typically, your cluster will contain multiple RegionServers all running on different servers, as
 470 well as primary and backup Master and ZooKeeper daemons. The _conf/regionservers_ file on the
 471 master server contains a list of hosts whose RegionServers are associated with this cluster.
 472 Each host is on a separate line. All hosts listed in this file will have their RegionServer
 473 processes started and stopped when the
 474 master server starts or stops.
 475
 476 .ZooKeeper and HBase
 477 See the <<zookeeper,ZooKeeper>> section for ZooKeeper setup instructions for HBase.
 478
 479 .Example Distributed HBase Cluster
 480 ====
 481 This is a bare-bones _conf/hbase-site.xml_ for a distributed HBase cluster.
 482 A cluster that is used for real-world work would contain more custom configuration parameters.
 483 Most HBase configuration directives have default values, which are used unless the value is
 484 overridden in the _hbase-site.xml_. See "<<config.files,Configuration Files>>" for more information.
 485
 486 [source,xml]
 487 ----
 488
 489 <configuration>
 490   <property>
 491     <name>hbase.rootdir</name>
 492     <value>hdfs://namenode.example.org:8020/hbase</value>
 493   </property>
 494   <property>
 495     <name>hbase.cluster.distributed</name>
 496     <value>true</value>
 497   </property>
 498   <property>
 499     <name>hbase.zookeeper.quorum</name>
 500     <value>node-a.example.com,node-b.example.com,node-c.example.com</value>
 501   </property>
 502 </configuration>
 503 ----
 504
 505 This is an example _conf/regionservers_ file, which contains a list of nodes that should run a
 506 RegionServer in the cluster. These nodes need HBase installed and they need to use the same
 507 contents of the _conf/_ directory as the Master server.
 508
 509 [source]
 510 ----
 511
 512 node-a.example.com
 513 node-b.example.com
 514 node-c.example.com
 515 ----
 516
 517 This is an example _conf/backup-masters_ file, which contains a list of each node that should run
 518 a backup Master instance. The backup Master instances will sit idle unless the main Master becomes
 519 unavailable.
 520
 521 [source]
 522 ----
 523
 524 node-b.example.com
 525 node-c.example.com
 526 ----
 527 ====
 528
 529 .Distributed HBase Quickstart
 530 See <<quickstart_fully_distributed,quickstart-fully-distributed>> for a walk-through of a simple
 531 three-node cluster configuration with multiple ZooKeeper, backup HMaster, and RegionServer
 532 instances.
 533
 534 .Procedure: HDFS Client Configuration
 535 . Of note, if you have made HDFS client configuration changes on your Hadoop cluster, such as
 536 configuration directives for HDFS clients, as opposed to server-side configurations, you must use
 537 one of the following methods to enable HBase to see and use these configuration changes:
 538 +
 539 a. Add a pointer to your `HADOOP_CONF_DIR` to the `HBASE_CLASSPATH` environment variable in
 540 _hbase-env.sh_.
 541 b. Add a copy of _hdfs-site.xml_ (or _hadoop-site.xml_) or, better, symlinks, under
 542 _${HBASE_HOME}/conf_, or
 543 c. if only a small set of HDFS client configurations, add them to _hbase-site.xml_.
 544
 545
 546 An example of such an HDFS client configuration is `dfs.replication`.
 547 If for example, you want to run with a replication factor of 5, HBase will create files with the
 548 default of 3 unless you do the above to make the configuration available to HBase.
 549
 550 [[confirm]]
 551 == Running and Confirming Your Installation
 552
 553 Make sure HDFS is running first.
 554 Start and stop the Hadoop HDFS daemons by running _bin/start-hdfs.sh_ over in the `HADOOP_HOME`
 555 directory. You can ensure it started properly by testing the `put` and `get` of files into the
 556 Hadoop filesystem. HBase does not normally use the MapReduce or YARN daemons. These do not need to
 557 be started.
 558
 559 _If_ you are managing your own ZooKeeper, start it and confirm it's running, else HBase will start
 560 up ZooKeeper for you as part of its start process.
 561
 562 Start HBase with the following command:
 563
 564 ----
 565 bin/start-hbase.sh
 566 ----
 567
 568 Run the above from the `HBASE_HOME` directory.
 569
 570 You should now have a running HBase instance.
 571 HBase logs can be found in the _logs_ subdirectory.
 572 Check them out especially if HBase had trouble starting.
 573
 574 HBase also puts up a UI listing vital attributes.
 575 By default it's deployed on the Master host at port 16010 (HBase RegionServers listen on port 16020
 576 by default and put up an informational HTTP server at port 16030). If the Master is running on a
 577 host named `master.example.org` on the default port, point your browser at
 578 pass:[http://master.example.org:16010] to see the web interface.
 579
 580 Once HBase has started, see the <<shell_exercises,shell exercises>> section for how to create
 581 tables, add data, scan your insertions, and finally disable and drop your tables.
 582
 583 To stop HBase after exiting the HBase shell enter
 584
 585 ----
 586 $ ./bin/stop-hbase.sh
 587 stopping hbase...............
 588 ----
 589
 590 Shutdown can take a moment to complete.
 591 It can take longer if your cluster is comprised of many machines.
 592 If you are running a distributed operation, be sure to wait until HBase has shut down completely
 593 before stopping the Hadoop daemons.
 594
 595 [[config.files]]
 596 == Default Configuration
 597
 598 [[hbase.site]]
 599 === _hbase-site.xml_ and _hbase-default.xml_
 600
 601 Just as in Hadoop where you add site-specific HDFS configuration to the _hdfs-site.xml_ file, for
 602 HBase, site specific customizations go into the file _conf/hbase-site.xml_. For the list of
 603 configurable properties, see <<hbase_default_configurations,hbase default configurations>> below
 604 or view the raw _hbase-default.xml_ source file in the HBase source code at _src/main/resources_.
 605
 606 Not all configuration options make it out to _hbase-default.xml_.
 607 Some configurations would only appear in source code; the only way to identify these changes are
 608 through code review.
 609
 610 Currently, changes here will require a cluster restart for HBase to notice the change.
 611 // hbase/src/main/asciidoc
 612 //
 613 include::{docdir}/../../../target/asciidoc/hbase-default.adoc[]
 614
 615
 616 [[hbase.env.sh]]
 617 === _hbase-env.sh_
 618
 619 Set HBase environment variables in this file. Examples include options to pass the JVM on start of
 620 an HBase daemon such as heap size and garbage collector configs.
 621 You can also set configurations for HBase configuration, log directories, niceness, ssh options,
 622 where to locate process pid files, etc. Open the file at _conf/hbase-env.sh_ and peruse its content.
 623 Each option is fairly well documented. Add your own environment variables here if you want them
 624 read by HBase daemons on startup.
 625
 626 Changes here will require a cluster restart for HBase to notice the change.
 627
 628 [[log4j]]
 629 === _log4j2.xml_
 630
 631 Since version 3.0.0, HBase has upgraded to Log4j2, so the configuration file name and format has changed. Read more in link:https://logging.apache.org/log4j/2.x/index.html[Apache Log4j2].
 632
 633 Edit this file to change rate at which HBase files are rolled and to change the level at which
 634 HBase logs messages.
 635
 636 Changes here will require a cluster restart for HBase to notice the change though log levels can
 637 be changed for particular daemons via the HBase UI.
 638
 639 [[client_dependencies]]
 640 === Client configuration and dependencies connecting to an HBase cluster
 641
 642 If you are running HBase in standalone mode, you don't need to configure anything for your client
 643 to work provided that they are all on the same machine.
 644
 645 Starting release 3.0.0, the default connection registry has been switched to a master based
 646 implementation. Refer to <<client.masterregistry>> for more details about what a connection
 647 registry is and implications of this change. Depending on your HBase version, following is the
 648 expected minimal client configuration.
 649
 650 ==== Up until 2.x.y releases
 651 In 2.x.y releases, the default connection registry was based on ZooKeeper as the source of truth.
 652 This means that the clients always looked up ZooKeeper znodes to fetch the required metadata. For
 653 example, if an active master crashed and the a new master is elected, clients looked up the master
 654 znode to fetch the active master address (similarly for meta locations). This meant that the
 655 clients needed to have access to ZooKeeper and need to know the ZooKeeper ensemble information
 656 before they can do anything. This can be configured in the client configuration xml as follows:
 657
 658 [source,xml]
 659 ----
 660 <?xml version="1.0"?>
 661 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
 662 <configuration>
 663   <property>
 664     <name>hbase.zookeeper.quorum</name>
 665     <value>example1,example2,example3</value>
 666     <description> Zookeeper ensemble information</description>
 667   </property>
 668 </configuration>
 669 ----
 670
 671 ==== Starting 3.0.0 release
 672
 673 The default implementation was switched to a master based connection registry. With this
 674 implementation, clients always contact the active or stand-by master RPC end points to fetch the
 675 connection registry information. This means that the clients should have access to the list of
 676 active and master end points before they can do anything. This can be configured in the client
 677 configuration xml as follows:
 678
 679 [source,xml]
 680 ----
 681 <?xml version="1.0"?>
 682 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
 683 <configuration>
 684   <property>
 685     <name>hbase.masters</name>
 686     <value>example1,example2,example3</value>
 687     <description>List of master rpc end points for the hbase cluster.</description>
 688   </property>
 689 </configuration>
 690 ----
 691
 692 The configuration value for _hbase.masters_ is a comma separated list of _host:port_ values. If no
 693 port value is specified, the default of _16000_ is assumed.
 694
 695 Usually this configuration is kept out in the _hbase-site.xml_ and is picked up by the client from
 696 the `CLASSPATH`.
 697
 698 If you are configuring an IDE to run an HBase client, you should include the _conf/_ directory on
 699 your classpath so _hbase-site.xml_ settings can be found (or add _src/test/resources_ to pick up
 700 the hbase-site.xml used by tests).
 701
 702 For Java applications using Maven, including the hbase-shaded-client module is the recommended
 703 dependency when connecting to a cluster:
 704 [source,xml]
 705 ----
 706 <dependency>
 707   <groupId>org.apache.hbase</groupId>
 708   <artifactId>hbase-shaded-client</artifactId>
 709   <version>2.0.0</version>
 710 </dependency>
 711 ----
 712
 713 [[java.client.config]]
 714 ==== Java client configuration
 715
 716 The configuration used by a Java client is kept in an
 717 link:https://hbase.apache.org/apidocs/org/apache/hadoop/hbase/HBaseConfiguration[HBaseConfiguration]
 718 instance.
 719
 720 The factory method on HBaseConfiguration, `HBaseConfiguration.create();`, on invocation, will read
 721 in the content of the first _hbase-site.xml_ found on the client's `CLASSPATH`, if one is present
 722 (Invocation will also factor in any _hbase-default.xml_ found; an _hbase-default.xml_ ships inside
 723 the _hbase.X.X.X.jar_). It is also possible to specify configuration directly without having to
 724 read from a _hbase-site.xml_.
 725
 726 For example, to set the ZooKeeper ensemble for the cluster programmatically do as follows:
 727
 728 [source,java]
 729 ----
 730 Configuration config = HBaseConfiguration.create();
 731 config.set("hbase.zookeeper.quorum", "localhost");  // Until 2.x.y versions
 732 // ---- or ----
 733 config.set("hbase.masters", "localhost:1234"); // Starting 3.0.0 version
 734 ----
 735
 736 [[config_timeouts]]
 737 === Timeout settings
 738
 739 HBase provides a wide variety of timeout settings to limit the execution time of various remote
 740 operations.
 741
 742 * hbase.rpc.timeout
 743 * hbase.rpc.read.timeout
 744 * hbase.rpc.write.timeout
 745 * hbase.client.operation.timeout
 746 * hbase.client.meta.operation.timeout
 747 * hbase.client.scanner.timeout.period
 748
 749 The `hbase.rpc.timeout` property limits how long a single RPC call can run before timing out.
 750 To fine tune read or write related RPC timeouts set `hbase.rpc.read.timeout` and
 751 `hbase.rpc.write.timeout` configuration properties. In the absence of these properties
 752 `hbase.rpc.timeout` will be used.
 753
 754 A higher-level timeout is `hbase.client.operation.timeout` which is valid for each client call.
 755 When an RPC call fails for instance for a timeout due to `hbase.rpc.timeout` it will be retried
 756 until `hbase.client.operation.timeout` is reached. Client operation timeout for system tables can
 757 be fine tuned by setting `hbase.client.meta.operation.timeout` configuration value.
 758 When this is not set its value will use `hbase.client.operation.timeout`.
 759
 760 Timeout for scan operations is controlled differently. Use `hbase.client.scanner.timeout.period`
 761 property to set this timeout.
 762
 763 [[example_config]]
 764 == Example Configurations
 765
 766 === Basic Distributed HBase Install
 767
 768 Here is a basic configuration example for a distributed ten node cluster:
 769 * The nodes are named `example0`, `example1`, etc., through node `example9` in this example.
 770 * The HBase Master and the HDFS NameNode are running on the node `example0`.
 771 * RegionServers run on nodes `example1`-`example9`.
 772 * A 3-node ZooKeeper ensemble runs on `example1`, `example2`, and `example3` on the default ports.
 773 * ZooKeeper data is persisted to the directory _/export/zookeeper_.
 774
 775 Below we show what the main configuration files -- _hbase-site.xml_, _regionservers_, and
 776 _hbase-env.sh_ -- found in the HBase _conf_ directory might look like.
 777
 778 [[hbase_site]]
 779 ==== _hbase-site.xml_
 780
 781 [source,xml]
 782 ----
 783 <?xml version="1.0"?>
 784 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
 785 <configuration>
 786   <property>
 787     <name>hbase.zookeeper.quorum</name>
 788     <value>example1,example2,example3</value>
 789     <description>The directory shared by RegionServers.
 790     </description>
 791   </property>
 792   <property>
 793     <name>hbase.zookeeper.property.dataDir</name>
 794     <value>/export/zookeeper</value>
 795     <description>Property from ZooKeeper config zoo.cfg.
 796     The directory where the snapshot is stored.
 797     </description>
 798   </property>
 799   <property>
 800     <name>hbase.rootdir</name>
 801     <value>hdfs://example0:8020/hbase</value>
 802     <description>The directory shared by RegionServers.
 803     </description>
 804   </property>
 805   <property>
 806     <name>hbase.cluster.distributed</name>
 807     <value>true</value>
 808     <description>The mode the cluster will be in. Possible values are
 809       false: standalone and pseudo-distributed setups with managed ZooKeeper
 810       true: fully-distributed with unmanaged ZooKeeper Quorum (see hbase-env.sh)
 811     </description>
 812   </property>
 813 </configuration>
 814 ----
 815
 816 [[regionservers]]
 817 ==== _regionservers_
 818
 819 In this file you list the nodes that will run RegionServers.
 820 In our case, these nodes are `example1`-`example9`.
 821
 822 [source]
 823 ----
 824 example1
 825 example2
 826 example3
 827 example4
 828 example5
 829 example6
 830 example7
 831 example8
 832 example9
 833 ----
 834
 835 [[hbase_env]]
 836 ==== _hbase-env.sh_
 837
 838 The following lines in the _hbase-env.sh_ file show how to set the `JAVA_HOME` environment variable
 839 (required for HBase) and set the heap to 4 GB (rather than the default value of 1 GB). If you copy
 840 and paste this example, be sure to adjust the `JAVA_HOME` to suit your environment.
 841
 842 ----
 843 # The java implementation to use.
 844 export JAVA_HOME=/usr/java/jdk1.8.0/
 845
 846 # The maximum amount of heap to use. Default is left to JVM default.
 847 export HBASE_HEAPSIZE=4G
 848 ----
 849
 850 Use +rsync+ to copy the content of the _conf_ directory to all nodes of the cluster.
 851
 852 [[important_configurations]]
 853 == The Important Configurations
 854
 855 Below we list some _important_ configurations.
 856 We've divided this section into required configuration and worth-a-look recommended configs.
 857
 858 [[required_configuration]]
 859 === Required Configurations
 860
 861 Review the <<os,os>> and <<hadoop,hadoop>> sections.
 862
 863 [[big.cluster.config]]
 864 ==== Big Cluster Configurations
 865
 866 If you have a cluster with a lot of regions, it is possible that a Regionserver checks in briefly
 867 after the Master starts while all the remaining RegionServers lag behind. This first server to
 868 check in will be assigned all regions which is not optimal. To prevent the above scenario from
 869 happening, up the `hbase.master.wait.on.regionservers.mintostart` property from its default value
 870 of 1. See link:https://issues.apache.org/jira/browse/HBASE-6389[HBASE-6389 Modify the
 871             conditions to ensure that Master waits for sufficient number of Region Servers before
 872             starting region assignments] for more detail.
 873
 874 [[recommended_configurations]]
 875 === Recommended Configurations
 876
 877 [[recommended_configurations.zk]]
 878 ==== ZooKeeper Configuration
 879
 880 [[sect.zookeeper.session.timeout]]
 881 ===== `zookeeper.session.timeout`
 882
 883 The default timeout is 90 seconds (specified in milliseconds). This means that if a server crashes,
 884 it will be 90 seconds before the Master notices the crash and starts recovery. You might need to
 885 tune the timeout down to a minute or even less so the Master notices failures sooner. Before
 886 changing this value, be sure you have your JVM garbage collection configuration under control,
 887 otherwise, a long garbage collection that lasts beyond the ZooKeeper session timeout will take out
 888 your RegionServer. (You might be fine with this -- you probably want recovery to start on the
 889 server if a RegionServer has been in GC for a long period of time).
 890
 891 To change this configuration, edit _hbase-site.xml_, copy the changed file across the cluster and
 892 restart.
 893
 894 We set this value high to save our having to field questions up on the mailing lists asking why a
 895 RegionServer went down during a massive import. The usual cause is that their JVM is untuned and
 896 they are running into long GC pauses. Our thinking is that while users are getting familiar with
 897 HBase, we'd save them having to know all of its intricacies. Later when they've built some
 898 confidence, then they can play with configuration such as this.
 899
 900 [[zookeeper.instances]]
 901 ===== Number of ZooKeeper Instances
 902
 903 See <<zookeeper,zookeeper>>.
 904
 905 [[recommended.configurations.hdfs]]
 906 ==== HDFS Configurations
 907
 908 [[dfs.datanode.failed.volumes.tolerated]]
 909 ===== `dfs.datanode.failed.volumes.tolerated`
 910
 911 This is the "...number of volumes that are allowed to fail before a DataNode stops offering
 912 service. By default, any volume failure will cause a datanode to shutdown" from the
 913 _hdfs-default.xml_ description. You might want to set this to about half the amount of your
 914 available disks.
 915
 916 [[hbase.regionserver.handler.count]]
 917 ===== `hbase.regionserver.handler.count`
 918
 919 This setting defines the number of threads that are kept open to answer incoming requests to user
 920 tables. The rule of thumb is to keep this number low when the payload per request approaches the MB
 921 (big puts, scans using a large cache) and high when the payload is small (gets, small puts, ICVs,
 922 deletes). The total size of the queries in progress is limited by the setting
 923 `hbase.ipc.server.max.callqueue.size`.
 924
 925 It is safe to set that number to the maximum number of incoming clients if their payload is small,
 926 the typical example being a cluster that serves a website since puts aren't typically buffered and
 927 most of the operations are gets.
 928
 929 The reason why it is dangerous to keep this setting high is that the aggregate size of all the puts
 930 that are currently happening in a region server may impose too much pressure on its memory, or even
 931 trigger an OutOfMemoryError. A RegionServer running on low memory will trigger its JVM's garbage
 932 collector to run more frequently up to a point where GC pauses become noticeable (the reason being
 933 that all the memory used to keep all the requests' payloads cannot be trashed, no matter how hard
 934 the garbage collector tries). After some time, the overall cluster throughput is affected since
 935 every request that hits that RegionServer will take longer, which exacerbates the problem even more.
 936
 937 You can get a sense of whether you have too little or too many handlers by
 938 <<rpc.logging,rpc.logging>> on an individual RegionServer then tailing its logs (Queued requests
 939 consume memory).
 940
 941 [[big_memory]]
 942 ==== Configuration for large memory machines
 943
 944 HBase ships with a reasonable, conservative configuration that will work on nearly all machine
 945 types that people might want to test with. If you have larger machines -- HBase has 8G and larger
 946 heap -- you might find the following configuration options helpful.
 947 TODO.
 948
 949 [[config.compression]]
 950 ==== Compression
 951
 952 You should consider enabling ColumnFamily compression.
 953 There are several options that are near-frictionless and in most all cases boost performance by
 954 reducing the size of StoreFiles and thus reducing I/O.
 955
 956 See <<compression,compression>> for more information.
 957
 958 [[config.wals]]
 959 ==== Configuring the size and number of WAL files
 960
 961 HBase uses <<wal,wal>> to recover the memstore data that has not been flushed to disk in case of
 962 an RS failure. These WAL files should be configured to be slightly smaller than HDFS block (by
 963 default a HDFS block is 64Mb and a WAL file is ~60Mb).
 964
 965 HBase also has a limit on the number of WAL files, designed to ensure there's never too much data
 966 that needs to be replayed during recovery. This limit needs to be set according to memstore
 967 configuration, so that all the necessary data would fit. It is recommended to allocate enough WAL
 968 files to store at least that much data (when all memstores are close to full). For example, with
 969 16Gb RS heap, default memstore settings (0.4), and default WAL file size (~60Mb), 16Gb*0.4/60, the
 970 starting point for WAL file count is ~109. However, as all memstores are not expected to be full
 971 all the time, less WAL files can be allocated.
 972
 973 [[disable.splitting]]
 974 ==== Managed Splitting
 975
 976 HBase generally handles splitting of your regions based upon the settings in your
 977 _hbase-default.xml_ and _hbase-site.xml_ configuration files. Important settings include
 978 `hbase.regionserver.region.split.policy`, `hbase.hregion.max.filesize`,
 979 `hbase.regionserver.regionSplitLimit`. A simplistic view of splitting is that when a region grows
 980 to `hbase.hregion.max.filesize`, it is split. For most usage patterns, you should use automatic
 981 splitting. See <<manual_region_splitting_decisions,manual region splitting decisions>> for more
 982 information about manual region splitting.
 983
 984 Instead of allowing HBase to split your regions automatically, you can choose to manage the
 985 splitting yourself. Manually managing splits works if you know your keyspace well, otherwise let
 986 HBase figure where to split for you. Manual splitting can mitigate region creation and movement
 987 under load. It also makes it so region boundaries are known and invariant (if you disable region
 988 splitting). If you use manual splits, it is easier doing staggered, time-based major compactions
 989 to spread out your network IO load.
 990
 991 .Disable Automatic Splitting
 992 To disable automatic splitting, you can set region split policy in either cluster configuration
 993 or table configuration to be `org.apache.hadoop.hbase.regionserver.DisabledRegionSplitPolicy`
 994
 995 .Automatic Splitting Is Recommended
 996 [NOTE]
 997 ====
 998 If you disable automatic splits to diagnose a problem or during a period of fast data growth, it
 999 is recommended to re-enable them when your situation becomes more stable. The potential benefits
1000 of managing region splits yourself are not undisputed.
1001 ====
1002
1003 .Determine the Optimal Number of Pre-Split Regions
1004 The optimal number of pre-split regions depends on your application and environment. A good rule of
1005 thumb is to start with 10 pre-split regions per server and watch as data grows over time. It is
1006 better to err on the side of too few regions and perform rolling splits later. The optimal number
1007 of regions depends upon the largest StoreFile in your region. The size of the largest StoreFile
1008 will increase with time if the amount of data grows. The goal is for the largest region to be just
1009 large enough that the compaction selection algorithm only compacts it during a timed major
1010 compaction. Otherwise, the cluster can be prone to compaction storms with a large number of regions
1011 under compaction at the same time. It is important to understand that the data growth causes
1012 compaction storms and not the manual split decision.
1013
1014 If the regions are split into too many large regions, you can increase the major compaction
1015 interval by configuring `HConstants.MAJOR_COMPACTION_PERIOD`. The
1016 `org.apache.hadoop.hbase.util.RegionSplitter` utility also provides a network-IO-safe rolling
1017 split of all regions.
1018
1019 [[managed.compactions]]
1020 ==== Managed Compactions
1021
1022 By default, major compactions are scheduled to run once in a 7-day period.
1023
1024 If you need to control exactly when and how often major compaction runs, you can disable managed
1025 major compactions. See the entry for `hbase.hregion.majorcompaction` in the
1026 <<compaction.parameters,compaction.parameters>> table for details.
1027
1028 .Do Not Disable Major Compactions
1029 [WARNING]
1030 ====
1031 Major compactions are absolutely necessary for StoreFile clean-up. Do not disable them altogether.
1032 You can run major compactions manually via the HBase shell or via the
1033 link:https://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/Admin.html#majorCompact-org.apache.hadoop.hbase.TableName-[Admin API].
1034 ====
1035
1036 For more information about compactions and the compaction file selection process, see
1037 <<compaction,compaction>>
1038
1039 [[spec.ex]]
1040 ==== Speculative Execution
1041
1042 Speculative Execution of MapReduce tasks is on by default, and for HBase clusters it is generally
1043 advised to turn off Speculative Execution at a system-level unless you need it for a specific case,
1044 where it can be configured per-job. Set the properties `mapreduce.map.speculative` and
1045 `mapreduce.reduce.speculative` to false.
1046
1047 [[other_configuration]]
1048 === Other Configurations
1049
1050 [[balancer_config]]
1051 ==== Balancer
1052
1053 The balancer is a periodic operation which is run on the master to redistribute regions on the
1054 cluster. It is configured via `hbase.balancer.period` and defaults to 300000 (5 minutes).
1055
1056 See <<master.processes.loadbalancer,master.processes.loadbalancer>> for more information on the
1057 LoadBalancer.
1058
1059 [[disabling.blockcache]]
1060 ==== Disabling Blockcache
1061
1062 Do not turn off block cache (You'd do it by setting `hfile.block.cache.size` to zero). Currently,
1063 we do not do well if you do this because the RegionServer will spend all its time loading HFile
1064 indices over and over again. If your working set is such that block cache does you no good, at
1065 least size the block cache such that HFile indices will stay up in the cache (you can get a rough
1066 idea on the size you need by surveying RegionServer UIs; you'll see index block size accounted near
1067 the top of the webpage).
1068
1069 [[nagles]]
1070 ==== link:http://en.wikipedia.org/wiki/Nagle's_algorithm[Nagle's] or the small package problem
1071
1072 If a big 40ms or so occasional delay is seen in operations against HBase, try the Nagles' setting.
1073 For example, see the user mailing list thread,
1074 link:https://lists.apache.org/thread.html/3d7ceb41c04a955b1b1c80480cdba95208ca3e97bf6895a40e0c1bbb%401346186127%40%3Cuser.hbase.apache.org%3E[Inconsistent scan performance with caching set to 1]
1075 and the issue cited therein where setting `notcpdelay` improved scan speeds. You might also see the
1076 graphs on the tail of
1077 link:https://issues.apache.org/jira/browse/HBASE-7008[HBASE-7008 Set scanner caching to a better default]
1078 where our Lars Hofhansl tries various data sizes w/ Nagle's on and off measuring the effect.
1079
1080 [[mttr]]
1081 ==== Better Mean Time to Recover (MTTR)
1082
1083 This section is about configurations that will make servers come back faster after a fail. See the
1084 Deveraj Das and Nicolas Liochon blog post
1085 link:http://hortonworks.com/blog/introduction-to-hbase-mean-time-to-recover-mttr/[Introduction to HBase Mean Time to Recover (MTTR)]
1086 for a brief introduction.
1087
1088 The issue
1089 link:https://issues.apache.org/jira/browse/HBASE-8389[HBASE-8354 forces Namenode into loop with lease recovery requests]
1090 is messy but has a bunch of good discussion toward the end on low timeouts and how to cause faster
1091 recovery including citation of fixes added to HDFS. Read the Varun Sharma comments. The below
1092 suggested configurations are Varun's suggestions distilled and tested. Make sure you are running
1093 on a late-version HDFS so you have the fixes he refers to and himself adds to HDFS that help HBase
1094 MTTR (e.g. HDFS-3703, HDFS-3712, and HDFS-4791 -- Hadoop 2 for sure has them and late Hadoop 1 has
1095 some). Set the following in the RegionServer.
1096
1097 [source,xml]
1098 ----
1099 <property>
1100   <name>hbase.lease.recovery.dfs.timeout</name>
1101   <value>23000</value>
1102   <description>How much time we allow elapse between calls to recover lease.
1103   Should be larger than the dfs timeout.</description>
1104 </property>
1105 <property>
1106   <name>dfs.client.socket-timeout</name>
1107   <value>10000</value>
1108   <description>Down the DFS timeout from 60 to 10 seconds.</description>
1109 </property>
1110 ----
1111
1112 And on the NameNode/DataNode side, set the following to enable 'staleness' introduced in HDFS-3703,
1113 HDFS-3912.
1114
1115 [source,xml]
1116 ----
1117 <property>
1118   <name>dfs.client.socket-timeout</name>
1119   <value>10000</value>
1120   <description>Down the DFS timeout from 60 to 10 seconds.</description>
1121 </property>
1122 <property>
1123   <name>dfs.datanode.socket.write.timeout</name>
1124   <value>10000</value>
1125   <description>Down the DFS timeout from 8 * 60 to 10 seconds.</description>
1126 </property>
1127 <property>
1128   <name>ipc.client.connect.timeout</name>
1129   <value>3000</value>
1130   <description>Down from 60 seconds to 3.</description>
1131 </property>
1132 <property>
1133   <name>ipc.client.connect.max.retries.on.timeouts</name>
1134   <value>2</value>
1135   <description>Down from 45 seconds to 3 (2 == 3 retries).</description>
1136 </property>
1137 <property>
1138   <name>dfs.namenode.avoid.read.stale.datanode</name>
1139   <value>true</value>
1140   <description>Enable stale state in hdfs</description>
1141 </property>
1142 <property>
1143   <name>dfs.namenode.stale.datanode.interval</name>
1144   <value>20000</value>
1145   <description>Down from default 30 seconds</description>
1146 </property>
1147 <property>
1148   <name>dfs.namenode.avoid.write.stale.datanode</name>
1149   <value>true</value>
1150   <description>Enable stale state in hdfs</description>
1151 </property>
1152 ----
1153
1154 [[jmx_config]]
1155 ==== JMX
1156
1157 JMX (Java Management Extensions) provides built-in instrumentation that enables you to monitor and
1158 manage the Java VM. To enable monitoring and management from remote systems, you need to set system
1159 property `com.sun.management.jmxremote.port` (the port number through which you want to enable JMX
1160 RMI connections) when you start the Java VM. See the
1161 link:http://docs.oracle.com/javase/8/docs/technotes/guides/management/agent.html[official documentation]
1162 for more information. Historically, besides above port mentioned, JMX opens two additional random
1163 TCP listening ports, which could lead to port conflict problem. (See
1164 link:https://issues.apache.org/jira/browse/HBASE-10289[HBASE-10289] for details)
1165
1166 As an alternative, you can use the coprocessor-based JMX implementation provided by HBase. To
1167 enable it, add below property in _hbase-site.xml_:
1168
1169 [source,xml]
1170 ----
1171 <property>
1172   <name>hbase.coprocessor.regionserver.classes</name>
1173   <value>org.apache.hadoop.hbase.JMXListener</value>
1174 </property>
1175 ----
1176
1177 NOTE: DO NOT set `com.sun.management.jmxremote.port` for Java VM at the same time.
1178
1179 Currently it supports Master and RegionServer Java VM.
1180 By default, the JMX listens on TCP port 10102, you can further configure the port using below
1181 properties:
1182
1183 [source,xml]
1184 ----
1185 <property>
1186   <name>regionserver.rmi.registry.port</name>
1187   <value>61130</value>
1188 </property>
1189 <property>
1190   <name>regionserver.rmi.connector.port</name>
1191   <value>61140</value>
1192 </property>
1193 ----
1194
1195 The registry port can be shared with connector port in most cases, so you only need to configure
1196 `regionserver.rmi.registry.port`. However, if you want to use SSL communication, the 2 ports must
1197 be configured to different values.
1198
1199 By default the password authentication and SSL communication is disabled.
1200 To enable password authentication, you need to update _hbase-env.sh_ like below:
1201 [source,bash]
1202 ----
1203 export HBASE_JMX_BASE="-Dcom.sun.management.jmxremote.authenticate=true                  \
1204                        -Dcom.sun.management.jmxremote.password.file=your_password_file   \
1205                        -Dcom.sun.management.jmxremote.access.file=your_access_file"
1206
1207 export HBASE_MASTER_OPTS="$HBASE_MASTER_OPTS $HBASE_JMX_BASE "
1208 export HBASE_REGIONSERVER_OPTS="$HBASE_REGIONSERVER_OPTS $HBASE_JMX_BASE "
1209 ----
1210
1211 See example password/access file under _$JRE_HOME/lib/management_.
1212
1213 To enable SSL communication with password authentication, follow below steps:
1214
1215 [source,bash]
1216 ----
1217 #1. generate a key pair, stored in myKeyStore
1218 keytool -genkey -alias jconsole -keystore myKeyStore
1219
1220 #2. export it to file jconsole.cert
1221 keytool -export -alias jconsole -keystore myKeyStore -file jconsole.cert
1222
1223 #3. copy jconsole.cert to jconsole client machine, import it to jconsoleKeyStore
1224 keytool -import -alias jconsole -keystore jconsoleKeyStore -file jconsole.cert
1225 ----
1226
1227 And then update _hbase-env.sh_ like below:
1228
1229 [source,bash]
1230 ----
1231 export HBASE_JMX_BASE="-Dcom.sun.management.jmxremote.ssl=true                         \
1232                        -Djavax.net.ssl.keyStore=/home/tianq/myKeyStore                 \
1233                        -Djavax.net.ssl.keyStorePassword=your_password_in_step_1        \
1234                        -Dcom.sun.management.jmxremote.authenticate=true                \
1235                        -Dcom.sun.management.jmxremote.password.file=your_password file \
1236                        -Dcom.sun.management.jmxremote.access.file=your_access_file"
1237
1238 export HBASE_MASTER_OPTS="$HBASE_MASTER_OPTS $HBASE_JMX_BASE "
1239 export HBASE_REGIONSERVER_OPTS="$HBASE_REGIONSERVER_OPTS $HBASE_JMX_BASE "
1240 ----
1241
1242 Finally start `jconsole` on the client using the key store:
1243
1244 [source,bash]
1245 ----
1246 jconsole -J-Djavax.net.ssl.trustStore=/home/tianq/jconsoleKeyStore
1247 ----
1248
1249 NOTE: To enable the HBase JMX implementation on Master, you also need to add below property in
1250 _hbase-site.xml_:
1251
1252 [source,xml]
1253 ----
1254 <property>
1255   <name>hbase.coprocessor.master.classes</name>
1256   <value>org.apache.hadoop.hbase.JMXListener</value>
1257 </property>
1258 ----
1259
1260 The corresponding properties for port configuration are `master.rmi.registry.port` (by default
1261 10101) and `master.rmi.connector.port` (by default the same as registry.port)
1262
1263 [[dyn_config]]
1264 == Dynamic Configuration
1265
1266 It is possible to change a subset of the configuration without requiring a server restart. In the
1267 HBase shell, the operations `update_config`, `update_all_config` and `update_rsgroup_config`
1268 will prompt a server, all servers or all servers in the RSGroup to reload configuration.
1269
1270 Only a subset of all configurations can currently be changed in the running server.
1271 Here are those configurations:
1272
1273 .Configurations support dynamically change
1274 [cols="1",options="header"]
1275 |===
1276 | Key
1277 | hbase.ipc.server.fallback-to-simple-auth-allowed
1278 | hbase.cleaner.scan.dir.concurrent.size
1279 | hbase.coprocessor.master.classes
1280 | hbase.coprocessor.region.classes
1281 | hbase.coprocessor.regionserver.classes
1282 | hbase.coprocessor.user.region.classes
1283 | hbase.regionserver.thread.compaction.large
1284 | hbase.regionserver.thread.compaction.small
1285 | hbase.regionserver.thread.split
1286 | hbase.regionserver.throughput.controller
1287 | hbase.regionserver.thread.hfilecleaner.throttle
1288 | hbase.regionserver.hfilecleaner.large.queue.size
1289 | hbase.regionserver.hfilecleaner.small.queue.size
1290 | hbase.regionserver.hfilecleaner.large.thread.count
1291 | hbase.regionserver.hfilecleaner.small.thread.count
1292 | hbase.regionserver.hfilecleaner.thread.timeout.msec
1293 | hbase.regionserver.hfilecleaner.thread.check.interval.msec
1294 | hbase.regionserver.flush.throughput.controller
1295 | hbase.hstore.compaction.max.size
1296 | hbase.hstore.compaction.max.size.offpeak
1297 | hbase.hstore.compaction.min.size
1298 | hbase.hstore.compaction.min
1299 | hbase.hstore.compaction.max
1300 | hbase.hstore.compaction.ratio
1301 | hbase.hstore.compaction.ratio.offpeak
1302 | hbase.regionserver.thread.compaction.throttle
1303 | hbase.hregion.majorcompaction
1304 | hbase.hregion.majorcompaction.jitter
1305 | hbase.hstore.min.locality.to.skip.major.compact
1306 | hbase.hstore.compaction.date.tiered.max.storefile.age.millis
1307 | hbase.hstore.compaction.date.tiered.incoming.window.min
1308 | hbase.hstore.compaction.date.tiered.window.policy.class
1309 | hbase.hstore.compaction.date.tiered.single.output.for.minor.compaction
1310 | hbase.hstore.compaction.date.tiered.window.factory.class
1311 | hbase.offpeak.start.hour
1312 | hbase.offpeak.end.hour
1313 | hbase.oldwals.cleaner.thread.size
1314 | hbase.oldwals.cleaner.thread.timeout.msec
1315 | hbase.oldwals.cleaner.thread.check.interval.msec
1316 | hbase.procedure.worker.keep.alive.time.msec
1317 | hbase.procedure.worker.add.stuck.percentage
1318 | hbase.procedure.worker.monitor.interval.msec
1319 | hbase.procedure.worker.stuck.threshold.msec
1320 | hbase.regions.slop
1321 | hbase.regions.overallSlop
1322 | hbase.balancer.tablesOnMaster
1323 | hbase.balancer.tablesOnMaster.systemTablesOnly
1324 | hbase.util.ip.to.rack.determiner
1325 | hbase.ipc.server.max.callqueue.length
1326 | hbase.ipc.server.priority.max.callqueue.length
1327 | hbase.ipc.server.callqueue.type
1328 | hbase.ipc.server.callqueue.codel.target.delay
1329 | hbase.ipc.server.callqueue.codel.interval
1330 | hbase.ipc.server.callqueue.codel.lifo.threshold
1331 | hbase.master.balancer.stochastic.maxSteps
1332 | hbase.master.balancer.stochastic.stepsPerRegion
1333 | hbase.master.balancer.stochastic.maxRunningTime
1334 | hbase.master.balancer.stochastic.runMaxSteps
1335 | hbase.master.balancer.stochastic.numRegionLoadsToRemember
1336 | hbase.master.loadbalance.bytable
1337 | hbase.master.balancer.stochastic.minCostNeedBalance
1338 | hbase.master.balancer.stochastic.localityCost
1339 | hbase.master.balancer.stochastic.rackLocalityCost
1340 | hbase.master.balancer.stochastic.readRequestCost
1341 | hbase.master.balancer.stochastic.writeRequestCost
1342 | hbase.master.balancer.stochastic.memstoreSizeCost
1343 | hbase.master.balancer.stochastic.storefileSizeCost
1344 | hbase.master.balancer.stochastic.regionReplicaHostCostKey
1345 | hbase.master.balancer.stochastic.regionReplicaRackCostKey
1346 | hbase.master.balancer.stochastic.regionCountCost
1347 | hbase.master.balancer.stochastic.primaryRegionCountCost
1348 | hbase.master.balancer.stochastic.moveCost
1349 | hbase.master.balancer.stochastic.moveCost.offpeak
1350 | hbase.master.balancer.stochastic.maxMovePercent
1351 | hbase.master.balancer.stochastic.tableSkewCost
1352 | hbase.master.regions.recovery.check.interval
1353 | hbase.regions.recovery.store.file.ref.count
1354 | hbase.rsgroup.fallback.enable
1355 |===
1356
1357 ifdef::backend-docbook[]
1358 [index]
1359 == Index
1360 // Generated automatically by the DocBook toolchain.
1361 endif::backend-docbook[]