RELEASENOTES.md

   1 # RELEASENOTES
   2
   3 <!---
   4 # Licensed to the Apache Software Foundation (ASF) under one
   5 # or more contributor license agreements.  See the NOTICE file
   6 # distributed with this work for additional information
   7 # regarding copyright ownership.  The ASF licenses this file
   8 # to you under the Apache License, Version 2.0 (the
   9 # "License"); you may not use this file except in compliance
  10 # with the License.  You may obtain a copy of the License at
  11 #
  12 #     http://www.apache.org/licenses/LICENSE-2.0
  13 #
  14 # Unless required by applicable law or agreed to in writing, software
  15 # distributed under the License is distributed on an "AS IS" BASIS,
  16 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  17 # See the License for the specific language governing permissions and
  18 # limitations under the License.
  19
  20 # Be careful doing manual edits in this file. Do not change format
  21 # of release header or remove the below marker. This file is generated.
  22 # DO NOT REMOVE THIS MARKER; FOR INTERPOLATING CHANGES!-->
  23 # HBASE  2.4.6 Release Notes
  24
  25 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  26
  27
  28 ---
  29
  30 * [HBASE-26204](https://issues.apache.org/jira/browse/HBASE-26204) | *Major* | **VerifyReplication should obtain token for peerQuorumAddress too**
  31
  32 VerifyReplication obtains tokens even if the peer quorum parameter is used. VerifyReplication with peer quorum can be used for secure clusters also.
  33
  34
  35 ---
  36
  37 * [HBASE-24652](https://issues.apache.org/jira/browse/HBASE-24652) | *Minor* | **master-status UI make date type fields sortable**
  38
  39 Makes RegionServer 'Start time' sortable in the Master UI
  40
  41
  42 ---
  43
  44 * [HBASE-26200](https://issues.apache.org/jira/browse/HBASE-26200) | *Major* | **Undo 'HBASE-25165 Change 'State time' in UI so sorts (#2508)' in favor of HBASE-24652**
  45
  46 Undid showing RegionServer 'Start time' in ISO-8601 format. Revert.
  47
  48
  49 ---
  50
  51 * [HBASE-6908](https://issues.apache.org/jira/browse/HBASE-6908) | *Major* | **Pluggable Call BlockingQueue for HBaseServer**
  52
  53 Can pass in a FQCN to load as the call queue implementation.
  54
  55 Standardized arguments to the constructor are the max queue length, the PriorityFunction, and the Configuration.
  56
  57 PluggableBlockingQueue abstract class provided to help guide the correct constructor signature.
  58
  59 Hard fails with PluggableRpcQueueNotFound if the class fails to load as a BlockingQueue\<CallRunner\>
  60
  61 Upstreaming on behalf of Hubspot, we are interested in defining our own custom RPC queue and don't want to get involved in necessarily upstreaming internal requirements/iterations.
  62
  63
  64 ---
  65
  66 * [HBASE-26196](https://issues.apache.org/jira/browse/HBASE-26196) | *Major* | **Support configuration override for remote cluster of HFileOutputFormat locality sensitive**
  67
  68 Allow any configuration for the remote cluster in HFileOutputFormat2 that could be useful the different configuration from the job's configuration is necessary to connect the remote cluster, for instance, non-secure vs secure.
  69
  70
  71 ---
  72
  73 * [HBASE-26160](https://issues.apache.org/jira/browse/HBASE-26160) | *Minor* | **Configurable disallowlist for live editing of loglevels**
  74
  75 Adds a new hbase.ui.logLevels.readonly.loggers config which takes a comma-separated list of logger names. Similar to log4j configurations, the logger names can be prefixes or a full logger name. The log level of read only loggers cannot be changed via the logLevel UI or setlevel CLI. This is useful for securing sensitive loggers, such as the SecurityLogger used for audit logs.
  76
  77
  78 ---
  79
  80 * [HBASE-26154](https://issues.apache.org/jira/browse/HBASE-26154) | *Minor* | **Provide exception metric for quota exceeded and throttling**
  81
  82 Adds "exceptions.quotaExceeded" and "exceptions.rpcThrottling" to HBase server and Thrift server metrics.
  83
  84
  85 ---
  86
  87 * [HBASE-26146](https://issues.apache.org/jira/browse/HBASE-26146) | *Minor* | **Allow custom opts for hbck in hbase bin**
  88
  89 Adds HBASE\_HBCK\_OPTS environment variable to bin/hbase for passing extra options to hbck/hbck2. Defaults to HBASE\_SERVER\_JAAS\_OPTS if specified, or HBASE\_REGIONSERVER\_OPTS.
  90
  91
  92
  93 # HBASE  2.4.5 Release Notes
  94
  95 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  96
  97
  98 ---
  99
 100 * [HBASE-26088](https://issues.apache.org/jira/browse/HBASE-26088) | *Critical* | **conn.getBufferedMutator(tableName) leaks thread executors and other problems**
 101
 102 The API doc for Connection#getBufferedMutator(TableName) and Connection#getBufferedMutator(BufferedMutatorParams) mentioned that when user dont pass a ThreadPool to be used, we use the ThreadPool in the Connection.  But in reality, we were creating new ThreadPool in such cases.
 103
 104 We are keeping the behaviour of code as is but corrected the Javadoc and also a bug of not closing this new pool while Closing the BufferedMutator.
 105
 106
 107 ---
 108
 109 * [HBASE-25986](https://issues.apache.org/jira/browse/HBASE-25986) | *Minor* | **Expose the NORMALIZARION\_ENABLED table descriptor through a property in hbase-site**
 110
 111 New config: hbase.table.normalization.enabled
 112
 113 Default value: false
 114
 115 Description: This config is used to set default behaviour of normalizer at table level. To override this at table level one can set NORMALIZATION\_ENABLED at table descriptor level and that property will be honored. Of course, this property at table level can only work if normalizer is enabled at cluster level using "normalizer\_switch true" command.
 116
 117
 118 ---
 119
 120 * [HBASE-22923](https://issues.apache.org/jira/browse/HBASE-22923) | *Major* | **hbase:meta is assigned to localhost when we downgrade the hbase version**
 121
 122 Introduced new config: hbase.min.version.move.system.tables
 123
 124 When the operator uses this configuration option, any version between
 125 the current cluster version and the value of "hbase.min.version.move.system.tables"
 126 does not trigger any auto-region movement. Auto-region movement here
 127 refers to auto-migration of system table regions to newer server versions.
 128 It is assumed that the configured range of versions does not require special
 129 handling of moving system table regions to higher versioned RegionServer.
 130 This auto-migration is done by AssignmentManager#checkIfShouldMoveSystemRegionAsync().
 131 Example: Let's assume the cluster is on version 1.4.0 and we have
 132 set "hbase.min.version.move.system.tables" as "2.0.0". Now if we upgrade
 133 one RegionServer on 1.4.0 cluster to 1.6.0 (\< 2.0.0), then AssignmentManager will
 134 not move hbase:meta, hbase:namespace and other system table regions
 135 to newly brought up RegionServer 1.6.0 as part of auto-migration.
 136 However, if we upgrade one RegionServer on 1.4.0 cluster to 2.2.0 (\> 2.0.0),
 137 then AssignmentManager will move all system table regions to newly brought
 138 up RegionServer 2.2.0 as part of auto-migration done by
 139 AssignmentManager#checkIfShouldMoveSystemRegionAsync().
 140
 141 Overall, assuming we have system RSGroup where we keep HBase system tables, if we use
 142 config "hbase.min.version.move.system.tables" with value x.y.z then while upgrading cluster to
 143 version greater than or equal to x.y.z, the first RegionServer that we upgrade must
 144 belong to system RSGroup only.
 145
 146
 147 ---
 148
 149 * [HBASE-25902](https://issues.apache.org/jira/browse/HBASE-25902) | *Critical* | **Add missing CFs in meta during HBase 1 to 2.3+ Upgrade**
 150
 151 While upgrading cluster from 1.x to 2.3+ versions, after the active master is done setting it's status as 'Initialized', it attempts to add 'table' and 'repl\_barrier' CFs in meta. Once CFs are added successfully, master is aborted with PleaseRestartMasterException because master has missed certain initialization events (e.g ClusterSchemaService is not initialized and tableStateManager fails to migrate table states from ZK to meta due to missing CFs). Subsequent active master initialization is expected to be smooth.
 152 In the presence of multi masters, when one of them becomes active for the first time after upgrading to HBase 2.3+, it is aborted after fixing CFs in meta and one of the other backup masters will take over and become active soon. Hence, overall this is expected to be smooth upgrade if we have backup masters configured. If not, operator is expected to restart same master again manually.
 153
 154
 155 ---
 156
 157 * [HBASE-25877](https://issues.apache.org/jira/browse/HBASE-25877) | *Major* | **Add access  check for compactionSwitch**
 158
 159 Now calling RSRpcService.compactionSwitch, i.e, Admin.compactionSwitch at client side, requires ADMIN permission.
 160 This is an incompatible change but it is also a bug, as we should not allow any users to disable compaction on a regionserver, so we apply this to all active branches.
 161
 162
 163 ---
 164
 165 * [HBASE-25984](https://issues.apache.org/jira/browse/HBASE-25984) | *Critical* | **FSHLog WAL lockup with sync future reuse [RS deadlock]**
 166
 167 Fixes a WAL lockup issue due to premature reuse of the sync futures by the WAL consumers. The lockup causes the WAL system to hang resulting in blocked appends and syncs thus holding up the RPC handlers from progressing. Only workaround without this fix is to force abort the region server.
 168
 169
 170 ---
 171
 172 * [HBASE-25993](https://issues.apache.org/jira/browse/HBASE-25993) | *Major* | **Make excluded SSL cipher suites configurable for all Web UIs**
 173
 174 Add "ssl.server.exclude.cipher.list" configuration to excluded cipher suites for the http server started by the InfoServer.
 175
 176
 177 ---
 178
 179 * [HBASE-25969](https://issues.apache.org/jira/browse/HBASE-25969) | *Major* | **Cleanup netty-all transitive includes**
 180
 181 We have an (old) netty-all in our produced artifacts. It is transitively included from hadoop. It is needed by MiniMRCluster referenced from a few MR tests in hbase. This commit adds netty-all excludes everywhere else but where tests will fail unless the transitive is allowed through. TODO: move MR and/or MR tests out of hbase core.
 182
 183
 184
 185 # HBASE  2.4.4 Release Notes
 186
 187 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 188
 189
 190 ---
 191
 192 * [HBASE-25963](https://issues.apache.org/jira/browse/HBASE-25963) | *Major* | **HBaseCluster should be marked as IA.Public**
 193
 194 Change HBaseCluster to IA.Public as its sub class MiniHBaseCluster is IA.Public.
 195
 196
 197
 198 # HBASE  2.4.3 Release Notes
 199
 200 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 201
 202
 203 ---
 204
 205 * [HBASE-25766](https://issues.apache.org/jira/browse/HBASE-25766) | *Major* | **Introduce RegionSplitRestriction that restricts the pattern of the split point**
 206
 207 After HBASE-25766, we can specify a split restriction, "KeyPrefix" or "DelimitedKeyPrefix", to a table with the "hbase.regionserver.region.split\_restriction.type" property. The "KeyPrefix" split restriction groups rows by a prefix of the row-key. And the "DelimitedKeyPrefix" split restriction groups rows by a prefix of the row-key with a delimiter.
 208
 209 For example:
 210 \`\`\`
 211 # Create a table with a "KeyPrefix" split restriction, where the prefix length is 2 bytes
 212 hbase\> create 'tbl1', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'KeyPrefix', 'hbase.regionserver.region.split\_restriction.prefix\_length' =\> '2'}}
 213
 214 # Create a table with a "DelimitedKeyPrefix" split restriction, where the delimiter is a comma (,)
 215 hbase\> create 'tbl2', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'DelimitedKeyPrefix', 'hbase.regionserver.region.split\_restriction.delimiter' =\> ','}}
 216 \`\`\`
 217
 218 Instead of specifying a split restriction to a table directly, we can also set the properties in hbase-site.xml. In this case, the specified split restriction is applied for all the tables.
 219
 220 Note that the split restriction is also applied to a user-specified split point so that we don't allow users to break the restriction, which is different behavior from the existing KeyPrefixRegionSplitPolicy and DelimitedKeyPrefixRegionSplitPolicy.
 221
 222
 223 ---
 224
 225 * [HBASE-25775](https://issues.apache.org/jira/browse/HBASE-25775) | *Major* | **Use a special balancer to deal with maintenance mode**
 226
 227 Introduced a MaintenanceLoadBalancer to be used only under maintenance mode. Typically you should not use it as your balancer implementation.
 228
 229
 230 ---
 231
 232 * [HBASE-25767](https://issues.apache.org/jira/browse/HBASE-25767) | *Major* | **CandidateGenerator.getRandomIterationOrder is too slow on large cluster**
 233
 234 In the actual implementation classes of CandidateGenerator, now we just random select a start point and then iterate sequentially, instead of using the old way, where we will create a big array to hold all the integers in [0, num\_regions\_in\_cluster), shuffle the array, and then iterate on the array.
 235 The new implementation is 'random' enough as every time we just select one candidate. The problem for the old implementation is that, it will create an array every time when we want to get a candidate, if we have tens of thousands regions, we will create an array with tens of thousands length everytime, which causes big GC pressure and slow down the balancer execution.
 236
 237
 238 ---
 239
 240 * [HBASE-25734](https://issues.apache.org/jira/browse/HBASE-25734) | *Minor* | **Backport HBASE-24305 to branch-2.4**
 241
 242 The following method was added to ServerName
 243
 244 - #valueOf(Address, long)
 245
 246
 247 ---
 248
 249 * [HBASE-25199](https://issues.apache.org/jira/browse/HBASE-25199) | *Minor* | **Remove HStore#getStoreHomedir**
 250
 251 Moved the following methods from HStore to HRegionFileSystem
 252
 253 - #getStoreHomedir(Path, RegionInfo, byte[])
 254 - #getStoreHomedir(Path, String, byte[])
 255
 256
 257 ---
 258
 259 * [HBASE-25685](https://issues.apache.org/jira/browse/HBASE-25685) | *Major* | **asyncprofiler2.0 no longer supports svg; wants html**
 260
 261 If asyncprofiler 1.x, all is good. If asyncprofiler 2.x and it is hbase-2.3.x or hbase-2.4.x, add '?output=html' to get flamegraphs from the profiler.
 262
 263 Otherwise, if hbase-2.5+ and asyncprofiler2, all works. If asyncprofiler1 and hbase-2.5+, you may have to add '?output=svg' to the query.
 264
 265
 266 ---
 267
 268 * [HBASE-25518](https://issues.apache.org/jira/browse/HBASE-25518) | *Major* | **Support separate child regions to different region servers**
 269
 270 Config key for enable/disable automatically separate child regions to different region servers in the procedure of split regions. One child will be kept to the server where parent region is on, and the other child will be assigned to a random server.
 271
 272 hbase.master.auto.separate.child.regions.after.split.enabled
 273
 274 Default setting is false/off.
 275
 276
 277 ---
 278
 279 * [HBASE-25374](https://issues.apache.org/jira/browse/HBASE-25374) | *Minor* | **Make REST Client connection and socket time out configurable**
 280
 281 Configuration parameter to set rest client connection timeout
 282
 283 "hbase.rest.client.conn.timeout" Default is 2 \* 1000
 284
 285 "hbase.rest.client.socket.timeout" Default of 30 \* 1000
 286
 287
 288 ---
 289
 290 * [HBASE-25587](https://issues.apache.org/jira/browse/HBASE-25587) | *Major* | **[hbck2] Schedule SCP for all unknown servers**
 291
 292 Adds scheduleSCPsForUnknownServers to Hbck Service.
 293
 294
 295 ---
 296
 297 * [HBASE-25636](https://issues.apache.org/jira/browse/HBASE-25636) | *Minor* | **Expose HBCK report as metrics**
 298
 299 Expose HBCK repost results in metrics, includes: "orphanRegionsOnRS", "orphanRegionsOnFS", "inconsistentRegions", "holes", "overlaps", "unknownServerRegions" and "emptyRegionInfoRegions".
 300
 301
 302 ---
 303
 304 * [HBASE-24305](https://issues.apache.org/jira/browse/HBASE-24305) | *Minor* | **Handle deprecations in ServerName**
 305
 306 The following methods were removed or made private from ServerName (due to HBASE-17624):
 307
 308 - getHostNameMinusDomain(String): Was made private without a replacement.
 309 - parseHostname(String): Use #valueOf(String) instead.
 310 - parsePort(String): Use #valueOf(String) instead.
 311 - parseStartcode(String): Use #valueOf(String) instead.
 312 - getServerName(String, int, long): Was made private. Use #valueOf(String, int, long) instead.
 313 - getServerName(String, long): Use #valueOf(String, long) instead.
 314 - getHostAndPort(): Use #getAddress() instead.
 315 - getServerStartcodeFromServerName(String): Use instance of ServerName to pull out start code)
 316 - getServerNameLessStartCode(String): Use #getAddress() instead.
 317
 318
 319
 320 # HBASE  2.4.2 Release Notes
 321
 322 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 323
 324
 325 ---
 326
 327 * [HBASE-25492](https://issues.apache.org/jira/browse/HBASE-25492) | *Major* | **Create table with rsgroup info in branch-2**
 328
 329 HBASE-25492 added a new interface in TableDescriptor which allows user to define RSGroup name while creating or modifying a table.
 330
 331
 332 ---
 333
 334 * [HBASE-25460](https://issues.apache.org/jira/browse/HBASE-25460) | *Major* | **Expose drainingServers as cluster metric**
 335
 336 Exposed new jmx metrics: "draininigRegionServers" and "numDrainingRegionServers" to provide "comma separated names for regionservers that are put in draining mode" and "num of such regionservers" respectively.
 337
 338
 339 ---
 340
 341 * [HBASE-25615](https://issues.apache.org/jira/browse/HBASE-25615) | *Major* | **Upgrade java version in pre commit docker file**
 342
 343 jdk8u232-b09 -\> jdk8u282-b08
 344 jdk-11.0.6\_10 -\> jdk-11.0.10\_9
 345
 346
 347 ---
 348
 349 * [HBASE-23887](https://issues.apache.org/jira/browse/HBASE-23887) | *Major* | **New L1 cache : AdaptiveLRU**
 350
 351 Introduced new L1 cache: AdaptiveLRU. This is supposed to provide better performance than default LRU cache.
 352 Set config key "hfile.block.cache.policy" to "AdaptiveLRU" in hbase-site in order to start using this new cache.
 353
 354
 355
 356 # HBASE  2.4.1 Release Notes
 357
 358 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 359
 360
 361 ---
 362
 363 * [HBASE-25449](https://issues.apache.org/jira/browse/HBASE-25449) | *Major* | **'dfs.client.read.shortcircuit' should not be set in hbase-default.xml**
 364
 365 The presence of HDFS short-circuit read configuration properties in hbase-default.xml inadvertently causes short-circuit reads to not happen inside of RegionServers, despite short-circuit reads being enabled in hdfs-site.xml.
 366
 367
 368 ---
 369
 370 * [HBASE-25333](https://issues.apache.org/jira/browse/HBASE-25333) | *Major* | **Add maven enforcer rule to ban VisibleForTesting imports**
 371
 372 Ban the imports of guava VisiableForTesting, which means you should not use this annotation in HBase any more.
 373 For IA.Public and IA.LimitedPrivate classes, typically you should not expose any test related fields/methods there, and if you want to hide something, use IA.Private on the specific fields/methods.
 374 For IA.Private classes, if you want to expose something only for tests, use the RestrictedApi annotation from error prone, where it could cause a compilation error if someone break the rule in the future.
 375
 376
 377 ---
 378
 379 * [HBASE-25441](https://issues.apache.org/jira/browse/HBASE-25441) | *Critical* | **add security check for some APIs in RSRpcServices**
 380
 381 RsRpcServices APIs that can be accessed only through Admin rights:
 382 - stopServer
 383 - updateFavoredNodes
 384 - updateConfiguration
 385 - clearRegionBlockCache
 386 - clearSlowLogsResponses
 387
 388
 389 ---
 390
 391 * [HBASE-25432](https://issues.apache.org/jira/browse/HBASE-25432) | *Blocker* | **we should add security checks for setTableStateInMeta and fixMeta**
 392
 393 setTableStateInMeta and fixMeta can be accessed only through Admin rights
 394
 395
 396 ---
 397
 398 * [HBASE-25318](https://issues.apache.org/jira/browse/HBASE-25318) | *Minor* | **Configure where IntegrationTestImportTsv generates HFiles**
 399
 400 Added IntegrationTestImportTsv.generatedHFileFolder configuration property to override the default location in IntegrationTestImportTsv. Useful for running the integration test when HDFS Transparent Encryption is enabled.
 401
 402
 403 ---
 404
 405 * [HBASE-25456](https://issues.apache.org/jira/browse/HBASE-25456) | *Critical* | **setRegionStateInMeta need security check**
 406
 407 setRegionStateInMeta can be accessed only through Admin rights
 408
 409
 410
 411 # HBASE  2.4.0 Release Notes
 412
 413 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 414
 415
 416 ---
 417
 418 * [HBASE-25127](https://issues.apache.org/jira/browse/HBASE-25127) | *Major* | **Enhance PerformanceEvaluation to profile meta replica performance.**
 419
 420 Three new commands are added to PE:
 421
 422 metaWrite, metaRandomRead and cleanMeta.
 423
 424 Usage example:
 425 hbase pe  --rows=100000 metaWrite  1
 426 hbase pe  --nomapreduce --rows=100000 metaRandomRead  32
 427 hbase pe  --rows=100000 cleanMeta 1
 428
 429 metaWrite and cleanMeta should be run with only 1 thread and the same number of rows so all the rows inserted will be cleaned up properly.
 430
 431 metaRandomRead can be run with multiple threads. The rows option should set to within the range of rows inserted by metaWrite
 432
 433
 434 ---
 435
 436 * [HBASE-25237](https://issues.apache.org/jira/browse/HBASE-25237) | *Major* | **'hbase master stop' shuts down the cluster, not the master only**
 437
 438 \`hbase master stop\` should shutdown only master by default.
 439 1. Help added to \`hbase master stop\`:
 440 To stop cluster, use \`stop-hbase.sh\` or \`hbase master stop --shutDownCluster\`
 441
 442 2. Help added to \`stop-hbase.sh\`:
 443 stop-hbase.sh can only be used for shutting down entire cluster. To shut down (HMaster\|HRegionServer) use hbase-daemon.sh stop (master\|regionserver)
 444
 445
 446 ---
 447
 448 * [HBASE-25242](https://issues.apache.org/jira/browse/HBASE-25242) | *Critical* | **Add Increment/Append support to RowMutations**
 449
 450 After HBASE-25242, we can add Increment/Append operations to RowMutations and perform those operations atomically in a single row.
 451 HBASE-25242 includes an API change where the mutateRow() API returns a Result object to get the result of the Increment/Append operations.
 452
 453
 454 ---
 455
 456 * [HBASE-25263](https://issues.apache.org/jira/browse/HBASE-25263) | *Major* | **Change encryption key generation algorithm used in the HBase shell**
 457
 458 Since the backward-compatible change we introduced in HBASE-25263,  we use the more secure PBKDF2WithHmacSHA384  key generation algorithm (instead of PBKDF2WithHmacSHA1) to generate a secret key for HFile / WalFile encryption, when the user is defining a string encryption key in the hbase shell.
 459
 460
 461 ---
 462
 463 * [HBASE-24268](https://issues.apache.org/jira/browse/HBASE-24268) | *Minor* | **REST and Thrift server do not handle the "doAs" parameter case insensitively**
 464
 465 This change allows the REST and Thrift servers to handle the "doAs" parameter case-insensitively, which is deemed as correct per the "specification" provided by the Hadoop community.
 466
 467
 468 ---
 469
 470 * [HBASE-25278](https://issues.apache.org/jira/browse/HBASE-25278) | *Minor* | **Add option to toggle CACHE\_BLOCKS in count.rb**
 471
 472 A new option, CACHE\_BLOCKS, was added to the \`count\` shell command which will force the data for a table to be loaded into the block cache. By default, the \`count\` command will not cache any blocks. This option can serve as a means to for a table's data to be loaded into block cache on demand. See the help message on the count shell command for usage details.
 473
 474
 475 ---
 476
 477 * [HBASE-18070](https://issues.apache.org/jira/browse/HBASE-18070) | *Critical* | **Enable memstore replication for meta replica**
 478
 479 "Async WAL Replication" [1] was added by HBASE-11183 "Timeline Consistent region replicas - Phase 2 design" but only for user-space tables. This feature adds "Async WAL Replication" for the hbase:meta table.  It also adds a client 'LoadBalance' mode that has reads go to replicas first and to the primary only on fail so as to shed read load from the primary to alleviate \*hotspotting\* on the hbase:meta Region.
 480
 481 Configuration is as it was for the user-space 'Async WAL Replication'. See [2] and [3] for details on how to enable.
 482
 483 1. http://hbase.apache.org/book.html#async.wal.replication
 484 2. http://hbase.apache.org/book.html#async.wal.replication.meta
 485 3. http://hbase.apache.org/book.html#\_async\_wal\_replication\_for\_meta\_table\_as\_of\_hbase\_2\_4\_0
 486
 487
 488 ---
 489
 490 * [HBASE-25126](https://issues.apache.org/jira/browse/HBASE-25126) | *Major* | **Add load balance logic in hbase-client to distribute read load over meta replica regions.**
 491
 492 See parent issue, HBASE-18070, release notes for how to enable.
 493
 494
 495 ---
 496
 497 * [HBASE-25026](https://issues.apache.org/jira/browse/HBASE-25026) | *Minor* | **Create a metric to track full region scans RPCs**
 498
 499 Adds a new metric where we collect the number of full region scan requests at the RPC layer. This will be collected under "name" : "Hadoop:service=HBase,name=RegionServer,sub=Server"
 500
 501
 502 ---
 503
 504 * [HBASE-25253](https://issues.apache.org/jira/browse/HBASE-25253) | *Major* | **Deprecated master carrys regions related methods and configs**
 505
 506 Since 2.4.0, deprecated all master carrys regions related methods(LoadBalancer,BaseLoadBalancer,ZNodeClearer) and configs(hbase.balancer.tablesOnMaster, hbase.balancer.tablesOnMaster.systemTablesOnly), they will be removed in 3.0.0.
 507
 508
 509 ---
 510
 511 * [HBASE-20598](https://issues.apache.org/jira/browse/HBASE-20598) | *Major* | **Upgrade to JRuby 9.2**
 512
 513 <!-- markdown -->
 514 The HBase shell now relies on JRuby 9.2. This is a new major version change for JRuby. The most significant change is Ruby compatibility changed from Ruby 2.3 to Ruby 2.5. For more detailed changes please see [the JRuby release announcement for the start of the 9.2 series](https://www.jruby.org/2018/05/24/jruby-9-2-0-0.html) as well as the [general release announcement page for updates since that version](https://www.jruby.org/news).
 515
 516 The runtime dependency versions present on the server side classpath for the Joni (now 2.1.31) and JCodings (now 1.0.55) libraries have also been updated to match those found in the JRuby version shipped with HBase. These version changes are maintenance releases and should be backwards compatible when updated in tandem.
 517
 518
 519 ---
 520
 521 * [HBASE-25181](https://issues.apache.org/jira/browse/HBASE-25181) | *Major* | **Add options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys.**
 522
 523 <!-- markdown -->
 524 This change adds options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys. Changes are done such that defaults will keep the same behavior prior to this issue.
 525
 526 Prior to this change HBase always used the MD5 hash algorithm to store a hash for encryption keys. This hash is needed to verify the secret key of the subject. (e.g. making sure that the same secrey key is used during encrypted HFile read and write). The MD5 algorithm is considered weak, and can not be used in some (e.g. FIPS compliant) clusters. Having a configurable hash enables us to use newer and more secure hash algorithms like SHA-384 or SHA-512 (which are FIPS compliant).
 527
 528 The hash is set via the configuration option `hbase.crypto.key.hash.algorithm`. It should be set to a JDK `MessageDigest` algorithm like "MD5", "SHA-256" or "SHA-384". The default is "MD5" for backward compatibility.
 529
 530 Alternatively, clusters which rely on an encryption at rest mechanism outside of HBase (e.g. those offered by HDFS) and wish to ensure HBase's encryption at rest system is inactive can set `hbase.crypto.enabled` to `false`.
 531
 532
 533 ---
 534
 535 * [HBASE-25238](https://issues.apache.org/jira/browse/HBASE-25238) | *Critical* | **Upgrading HBase from 2.2.0 to 2.3.x fails because of “Message missing required fields: state”**
 536
 537 Fixes master procedure store migration issues going from 2.0.x to 2.2.x and/or 2.3.x. Also fixes failed heartbeat parse during rolling upgrade from 2.0.x. to 2.3.x.
 538
 539
 540 ---
 541
 542 * [HBASE-25234](https://issues.apache.org/jira/browse/HBASE-25234) | *Major* | **[Upgrade]Incompatibility in reading RS report from 2.1 RS when Master is upgraded to a version containing HBASE-21406**
 543
 544 Fixes so auto-migration of master procedure store works again going from 2.0.x =\> 2.2+. Also make it so heartbeats work when rolling upgrading from 2.0.x =\> 2.3+.
 545
 546
 547 ---
 548
 549 * [HBASE-25212](https://issues.apache.org/jira/browse/HBASE-25212) | *Major* | **Optionally abort requests in progress after deciding a region should close**
 550
 551 If hbase.regionserver.close.wait.abort is set to true, interrupt RPC handler threads holding the region close lock.
 552
 553 Until requests in progress can be aborted, wait on the region close lock for a configurable interval (specified by hbase.regionserver.close.wait.time.ms, default 60000 (1 minute)). If we have failed to acquire the close lock after this interval elapses, if allowed (also specified by hbase.regionserver.close.wait.abort), abort the regionserver.
 554
 555 We will attempt to interrupt any running handlers every hbase.regionserver.close.wait.interval.ms (default 10000 (10 seconds)) until either the close lock is acquired or we reach the maximum wait time.
 556
 557
 558 ---
 559
 560 * [HBASE-25167](https://issues.apache.org/jira/browse/HBASE-25167) | *Major* | **Normalizer support for hot config reloading**
 561
 562 <!-- markdown -->
 563 This patch adds [dynamic configuration](https://hbase.apache.org/book.html#dyn_config) support for the following configuration keys related to the normalizer:
 564 * hbase.normalizer.throughput.max_bytes_per_sec
 565 * hbase.normalizer.split.enabled
 566 * hbase.normalizer.merge.enabled
 567 * hbase.normalizer.min.region.count
 568 * hbase.normalizer.merge.min_region_age.days
 569 * hbase.normalizer.merge.min_region_size.mb
 570
 571
 572 ---
 573
 574 * [HBASE-25224](https://issues.apache.org/jira/browse/HBASE-25224) | *Major* | **Maximize sleep for checking meta and namespace regions availability**
 575
 576 Changed the max sleep time during meta and namespace regions availability check to be 60 sec. Previously there was no such cap
 577
 578
 579 ---
 580
 581 * [HBASE-24628](https://issues.apache.org/jira/browse/HBASE-24628) | *Major* | **Region normalizer now respects a rate limit**
 582
 583 <!-- markdown -->
 584 Introduces a new configuration, `hbase.normalizer.throughput.max_bytes_per_sec`, for specifying a limit on the throughput of actions executed by the normalizer. Note that while this configuration value is in bytes, the minimum honored valued is `1,000,000`, or `1m`. Supports values configured using the human-readable suffixes honored by [`Configuration.getLongBytes`](https://hadoop.apache.org/docs/current/api/org/apache/hadoop/conf/Configuration.html#getLongBytes-java.lang.String-long-)
 585
 586
 587 ---
 588
 589 * [HBASE-14067](https://issues.apache.org/jira/browse/HBASE-14067) | *Major* | **bundle ruby files for hbase shell into a jar.**
 590
 591 <!-- markdown -->
 592 The `hbase-shell` artifact now contains the ruby files that implement the hbase shell. There should be no downstream impact for users of the shell that rely on the `hbase shell` command.
 593
 594 Folks that wish to include the HBase ruby classes defined for the shell in their own JRuby scripts should add the `hbase-shell.jar` file to their classpath rather than add `${HBASE_HOME}/lib/ruby` to their load paths.
 595
 596
 597 ---
 598
 599 * [HBASE-24875](https://issues.apache.org/jira/browse/HBASE-24875) | *Major* | **Remove the force param for unassign since it dose not take effect any more**
 600
 601 <!-- markdown -->
 602 The "force" flag to various unassign commands (java api, shell, etc) has been ignored since HBase 2. As of this change the methods that take it are now deprecated. Downstream users should stop passing/using this flag.
 603
 604 The Admin and AsyncAdmin Java APIs will have the deprecated version of the unassign method with a force flag removed in HBase 4. Callers can safely continue to use the deprecated API until then; the internal implementation just calls the new method.
 605
 606 The MasterObserver coprocessor API deprecates the `preUnassign` and `postUnassign` methods that include the force parameter and replaces them with versions that omit this parameter. The deprecated methods will be removed from the API in HBase 3. Until then downstream coprocessor implementations can safely continue to *just* implement the deprecated method if they wish; the replacement methods provide a default implementation that calls the deprecated method with force set to `false`.
 607
 608
 609 ---
 610
 611 * [HBASE-25099](https://issues.apache.org/jira/browse/HBASE-25099) | *Major* | **Change meta replica count by altering meta table descriptor**
 612
 613 Now you can change the region replication config for meta table by altering meta table.
 614 The old "hbase.meta.replica.count" is deprecated and will be removed in 4.0.0. But if it is set, we will still honor it, which means, when master restart, if we find out that the value of 'hbase.meta.replica.count' is different with the region replication config of meta table, we will schedule an alter table operation to change the region replication config to the value you configured for 'hbase.meta.replica.count'.
 615
 616
 617 ---
 618
 619 * [HBASE-23834](https://issues.apache.org/jira/browse/HBASE-23834) | *Major* | **HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch**
 620
 621 Use shaded json and jersey in HBase.
 622 Ban the imports of unshaded json and jersey in code.
 623
 624
 625 ---
 626
 627 * [HBASE-25163](https://issues.apache.org/jira/browse/HBASE-25163) | *Major* | **Increase the timeout value for nightly jobs**
 628
 629 Increase timeout value for nightly jobs to 16 hours since the new build machines are dedicated to hbase project, so we are allowed to use it all the time.
 630
 631
 632 ---
 633
 634 * [HBASE-22976](https://issues.apache.org/jira/browse/HBASE-22976) | *Major* | **[HBCK2] Add RecoveredEditsPlayer**
 635
 636 WALPlayer can replay the content of recovered.edits directories.
 637
 638 Side-effect is that WAL filename timestamp is now factored when setting start/end times for WALInputFormat; i.e. wal.start.time and wal.end.time values on a job context. Previous we looked at wal.end.time only. Now we consider wal.start.time too. If a file has a name outside of wal.start.time\<-\>wal.end.time, it'll be by-passed. This change-in-behavior will make it easier on operator crafting timestamp filters processing WALs.
 639
 640
 641 ---
 642
 643 * [HBASE-25165](https://issues.apache.org/jira/browse/HBASE-25165) | *Minor* | **Change 'State time' in UI so sorts**
 644
 645 Start time on the Master UI is now displayed using ISO8601 format instead of java Date#toString().
 646
 647
 648 ---
 649
 650 * [HBASE-25124](https://issues.apache.org/jira/browse/HBASE-25124) | *Major* | **Support changing region replica count without disabling table**
 651
 652 Now you do not need to disable a table before changing its 'region replication' property.
 653 If you are decreasing the replica count, the excess region replicas will be closed before reopening other replicas.
 654 If you are increasing the replica count, the new region replicas will be opened after reopening the existing replicas.
 655
 656
 657 ---
 658
 659 * [HBASE-25154](https://issues.apache.org/jira/browse/HBASE-25154) | *Major* | **Set java.io.tmpdir to project build directory to avoid writing std\*deferred files to /tmp**
 660
 661 Change the java.io.tmpdir to project.build.directory in surefire-maven-plugin, to avoid writing std\*deferred files to /tmp which may blow up the /tmp disk on our jenkins build node.
 662
 663
 664 ---
 665
 666 * [HBASE-25055](https://issues.apache.org/jira/browse/HBASE-25055) | *Major* | **Add ReplicationSource for meta WALs; add enable/disable when hbase:meta assigned to RS**
 667
 668 Set hbase.region.replica.replication.catalog.enabled to enable async WAL Replication for hbase:meta region replicas. Its off by default.
 669
 670 Defaults to the RegionReadReplicaEndpoint.class shipping edits -- set hbase.region.replica.catalog.replication to target a different endpoint implementation.
 671
 672
 673 ---
 674
 675 * [HBASE-25109](https://issues.apache.org/jira/browse/HBASE-25109) | *Major* | **Add MR Counters to WALPlayer; currently hard to tell if it is doing anything**
 676
 677 Adds a WALPlayer to MR Counter output:
 678
 679         org.apache.hadoop.hbase.mapreduce.WALPlayer$Counter
 680                 CELLS\_READ=89574
 681                 CELLS\_WRITTEN=89572
 682                 DELETES=64
 683                 PUTS=5305
 684                 WALEDITS=4375
 685
 686
 687 ---
 688
 689 * [HBASE-24896](https://issues.apache.org/jira/browse/HBASE-24896) | *Major* | **'Stuck' in static initialization creating RegionInfo instance**
 690
 691 1. Untangle RegionInfo, RegionInfoBuilder, and MutableRegionInfo static
 692 initializations.
 693 2. Undo static initializing references from RegionInfo to RegionInfoBuilder.
 694 3. Mark RegionInfo#UNDEFINED IA.Private and deprecated;
 695 it is for internal use only and likely to be removed in HBase4. (sub-task HBASE-24918)
 696 4. Move MutableRegionInfo from inner-class of
 697 RegionInfoBuilder to be (package private) standalone. (sub-task HBASE-24918)
 698
 699
 700 ---
 701
 702 * [HBASE-24956](https://issues.apache.org/jira/browse/HBASE-24956) | *Major* | **ConnectionManager#locateRegionInMeta waits for user region lock indefinitely.**
 703
 704 <!-- markdown -->
 705
 706 Without this fix there are situations in which locateRegionInMeta() on a client is not bound by a timeout. This happens because of a global lock whose acquisition was not under any lock scope. This affects client facing API calls that rely on this method to locate a table region in meta. This fix brings the lock acquisition under the scope of "hbase.client.meta.operation.timeout" and that guarantees a bounded wait time.
 707
 708
 709 ---
 710
 711 * [HBASE-24764](https://issues.apache.org/jira/browse/HBASE-24764) | *Minor* | **Add support of adding base peer configs via hbase-site.xml for all replication peers.**
 712
 713 <!-- markdown -->
 714
 715 Adds a new configuration parameter "hbase.replication.peer.base.config" which accepts a semi-colon separated key=CSV pairs (example: k1=v1;k2=v2_1,v3...). When this configuration is set on the server side, these kv pairs are added to every peer configuration if not already set. Peer specific configuration overrides have precedence over the above default configuration. This is useful in cases when some configuration has to be set for all the peers by default and one does not want to add to every peer definition.
 716
 717
 718 ---
 719
 720 * [HBASE-24994](https://issues.apache.org/jira/browse/HBASE-24994) | *Minor* | **Add hedgedReadOpsInCurThread metric**
 721
 722 Expose Hadoop hedgedReadOpsInCurThread metric to HBase.
 723 This metric counts the number of times the hedged reads service executor rejected a read task, falling back to the current thread.
 724 This will help determine the proper size of the thread pool (dfs.client.hedged.read.threadpool.size).
 725
 726
 727 ---
 728
 729 * [HBASE-24776](https://issues.apache.org/jira/browse/HBASE-24776) | *Major* | **[hbtop] Support Batch mode**
 730
 731 HBASE-24776 added the following command line parameters to hbtop:
 732 \| Argument \| Description \|
 733 \|---\|---\|
 734 \| -n,--numberOfIterations \<arg\> \| The number of iterations \|
 735 \| -O,--outputFieldNames \| Print each of the available field names on a separate line, then quit \|
 736 \| -f,--fields \<arg\> \| Show only the given fields. Specify comma separated fields to show multiple fields \|
 737 \| -s,--sortField \<arg\> \| The initial sort field. You can prepend a \`+' or \`-' to the field name to also override the sort direction. A leading \`+' will force sorting high to low, whereas a \`-' will ensure a low to high ordering \|
 738 \| -i,--filters \<arg\> \| The initial filters. Specify comma separated filters to set multiple filters \|
 739 \| -b,--batchMode \| Starts hbtop in Batch mode, which could be useful for sending output from hbtop to other programs or to a file. In this mode, hbtop will not accept input and runs until the iterations limit you've set with the \`-n' command-line option or until killed \|
 740
 741
 742 ---
 743
 744 * [HBASE-24602](https://issues.apache.org/jira/browse/HBASE-24602) | *Major* | **Add Increment and Append support to CheckAndMutate**
 745
 746 Summary of the change of HBASE-24602:
 747 - Add \`build(Increment)\` and \`build(Append)\` methods to the \`Builder\` class of the \`CheckAndMutate\` class. After this change, we can perform checkAndIncrement/Append operations as follows:
 748 \`\`\`
 749 // Build a CheckAndMutate object with a Increment object
 750 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 751   .ifEquals(family, qualifier, value)
 752   .build(increment);
 753
 754 // Perform a CheckAndIncrement operation
 755 CheckAndMutateResult checkAndMutateResult = table.checkAndMutate(checkAndMutate);
 756
 757 // Get whether or not the CheckAndIncrement operation is successful
 758 boolean success = checkAndMutateResult.isSuccess();
 759
 760 // Get the result of the increment operation
 761 Result result = checkAndMutateResult.getResult();
 762 \`\`\`
 763 - After this change, \`HRegion.batchMutate()\` is used for increment/append operations.
 764 - As the side effect of the above change, the following coprocessor methods of RegionObserver are called when increment/append operations are performed:
 765   - preBatchMutate()
 766   - postBatchMutate()
 767   - postBatchMutateIndispensably()
 768
 769
 770 ---
 771
 772 * [HBASE-24694](https://issues.apache.org/jira/browse/HBASE-24694) | *Major* | **Support flush a single column family of table**
 773
 774 Adds option for the flush command to flush all stores from the specified column family only, among all regions of the given table (stores from other column families on this table would not get flushed).
 775
 776
 777 ---
 778
 779 * [HBASE-24625](https://issues.apache.org/jira/browse/HBASE-24625) | *Critical* | **AsyncFSWAL.getLogFileSizeIfBeingWritten does not return the expected synced file length.**
 780
 781 We add a method getSyncedLength in  WALProvider.WriterBase interface for  WALFileLengthProvider used for replication, considering the case if we use  AsyncFSWAL,we write to 3 DNs concurrently,according to the visibility guarantee of HDFS, the data will be available immediately
 782 when arriving at DN since all the DNs will be considered as the last one in pipeline.This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency.The method WriterBase#getLength may return length which just in hdfs client buffer and not successfully synced to HDFS, so we use this method WriterBase#getSyncedLength to return the length successfully synced to HDFS and replication thread could only read writing WAL file limited by this length.
 783 see also HBASE-14004 and this document for more details:
 784 https://docs.google.com/document/d/11AyWtGhItQs6vsLRIx32PwTxmBY3libXwGXI25obVEY/edit#
 785
 786 Before this patch, replication may read uncommitted data and replicate it to the slave cluster and cause data inconsistency between master and slave cluster, we could use FSHLog instead of AsyncFSWAL  to reduce probability of inconsistency without this patch applied.
 787
 788
 789 ---
 790
 791 * [HBASE-24779](https://issues.apache.org/jira/browse/HBASE-24779) | *Minor* | **Improve insight into replication WAL readers hung on checkQuota**
 792
 793 New metrics are exposed, on the global source, for replication which indicate the "WAL entry buffer" that was introduced in HBASE-15995. When this usage reaches the limit, that RegionServer will cease to read more data for the sake of trying to replicate it. This usage (and limit) is local to each RegionServer is shared across all peers being handled by that RegionServer.
 794
 795
 796 ---
 797
 798 * [HBASE-24404](https://issues.apache.org/jira/browse/HBASE-24404) | *Major* | **Support flush a single column family of region**
 799
 800 This adds an extra "flush" command option that allows for specifying an individual family to have its store flushed.
 801
 802 Usage:
 803 flush 'REGIONNAME','FAMILYNAME'
 804 flush 'ENCODED\_REGIONNAME','FAMILYNAME'
 805
 806
 807 ---
 808
 809 * [HBASE-24805](https://issues.apache.org/jira/browse/HBASE-24805) | *Major* | **HBaseTestingUtility.getConnection should be threadsafe**
 810
 811 <!-- markdown -->
 812 Users of `HBaseTestingUtility` can now safely call the `getConnection` method from multiple threads.
 813
 814 As a consequence of refactoring to improve the thread safety of the HBase testing classes, the protected `conf` member of the  `HBaseCommonTestingUtility` class has been marked final. Downstream users who extend from the class hierarchy rooted at this class will need to pass the Configuration instance they want used to their super constructor rather than overwriting the instance variable.
 815
 816
 817 ---
 818
 819 * [HBASE-24767](https://issues.apache.org/jira/browse/HBASE-24767) | *Major* | **Change default to false for HBASE-15519 per-user metrics**
 820
 821 Disables per-user metrics. They were enabled by default for the first time in hbase-2.3.0 but they need some work before they can be on all the time (See HBASE-15519)
 822
 823
 824 ---
 825
 826 * [HBASE-24704](https://issues.apache.org/jira/browse/HBASE-24704) | *Major* | **Make the Table Schema easier to view even there are multiple families**
 827
 828 Improve the layout of column family from vertical to horizontal in table UI.
 829
 830
 831 ---
 832
 833 * [HBASE-11686](https://issues.apache.org/jira/browse/HBASE-11686) | *Minor* | **Shell code should create a binding / irb workspace instead of polluting the root namespace**
 834
 835 In shell, all HBase constants and commands have been moved out of the top-level and into an IRB Workspace. Piped stdin and scripts passed by name to the shell will be evaluated within this workspace. If you absolutely need the top-level definitions, use the new compatibility flag, ie. hbase shell --top-level-defs or hbase shell --top-level-defs script2run.rb.
 836
 837
 838 ---
 839
 840 * [HBASE-24632](https://issues.apache.org/jira/browse/HBASE-24632) | *Major* | **Enable procedure-based log splitting as default in hbase3**
 841
 842 Enables procedure-based distributed WAL splitting as default (HBASE-20610). To use 'classic' zk-coordinated splitting instead, set 'hbase.split.wal.zk.coordinated' to 'true'.
 843
 844
 845 ---
 846
 847 * [HBASE-24698](https://issues.apache.org/jira/browse/HBASE-24698) | *Major* | **Turn OFF Canary WebUI as default**
 848
 849 Flips default for 'HBASE-23994 Add WebUI to Canary' The UI defaulted to on at port 16050. This JIRA changes it so new UI is off by default.
 850
 851 To enable the UI, set property 'hbase.canary.info.port' to the port you want the UI to use.
 852
 853
 854 ---
 855
 856 * [HBASE-24650](https://issues.apache.org/jira/browse/HBASE-24650) | *Major* | **Change the return types of the new checkAndMutate methods introduced in HBASE-8458**
 857
 858 HBASE-24650 introduced CheckAndMutateResult class and changed the return type of checkAndMutate methods to this class in order to support CheckAndMutate with Increment/Append. CheckAndMutateResult class has two fields, one is \*success\* that indicates whether the operation is successful or not, and the other one is \*result\* that's the result of the operation and is used for  CheckAndMutate with Increment/Append.
 859
 860 The new APIs for the Table interface:
 861 \`\`\`
 862 /\*\*
 863  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 864  \* it performs the specified action.
 865  \*
 866  \* @param checkAndMutate The CheckAndMutate object.
 867  \* @return A CheckAndMutateResult object that represents the result for the CheckAndMutate.
 868  \* @throws IOException if a remote or network exception occurs.
 869  \*/
 870 default CheckAndMutateResult checkAndMutate(CheckAndMutate checkAndMutate) throws IOException {
 871   return checkAndMutate(Collections.singletonList(checkAndMutate)).get(0);
 872 }
 873
 874 /\*\*
 875  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
 876  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
 877  \* atomically (and thus, each may fail independently of others).
 878  \*
 879  \* @param checkAndMutates The list of CheckAndMutate.
 880  \* @return A list of CheckAndMutateResult objects that represents the result for each
 881  \*   CheckAndMutate.
 882  \* @throws IOException if a remote or network exception occurs.
 883  \*/
 884 default List\<CheckAndMutateResult\> checkAndMutate(List\<CheckAndMutate\> checkAndMutates)
 885   throws IOException {
 886   throw new NotImplementedException("Add an implementation!");
 887 }
 888 {code}
 889
 890 The new APIs for the AsyncTable interface:
 891 {code}
 892 /\*\*
 893  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 894  \* it performs the specified action.
 895  \*
 896  \* @param checkAndMutate The CheckAndMutate object.
 897  \* @return A {@link CompletableFuture}s that represent the result for the CheckAndMutate.
 898  \*/
 899 CompletableFuture\<CheckAndMutateResult\> checkAndMutate(CheckAndMutate checkAndMutate);
 900
 901 /\*\*
 902  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
 903  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
 904  \* atomically (and thus, each may fail independently of others).
 905  \*
 906  \* @param checkAndMutates The list of CheckAndMutate.
 907  \* @return A list of {@link CompletableFuture}s that represent the result for each
 908  \*   CheckAndMutate.
 909  \*/
 910 List\<CompletableFuture\<CheckAndMutateResult\>\> checkAndMutate(
 911   List\<CheckAndMutate\> checkAndMutates);
 912
 913 /\*\*
 914  \* A simple version of batch checkAndMutate. It will fail if there are any failures.
 915  \*
 916  \* @param checkAndMutates The list of rows to apply.
 917  \* @return A {@link CompletableFuture} that wrapper the result list.
 918  \*/
 919 default CompletableFuture\<List\<CheckAndMutateResult\>\> checkAndMutateAll(
 920   List\<CheckAndMutate\> checkAndMutates) {
 921   return allOf(checkAndMutate(checkAndMutates));
 922 }
 923 \`\`\`
 924
 925
 926 ---
 927
 928 * [HBASE-24671](https://issues.apache.org/jira/browse/HBASE-24671) | *Major* | **Add excludefile and designatedfile options to graceful\_stop.sh**
 929
 930 Add excludefile and designatedfile options to graceful\_stop.sh.
 931
 932 Designated file with \<hostname:port\> per line as unload targets.
 933
 934 Exclude file should have \<hostname:port\> per line. We do not unload regions to hostnames given in exclude file.
 935
 936 Here is a simple example using graceful\_stop.sh with designatedfile option:
 937 ./bin/graceful\_stop.sh --maxthreads 4 --designatedfile /path/designatedfile hostname
 938 The usage of the excludefile option is the same as the above.
 939
 940
 941 ---
 942
 943 * [HBASE-24560](https://issues.apache.org/jira/browse/HBASE-24560) | *Major* | **Add a new option of designatedfile in RegionMover**
 944
 945 Add a new option "designatedfile" in RegionMover.
 946
 947 If designated file is present with some contents, we will unload regions to hostnames provided in designated file.
 948
 949 Designated file should have 'host:port' per line.
 950
 951
 952 ---
 953
 954 * [HBASE-24289](https://issues.apache.org/jira/browse/HBASE-24289) | *Major* | **Heterogeneous Storage for Date Tiered Compaction**
 955
 956 Enhance DateTieredCompaction to support HDFS storage policy within one class family.
 957 # First you need enable DTCP.
 958 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
 959 hbase.hstore.compaction.compaction.policy=org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
 960 ## Parameters for Date Tiered Compaction:
 961 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
 962 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
 963 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
 964 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
 965
 966 # Then enable HDTCP(Heterogeneous Date Tiered Compaction) as follow example configurations:
 967 hbase.hstore.compaction.date.tiered.storage.policy.enable=true
 968 hbase.hstore.compaction.date.tiered.hot.window.age.millis=3600000
 969 hbase.hstore.compaction.date.tiered.hot.window.storage.policy=ALL\_SSD
 970 hbase.hstore.compaction.date.tiered.warm.window.age.millis=20600000
 971 hbase.hstore.compaction.date.tiered.warm.window.storage.policy=ONE\_SSD
 972 hbase.hstore.compaction.date.tiered.cold.window.storage.policy=HOT
 973 ## It is better to enable WAL and flushing HFile storage policy with HDTCP. You can tune follow settings as well:
 974 hbase.wal.storage.policy=ALL\_SSD
 975 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ALL\_SSD'}}
 976
 977 # Disable HDTCP as follow:
 978 hbase.hstore.compaction.date.tiered.storage.policy.enable=false
 979
 980
 981 ---
 982
 983 * [HBASE-24648](https://issues.apache.org/jira/browse/HBASE-24648) | *Major* | **Remove the legacy 'forceSplit' related code at region server side**
 984
 985 Add a canSplit method to RegionSplitPolicy to determine whether we can split a region. Usually it is not related to RegionSplitPolicy so in the default implementation, it will test whether region is available and does not have reference file, but in DisabledRegionSplitPolicy, we will always return false.
 986
 987
 988 ---
 989
 990 * [HBASE-24382](https://issues.apache.org/jira/browse/HBASE-24382) | *Major* | **Flush partial stores of region filtered by seqId when archive wal due to too many wals**
 991
 992 Change the flush level from region to store when there are too many wals, benefit from this we can reduce unnessary flush tasks and small hfiles.
 993
 994
 995 ---
 996
 997 * [HBASE-24038](https://issues.apache.org/jira/browse/HBASE-24038) | *Major* | **Add a metric to show the locality of ssd in table.jsp**
 998
 999 Add a metric to show the locality of ssd in table.jsp, and move the locality related metrics to a new tab named localities.
1000
1001
1002 ---
1003
1004 * [HBASE-8458](https://issues.apache.org/jira/browse/HBASE-8458) | *Major* | **Support for batch version of checkAndMutate()**
1005
1006 HBASE-8458 introduced CheckAndMutate class that's used to perform CheckAndMutate operations. Use the builder class to instantiate a CheckAndMutate object. This builder class is fluent style APIs, the code are like:
1007 \`\`\`
1008 // A CheckAndMutate operation where do the specified action if the column (specified by the
1009 family and the qualifier) of the row equals to the specified value
1010 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1011   .ifEquals(family, qualifier, value)
1012   .build(put);
1013
1014 // A CheckAndMutate operation where do the specified action if the column (specified by the
1015 // family and the qualifier) of the row doesn't exist
1016 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1017   .ifNotExists(family, qualifier)
1018   .build(put);
1019
1020 // A CheckAndMutate operation where do the specified action if the row matches the filter
1021 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1022   .ifMatches(filter)
1023   .build(delete);
1024 \`\`\`
1025
1026 And This added new checkAndMutate APIs to the Table and AsyncTable interfaces, and deprecated the old checkAndMutate APIs. The example code for the new APIs are as follows:
1027 \`\`\`
1028 Table table = ...;
1029
1030 CheckAndMutate checkAndMutate = ...;
1031
1032 // Perform the checkAndMutate operation
1033 boolean success = table.checkAndMutate(checkAndMutate);
1034
1035 CheckAndMutate checkAndMutate1 = ...;
1036 CheckAndMutate checkAndMutate2 = ...;
1037
1038 // Batch version
1039 List\<Boolean\> successList = table.checkAndMutate(Arrays.asList(checkAndMutate1, checkAndMutate2));
1040 \`\`\`
1041
1042 This also has Protocol Buffers level changes. Old clients without this patch will work against new servers with this patch. However, new clients will break against old servers without this patch for checkAndMutate with RM and mutateRow. So, for rolling upgrade, we will need to upgrade servers first, and then roll out the new clients.
1043
1044
1045 ---
1046
1047 * [HBASE-24471](https://issues.apache.org/jira/browse/HBASE-24471) | *Major* | **The way we bootstrap meta table is confusing**
1048
1049 Move all the meta initialization code in MasterFileSystem and HRegionServer to InitMetaProcedure. Add a new step for InitMetaProcedure called INIT\_META\_WRITE\_FS\_LAYOUT to place the moved code.
1050
1051 This is an incompatible change, but should not have much impact. InitMetaProcedure will only be executed once when bootstraping a fresh new cluster, so typically this will not effect rolling upgrading. And even if you hit this problem, as long as InitMetaProcedure has not been finished, we can make sure that there is no user data in the cluster, you can just clean up the cluster and try again. There will be no data loss.
1052
1053
1054 ---
1055
1056 * [HBASE-24017](https://issues.apache.org/jira/browse/HBASE-24017) | *Major* | **Turn down flakey rerun rate on all but hot branches**
1057
1058 Changed master, branch-2, and branch-2.1 to twice a day.
1059 Left branch-2.3, branch-2.2, and branch-1 at every 4 hours.
1060 Changed branch-1.4 and branch-1.3 to @daily (1.3 was running every hour).
1061
1062
1063
1064 # HBASE  2.3.0 Release Notes
1065
1066 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
1067
1068
1069 ---
1070
1071 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
1072
1073 <!-- markdown -->
1074
1075 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
1076
1077
1078 ---
1079
1080 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
1081
1082 <!-- markdown -->
1083 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
1084 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
1085
1086
1087 ---
1088
1089 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
1090
1091 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
1092 The metric is now collected under the mbean for Tables and under the mbean for regions.
1093 Under table mbean ie.-
1094 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
1095 The new metrics will be listed as
1096 {code}
1097     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1098  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
1099 {code}
1100 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
1101 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
1102 {code}
1103
1104 The same one under the region ie.
1105 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
1106 comes as
1107 {code}
1108    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1109     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
1110 {code}
1111 where
1112 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
1113 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
1114 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
1115
1116
1117 ---
1118
1119 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
1120
1121 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
1122
1123 $hbase rowcounter -h
1124
1125 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
1126 Options:
1127     --starttime=\<arg\>       starting time filter to start counting rows from.
1128     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
1129     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
1130     --expectedCount=\<arg\>   expected number of rows to be count.
1131 For performance, consider the following configuration properties:
1132 -Dhbase.client.scanner.caching=100
1133 -Dmapreduce.map.speculative=false
1134
1135
1136 ---
1137
1138 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
1139
1140 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
1141
1142
1143 ---
1144
1145 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
1146
1147 Adds being able to edit hbase:meta table schema. For example,
1148
1149 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
1150 Updating all regions with the new schema...
1151 All regions updated.
1152 Done.
1153 Took 1.2138 seconds
1154
1155 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
1156
1157
1158 ---
1159
1160 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
1161
1162 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
1163
1164
1165 ---
1166
1167 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
1168
1169 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
1170
1171
1172 ---
1173
1174 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
1175
1176 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
1177
1178
1179 ---
1180
1181 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
1182
1183 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
1184
1185
1186 ---
1187
1188 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
1189
1190 <!-- markdown -->
1191 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
1192 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
1193 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
1194 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
1195 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
1196 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
1197
1198
1199 ---
1200
1201 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
1202
1203 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
1204
1205 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
1206
1207 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
1208
1209 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
1210
1211
1212 ---
1213
1214 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
1215
1216 Added new metric to differentiate sink startup time from last OP applied time.
1217
1218 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
1219
1220 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
1221
1222 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
1223
1224 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
1225
1226
1227 ---
1228
1229 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
1230
1231 <!-- markdown -->
1232 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
1233
1234 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
1235
1236
1237 ---
1238
1239 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
1240
1241 Add backoff. Avoid retrying every 100ms.
1242
1243
1244 ---
1245
1246 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
1247
1248 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
1249
1250 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
1251
1252
1253 ---
1254
1255 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
1256
1257 Introduced a general 'local region' at master side to store the procedure data, etc.
1258
1259 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
1260
1261 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
1262
1263
1264 ---
1265
1266 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
1267
1268 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
1269
1270
1271 ---
1272
1273 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
1274
1275 Config key: hbase.regionserver.slowlog.systable.enabled
1276 Default value: false
1277
1278 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
1279 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
1280
1281 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
1282
1283 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
1284
1285  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
1286  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
1287  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
1288  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
1289                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
1290                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
1291                                                              rics: false
1292  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
1293  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
1294  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
1295  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
1296  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
1297  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
1298  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
1299  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
1300
1301
1302 ---
1303
1304 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
1305
1306 <!-- markdown -->
1307 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
1308
1309
1310 ---
1311
1312 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
1313
1314 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
1315
1316 The request log is disabled by default in conf/log4j.properties by the following lines:
1317
1318 # Disable request log by default, you can enable this by changing the appender
1319 log4j.category.http.requests=INFO,NullAppender
1320 log4j.additivity.http.requests=false
1321
1322 Change the 'NullAppender' to what ever you want if you want to enable request log.
1323
1324 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
1325
1326
1327 ---
1328
1329 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
1330
1331 Use a empty string to represent no column specified for deleteall in shell mode.
1332 useage:
1333 deleteall 'test','r1','',12345
1334 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
1335
1336
1337 ---
1338
1339 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
1340
1341 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
1342
1343
1344 ---
1345
1346 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
1347
1348 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
1349
1350
1351 ---
1352
1353 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
1354
1355 Moved to hbase-thirdparty 3.3.0.
1356
1357
1358 ---
1359
1360 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
1361
1362 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
1363
1364 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
1365
1366 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
1367
1368
1369 ---
1370
1371 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
1372
1373 <!-- markdown -->
1374 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
1375
1376 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
1377
1378
1379 ---
1380
1381 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
1382
1383 New Config: hbase.rpc.rows.size.threshold.reject
1384 -----------------------------------------------------------------------
1385
1386 Default value: false
1387 Description:
1388 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
1389
1390
1391 ---
1392
1393 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
1394
1395 StochasticLoadBalancer functional improvement:
1396
1397 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
1398
1399
1400 ---
1401
1402 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
1403
1404 user or admin can now use
1405 hbase shell \> rename\_rsgroup 'oldname', 'newname'
1406 to rename rsgroup.
1407
1408
1409 ---
1410
1411 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
1412
1413 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
1414
1415 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
1416
1417
1418 ---
1419
1420 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
1421
1422 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
1423
1424
1425 ---
1426
1427 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
1428
1429 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
1430
1431
1432 ---
1433
1434 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
1435
1436 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
1437
1438
1439 ---
1440
1441 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
1442
1443 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
1444
1445
1446 ---
1447
1448 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
1449
1450 <!-- markdown -->
1451 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
1452
1453
1454 ---
1455
1456 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
1457
1458 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
1459
1460 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
1461
1462 For running tests locally, to go faster, up fork count.
1463
1464 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
1465
1466 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
1467
1468
1469 ---
1470
1471 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
1472
1473 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
1474
1475
1476 ---
1477
1478 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
1479
1480 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
1481
1482
1483 ---
1484
1485 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
1486
1487 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
1488
1489
1490 ---
1491
1492 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
1493
1494 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
1495
1496
1497 ---
1498
1499 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
1500
1501 <!-- markdown -->
1502 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
1503
1504 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
1505
1506
1507 ---
1508
1509 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
1510
1511 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
1512
1513
1514 ---
1515
1516 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
1517
1518 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
1519
1520
1521 ---
1522
1523 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
1524
1525 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
1526
1527
1528 ---
1529
1530 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
1531
1532 ColumnFamilyDescriptor new builder API:
1533
1534     /\*\*
1535      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
1536      \* of versions(versionAfterInterval) after that interval elapses.
1537      \*
1538      \* @param retentionInterval Retain all versions for this interval
1539      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
1540      \*/
1541     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
1542         final int retentionInterval, final int versionAfterInterval)
1543
1544
1545 ---
1546
1547 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
1548
1549 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
1550
1551
1552 ---
1553
1554 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
1555
1556 Expose file system level read metrics for RegionServer.
1557
1558 If the HBase RS runs on top of HDFS, calculate the aggregation of
1559 ReadStatistics of each HdfsFileInputStream. These metrics include:
1560 (1) total number of bytes read from HDFS.
1561 (2) total number of bytes read from local DataNode.
1562 (3) total number of bytes read locally through short-circuit read.
1563 (4) total number of bytes read locally through zero-copy read.
1564
1565 Because HDFS ReadStatistics is calculated per input stream, it is not
1566 feasible to update the aggregated number in real time. Instead, the
1567 metrics are updated when an input stream is closed.
1568
1569
1570 ---
1571
1572 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
1573
1574 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
1575
1576 Here is a simple example of script:
1577 {code}
1578 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
1579 #!/bin/bash
1580 namespace=$1
1581 tablename=$2
1582 if [[ $namespace == test ]]; then
1583   echo test
1584 elif [[ $tablename == \*foo\* ]]; then
1585   echo other
1586 else
1587   echo default
1588 fi
1589 {code}
1590
1591
1592 ---
1593
1594 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
1595
1596 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
1597
1598
1599 ---
1600
1601 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
1602
1603 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
1604
1605
1606 ---
1607
1608 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
1609
1610 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
1611
1612 User used to see....
1613
1614   column=table:state, timestamp=1583967620343 .....
1615
1616 ... but now sees:
1617
1618   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
1619
1620
1621 ---
1622
1623 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
1624
1625 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
1626
1627
1628 ---
1629
1630 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
1631
1632 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
1633
1634 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
1635
1636
1637 ---
1638
1639 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
1640
1641 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
1642
1643 New Admin APIs:
1644 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
1645       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
1646
1647 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
1648       throws IOException;
1649
1650 Configs:
1651
1652 1. hbase.regionserver.slowlog.ringbuffer.size:
1653 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
1654
1655 Default
1656 256
1657
1658 2. hbase.regionserver.slowlog.buffer.enabled:
1659 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
1660
1661 Default
1662 false
1663
1664
1665 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
1666
1667
1668 ---
1669
1670 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
1671
1672 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
1673
1674
1675 ---
1676
1677 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
1678
1679 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
1680
1681 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
1682
1683 This is a fluent style API, the code is like:
1684
1685 For Table interface:
1686 {code}
1687 table.checkAndMutate(row, filter).thenPut(put);
1688 {code}
1689
1690 For AsyncTable interface:
1691 {code}
1692 table.checkAndMutate(row, filter).thenPut(put)
1693     .thenAccept(succ -\> {
1694       if (succ) {
1695         System.out.println("Check and put succeeded");
1696       } else {
1697         System.out.println("Check and put failed");
1698       }
1699     });
1700 {code}
1701
1702
1703 ---
1704
1705 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
1706
1707 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
1708
1709
1710 ---
1711
1712 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
1713
1714 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
1715
1716
1717 ---
1718
1719 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
1720
1721     Adds shell command regioninfo:
1722
1723       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
1724       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
1725       Took 0.4737 seconds
1726
1727
1728 ---
1729
1730 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
1731
1732 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
1733
1734 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
1735
1736
1737 ---
1738
1739 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
1740
1741 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
1742
1743 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
1744 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
1745
1746 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
1747
1748
1749 ---
1750
1751 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
1752
1753 <!-- markdown -->
1754 Enables master based registry as the default registry used by clients to fetch connection metadata.
1755 Refer to the section "Master Registry" in the client documentation for more details and advantages
1756 of this implementation over the default Zookeeper based registry.
1757
1758 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
1759
1760 Where to set this: HBase client configuration (hbase-site.xml)
1761
1762 Possible values:
1763 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
1764 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
1765
1766 Notes on defaults:
1767
1768 - For v3.0.0 and later, MasterRegistry is the default registry
1769 - For all releases in 2.x line, ZK based registry is the default.
1770
1771 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
1772
1773 ```
1774 <property>
1775   <name>hbase.client.registry.impl</name>
1776   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
1777 </property>
1778 ```
1779
1780
1781 ---
1782
1783 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
1784
1785 caffeine: 2.6.2 =\> 2.8.1
1786 commons-codec: 1.10 =\> 1.13
1787 commons-io: 2.5 =\> 2.6
1788 disrupter: 3.3.6 =\> 3.4.2
1789 httpcore: 4.4.6 =\> 4.4.13
1790 jackson: 2.9.10 =\> 2.10.1
1791 jackson.databind: 2.9.10.1 =\> 2.10.1
1792 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
1793 protobuf.plugin: 0.5.0 =\> 0.6.1
1794 zookeeper: 3.4.10 =\> 3.4.14
1795 slf4j: 1.7.25 =\> 1.7.30
1796 rat: 0.12 =\> 0.13
1797 asciidoctor: 1.5.5 =\> 1.5.8
1798 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
1799 error-prone: 2.3.3 =\> 2.3.4
1800
1801
1802 ---
1803
1804 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
1805
1806 - Reverts a binary incompatible binary change for ByteRangeUtils
1807 - Usage of reflection inside CommonFSUtils removed
1808
1809
1810 ---
1811
1812 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
1813
1814 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
1815
1816 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
1817
1818
1819 ---
1820
1821 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
1822
1823 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
1824
1825
1826 ---
1827
1828 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
1829
1830 Add a new config to hbase-default.xml
1831
1832   \<property\>
1833     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
1834     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
1835     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
1836     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
1837     called in order, so put the cleaner that prunes the most files in front. To
1838     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
1839     and add the fully qualified class name here. Always add the above
1840     default hfile cleaners in the list as they will be overwritten in
1841     hbase-site.xml.\</description\>
1842   \</property\>
1843
1844 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
1845
1846
1847 ---
1848
1849 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
1850
1851 Updated parent pom to Apache version 22.
1852
1853
1854 ---
1855
1856 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
1857
1858 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
1859
1860 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
1861
1862
1863 ---
1864
1865 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
1866
1867 Add a new feature to improve MTTR which have 3 steps to failover:
1868 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
1869 2. Open region.
1870 3. Bulkload the recovered.hfiles for every column family.
1871
1872 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
1873
1874 Config hbase.wal.split.to.hfile to true to enable this featue.
1875
1876
1877 ---
1878
1879 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
1880
1881 Changed the logging in hbase-zookeeper to use built-in formatting
1882
1883
1884 ---
1885
1886 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
1887
1888 From the PR:
1889
1890 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
1891
1892 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
1893
1894
1895 ---
1896
1897 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
1898
1899 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
1900
1901
1902 ---
1903
1904 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
1905
1906 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
1907
1908
1909 ---
1910
1911 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
1912
1913 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
1914
1915
1916 ---
1917
1918 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
1919
1920 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
1921
1922 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
1923
1924
1925 ---
1926
1927 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
1928
1929 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
1930
1931
1932 ---
1933
1934 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
1935
1936 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
1937 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
1938
1939 Fixed this bug as part of this Jira.
1940 Updated description for corresponding configs:
1941
1942 1. hbase.master.regions.recovery.check.interval :
1943
1944 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
1945
1946 2. hbase.regions.recovery.store.file.ref.count :
1947
1948 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
1949
1950
1951 ---
1952
1953 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
1954
1955 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
1956
1957
1958 ---
1959
1960 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
1961
1962 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
1963
1964
1965 ---
1966
1967 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
1968
1969 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
1970
1971
1972 ---
1973
1974 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
1975
1976 Bumped surefire plugin to 3.0.0-M4
1977
1978
1979 ---
1980
1981 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
1982
1983 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
1984
1985
1986 ---
1987
1988 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
1989
1990 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
1991 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
1992 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
1993 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
1994 From the shell this can be enabled by using the option per Column Family also by using the below format
1995 {code}
1996 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
1997 {code}
1998
1999
2000 ---
2001
2002 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
2003
2004 <!-- markdown -->
2005
2006 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
2007
2008 ```
2009 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
2010     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
2011 ```
2012
2013 See javadocs of the class `MobRefReporter` for more details.
2014
2015 the reference guide has added some information about MOB internals and troubleshooting.
2016
2017
2018 ---
2019
2020 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
2021
2022 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
2023
2024
2025 ---
2026
2027 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
2028
2029 Fixed unbalanced braces in string representation within HBase shell
2030
2031
2032 ---
2033
2034 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
2035
2036 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
2037 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
2038
2039
2040 ---
2041
2042 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
2043
2044 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
2045
2046
2047 ---
2048
2049 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
2050
2051 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
2052
2053 1. RowFilter
2054 2. ValueFilter
2055 3. QualifierFilter
2056 4. FamilyFilter
2057 5. ColumnValueFilter
2058
2059
2060 ---
2061
2062 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
2063
2064 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
2065
2066
2067 ---
2068
2069 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
2070
2071 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
2072
2073
2074 ---
2075
2076 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
2077
2078 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
2079
2080
2081 ---
2082
2083 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
2084
2085 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
2086
2087
2088 ---
2089
2090 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
2091
2092 <!-- markdown -->
2093 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
2094
2095 Such messages will happen at most once per five minutes.
2096
2097
2098 ---
2099
2100 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
2101
2102 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
2103
2104
2105 ---
2106
2107 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
2108
2109 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
2110
2111
2112 ---
2113
2114 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
2115
2116 <!-- markdown -->
2117
2118 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
2119
2120   - CVE-2019-16942
2121   - CVE-2019-16943
2122
2123 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
2124
2125
2126 ---
2127
2128 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
2129
2130 <!-- markdown -->
2131
2132 The MOB compaction process in the HBase Master now logs more about its activity.
2133
2134 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
2135
2136 Caveats:
2137 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
2138 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
2139 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
2140 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
2141
2142
2143 ---
2144
2145 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
2146
2147 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
2148
2149
2150 ---
2151
2152 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
2153
2154 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
2155
2156 Configs:
2157
2158 1. hbase.master.regions.recovery.check.interval :
2159
2160 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
2161
2162 2. hbase.regions.recovery.store.file.ref.count :
2163
2164 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
2165
2166
2167 ---
2168
2169 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
2170
2171 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
2172
2173 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
2174
2175
2176 ---
2177
2178 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
2179
2180 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
2181
2182
2183 ---
2184
2185 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
2186
2187 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
2188
2189
2190 ---
2191
2192 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
2193
2194 <!-- markdown -->
2195 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
2196
2197 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
2198
2199
2200 ---
2201
2202 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
2203
2204 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
2205
2206
2207 ---
2208
2209 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
2210
2211 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
2212
2213
2214 ---
2215
2216 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
2217
2218 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
2219
2220 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
2221
2222
2223 ---
2224
2225 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
2226
2227 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
2228 \<property\>
2229     \<name\>hbase.bucketcache.ioengine\</name\>
2230     \<value\> pmem:///path in persistent memory \</value\>
2231   \</property\>
2232
2233
2234 ---
2235
2236 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
2237
2238 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
2239 hbase\> snapshot\_cleanup\_switch false
2240
2241 We can re-enable it using:
2242 hbase\> snapshot\_cleanup\_switch true
2243
2244 We can query whether snapshot auto cleanup is enabled for cluster using:
2245 hbase\> snapshot\_cleanup\_enabled
2246
2247
2248 ---
2249
2250 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
2251
2252 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
2253
2254
2255 ---
2256
2257 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
2258
2259 This issue adds via its subtasks:
2260
2261  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
2262  \*\* Master thought this region opened, but no regionserver reported it.
2263  \*\* Master thought this region opened on Server1, but regionserver reported Server2
2264  \*\* More than one regionservers reported opened this region
2265  Both chores can be triggered from the shell to regenerate ‘new’ reports.
2266  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
2267  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
2268  \* Offline replace of hbase.version and hbase.id
2269  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
2270  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
2271  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
2272  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
2273  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
2274  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
2275
2276 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
2277
2278
2279 ---
2280
2281 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
2282
2283 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
2284
2285
2286 ---
2287
2288 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
2289
2290 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
2291
2292
2293 ---
2294
2295 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
2296
2297 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
2298
2299 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
2300
2301 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
2302
2303
2304 ---
2305
2306 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
2307
2308 <!-- markdown -->
2309 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
2310
2311
2312 ---
2313
2314 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
2315
2316 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
2317
2318
2319 ---
2320
2321 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
2322
2323 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
2324
2325
2326 ---
2327
2328 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
2329
2330 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
2331 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
2332
2333
2334 ---
2335
2336 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
2337
2338 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
2339 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
2340 \* TimeRange#until: Represents the time interval [0, maxStamp)
2341 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
2342
2343
2344 ---
2345
2346 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
2347
2348 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
2349 {code}
2350 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
2351 {code}
2352
2353
2354 ---
2355
2356 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
2357
2358 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
2359
2360
2361 ---
2362
2363 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
2364
2365 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
2366
2367 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
2368
2369
2370 ---
2371
2372 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
2373
2374 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
2375
2376
2377 ---
2378
2379 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
2380
2381 New shaded artifact for testing: hbase-shaded-testing-util.
2382
2383
2384 ---
2385
2386 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
2387
2388 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
2389 1. Check HDFS configuration
2390 2. Add master coprocessor:
2391     hbase.coprocessor.master.classes=
2392     “org.apache.hadoop.hbase.security.access.AccessController,
2393 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
2394 3. Enable this feature:
2395     hbase.acl.sync.to.hdfs.enable=true
2396 4. Modify table scheme to enable this feature for a table:
2397     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
2398
2399
2400 ---
2401
2402 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
2403
2404 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
2405
2406 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
2407
2408 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
2409 java.lang.ArrayIndexOutOfBoundsException: 18056
2410         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
2411         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
2412         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
2413         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
2414         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
2415         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
2416         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
2417         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
2418         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
2419
2420 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
2421
2422 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
2423
2424 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
2425
2426
2427 ---
2428
2429 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
2430
2431 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
2432
2433
2434 ---
2435
2436 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
2437
2438 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
2439
2440
2441 ---
2442
2443 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
2444
2445 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
2446
2447 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
2448
2449
2450 ---
2451
2452 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
2453
2454 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
2455
2456
2457 ---
2458
2459 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
2460
2461 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
2462 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
2463
2464
2465 ---
2466
2467 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
2468
2469 1. Add a new chore thread in master to do hbck checking
2470 2. Add a new web ui "HBCK Report" page to display checking results.
2471
2472 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
2473
2474 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
2475
2476
2477 ---
2478
2479 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
2480
2481 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
2482 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
2483
2484
2485 ---
2486
2487 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
2488
2489 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
2490
2491 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
2492
2493 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
2494
2495
2496 ---
2497
2498 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
2499
2500 Add a new master web UI to show the potentially problematic opened regions. There are three case:
2501 1. Master thought this region opened, but no regionserver reported it.
2502 2. Master thought this region opened on Server1, but regionserver reported Server2
2503 3. More than one regionservers reported opened this region
2504
2505
2506 ---
2507
2508 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
2509
2510 Feature: Take a Snapshot With TTL for auto-cleanup
2511
2512 Attribute:
2513 1. TTL
2514      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
2515
2516 Configs:
2517 1. Default Snapshot TTL:
2518      - FOREVER by default
2519      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
2520
2521 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
2522      - hbase.master.cleaner.snapshot.disable: "true"
2523     With this config, HMaster needs restart just like any other hbase-site config.
2524
2525
2526 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
2527
2528
2529 ---
2530
2531 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
2532
2533 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
2534
2535
2536 ---
2537
2538 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
2539
2540 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
2541
2542 This tool is deprecated in 2.x and will be removed in 3.0.
2543
2544
2545 ---
2546
2547 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
2548
2549 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
2550
2551
2552 ---
2553
2554 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
2555
2556 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
2557
2558 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
2559
2560 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
2561
2562
2563 ---
2564
2565 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
2566
2567 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
2568 To use this feature, please make sure the HDFS config is set:
2569 dfs.namenode.acls.enabled=true
2570 fs.permissions.umask-mode=027
2571
2572 and set the HBase config:
2573 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
2574 hbase.user.scan.snapshot.enable=true
2575
2576
2577 ---
2578
2579 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
2580
2581 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2582
2583 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2584
2585
2586 ---
2587
2588 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
2589
2590 <!-- markdown -->
2591
2592 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
2593
2594
2595 ---
2596
2597 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
2598
2599 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
2600
2601
2602 ---
2603
2604 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
2605
2606 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
2607
2608
2609 ---
2610
2611 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
2612
2613 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
2614
2615
2616 ---
2617
2618 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
2619
2620 The HBase "source checksum" now uses SHA512 instead of MD5.
2621
2622
2623 ---
2624
2625 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
2626
2627 <!-- markdown -->
2628
2629 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
2630
2631 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
2632
2633
2634 ---
2635
2636 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
2637
2638 The access method was used to the HttpServerFunctionalTest class as a common place.
2639
2640
2641 ---
2642
2643 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
2644
2645 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
2646
2647
2648 ---
2649
2650 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
2651
2652 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
2653
2654
2655 ---
2656
2657 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
2658
2659 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
2660
2661
2662 ---
2663
2664 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
2665
2666 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
2667
2668
2669 ---
2670
2671 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
2672
2673 Support get\|set LogLevel in secure(kerberized) environment.
2674
2675
2676 ---
2677
2678 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
2679
2680 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
2681
2682
2683 ---
2684
2685 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
2686
2687 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
2688
2689
2690 ---
2691
2692 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
2693
2694 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
2695
2696
2697 ---
2698
2699 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
2700
2701 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
2702
2703
2704 ---
2705
2706 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
2707
2708 Updated metrics core from 3.2.1 to 3.2.6.
2709
2710
2711 ---
2712
2713 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
2714
2715 The rubocop definition for the maximum method length was set to 75.
2716
2717
2718 ---
2719
2720 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
2721
2722 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
2723
2724
2725 ---
2726
2727 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
2728
2729 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
2730
2731
2732 ---
2733
2734 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
2735
2736 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
2737
2738 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
2739
2740 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
2741
2742 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
2743
2744 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
2745 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
2746
2747 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
2748 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
2749 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
2750
2751
2752 ---
2753
2754 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
2755
2756 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
2757
2758 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
2759
2760
2761 ---
2762
2763 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
2764
2765 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
2766
2767
2768 ---
2769
2770 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
2771
2772 <!-- markdown -->
2773 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
2774
2775 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
2776
2777
2778 ---
2779
2780 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
2781
2782 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
2783
2784 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
2785
2786
2787 ---
2788
2789 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
2790
2791 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
2792
2793
2794 ---
2795
2796 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
2797
2798 <!-- markdown -->
2799
2800 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
2801
2802 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
2803
2804
2805 ---
2806
2807 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
2808
2809 Add below method in Table interface:
2810
2811 RegionLocator getRegionLocator() throws IOException;
2812
2813 Add below methods in AsyncTable interface:
2814
2815 AsyncTableRegionLocator getRegionLocator();
2816 CompletableFuture\<TableDescriptor\> getDescriptor();
2817
2818
2819 ---
2820
2821 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
2822
2823 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
2824
2825 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
2826
2827 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
2828
2829
2830 ---
2831
2832 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
2833
2834 Introduced
2835
2836 Future\<Void\> createTableAsync(TableDescriptor);
2837
2838
2839 ---
2840
2841 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
2842
2843 Introduced these methods:
2844 void move(byte[]);
2845 void move(byte[], ServerName);
2846 Future\<Void\> splitRegionAsync(byte[]);
2847
2848 These methods are deprecated:
2849 void move(byte[], byte[])
2850
2851
2852 ---
2853
2854 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
2855
2856 Add a new jenkins file for running pre commit check for GitHub PR.
2857
2858
2859 ---
2860
2861 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
2862
2863 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
2864
2865
2866 ---
2867
2868 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
2869
2870 When insufficient permissions, you now get:
2871
2872 HTTP/1.1 403 Forbidden
2873
2874 on the HTTP side, and in the message
2875
2876 Forbidden
2877 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
2878 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
2879 and the rest of the ADE stack
2880
2881
2882 ---
2883
2884 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
2885
2886 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
2887
2888
2889 ---
2890
2891 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
2892
2893 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
2894
2895
2896 ---
2897
2898 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
2899
2900 <!-- markdown -->
2901 Fixed awkward dependency issue that prevented site building.
2902
2903 #### note specific to HBase 2.1.4
2904 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
2905 ```
2906 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
2907 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
2908         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
2909         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
2910         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
2911         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
2912         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
2913         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
2914         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
2915         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
2916         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
2917         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
2918         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
2919         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
2920         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
2921         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
2922         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
2923         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
2924         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
2925         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
2926         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
2927         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
2928         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
2929         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
2930         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
2931         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
2932         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
2933         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
2934 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
2935         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
2936         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
2937         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
2938         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
2939         ... 26 more
2940
2941 ```
2942
2943 Workaround via any _one_ of the following:
2944 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
2945 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
2946 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
2947 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
2948 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
2949
2950
2951 ---
2952
2953 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
2954
2955 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
2956
2957
2958 ---
2959
2960 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
2961
2962 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
2963
2964
2965 ---
2966
2967 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
2968
2969 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
2970
2971 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
2972
2973
2974 ---
2975
2976 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
2977
2978 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
2979
2980
2981 ---
2982
2983 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
2984
2985 <!-- markdown -->
2986
2987 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
2988
2989 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
2990
2991
2992 ---
2993
2994 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
2995
2996 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
2997
2998
2999 ---
3000
3001 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
3002
3003 Add a cloneSnapshotAsync method with restoreAcl parameter.
3004 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
3005 Make snapshotAsync method returns a Future\<Void\>.
3006 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
3007 Use default methods to reduce the code base for implementation classes.
3008
3009
3010 ---
3011
3012 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
3013
3014 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
3015
3016
3017 ---
3018
3019 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
3020
3021 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
3022 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
3023
3024 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
3025
3026 For example:
3027 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
3028
3029
3030 ---
3031
3032 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
3033
3034 Adds below flush, split, and compaction metrics
3035
3036  +  // split related metrics
3037  +  private MutableFastCounter splitRequest;
3038  +  private MutableFastCounter splitSuccess;
3039  +  private MetricHistogram splitTimeHisto;
3040  +
3041  +  // flush related metrics
3042  +  private MetricHistogram flushTimeHisto;
3043  +  private MetricHistogram flushMemstoreSizeHisto;
3044  +  private MetricHistogram flushOutputSizeHisto;
3045  +  private MutableFastCounter flushedMemstoreBytes;
3046  +  private MutableFastCounter flushedOutputBytes;
3047  +
3048  +  // compaction related metrics
3049  +  private MetricHistogram compactionTimeHisto;
3050  +  private MetricHistogram compactionInputFileCountHisto;
3051  +  private MetricHistogram compactionInputSizeHisto;
3052  +  private MetricHistogram compactionOutputFileCountHisto;
3053  +  private MetricHistogram compactionOutputSizeHisto;
3054  +  private MutableFastCounter compactedInputBytes;
3055  +  private MutableFastCounter compactedOutputBytes;
3056  +
3057  +  private MetricHistogram majorCompactionTimeHisto;
3058  +  private MetricHistogram majorCompactionInputFileCountHisto;
3059  +  private MetricHistogram majorCompactionInputSizeHisto;
3060  +  private MetricHistogram majorCompactionOutputFileCountHisto;
3061  +  private MetricHistogram majorCompactionOutputSizeHisto;
3062  +  private MutableFastCounter majorCompactedInputBytes;
3063  +  private MutableFastCounter majorCompactedOutputBytes;
3064
3065
3066 ---
3067
3068 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
3069
3070 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
3071
3072
3073 ---
3074
3075 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
3076
3077 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
3078 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
3079
3080
3081 ---
3082
3083 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
3084
3085 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
3086
3087 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
3088
3089
3090 ---
3091
3092 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
3093
3094 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
3095 Shell commands are as follows:
3096 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3097
3098 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
3099 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
3100 Shell commands are as follows:
3101 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3102 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
3103 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
3104
3105
3106 ---
3107
3108 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
3109
3110 Change spotbugs version to 3.1.11.
3111
3112
3113 ---
3114
3115 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
3116
3117 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
3118
3119 It also introduces additional info for each recovery queue, which was not accounted by this command before.
3120
3121 The new output for "status 'replication'" command is explained in details below:
3122 a) Source started, target stopped, no edits arrived on source yet:
3123 ...
3124  SOURCE: PeerID=1
3125          Normal Queue: 1
3126            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3127 ...
3128 b) Source started, target stopped, add edit on source:
3129 ...
3130 Normal Queue: 1
3131            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
3132 ...
3133 c) Source started, target stopped, edit added on source, restart source:
3134 ...
3135 SOURCE: PeerID=1
3136          Normal Queue: 1
3137            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3138          Recovered Queue: 1-hbase01.home,16020,1542784524057
3139            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
3140 ...
3141 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
3142 ...
3143 SOURCE: PeerID=1
3144          Normal Queue: 1
3145            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
3146          Recovered Queue: 1-hbase01.home,16020,1542782758742
3147            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
3148 ...
3149 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
3150 ...
3151        SOURCE: PeerID=1
3152          Normal Queue: 1
3153            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
3154 ...
3155 f) Source started, target stopped, add edit on source, restart source, restart target:
3156 ...
3157 SOURCE: PeerID=1
3158          Normal Queue: 1
3159            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3160 ...
3161
3162
3163 ---
3164
3165 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
3166
3167 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
3168
3169
3170 ---
3171
3172 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
3173
3174 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
3175 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
3176 disable\_exceed\_throttle\_quota
3177 There are two limits when enable exceed throttle quota:
3178 1. Must set at least one read and one write region server throttle quota;
3179 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
3180
3181
3182 ---
3183
3184 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
3185
3186 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
3187
3188
3189 ---
3190
3191 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
3192
3193 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
3194
3195
3196 ---
3197
3198 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
3199
3200 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
3201
3202
3203 ---
3204
3205 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
3206
3207 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
3208
3209 hbase\> help 'scan'
3210
3211
3212 ---
3213
3214 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
3215
3216 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
3217
3218 For example:
3219 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
3220
3221
3222 ---
3223
3224 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
3225
3226 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
3227 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
3228
3229
3230 ---
3231
3232 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
3233
3234 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
3235
3236
3237 ---
3238
3239 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
3240
3241 Make StoppedRpcClientException extend DoNotRetryIOException.
3242
3243
3244 ---
3245
3246 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
3247
3248 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
3249 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
3250
3251
3252 ---
3253
3254 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
3255
3256 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
3257
3258 The effect releases are:
3259 2.1.x: 2.1.2 and below
3260 2.0.x: 2.0.4 and below
3261 1.x: 1.4.x and below
3262
3263 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
3264
3265
3266 ---
3267
3268 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
3269
3270 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
3271
3272
3273
3274 # HBASE  2.3.0 Release Notes
3275
3276 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
3277
3278
3279 ---
3280
3281 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
3282
3283 <!-- markdown -->
3284
3285 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
3286
3287
3288 ---
3289
3290 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
3291
3292 <!-- markdown -->
3293 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
3294 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
3295
3296
3297 ---
3298
3299 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
3300
3301 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
3302 The metric is now collected under the mbean for Tables and under the mbean for regions.
3303 Under table mbean ie.-
3304 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
3305 The new metrics will be listed as
3306 {code}
3307     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3308  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
3309 {code}
3310 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
3311 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
3312 {code}
3313
3314 The same one under the region ie.
3315 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
3316 comes as
3317 {code}
3318    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3319     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
3320 {code}
3321 where
3322 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
3323 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
3324 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
3325
3326
3327 ---
3328
3329 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
3330
3331 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
3332
3333 $hbase rowcounter -h
3334
3335 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
3336 Options:
3337     --starttime=\<arg\>       starting time filter to start counting rows from.
3338     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
3339     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
3340     --expectedCount=\<arg\>   expected number of rows to be count.
3341 For performance, consider the following configuration properties:
3342 -Dhbase.client.scanner.caching=100
3343 -Dmapreduce.map.speculative=false
3344
3345
3346 ---
3347
3348 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
3349
3350 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
3351
3352
3353 ---
3354
3355 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
3356
3357 Adds being able to edit hbase:meta table schema. For example,
3358
3359 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
3360 Updating all regions with the new schema...
3361 All regions updated.
3362 Done.
3363 Took 1.2138 seconds
3364
3365 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
3366
3367
3368 ---
3369
3370 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
3371
3372 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
3373
3374
3375 ---
3376
3377 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
3378
3379 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
3380
3381
3382 ---
3383
3384 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
3385
3386 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
3387
3388
3389 ---
3390
3391 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
3392
3393 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
3394
3395
3396 ---
3397
3398 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
3399
3400 <!-- markdown -->
3401 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
3402 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
3403 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
3404 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
3405 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
3406 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
3407
3408
3409 ---
3410
3411 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
3412
3413 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
3414
3415 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
3416
3417 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
3418
3419 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
3420
3421
3422 ---
3423
3424 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
3425
3426 Added new metric to differentiate sink startup time from last OP applied time.
3427
3428 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
3429
3430 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
3431
3432 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
3433
3434 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
3435
3436
3437 ---
3438
3439 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
3440
3441 <!-- markdown -->
3442 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
3443
3444 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
3445
3446
3447 ---
3448
3449 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
3450
3451 Add backoff. Avoid retrying every 100ms.
3452
3453
3454 ---
3455
3456 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
3457
3458 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
3459
3460 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
3461
3462
3463 ---
3464
3465 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
3466
3467 Introduced a general 'local region' at master side to store the procedure data, etc.
3468
3469 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
3470
3471 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
3472
3473
3474 ---
3475
3476 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
3477
3478 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
3479
3480
3481 ---
3482
3483 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
3484
3485 Config key: hbase.regionserver.slowlog.systable.enabled
3486 Default value: false
3487
3488 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
3489 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
3490
3491 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
3492
3493 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
3494
3495  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
3496  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
3497  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
3498  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
3499                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
3500                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
3501                                                              rics: false
3502  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
3503  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
3504  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
3505  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
3506  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
3507  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
3508  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
3509  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
3510
3511
3512 ---
3513
3514 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
3515
3516 <!-- markdown -->
3517 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
3518
3519
3520 ---
3521
3522 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
3523
3524 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
3525
3526 The request log is disabled by default in conf/log4j.properties by the following lines:
3527
3528 # Disable request log by default, you can enable this by changing the appender
3529 log4j.category.http.requests=INFO,NullAppender
3530 log4j.additivity.http.requests=false
3531
3532 Change the 'NullAppender' to what ever you want if you want to enable request log.
3533
3534 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
3535
3536
3537 ---
3538
3539 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
3540
3541 Use a empty string to represent no column specified for deleteall in shell mode.
3542 useage:
3543 deleteall 'test','r1','',12345
3544 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
3545
3546
3547 ---
3548
3549 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
3550
3551 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
3552
3553
3554 ---
3555
3556 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
3557
3558 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
3559
3560
3561 ---
3562
3563 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
3564
3565 Moved to hbase-thirdparty 3.3.0.
3566
3567
3568 ---
3569
3570 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
3571
3572 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
3573
3574 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
3575
3576 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
3577
3578
3579 ---
3580
3581 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
3582
3583 <!-- markdown -->
3584 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
3585
3586 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
3587
3588
3589 ---
3590
3591 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
3592
3593 New Config: hbase.rpc.rows.size.threshold.reject
3594 -----------------------------------------------------------------------
3595
3596 Default value: false
3597 Description:
3598 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
3599
3600
3601 ---
3602
3603 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
3604
3605 StochasticLoadBalancer functional improvement:
3606
3607 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
3608
3609
3610 ---
3611
3612 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
3613
3614 user or admin can now use
3615 hbase shell \> rename\_rsgroup 'oldname', 'newname'
3616 to rename rsgroup.
3617
3618
3619 ---
3620
3621 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
3622
3623 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
3624
3625 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
3626
3627
3628 ---
3629
3630 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
3631
3632 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
3633
3634
3635 ---
3636
3637 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
3638
3639 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
3640
3641
3642 ---
3643
3644 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
3645
3646 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
3647
3648
3649 ---
3650
3651 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
3652
3653 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
3654
3655
3656 ---
3657
3658 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
3659
3660 <!-- markdown -->
3661 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
3662
3663
3664 ---
3665
3666 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
3667
3668 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
3669
3670 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
3671
3672 For running tests locally, to go faster, up fork count.
3673
3674 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
3675
3676 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
3677
3678
3679 ---
3680
3681 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
3682
3683 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
3684
3685
3686 ---
3687
3688 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
3689
3690 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
3691
3692
3693 ---
3694
3695 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
3696
3697 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
3698
3699
3700 ---
3701
3702 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
3703
3704 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
3705
3706
3707 ---
3708
3709 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
3710
3711 <!-- markdown -->
3712 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
3713
3714 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
3715
3716
3717 ---
3718
3719 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
3720
3721 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
3722
3723
3724 ---
3725
3726 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
3727
3728 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
3729
3730
3731 ---
3732
3733 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
3734
3735 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
3736
3737
3738 ---
3739
3740 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
3741
3742 ColumnFamilyDescriptor new builder API:
3743
3744     /\*\*
3745      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
3746      \* of versions(versionAfterInterval) after that interval elapses.
3747      \*
3748      \* @param retentionInterval Retain all versions for this interval
3749      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
3750      \*/
3751     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
3752         final int retentionInterval, final int versionAfterInterval)
3753
3754
3755 ---
3756
3757 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
3758
3759 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
3760
3761
3762 ---
3763
3764 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
3765
3766 Expose file system level read metrics for RegionServer.
3767
3768 If the HBase RS runs on top of HDFS, calculate the aggregation of
3769 ReadStatistics of each HdfsFileInputStream. These metrics include:
3770 (1) total number of bytes read from HDFS.
3771 (2) total number of bytes read from local DataNode.
3772 (3) total number of bytes read locally through short-circuit read.
3773 (4) total number of bytes read locally through zero-copy read.
3774
3775 Because HDFS ReadStatistics is calculated per input stream, it is not
3776 feasible to update the aggregated number in real time. Instead, the
3777 metrics are updated when an input stream is closed.
3778
3779
3780 ---
3781
3782 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
3783
3784 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
3785
3786 Here is a simple example of script:
3787 {code}
3788 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
3789 #!/bin/bash
3790 namespace=$1
3791 tablename=$2
3792 if [[ $namespace == test ]]; then
3793   echo test
3794 elif [[ $tablename == \*foo\* ]]; then
3795   echo other
3796 else
3797   echo default
3798 fi
3799 {code}
3800
3801
3802 ---
3803
3804 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
3805
3806 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
3807
3808
3809 ---
3810
3811 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
3812
3813 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
3814
3815
3816 ---
3817
3818 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
3819
3820 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
3821
3822 User used to see....
3823
3824   column=table:state, timestamp=1583967620343 .....
3825
3826 ... but now sees:
3827
3828   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
3829
3830
3831 ---
3832
3833 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
3834
3835 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
3836
3837
3838 ---
3839
3840 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
3841
3842 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
3843
3844 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
3845
3846
3847 ---
3848
3849 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
3850
3851 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
3852
3853 New Admin APIs:
3854 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
3855       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
3856
3857 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
3858       throws IOException;
3859
3860 Configs:
3861
3862 1. hbase.regionserver.slowlog.ringbuffer.size:
3863 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
3864
3865 Default
3866 256
3867
3868 2. hbase.regionserver.slowlog.buffer.enabled:
3869 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
3870
3871 Default
3872 false
3873
3874
3875 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
3876
3877
3878 ---
3879
3880 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
3881
3882 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
3883
3884
3885 ---
3886
3887 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
3888
3889 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
3890
3891 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
3892
3893 This is a fluent style API, the code is like:
3894
3895 For Table interface:
3896 {code}
3897 table.checkAndMutate(row, filter).thenPut(put);
3898 {code}
3899
3900 For AsyncTable interface:
3901 {code}
3902 table.checkAndMutate(row, filter).thenPut(put)
3903     .thenAccept(succ -\> {
3904       if (succ) {
3905         System.out.println("Check and put succeeded");
3906       } else {
3907         System.out.println("Check and put failed");
3908       }
3909     });
3910 {code}
3911
3912
3913 ---
3914
3915 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
3916
3917 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
3918
3919
3920 ---
3921
3922 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
3923
3924 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
3925
3926
3927 ---
3928
3929 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
3930
3931     Adds shell command regioninfo:
3932
3933       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
3934       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
3935       Took 0.4737 seconds
3936
3937
3938 ---
3939
3940 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
3941
3942 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
3943
3944 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
3945
3946
3947 ---
3948
3949 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
3950
3951 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
3952
3953 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
3954 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
3955
3956 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
3957
3958
3959 ---
3960
3961 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
3962
3963 <!-- markdown -->
3964 Enables master based registry as the default registry used by clients to fetch connection metadata.
3965 Refer to the section "Master Registry" in the client documentation for more details and advantages
3966 of this implementation over the default Zookeeper based registry.
3967
3968 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
3969
3970 Where to set this: HBase client configuration (hbase-site.xml)
3971
3972 Possible values:
3973 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
3974 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
3975
3976 Notes on defaults:
3977
3978 - For v3.0.0 and later, MasterRegistry is the default registry
3979 - For all releases in 2.x line, ZK based registry is the default.
3980
3981 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
3982
3983 ```
3984 <property>
3985   <name>hbase.client.registry.impl</name>
3986   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
3987 </property>
3988 ```
3989
3990
3991 ---
3992
3993 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
3994
3995 caffeine: 2.6.2 =\> 2.8.1
3996 commons-codec: 1.10 =\> 1.13
3997 commons-io: 2.5 =\> 2.6
3998 disrupter: 3.3.6 =\> 3.4.2
3999 httpcore: 4.4.6 =\> 4.4.13
4000 jackson: 2.9.10 =\> 2.10.1
4001 jackson.databind: 2.9.10.1 =\> 2.10.1
4002 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
4003 protobuf.plugin: 0.5.0 =\> 0.6.1
4004 zookeeper: 3.4.10 =\> 3.4.14
4005 slf4j: 1.7.25 =\> 1.7.30
4006 rat: 0.12 =\> 0.13
4007 asciidoctor: 1.5.5 =\> 1.5.8
4008 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
4009 error-prone: 2.3.3 =\> 2.3.4
4010
4011
4012 ---
4013
4014 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
4015
4016 - Reverts a binary incompatible binary change for ByteRangeUtils
4017 - Usage of reflection inside CommonFSUtils removed
4018
4019
4020 ---
4021
4022 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
4023
4024 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
4025
4026 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
4027
4028
4029 ---
4030
4031 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
4032
4033 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
4034
4035
4036 ---
4037
4038 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
4039
4040 Add a new config to hbase-default.xml
4041
4042   \<property\>
4043     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
4044     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
4045     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
4046     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
4047     called in order, so put the cleaner that prunes the most files in front. To
4048     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
4049     and add the fully qualified class name here. Always add the above
4050     default hfile cleaners in the list as they will be overwritten in
4051     hbase-site.xml.\</description\>
4052   \</property\>
4053
4054 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
4055
4056
4057 ---
4058
4059 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
4060
4061 Updated parent pom to Apache version 22.
4062
4063
4064 ---
4065
4066 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
4067
4068 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
4069
4070 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
4071
4072
4073 ---
4074
4075 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
4076
4077 Add a new feature to improve MTTR which have 3 steps to failover:
4078 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
4079 2. Open region.
4080 3. Bulkload the recovered.hfiles for every column family.
4081
4082 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
4083
4084 Config hbase.wal.split.to.hfile to true to enable this featue.
4085
4086
4087 ---
4088
4089 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
4090
4091 Changed the logging in hbase-zookeeper to use built-in formatting
4092
4093
4094 ---
4095
4096 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
4097
4098 From the PR:
4099
4100 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
4101
4102 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
4103
4104
4105 ---
4106
4107 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
4108
4109 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
4110
4111
4112 ---
4113
4114 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
4115
4116 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
4117
4118
4119 ---
4120
4121 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
4122
4123 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
4124
4125
4126 ---
4127
4128 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
4129
4130 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
4131
4132 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
4133
4134
4135 ---
4136
4137 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
4138
4139 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
4140
4141
4142 ---
4143
4144 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
4145
4146 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
4147 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
4148
4149 Fixed this bug as part of this Jira.
4150 Updated description for corresponding configs:
4151
4152 1. hbase.master.regions.recovery.check.interval :
4153
4154 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4155
4156 2. hbase.regions.recovery.store.file.ref.count :
4157
4158 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4159
4160
4161 ---
4162
4163 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
4164
4165 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
4166
4167
4168 ---
4169
4170 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
4171
4172 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
4173
4174
4175 ---
4176
4177 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
4178
4179 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
4180
4181
4182 ---
4183
4184 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
4185
4186 Bumped surefire plugin to 3.0.0-M4
4187
4188
4189 ---
4190
4191 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
4192
4193 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
4194
4195
4196 ---
4197
4198 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
4199
4200 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
4201 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
4202 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
4203 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
4204 From the shell this can be enabled by using the option per Column Family also by using the below format
4205 {code}
4206 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
4207 {code}
4208
4209
4210 ---
4211
4212 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
4213
4214 <!-- markdown -->
4215
4216 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
4217
4218 ```
4219 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
4220     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
4221 ```
4222
4223 See javadocs of the class `MobRefReporter` for more details.
4224
4225 the reference guide has added some information about MOB internals and troubleshooting.
4226
4227
4228 ---
4229
4230 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
4231
4232 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
4233
4234
4235 ---
4236
4237 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
4238
4239 Fixed unbalanced braces in string representation within HBase shell
4240
4241
4242 ---
4243
4244 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
4245
4246 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
4247 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
4248
4249
4250 ---
4251
4252 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
4253
4254 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
4255
4256
4257 ---
4258
4259 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
4260
4261 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
4262
4263 1. RowFilter
4264 2. ValueFilter
4265 3. QualifierFilter
4266 4. FamilyFilter
4267 5. ColumnValueFilter
4268
4269
4270 ---
4271
4272 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
4273
4274 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
4275
4276
4277 ---
4278
4279 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
4280
4281 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
4282
4283
4284 ---
4285
4286 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
4287
4288 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
4289
4290
4291 ---
4292
4293 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
4294
4295 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
4296
4297
4298 ---
4299
4300 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
4301
4302 <!-- markdown -->
4303 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
4304
4305 Such messages will happen at most once per five minutes.
4306
4307
4308 ---
4309
4310 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
4311
4312 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
4313
4314
4315 ---
4316
4317 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
4318
4319 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
4320
4321
4322 ---
4323
4324 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
4325
4326 <!-- markdown -->
4327
4328 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
4329
4330   - CVE-2019-16942
4331   - CVE-2019-16943
4332
4333 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
4334
4335
4336 ---
4337
4338 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
4339
4340 <!-- markdown -->
4341
4342 The MOB compaction process in the HBase Master now logs more about its activity.
4343
4344 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
4345
4346 Caveats:
4347 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
4348 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
4349 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
4350 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
4351
4352
4353 ---
4354
4355 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
4356
4357 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
4358
4359
4360 ---
4361
4362 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
4363
4364 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
4365
4366 Configs:
4367
4368 1. hbase.master.regions.recovery.check.interval :
4369
4370 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4371
4372 2. hbase.regions.recovery.store.file.ref.count :
4373
4374 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4375
4376
4377 ---
4378
4379 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
4380
4381 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
4382
4383 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
4384
4385
4386 ---
4387
4388 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
4389
4390 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
4391
4392
4393 ---
4394
4395 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
4396
4397 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
4398
4399
4400 ---
4401
4402 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
4403
4404 <!-- markdown -->
4405 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
4406
4407 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
4408
4409
4410 ---
4411
4412 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
4413
4414 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
4415
4416
4417 ---
4418
4419 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
4420
4421 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
4422
4423
4424 ---
4425
4426 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
4427
4428 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
4429
4430 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
4431
4432
4433 ---
4434
4435 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
4436
4437 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
4438 \<property\>
4439     \<name\>hbase.bucketcache.ioengine\</name\>
4440     \<value\> pmem:///path in persistent memory \</value\>
4441   \</property\>
4442
4443
4444 ---
4445
4446 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
4447
4448 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
4449 hbase\> snapshot\_cleanup\_switch false
4450
4451 We can re-enable it using:
4452 hbase\> snapshot\_cleanup\_switch true
4453
4454 We can query whether snapshot auto cleanup is enabled for cluster using:
4455 hbase\> snapshot\_cleanup\_enabled
4456
4457
4458 ---
4459
4460 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
4461
4462 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
4463
4464
4465 ---
4466
4467 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
4468
4469 This issue adds via its subtasks:
4470
4471  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
4472  \*\* Master thought this region opened, but no regionserver reported it.
4473  \*\* Master thought this region opened on Server1, but regionserver reported Server2
4474  \*\* More than one regionservers reported opened this region
4475  Both chores can be triggered from the shell to regenerate ‘new’ reports.
4476  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
4477  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
4478  \* Offline replace of hbase.version and hbase.id
4479  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
4480  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
4481  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
4482  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
4483  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
4484  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
4485
4486 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
4487
4488
4489 ---
4490
4491 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
4492
4493 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
4494
4495
4496 ---
4497
4498 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
4499
4500 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
4501
4502
4503 ---
4504
4505 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
4506
4507 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
4508
4509 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
4510
4511 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
4512
4513
4514 ---
4515
4516 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
4517
4518 <!-- markdown -->
4519 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
4520
4521
4522 ---
4523
4524 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
4525
4526 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
4527
4528
4529 ---
4530
4531 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
4532
4533 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
4534
4535
4536 ---
4537
4538 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
4539
4540 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
4541 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
4542
4543
4544 ---
4545
4546 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
4547
4548 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
4549 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
4550 \* TimeRange#until: Represents the time interval [0, maxStamp)
4551 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
4552
4553
4554 ---
4555
4556 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
4557
4558 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
4559 {code}
4560 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
4561 {code}
4562
4563
4564 ---
4565
4566 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
4567
4568 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
4569
4570
4571 ---
4572
4573 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
4574
4575 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
4576
4577 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
4578
4579
4580 ---
4581
4582 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
4583
4584 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
4585
4586
4587 ---
4588
4589 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
4590
4591 New shaded artifact for testing: hbase-shaded-testing-util.
4592
4593
4594 ---
4595
4596 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
4597
4598 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
4599 1. Check HDFS configuration
4600 2. Add master coprocessor:
4601     hbase.coprocessor.master.classes=
4602     “org.apache.hadoop.hbase.security.access.AccessController,
4603 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
4604 3. Enable this feature:
4605     hbase.acl.sync.to.hdfs.enable=true
4606 4. Modify table scheme to enable this feature for a table:
4607     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
4608
4609
4610 ---
4611
4612 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
4613
4614 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
4615
4616 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
4617
4618 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
4619 java.lang.ArrayIndexOutOfBoundsException: 18056
4620         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
4621         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
4622         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
4623         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
4624         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
4625         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
4626         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
4627         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
4628         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
4629
4630 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
4631
4632 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
4633
4634 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
4635
4636
4637 ---
4638
4639 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
4640
4641 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
4642
4643
4644 ---
4645
4646 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
4647
4648 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
4649
4650
4651 ---
4652
4653 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
4654
4655 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
4656
4657 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
4658
4659
4660 ---
4661
4662 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
4663
4664 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
4665
4666
4667 ---
4668
4669 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
4670
4671 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
4672 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
4673
4674
4675 ---
4676
4677 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
4678
4679 1. Add a new chore thread in master to do hbck checking
4680 2. Add a new web ui "HBCK Report" page to display checking results.
4681
4682 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
4683
4684 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
4685
4686
4687 ---
4688
4689 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
4690
4691 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
4692 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
4693
4694
4695 ---
4696
4697 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
4698
4699 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
4700
4701 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
4702
4703 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
4704
4705
4706 ---
4707
4708 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
4709
4710 Add a new master web UI to show the potentially problematic opened regions. There are three case:
4711 1. Master thought this region opened, but no regionserver reported it.
4712 2. Master thought this region opened on Server1, but regionserver reported Server2
4713 3. More than one regionservers reported opened this region
4714
4715
4716 ---
4717
4718 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
4719
4720 Feature: Take a Snapshot With TTL for auto-cleanup
4721
4722 Attribute:
4723 1. TTL
4724      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
4725
4726 Configs:
4727 1. Default Snapshot TTL:
4728      - FOREVER by default
4729      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
4730
4731 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
4732      - hbase.master.cleaner.snapshot.disable: "true"
4733     With this config, HMaster needs restart just like any other hbase-site config.
4734
4735
4736 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
4737
4738
4739 ---
4740
4741 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
4742
4743 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
4744
4745
4746 ---
4747
4748 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
4749
4750 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
4751
4752 This tool is deprecated in 2.x and will be removed in 3.0.
4753
4754
4755 ---
4756
4757 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
4758
4759 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
4760
4761
4762 ---
4763
4764 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
4765
4766 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
4767
4768 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
4769
4770 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
4771
4772
4773 ---
4774
4775 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
4776
4777 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
4778 To use this feature, please make sure the HDFS config is set:
4779 dfs.namenode.acls.enabled=true
4780 fs.permissions.umask-mode=027
4781
4782 and set the HBase config:
4783 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
4784 hbase.user.scan.snapshot.enable=true
4785
4786
4787 ---
4788
4789 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
4790
4791 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4792
4793 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4794
4795
4796 ---
4797
4798 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
4799
4800 <!-- markdown -->
4801
4802 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
4803
4804
4805 ---
4806
4807 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
4808
4809 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
4810
4811
4812 ---
4813
4814 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
4815
4816 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
4817
4818
4819 ---
4820
4821 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
4822
4823 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
4824
4825
4826 ---
4827
4828 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
4829
4830 The HBase "source checksum" now uses SHA512 instead of MD5.
4831
4832
4833 ---
4834
4835 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
4836
4837 <!-- markdown -->
4838
4839 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
4840
4841 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
4842
4843
4844 ---
4845
4846 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
4847
4848 The access method was used to the HttpServerFunctionalTest class as a common place.
4849
4850
4851 ---
4852
4853 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
4854
4855 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
4856
4857
4858 ---
4859
4860 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
4861
4862 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
4863
4864
4865 ---
4866
4867 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
4868
4869 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
4870
4871
4872 ---
4873
4874 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
4875
4876 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
4877
4878
4879 ---
4880
4881 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
4882
4883 Support get\|set LogLevel in secure(kerberized) environment.
4884
4885
4886 ---
4887
4888 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
4889
4890 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
4891
4892
4893 ---
4894
4895 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
4896
4897 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
4898
4899
4900 ---
4901
4902 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
4903
4904 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
4905
4906
4907 ---
4908
4909 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
4910
4911 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
4912
4913
4914 ---
4915
4916 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
4917
4918 Updated metrics core from 3.2.1 to 3.2.6.
4919
4920
4921 ---
4922
4923 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
4924
4925 The rubocop definition for the maximum method length was set to 75.
4926
4927
4928 ---
4929
4930 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
4931
4932 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
4933
4934
4935 ---
4936
4937 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
4938
4939 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
4940
4941
4942 ---
4943
4944 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
4945
4946 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
4947
4948 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
4949
4950 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
4951
4952 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
4953
4954 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
4955 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
4956
4957 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
4958 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
4959 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
4960
4961
4962 ---
4963
4964 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
4965
4966 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
4967
4968 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
4969
4970
4971 ---
4972
4973 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
4974
4975 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
4976
4977
4978 ---
4979
4980 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
4981
4982 <!-- markdown -->
4983 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
4984
4985 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
4986
4987
4988 ---
4989
4990 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
4991
4992 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
4993
4994 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
4995
4996
4997 ---
4998
4999 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
5000
5001 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
5002
5003
5004 ---
5005
5006 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
5007
5008 <!-- markdown -->
5009
5010 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
5011
5012 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
5013
5014
5015 ---
5016
5017 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
5018
5019 Add below method in Table interface:
5020
5021 RegionLocator getRegionLocator() throws IOException;
5022
5023 Add below methods in AsyncTable interface:
5024
5025 AsyncTableRegionLocator getRegionLocator();
5026 CompletableFuture\<TableDescriptor\> getDescriptor();
5027
5028
5029 ---
5030
5031 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
5032
5033 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
5034
5035 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
5036
5037 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
5038
5039
5040 ---
5041
5042 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
5043
5044 Introduced
5045
5046 Future\<Void\> createTableAsync(TableDescriptor);
5047
5048
5049 ---
5050
5051 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
5052
5053 Introduced these methods:
5054 void move(byte[]);
5055 void move(byte[], ServerName);
5056 Future\<Void\> splitRegionAsync(byte[]);
5057
5058 These methods are deprecated:
5059 void move(byte[], byte[])
5060
5061
5062 ---
5063
5064 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
5065
5066 Add a new jenkins file for running pre commit check for GitHub PR.
5067
5068
5069 ---
5070
5071 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
5072
5073 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
5074
5075
5076 ---
5077
5078 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
5079
5080 When insufficient permissions, you now get:
5081
5082 HTTP/1.1 403 Forbidden
5083
5084 on the HTTP side, and in the message
5085
5086 Forbidden
5087 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
5088 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
5089 and the rest of the ADE stack
5090
5091
5092 ---
5093
5094 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
5095
5096 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
5097
5098
5099 ---
5100
5101 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
5102
5103 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
5104
5105
5106 ---
5107
5108 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
5109
5110 <!-- markdown -->
5111 Fixed awkward dependency issue that prevented site building.
5112
5113 #### note specific to HBase 2.1.4
5114 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
5115 ```
5116 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
5117 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
5118         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
5119         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
5120         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
5121         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
5122         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
5123         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
5124         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
5125         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
5126         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
5127         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
5128         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
5129         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
5130         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
5131         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
5132         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
5133         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
5134         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
5135         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
5136         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
5137         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
5138         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
5139         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
5140         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
5141         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
5142         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
5143         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
5144 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
5145         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
5146         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
5147         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
5148         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
5149         ... 26 more
5150
5151 ```
5152
5153 Workaround via any _one_ of the following:
5154 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
5155 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
5156 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
5157 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
5158 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
5159
5160
5161 ---
5162
5163 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
5164
5165 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
5166
5167
5168 ---
5169
5170 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
5171
5172 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
5173
5174
5175 ---
5176
5177 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
5178
5179 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
5180
5181 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
5182
5183
5184 ---
5185
5186 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
5187
5188 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
5189
5190
5191 ---
5192
5193 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
5194
5195 <!-- markdown -->
5196
5197 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
5198
5199 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
5200
5201
5202 ---
5203
5204 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
5205
5206 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
5207
5208
5209 ---
5210
5211 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
5212
5213 Add a cloneSnapshotAsync method with restoreAcl parameter.
5214 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
5215 Make snapshotAsync method returns a Future\<Void\>.
5216 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
5217 Use default methods to reduce the code base for implementation classes.
5218
5219
5220 ---
5221
5222 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
5223
5224 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
5225
5226
5227 ---
5228
5229 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
5230
5231 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
5232 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
5233
5234 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
5235
5236 For example:
5237 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
5238
5239
5240 ---
5241
5242 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
5243
5244 Adds below flush, split, and compaction metrics
5245
5246  +  // split related metrics
5247  +  private MutableFastCounter splitRequest;
5248  +  private MutableFastCounter splitSuccess;
5249  +  private MetricHistogram splitTimeHisto;
5250  +
5251  +  // flush related metrics
5252  +  private MetricHistogram flushTimeHisto;
5253  +  private MetricHistogram flushMemstoreSizeHisto;
5254  +  private MetricHistogram flushOutputSizeHisto;
5255  +  private MutableFastCounter flushedMemstoreBytes;
5256  +  private MutableFastCounter flushedOutputBytes;
5257  +
5258  +  // compaction related metrics
5259  +  private MetricHistogram compactionTimeHisto;
5260  +  private MetricHistogram compactionInputFileCountHisto;
5261  +  private MetricHistogram compactionInputSizeHisto;
5262  +  private MetricHistogram compactionOutputFileCountHisto;
5263  +  private MetricHistogram compactionOutputSizeHisto;
5264  +  private MutableFastCounter compactedInputBytes;
5265  +  private MutableFastCounter compactedOutputBytes;
5266  +
5267  +  private MetricHistogram majorCompactionTimeHisto;
5268  +  private MetricHistogram majorCompactionInputFileCountHisto;
5269  +  private MetricHistogram majorCompactionInputSizeHisto;
5270  +  private MetricHistogram majorCompactionOutputFileCountHisto;
5271  +  private MetricHistogram majorCompactionOutputSizeHisto;
5272  +  private MutableFastCounter majorCompactedInputBytes;
5273  +  private MutableFastCounter majorCompactedOutputBytes;
5274
5275
5276 ---
5277
5278 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
5279
5280 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
5281
5282
5283 ---
5284
5285 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
5286
5287 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
5288 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
5289
5290
5291 ---
5292
5293 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
5294
5295 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
5296
5297 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
5298
5299
5300 ---
5301
5302 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
5303
5304 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
5305 Shell commands are as follows:
5306 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5307
5308 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
5309 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
5310 Shell commands are as follows:
5311 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5312 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
5313 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
5314
5315
5316 ---
5317
5318 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
5319
5320 Change spotbugs version to 3.1.11.
5321
5322
5323 ---
5324
5325 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
5326
5327 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
5328
5329 It also introduces additional info for each recovery queue, which was not accounted by this command before.
5330
5331 The new output for "status 'replication'" command is explained in details below:
5332 a) Source started, target stopped, no edits arrived on source yet:
5333 ...
5334  SOURCE: PeerID=1
5335          Normal Queue: 1
5336            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5337 ...
5338 b) Source started, target stopped, add edit on source:
5339 ...
5340 Normal Queue: 1
5341            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
5342 ...
5343 c) Source started, target stopped, edit added on source, restart source:
5344 ...
5345 SOURCE: PeerID=1
5346          Normal Queue: 1
5347            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5348          Recovered Queue: 1-hbase01.home,16020,1542784524057
5349            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
5350 ...
5351 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
5352 ...
5353 SOURCE: PeerID=1
5354          Normal Queue: 1
5355            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
5356          Recovered Queue: 1-hbase01.home,16020,1542782758742
5357            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
5358 ...
5359 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
5360 ...
5361        SOURCE: PeerID=1
5362          Normal Queue: 1
5363            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
5364 ...
5365 f) Source started, target stopped, add edit on source, restart source, restart target:
5366 ...
5367 SOURCE: PeerID=1
5368          Normal Queue: 1
5369            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5370 ...
5371
5372
5373 ---
5374
5375 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
5376
5377 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
5378
5379
5380 ---
5381
5382 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
5383
5384 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
5385 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
5386 disable\_exceed\_throttle\_quota
5387 There are two limits when enable exceed throttle quota:
5388 1. Must set at least one read and one write region server throttle quota;
5389 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
5390
5391
5392 ---
5393
5394 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
5395
5396 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
5397
5398
5399 ---
5400
5401 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
5402
5403 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
5404
5405
5406 ---
5407
5408 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
5409
5410 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
5411
5412
5413 ---
5414
5415 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
5416
5417 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
5418
5419 hbase\> help 'scan'
5420
5421
5422 ---
5423
5424 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
5425
5426 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
5427
5428 For example:
5429 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
5430
5431
5432 ---
5433
5434 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
5435
5436 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
5437 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
5438
5439
5440 ---
5441
5442 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
5443
5444 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
5445
5446
5447 ---
5448
5449 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
5450
5451 Make StoppedRpcClientException extend DoNotRetryIOException.
5452
5453
5454 ---
5455
5456 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
5457
5458 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
5459 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
5460
5461
5462 ---
5463
5464 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
5465
5466 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
5467
5468 The effect releases are:
5469 2.1.x: 2.1.2 and below
5470 2.0.x: 2.0.4 and below
5471 1.x: 1.4.x and below
5472
5473 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
5474
5475
5476 ---
5477
5478 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
5479
5480 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
5481
5482
5483
5484 # HBASE  2.3.0 Release Notes
5485
5486 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
5487
5488
5489 ---
5490
5491 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
5492
5493 <!-- markdown -->
5494 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
5495 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
5496
5497
5498 ---
5499
5500 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
5501
5502 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
5503 The metric is now collected under the mbean for Tables and under the mbean for regions.
5504 Under table mbean ie.-
5505 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
5506 The new metrics will be listed as
5507 {code}
5508     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5509  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
5510 {code}
5511 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
5512 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
5513 {code}
5514
5515 The same one under the region ie.
5516 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
5517 comes as
5518 {code}
5519    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5520     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
5521 {code}
5522 where
5523 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
5524 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
5525 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
5526
5527
5528 ---
5529
5530 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
5531
5532 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
5533
5534 $hbase rowcounter -h
5535
5536 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
5537 Options:
5538     --starttime=\<arg\>       starting time filter to start counting rows from.
5539     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
5540     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
5541     --expectedCount=\<arg\>   expected number of rows to be count.
5542 For performance, consider the following configuration properties:
5543 -Dhbase.client.scanner.caching=100
5544 -Dmapreduce.map.speculative=false
5545
5546
5547 ---
5548
5549 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
5550
5551 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
5552
5553
5554 ---
5555
5556 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
5557
5558 Adds being able to edit hbase:meta table schema. For example,
5559
5560 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
5561 Updating all regions with the new schema...
5562 All regions updated.
5563 Done.
5564 Took 1.2138 seconds
5565
5566 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
5567
5568
5569 ---
5570
5571 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
5572
5573 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
5574
5575
5576 ---
5577
5578 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
5579
5580 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
5581
5582
5583 ---
5584
5585 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
5586
5587 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
5588
5589
5590 ---
5591
5592 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
5593
5594 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
5595
5596
5597 ---
5598
5599 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
5600
5601 <!-- markdown -->
5602 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
5603 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
5604 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
5605 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
5606 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
5607 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
5608
5609
5610 ---
5611
5612 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
5613
5614 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
5615
5616 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
5617
5618 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
5619
5620 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
5621
5622
5623 ---
5624
5625 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
5626
5627 Added new metric to differentiate sink startup time from last OP applied time.
5628
5629 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
5630
5631 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
5632
5633 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
5634
5635 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
5636
5637
5638 ---
5639
5640 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
5641
5642 <!-- markdown -->
5643 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
5644
5645 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
5646
5647
5648 ---
5649
5650 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
5651
5652 Add backoff. Avoid retrying every 100ms.
5653
5654
5655 ---
5656
5657 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
5658
5659 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
5660
5661 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
5662
5663
5664 ---
5665
5666 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
5667
5668 Introduced a general 'local region' at master side to store the procedure data, etc.
5669
5670 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
5671
5672 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
5673
5674
5675 ---
5676
5677 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
5678
5679 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
5680
5681
5682 ---
5683
5684 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
5685
5686 Config key: hbase.regionserver.slowlog.systable.enabled
5687 Default value: false
5688
5689 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
5690 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
5691
5692 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
5693
5694 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
5695
5696  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
5697  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
5698  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
5699  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
5700                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
5701                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
5702                                                              rics: false
5703  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
5704  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
5705  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
5706  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
5707  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
5708  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
5709  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
5710  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
5711
5712
5713 ---
5714
5715 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
5716
5717 <!-- markdown -->
5718 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
5719
5720
5721 ---
5722
5723 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
5724
5725 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
5726
5727 The request log is disabled by default in conf/log4j.properties by the following lines:
5728
5729 # Disable request log by default, you can enable this by changing the appender
5730 log4j.category.http.requests=INFO,NullAppender
5731 log4j.additivity.http.requests=false
5732
5733 Change the 'NullAppender' to what ever you want if you want to enable request log.
5734
5735 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
5736
5737
5738 ---
5739
5740 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
5741
5742 Use a empty string to represent no column specified for deleteall in shell mode.
5743 useage:
5744 deleteall 'test','r1','',12345
5745 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
5746
5747
5748 ---
5749
5750 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
5751
5752 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
5753
5754
5755 ---
5756
5757 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
5758
5759 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
5760
5761
5762 ---
5763
5764 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
5765
5766 Moved to hbase-thirdparty 3.3.0.
5767
5768
5769 ---
5770
5771 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
5772
5773 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
5774
5775 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
5776
5777 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
5778
5779
5780 ---
5781
5782 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
5783
5784 <!-- markdown -->
5785 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
5786
5787 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
5788
5789
5790 ---
5791
5792 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
5793
5794 New Config: hbase.rpc.rows.size.threshold.reject
5795 -----------------------------------------------------------------------
5796
5797 Default value: false
5798 Description:
5799 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
5800
5801
5802 ---
5803
5804 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
5805
5806 StochasticLoadBalancer functional improvement:
5807
5808 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
5809
5810
5811 ---
5812
5813 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
5814
5815 user or admin can now use
5816 hbase shell \> rename\_rsgroup 'oldname', 'newname'
5817 to rename rsgroup.
5818
5819
5820 ---
5821
5822 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
5823
5824 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
5825
5826 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
5827
5828
5829 ---
5830
5831 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
5832
5833 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
5834
5835
5836 ---
5837
5838 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
5839
5840 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
5841
5842
5843 ---
5844
5845 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
5846
5847 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
5848
5849
5850 ---
5851
5852 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
5853
5854 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
5855
5856
5857 ---
5858
5859 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
5860
5861 <!-- markdown -->
5862 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
5863
5864
5865 ---
5866
5867 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
5868
5869 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
5870
5871 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
5872
5873 For running tests locally, to go faster, up fork count.
5874
5875 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
5876
5877 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
5878
5879
5880 ---
5881
5882 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
5883
5884 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
5885
5886
5887 ---
5888
5889 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
5890
5891 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
5892
5893
5894 ---
5895
5896 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
5897
5898 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
5899
5900
5901 ---
5902
5903 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
5904
5905 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
5906
5907
5908 ---
5909
5910 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
5911
5912 <!-- markdown -->
5913 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
5914
5915 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
5916
5917
5918 ---
5919
5920 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
5921
5922 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
5923
5924
5925 ---
5926
5927 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
5928
5929 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
5930
5931
5932 ---
5933
5934 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
5935
5936 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
5937
5938
5939 ---
5940
5941 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
5942
5943 ColumnFamilyDescriptor new builder API:
5944
5945     /\*\*
5946      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
5947      \* of versions(versionAfterInterval) after that interval elapses.
5948      \*
5949      \* @param retentionInterval Retain all versions for this interval
5950      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
5951      \*/
5952     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
5953         final int retentionInterval, final int versionAfterInterval)
5954
5955
5956 ---
5957
5958 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
5959
5960 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
5961
5962
5963 ---
5964
5965 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
5966
5967 Expose file system level read metrics for RegionServer.
5968
5969 If the HBase RS runs on top of HDFS, calculate the aggregation of
5970 ReadStatistics of each HdfsFileInputStream. These metrics include:
5971 (1) total number of bytes read from HDFS.
5972 (2) total number of bytes read from local DataNode.
5973 (3) total number of bytes read locally through short-circuit read.
5974 (4) total number of bytes read locally through zero-copy read.
5975
5976 Because HDFS ReadStatistics is calculated per input stream, it is not
5977 feasible to update the aggregated number in real time. Instead, the
5978 metrics are updated when an input stream is closed.
5979
5980
5981 ---
5982
5983 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
5984
5985 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
5986
5987 Here is a simple example of script:
5988 {code}
5989 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
5990 #!/bin/bash
5991 namespace=$1
5992 tablename=$2
5993 if [[ $namespace == test ]]; then
5994   echo test
5995 elif [[ $tablename == \*foo\* ]]; then
5996   echo other
5997 else
5998   echo default
5999 fi
6000 {code}
6001
6002
6003 ---
6004
6005 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
6006
6007 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
6008
6009
6010 ---
6011
6012 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
6013
6014 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
6015
6016
6017 ---
6018
6019 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
6020
6021 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
6022
6023 User used to see....
6024
6025   column=table:state, timestamp=1583967620343 .....
6026
6027 ... but now sees:
6028
6029   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
6030
6031
6032 ---
6033
6034 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
6035
6036 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
6037
6038
6039 ---
6040
6041 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
6042
6043 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
6044
6045 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
6046
6047
6048 ---
6049
6050 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
6051
6052 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
6053
6054 New Admin APIs:
6055 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
6056       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
6057
6058 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
6059       throws IOException;
6060
6061 Configs:
6062
6063 1. hbase.regionserver.slowlog.ringbuffer.size:
6064 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
6065
6066 Default
6067 256
6068
6069 2. hbase.regionserver.slowlog.buffer.enabled:
6070 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
6071
6072 Default
6073 false
6074
6075
6076 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
6077
6078
6079 ---
6080
6081 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
6082
6083 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
6084
6085
6086 ---
6087
6088 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
6089
6090 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
6091
6092 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
6093
6094 This is a fluent style API, the code is like:
6095
6096 For Table interface:
6097 {code}
6098 table.checkAndMutate(row, filter).thenPut(put);
6099 {code}
6100
6101 For AsyncTable interface:
6102 {code}
6103 table.checkAndMutate(row, filter).thenPut(put)
6104     .thenAccept(succ -\> {
6105       if (succ) {
6106         System.out.println("Check and put succeeded");
6107       } else {
6108         System.out.println("Check and put failed");
6109       }
6110     });
6111 {code}
6112
6113
6114 ---
6115
6116 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
6117
6118 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
6119
6120
6121 ---
6122
6123 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
6124
6125 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
6126
6127
6128 ---
6129
6130 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
6131
6132     Adds shell command regioninfo:
6133
6134       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
6135       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
6136       Took 0.4737 seconds
6137
6138
6139 ---
6140
6141 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
6142
6143 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
6144
6145 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
6146
6147
6148 ---
6149
6150 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
6151
6152 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
6153
6154 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
6155 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
6156
6157 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
6158
6159
6160 ---
6161
6162 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
6163
6164 <!-- markdown -->
6165 Enables master based registry as the default registry used by clients to fetch connection metadata.
6166 Refer to the section "Master Registry" in the client documentation for more details and advantages
6167 of this implementation over the default Zookeeper based registry.
6168
6169 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
6170
6171 Where to set this: HBase client configuration (hbase-site.xml)
6172
6173 Possible values:
6174 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
6175 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
6176
6177 Notes on defaults:
6178
6179 - For v3.0.0 and later, MasterRegistry is the default registry
6180 - For all releases in 2.x line, ZK based registry is the default.
6181
6182 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
6183
6184 ```
6185 <property>
6186   <name>hbase.client.registry.impl</name>
6187   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
6188 </property>
6189 ```
6190
6191
6192 ---
6193
6194 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
6195
6196 caffeine: 2.6.2 =\> 2.8.1
6197 commons-codec: 1.10 =\> 1.13
6198 commons-io: 2.5 =\> 2.6
6199 disrupter: 3.3.6 =\> 3.4.2
6200 httpcore: 4.4.6 =\> 4.4.13
6201 jackson: 2.9.10 =\> 2.10.1
6202 jackson.databind: 2.9.10.1 =\> 2.10.1
6203 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
6204 protobuf.plugin: 0.5.0 =\> 0.6.1
6205 zookeeper: 3.4.10 =\> 3.4.14
6206 slf4j: 1.7.25 =\> 1.7.30
6207 rat: 0.12 =\> 0.13
6208 asciidoctor: 1.5.5 =\> 1.5.8
6209 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
6210 error-prone: 2.3.3 =\> 2.3.4
6211
6212
6213 ---
6214
6215 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
6216
6217 - Reverts a binary incompatible binary change for ByteRangeUtils
6218 - Usage of reflection inside CommonFSUtils removed
6219
6220
6221 ---
6222
6223 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
6224
6225 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
6226
6227 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
6228
6229
6230 ---
6231
6232 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
6233
6234 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
6235
6236
6237 ---
6238
6239 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
6240
6241 Add a new config to hbase-default.xml
6242
6243   \<property\>
6244     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
6245     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
6246     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
6247     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
6248     called in order, so put the cleaner that prunes the most files in front. To
6249     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
6250     and add the fully qualified class name here. Always add the above
6251     default hfile cleaners in the list as they will be overwritten in
6252     hbase-site.xml.\</description\>
6253   \</property\>
6254
6255 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
6256
6257
6258 ---
6259
6260 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
6261
6262 Updated parent pom to Apache version 22.
6263
6264
6265 ---
6266
6267 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
6268
6269 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
6270
6271 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
6272
6273
6274 ---
6275
6276 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
6277
6278 Add a new feature to improve MTTR which have 3 steps to failover:
6279 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
6280 2. Open region.
6281 3. Bulkload the recovered.hfiles for every column family.
6282
6283 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
6284
6285 Config hbase.wal.split.to.hfile to true to enable this featue.
6286
6287
6288 ---
6289
6290 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
6291
6292 Changed the logging in hbase-zookeeper to use built-in formatting
6293
6294
6295 ---
6296
6297 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
6298
6299 From the PR:
6300
6301 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
6302
6303 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
6304
6305
6306 ---
6307
6308 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
6309
6310 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
6311
6312
6313 ---
6314
6315 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
6316
6317 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
6318
6319
6320 ---
6321
6322 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
6323
6324 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
6325
6326
6327 ---
6328
6329 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
6330
6331 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
6332
6333 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
6334
6335
6336 ---
6337
6338 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
6339
6340 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
6341
6342
6343 ---
6344
6345 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
6346
6347 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
6348 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
6349
6350 Fixed this bug as part of this Jira.
6351 Updated description for corresponding configs:
6352
6353 1. hbase.master.regions.recovery.check.interval :
6354
6355 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6356
6357 2. hbase.regions.recovery.store.file.ref.count :
6358
6359 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6360
6361
6362 ---
6363
6364 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
6365
6366 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
6367
6368
6369 ---
6370
6371 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
6372
6373 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
6374
6375
6376 ---
6377
6378 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
6379
6380 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
6381
6382
6383 ---
6384
6385 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
6386
6387 Bumped surefire plugin to 3.0.0-M4
6388
6389
6390 ---
6391
6392 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
6393
6394 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
6395
6396
6397 ---
6398
6399 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
6400
6401 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
6402 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
6403 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
6404 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
6405 From the shell this can be enabled by using the option per Column Family also by using the below format
6406 {code}
6407 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
6408 {code}
6409
6410
6411 ---
6412
6413 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
6414
6415 <!-- markdown -->
6416
6417 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
6418
6419 ```
6420 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
6421     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
6422 ```
6423
6424 See javadocs of the class `MobRefReporter` for more details.
6425
6426 the reference guide has added some information about MOB internals and troubleshooting.
6427
6428
6429 ---
6430
6431 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
6432
6433 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
6434
6435
6436 ---
6437
6438 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
6439
6440 Fixed unbalanced braces in string representation within HBase shell
6441
6442
6443 ---
6444
6445 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
6446
6447 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
6448 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
6449
6450
6451 ---
6452
6453 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
6454
6455 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
6456
6457
6458 ---
6459
6460 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
6461
6462 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
6463
6464 1. RowFilter
6465 2. ValueFilter
6466 3. QualifierFilter
6467 4. FamilyFilter
6468 5. ColumnValueFilter
6469
6470
6471 ---
6472
6473 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
6474
6475 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
6476
6477
6478 ---
6479
6480 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
6481
6482 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
6483
6484
6485 ---
6486
6487 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
6488
6489 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
6490
6491
6492 ---
6493
6494 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
6495
6496 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
6497
6498
6499 ---
6500
6501 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
6502
6503 <!-- markdown -->
6504 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
6505
6506 Such messages will happen at most once per five minutes.
6507
6508
6509 ---
6510
6511 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
6512
6513 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
6514
6515
6516 ---
6517
6518 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
6519
6520 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
6521
6522
6523 ---
6524
6525 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
6526
6527 <!-- markdown -->
6528
6529 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
6530
6531   - CVE-2019-16942
6532   - CVE-2019-16943
6533
6534 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
6535
6536
6537 ---
6538
6539 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
6540
6541 <!-- markdown -->
6542
6543 The MOB compaction process in the HBase Master now logs more about its activity.
6544
6545 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
6546
6547 Caveats:
6548 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
6549 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
6550 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
6551 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
6552
6553
6554 ---
6555
6556 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
6557
6558 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
6559
6560
6561 ---
6562
6563 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
6564
6565 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
6566
6567 Configs:
6568
6569 1. hbase.master.regions.recovery.check.interval :
6570
6571 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6572
6573 2. hbase.regions.recovery.store.file.ref.count :
6574
6575 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6576
6577
6578 ---
6579
6580 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
6581
6582 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
6583
6584 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
6585
6586
6587 ---
6588
6589 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
6590
6591 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
6592
6593
6594 ---
6595
6596 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
6597
6598 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
6599
6600
6601 ---
6602
6603 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
6604
6605 <!-- markdown -->
6606 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
6607
6608 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
6609
6610
6611 ---
6612
6613 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
6614
6615 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
6616
6617
6618 ---
6619
6620 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
6621
6622 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
6623
6624
6625 ---
6626
6627 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
6628
6629 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
6630
6631 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
6632
6633
6634 ---
6635
6636 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
6637
6638 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
6639 \<property\>
6640     \<name\>hbase.bucketcache.ioengine\</name\>
6641     \<value\> pmem:///path in persistent memory \</value\>
6642   \</property\>
6643
6644
6645 ---
6646
6647 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
6648
6649 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
6650 hbase\> snapshot\_cleanup\_switch false
6651
6652 We can re-enable it using:
6653 hbase\> snapshot\_cleanup\_switch true
6654
6655 We can query whether snapshot auto cleanup is enabled for cluster using:
6656 hbase\> snapshot\_cleanup\_enabled
6657
6658
6659 ---
6660
6661 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
6662
6663 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
6664
6665
6666 ---
6667
6668 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
6669
6670 This issue adds via its subtasks:
6671
6672  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
6673  \*\* Master thought this region opened, but no regionserver reported it.
6674  \*\* Master thought this region opened on Server1, but regionserver reported Server2
6675  \*\* More than one regionservers reported opened this region
6676  Both chores can be triggered from the shell to regenerate ‘new’ reports.
6677  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
6678  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
6679  \* Offline replace of hbase.version and hbase.id
6680  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
6681  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
6682  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
6683  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
6684  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
6685  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
6686
6687 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
6688
6689
6690 ---
6691
6692 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
6693
6694 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
6695
6696
6697 ---
6698
6699 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
6700
6701 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
6702
6703
6704 ---
6705
6706 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
6707
6708 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
6709
6710 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
6711
6712 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
6713
6714
6715 ---
6716
6717 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
6718
6719 <!-- markdown -->
6720 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
6721
6722
6723 ---
6724
6725 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
6726
6727 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
6728
6729
6730 ---
6731
6732 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
6733
6734 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
6735
6736
6737 ---
6738
6739 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
6740
6741 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
6742 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
6743
6744
6745 ---
6746
6747 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
6748
6749 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
6750 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
6751 \* TimeRange#until: Represents the time interval [0, maxStamp)
6752 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
6753
6754
6755 ---
6756
6757 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
6758
6759 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
6760 {code}
6761 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
6762 {code}
6763
6764
6765 ---
6766
6767 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
6768
6769 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
6770
6771
6772 ---
6773
6774 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
6775
6776 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
6777
6778 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
6779
6780
6781 ---
6782
6783 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
6784
6785 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
6786
6787
6788 ---
6789
6790 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
6791
6792 New shaded artifact for testing: hbase-shaded-testing-util.
6793
6794
6795 ---
6796
6797 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
6798
6799 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
6800 1. Check HDFS configuration
6801 2. Add master coprocessor:
6802     hbase.coprocessor.master.classes=
6803     “org.apache.hadoop.hbase.security.access.AccessController,
6804 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
6805 3. Enable this feature:
6806     hbase.acl.sync.to.hdfs.enable=true
6807 4. Modify table scheme to enable this feature for a table:
6808     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
6809
6810
6811 ---
6812
6813 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
6814
6815 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
6816
6817 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
6818
6819 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
6820 java.lang.ArrayIndexOutOfBoundsException: 18056
6821         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
6822         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
6823         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
6824         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
6825         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
6826         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
6827         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
6828         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
6829         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
6830
6831 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
6832
6833 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
6834
6835 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
6836
6837
6838 ---
6839
6840 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
6841
6842 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
6843
6844
6845 ---
6846
6847 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
6848
6849 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
6850
6851
6852 ---
6853
6854 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
6855
6856 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
6857
6858 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
6859
6860
6861 ---
6862
6863 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
6864
6865 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
6866
6867
6868 ---
6869
6870 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
6871
6872 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
6873 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
6874
6875
6876 ---
6877
6878 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
6879
6880 1. Add a new chore thread in master to do hbck checking
6881 2. Add a new web ui "HBCK Report" page to display checking results.
6882
6883 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
6884
6885 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
6886
6887
6888 ---
6889
6890 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
6891
6892 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
6893 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
6894
6895
6896 ---
6897
6898 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
6899
6900 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
6901
6902 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
6903
6904 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
6905
6906
6907 ---
6908
6909 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
6910
6911 Add a new master web UI to show the potentially problematic opened regions. There are three case:
6912 1. Master thought this region opened, but no regionserver reported it.
6913 2. Master thought this region opened on Server1, but regionserver reported Server2
6914 3. More than one regionservers reported opened this region
6915
6916
6917 ---
6918
6919 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
6920
6921 Feature: Take a Snapshot With TTL for auto-cleanup
6922
6923 Attribute:
6924 1. TTL
6925      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
6926
6927 Configs:
6928 1. Default Snapshot TTL:
6929      - FOREVER by default
6930      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
6931
6932 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
6933      - hbase.master.cleaner.snapshot.disable: "true"
6934     With this config, HMaster needs restart just like any other hbase-site config.
6935
6936
6937 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
6938
6939
6940 ---
6941
6942 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
6943
6944 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
6945
6946
6947 ---
6948
6949 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
6950
6951 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
6952
6953 This tool is deprecated in 2.x and will be removed in 3.0.
6954
6955
6956 ---
6957
6958 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
6959
6960 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
6961
6962
6963 ---
6964
6965 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
6966
6967 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
6968
6969 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
6970
6971 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
6972
6973
6974 ---
6975
6976 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
6977
6978 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
6979 To use this feature, please make sure the HDFS config is set:
6980 dfs.namenode.acls.enabled=true
6981 fs.permissions.umask-mode=027
6982
6983 and set the HBase config:
6984 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
6985 hbase.user.scan.snapshot.enable=true
6986
6987
6988 ---
6989
6990 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
6991
6992 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
6993
6994 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
6995
6996
6997 ---
6998
6999 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
7000
7001 <!-- markdown -->
7002
7003 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
7004
7005
7006 ---
7007
7008 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
7009
7010 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
7011
7012
7013 ---
7014
7015 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
7016
7017 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
7018
7019
7020 ---
7021
7022 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
7023
7024 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
7025
7026
7027 ---
7028
7029 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
7030
7031 The HBase "source checksum" now uses SHA512 instead of MD5.
7032
7033
7034 ---
7035
7036 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
7037
7038 <!-- markdown -->
7039
7040 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
7041
7042 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
7043
7044
7045 ---
7046
7047 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
7048
7049 The access method was used to the HttpServerFunctionalTest class as a common place.
7050
7051
7052 ---
7053
7054 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
7055
7056 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
7057
7058
7059 ---
7060
7061 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
7062
7063 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
7064
7065
7066 ---
7067
7068 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
7069
7070 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
7071
7072
7073 ---
7074
7075 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
7076
7077 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
7078
7079
7080 ---
7081
7082 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
7083
7084 Support get\|set LogLevel in secure(kerberized) environment.
7085
7086
7087 ---
7088
7089 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
7090
7091 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
7092
7093
7094 ---
7095
7096 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
7097
7098 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
7099
7100
7101 ---
7102
7103 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
7104
7105 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
7106
7107
7108 ---
7109
7110 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
7111
7112 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
7113
7114
7115 ---
7116
7117 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
7118
7119 Updated metrics core from 3.2.1 to 3.2.6.
7120
7121
7122 ---
7123
7124 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
7125
7126 The rubocop definition for the maximum method length was set to 75.
7127
7128
7129 ---
7130
7131 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
7132
7133 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
7134
7135
7136 ---
7137
7138 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
7139
7140 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
7141
7142
7143 ---
7144
7145 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
7146
7147 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
7148
7149 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
7150
7151 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
7152
7153 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
7154
7155 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
7156 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
7157
7158 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
7159 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
7160 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
7161
7162
7163 ---
7164
7165 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
7166
7167 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
7168
7169 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
7170
7171
7172 ---
7173
7174 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
7175
7176 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
7177
7178
7179 ---
7180
7181 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
7182
7183 <!-- markdown -->
7184 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
7185
7186 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
7187
7188
7189 ---
7190
7191 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
7192
7193 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
7194
7195 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
7196
7197
7198 ---
7199
7200 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
7201
7202 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
7203
7204
7205 ---
7206
7207 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
7208
7209 <!-- markdown -->
7210
7211 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
7212
7213 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
7214
7215
7216 ---
7217
7218 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
7219
7220 Add below method in Table interface:
7221
7222 RegionLocator getRegionLocator() throws IOException;
7223
7224 Add below methods in AsyncTable interface:
7225
7226 AsyncTableRegionLocator getRegionLocator();
7227 CompletableFuture\<TableDescriptor\> getDescriptor();
7228
7229
7230 ---
7231
7232 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
7233
7234 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
7235
7236 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
7237
7238 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
7239
7240
7241 ---
7242
7243 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
7244
7245 Introduced
7246
7247 Future\<Void\> createTableAsync(TableDescriptor);
7248
7249
7250 ---
7251
7252 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
7253
7254 Introduced these methods:
7255 void move(byte[]);
7256 void move(byte[], ServerName);
7257 Future\<Void\> splitRegionAsync(byte[]);
7258
7259 These methods are deprecated:
7260 void move(byte[], byte[])
7261
7262
7263 ---
7264
7265 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
7266
7267 Add a new jenkins file for running pre commit check for GitHub PR.
7268
7269
7270 ---
7271
7272 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
7273
7274 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
7275
7276
7277 ---
7278
7279 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
7280
7281 When insufficient permissions, you now get:
7282
7283 HTTP/1.1 403 Forbidden
7284
7285 on the HTTP side, and in the message
7286
7287 Forbidden
7288 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
7289 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
7290 and the rest of the ADE stack
7291
7292
7293 ---
7294
7295 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
7296
7297 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
7298
7299
7300 ---
7301
7302 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
7303
7304 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
7305
7306
7307 ---
7308
7309 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
7310
7311 <!-- markdown -->
7312 Fixed awkward dependency issue that prevented site building.
7313
7314 #### note specific to HBase 2.1.4
7315 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
7316 ```
7317 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
7318 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
7319         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
7320         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
7321         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
7322         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
7323         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
7324         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
7325         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
7326         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
7327         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
7328         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
7329         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
7330         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
7331         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
7332         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
7333         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
7334         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
7335         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
7336         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
7337         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
7338         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
7339         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
7340         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
7341         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
7342         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
7343         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
7344         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
7345 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
7346         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
7347         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
7348         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
7349         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
7350         ... 26 more
7351
7352 ```
7353
7354 Workaround via any _one_ of the following:
7355 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
7356 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
7357 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
7358 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
7359 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
7360
7361
7362 ---
7363
7364 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
7365
7366 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
7367
7368
7369 ---
7370
7371 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
7372
7373 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
7374
7375
7376 ---
7377
7378 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
7379
7380 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
7381
7382 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
7383
7384
7385 ---
7386
7387 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
7388
7389 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
7390
7391
7392 ---
7393
7394 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
7395
7396 <!-- markdown -->
7397
7398 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
7399
7400 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
7401
7402
7403 ---
7404
7405 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
7406
7407 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
7408
7409
7410 ---
7411
7412 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
7413
7414 Add a cloneSnapshotAsync method with restoreAcl parameter.
7415 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
7416 Make snapshotAsync method returns a Future\<Void\>.
7417 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
7418 Use default methods to reduce the code base for implementation classes.
7419
7420
7421 ---
7422
7423 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
7424
7425 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
7426
7427
7428 ---
7429
7430 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
7431
7432 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
7433 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
7434
7435 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
7436
7437 For example:
7438 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
7439
7440
7441 ---
7442
7443 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
7444
7445 Adds below flush, split, and compaction metrics
7446
7447  +  // split related metrics
7448  +  private MutableFastCounter splitRequest;
7449  +  private MutableFastCounter splitSuccess;
7450  +  private MetricHistogram splitTimeHisto;
7451  +
7452  +  // flush related metrics
7453  +  private MetricHistogram flushTimeHisto;
7454  +  private MetricHistogram flushMemstoreSizeHisto;
7455  +  private MetricHistogram flushOutputSizeHisto;
7456  +  private MutableFastCounter flushedMemstoreBytes;
7457  +  private MutableFastCounter flushedOutputBytes;
7458  +
7459  +  // compaction related metrics
7460  +  private MetricHistogram compactionTimeHisto;
7461  +  private MetricHistogram compactionInputFileCountHisto;
7462  +  private MetricHistogram compactionInputSizeHisto;
7463  +  private MetricHistogram compactionOutputFileCountHisto;
7464  +  private MetricHistogram compactionOutputSizeHisto;
7465  +  private MutableFastCounter compactedInputBytes;
7466  +  private MutableFastCounter compactedOutputBytes;
7467  +
7468  +  private MetricHistogram majorCompactionTimeHisto;
7469  +  private MetricHistogram majorCompactionInputFileCountHisto;
7470  +  private MetricHistogram majorCompactionInputSizeHisto;
7471  +  private MetricHistogram majorCompactionOutputFileCountHisto;
7472  +  private MetricHistogram majorCompactionOutputSizeHisto;
7473  +  private MutableFastCounter majorCompactedInputBytes;
7474  +  private MutableFastCounter majorCompactedOutputBytes;
7475
7476
7477 ---
7478
7479 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
7480
7481 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
7482
7483
7484 ---
7485
7486 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
7487
7488 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
7489 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
7490
7491
7492 ---
7493
7494 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
7495
7496 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
7497
7498 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
7499
7500
7501 ---
7502
7503 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
7504
7505 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
7506 Shell commands are as follows:
7507 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7508
7509 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
7510 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
7511 Shell commands are as follows:
7512 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7513 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
7514 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
7515
7516
7517 ---
7518
7519 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
7520
7521 Change spotbugs version to 3.1.11.
7522
7523
7524 ---
7525
7526 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
7527
7528 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
7529
7530 It also introduces additional info for each recovery queue, which was not accounted by this command before.
7531
7532 The new output for "status 'replication'" command is explained in details below:
7533 a) Source started, target stopped, no edits arrived on source yet:
7534 ...
7535  SOURCE: PeerID=1
7536          Normal Queue: 1
7537            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7538 ...
7539 b) Source started, target stopped, add edit on source:
7540 ...
7541 Normal Queue: 1
7542            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
7543 ...
7544 c) Source started, target stopped, edit added on source, restart source:
7545 ...
7546 SOURCE: PeerID=1
7547          Normal Queue: 1
7548            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7549          Recovered Queue: 1-hbase01.home,16020,1542784524057
7550            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
7551 ...
7552 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
7553 ...
7554 SOURCE: PeerID=1
7555          Normal Queue: 1
7556            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
7557          Recovered Queue: 1-hbase01.home,16020,1542782758742
7558            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
7559 ...
7560 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
7561 ...
7562        SOURCE: PeerID=1
7563          Normal Queue: 1
7564            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
7565 ...
7566 f) Source started, target stopped, add edit on source, restart source, restart target:
7567 ...
7568 SOURCE: PeerID=1
7569          Normal Queue: 1
7570            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7571 ...
7572
7573
7574 ---
7575
7576 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
7577
7578 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
7579
7580
7581 ---
7582
7583 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
7584
7585 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
7586 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
7587 disable\_exceed\_throttle\_quota
7588 There are two limits when enable exceed throttle quota:
7589 1. Must set at least one read and one write region server throttle quota;
7590 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
7591
7592
7593 ---
7594
7595 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
7596
7597 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
7598
7599
7600 ---
7601
7602 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
7603
7604 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
7605
7606
7607 ---
7608
7609 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
7610
7611 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
7612
7613
7614 ---
7615
7616 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
7617
7618 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
7619
7620 hbase\> help 'scan'
7621
7622
7623 ---
7624
7625 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
7626
7627 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
7628
7629 For example:
7630 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
7631
7632
7633 ---
7634
7635 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
7636
7637 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
7638 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
7639
7640
7641 ---
7642
7643 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
7644
7645 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
7646
7647
7648 ---
7649
7650 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
7651
7652 Make StoppedRpcClientException extend DoNotRetryIOException.
7653
7654
7655 ---
7656
7657 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
7658
7659 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
7660 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
7661
7662
7663 ---
7664
7665 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
7666
7667 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
7668
7669 The effect releases are:
7670 2.1.x: 2.1.2 and below
7671 2.0.x: 2.0.4 and below
7672 1.x: 1.4.x and below
7673
7674 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
7675
7676
7677 ---
7678
7679 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
7680
7681 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
7682
7683
7684
7685 # HBASE  2.3.0 Release Notes
7686
7687 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
7688
7689
7690 ---
7691
7692 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
7693
7694 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
7695
7696
7697 ---
7698
7699 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
7700
7701 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
7702
7703
7704 ---
7705
7706 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
7707
7708 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
7709
7710
7711 ---
7712
7713 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
7714
7715 <!-- markdown -->
7716 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
7717 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
7718 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
7719 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
7720 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
7721 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
7722
7723
7724 ---
7725
7726 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
7727
7728 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
7729
7730 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
7731
7732 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
7733
7734 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
7735
7736
7737 ---
7738
7739 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
7740
7741 Added new metric to differentiate sink startup time from last OP applied time.
7742
7743 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
7744
7745 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
7746
7747 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
7748
7749 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
7750
7751
7752 ---
7753
7754 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
7755
7756 <!-- markdown -->
7757 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
7758
7759 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
7760
7761
7762 ---
7763
7764 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
7765
7766 Add backoff. Avoid retrying every 100ms.
7767
7768
7769 ---
7770
7771 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
7772
7773 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
7774
7775 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
7776
7777
7778 ---
7779
7780 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
7781
7782 Introduced a general 'local region' at master side to store the procedure data, etc.
7783
7784 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
7785
7786 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
7787
7788
7789 ---
7790
7791 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
7792
7793 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
7794
7795
7796 ---
7797
7798 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
7799
7800 Config key: hbase.regionserver.slowlog.systable.enabled
7801 Default value: false
7802
7803 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
7804 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
7805
7806 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
7807
7808 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
7809
7810  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
7811  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
7812  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
7813  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
7814                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
7815                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
7816                                                              rics: false
7817  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
7818  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
7819  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
7820  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
7821  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
7822  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
7823  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
7824  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
7825
7826
7827 ---
7828
7829 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
7830
7831 <!-- markdown -->
7832 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
7833
7834
7835 ---
7836
7837 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
7838
7839 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
7840
7841 The request log is disabled by default in conf/log4j.properties by the following lines:
7842
7843 # Disable request log by default, you can enable this by changing the appender
7844 log4j.category.http.requests=INFO,NullAppender
7845 log4j.additivity.http.requests=false
7846
7847 Change the 'NullAppender' to what ever you want if you want to enable request log.
7848
7849 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
7850
7851
7852 ---
7853
7854 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
7855
7856 Use a empty string to represent no column specified for deleteall in shell mode.
7857 useage:
7858 deleteall 'test','r1','',12345
7859 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
7860
7861
7862 ---
7863
7864 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
7865
7866 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
7867
7868
7869 ---
7870
7871 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
7872
7873 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
7874
7875
7876 ---
7877
7878 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
7879
7880 Moved to hbase-thirdparty 3.3.0.
7881
7882
7883 ---
7884
7885 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
7886
7887 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
7888
7889 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
7890
7891 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
7892
7893
7894 ---
7895
7896 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
7897
7898 <!-- markdown -->
7899 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
7900
7901 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
7902
7903
7904 ---
7905
7906 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
7907
7908 New Config: hbase.rpc.rows.size.threshold.reject
7909 -----------------------------------------------------------------------
7910
7911 Default value: false
7912 Description:
7913 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
7914
7915
7916 ---
7917
7918 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
7919
7920 StochasticLoadBalancer functional improvement:
7921
7922 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
7923
7924
7925 ---
7926
7927 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
7928
7929 user or admin can now use
7930 hbase shell \> rename\_rsgroup 'oldname', 'newname'
7931 to rename rsgroup.
7932
7933
7934 ---
7935
7936 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
7937
7938 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
7939
7940 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
7941
7942
7943 ---
7944
7945 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
7946
7947 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
7948
7949
7950 ---
7951
7952 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
7953
7954 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
7955
7956
7957 ---
7958
7959 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
7960
7961 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
7962
7963
7964 ---
7965
7966 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
7967
7968 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
7969
7970
7971 ---
7972
7973 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
7974
7975 <!-- markdown -->
7976 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
7977
7978
7979 ---
7980
7981 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
7982
7983 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
7984
7985 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
7986
7987 For running tests locally, to go faster, up fork count.
7988
7989 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
7990
7991 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
7992
7993
7994 ---
7995
7996 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
7997
7998 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
7999
8000
8001 ---
8002
8003 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
8004
8005 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
8006
8007
8008 ---
8009
8010 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
8011
8012 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
8013
8014
8015 ---
8016
8017 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
8018
8019 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
8020
8021
8022 ---
8023
8024 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
8025
8026 <!-- markdown -->
8027 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
8028
8029 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
8030
8031
8032 ---
8033
8034 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
8035
8036 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
8037
8038
8039 ---
8040
8041 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
8042
8043 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
8044
8045
8046 ---
8047
8048 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
8049
8050 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
8051
8052
8053 ---
8054
8055 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
8056
8057 ColumnFamilyDescriptor new builder API:
8058
8059     /\*\*
8060      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
8061      \* of versions(versionAfterInterval) after that interval elapses.
8062      \*
8063      \* @param retentionInterval Retain all versions for this interval
8064      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
8065      \*/
8066     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
8067         final int retentionInterval, final int versionAfterInterval)
8068
8069
8070 ---
8071
8072 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
8073
8074 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
8075
8076
8077 ---
8078
8079 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
8080
8081 Expose file system level read metrics for RegionServer.
8082
8083 If the HBase RS runs on top of HDFS, calculate the aggregation of
8084 ReadStatistics of each HdfsFileInputStream. These metrics include:
8085 (1) total number of bytes read from HDFS.
8086 (2) total number of bytes read from local DataNode.
8087 (3) total number of bytes read locally through short-circuit read.
8088 (4) total number of bytes read locally through zero-copy read.
8089
8090 Because HDFS ReadStatistics is calculated per input stream, it is not
8091 feasible to update the aggregated number in real time. Instead, the
8092 metrics are updated when an input stream is closed.
8093
8094
8095 ---
8096
8097 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
8098
8099 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
8100
8101 Here is a simple example of script:
8102 {code}
8103 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
8104 #!/bin/bash
8105 namespace=$1
8106 tablename=$2
8107 if [[ $namespace == test ]]; then
8108   echo test
8109 elif [[ $tablename == \*foo\* ]]; then
8110   echo other
8111 else
8112   echo default
8113 fi
8114 {code}
8115
8116
8117 ---
8118
8119 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
8120
8121 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
8122
8123
8124 ---
8125
8126 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
8127
8128 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
8129
8130
8131 ---
8132
8133 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
8134
8135 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
8136
8137 User used to see....
8138
8139   column=table:state, timestamp=1583967620343 .....
8140
8141 ... but now sees:
8142
8143   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
8144
8145
8146 ---
8147
8148 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
8149
8150 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
8151
8152
8153 ---
8154
8155 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
8156
8157 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
8158
8159 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
8160
8161
8162 ---
8163
8164 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
8165
8166 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
8167
8168 New Admin APIs:
8169 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
8170       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
8171
8172 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
8173       throws IOException;
8174
8175 Configs:
8176
8177 1. hbase.regionserver.slowlog.ringbuffer.size:
8178 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
8179
8180 Default
8181 256
8182
8183 2. hbase.regionserver.slowlog.buffer.enabled:
8184 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
8185
8186 Default
8187 false
8188
8189
8190 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
8191
8192
8193 ---
8194
8195 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
8196
8197 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
8198
8199
8200 ---
8201
8202 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
8203
8204 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
8205
8206 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
8207
8208 This is a fluent style API, the code is like:
8209
8210 For Table interface:
8211 {code}
8212 table.checkAndMutate(row, filter).thenPut(put);
8213 {code}
8214
8215 For AsyncTable interface:
8216 {code}
8217 table.checkAndMutate(row, filter).thenPut(put)
8218     .thenAccept(succ -\> {
8219       if (succ) {
8220         System.out.println("Check and put succeeded");
8221       } else {
8222         System.out.println("Check and put failed");
8223       }
8224     });
8225 {code}
8226
8227
8228 ---
8229
8230 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
8231
8232 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
8233
8234
8235 ---
8236
8237 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
8238
8239 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
8240
8241
8242 ---
8243
8244 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
8245
8246     Adds shell command regioninfo:
8247
8248       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
8249       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
8250       Took 0.4737 seconds
8251
8252
8253 ---
8254
8255 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
8256
8257 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
8258
8259 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
8260
8261
8262 ---
8263
8264 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
8265
8266 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
8267
8268 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
8269 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
8270
8271 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
8272
8273
8274 ---
8275
8276 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
8277
8278 <!-- markdown -->
8279 Enables master based registry as the default registry used by clients to fetch connection metadata.
8280 Refer to the section "Master Registry" in the client documentation for more details and advantages
8281 of this implementation over the default Zookeeper based registry.
8282
8283 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
8284
8285 Where to set this: HBase client configuration (hbase-site.xml)
8286
8287 Possible values:
8288 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
8289 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
8290
8291 Notes on defaults:
8292
8293 - For v3.0.0 and later, MasterRegistry is the default registry
8294 - For all releases in 2.x line, ZK based registry is the default.
8295
8296 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
8297
8298 ```
8299 <property>
8300   <name>hbase.client.registry.impl</name>
8301   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
8302 </property>
8303 ```
8304
8305
8306 ---
8307
8308 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
8309
8310 caffeine: 2.6.2 =\> 2.8.1
8311 commons-codec: 1.10 =\> 1.13
8312 commons-io: 2.5 =\> 2.6
8313 disrupter: 3.3.6 =\> 3.4.2
8314 httpcore: 4.4.6 =\> 4.4.13
8315 jackson: 2.9.10 =\> 2.10.1
8316 jackson.databind: 2.9.10.1 =\> 2.10.1
8317 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
8318 protobuf.plugin: 0.5.0 =\> 0.6.1
8319 zookeeper: 3.4.10 =\> 3.4.14
8320 slf4j: 1.7.25 =\> 1.7.30
8321 rat: 0.12 =\> 0.13
8322 asciidoctor: 1.5.5 =\> 1.5.8
8323 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
8324 error-prone: 2.3.3 =\> 2.3.4
8325
8326
8327 ---
8328
8329 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
8330
8331 - Reverts a binary incompatible binary change for ByteRangeUtils
8332 - Usage of reflection inside CommonFSUtils removed
8333
8334
8335 ---
8336
8337 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
8338
8339 Adds being able to edit hbase:meta table schema. For example,
8340
8341 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
8342 Updating all regions with the new schema...
8343 All regions updated.
8344 Done.
8345 Took 1.2138 seconds
8346
8347 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
8348
8349
8350 ---
8351
8352 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
8353
8354 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
8355
8356 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
8357
8358
8359 ---
8360
8361 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
8362
8363 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
8364
8365
8366 ---
8367
8368 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
8369
8370 Add a new config to hbase-default.xml
8371
8372   \<property\>
8373     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
8374     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
8375     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
8376     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
8377     called in order, so put the cleaner that prunes the most files in front. To
8378     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
8379     and add the fully qualified class name here. Always add the above
8380     default hfile cleaners in the list as they will be overwritten in
8381     hbase-site.xml.\</description\>
8382   \</property\>
8383
8384 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
8385
8386
8387 ---
8388
8389 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
8390
8391 Updated parent pom to Apache version 22.
8392
8393
8394 ---
8395
8396 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
8397
8398 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
8399
8400 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
8401
8402
8403 ---
8404
8405 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
8406
8407 Add a new feature to improve MTTR which have 3 steps to failover:
8408 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
8409 2. Open region.
8410 3. Bulkload the recovered.hfiles for every column family.
8411
8412 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
8413
8414 Config hbase.wal.split.to.hfile to true to enable this featue.
8415
8416
8417 ---
8418
8419 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
8420
8421 Changed the logging in hbase-zookeeper to use built-in formatting
8422
8423
8424 ---
8425
8426 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
8427
8428 From the PR:
8429
8430 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
8431
8432 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
8433
8434
8435 ---
8436
8437 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
8438
8439 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
8440
8441
8442 ---
8443
8444 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
8445
8446 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
8447
8448
8449 ---
8450
8451 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
8452
8453 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
8454
8455
8456 ---
8457
8458 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
8459
8460 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
8461
8462 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
8463
8464
8465 ---
8466
8467 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
8468
8469 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
8470
8471
8472 ---
8473
8474 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
8475
8476 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
8477 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
8478
8479 Fixed this bug as part of this Jira.
8480 Updated description for corresponding configs:
8481
8482 1. hbase.master.regions.recovery.check.interval :
8483
8484 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8485
8486 2. hbase.regions.recovery.store.file.ref.count :
8487
8488 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8489
8490
8491 ---
8492
8493 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
8494
8495 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
8496
8497
8498 ---
8499
8500 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
8501
8502 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
8503
8504
8505 ---
8506
8507 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
8508
8509 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
8510
8511
8512 ---
8513
8514 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
8515
8516 Bumped surefire plugin to 3.0.0-M4
8517
8518
8519 ---
8520
8521 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
8522
8523 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
8524
8525
8526 ---
8527
8528 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
8529
8530 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
8531 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
8532 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
8533 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
8534 From the shell this can be enabled by using the option per Column Family also by using the below format
8535 {code}
8536 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
8537 {code}
8538
8539
8540 ---
8541
8542 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
8543
8544 <!-- markdown -->
8545
8546 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
8547
8548 ```
8549 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
8550     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
8551 ```
8552
8553 See javadocs of the class `MobRefReporter` for more details.
8554
8555 the reference guide has added some information about MOB internals and troubleshooting.
8556
8557
8558 ---
8559
8560 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
8561
8562 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
8563
8564
8565 ---
8566
8567 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
8568
8569 Fixed unbalanced braces in string representation within HBase shell
8570
8571
8572 ---
8573
8574 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
8575
8576 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
8577 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
8578
8579
8580 ---
8581
8582 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
8583
8584 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
8585
8586
8587 ---
8588
8589 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
8590
8591 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
8592
8593 1. RowFilter
8594 2. ValueFilter
8595 3. QualifierFilter
8596 4. FamilyFilter
8597 5. ColumnValueFilter
8598
8599
8600 ---
8601
8602 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
8603
8604 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
8605
8606
8607 ---
8608
8609 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
8610
8611 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
8612
8613
8614 ---
8615
8616 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
8617
8618 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
8619
8620
8621 ---
8622
8623 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
8624
8625 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
8626
8627
8628 ---
8629
8630 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
8631
8632 <!-- markdown -->
8633 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
8634
8635 Such messages will happen at most once per five minutes.
8636
8637
8638 ---
8639
8640 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
8641
8642 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
8643
8644
8645 ---
8646
8647 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
8648
8649 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
8650
8651
8652 ---
8653
8654 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
8655
8656 <!-- markdown -->
8657
8658 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
8659
8660   - CVE-2019-16942
8661   - CVE-2019-16943
8662
8663 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
8664
8665
8666 ---
8667
8668 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
8669
8670 <!-- markdown -->
8671
8672 The MOB compaction process in the HBase Master now logs more about its activity.
8673
8674 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
8675
8676 Caveats:
8677 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
8678 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
8679 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
8680 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
8681
8682
8683 ---
8684
8685 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
8686
8687 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
8688
8689
8690 ---
8691
8692 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
8693
8694 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
8695
8696 Configs:
8697
8698 1. hbase.master.regions.recovery.check.interval :
8699
8700 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8701
8702 2. hbase.regions.recovery.store.file.ref.count :
8703
8704 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8705
8706
8707 ---
8708
8709 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
8710
8711 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
8712
8713 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
8714
8715
8716 ---
8717
8718 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
8719
8720 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
8721
8722
8723 ---
8724
8725 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
8726
8727 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
8728
8729
8730 ---
8731
8732 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
8733
8734 <!-- markdown -->
8735 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
8736
8737 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
8738
8739
8740 ---
8741
8742 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
8743
8744 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
8745
8746
8747 ---
8748
8749 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
8750
8751 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
8752
8753
8754 ---
8755
8756 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
8757
8758 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
8759
8760 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
8761
8762
8763 ---
8764
8765 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
8766
8767 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
8768 \<property\>
8769     \<name\>hbase.bucketcache.ioengine\</name\>
8770     \<value\> pmem:///path in persistent memory \</value\>
8771   \</property\>
8772
8773
8774 ---
8775
8776 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
8777
8778 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
8779 hbase\> snapshot\_cleanup\_switch false
8780
8781 We can re-enable it using:
8782 hbase\> snapshot\_cleanup\_switch true
8783
8784 We can query whether snapshot auto cleanup is enabled for cluster using:
8785 hbase\> snapshot\_cleanup\_enabled
8786
8787
8788 ---
8789
8790 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
8791
8792 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
8793
8794
8795 ---
8796
8797 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
8798
8799 This issue adds via its subtasks:
8800
8801  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
8802  \*\* Master thought this region opened, but no regionserver reported it.
8803  \*\* Master thought this region opened on Server1, but regionserver reported Server2
8804  \*\* More than one regionservers reported opened this region
8805  Both chores can be triggered from the shell to regenerate ‘new’ reports.
8806  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
8807  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
8808  \* Offline replace of hbase.version and hbase.id
8809  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
8810  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
8811  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
8812  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
8813  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
8814  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
8815
8816 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
8817
8818
8819 ---
8820
8821 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
8822
8823 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
8824
8825
8826 ---
8827
8828 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
8829
8830 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
8831
8832
8833 ---
8834
8835 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
8836
8837 Before this issue, we've made the read path 100% offheap when block hit the BucketCache 100%, but if the cache missed then RS need to read the block by on-heap API, which would cause high young GC pressure.
8838 This issue will read the block by offheap even if reading the block from filesystem directly, it have some requirement for hadoop version(\>=2.9.3) but can also works with older hadoop version(means still works fine but will read block onheap). We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex, for more details please read it.
8839
8840
8841 ---
8842
8843 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
8844
8845 <!-- markdown -->
8846 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
8847
8848
8849 ---
8850
8851 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
8852
8853 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
8854
8855
8856 ---
8857
8858 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
8859
8860 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
8861
8862
8863 ---
8864
8865 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
8866
8867 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
8868 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
8869
8870
8871 ---
8872
8873 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
8874
8875 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
8876 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
8877 \* TimeRange#until: Represents the time interval [0, maxStamp)
8878 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
8879
8880
8881 ---
8882
8883 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
8884
8885 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
8886 {code}
8887 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
8888 {code}
8889
8890
8891 ---
8892
8893 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
8894
8895 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
8896
8897
8898 ---
8899
8900 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
8901
8902 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
8903
8904 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
8905
8906
8907 ---
8908
8909 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
8910
8911 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
8912
8913
8914 ---
8915
8916 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
8917
8918 New shaded artifact for testing: hbase-shaded-testing-util.
8919
8920
8921 ---
8922
8923 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
8924
8925 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
8926 1. Check HDFS configuration
8927 2. Add master coprocessor:
8928     hbase.coprocessor.master.classes=
8929     “org.apache.hadoop.hbase.security.access.AccessController,
8930 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
8931 3. Enable this feature:
8932     hbase.acl.sync.to.hdfs.enable=true
8933 4. Modify table scheme to enable this feature for a table:
8934     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
8935
8936
8937 ---
8938
8939 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
8940
8941 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
8942
8943 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
8944
8945 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
8946 java.lang.ArrayIndexOutOfBoundsException: 18056
8947         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
8948         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
8949         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
8950         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
8951         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
8952         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
8953         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
8954         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
8955         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
8956
8957 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
8958
8959 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
8960
8961 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
8962
8963
8964 ---
8965
8966 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
8967
8968 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
8969
8970
8971 ---
8972
8973 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
8974
8975 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
8976
8977
8978 ---
8979
8980 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
8981
8982 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
8983
8984 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
8985
8986
8987 ---
8988
8989 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
8990
8991 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
8992
8993
8994 ---
8995
8996 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
8997
8998 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
8999 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
9000
9001
9002 ---
9003
9004 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
9005
9006 1. Add a new chore thread in master to do hbck checking
9007 2. Add a new web ui "HBCK Report" page to display checking results.
9008
9009 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
9010
9011 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
9012
9013
9014 ---
9015
9016 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
9017
9018 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
9019
9020 $hbase rowcounter -h
9021
9022 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
9023 Options:
9024     --starttime=\<arg\>       starting time filter to start counting rows from.
9025     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
9026     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
9027     --expectedCount=\<arg\>   expected number of rows to be count.
9028 For performance, consider the following configuration properties:
9029 -Dhbase.client.scanner.caching=100
9030 -Dmapreduce.map.speculative=false
9031
9032
9033 ---
9034
9035 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
9036
9037 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
9038 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
9039
9040
9041 ---
9042
9043 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
9044
9045 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
9046
9047 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
9048
9049 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
9050
9051
9052 ---
9053
9054 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
9055
9056 Add a new master web UI to show the potentially problematic opened regions. There are three case:
9057 1. Master thought this region opened, but no regionserver reported it.
9058 2. Master thought this region opened on Server1, but regionserver reported Server2
9059 3. More than one regionservers reported opened this region
9060
9061
9062 ---
9063
9064 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
9065
9066 Feature: Take a Snapshot With TTL for auto-cleanup
9067
9068 Attribute:
9069 1. TTL
9070      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
9071
9072 Configs:
9073 1. Default Snapshot TTL:
9074      - FOREVER by default
9075      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
9076
9077 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
9078      - hbase.master.cleaner.snapshot.disable: "true"
9079     With this config, HMaster needs restart just like any other hbase-site config.
9080
9081
9082 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
9083
9084
9085 ---
9086
9087 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
9088
9089 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
9090
9091
9092 ---
9093
9094 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
9095
9096 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
9097
9098 This tool is deprecated in 2.x and will be removed in 3.0.
9099
9100
9101 ---
9102
9103 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
9104
9105 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
9106
9107
9108 ---
9109
9110 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
9111
9112 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
9113
9114 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
9115
9116 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
9117
9118
9119 ---
9120
9121 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
9122
9123 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
9124 To use this feature, please make sure the HDFS config is set:
9125 dfs.namenode.acls.enabled=true
9126 fs.permissions.umask-mode=027
9127
9128 and set the HBase config:
9129 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
9130 hbase.user.scan.snapshot.enable=true
9131
9132
9133 ---
9134
9135 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
9136
9137 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9138
9139 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9140
9141
9142 ---
9143
9144 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
9145
9146 <!-- markdown -->
9147
9148 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
9149
9150
9151 ---
9152
9153 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9154
9155 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9156
9157
9158 ---
9159
9160 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9161
9162 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9163
9164
9165 ---
9166
9167 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
9168
9169 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
9170
9171
9172 ---
9173
9174 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
9175
9176 The HBase "source checksum" now uses SHA512 instead of MD5.
9177
9178
9179 ---
9180
9181 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9182
9183 <!-- markdown -->
9184
9185 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9186
9187 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9188
9189
9190 ---
9191
9192 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
9193
9194 The access method was used to the HttpServerFunctionalTest class as a common place.
9195
9196
9197 ---
9198
9199 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9200
9201 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9202
9203
9204 ---
9205
9206 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9207
9208 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9209
9210
9211 ---
9212
9213 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9214
9215 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9216
9217
9218 ---
9219
9220 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9221
9222 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9223
9224
9225 ---
9226
9227 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
9228
9229 Support get\|set LogLevel in secure(kerberized) environment.
9230
9231
9232 ---
9233
9234 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9235
9236 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9237
9238
9239 ---
9240
9241 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
9242
9243 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
9244
9245
9246 ---
9247
9248 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9249
9250 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9251
9252
9253 ---
9254
9255 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9256
9257 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9258
9259
9260 ---
9261
9262 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
9263
9264 Updated metrics core from 3.2.1 to 3.2.6.
9265
9266
9267 ---
9268
9269 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
9270
9271 The rubocop definition for the maximum method length was set to 75.
9272
9273
9274 ---
9275
9276 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
9277
9278 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
9279
9280
9281 ---
9282
9283 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
9284
9285 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
9286
9287
9288 ---
9289
9290 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
9291
9292 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
9293
9294 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
9295
9296 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
9297
9298 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
9299
9300 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
9301 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
9302
9303 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
9304 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
9305 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
9306
9307
9308 ---
9309
9310 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
9311
9312 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
9313
9314 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
9315
9316
9317 ---
9318
9319 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
9320
9321 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
9322
9323
9324 ---
9325
9326 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
9327
9328 <!-- markdown -->
9329 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
9330
9331 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
9332
9333
9334 ---
9335
9336 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
9337
9338 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
9339
9340 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
9341
9342
9343 ---
9344
9345 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
9346
9347 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
9348
9349
9350 ---
9351
9352 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
9353
9354 <!-- markdown -->
9355
9356 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
9357
9358 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
9359
9360
9361 ---
9362
9363 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
9364
9365 Add below method in Table interface:
9366
9367 RegionLocator getRegionLocator() throws IOException;
9368
9369 Add below methods in AsyncTable interface:
9370
9371 AsyncTableRegionLocator getRegionLocator();
9372 CompletableFuture\<TableDescriptor\> getDescriptor();
9373
9374
9375 ---
9376
9377 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
9378
9379 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
9380
9381 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
9382
9383 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
9384
9385
9386 ---
9387
9388 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
9389
9390 Introduced
9391
9392 Future\<Void\> createTableAsync(TableDescriptor);
9393
9394
9395 ---
9396
9397 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
9398
9399 Introduced these methods:
9400 void move(byte[]);
9401 void move(byte[], ServerName);
9402 Future\<Void\> splitRegionAsync(byte[]);
9403
9404 These methods are deprecated:
9405 void move(byte[], byte[])
9406
9407
9408 ---
9409
9410 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
9411
9412 Add a new jenkins file for running pre commit check for GitHub PR.
9413
9414
9415 ---
9416
9417 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
9418
9419 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
9420
9421
9422 ---
9423
9424 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
9425
9426 When insufficient permissions, you now get:
9427
9428 HTTP/1.1 403 Forbidden
9429
9430 on the HTTP side, and in the message
9431
9432 Forbidden
9433 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
9434 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
9435 and the rest of the ADE stack
9436
9437
9438 ---
9439
9440 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
9441
9442 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
9443
9444
9445 ---
9446
9447 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
9448
9449 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
9450
9451
9452 ---
9453
9454 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
9455
9456 <!-- markdown -->
9457 Fixed awkward dependency issue that prevented site building.
9458
9459 #### note specific to HBase 2.1.4
9460 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
9461 ```
9462 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
9463 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
9464         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
9465         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
9466         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
9467         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
9468         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
9469         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
9470         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
9471         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
9472         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
9473         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
9474         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
9475         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
9476         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
9477         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
9478         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
9479         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
9480         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
9481         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
9482         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
9483         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
9484         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
9485         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
9486         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
9487         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
9488         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
9489         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
9490 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
9491         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
9492         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
9493         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
9494         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
9495         ... 26 more
9496
9497 ```
9498
9499 Workaround via any _one_ of the following:
9500 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
9501 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
9502 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
9503 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
9504 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
9505
9506
9507 ---
9508
9509 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
9510
9511 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
9512
9513
9514 ---
9515
9516 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
9517
9518 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
9519
9520
9521 ---
9522
9523 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
9524
9525 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
9526
9527 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
9528
9529
9530 ---
9531
9532 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
9533
9534 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
9535
9536
9537 ---
9538
9539 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
9540
9541 <!-- markdown -->
9542
9543 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
9544
9545 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
9546
9547
9548 ---
9549
9550 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
9551
9552 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
9553
9554
9555 ---
9556
9557 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
9558
9559 Add a cloneSnapshotAsync method with restoreAcl parameter.
9560 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
9561 Make snapshotAsync method returns a Future\<Void\>.
9562 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
9563 Use default methods to reduce the code base for implementation classes.
9564
9565
9566 ---
9567
9568 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
9569
9570 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
9571
9572
9573 ---
9574
9575 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
9576
9577 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
9578 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
9579
9580 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
9581
9582 For example:
9583 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
9584
9585
9586 ---
9587
9588 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
9589
9590 Adds below flush, split, and compaction metrics
9591
9592  +  // split related metrics
9593  +  private MutableFastCounter splitRequest;
9594  +  private MutableFastCounter splitSuccess;
9595  +  private MetricHistogram splitTimeHisto;
9596  +
9597  +  // flush related metrics
9598  +  private MetricHistogram flushTimeHisto;
9599  +  private MetricHistogram flushMemstoreSizeHisto;
9600  +  private MetricHistogram flushOutputSizeHisto;
9601  +  private MutableFastCounter flushedMemstoreBytes;
9602  +  private MutableFastCounter flushedOutputBytes;
9603  +
9604  +  // compaction related metrics
9605  +  private MetricHistogram compactionTimeHisto;
9606  +  private MetricHistogram compactionInputFileCountHisto;
9607  +  private MetricHistogram compactionInputSizeHisto;
9608  +  private MetricHistogram compactionOutputFileCountHisto;
9609  +  private MetricHistogram compactionOutputSizeHisto;
9610  +  private MutableFastCounter compactedInputBytes;
9611  +  private MutableFastCounter compactedOutputBytes;
9612  +
9613  +  private MetricHistogram majorCompactionTimeHisto;
9614  +  private MetricHistogram majorCompactionInputFileCountHisto;
9615  +  private MetricHistogram majorCompactionInputSizeHisto;
9616  +  private MetricHistogram majorCompactionOutputFileCountHisto;
9617  +  private MetricHistogram majorCompactionOutputSizeHisto;
9618  +  private MutableFastCounter majorCompactedInputBytes;
9619  +  private MutableFastCounter majorCompactedOutputBytes;
9620
9621
9622 ---
9623
9624 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
9625
9626 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
9627
9628
9629 ---
9630
9631 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
9632
9633 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
9634 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
9635
9636
9637 ---
9638
9639 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
9640
9641 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
9642
9643 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
9644
9645
9646 ---
9647
9648 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
9649
9650 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
9651 Shell commands are as follows:
9652 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9653
9654 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
9655 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
9656 Shell commands are as follows:
9657 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9658 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
9659 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
9660
9661
9662 ---
9663
9664 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
9665
9666 Change spotbugs version to 3.1.11.
9667
9668
9669 ---
9670
9671 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
9672
9673 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
9674
9675 It also introduces additional info for each recovery queue, which was not accounted by this command before.
9676
9677 The new output for "status 'replication'" command is explained in details below:
9678 a) Source started, target stopped, no edits arrived on source yet:
9679 ...
9680  SOURCE: PeerID=1
9681          Normal Queue: 1
9682            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9683 ...
9684 b) Source started, target stopped, add edit on source:
9685 ...
9686 Normal Queue: 1
9687            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
9688 ...
9689 c) Source started, target stopped, edit added on source, restart source:
9690 ...
9691 SOURCE: PeerID=1
9692          Normal Queue: 1
9693            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9694          Recovered Queue: 1-hbase01.home,16020,1542784524057
9695            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
9696 ...
9697 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
9698 ...
9699 SOURCE: PeerID=1
9700          Normal Queue: 1
9701            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
9702          Recovered Queue: 1-hbase01.home,16020,1542782758742
9703            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
9704 ...
9705 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
9706 ...
9707        SOURCE: PeerID=1
9708          Normal Queue: 1
9709            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
9710 ...
9711 f) Source started, target stopped, add edit on source, restart source, restart target:
9712 ...
9713 SOURCE: PeerID=1
9714          Normal Queue: 1
9715            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9716 ...
9717
9718
9719 ---
9720
9721 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
9722
9723 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
9724
9725
9726 ---
9727
9728 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
9729
9730 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
9731 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
9732 disable\_exceed\_throttle\_quota
9733 There are two limits when enable exceed throttle quota:
9734 1. Must set at least one read and one write region server throttle quota;
9735 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
9736
9737
9738 ---
9739
9740 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
9741
9742 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
9743
9744
9745 ---
9746
9747 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
9748
9749 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
9750
9751
9752 ---
9753
9754 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
9755
9756 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
9757
9758
9759 ---
9760
9761 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
9762
9763 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
9764
9765 hbase\> help 'scan'
9766
9767
9768 ---
9769
9770 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
9771
9772 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
9773
9774 For example:
9775 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
9776
9777
9778 ---
9779
9780 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
9781
9782 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
9783 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
9784
9785
9786 ---
9787
9788 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
9789
9790 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
9791
9792
9793 ---
9794
9795 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
9796
9797 Make StoppedRpcClientException extend DoNotRetryIOException.
9798
9799
9800 ---
9801
9802 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
9803
9804 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
9805 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
9806
9807
9808 ---
9809
9810 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
9811
9812 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
9813
9814 The effect releases are:
9815 2.1.x: 2.1.2 and below
9816 2.0.x: 2.0.4 and below
9817 1.x: 1.4.x and below
9818
9819 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
9820
9821
9822 ---
9823
9824 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
9825
9826 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
9827
9828
9829
9830
9831 # HBASE  2.2.0 Release Notes
9832
9833 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
9834
9835
9836 ---
9837
9838 * [HBASE-21970](https://issues.apache.org/jira/browse/HBASE-21970) | *Major* | **Document that how to upgrade from 2.0 or 2.1 to 2.2+**
9839
9840 See the document http://hbase.apache.org/book.html#upgrade2.2 about how to upgrade from 2.0 or 2.1 to 2.2+.
9841
9842 HBase 2.2+ uses a new Procedure form assiging/unassigning/moving Regions. It does not process HBase 2.1 and 2.0's Unassign/Assign Procedure types. Upgrade requires that we first drain the Master Procedure Store of old style Procedures before starting the new 2.2 Master. So you need to make sure that before you kill the old version (2.0 or 2.1) Master, there is no region in transition. And once the new version (2.2+) Master is up, you can rolling upgrade RegionServers one by one.
9843
9844 And there is a more safer way if you are running 2.1.1+ or 2.0.3+ cluster. It need four steps to upgrade Master.
9845
9846 1. Shutdown both active and standby Masters (Your cluster will continue to server reads and writes without interruption).
9847 2. Set the property hbase.procedure.upgrade-to-2-2 to true in hbase-site.xml for the Master, and start only one Master, still using the 2.1.1+ (or 2.0.3+) version.
9848 3. Wait until the Master quits. Confirm that there is a 'READY TO ROLLING UPGRADE' message in the Master log as the cause of the shutdown. The Procedure Store is now empty.
9849 4. Start new Masters with the new 2.2+ version.
9850
9851 Then you can rolling upgrade RegionServers one by one. See HBASE-21075 for more details.
9852
9853
9854 ---
9855
9856 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9857
9858 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9859
9860
9861 ---
9862
9863 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9864
9865 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9866
9867
9868 ---
9869
9870 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9871
9872 <!-- markdown -->
9873
9874 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9875
9876 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9877
9878
9879 ---
9880
9881 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9882
9883 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9884
9885
9886 ---
9887
9888 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9889
9890 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9891
9892
9893 ---
9894
9895 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9896
9897 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9898
9899
9900 ---
9901
9902 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9903
9904 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9905
9906
9907 ---
9908
9909 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9910
9911 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9912
9913
9914 ---
9915
9916 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9917
9918 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9919
9920
9921 ---
9922
9923 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9924
9925 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9926
9927
9928 ---
9929
9930 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
9931
9932 Updated metrics core from 3.2.1 to 3.2.6.
9933
9934
9935 ---
9936
9937 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
9938
9939 The rubocop definition for the maximum method length was set to 75.
9940
9941
9942 ---
9943
9944 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
9945
9946 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
9947
9948
9949 ---
9950
9951 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
9952
9953 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
9954
9955
9956 ---
9957
9958 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
9959
9960 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
9961
9962
9963 ---
9964
9965 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
9966
9967 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
9968
9969
9970 ---
9971
9972 * [HBASE-22155](https://issues.apache.org/jira/browse/HBASE-22155) | *Major* | **Move 2.2.0 on to hbase-thirdparty-2.2.0**
9973
9974  Updates libs used internally by hbase via hbase-thirdparty as follows:
9975
9976  gson 2.8.1 -\\\> 2.8.5
9977  guava 22.0 -\\\> 27.1-jre
9978  pb 3.5.1 -\\\> 3.7.0
9979  netty 4.1.17 -\\\> 4.1.34
9980  commons-collections4 4.1 -\\\> 4.3
9981
9982
9983 ---
9984
9985 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
9986
9987 Introduced
9988
9989 Future\<Void\> createTableAsync(TableDescriptor);
9990
9991
9992 ---
9993
9994 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
9995
9996 Introduced these methods:
9997 void move(byte[]);
9998 void move(byte[], ServerName);
9999 Future\<Void\> splitRegionAsync(byte[]);
10000
10001 These methods are deprecated:
10002 void move(byte[], byte[])
10003
10004
10005 ---
10006
10007 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
10008
10009 Add a new jenkins file for running pre commit check for GitHub PR.
10010
10011
10012 ---
10013
10014 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
10015
10016 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
10017
10018
10019 ---
10020
10021 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
10022
10023 When insufficient permissions, you now get:
10024
10025 HTTP/1.1 403 Forbidden
10026
10027 on the HTTP side, and in the message
10028
10029 Forbidden
10030 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
10031 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
10032 and the rest of the ADE stack
10033
10034
10035 ---
10036
10037 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
10038
10039 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
10040
10041
10042 ---
10043
10044 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
10045
10046 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
10047
10048
10049 ---
10050
10051 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
10052
10053 <!-- markdown -->
10054 Fixed awkward dependency issue that prevented site building.
10055
10056 #### note specific to HBase 2.1.4
10057 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
10058 ```
10059 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
10060 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
10061         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
10062         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
10063         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
10064         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
10065         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
10066         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
10067         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
10068         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
10069         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
10070         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
10071         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
10072         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
10073         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
10074         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
10075         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
10076         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
10077         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
10078         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
10079         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
10080         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
10081         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
10082         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
10083         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
10084         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
10085         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
10086         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
10087 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
10088         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
10089         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
10090         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
10091         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
10092         ... 26 more
10093
10094 ```
10095
10096 Workaround via any _one_ of the following:
10097 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
10098 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
10099 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
10100 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
10101 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
10102
10103
10104 ---
10105
10106 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
10107
10108 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
10109
10110
10111 ---
10112
10113 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
10114
10115 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
10116
10117 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
10118
10119
10120 ---
10121
10122 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
10123
10124 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
10125
10126
10127 ---
10128
10129 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
10130
10131 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
10132
10133
10134 ---
10135
10136 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
10137
10138 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
10139
10140
10141 ---
10142
10143 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
10144
10145 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
10146 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
10147
10148 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
10149
10150 For example:
10151 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
10152
10153
10154 ---
10155
10156 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
10157
10158 Adds below flush, split, and compaction metrics
10159
10160  +  // split related metrics
10161  +  private MutableFastCounter splitRequest;
10162  +  private MutableFastCounter splitSuccess;
10163  +  private MetricHistogram splitTimeHisto;
10164  +
10165  +  // flush related metrics
10166  +  private MetricHistogram flushTimeHisto;
10167  +  private MetricHistogram flushMemstoreSizeHisto;
10168  +  private MetricHistogram flushOutputSizeHisto;
10169  +  private MutableFastCounter flushedMemstoreBytes;
10170  +  private MutableFastCounter flushedOutputBytes;
10171  +
10172  +  // compaction related metrics
10173  +  private MetricHistogram compactionTimeHisto;
10174  +  private MetricHistogram compactionInputFileCountHisto;
10175  +  private MetricHistogram compactionInputSizeHisto;
10176  +  private MetricHistogram compactionOutputFileCountHisto;
10177  +  private MetricHistogram compactionOutputSizeHisto;
10178  +  private MutableFastCounter compactedInputBytes;
10179  +  private MutableFastCounter compactedOutputBytes;
10180  +
10181  +  private MetricHistogram majorCompactionTimeHisto;
10182  +  private MetricHistogram majorCompactionInputFileCountHisto;
10183  +  private MetricHistogram majorCompactionInputSizeHisto;
10184  +  private MetricHistogram majorCompactionOutputFileCountHisto;
10185  +  private MetricHistogram majorCompactionOutputSizeHisto;
10186  +  private MutableFastCounter majorCompactedInputBytes;
10187  +  private MutableFastCounter majorCompactedOutputBytes;
10188
10189
10190 ---
10191
10192 * [HBASE-20886](https://issues.apache.org/jira/browse/HBASE-20886) | *Critical* | **[Auth] Support keytab login in hbase client**
10193
10194 From 2.2.0, hbase supports client login via keytab. To use this feature, client should specify \`hbase.client.keytab.file\` and \`hbase.client.keytab.principal\` in hbase-site.xml, then the connection will contain the needed credentials which be renewed periodically to communicate with kerberized hbase cluster.
10195
10196
10197 ---
10198
10199 * [HBASE-21410](https://issues.apache.org/jira/browse/HBASE-21410) | *Major* | **A helper page that help find all problematic regions and procedures**
10200
10201 After HBASE-21410, we add a helper page to Master UI. This helper page is mainly to help HBase operator quickly found all regions and pids that are get stuck.
10202 There are 2 entries to get in this page.
10203 One is showing in the Regions in Transition section, it made "num region(s) in transition" a link that you can click and check all regions in transition and their related procedure IDs.
10204 The other one is showing in the table details section, it made the number of CLOSING or OPENING regions a link, which you can click and check regions and related procedure IDs of CLOSING or OPENING regions of a certain table.
10205 In this helper page, not only you can see all regions and related procedures, there are 2 buttons at the top which will show these regions or procedure IDs in text format. This is mainly aim to help operator to easily copy and paste all problematic procedure IDs and encoded region names to HBCK2's command line, by which we HBase operator can bypass these procedures or assign these regions.
10206
10207
10208 ---
10209
10210 * [HBASE-21588](https://issues.apache.org/jira/browse/HBASE-21588) | *Major* | **Procedure v2 wal splitting implementation**
10211
10212 After HBASE-21588, we introduce a new way to do WAL splitting coordination by procedure framework. This can simplify the process of WAL splitting and no need to connect zookeeper any more.
10213 During ServerCrashProcedure, it will create a SplitWALProcedure for each WAL that need to split. Then each SplitWALProcedure will spawn a SplitWALRemoteProcedure to send the request to regionserver.
10214 At the RegionServer side, whole process is handled by SplitWALCallable. It split the WAL and return the result to master.
10215 According to my test, this patch has a better performance as the number of WALs that need to split increase. And it can relieve the pressure on zookeeper.
10216
10217
10218 ---
10219
10220 * [HBASE-20734](https://issues.apache.org/jira/browse/HBASE-20734) | *Major* | **Colocate recovered edits directory with hbase.wal.dir**
10221
10222 Previously the recovered.edits directory was under the root directory. This JIRA moves the recovered.edits directory to be under the hbase.wal.dir if set. It also adds a check for any recovered.edits found under the root directory for backwards compatibility. This gives improvements when a faster media(like SSD) or more local FileSystem is used for the hbase.wal.dir than the root dir.
10223
10224
10225 ---
10226
10227 * [HBASE-20401](https://issues.apache.org/jira/browse/HBASE-20401) | *Minor* | **Make \`MAX\_WAIT\` and \`waitIfNotFinished\` in CleanerContext configurable**
10228
10229 When oldwals (and hfile) cleaner cleans stale wals (and hfiles), it will periodically check and wait the clean results from filesystem, the total wait time will be no more than a max time.
10230
10231 The periodically wait and check configurations are hbase.oldwals.cleaner.thread.check.interval.msec (default is 500 ms) and hbase.regionserver.hfilecleaner.thread.check.interval.msec (default is 1000 ms).
10232
10233 Meanwhile, The max time configurations are hbase.oldwals.cleaner.thread.timeout.msec and hbase.regionserver.hfilecleaner.thread.timeout.msec, they are set to 60 seconds by default.
10234
10235 All support dynamic configuration.
10236
10237 e.g. in the oldwals cleaning scenario, one may consider tuning hbase.oldwals.cleaner.thread.timeout.msec and hbase.oldwals.cleaner.thread.check.interval.msec
10238
10239 1. While deleting a oldwal never complete (strange but possible), then delete file task needs to wait for a max of 60 seconds. Here, 60 seconds might be too long, or the opposite way is to increase more than 60 seconds in the use cases of slow file delete.
10240 2. The check and wait of a file delete is set to default in the period of 500 milliseconds, one might want to tune this checking period to a short interval to check more frequently or to a longer interval to avoid checking too often to manage their delete file task checking period (the longer interval may be use to avoid checking too fast while using a high latency storage).
10241
10242
10243 ---
10244
10245 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
10246
10247 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
10248
10249
10250 ---
10251
10252 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
10253
10254 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
10255 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
10256
10257
10258 ---
10259
10260 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
10261
10262 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
10263
10264 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
10265
10266
10267 ---
10268
10269 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
10270
10271 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
10272 Shell commands are as follows:
10273 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10274
10275 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
10276 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
10277 Shell commands are as follows:
10278 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10279 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
10280 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
10281
10282
10283 ---
10284
10285 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
10286
10287 Change spotbugs version to 3.1.11.
10288
10289
10290 ---
10291
10292 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
10293
10294 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
10295
10296
10297 ---
10298
10299 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
10300
10301 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
10302 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
10303 disable\_exceed\_throttle\_quota
10304 There are two limits when enable exceed throttle quota:
10305 1. Must set at least one read and one write region server throttle quota;
10306 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
10307
10308
10309 ---
10310
10311 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
10312
10313 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
10314
10315
10316 ---
10317
10318 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
10319
10320 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
10321
10322
10323 ---
10324
10325 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
10326
10327 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
10328
10329
10330 ---
10331
10332 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
10333
10334 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
10335
10336 hbase\> help 'scan'
10337
10338
10339 ---
10340
10341 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
10342
10343 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
10344
10345 For example:
10346 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
10347
10348
10349 ---
10350
10351 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
10352
10353 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
10354 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
10355
10356
10357 ---
10358
10359 * [HBASE-21727](https://issues.apache.org/jira/browse/HBASE-21727) | *Minor* | **Simplify documentation around client timeout**
10360
10361 Deprecated HBaseConfiguration#getInt(Configuration, String, String, int) method and removed it from 3.0.0 version.
10362
10363
10364 ---
10365
10366 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
10367
10368 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
10369
10370
10371 ---
10372
10373 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
10374
10375 Make StoppedRpcClientException extend DoNotRetryIOException.
10376
10377
10378 ---
10379
10380 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
10381
10382 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
10383 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
10384
10385
10386 ---
10387
10388 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
10389
10390 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
10391
10392 The effect releases are:
10393 2.1.x: 2.1.2 and below
10394 2.0.x: 2.0.4 and below
10395 1.x: 1.4.x and below
10396
10397 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
10398
10399
10400 ---
10401
10402 * [HBASE-21792](https://issues.apache.org/jira/browse/HBASE-21792) | *Major* | **Mark HTableMultiplexer as deprecated and remove it in 3.0.0**
10403
10404 HTableMultiplexer exposes the implementation class, and it is incomplete, so we mark it as deprecated and remove it in 3.0.0 release.
10405
10406 There is no direct replacement for HTableMultiplexer, please use BufferedMutator if you want to batch mutations to a table.
10407
10408
10409 ---
10410
10411 * [HBASE-21782](https://issues.apache.org/jira/browse/HBASE-21782) | *Major* | **LoadIncrementalHFiles should not be IA.Public**
10412
10413 Introduce a BulkLoadHFiles interface which is marked as IA.Public, for doing bulk load programmatically.
10414 Introduce a BulkLoadHFilesTool which extends BulkLoadHFiles, and is marked as IA.LimitedPrivate(TOOLS), for using from command line.
10415 The old LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
10416
10417
10418 ---
10419
10420 * [HBASE-21762](https://issues.apache.org/jira/browse/HBASE-21762) | *Major* | **Move some methods in ClusterConnection to Connection**
10421
10422 Move the two getHbck method from ClusterConnection to Connection, and mark the methods as IA.LimitedPrivate(HBCK), as ClusterConnection is IA.Private and should not be depended by HBCK2.
10423
10424 Add a clearRegionLocationCache method in Connection to clear the region location cache for all the tables. As in RegionLocator, most of the methods have a 'reload' parameter, which implicitly tells user that we have a region location cache, so adding a method to clear the cache is fine.
10425
10426
10427 ---
10428
10429 * [HBASE-21713](https://issues.apache.org/jira/browse/HBASE-21713) | *Major* | **Support set region server throttle quota**
10430
10431 Support set region server rpc throttle quota which represents the read/write ability of region servers and throttles when region server's total requests exceeding the limit.
10432
10433 Use the following shell command to set RS quota:
10434 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', THROTTLE\_TYPE =\> WRITE, LIMIT =\> '20000req/sec'
10435 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', LIMIT =\> NONE
10436 "all" represents the throttle quota of all region servers and setting specified region server quota isn't supported currently.
10437
10438
10439 ---
10440
10441 * [HBASE-21689](https://issues.apache.org/jira/browse/HBASE-21689) | *Minor* | **Make table/namespace specific current quota info available in shell(describe\_namespace & describe)**
10442
10443 In shell commands "describe\_namespace" and "describe", which are used to see the descriptors of the namespaces and tables respectively, quotas set on that particular namespace/table will also be printed along.
10444
10445
10446 ---
10447
10448 * [HBASE-17370](https://issues.apache.org/jira/browse/HBASE-17370) | *Major* | **Fix or provide shell scripts to drain and decommission region server**
10449
10450 Adds shell support for the following:
10451 - List decommissioned/draining region servers
10452 - Decommission a list of region servers, optionally offload corresponding regions
10453 - Recommission a region server, optionally load a list of passed regions
10454
10455
10456 ---
10457
10458 * [HBASE-21734](https://issues.apache.org/jira/browse/HBASE-21734) | *Major* | **Some optimization in FilterListWithOR**
10459
10460 After HBASE-21620, the filterListWithOR has been a bit slow because we need to merge each sub-filter's RC , while before HBASE-21620, we will skip many RC merging, but the logic was wrong. So here we choose another way to optimaze the performance: removing the KeyValueUtil#toNewKeyCell.
10461 Anoop Sam John suggested that the KeyValueUtil#toNewKeyCell can save some GC before because if we copy key part of cell into a single byte[], then the block the cell refering won't be refered by the filter list any more, the upper layer can GC the data block quickly. while after HBASE-21620, we will update the prevCellList for every encountered cell now, so the lifecycle of cell in prevCellList for FilterList will be quite shorter. so just use the cell ref for saving cpu.
10462 BTW, we removed all the arrays streams usage in filter list, because it's also quite time-consuming in our test.
10463
10464
10465 ---
10466
10467 * [HBASE-21738](https://issues.apache.org/jira/browse/HBASE-21738) | *Critical* | **Remove all the CSLM#size operation in our memstore because it's an quite time consuming.**
10468
10469 We found the memstore snapshotting would cost much time because of calling the time-consuming ConcurrentSkipListMap#Size, it would make the p999 latency spike happen. So in this issue, we remove all ConcurrentSkipListMap#size in memstore by counting the cellsCount in MemstoreSizeing. As the issue described, the p999 latency spike was mitigated.
10470
10471
10472 ---
10473
10474 * [HBASE-21034](https://issues.apache.org/jira/browse/HBASE-21034) | *Major* | **Add new throttle type: read/write capacity unit**
10475
10476 Provides a new throttle type: capacity unit. One read/write/request capacity unit represents that read/write/read+write up to 1K data. If data size is more than 1K, then consume additional capacity units.
10477
10478 Use shell command to set capacity unit(CU):
10479 set\_quota TYPE =\> THROTTLE, THROTTLE\_TYPE =\> WRITE, USER =\> 'u1', LIMIT =\> '10CU/sec'
10480
10481 Use the "hbase.quota.read.capacity.unit" property to set the data size of one read capacity unit in bytes, the default value is 1K. Use the "hbase.quota.write.capacity.unit" property to set the data size of one write capacity unit in bytes, the default value is 1K.
10482
10483
10484 ---
10485
10486 * [HBASE-21595](https://issues.apache.org/jira/browse/HBASE-21595) | *Minor* | **Print thread's information and stack traces when RS is aborting forcibly**
10487
10488 Does thread dump on stdout on abort.
10489
10490
10491 ---
10492
10493 * [HBASE-21732](https://issues.apache.org/jira/browse/HBASE-21732) | *Critical* | **Should call toUpperCase before using Enum.valueOf in some methods for ColumnFamilyDescriptor**
10494
10495 Now all the Enum configs in ColumnFamilyDescriptor can accept lower case config value.
10496
10497
10498 ---
10499
10500 * [HBASE-21712](https://issues.apache.org/jira/browse/HBASE-21712) | *Minor* | **Make submit-patch.py python3 compatible**
10501
10502 Python3 support was added to dev-support/submit-patch.py. To install newly required dependencies run \`pip install -r dev-support/python-requirements.txt\` command.
10503
10504
10505 ---
10506
10507 * [HBASE-21657](https://issues.apache.org/jira/browse/HBASE-21657) | *Major* | **PrivateCellUtil#estimatedSerializedSizeOf has been the bottleneck in 100% scan case.**
10508
10509 In HBASE-21657,  I simplified the path of estimatedSerialiedSize() & estimatedSerialiedSizeOfCell() by moving the general getSerializedSize()
10510 and heapSize() from ExtendedCell to Cell interface. The patch also included some other improvments:
10511
10512 1. For 99%  of case, our cells has no tags, so let the HFileScannerImpl just return the NoTagsByteBufferKeyValue if no tags, which means we can save
10513    lots of cpu time when sending no tags cell to rpc because can just return the length instead of getting the serialize size by caculating offset/length
10514    of each fields(row/cf/cq..)
10515 2. Move the subclass's getSerializedSize implementation from ExtendedCell to their own class, which mean we did not need to call ExtendedCell's
10516    getSerialiedSize() firstly, then forward to subclass's getSerializedSize(withTags).
10517 3. Give a estimated result arraylist size for avoiding the frequent list extension when in a big scan, now we estimate the array size as min(scan.rows, 512).
10518    it's also help a lot.
10519
10520 We gain almost ~40% throughput improvement in 100% scan case for branch-2 (cacheHitRatio~100%)[1], it's a good thing. While it's a incompatible change in
10521 some case, such as if the upstream user implemented their own Cells, although it's rare but can happen, then their compile will be error.
10522
10523
10524 ---
10525
10526 * [HBASE-21647](https://issues.apache.org/jira/browse/HBASE-21647) | *Major* | **Add status track for splitting WAL tasks**
10527
10528 Adds task monitor that shows ServerCrashProcedure progress in UI.
10529
10530
10531 ---
10532
10533 * [HBASE-21652](https://issues.apache.org/jira/browse/HBASE-21652) | *Major* | **Refactor ThriftServer making thrift2 server inherited from thrift1 server**
10534
10535 Before this issue, thrift1 server and thrift2 server are totally different servers. If a new feature is added to thrift1 server, thrfit2 server have to make the same change to support it(e.g. authorization). After this issue, thrift2 server is inherited from thrift1, thrift2 server now have all the features thrift1 server has(e.g http support, which thrift2 server doesn't have before).  The way to start thrift1 or thrift2 server remain the same after this issue.
10536
10537
10538 ---
10539
10540 * [HBASE-21661](https://issues.apache.org/jira/browse/HBASE-21661) | *Major* | **Provide Thrift2 implementation of Table/Admin**
10541
10542 ThriftAdmin/ThriftTable are implemented based on Thrift2. With ThriftAdmin/ThriftTable, People can use thrift2 protocol just like HTable/HBaseAdmin.
10543 Example of using ThriftConnection
10544 Configuration conf = HBaseConfiguration.create();
10545 conf.set(ClusterConnection.HBASE\_CLIENT\_CONNECTION\_IMPL,ThriftConnection.class.getName());
10546 Connection conn = ConnectionFactory.createConnection(conf);
10547 Table table = conn.getTable(tablename)
10548 It is just like a normal Connection, similar use experience with the default ConnectionImplementation
10549
10550
10551 ---
10552
10553 * [HBASE-21618](https://issues.apache.org/jira/browse/HBASE-21618) | *Critical* | **Scan with the same startRow(inclusive=true) and stopRow(inclusive=false) returns one result**
10554
10555 There was a bug when scan with the same startRow(inclusive=true) and stopRow(inclusive=false). The old incorrect behavior is return one result. After this fix, the new correct behavior is return nothing.
10556
10557
10558 ---
10559
10560 * [HBASE-21159](https://issues.apache.org/jira/browse/HBASE-21159) | *Major* | **Add shell command to switch throttle on or off**
10561
10562 Support enable or disable rpc throttle when hbase quota is enabled. If hbase quota is enabled, rpc throttle is enabled by default.  When disable rpc throttle, HBase will not throttle any request. Use the following commands to switch rpc throttle : enable\_rpc\_throttle / disable\_rpc\_throttle.
10563
10564
10565 ---
10566
10567 * [HBASE-21659](https://issues.apache.org/jira/browse/HBASE-21659) | *Minor* | **Avoid to load duplicate coprocessors in system config and table descriptor**
10568
10569 Add a new configuration "hbase.skip.load.duplicate.table.coprocessor". The default value is false to keep compatible with the old behavior. Config it true to skip load duplicate table coprocessor.
10570
10571
10572 ---
10573
10574 * [HBASE-21650](https://issues.apache.org/jira/browse/HBASE-21650) | *Major* | **Add DDL operation and some other miscellaneous to thrift2**
10575
10576 Added DDL operations and some other structure definition to thrift2. Methods added:
10577 create/modify/addColumnFamily/deleteColumnFamily/modifyColumnFamily/enable/disable/truncate/delete table
10578 create/modify/delete namespace
10579 get(list)TableDescriptor(s)/get(list)NamespaceDescirptor(s)
10580 tableExists/isTableEnabled/isTableDisabled/isTableAvailabe
10581 And some class definitions along with those methods
10582
10583
10584 ---
10585
10586 * [HBASE-21643](https://issues.apache.org/jira/browse/HBASE-21643) | *Major* | **Introduce two new region coprocessor method and deprecated postMutationBeforeWAL**
10587
10588 Deprecated region coprocessor postMutationBeforeWAL and introduce two new region coprocessor postIncrementBeforeWAL and postAppendBeforeWAL instead.
10589
10590
10591 ---
10592
10593 * [HBASE-21635](https://issues.apache.org/jira/browse/HBASE-21635) | *Major* | **Use maven enforcer to ban imports from illegal packages**
10594
10595 Use de.skuzzle.enforcer.restrict-imports-enforcer-rule extension for maven enforcer plugin to ban illegal imports at compile time. Now if you use illegal imports, for example, import com.google.common.\*, there will be a compile error, instead of a checkstyle warning.
10596
10597
10598 ---
10599
10600 * [HBASE-21401](https://issues.apache.org/jira/browse/HBASE-21401) | *Critical* | **Sanity check when constructing the KeyValue**
10601
10602 Add a sanity check when constructing KeyValue from a byte[]. we use the constructor when we're reading kv from socket or HFIle or WAL(replication). the santiy check isn't designed for discovering the bits corruption in network transferring or disk IO. It is designed to detect bugs inside HBase in advance. and HBASE-21459 indicated that there's extremely small performance loss for diff kinds of keyvalue.
10603
10604
10605 ---
10606
10607 * [HBASE-21554](https://issues.apache.org/jira/browse/HBASE-21554) | *Minor* | **Show replication endpoint classname for replication peer on master web UI**
10608
10609 The replication UI on master will show the replication endpoint classname.
10610
10611
10612 ---
10613
10614 * [HBASE-21549](https://issues.apache.org/jira/browse/HBASE-21549) | *Major* | **Add shell command for serial replication peer**
10615
10616 Add a SERIAL flag for add\_peer command to identifiy whether or not the replication peer is a serial replication peer. The default serial flag is false.
10617
10618
10619 ---
10620
10621 * [HBASE-21453](https://issues.apache.org/jira/browse/HBASE-21453) | *Major* | **Convert ReadOnlyZKClient to DEBUG instead of INFO**
10622
10623 Log level of ReadOnlyZKClient moved to debug.
10624
10625
10626 ---
10627
10628 * [HBASE-21283](https://issues.apache.org/jira/browse/HBASE-21283) | *Minor* | **Add new shell command 'rit' for listing regions in transition**
10629
10630 <!-- markdown -->
10631
10632 The HBase `shell` now includes a command to list regions currently in transition.
10633
10634 ```
10635 HBase Shell
10636 Use "help" to get list of supported commands.
10637 Use "exit" to quit this interactive shell.
10638 Version 1.5.0-SNAPSHOT, r9bb6d2fa8b760f16cd046657240ebd4ad91cb6de, Mon Oct  8 21:05:50 UTC 2018
10639
10640 hbase(main):001:0> help 'rit'
10641 List all regions in transition.
10642 Examples:
10643   hbase> rit
10644
10645 hbase(main):002:0> create ...
10646 0 row(s) in 2.5150 seconds
10647 => Hbase::Table - IntegrationTestBigLinkedList
10648
10649 hbase(main):003:0> rit
10650 0 row(s) in 0.0340 seconds
10651
10652 hbase(main):004:0> unassign '56f0c38c81ae453d19906ce156a2d6a1'
10653 0 row(s) in 0.0540 seconds
10654
10655 hbase(main):005:0> rit
10656 IntegrationTestBigLinkedList,L\xCC\xCC\xCC\xCC\xCC\xCC\xCB,1539117183224.56f0c38c81ae453d19906ce156a2d6a1. state=PENDING_CLOSE, ts=Tue Oct 09 20:33:34 UTC 2018 (0s ago), server=null
10657 1 row(s) in 0.0170 seconds
10658 ```
10659
10660
10661 ---
10662
10663 * [HBASE-21567](https://issues.apache.org/jira/browse/HBASE-21567) | *Major* | **Allow overriding configs starting up the shell**
10664
10665 Allow passing of -Dkey=value option to shell to override hbase-\* configuration: e.g.:
10666
10667 $ ./bin/hbase shell -Dhbase.zookeeper.quorum=ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org -Draining=false
10668 ...
10669 hbase(main):001:0\> @shell.hbase.configuration.get("hbase.zookeeper.quorum")
10670 =\> "ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org"
10671 hbase(main):002:0\> @shell.hbase.configuration.get("raining")
10672 =\> "false"
10673
10674
10675 ---
10676
10677 * [HBASE-21560](https://issues.apache.org/jira/browse/HBASE-21560) | *Major* | **Return a new TableDescriptor for MasterObserver#preModifyTable to allow coprocessor modify the TableDescriptor**
10678
10679 Incompatible change. Allow MasterObserver#preModifyTable to return a new TableDescriptor. And master will use this returned TableDescriptor to modify table.
10680
10681
10682 ---
10683
10684 * [HBASE-21551](https://issues.apache.org/jira/browse/HBASE-21551) | *Blocker* | **Memory leak when use scan with STREAM at server side**
10685
10686 <!-- markdown -->
10687 ### Summary
10688 HBase clusters will experience Region Server failures due to out of memory errors due to a leak given any of the following:
10689
10690 * User initiates Scan operations set to use the STREAM reading type
10691 * User initiates Scan operations set to use the default reading type that read more than 4 * the block size of column families involved in the scan (e.g. by default 4*64KiB)
10692 * Compactions run
10693
10694 ### Root cause
10695
10696 When there are long running scans the Region Server process attempts to optimize access by using a different API geared towards sequential access. Due to an error in HBASE-20704 for HBase 2.0+ the Region Server fails to release related resources when those scans finish. That same optimization path is always used for the HBase internal file compaction process.
10697
10698 ### Workaround
10699
10700 Impact for this error can be minimized by setting the config value “hbase.storescanner.pread.max.bytes” to MAX_INT to avoid the optimization for default user scans. Clients should also be checked to ensure they do not pass the STREAM read type to the Scan API. This will have a severe impact on performance for long scans.
10701
10702 Compactions always use this sequential optimized reading mechanism so downstream users will need to periodically restart Region Server roles after compactions have happened.
10703
10704
10705 ---
10706
10707 * [HBASE-21550](https://issues.apache.org/jira/browse/HBASE-21550) | *Major* | **Add a new method preCreateTableRegionInfos for MasterObserver which allows CPs to modify the TableDescriptor**
10708
10709 Add a new method preCreateTableRegionInfos for MasterObserver, which will be called before creating region infos for the given table,  before the preCreateTable method. It allows you to return a new TableDescritor to override the original one. Returns null or throws exception will stop the creation.
10710
10711
10712 ---
10713
10714 * [HBASE-21492](https://issues.apache.org/jira/browse/HBASE-21492) | *Critical* | **CellCodec Written To WAL Before It's Verified**
10715
10716 After HBASE-21492 the return type of WALCellCodec#getWALCellCodecClass has been changed from String to Class
10717
10718
10719 ---
10720
10721 * [HBASE-21387](https://issues.apache.org/jira/browse/HBASE-21387) | *Major* | **Race condition surrounding in progress snapshot handling in snapshot cache leads to loss of snapshot files**
10722
10723 To prevent race condition between in progress snapshot (performed by TakeSnapshotHandler) and HFileCleaner which results in data loss, this JIRA introduced mutual exclusion between taking snapshot and running HFileCleaner. That is, at any given moment, either some snapshot can be taken or, HFileCleaner checks hfiles which are not referenced, but not both can be running.
10724
10725
10726 ---
10727
10728 * [HBASE-21452](https://issues.apache.org/jira/browse/HBASE-21452) | *Major* | **Illegal character in hbase counters group name**
10729
10730 Changes group name of hbase metrics from "HBase Counters" to "HBaseCounters".
10731
10732
10733 ---
10734
10735 * [HBASE-21443](https://issues.apache.org/jira/browse/HBASE-21443) | *Major* | **[hbase-connectors] Purge hbase-\* modules from core now they've been moved to hbase-connectors**
10736
10737 Parent issue moved hbase-spark\* modules to hbase-connectors. This issue removes hbase-spark\* modules from hbase core repo.
10738
10739
10740 ---
10741
10742 * [HBASE-21430](https://issues.apache.org/jira/browse/HBASE-21430) | *Major* | **[hbase-connectors] Move hbase-spark\* modules to hbase-connectors repo**
10743
10744 hbase-spark\* modules have been cloned to https://github.com/apache/hbase-connectors All spark connector dev is to happen in that repo from here on out.
10745
10746 Let me file a subtask to remove hbase-spark\* modules from hbase core.
10747
10748
10749 ---
10750
10751 * [HBASE-21417](https://issues.apache.org/jira/browse/HBASE-21417) | *Critical* | **Pre commit build is broken due to surefire plugin crashes**
10752
10753 Add -Djdk.net.URLClassPath.disableClassPathURLCheck=true when executing surefire plugin.
10754
10755
10756 ---
10757
10758 * [HBASE-21191](https://issues.apache.org/jira/browse/HBASE-21191) | *Major* | **Add a holding-pattern if no assign for meta or namespace (Can happen if masterprocwals have been cleared).**
10759
10760 Puts master startup into holding pattern if meta is not assigned (previous it would exit). To make progress again, operator needs to inject an assign (Caveats and instruction can be found in HBASE-21035).
10761
10762
10763 ---
10764
10765 * [HBASE-21322](https://issues.apache.org/jira/browse/HBASE-21322) | *Critical* | **Add a scheduleServerCrashProcedure() API to HbckService**
10766
10767 Adds scheduleServerCrashProcedure to the HbckService.
10768
10769
10770 ---
10771
10772 * [HBASE-21325](https://issues.apache.org/jira/browse/HBASE-21325) | *Major* | **Force to terminate regionserver when abort hang in somewhere**
10773
10774 Add two new config hbase.regionserver.abort.timeout and hbase.regionserver.abort.timeout.task. If regionserver abort timeout, it will schedule an abort timeout task to run. The default abort task is SystemExitWhenAbortTimeout, which will force to terminate region server when abort timeout. And you can config a special abort timeout task by hbase.regionserver.abort.timeout.task.
10775
10776
10777 ---
10778
10779 * [HBASE-21215](https://issues.apache.org/jira/browse/HBASE-21215) | *Major* | **Figure how to invoke hbck2; make it easy to find**
10780
10781 Adds to bin/hbase means of invoking hbck2. Pass the new '-j' option on the 'hbck' command with a value of the full path to the HBCK2.jar.
10782
10783 E.g:
10784
10785 $ ./bin/hbase hbck -j ~/checkouts/hbase-operator-tools/hbase-hbck2/target/hbase-hbck2-1.0.0-SNAPSHOT.jar  setTableState x ENABLED
10786
10787
10788 ---
10789
10790 * [HBASE-21372](https://issues.apache.org/jira/browse/HBASE-21372) | *Major* | **Set hbase.assignment.maximum.attempts to Long.MAX**
10791
10792 Retry assigns 'forever' (or until an intervention such as a ServerCrashProcedure).
10793
10794 Previous retry was a maximum of ten times but on failure, handling was an indeterminate.
10795
10796
10797 ---
10798
10799 * [HBASE-21338](https://issues.apache.org/jira/browse/HBASE-21338) | *Major* | **[balancer] If balancer is an ill-fit for cluster size, it gives little indication**
10800
10801 The description claims the balancer not dynamically configurable but this is an error; it is http://hbase.apache.org/book.html#dyn\_config
10802
10803 Also, if balancer is seen to be cutting out too soon, try setting "hbase.master.balancer.stochastic.runMaxSteps" to true.
10804
10805 Adds cleaner logging around balancer start.
10806
10807
10808 ---
10809
10810 * [HBASE-21073](https://issues.apache.org/jira/browse/HBASE-21073) | *Major* | **"Maintenance mode" master**
10811
10812     Instead of being an ephemeral state set by hbck, maintenance mode is now
10813     an explicit toggle set by either configuration property or environment
10814     variable. In maintenance mode, master will host system tables and not
10815     assign any user-space tables to RSs. This gives operators the ability to
10816     affect repairs to meta table with fewer moving parts.
10817
10818
10819 ---
10820
10821 * [HBASE-21335](https://issues.apache.org/jira/browse/HBASE-21335) | *Critical* | **Change the default wait time of HBCK2 tool**
10822
10823 Changed waitTime parameter to lockWait on bypass. Changed default waitTime from 0 -- i.e. wait for ever -- to 1ms so if lock is held, we'll go past it and if override enforce bypass.
10824
10825
10826 ---
10827
10828 * [HBASE-21291](https://issues.apache.org/jira/browse/HBASE-21291) | *Major* | **Add a test for bypassing stuck state-machine procedures**
10829
10830 bypass will now throw an Exception if passed a lockWait \<= 0; i.e bypass will prevent an operator getting stuck on an entity lock waiting forever (lockWait == 0)
10831
10832
10833 ---
10834
10835 * [HBASE-21320](https://issues.apache.org/jira/browse/HBASE-21320) | *Major* | **[canary] Cleanup of usage and add commentary**
10836
10837 Cleans up usage and docs around Canary.  Does not change command-line args (though we should -- smile).
10838
10839
10840 ---
10841
10842 * [HBASE-21278](https://issues.apache.org/jira/browse/HBASE-21278) | *Critical* | **Do not rollback successful sub procedures when rolling back a procedure**
10843
10844 For the sub procedures which are successfully finished, do not do rollback. This is a change in rollback behavior.
10845
10846 State changes which are done by sub procedures should be handled by parent procedures when rolling back. For example, when rolling back a MergeTableProcedure, we will schedule new procedures to bring the offline regions online instead of rolling back the original procedures which off-lined the regions (in fact these procedures can not be rolled back...).
10847
10848
10849 ---
10850
10851 * [HBASE-21158](https://issues.apache.org/jira/browse/HBASE-21158) | *Critical* | **Empty qualifier cell should not be returned if it does not match QualifierFilter**
10852
10853 <!-- markdown -->
10854
10855 Scans that make use of `QualifierFilter` previously would erroneously return both columns with an empty qualifier along with those that matched. After this change that behavior has changed to only return those columns that match.
10856
10857
10858 ---
10859
10860 * [HBASE-21098](https://issues.apache.org/jira/browse/HBASE-21098) | *Major* | **Improve Snapshot Performance with Temporary Snapshot Directory when rootDir on S3**
10861
10862 It is recommended to place the working directory on-cluster on HDFS as doing so has shown a strong performance increase due to data locality. It is important to note that the working directory should not overlap with any existing directories as the working directory will be cleaned out during the snapshot process. Beyond that, any well-named directory on HDFS should be sufficient.
10863
10864
10865 ---
10866
10867 * [HBASE-21185](https://issues.apache.org/jira/browse/HBASE-21185) | *Minor* | **WALPrettyPrinter: Additional useful info to be printed by wal printer tool, for debugability purposes**
10868
10869 This adds two extra features to WALPrettyPrinter tool:
10870
10871 1) Output for each cell combined size of cell descriptors, plus the cell value itself, in a given WAL edit. This is printed on the results as "cell total size sum:" info by default;
10872
10873 2) An optional -g/--goto argument, that allows to seek straight to that specific WAL file position, then sequentially reading the WAL from that point towards its end;
10874
10875
10876 ---
10877
10878 * [HBASE-21287](https://issues.apache.org/jira/browse/HBASE-21287) | *Major* | **JVMClusterUtil Master initialization wait time not configurable**
10879
10880 Local HBase cluster (as used by unit tests) wait times on startup and initialization can be configured via \`hbase.master.start.timeout.localHBaseCluster\` and \`hbase.master.init.timeout.localHBaseCluster\`
10881
10882
10883 ---
10884
10885 * [HBASE-21280](https://issues.apache.org/jira/browse/HBASE-21280) | *Trivial* | **Add anchors for each heading in UI**
10886
10887 Adds anchors #tables, #tasks, etc.
10888
10889
10890 ---
10891
10892 * [HBASE-21232](https://issues.apache.org/jira/browse/HBASE-21232) | *Major* | **Show table state in Tables view on Master home page**
10893
10894 Add table state column to the tables panel
10895
10896
10897 ---
10898
10899 * [HBASE-21223](https://issues.apache.org/jira/browse/HBASE-21223) | *Critical* | **[amv2] Remove abort\_procedure from shell**
10900
10901 Removed the abort\_procedure command from shell -- dangerous -- and deprecated abortProcedure in Admin API.
10902
10903
10904 ---
10905
10906 * [HBASE-20636](https://issues.apache.org/jira/browse/HBASE-20636) | *Major* | **Introduce two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED**
10907
10908 Add two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED
10909 1. ROWPREFIX\_FIXED\_LENGTH: specify the length of the prefix
10910 2. ROWPREFIX\_DELIMITED: specify the delimiter of the prefix
10911 Need to specify parameters for these two types of bloomfilter, otherwise the table will fail to create
10912 Example:
10913 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_FIXED\_LENGTH', CONFIGURATION =\> {'RowPrefixBloomFilter.prefix\_length' =\> '10'}}
10914 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_DELIMITED', CONFIGURATION =\> {'RowPrefixDelimitedBloomFilter.delimiter' =\> '#'}}
10915
10916
10917 ---
10918
10919 * [HBASE-21156](https://issues.apache.org/jira/browse/HBASE-21156) | *Critical* | **[hbck2] Queue an assign of hbase:meta and bulk assign/unassign**
10920
10921 Adds 'raw' assigns/unassigns to the Hbck Service. Takes a list of encoded region names and bulk assigns/unassigns. Skirts Master 'state' check and does not invoke Coprocessors. For repair only.
10922
10923 Here is what HBCK2 usage looks like now:
10924
10925 {code}
10926 $ java -cp hbase-hbck2-1.0.0-SNAPSHOT.jar  org.apache.hbase.HBCK2
10927 usage: HBCK2 \<OPTIONS\> COMMAND [\<ARGS\>]
10928
10929 Options:
10930  -d,--debug                      run with debug output
10931  -h,--help                       output this help message
10932     --hbase.zookeeper.peerport   peerport of target hbase ensemble
10933     --hbase.zookeeper.quorum     ensemble of target hbase
10934     --zookeeper.znode.parent     parent znode of target hbase
10935
10936 Commands:
10937  setTableState \<TABLENAME\> \<STATE\>
10938    Possible table states: ENABLED, DISABLED, DISABLING, ENABLING
10939    To read current table state, in the hbase shell run:
10940      hbase\> get 'hbase:meta', '\<TABLENAME\>', 'table:state'
10941    A value of \\x08\\x00 == ENABLED, \\x08\\x01 == DISABLED, etc.
10942    An example making table name 'user' ENABLED:
10943      $ HBCK2 setTableState users ENABLED
10944    Returns whatever the previous table state was.
10945
10946  assign \<ENCODED\_REGIONNAME\> ...
10947    A 'raw' assign that can be used even during Master initialization.
10948    Skirts Coprocessors. Pass one or more encoded RegionNames:
10949    e.g. 1588230740 is hard-coded encoding for hbase:meta region and
10950    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
10951    user-space encoded Region name looks like. For example:
10952      $ HBCK2 assign 1588230740 de00010733901a05f5a2a3a382e27dd4
10953    Returns the pid of the created AssignProcedure or -1 if none.
10954
10955  unassign \<ENCODED\_REGIONNAME\> ...
10956    A 'raw' unassign that can be used even during Master initialization.
10957    Skirts Coprocessors. Pass one or more encoded RegionNames:
10958    Skirts Coprocessors. Pass one or more encoded RegionNames:
10959    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
10960    user-space encoded Region name looks like. For example:
10961      $ HBCK2 unassign 1588230740 de00010733901a05f5a2a3a382e27dd4
10962    Returns the pid of the created UnassignProcedure or -1 if none.
10963 {code}
10964
10965
10966 ---
10967
10968 * [HBASE-21021](https://issues.apache.org/jira/browse/HBASE-21021) | *Major* | **Result returned by Append operation should be ordered**
10969
10970 This change ensures Append operations are assembled into the expected order.
10971
10972
10973 ---
10974
10975 * [HBASE-21171](https://issues.apache.org/jira/browse/HBASE-21171) | *Major* | **[amv2] Tool to parse a directory of MasterProcWALs standalone**
10976
10977 Make it so can run the WAL parse and load system in isolation. Here is an example:
10978
10979 {code}$ HBASE\_OPTS=" -XX:+UnlockDiagnosticVMOptions -XX:+UnlockCommercialFeatures -XX:+FlightRecorder -XX:+DebugNonSafepoints" ./bin/hbase org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore ~/big\_set\_of\_masterprocwals/
10980 {code}
10981
10982
10983 ---
10984
10985 * [HBASE-21107](https://issues.apache.org/jira/browse/HBASE-21107) | *Minor* | **add a metrics for netty direct memory**
10986
10987 Add a new nettyDirectMemoryUsage under server's ipc metrics to show direct memory usage for netty rpc server.
10988
10989
10990 ---
10991
10992 * [HBASE-21153](https://issues.apache.org/jira/browse/HBASE-21153) | *Major* | **Shaded client jars should always build in relevant phase to avoid confusion**
10993
10994 Client facing artifacts are now built whenever Maven is run through the "package" goal. Previously, the client facing artifacts would create placeholder jars that skipped repackaging HBase and third-party dependencies unless the "release" profile was active.
10995
10996 Build times may be noticeably longer depending on your build hardware. For example, the Jenkins worker nodes maintained by ASF Infra take ~14% longer to do a full packaging build. An example portability-focused personal laptop took ~25% longer.
10997
10998
10999 ---
11000
11001 * [HBASE-20942](https://issues.apache.org/jira/browse/HBASE-20942) | *Major* | **Improve RpcServer TRACE logging**
11002
11003 Allows configuration of the length of RPC messages printed to the log at TRACE level via "hbase.ipc.trace.param.size" in RpcServer.
11004
11005
11006 ---
11007
11008 * [HBASE-20649](https://issues.apache.org/jira/browse/HBASE-20649) | *Minor* | **Validate HFiles do not have PREFIX\_TREE DataBlockEncoding**
11009
11010 <!-- markdown -->
11011 Users who have previously made use of prefix tree encoding can now check that their existing HFiles no longer contain data that uses it with an additional preupgrade check command.
11012
11013 ```
11014 hbase pre-upgrade validate-hfile
11015 ```
11016
11017 Please see the "HFile Content validation" section of the ref guide's coverage of the pre-upgrade validator tool for usage details.
11018
11019
11020 ---
11021
11022 * [HBASE-20941](https://issues.apache.org/jira/browse/HBASE-20941) | *Major* | **Create and implement HbckService in master**
11023
11024 Adds an HBCK Service and a first method to force-change-in-table-state for use by an HBCK client effecting 'repair' to a malfunctioning HBase.
11025
11026
11027 ---
11028
11029 * [HBASE-21071](https://issues.apache.org/jira/browse/HBASE-21071) | *Major* | **HBaseTestingUtility::startMiniCluster() to use builder pattern**
11030
11031 Cleanup all the cluster start override combos in HBaseTestingUtility by adding a StartMiniClusterOption and Builder.
11032
11033
11034 ---
11035
11036 * [HBASE-21072](https://issues.apache.org/jira/browse/HBASE-21072) | *Major* | **Block out HBCK1 in hbase2**
11037
11038 Fence out hbase-1.x hbck1 instances. Stop them making state changes on an hbase-2.x cluster; they could do damage. We do this by writing the hbck1 lock file into place on hbase-2.x Master start-up.
11039
11040 To disable this new behavior, set hbase.write.hbck1.lock.file to false
11041
11042
11043 ---
11044
11045 * [HBASE-20881](https://issues.apache.org/jira/browse/HBASE-20881) | *Major* | **Introduce a region transition procedure to handle all the state transition for a region**
11046
11047 Introduced a new TransitRegionStateProcedure to replace the old AssignProcedure/UnassignProcedure/MoveRegionProcedure. In the old code, MRP will not be attached to RegionStateNode, so it can not be interrupted by ServerCrashProcedure, which introduces lots of tricky code to deal with races, and also causes lots of other difficulties on how to prevent scheduling redundant or even conflict procedures for a region.
11048
11049 And now TRSP is the only one procedure which can bring region online or offline. When you want to schedule one, you need to check whether there is already one attached to the RegionStateNode, under the lock of the RegionStateNode. If not just go ahead, and if there is one, then you should do something, for example, give up and fail directly, or tell the TRSP to give up(This is what SCP does). Since the check and attach are both under the lock of RSN, it will greatly reduce the possible races, and make the code much simpler.
11050
11051
11052 ---
11053
11054 * [HBASE-21012](https://issues.apache.org/jira/browse/HBASE-21012) | *Critical* | **Revert the change of serializing TimeRangeTracker**
11055
11056 HFiles generated by 2.0.0, 2.0.1, 2.1.0 are not forward compatible to 1.4.6-, 1.3.2.1-, 1.2.6.1-, and other inactive releases. Why HFile lose compatability is hbase in new versions (2.0.0, 2.0.1, 2.1.0) use protobuf to serialize/deserialize TimeRangeTracker (TRT) while old versions use DataInput/DataOutput. To solve this, We have to put HBASE-21012 to 2.x and put HBASE-21013 in 1.x. For more information, please check HBASE-21008.
11057
11058
11059 ---
11060
11061 * [HBASE-20965](https://issues.apache.org/jira/browse/HBASE-20965) | *Major* | **Separate region server report requests to new handlers**
11062
11063 After HBASE-20965, we can use MasterFifoRpcScheduler in master to separate RegionServerReport requests to indenpedent handler. To use this feature, please set "hbase.master.rpc.scheduler.factory.class" to
11064  "org.apache.hadoop.hbase.ipc.MasterFifoRpcScheduler". Use "hbase.master.server.report.handler.count" to set RegionServerReport handlers count, the default value is half of "hbase.regionserver.handler.count" value, but at least 1, and the other handlers count in master is "hbase.regionserver.handler.count" value minus RegionServerReport handlers count, but at least 1 too.
11065
11066
11067 ---
11068
11069 * [HBASE-20813](https://issues.apache.org/jira/browse/HBASE-20813) | *Minor* | **Remove RPC quotas when the associated table/Namespace is dropped off**
11070
11071 In previous releases, when a Space Quota was configured on a table or namespace and that table or namespace was deleted, the Space Quota was also deleted. This change improves the implementation so that the same is also done for RPC Quotas.
11072
11073
11074 ---
11075
11076 * [HBASE-20986](https://issues.apache.org/jira/browse/HBASE-20986) | *Major* | **Separate the config of block size when we do log splitting and write Hlog**
11077
11078 After HBASE-20986, we can set different value to block size of WAL and recovered edits. Both of their default value is 2 \* default HDFS blocksize. And hbase.regionserver.recoverededits.blocksize is for block size of recovered edits while hbase.regionserver.hlog.blocksize is for block size of WAL.
11079
11080
11081 ---
11082
11083 * [HBASE-20856](https://issues.apache.org/jira/browse/HBASE-20856) | *Minor* | **PITA having to set WAL provider in two places**
11084
11085 With this change if a WAL's meta provider (hbase.wal.meta\_provider) is not explicitly set, it now defaults to whatever hbase.wal.provider is set to. Previous, the two settings operated independently, each with its own default.
11086
11087 This change is operationally incompatible with previous HBase versions because the default WAL meta provider no longer defaults to AsyncFSWALProvider but to hbase.wal.provider.
11088
11089 The thought is that this is more in line with an operator's expectation, that a change in hbase.wal.provider is sufficient to change how WALs are written, especially given hbase.wal.meta\_provider is an obscure configuration and that the very idea that meta regions would have their own wal provider would likely come as a surprise.
11090
11091
11092 ---
11093
11094 * [HBASE-20538](https://issues.apache.org/jira/browse/HBASE-20538) | *Critical* | **Upgrade our hadoop versions to 2.7.7 and 3.0.3**
11095
11096 Update hadoop-two.version to 2.7.7 and hadoop-three.version to 3.0.3 due to a JDK issue which is solved by HADOOP-15473.
11097
11098
11099 ---
11100
11101 * [HBASE-20846](https://issues.apache.org/jira/browse/HBASE-20846) | *Major* | **Restore procedure locks when master restarts**
11102
11103 1. Make hasLock method final, and add a locked field in Procedure to record whether we have the lock. We will set it to true in doAcquireLock and to false in doReleaseLock. The sub procedures do not need to manage it any more.
11104
11105 2. Also added a locked field in the proto message. When storing, the field will be set according to the return value of hasLock. And when loading, there is a new field in Procedure called lockedWhenLoading. We will set it to true if the locked field in proto message is true.
11106
11107 3. The reason why we can not set the locked field directly to true by calling doAcquireLock is that, during initialization, most procedures need to wait until master is initialized. So the solution here is that, we introduced a new method called waitInitialized in Procedure, and move the wait master initialized related code from acquireLock to this method. And we added a restoreLock method to Procedure, if lockedWhenLoading is true, we will call the acquireLock to get the lock, but do not set locked to true. And later when we call doAcquireLock and pass the waitInitialized check, we will test lockedWhenLoading, if it is true, when we just set the locked field to true and return, without actually calling the acquireLock method since we have already called it once.
11108
11109
11110 ---
11111
11112 * [HBASE-20672](https://issues.apache.org/jira/browse/HBASE-20672) | *Minor* | **New metrics ReadRequestRate and WriteRequestRate**
11113
11114 Exposing 2 new metrics in HBase to provide ReadRequestRate and WriteRequestRate at region server level. These metrics give the rate of request handled by the region server and are reset after every monitoring interval.
11115
11116
11117 ---
11118
11119 * [HBASE-6028](https://issues.apache.org/jira/browse/HBASE-6028) | *Minor* | **Implement a cancel for in-progress compactions**
11120
11121 Added a new command to the shell to switch on/off compactions called "compaction\_switch". Disabling compactions will interrupt any currently ongoing compactions. This setting will be lost on restart of the server. Added the configuration hbase.regionserver.compaction.enabled so user can enable/disable compactions via hbase-site.xml.
11122
11123
11124 ---
11125
11126 * [HBASE-20884](https://issues.apache.org/jira/browse/HBASE-20884) | *Major* | **Replace usage of our Base64 implementation with java.util.Base64**
11127
11128 Class org.apache.hadoop.hbase.util.Base64 has been removed in it's entirety from HBase 2+. In HBase 1, unused methods have been removed from the class and the audience was changed from  Public to Private. This class was originally intended as an internal utility class that could be used externally but thinking since changed; these classes should not have been advertised as public to end-users.
11129
11130 This represents an incompatible change for users who relied on this implementation. An alternative implementation for affected clients is available at java.util.Base64 when using Java 8 or newer; be aware, it may encode/decode differently. For clients seeking to restore this specific implementation, it is available in the public domain for download at http://iharder.sourceforge.net/current/java/base64/
11131
11132
11133 ---
11134
11135 * [HBASE-20357](https://issues.apache.org/jira/browse/HBASE-20357) | *Major* | **AccessControlClient API Enhancement**
11136
11137 This enhances the AccessControlClient APIs to retrieve the permissions based on namespace, table name, family and qualifier for specific user. AccessControlClient can also validate a user whether allowed to perform specified operations on a particular table.
11138 Following APIs have been added,
11139 1) getUserPermissions(Connection connection, String tableRegex, byte[] columnFamily, byte[] columnQualifier, String userName)
11140          Scope of retrieving permission will be same as existing.
11141 2) hasPermission(onnection connection, String tableName, byte[] columnFamily, byte[] columnQualifier, String userName, Permission.Action... actions)
11142      Scope of validating user privilege,
11143            User can perform self check without any special privilege but ADMIN privilege will be required to perform check for other users.
11144            For example, suppose there are two users "userA" & "userB" then there can be below scenarios,
11145             a. When userA want to check whether userA have privilege to perform mentioned actions
11146                  userA don't need ADMIN privilege, as it's a self query.
11147             b. When userA want to check whether userB have privilege to perform mentioned actions,
11148                  userA must have ADMIN or superuser privilege, as it's trying to query for other user.
11149
11150
11151
11152 # HBASE  2.1.0 Release Notes
11153
11154 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11155
11156
11157 ---
11158
11159 * [HBASE-20691](https://issues.apache.org/jira/browse/HBASE-20691) | *Blocker* | **Storage policy should allow deferring to HDFS**
11160
11161 After HBASE-20691 we have changed the default setting of hbase.wal.storage.policy from "HOT" back to "NONE" which means we defer the policy to HDFS. This fixes the problem of release 2.0.0 that the storage policy of WAL directory will defer to HDFS and may not be "HOT" even if you explicitly set hbase.wal.storage.policy to "HOT"
11162
11163
11164 ---
11165
11166 * [HBASE-20839](https://issues.apache.org/jira/browse/HBASE-20839) | *Blocker* | **Fallback to FSHLog if we can not instantiated AsyncFSWAL when user does not specify AsyncFSWAL explicitly**
11167
11168 As we hack into the internal of DFSClient when implementing AsyncFSWAL to get better performance, a patch release of hadoop can make it broken.
11169
11170 So now, if user does not specify a wal provider, then we will first try to use 'asyncfs', i.e, the AsyncFSWALProvider. If we fail due to some compatible issues, we will fallback to 'filesystem', i.e, FSHLog.
11171
11172
11173 ---
11174
11175 * [HBASE-20193](https://issues.apache.org/jira/browse/HBASE-20193) | *Critical* | **Basic Replication Web UI - Regionserver**
11176
11177 After HBASE-20193, we add a section to web ui to show the replication status of each wal group. There are 2 parts of this section, they both show the peerId, wal group and current replicating log of each replication source. And one is showing the information of replication log queue, i.e. size of current log, log queue size and replicating offset. The other one is showing the delay of replication, i.e. last shipped age and replication delay.
11178 If the offset shows -1 and replication delay is UNKNOWN, that means replication is not started. This may be caused by this peer is disabled or the replicationEndpoint is sleeping due to some reason.
11179
11180
11181 ---
11182
11183 * [HBASE-19997](https://issues.apache.org/jira/browse/HBASE-19997) | *Blocker* | **[rolling upgrade] 1.x =\> 2.x**
11184
11185 Now we have a 'basically work' solution for rolling upgrade from 1.4.x to 2.x. Please see the "Rolling Upgrade from 1.x to 2.x" section in ref guide for more details.
11186
11187
11188 ---
11189
11190 * [HBASE-20270](https://issues.apache.org/jira/browse/HBASE-20270) | *Major* | **Turn off command help that follows all errors in shell**
11191
11192 <!-- markdown -->
11193 The command help that followed all errors, before, is now no longer available. Erroneous command inputs would now just show error-texts followed by the shell command to try for seeing the help message. It looks like: For usage try 'help “create”’. Operators can copy-paste the command to get the help message.
11194
11195
11196 ---
11197
11198 * [HBASE-20194](https://issues.apache.org/jira/browse/HBASE-20194) | *Critical* | **Basic Replication WebUI - Master**
11199
11200 After HBASE-20194, we added 2 parts to master's web page.
11201 One is Peers that shows all replication peers and some of their configurations, like peer id, cluster key, state, bandwidth, and which namespace or table it will replicate.
11202 The other one is replication status of all regionservers, we added a tab to region servers division, then we can check the replication delay of all region servers for any peer. This table shows AgeOfLastShippedOp, SizeOfLogQueue and ReplicationLag for each regionserver and the table is sort by ReplicationLag in descending order. By this way we can easily find the problematic region server. If the replication delay is UNKNOWN, that means this walGroup doesn't start replicate yet and it may get disabled. ReplicationLag will update once this peer start replicate.
11203
11204
11205 ---
11206
11207 * [HBASE-18569](https://issues.apache.org/jira/browse/HBASE-18569) | *Major* | **Add prefetch support for async region locator**
11208
11209 Add prefetch support for async region locator. The default value is 10. Set 'hbase.client.locate.prefetch.limit' in hbase-site.xml if you want to use another value for it.
11210
11211
11212 ---
11213
11214 * [HBASE-20642](https://issues.apache.org/jira/browse/HBASE-20642) | *Major* | **IntegrationTestDDLMasterFailover throws 'InvalidFamilyOperationException**
11215
11216 This changes client-side nonce generation to use the same nonce for re-submissions of client RPC DDL operations.
11217
11218
11219 ---
11220
11221 * [HBASE-20708](https://issues.apache.org/jira/browse/HBASE-20708) | *Blocker* | **Remove the usage of RecoverMetaProcedure in master startup**
11222
11223 Introduce an InitMetaProcedure to initialize meta table for a new HBase deploy. Marked RecoverMetaProcedure deprecated and remove the usage of it in the current code base. We still need to keep it in place for compatibility. The code in RecoverMetaProcedure has been moved to ServerCrashProcedure, and SCP will always be enabled and we will rely on it to bring meta region online.
11224
11225 For more on the issue addressed by this commit, see the design doc for overview and plan: https://docs.google.com/document/d/1\_872oHzrhJq4ck7f6zmp1J--zMhsIFvXSZyX1Mxg5MA/edit#heading=h.xy1z4alsq7uy
11226
11227
11228 ---
11229
11230 * [HBASE-20334](https://issues.apache.org/jira/browse/HBASE-20334) | *Major* | **add a test that expressly uses both our shaded client and the one from hadoop 3**
11231
11232 <!-- markdown -->
11233
11234 HBase now includes a helper script that can be used to run a basic functionality test for a given HBase installation at in `dev_support`. The test can optionally be given an HBase client artifact to rely on and can optionally be given specific Hadoop client artifacts to use.
11235
11236 For usage information see `./dev-support/hbase_nightly_pseudo-distributed-test.sh --help`.
11237
11238 The project nightly tests now make use of this test to check running on top of Hadoop 2, Hadoop 3, and Hadoop 3 with shaded client artifacts.
11239
11240
11241 ---
11242
11243 * [HBASE-19735](https://issues.apache.org/jira/browse/HBASE-19735) | *Major* | **Create a minimal "client" tarball installation**
11244
11245 <!-- markdown -->
11246
11247 The HBase convenience binary artifacts now includes a client focused tarball that a) includes more docs and b) does not include scripts or jars only needed for running HBase cluster services.
11248
11249 The new artifact is made as a normal part of the `assembly:single` maven command.
11250
11251
11252 ---
11253
11254 * [HBASE-20615](https://issues.apache.org/jira/browse/HBASE-20615) | *Major* | **emphasize use of shaded client jars when they're present in an install**
11255
11256 <!-- markdown -->
11257
11258 HBase's built in scripts now rely on the downstream facing shaded artifacts where possible. In particular interest to downstream users, the `hbase classpath` and `hbase mapredcp` commands now return the relevant shaded client artifact and only those third paty jars needed to make use of them (e.g. slf4j-api, commons-logging, htrace, etc).
11259
11260 Downstream users should note that by default the `hbase classpath` command will treat having `hadoop` on the shell's PATH as an implicit request to include the output of the `hadoop classpath` command in the returned classpath. This long-existing behavior can be opted out of by setting the environment variable `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` to the value "true". For example: `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP="true" bin/hbase classpath`.
11261
11262
11263 ---
11264
11265 * [HBASE-20333](https://issues.apache.org/jira/browse/HBASE-20333) | *Critical* | **break up shaded client into one with no Hadoop and one that's standalone**
11266
11267 <!-- markdown -->
11268
11269 Downstream users who need to use both HBase and Hadoop APIs should switch to relying on the new `hbase-shaded-client-byo-hadoop` artifact rather than the existing `hbase-shaded-client` artifact. The new artifact no longer includes and Hadoop classes.
11270
11271 It should work in combination with either the output of `hadoop classpath` or the Hadoop provided client-facing shaded artifacts in Hadoop 3+.
11272
11273
11274 ---
11275
11276 * [HBASE-20332](https://issues.apache.org/jira/browse/HBASE-20332) | *Critical* | **shaded mapreduce module shouldn't include hadoop**
11277
11278 <!-- markdown -->
11279
11280 The `hbase-shaded-mapreduce` artifact no longer include its own copy of Hadoop classes. Users who make use of the artifact via YARN should be able to get these classes from YARN's classpath without having to make any changes.
11281
11282
11283 ---
11284
11285 * [HBASE-20681](https://issues.apache.org/jira/browse/HBASE-20681) | *Major* | **IntegrationTestDriver fails after HADOOP-15406 due to missing hamcrest-core**
11286
11287 <!-- markdown -->
11288
11289 Users of our integration tests on Hadoop 3 can now add all needed dependencies by pointing at jars included in our binary convenience artifact.
11290
11291 Prior to this fix, downstream users on Hadoop 3 would need to get a copy of the Hamcrest v1.3 jar from elsewhere.
11292
11293
11294 ---
11295
11296 * [HBASE-19852](https://issues.apache.org/jira/browse/HBASE-19852) | *Major* | **HBase Thrift 1 server SPNEGO Improvements**
11297
11298 Adds two new properties for hbase-site.xml for THRIFT SPNEGO when in HTTP mode:
11299 \* hbase.thrift.spnego.keytab.file
11300 \* hbase.thrift.spnego.principal
11301
11302
11303 ---
11304
11305 * [HBASE-20590](https://issues.apache.org/jira/browse/HBASE-20590) | *Critical* | **REST Java client is not able to negotiate with the server in the secure mode**
11306
11307 Adds a negotiation logic between a secure java REST client and server. After this jira the Java REST client will start responding to the Negotiate challenge sent by the server. Adds RESTDemoClient which can be used to verify whether the secure Java REST client works against secure REST server or not.
11308
11309
11310 ---
11311
11312 * [HBASE-20634](https://issues.apache.org/jira/browse/HBASE-20634) | *Critical* | **Reopen region while server crash can cause the procedure to be stuck**
11313
11314 A second attempt at fixing HBASE-20173. Fixes unfinished keeping of server state inside AM (ONLINE=\>SPLITTING=\>OFFLINE=\>null). Concurrent unassigns look at server state to figure if they should wait on SCP to wake them up or not.
11315
11316
11317 ---
11318
11319 * [HBASE-20579](https://issues.apache.org/jira/browse/HBASE-20579) | *Minor* | **Improve snapshot manifest copy in ExportSnapshot**
11320
11321 This patch adds an FSUtil.copyFilesParallel() to help copy files in parallel, and it will return all the paths of directories and files traversed. Thus when we copy manifest in ExportSnapshot, we can copy reference files concurrently and use the paths it returns to help setOwner and setPermission.
11322 The size of thread pool is determined by the configuration snapshot.export.copy.references.threads, and its default value is the number of runtime available processors.
11323
11324
11325 ---
11326
11327 * [HBASE-18116](https://issues.apache.org/jira/browse/HBASE-18116) | *Major* | **Replication source in-memory accounting should not include bulk transfer hfiles**
11328
11329 Before this change we would incorrectly include the size of enqueued store files for bulk replication in the calculation for determining whether or not to rate limit the transfer of WAL edits. Because bulk replication uses a separate and asynchronous mechanism for file transfer this could incorrectly limit the batch sizes for WAL replication if bulk replication in progress, with negative impact on latency and throughput.
11330
11331
11332 ---
11333
11334 * [HBASE-20592](https://issues.apache.org/jira/browse/HBASE-20592) | *Minor* | **Create a tool to verify tables do not have prefix tree encoding**
11335
11336 PreUpgradeValidator tool with DataBlockEncoding validator was added to verify cluster is upgradable to HBase 2.
11337
11338
11339 ---
11340
11341 * [HBASE-20501](https://issues.apache.org/jira/browse/HBASE-20501) | *Blocker* | **Change the Hadoop minimum version to 2.7.1**
11342
11343 <!-- markdown -->
11344 HBase is no longer able to maintain compatibility with Apache Hadoop versions that are no longer receiving updates. This release raises the minimum supported version to Hadoop 2.7.1. Downstream users are strongly advised to upgrade to the latest Hadoop 2.7 maintenance release.
11345
11346 Downstream users of earlier HBase versions are similarly advised to upgrade to Hadoop 2.7.1+. When doing so, it is especially important to follow the guidance from [the HBase Reference Guide's Hadoop section](http://hbase.apache.org/book.html#hadoop) on replacing the Hadoop artifacts bundled with HBase.
11347
11348
11349 ---
11350
11351 * [HBASE-20601](https://issues.apache.org/jira/browse/HBASE-20601) | *Minor* | **Add multiPut support and other miscellaneous to PE**
11352
11353 1. Add multiPut support
11354 Set --multiPut=number to enable batchput(meanwhile, --autoflush need be set to false)
11355
11356 2. Add Connection Count support
11357 Added a new parameter connCount to PE. set --connCount=2 means all threads will share 2 connections.
11358 oneCon option and connCount option shouldn't be set at the same time.
11359
11360 3. Add avg RT and avg TPS/QPS statstic for all threads
11361
11362 4. Delete some redundant code
11363 Now RandomWriteTest is inherited from SequentialWrite.
11364
11365
11366 ---
11367
11368 * [HBASE-20544](https://issues.apache.org/jira/browse/HBASE-20544) | *Blocker* | **downstream HBaseTestingUtility fails with invalid port**
11369
11370 <!-- markdown -->
11371
11372 HBase now relies on an internal mechanism to determine when it is running a local hbase cluster meant for external interaction vs an encapsulated test. When created via the `HBaseTestingUtility`, ports for Master and RegionServer services and UIs will be set to random ports to allow for multiple parallel uses on a single machine. Normally when running a Standalone HBase Deployment (as described in the HBase Reference Guide) the ports will be picked according to the same defaults used in a full cluster set up. If you wish to instead use the random port assignment set `hbase.localcluster.assign.random.ports` to true.
11373
11374
11375 ---
11376
11377 * [HBASE-20004](https://issues.apache.org/jira/browse/HBASE-20004) | *Minor* | **Client is not able to execute REST queries in a secure cluster**
11378
11379 Added 'hbase.rest.http.allow.options.method' configuration property to allow user to decide whether Rest Server HTTP should allow OPTIONS method or not. By default it is enabled in HBase 2.1.0+ versions and in other versions it is disabled.
11380 Similarly 'hbase.thrift.http.allow.options.method' is added HBase 1.5, 2.1.0 and 3.0.0 versions. It is disabled by default.
11381
11382
11383 ---
11384
11385 * [HBASE-20327](https://issues.apache.org/jira/browse/HBASE-20327) | *Minor* | **When qualifier is not specified, append and incr operation do not work (shell)**
11386
11387 This change will enable users to perform append and increment operation with null qualifier via hbase-shell.
11388
11389
11390 ---
11391
11392 * [HBASE-18842](https://issues.apache.org/jira/browse/HBASE-18842) | *Minor* | **The hbase shell clone\_snaphost command returns bad error message**
11393
11394 <!-- markdown -->
11395
11396 When attempting to clone a snapshot but using a namespace that does not exist, the HBase shell will now correctly report the exception as caused by the passed namespace. Previously, the shell would report that the problem was an unknown namespace but it would claim the user provided table name was not found as a namespace. Both before and after this change the shell properly used the passed namespace to attempt to handle the request.
11397
11398
11399 ---
11400
11401 * [HBASE-20406](https://issues.apache.org/jira/browse/HBASE-20406) | *Major* | **HBase Thrift HTTP - Shouldn't handle TRACE/OPTIONS methods**
11402
11403 <!-- markdown -->
11404 When configured to do thrift-over-http, the HBase Thrift API Server no longer accepts the HTTP methods TRACE nor OPTIONS.
11405
11406
11407 ---
11408
11409 * [HBASE-20046](https://issues.apache.org/jira/browse/HBASE-20046) | *Major* | **Reconsider the implementation for serial replication**
11410
11411 Now in replication we can make sure the order of pushing logs is same as the order of requests from client. Set the serial flag to true for a replication peer to enable this feature.
11412
11413
11414 ---
11415
11416 * [HBASE-20159](https://issues.apache.org/jira/browse/HBASE-20159) | *Major* | **Support using separate ZK quorums for client**
11417
11418 After HBASE-20159 we allow client to use different ZK quorums by introducing three new properties: hbase.client.zookeeper.quorum and hbase.client.zookeeper.property.clientPort to specify client zookeeper properties (note that the combination of these two properties should be different from the server ZK quorums), and hbase.client.zookeeper.observer.mode to indicate whether the client ZK nodes are in observer mode (false by default)
11419
11420 HConstants.DEFAULT\_ZOOKEPER\_CLIENT\_PORT has been removed in HBase 3.0 and replaced by the correctly spelled DEFAULT\_ZOOKEEPER\_CLIENT\_PORT.
11421
11422
11423 ---
11424
11425 * [HBASE-20242](https://issues.apache.org/jira/browse/HBASE-20242) | *Major* | **The open sequence number will grow if we fail to open a region after writing the max sequence id file**
11426
11427 Now when opening a region, we will store the current max sequence id of the region to its max sequence id file instead of the 'next sequence id'. This could avoid the sequence id bumping when we fail to open a region, and also align to the behavior when we close a region.
11428
11429
11430 ---
11431
11432 * [HBASE-19024](https://issues.apache.org/jira/browse/HBASE-19024) | *Critical* | **Configurable default durability for synchronous WAL**
11433
11434 The default durability setting for the synchronous WAL is Durability.SYNC\_WAL, which triggers HDFS hflush() to flush edits to the datanodes. We also support Durability.FSYNC\_WAL, which instead triggers HDFS hsync() to flush \_and\_ fsync edits. This change introduces the new configuration setting "hbase.wal.hsync", defaulting to FALSE, that if set to TRUE changes the default durability setting for the synchronous WAL to  FSYNC\_WAL.
11435
11436
11437 ---
11438
11439 * [HBASE-19389](https://issues.apache.org/jira/browse/HBASE-19389) | *Critical* | **Limit concurrency of put with dense (hundreds) columns to prevent write handler exhausted**
11440
11441 After HBASE-19389 we introduced a RegionServer self-protection mechanism to prevent write handler getting exhausted by high concurrency put with dense columns, mainly through two new properties: hbase.region.store.parallel.put.limit.min.column.count to decide what kind of put (with how many columns within a single column family) to limit (100 by default) and hbase.region.store.parallel.put.limit to limit the concurrency (10 by default). There's another property for advanced user and please check source and javadoc of StoreHotnessProtector for more details.
11442
11443
11444 ---
11445
11446 * [HBASE-20148](https://issues.apache.org/jira/browse/HBASE-20148) | *Major* | **Make serial replication as a option for a peer instead of a table**
11447
11448 A new method setSerial has been added to the interface ReplicationPeerConfigBuilder which is marked as IA.Public. This interface is not supposed to be implemented by client code, but if you do, this will be an incompatible change as you need to add this method to your implementation too.
11449
11450
11451 ---
11452
11453 * [HBASE-19397](https://issues.apache.org/jira/browse/HBASE-19397) | *Major* | **Design  procedures for ReplicationManager to notify peer change event from master**
11454
11455 Introduce 5 procedures to do peer modifications:
11456 AddPeerProcedure
11457 RemovePeerProcedure
11458 UpdatePeerConfigProcedure
11459 EnablePeerProcedure
11460 DisablePeerProcedure
11461
11462 The procedures are all executed with the following stage:
11463 1. Call pre CP hook, if an exception is thrown then give up
11464 2. Check whether the operation is valid, if not then give up
11465 3. Update peer storage. Notice that if we have entered this stage, then we can not rollback any more.
11466 4. Schedule sub procedures to refresh the peer config on every RS.
11467 5. Do post cleanup if any.
11468 6. Call post CP hook. The exception thrown will be ignored since we have already done the work.
11469
11470 The procedure will hold an exclusive lock on the peer id, so now there is no concurrent modifications on a single peer.
11471
11472 And now it is guaranteed that once the procedure is done, the peer modification has already taken effect on all RSes.
11473
11474 Abstracte a storage layer for replication peer/queue manangement, and refactored the upper layer to remove zk related naming/code/comment.
11475
11476 Add pre/postExecuteProcedures CP hooks to RegionServerObserver, and add permission check for executeProcedures method which requires the caller to be system user or super user.
11477
11478 On rolling upgrade: just do not do any replication peer modifications during the rolling upgrading. There is no pb/layout changes on the peer/queue storage on zk.
11479 # HBASE  2.0.0 Release Notes
11480
11481
11482 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11483
11484
11485 ---
11486
11487 * [HBASE-20464](https://issues.apache.org/jira/browse/HBASE-20464) | *Major* | **Disable IMC**
11488
11489 Change the default so that on creation of new tables, In-Memory Compaction BASIC is NOT enabled.
11490
11491 This change is in branch-2.0 only, not in branch-2.
11492
11493
11494 ---
11495
11496 * [HBASE-20276](https://issues.apache.org/jira/browse/HBASE-20276) | *Blocker* | **[shell] Revert shell REPL change and document**
11497
11498 <!-- markdown -->
11499
11500
11501
11502 The HBase shell now behaves as it did prior to the changes that started in HBASE-15965. Namely, some shell commands return values that may be further manipulated within the shell's IRB session.
11503
11504 The command line option `--return-values` is no longer acted on by the shell since it now always behaves as it did when passed this parameter. Passing the option results in a harmless warning about this change.
11505
11506 Users who wish to maintain the behavior seen in the 1.4.0-1.4.2 releases of the HBase shell should refer to the section _irbrc_ in the reference guide for how to configure their IRB session to avoid echoing expression results to the console.
11507
11508
11509 ---
11510
11511 * [HBASE-18792](https://issues.apache.org/jira/browse/HBASE-18792) | *Blocker* | **hbase-2 needs to defend against hbck operations**
11512
11513 As of HBase version 2.0, the hbck tool is significantly changed. In general, all Read-Only options are supported and can be be used safely. Most -fix/ -repair options are NOT supported. Please see usage below for details on which options are not supported:
11514
11515
11516 Usage: fsck [opts] {only tables}
11517  where [opts] are:
11518    -help Display help options (this)
11519    -details Display full report of all regions.
11520    -timelag \<timeInSeconds\>  Process only regions that  have not experienced any metadata updates in the last  \<timeInSeconds\> seconds.
11521    -sleepBeforeRerun \<timeInSeconds\> Sleep this many seconds before checking if the fix worked if run with -fix
11522    -summary Print only summary of the tables and status.
11523    -metaonly Only check the state of the hbase:meta table.
11524    -sidelineDir \<hdfs://\> HDFS path to backup existing meta.
11525    -boundaries Verify that regions boundaries are the same between META and store files.
11526    -exclusive Abort if another hbck is exclusive or fixing.
11527
11528   Datafile Repair options: (expert features, use with caution!)
11529    -checkCorruptHFiles     Check all Hfiles by opening them to make sure they are valid
11530    -sidelineCorruptHFiles  Quarantine corrupted HFiles.  implies -checkCorruptHFiles
11531
11532  Replication options
11533    -fixReplication   Deletes replication queues for removed peers
11534
11535   Metadata Repair options supported as of version 2.0: (expert features, use with caution!)
11536    -fixVersionFile   Try to fix missing hbase.version file in hdfs.
11537    -fixReferenceFiles  Try to offline lingering reference store files
11538    -fixHFileLinks  Try to offline lingering HFileLinks
11539    -noHdfsChecking   Don't load/check region info from HDFS. Assumes hbase:meta region info is good. Won't check/fix any HDFS issue, e.g. hole, orphan, or overlap
11540    -ignorePreCheckPermission  ignore filesystem permission pre-check
11541
11542 NOTE: Following options are NOT supported as of HBase version 2.0+.
11543
11544   UNSUPPORTED Metadata Repair options: (expert features, use with caution!)
11545    -fix              Try to fix region assignments.  This is for backwards compatiblity
11546    -fixAssignments   Try to fix region assignments.  Replaces the old -fix
11547    -fixMeta          Try to fix meta problems.  This assumes HDFS region info is good.
11548    -fixHdfsHoles     Try to fix region holes in hdfs.
11549    -fixHdfsOrphans   Try to fix region dirs with no .regioninfo file in hdfs
11550    -fixTableOrphans  Try to fix table dirs with no .tableinfo file in hdfs (online mode only)
11551    -fixHdfsOverlaps  Try to fix region overlaps in hdfs.
11552    -maxMerge \<n\>     When fixing region overlaps, allow at most \<n\> regions to merge. (n=5 by default)
11553    -sidelineBigOverlaps  When fixing region overlaps, allow to sideline big overlaps
11554    -maxOverlapsToSideline \<n\>  When fixing region overlaps, allow at most \<n\> regions to sideline per group. (n=2 by default)
11555    -fixSplitParents  Try to force offline split parents to be online.
11556    -removeParents    Try to offline and sideline lingering parents and keep daughter regions.
11557    -fixEmptyMetaCells  Try to fix hbase:meta entries not referencing any region (empty REGIONINFO\_QUALIFIER rows)
11558
11559   UNSUPPORTED Metadata Repair shortcuts
11560    -repair           Shortcut for -fixAssignments -fixMeta -fixHdfsHoles -fixHdfsOrphans -fixHdfsOverlaps -fixVersionFile -sidelineBigOverlaps -fixReferenceFiles-fixHFileLinks
11561    -repairHoles      Shortcut for -fixAssignments -fixMeta -fixHdfsHoles
11562
11563
11564 ---
11565
11566 * [HBASE-19994](https://issues.apache.org/jira/browse/HBASE-19994) | *Major* | **Create a new class for RPC throttling exception, make it retryable.**
11567
11568 A new RpcThrottlingException deprecates ThrottlingException. The new RpcThrottlingException is a retryable Exception that clients will retry when Rpc throttling quota is exceeded. The deprecated ThrottlingException is a nonretryable Exception.
11569
11570
11571 ---
11572
11573 * [HBASE-20224](https://issues.apache.org/jira/browse/HBASE-20224) | *Blocker* | **Web UI is broken in standalone mode**
11574
11575 Standalone webui was broken inadvertently by HBASE-20027.
11576
11577
11578 ---
11579
11580 * [HBASE-18784](https://issues.apache.org/jira/browse/HBASE-18784) | *Major* | **Use of filesystem that requires hflush / hsync / append / etc should query outputstream capabilities**
11581
11582 <!-- markdown -->
11583
11584
11585
11586 If HBase is run on top of Apache Hadoop libraries that support the needed APIs it will verify that underlying Filesystem implementations provide the needed durability mechanisms to safely operate. The needed APIs *should* be present in Hadoop 3 release and Hadoop 2 releases starting in the Hadoop 2.9 series. If the APIs are not available, HBase behaves as it has in previous releases (that is, it moves forward assuming such a check would pass).
11587
11588 Where this check fails, it is unsafe to rely on HBase in a production setting. In the event of process or node failure, the HBase RegionServer process may fail to have access to all the data it previously wrote to its write ahead log, resulting in data loss. In the event of process or node failure, the HBase master process may lose all or part of the write ahead log that it relies on for cluster management operations, leaving the cluster in an inconsistent state that we aren't sure it could recover from.
11589
11590 Notably, the LocalFileSystem implementation provided by Hadoop reports (accurately) via these new APIs that it can not provide the durability HBase needs to operate. As such, the current instructions for single-node HBase operation have been updated both with a) how to bypass this safety check and b) a strong warning about the dire consequences of doing so outside of a dev/test environment.
11591
11592
11593 ---
11594
11595 * [HBASE-20219](https://issues.apache.org/jira/browse/HBASE-20219) | *Critical* | **An error occurs when scanning with reversed=true and loadColumnFamiliesOnDemand=true**
11596
11597 Throws DoNotRetryIOException when you ask for a reverse scan loading adjacent column families on demand. Previous it threw IllegalStateException
11598
11599
11600 ---
11601
11602 * [HBASE-20358](https://issues.apache.org/jira/browse/HBASE-20358) | *Minor* | **Fix bin/hbase thrift usage text**
11603
11604 Cleanup usage message and command-line processing (no functional change).
11605
11606
11607 ---
11608
11609 * [HBASE-20182](https://issues.apache.org/jira/browse/HBASE-20182) | *Blocker* | **Can not locate region after split and merge**
11610
11611 Now if we hit a split parent when locating a region, we will skip to the next row and try again until the region does not contain our row. So there will be no RegionOfflineException for a split parent any more, instead, if the split children have not been onlined yet, i.e, we finally arrive at a region which does not contain our row, an IOException will be thrown.
11612
11613
11614 ---
11615
11616 * [HBASE-20149](https://issues.apache.org/jira/browse/HBASE-20149) | *Critical* | **Purge dev javadoc from bin tarball (or make a separate tarball of javadoc)**
11617
11618 We no longer include dev or dev test javadocs in our binary bundle. We still build them; they are just not included because they were half the size of the resultant tarball.
11619
11620 Here is our story on javadoc as of this commit:
11621
11622  \* apidocs - user facing main api javadocs. currently for a release line, published on website and linked from menu. included in the bin tarball
11623  \* devapidocs - hbase internal javadocs. currently for a release line, published on the website but not linked from the menu. no longer included in the bin tarball.
11624  \* testapidocs - user facing test scope api javadocs. currently for a release line, not published. included in the bin tarball.
11625  \* testdevapidocs - hbase internal test scope javadocs. currently for a release line, not published. no longer included in the bin tarball
11626
11627
11628 ---
11629
11630 * [HBASE-18828](https://issues.apache.org/jira/browse/HBASE-18828) | *Blocker* | **[2.0] Generate CHANGES.txt**
11631
11632 Moves us over to yetus releasedocmaker tooling generating CHANGES. CHANGES is not markdown (CHANGES.md) as opposed to CHANGES.txt. We've also added a new RELEASENOTES.md that lists JIRA release notes (courtesy of releasedocmaker).
11633
11634 CHANGES/RELEASENOTES are current as of now. Will need a 'freshening' when we cut the RC.
11635
11636
11637 ---
11638
11639 * [HBASE-14175](https://issues.apache.org/jira/browse/HBASE-14175) | *Critical* | **Adopt releasedocmaker for better generated release notes**
11640
11641 We will use yetus releasedocmaker to make our changes doc from here on out. A CHANGELOG.md will replace our current CHANGES.txt. Adjacent, we'll keep up a RELEASENOTES.md doc courtesy of releasedocmaker.
11642
11643 Over in HBASE-18828 is where we are working through steps for the RM integrating this new tooling.
11644
11645
11646 ---
11647
11648 * [HBASE-16499](https://issues.apache.org/jira/browse/HBASE-16499) | *Critical* | **slow replication for small HBase clusters**
11649
11650 Changed the default value for replication.source.ratio from 0.1 to 0.5. Which means now by default 50% of the total RegionServers in peer cluster(s) will participate in replication.
11651
11652
11653 ---
11654
11655 * [HBASE-16459](https://issues.apache.org/jira/browse/HBASE-16459) | *Trivial* | **Remove unused hbase shell --format option**
11656
11657 <!-- markdown -->
11658
11659
11660
11661
11662 The HBase `shell` command no longer recognizes the option `--format`. Previously this option only recognized the default value of 'console'. The default value is now always used.
11663
11664
11665 ---
11666
11667 * [HBASE-20259](https://issues.apache.org/jira/browse/HBASE-20259) | *Critical* | **Doc configs for in-memory-compaction and add detail to in-memory-compaction logging**
11668
11669 Disables in-memory compaction as default.
11670
11671 Adds logging of in-memory compaction configuration on creation.
11672
11673 Adds a chapter to the refguide on this new feature.
11674
11675
11676 ---
11677
11678 * [HBASE-20282](https://issues.apache.org/jira/browse/HBASE-20282) | *Major* | **Provide short name invocations for useful tools**
11679
11680 \`hbase regionsplitter\` is a new short invocation for \`hbase org.apache.hadoop.hbase.util.RegionSplitter\`
11681
11682
11683 ---
11684
11685 * [HBASE-20314](https://issues.apache.org/jira/browse/HBASE-20314) | *Major* | **Precommit build for master branch fails because of surefire fork fails**
11686
11687 Upgrade surefire plugin to 2.21.0.
11688
11689
11690 ---
11691
11692 * [HBASE-20130](https://issues.apache.org/jira/browse/HBASE-20130) | *Critical* | **Use defaults (16020 & 16030) as base ports when the RS is bound to localhost**
11693
11694 <!-- markdown -->
11695
11696
11697
11698 When region servers bind to localhost (mostly in pseudo distributed mode), default ports (16020 & 16030) are used as base ports. This will support up to 9 instances of region servers by default with `local-regionservers.sh` script. If additional instances are needed, see the reference guide on how to deploy with a different range using the environment variables `HBASE_RS_BASE_PORT` and `HBASE_RS_INFO_BASE_PORT`.
11699
11700
11701 ---
11702
11703 * [HBASE-20111](https://issues.apache.org/jira/browse/HBASE-20111) | *Critical* | **Able to split region explicitly even on shouldSplit return false from split policy**
11704
11705 When a split is requested on a Region, the RegionServer hosting that Region will now consult the configured SplitPolicy for that table when determining if a split of that Region is allowed. When a split is disallowed (due to the Region not being OPEN or the SplitPolicy denying the request), the operation will \*not\* be implicitly retried as it has previously done. Users will need to guard against and explicitly retry region split requests which are denied by the system.
11706
11707
11708 ---
11709
11710 * [HBASE-20223](https://issues.apache.org/jira/browse/HBASE-20223) | *Blocker* | **Use hbase-thirdparty 2.1.0**
11711
11712 Moves commons-cli and commons-collections4 into the HBase thirdparty shaded jar which means that these are no longer generally available for users on the classpath.
11713
11714
11715 ---
11716
11717 * [HBASE-19128](https://issues.apache.org/jira/browse/HBASE-19128) | *Major* | **Purge Distributed Log Replay from codebase, configurations, text; mark the feature as unsupported, broken.**
11718
11719 Removes Distributed Log Replay feature. Disable the feature before upgrading.
11720
11721
11722 ---
11723
11724 * [HBASE-19504](https://issues.apache.org/jira/browse/HBASE-19504) | *Major* | **Add TimeRange support into checkAndMutate**
11725
11726 1) checkAndMutate accept a TimeRange to query the specified cell
11727 2) remove writeToWAL flag from Region#checkAndMutate since it is useless (this is a incompatible change)
11728
11729
11730 ---
11731
11732 * [HBASE-20237](https://issues.apache.org/jira/browse/HBASE-20237) | *Critical* | **Put back getClosestRowBefore and throw UnknownProtocolException instead... for asynchbase client**
11733
11734 Throw UnknownProtocolException if a client connects and tries to invoke the old getClosestRowOrBefore method. Pre-hbase-1.0.0 or asynchbase do this instead of using its replacement, the reverse Scan.
11735
11736 getClosestRowOrBefore was implemented as a flag on Get. Before this patch though the flag was set, hbase2 were ignoring it. This made it look like a pre-1.0.0 client was 'working' but then it'd fail finding the appropriate Region for a client-specified row doing lookups into hbase:meta.
11737
11738
11739 ---
11740
11741 * [HBASE-20247](https://issues.apache.org/jira/browse/HBASE-20247) | *Major* | **Set version as 2.0.0 in branch-2.0 in prep for first RC**
11742
11743 Set version as 2.0.0 on branch-2.0.
11744
11745
11746 ---
11747
11748 * [HBASE-20090](https://issues.apache.org/jira/browse/HBASE-20090) | *Major* | **Properly handle Preconditions check failure in MemStoreFlusher$FlushHandler.run**
11749
11750 When there is concurrent region split, MemStoreFlusher may not find flushable region if the only candidate region left hasn't received writes (resulting in 0 data size).
11751 After this JIRA, such scenario wouldn't trigger Precondition assertion (replaced by an if statement to see whether there is any flushable region).
11752 If there is no flushable region, a DEBUG log would appear in region server log, saying "Above memory mark but there is no flushable region".
11753
11754
11755 ---
11756
11757 * [HBASE-19552](https://issues.apache.org/jira/browse/HBASE-19552) | *Major* | **update hbase to use new thirdparty libs**
11758
11759 hbase-thirdparty libs have moved to o.a.h.thirdparty offset. Netty shading system property is no longer necessary.
11760
11761
11762 ---
11763
11764 * [HBASE-20119](https://issues.apache.org/jira/browse/HBASE-20119) | *Minor* | **Introduce a pojo class to carry coprocessor information in order to make TableDescriptorBuilder accept multiple cp at once**
11765
11766 1) Make all methods in TableDescriptorBuilder be setter pattern.
11767 addCoprocessor -\> setCoprocessor
11768 addColumnFamily -\> setColumnFamily
11769 (addCoprocessor and addColumnFamily are still in branch-2 but they are marked as deprecated)
11770 2) add CoprocessorDescriptor to carry cp information
11771 3) add CoprocessorDescriptorBuilder to build CoprocessorDescriptor
11772 4) TD disallow user to set negative priority to coprocessor since parsing the negative value will cause a exception
11773
11774
11775 ---
11776
11777 * [HBASE-17165](https://issues.apache.org/jira/browse/HBASE-17165) | *Critical* | **Add retry to LoadIncrementalHFiles tool**
11778
11779 Adds retry to load of incremental hfiles. Pertinent key is HConstants.HBASE\_CLIENT\_RETRIES\_NUMBER. Default is HConstants.DEFAULT\_HBASE\_CLIENT\_RETRIES\_NUMBER.
11780
11781
11782 ---
11783
11784 * [HBASE-20108](https://issues.apache.org/jira/browse/HBASE-20108) | *Critical* | **\`hbase zkcli\` falls into a non-interactive prompt after HBASE-15199**
11785
11786 This issue fixes a runtime dependency issues where JLine is not made available on the classpath which causes the ZooKeeper CLI to appear non-interactive. JLine was being made available unintentionally via the JRuby jar file on the classpath for the HBase shell. While the JRuby jar is not always present, the fix made here was to selectively include the JLine dependency on the zkcli command's classpath.
11787
11788
11789 ---
11790
11791 * [HBASE-8770](https://issues.apache.org/jira/browse/HBASE-8770) | *Blocker* | **deletes and puts with the same ts should be resolved according to mvcc/seqNum**
11792
11793 This behavior is available as a new feature. See HBASE-15968 release note.
11794
11795 This issue is just about adding to the refguide documentation on the HBASE\_15968 feature.
11796
11797
11798 ---
11799
11800 * [HBASE-19114](https://issues.apache.org/jira/browse/HBASE-19114) | *Major* | **Split out o.a.h.h.zookeeper from hbase-server and hbase-client**
11801
11802 Splits out most of ZooKeeper related code into a separate new module: hbase-zookeeper.
11803 Also, renames some ZooKeeper related classes to follow a common naming pattern - "ZK" prefix - as compared to many different styles earlier.
11804
11805
11806 ---
11807
11808 * [HBASE-19437](https://issues.apache.org/jira/browse/HBASE-19437) | *Critical* | **Batch operation can't handle the null result for Append/Increment**
11809
11810 The result from server is changed from null to Result.EMPTY\_RESULT when Append/Increment operation can't retrieve any data from server,
11811
11812
11813 ---
11814
11815 * [HBASE-17448](https://issues.apache.org/jira/browse/HBASE-17448) | *Major* | **Export metrics from RecoverableZooKeeper**
11816
11817 Committed to master and branch-1
11818
11819
11820 ---
11821
11822 * [HBASE-19400](https://issues.apache.org/jira/browse/HBASE-19400) | *Major* | **Add missing security checks in MasterRpcServices**
11823
11824 Added ACL check to following Admin functions:
11825 enableCatalogJanitor, runCatalogJanitor, cleanerChoreSwitch, runCleanerChore, execProcedure, execProcedureWithReturn, normalize, normalizerSwitch, coprocessorService.
11826 When ACL is enabled, only those with ADMIN rights will be able to invoke these operations successfully.
11827
11828
11829 ---
11830
11831 * [HBASE-20048](https://issues.apache.org/jira/browse/HBASE-20048) | *Blocker* | **Revert serial replication feature**
11832
11833 Revert the serial replication feature from all branches. Plan to reimplement it soon and land onto 2.1 release line.
11834
11835
11836 ---
11837
11838 * [HBASE-19166](https://issues.apache.org/jira/browse/HBASE-19166) | *Blocker* | **AsyncProtobufLogWriter persists ProtobufLogWriter as class name for backward compatibility**
11839
11840 For backward compatibility, AsyncProtobufLogWriter uses "ProtobufLogWriter" as writer class name and SecureAsyncProtobufLogWriter uses "SecureProtobufLogWriter" as writer class name.
11841
11842
11843 ---
11844
11845 * [HBASE-18596](https://issues.apache.org/jira/browse/HBASE-18596) | *Blocker* | **[TEST] A hbase1 cluster should be able to replicate to a hbase2 cluster; verify**
11846
11847 Replication between versions verified as basically working. 0.98.25-SNAPSHOT to beta-2 hbase2 and a 1.2-ish version tried.
11848
11849
11850 ---
11851
11852 * [HBASE-20017](https://issues.apache.org/jira/browse/HBASE-20017) | *Blocker* | **BufferedMutatorImpl submit the same mutation repeatedly**
11853
11854 This change fixes multithreading issues in the implementation of BufferedMutator. BufferedMutator should not be used with 1.4 releases prior to 1.4.2.
11855
11856
11857 ---
11858
11859 * [HBASE-20032](https://issues.apache.org/jira/browse/HBASE-20032) | *Minor* | **Receving multiple warnings for missing reporting.plugins.plugin.version**
11860
11861 Add (latest) version elements missing from reporting plugins in top-level pom.
11862
11863
11864 ---
11865
11866 * [HBASE-19954](https://issues.apache.org/jira/browse/HBASE-19954) | *Major* | **Separate TestBlockReorder into individual tests to avoid ShutdownHook suppression error against hadoop3**
11867
11868 hadoop3 minidfscluster removes all shutdown handlers when the cluster goes down which made this test that does FS-stuff fail (Fix was to break up the test so each test method ran with an unadulterated FS).
11869
11870
11871 ---
11872
11873 * [HBASE-20014](https://issues.apache.org/jira/browse/HBASE-20014) | *Major* | **TestAdmin1 Times out**
11874
11875 Ups the overall test timeout from 10 minutes to 13minutes. 15minutes is the surefire timeout.
11876
11877
11878 ---
11879
11880 * [HBASE-20020](https://issues.apache.org/jira/browse/HBASE-20020) | *Critical* | **Make sure we throw DoNotRetryIOException when ConnectionImplementation is closed**
11881
11882 Add checkClosed to core Client methods. Avoid unnecessary retry.
11883
11884
11885 ---
11886
11887 * [HBASE-19978](https://issues.apache.org/jira/browse/HBASE-19978) | *Major* | **The keepalive logic is incomplete in ProcedureExecutor**
11888
11889 Completes keep-alive logic and then enables it; ProcedureExecutor Workers will spin up more threads when need settling back to the core count after the burst in demand has passed. Default keep-alive is one minute. Default core-count is CPUs/4 or 16, which ever is greater. Maximum is an arbitrary core-count \* 10 (a limit that should never be hit and if it is, there is something else very wrong).
11890
11891
11892 ---
11893
11894 * [HBASE-19950](https://issues.apache.org/jira/browse/HBASE-19950) | *Minor* | **Introduce a ColumnValueFilter**
11895
11896 ColumnValueFilter provides a way to fetch matched cells only by providing specified column, value and a comparator, which is different from SingleValueFilter, fetching an entire row as soon as a matched cell found.
11897
11898
11899 ---
11900
11901 * [HBASE-18294](https://issues.apache.org/jira/browse/HBASE-18294) | *Major* | **Reduce global heap pressure: flush based on heap occupancy**
11902
11903 A region is flushed if its memory component exceeds the region flush threshold.
11904 A flush policy decides which stores to flush by comparing the size of the store to a column-family-flush threshold.
11905 If the overall size of all memstores in the machine exceeds the bounds defined by the administrator (denoted global pressure) a region is selected and flushed.
11906 HBASE-18294 changes flush decisions to be based on heap-occupancy and not data (key-value) size, consistently across levels. This rolls back some of the changes by HBASE-16747. Specifically,
11907 (1) RSs, Regions and stores track their overall on-heap and off-heap occupancy,
11908 (2) A region is flushed when its on-heap+off-heap size exceeds the region flush threshold specified in hbase.hregion.memstore.flush.size,
11909 (3) The store to be flushed is chosen based on its on-heap+off-heap size
11910 (4) At the RS level, a flush is triggered when the overall on-heap exceeds the on-heap limit, or when the overall off-heap size exceeds the off-heap limit (low/high water marks).
11911
11912 Note that when the region flush size is set to XXmb a region flush may be triggered even before writing keys and values of size XX because the total heap occupancy of the region which includes additional metadata exceeded the threshold.
11913
11914
11915 ---
11916
11917 * [HBASE-19116](https://issues.apache.org/jira/browse/HBASE-19116) | *Critical* | **Currently the tail of hfiles with CellComparator\* classname makes it so hbase1 can't open hbase2 written hfiles; fix**
11918
11919 hbase-2.x sets KeyValue Comparators into the tail of hfiles rather than CellComparator, what it uses internally, just so hbase-1.x can continue to read hbase-2.x written hfiles.
11920
11921
11922 ---
11923
11924 * [HBASE-19948](https://issues.apache.org/jira/browse/HBASE-19948) | *Major* | **Since HBASE-19873, HBaseClassTestRule, Small/Medium/Large has different semantic**
11925
11926 In subtask, fixed doc and annotations to be more explicit that test timings are for the whole Test Fixture/Test Class/Test Suite NOT the test method only as we'd measuring up to this (tother subtasks untethered Categorization and test timeout such that all categories now have a ten minute timeout -- no test can run longer than ten minutes or it gets killed/timedout).
11927
11928
11929 ---
11930
11931 * [HBASE-16060](https://issues.apache.org/jira/browse/HBASE-16060) | *Blocker* | **1.x clients cannot access table state talking to 2.0 cluster**
11932
11933 By default, we mirror table state to zookeeper so hbase-1.x clients will work against an hbase-2 cluster (With this patch, hbase-1.x clients can do most Admin functions including table create; hbase-1.x clients can do all Table/DML against hbase-2 cluster).
11934
11935 Flag to disable mirroring is hbase.mirror.table.state.to.zookeeper; set it to false in Configuration.
11936
11937 Related, Master on startup will look to see if there are table state znodes left over by an hbase-1 instance. If any found, it will migrate the table state to hbase-2 setting the state into the hbase:meta table where table state is now kept. We will do this check on every Master start. Notion is that this will be overall beneficial with low impediment. To disable the migration check, set hbase.migrate.table.state.from.zookeeper to false.
11938
11939
11940 ---
11941
11942 * [HBASE-19900](https://issues.apache.org/jira/browse/HBASE-19900) | *Critical* | **Region-level exception destroy the result of batch**
11943
11944 This fix makes the following changes to how client handle the both of action result and region exception.
11945 1) honor the action result rather than region exception. If the action have both of true result and region exception, the action is fine as the exception is caused by other actions which are in the same region.
11946 2) honor the action exception rather than region exception. If the action have both of action exception and region exception, we deal with the action exception only. If we also handle the region exception for the same action, it will introduce the negative count of actions in progress. The AsyncRequestFuture#waitUntilDone will block forever.
11947
11948
11949 ---
11950
11951 * [HBASE-19841](https://issues.apache.org/jira/browse/HBASE-19841) | *Major* | **Tests against hadoop3 fail with StreamLacksCapabilityException**
11952
11953 HBaseTestingUtility now assumes that all clusters will use local storage until a MiniDFSCluster is started or assigned.
11954
11955
11956 ---
11957
11958 * [HBASE-19528](https://issues.apache.org/jira/browse/HBASE-19528) | *Major* | **Major Compaction Tool**
11959
11960 Tool allows you to compact a cluster with given concurrency of regionservers compacting at a given time.  If tool completes successfully everything requested for compaction will be compacted, regardless of region moves, splits and merges.
11961
11962
11963 ---
11964
11965 * [HBASE-19919](https://issues.apache.org/jira/browse/HBASE-19919) | *Major* | **Tidying up logging**
11966
11967 (I thought this change innocuous but I made work for a co-worker when I upped interval between log cleaner runs -- meant a smoke test failed because we were slow doing an expected cleanup).
11968
11969 Edit of log lines removing redundancy. Shorten thread names shown in log.  Made some log TRACE instead of DEBUG.  Capitalizations.
11970
11971 Upped log cleaner interval from every minute to every ten minutes. hbase.master.cleaner.interval
11972
11973 Lowered default count of threads started by Procedure Executor from count of CPUs to 1/4 of count of CPUs.
11974
11975
11976 ---
11977
11978 * [HBASE-19901](https://issues.apache.org/jira/browse/HBASE-19901) | *Major* | **Up yetus proclimit on nightlies**
11979
11980 Pass to yetus a dockermemlimit of 20G and a proclimit of 10000. Defaults are 4G and 1G respectively.
11981
11982
11983 ---
11984
11985 * [HBASE-19912](https://issues.apache.org/jira/browse/HBASE-19912) | *Minor* | **The flag "writeToWAL" of Region#checkAndRowMutate is useless**
11986
11987 Remove useless 'writeToWAL' flag of Region#checkAndRowMutate & related class
11988
11989
11990 ---
11991
11992 * [HBASE-19911](https://issues.apache.org/jira/browse/HBASE-19911) | *Major* | **Convert some tests from small to medium because they are timing out: TestNettyRpcServer, TestClientClusterStatus, TestCheckTestClasses**
11993
11994 Changed a few tests so they are medium sized rather than small size.
11995
11996 Also, upped the time we wait on small tests to 60seconds from 30seconds. Small tests are tests that run in 15seconds or less. What we changed was the timeout watcher. It is now more lax, more tolerant of dodgy infrastructure that might be running tests slowly.
11997
11998
11999 ---
12000
12001 * [HBASE-19892](https://issues.apache.org/jira/browse/HBASE-19892) | *Major* | **Checking 'patch attach' and yetus 0.7.0 and move to Yetus 0.7.0**
12002
12003 Moved our internal yetus reference from 0.6.0 to 0.7.0. Concurrently, I changed hadoopqa to run with 0.7.0 (by editing the config in jenkins).
12004
12005
12006 ---
12007
12008 * [HBASE-19873](https://issues.apache.org/jira/browse/HBASE-19873) | *Major* | **Add a CategoryBasedTimeout ClassRule for all UTs**
12009
12010 Along with @category -- small, medium, large -- all hbase tests must now carry a ClassRule as follows:
12011
12012 +  @ClassRule
12013 +  public static final HBaseClassTestRule CLASS\_RULE =
12014 +      HBaseClassTestRule.forClass(TestInterfaceAudienceAnnotations.class);
12015
12016 where the class changes by test.
12017
12018 Currently the classrule enforces timeout for the whole test suite -- i.e. if a SmallTest Category then all the tests in the TestSuite must complete inside 60seconds, the timeout we set on SmallTest Category test suite -- but is meant to be a repository for general, runtime, hbase test facility.
12019
12020
12021 ---
12022
12023 * [HBASE-19770](https://issues.apache.org/jira/browse/HBASE-19770) | *Critical* | **Add '--return-values' option to Shell to print return values of commands in interactive mode**
12024
12025 Introduces a new option to the HBase shell: -r, --return-values. When the shell is in "interactive" mode (default), the return value of shell commands are not returned to the user as they dirty the console output. For those who desire this functionality, the "--return-values" option restores the old functionality of the commands passing their return value to the user.
12026
12027
12028 ---
12029
12030 * [HBASE-15321](https://issues.apache.org/jira/browse/HBASE-15321) | *Major* | **Ability to open a HRegion from hdfs snapshot.**
12031
12032 HRegion.openReadOnlyFileSystemHRegion() provides the ability to open HRegion from a read-only hdfs snapshot.  Because hdfs snapshots are read-only, no cleanup happens when using this API.
12033
12034
12035 ---
12036
12037 * [HBASE-17513](https://issues.apache.org/jira/browse/HBASE-17513) | *Critical* | **Thrift Server 1 uses different QOP settings than RPC and Thrift Server 2 and can easily be misconfigured so there is no encryption when the operator expects it.**
12038
12039 This change fixes an issue where users could have unintentionally configured the HBase Thrift1 server to run without wire-encryption, when they believed they had configured the Thrift1 server to do so.
12040
12041
12042 ---
12043
12044 * [HBASE-19828](https://issues.apache.org/jira/browse/HBASE-19828) | *Major* | **Flakey TestRegionsOnMasterOptions.testRegionsOnAllServers**
12045
12046 Disables TestRegionsOnMasterOptions because Regions on Master does not work reliably; see HBASE-19831.
12047
12048
12049 ---
12050
12051 * [HBASE-18963](https://issues.apache.org/jira/browse/HBASE-18963) | *Major* | **Remove MultiRowMutationProcessor and implement mutateRows... methods using batchMutate()**
12052
12053 Modified HRegion.mutateRow() APIs to use batchMutate() instead of processRowsWithLocks() with MultiRowMutationProcessor. MultiRowMutationProcessor is removed to have single write path that uses batchMutate().
12054
12055
12056 ---
12057
12058 * [HBASE-19163](https://issues.apache.org/jira/browse/HBASE-19163) | *Major* | **"Maximum lock count exceeded" from region server's batch processing**
12059
12060 When there are many mutations against the same row in a batch, as each mutation will acquire a shared row lock, it will exceed the maximum shared lock count the java ReadWritelock supports (64k). Along with other optimization, the batch is divided into multiple possible minibatches. A new config is added to limit the maximum number of mutations in the minibatch.
12061
12062    \<property\>
12063     \<name\>hbase.regionserver.minibatch.size\</name\>
12064     \<value\>20000\</value\>
12065    \</property\>
12066 The default value is 20000.
12067
12068
12069 ---
12070
12071 * [HBASE-19739](https://issues.apache.org/jira/browse/HBASE-19739) | *Minor* | **Include thrift IDL files in HBase binary distribution**
12072
12073 Thrift IDLs are now shipped, bundled up in the respective hbase-\*thrift.jars (look for files ending in .thrift).
12074
12075
12076 ---
12077
12078 * [HBASE-11409](https://issues.apache.org/jira/browse/HBASE-11409) | *Major* | **Add more flexibility for input directory structure to LoadIncrementalHFiles**
12079
12080 Allows for users to bulk load entire tables from hdfs by specifying the parameter -loadTable.  This allows you to pass in a table level directory and have all regions column families bulk loaded, if you do not specify the -loadTable parameter LoadIncrementalHFiles will work as before. Note: you must have a pre-created table to run with -loadTable it will not create one for you.
12081
12082
12083 ---
12084
12085 * [HBASE-19769](https://issues.apache.org/jira/browse/HBASE-19769) | *Critical* | **IllegalAccessError on package-private Hadoop metrics2 classes in MapReduce jobs**
12086
12087 Client-side ZooKeeper metrics which were added to 2.0.0 alpha/beta releases cause issues when launching MapReduce jobs via {{yarn jar}} on the command line. This stems from ClassLoader separation issues that YARN implements. It was chosen that the easiest solution was to remove these ZooKeeper metrics entirely.
12088
12089
12090 ---
12091
12092 * [HBASE-19783](https://issues.apache.org/jira/browse/HBASE-19783) | *Minor* | **Change replication peer cluster key/endpoint from a not-null value to null is not allowed**
12093
12094 To reduce the confusing behavior, now when you call updatePeerConfig with empty ClusterKey or ReplicationEndpointImpl, but the value of field of the to-be-updated ReplicationPeerConfig is not null, we will throw exception instead of ignoring them.
12095
12096
12097 ---
12098
12099 * [HBASE-19483](https://issues.apache.org/jira/browse/HBASE-19483) | *Major* | **Add proper privilege check for rsgroup commands**
12100
12101 This JIRA aims at refactoring AccessController, using ACL as core library in CPs.
12102 1. Stripping out a public class AccessChecker from AccessController, using ACL as core library in CPs. AccessChecker don't have any dependency on anything CP related. Create it's instance from other CPS.
12103 2. Change the default value of hbase.security.authorization to false.
12104 3. Don't use CP hooks to check access in RSGroup. Use the access checker instance directly in functions of RSGroupAdminServiceImpl.
12105
12106
12107 ---
12108
12109 * [HBASE-19358](https://issues.apache.org/jira/browse/HBASE-19358) | *Major* | **Improve the stability of splitting log when do fail over**
12110
12111 After HBASE-19358 we introduced a new property hbase.split.writer.creation.bounded to limit the opening writers for each WALSplitter. If set to true, we won't open any writer for recovered.edits until the entries accumulated in memory reaching hbase.regionserver.hlog.splitlog.buffersize (which defaults at 128M) and will write and close the file in one go instead of keeping the writer open. It's false by default and we recommend to set it to true if your cluster has a high region load (like more than 300 regions per RS), especially when you observed obvious NN/HDFS slow down during hbase (single RS or cluster) failover.
12112
12113
12114 ---
12115
12116 * [HBASE-19651](https://issues.apache.org/jira/browse/HBASE-19651) | *Minor* | **Remove LimitInputStream**
12117
12118 HBase had copied from guava the file LmiitedInputStream. This commit removes the copied file in favor of (our internal, shaded) guava's ByteStreams.limit. Guava 14.0's LIS noted: "Use ByteStreams.limit(java.io.InputStream, long) instead. This class is scheduled to be removed in Guava release 15.0."
12119
12120
12121 ---
12122
12123 * [HBASE-19691](https://issues.apache.org/jira/browse/HBASE-19691) | *Critical* | **Do not require ADMIN permission for obtaining ClusterStatus**
12124
12125 This change reverts an unintentional requirement for global ADMIN permission to obtain cluster status from the active HMaster.
12126
12127
12128 ---
12129
12130 * [HBASE-19486](https://issues.apache.org/jira/browse/HBASE-19486) | *Major* | ** Periodically ensure records are not buffered too long by BufferedMutator**
12131
12132 The BufferedMutator now supports two settings that are used to ensure records do not stay too long in the buffer of a BufferedMutator. For periodically flushing the BufferedMutator there is now a "Timeout": "How old may the oldest record in the buffer be before we force a flush" and a "TimerTick": How often do we check if the timeout has been exceeded. Using these settings you can make the BufferedMutator automatically flush the write buffer if after the specified number of milliseconds no flush has occurred.
12133
12134 This is mainly useful in streaming scenarios (i.e. writing data into HBase using Apache Flink/Beam/Storm) where it is common (especially in a test/development situation) to see small unpredictable bursts of data that need to be written into HBase. When using the BufferedMutator till now the effect was that records would remain in the write buffer until the buffer was full or an explicit flush was triggered. In practice this would mean that the 'last few records' of a burst would remain in the write buffer until the next burst arrives filling the buffer to capacity and thus triggering a flush.
12135
12136
12137 ---
12138
12139 * [HBASE-19670](https://issues.apache.org/jira/browse/HBASE-19670) | *Major* | **Workaround: Purge User API building from branch-2 so can make a beta-1**
12140
12141 Disable filtering of User API based off yetus annotation done in doclet. See parent issue for build failure currently being worked on but not done in time for a beta-1.
12142
12143
12144 ---
12145
12146 * [HBASE-19282](https://issues.apache.org/jira/browse/HBASE-19282) | *Major* | **CellChunkMap Benchmarking and User Interface**
12147
12148 When MSLAB is in use (that is the default config) , we will always use the CellChunkMap indexing variant for in memory flushed Immutable segments. When MSLAB is turned off, we will use CellAraryMap. These can not be changed with any configs.  The in memory flush threshold been made to be default to 10% of region flush size. This can be turned using 'hbase.memstore.inmemoryflush.threshold.factor'.
12149
12150
12151 ---
12152
12153 * [HBASE-19628](https://issues.apache.org/jira/browse/HBASE-19628) | *Major* | **ByteBufferCell should extend ExtendedCell**
12154
12155 ByteBufferCell → ByteBufferExtendedCell
12156 MapReduceCell → MapReduceExtendedCell
12157 ByteBufferChunkCell → ByteBufferChunkKeyValue
12158 NoTagByteBufferChunkCell → NoTagByteBufferChunkKeyValue
12159 KeyOnlyByteBufferCell → KeyOnlyByteBufferExtendedCell
12160 TagRewriteByteBufferCell → TagRewriteByteBufferExtendedCell
12161 ValueAndTagRewriteByteBufferCell → ValueAndTagRewriteByteBufferExtendedCell
12162 EmptyByteBufferCell → EmptyByteBufferExtendedCell
12163 FirstOnRowByteBufferCell → FirstOnRowByteBufferExtendedCell
12164 LastOnRowByteBufferCell → LastOnRowByteBufferExtendedCell
12165 FirstOnRowColByteBufferCell → FirstOnRowColByteBufferExtendedCell
12166 FirstOnRowColTSByteBufferCell → FirstOnRowColTSByteBufferExtendedCell
12167 LastOnRowColByteBufferCell → LastOnRowColByteBufferCell
12168 OffheapDecodedCell → OffheapDecodedExtendedCell
12169
12170
12171 ---
12172
12173 * [HBASE-19576](https://issues.apache.org/jira/browse/HBASE-19576) | *Major* | **Introduce builder for ReplicationPeerConfig and make it immutable**
12174
12175 Add a ReplicationPeerConfigBuilder to create ReplicationPeerConfig and make ReplicationPeerConfig immutable. Meanwhile, deprecated set\* methods in ReplicationPeerConfig.
12176
12177
12178 ---
12179
12180 * [HBASE-10092](https://issues.apache.org/jira/browse/HBASE-10092) | *Critical* | **Move to slf4j**
12181
12182 We now have slf4j as our front-end. Be careful adding logging from here on out; make sure it slf4j.
12183
12184 From here on out, as us devs go, we need to convert log messages from being 'guarded' -- i.e. surrounded by if (LOG.isDebugEnabled...) -- to instead being parameterized log messages. e.g. the latter rather than the former in the below:
12185
12186 logger.debug("The new entry is "+entry+".");
12187 logger.debug("The new entry is {}.", entry);
12188
12189 See [1] for background on perf benefits.
12190
12191 Note, FATAL log level is not present in slf4j. It is noted as a Marker but won't show in logs as a LEVEL.
12192
12193 1.  https://www.slf4j.org/faq.html#logging\_performance
12194
12195
12196 ---
12197
12198 * [HBASE-19148](https://issues.apache.org/jira/browse/HBASE-19148) | *Blocker* | **Reevaluate default values of configurations**
12199
12200 Removed unused hbase.fs.tmp.dir from hbase-default.xml.
12201
12202 Upped hbase.master.fileSplitTimeout from 30s to 10minutes (suggested by production experience)
12203
12204 Added note that handler-count should be ~CPU count.
12205
12206 hbase.regionserver.logroll.multiplier has been changed from 0.95 to 0.5 AND the default block size has been doubled.
12207
12208 A few of the core configs are now dumped to the log on startup.
12209
12210
12211 ---
12212
12213 * [HBASE-19492](https://issues.apache.org/jira/browse/HBASE-19492) | *Major* | **Add EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS support to replication peer config**
12214
12215 Add two new field:  EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS to replication peer config.
12216
12217 If replicate\_all flag is true, it means all user tables will be replicated to peer cluster. Then allow config exclude namespaces or exclude table-cfs which can't be replicated to  peer cluster.
12218
12219 If replicate\_all flag is false, it means all user tables can't be replicated to peer cluster. Then allow to config namespaces or table-cfs which will be replicated to peer cluster.
12220
12221
12222 ---
12223
12224 * [HBASE-19494](https://issues.apache.org/jira/browse/HBASE-19494) | *Major* | **Create simple WALKey filter that can be plugged in on the Replication Sink**
12225
12226 Adds means of adding very basic filter on the sink side of replication. We already have a means of installing filter source-side, which is better place to filter edits before they are shipped over the network, but this facility is needed by hbase-indexer.
12227
12228 Set hbase.replication.sink.walentrysinkfilter with a no-param Constructor implementation. See test in patch for example.
12229
12230
12231 ---
12232
12233 * [HBASE-19112](https://issues.apache.org/jira/browse/HBASE-19112) | *Blocker* | **Suspect methods on Cell to be deprecated**
12234
12235 Adds method Cell#getType which returns enum describing Cell Type.
12236
12237 Deprecates the following Cell methods:
12238
12239  getTypeByte
12240  getSequenceId
12241  getTagsArray
12242  getTagsOffset
12243  getTagsLength
12244
12245 CPs trying to build cells should use RawCellBuilderFactory that supports  building cells with tags.
12246
12247
12248 ---
12249
12250 * [HBASE-14790](https://issues.apache.org/jira/browse/HBASE-14790) | *Major* | **Implement a new DFSOutputStream for logging WAL only**
12251
12252 Implement a FanOutOneBlockAsyncDFSOutput for writing WAL only, the WAL provider which uses this class is AsyncFSWALProvider.
12253
12254 It is based on netty, and will write to 3 DNs at the same time concurrently(fan-out) so generally it will lead to a lower latency. And it is also fail-fast, the stream will become unwritable immediately after there are any read/write errors, no pipeline recovery. You need to call recoverLease to force close the output for this case. And it only supports to write a file with a single block. For WAL this is a good behavior as we can always open a new file when the old one is broken. The performance analysis in HBASE-16890 shows that it has a better performance.
12255
12256 Behavior changes:
12257 1. As now we write to 3 DNs concurrently, according to the visibility guarantee of HDFS, the data will be available immediately when arriving at DN since all the DNs will be considered as the last one in pipeline. This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency. HBASE-14004 is used to solve the problem.
12258 2. There will be no sync failure. When the output is broken, we will open a new file and write all the unacked wal entries to the new file. This means that we may have duplicated entries in wal files. HBASE-14949 is used to solve this problem.
12259
12260
12261 ---
12262
12263 * [HBASE-15536](https://issues.apache.org/jira/browse/HBASE-15536) | *Critical* | **Make AsyncFSWAL as our default WAL**
12264
12265 Now the default WALProvider is AsyncFSWALProvider, i.e. 'asyncfs'.
12266 If you want to change back to use FSHLog, please add this in hbase-site.xml
12267 {code}
12268 \<property\>
12269 \<name\>hbase.wal.provider\</name\>
12270 \<value\>filesystem\</value\>
12271 \</property\>
12272 {code}
12273 If you want to use FSHLog with multiwal, please add this in hbase-site.xml
12274 {code}
12275 \<property\>
12276 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
12277 \<value\>filesystem\</value\>
12278 \</property\>
12279 {code}
12280
12281 This patch also sets hbase.wal.async.use-shared-event-loop to false so WAL has its own netty event group.
12282
12283
12284 ---
12285
12286 * [HBASE-19462](https://issues.apache.org/jira/browse/HBASE-19462) | *Major* | **Deprecate all addImmutable methods in Put**
12287
12288 Deprecates Put#addImmutable as of release 2.0.0, this will be removed in HBase 3.0.0. Use {@link #add(Cell)} and {@link org.apache.hadoop.hbase.CellBuilder} instead
12289
12290
12291 ---
12292
12293 * [HBASE-19213](https://issues.apache.org/jira/browse/HBASE-19213) | *Minor* | **Align check and mutate operations in Table and AsyncTable**
12294
12295 In Table interface deprecate checkAndPut, checkAndDelete and checkAndMutate methods.
12296 Similarly to AsyncTable a new method was added to replace the deprecated ones: CheckAndMutateBuilder checkAndMutate(byte[] row, byte[] family) with CheckAndMutateBuilder interface which can be used to construct the checkAnd\*() operations.
12297
12298
12299 ---
12300
12301 * [HBASE-19134](https://issues.apache.org/jira/browse/HBASE-19134) | *Major* | **Make WALKey an Interface; expose Read-Only version to CPs**
12302
12303 Made WALKey an Interface and added a WALKeyImpl implementation. WALKey comes through to Coprocessors. WALKey is read-only.
12304
12305
12306 ---
12307
12308 * [HBASE-18169](https://issues.apache.org/jira/browse/HBASE-18169) | *Blocker* | **Coprocessor fix and cleanup before 2.0.0 release**
12309
12310 Refactor of Coprocessor API for hbase2. Purged methods that exposed too much of our internals. Other hooks were recast so they no longer took or returned internal classes; instead we pass Interfaces or read-only versions of implementations.
12311
12312 Here is some overview doc on changes in hbase2 for Coprocessors including detail on why the change was made:
12313 https://github.com/apache/hbase/blob/branch-2.0/dev-support/design-docs/Coprocessor\_Design\_Improvements-Use\_composition\_instead\_of\_inheritance-HBASE-17732.adoc
12314
12315
12316 ---
12317
12318 * [HBASE-19301](https://issues.apache.org/jira/browse/HBASE-19301) | *Major* | **Provide way for CPs to create short circuited connection with custom configurations**
12319
12320 Provided a way for the CP users to create a short circuitable connection with custom configs.
12321
12322 createConnection(Configuration) is added to MasterCoprocessorEnvironment, RegionServerCoprocessorEnvironment and RegionCoprocessorEnvironment.
12323
12324 The getConnection() method already available in these Env interfaces returns the cluster connection used by the server (which the server also uses) where as this new method will create a new connection on request. The difference from connection created using ConnectionFactory APIs is that this connection can short circuit the calls to same server avoiding the RPC paths. The connection will NOT be cached/maintained by server. That should be done the CPs.
12325
12326 Be careful creating Connections out of a Coprocessor. See the javadoc on these createConnection and getConnection.
12327
12328
12329 ---
12330
12331 * [HBASE-19357](https://issues.apache.org/jira/browse/HBASE-19357) | *Major* | **Bucket cache no longer L2 for LRU cache**
12332
12333 Removed cacheDataInL1 option for HCD
12334 BucketCache is no longer the L2 for LRU on heap cache. When BC is used, data blocks will be strictly on BC only where as index/bloom blocks are on LRU L1 cache.
12335 Config 'hbase.bucketcache.combinedcache.enabled' is removed. There is no way set combined mode = false. Means make BC as victim handler for LRU cache.
12336 This will be one more noticeable change when one uses BucketCache in File mode.  Then the system table's data block(Including the META table)  will be cached in Bucket Cache files only. Plain scan from META files alone test reveal that the throughput of file mode BC is almost half only.  But for META entries we have RegionLocation cache at client side connections. So this would not be a big concern in a real cluster usage. Will check more on this and probably fix even when we do tiered BucketCache.
12337
12338
12339 ---
12340
12341 * [HBASE-19430](https://issues.apache.org/jira/browse/HBASE-19430) | *Major* | **Remove the SettableTimestamp and SettableSequenceId**
12342
12343 All the cells which are used in server side are of ExtendedCell now.
12344
12345
12346 ---
12347
12348 * [HBASE-19295](https://issues.apache.org/jira/browse/HBASE-19295) | *Major* | **The Configuration returned by CPEnv should be read-only.**
12349
12350 CoprocessorEnvironment#getConfiguration returns a READ-ONLY Configuration. Attempts at altering the returned Configuration -- whether setting or adding resources -- will result in an IllegalStateException warning of the Read-only condition of the returned Configuration.
12351
12352
12353 ---
12354
12355 * [HBASE-19410](https://issues.apache.org/jira/browse/HBASE-19410) | *Major* | **Move zookeeper related UTs to hbase-zookeeper and mark them as ZKTests**
12356
12357 There is a new HBaseZKTestingUtility which can only start a mini zookeeper cluster. And we will publish sources for test-jar for all modules.
12358
12359
12360 ---
12361
12362 * [HBASE-19323](https://issues.apache.org/jira/browse/HBASE-19323) | *Major* | **Make netty engine default in hbase2**
12363
12364 NettyRpcServer is now our default RPC server replacing SimpleRpcServer.
12365
12366
12367 ---
12368
12369 * [HBASE-19426](https://issues.apache.org/jira/browse/HBASE-19426) | *Major* | **Move has() and setTimestamp() to Mutation**
12370
12371 Moves #has and #setTimestamp back up to Mutation from the subclass Put so available to other Mutation implementations.
12372
12373
12374 ---
12375
12376 * [HBASE-19384](https://issues.apache.org/jira/browse/HBASE-19384) | *Critical* | **Results returned by preAppend hook in a coprocessor are replaced with null from other coprocessor even on bypass**
12377
12378 When a coprocessor sets 'bypass', we will skip calling subsequent Coprocessors that may be stacked-up on the method invocation; e.g. if a prePut has three coprocessors hooked up, if the first coprocessor decides to set 'bypass', we will not call the two subsequent coprocessors (this is similar to the 'complete' functionality that was in hbase1, removed in hbase2).
12379
12380
12381 ---
12382
12383 * [HBASE-19408](https://issues.apache.org/jira/browse/HBASE-19408) | *Trivial* | **Remove WALActionsListener.Base**
12384
12385 1) remove the WALActionsListener.Base
12386 2) provide default method implementation to WALActionsListener
12387 The person who want to receive the notification of WAL events should implements the WALActionsListener rather than WALActionsListener.Base.
12388
12389
12390 ---
12391
12392 * [HBASE-19339](https://issues.apache.org/jira/browse/HBASE-19339) | *Critical* | **Eager policy results in the negative size of memstore**
12393
12394 Enable TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy
12395
12396
12397 ---
12398
12399 * [HBASE-19336](https://issues.apache.org/jira/browse/HBASE-19336) | *Major* | **Improve rsgroup to allow assign all tables within a specified namespace by only writing namespace**
12400
12401 Add two new shell cmd.
12402 move\_namespaces\_rsgroup is used to reassign tables of specified namespaces from one RegionServer group to another.
12403 move\_servers\_namespaces\_rsgroup is used to reassign regionServers and tables of specified namespaces from one group to another.
12404
12405
12406 ---
12407
12408 * [HBASE-19285](https://issues.apache.org/jira/browse/HBASE-19285) | *Critical* | **Add per-table latency histograms**
12409
12410 Per-RegionServer table latency histograms have been returned to HBase (after being removed due to impacting performance). These metrics are exposed via a new JMX bean "TableLatencies" with the typical naming conventions: namespace, table, and histogram component.
12411
12412
12413 ---
12414
12415 * [HBASE-19359](https://issues.apache.org/jira/browse/HBASE-19359) | *Major* | **Revisit the default config of hbase client retries number**
12416
12417 The default value of hbase.client.retries.number was 35. It is now 10.
12418 And for server side, the default hbase.client.serverside.retries.multiplier was 10. So the server side retries number was 35 \* 10 = 350. It is now 3.
12419
12420
12421 ---
12422
12423 * [HBASE-18090](https://issues.apache.org/jira/browse/HBASE-18090) | *Major* | **Improve TableSnapshotInputFormat to allow more multiple mappers per region**
12424
12425 In this task, we make it possible to run multiple mappers per region in the table snapshot. The following code is primary table snapshot mapper initializatio:
12426
12427 TableMapReduceUtil.initTableSnapshotMapperJob(
12428           snapshotName,                     // The name of the snapshot (of a table) to read from
12429           scan,                                      // Scan instance to control CF and attribute selection
12430           mapper,                                 // mapper
12431           outputKeyClass,                   // mapper output key
12432           outputValueClass,                // mapper output value
12433           job,                                       // The current job to adjust
12434           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12435           restoreDir,                           // a temporary directory to copy the snapshot files into
12436 );
12437
12438 The job only run one map task per region in the table snapshot. With this feature, client can specify the desired num of mappers when init table snapshot mapper job：
12439
12440 TableMapReduceUtil.initTableSnapshotMapperJob(
12441           snapshotName,                     // The name of the snapshot (of a table) to read from
12442           scan,                                      // Scan instance to control CF and attribute selection
12443           mapper,                                 // mapper
12444           outputKeyClass,                   // mapper output key
12445           outputValueClass,                // mapper output value
12446           job,                                       // The current job to adjust
12447           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12448           restoreDir,                           // a temporary directory to copy the snapshot files into
12449           splitAlgorithm,                     // splitAlgo algorithm to split, current split algorithms  support RegionSplitter.UniformSplit() and RegionSplitter.HexStringSplit()
12450           n                                         // how many input splits to generate per one region
12451 );
12452
12453
12454 ---
12455
12456 * [HBASE-19035](https://issues.apache.org/jira/browse/HBASE-19035) | *Major* | **Miss metrics when coprocessor use region scanner to read data**
12457
12458 1. Move read requests count to region level. Because RegionScanner is exposed to CP.
12459 2. Update write requests count in processRowsWithLocks.
12460 3. Remove requestRowActionCount in RSRpcServices. This metric can be computed by region's readRequestsCount and writeRequestsCount.
12461
12462
12463 ---
12464
12465 * [HBASE-19318](https://issues.apache.org/jira/browse/HBASE-19318) | *Critical* | **MasterRpcServices#getSecurityCapabilities explicitly checks for the HBase AccessController implementation**
12466
12467 Fixes an issue with loading customer coprocessor endpoint implementations inside of the HBase Master which breaks Apache Ranger.
12468
12469
12470 ---
12471
12472 * [HBASE-19092](https://issues.apache.org/jira/browse/HBASE-19092) | *Critical* | **Make Tag IA.LimitedPrivate and expose for CPs**
12473
12474 This JIRA aims at exposing Tags for Coprocessor usage.
12475 Tag interface is now exposed to Coprocessors and CPs can make use of this interface to create their own Tags.
12476 RawCell is a new interface that is a subtype of Cell and that is exposed to CPs. RawCell has the following APIs
12477
12478 List\<Tag\> getTags()
12479 Optional\<Tag\> getTag(byte type)
12480 byte[] cloneTags()
12481
12482 The above APIs helps to read tags from the Cell.
12483
12484 CellUtil#createCell(Cell cell, List\<Tag\> tags)
12485 CellUtil#createCell(Cell cell, byte[] tags)
12486 CellUtil#createCell(Cell cell, byte[] value, byte[] tags)
12487 are deprecated.
12488 If CPs want to create a cell with Tags they can use the RegionCoprocessorEnvironment#getCellBuilder() that returns an ExtendedCellBuilder.
12489 Using ExtendedCellBuilder the CP can create Cells with Tags. Other helper methods to work on Tags are available as static APIs in Tag interface.
12490
12491
12492 ---
12493
12494 * [HBASE-19266](https://issues.apache.org/jira/browse/HBASE-19266) | *Minor* | **TestAcidGuarantees should cover adaptive in-memory compaction**
12495
12496 separate the TestAcidGuarantees by the policy:
12497 1) NONE -\> TestAcidGuaranteesWithNoInMemCompaction
12498 2) BASIC -\> TestAcidGuaranteesWithBasicPolicy
12499 3) EAGER -\> TestAcidGuaranteesWithEagerPolicy
12500 4) ADAPTIVE -\> TestAcidGuaranteesWithAdaptivePolicy
12501
12502 TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy are disabled by default as the eager policy may cause the negative size of memstore.
12503
12504
12505 ---
12506
12507 * [HBASE-16868](https://issues.apache.org/jira/browse/HBASE-16868) | *Critical* | **Add a replicate\_all flag to avoid misuse the namespaces and table-cfs config of replication peer**
12508
12509 Add a replicate\_all flag to replication peer config. The default value is true, which means all user tables (REPLICATION\_SCOPE != 0 ) will be replicated to peer cluster.
12510
12511 How to config a peer from replicate all to only replicate special namespace/tablecfs?
12512 Step1. Add a new peer with no namespace/tablecfs config, the replicate\_all flag will be true automatically.
12513 Step2. User want only replicate some namespaces or tables, so set replicate\_all flag to false first.
12514 Step3. Add special namespaces or table-cfs config to the replication peer.
12515
12516 How to config a peer from replicate special namespace/tablecfs to replicate all?
12517 Step1. Add a new peer with special namespace/tablecfs config, the replicate\_all flag will be false automatically.
12518 Step2. User want replicate all user tables, so remove the special namespace/tablecfs config first.
12519 Step3. Set replicate\_all flag to true.
12520
12521 How to config replicate nothing?
12522 Set replicate\_all flag to false and no namespace/tablecfs config, then all tables cannot be replicated to peer cluster.
12523
12524
12525 ---
12526
12527 * [HBASE-19122](https://issues.apache.org/jira/browse/HBASE-19122) | *Critical* | **preCompact and preFlush can bypass by returning null scanner; shut it down**
12528
12529 Remove the ability to 'bypass' preFlush and preCompact by returning a null Scanner. Bypass is disallowed on these methods in hbase2.
12530
12531
12532 ---
12533
12534 * [HBASE-19200](https://issues.apache.org/jira/browse/HBASE-19200) | *Major* | **make hbase-client only depend on ZKAsyncRegistry and ZNodePaths**
12535
12536 ConnectionImplementation now uses asynchronous connections to zookeeper via ZKAsyncRegistry to get cluster id, master address, meta region location, etc.
12537 Since ZKAsyncRegistry uses curator framework, this change purges a lot of zookeeper dependencies in hbase-client.
12538 Now hbase-client only depends on only ZKAsyncRegistry, ZNodePaths and the newly introduced ZKMetadata.
12539
12540
12541 ---
12542
12543 * [HBASE-19311](https://issues.apache.org/jira/browse/HBASE-19311) | *Major* | **Promote TestAcidGuarantees to LargeTests and start mini cluster once to make it faster**
12544
12545 Introduce a AcidGuaranteesTestTool and expose as tool instead of TestAcidGuarantees. Now TestAcidGuarantees is just a UT.
12546
12547
12548 ---
12549
12550 * [HBASE-19293](https://issues.apache.org/jira/browse/HBASE-19293) | *Major* | **Support adding a new replication peer in disabled state**
12551
12552 Add a boolean parameter which means the new replication peer's state is enabled or disabled for Admin/AsyncAdmin's addReplicationPeer method. Meanwhile, you can use shell cmd to add a enabled/disabled replication peer. The STATE parameter is optional and the default state is enabled.
12553
12554 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "ENABLED"
12555 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "DISABLED"
12556
12557
12558 ---
12559
12560 * [HBASE-19123](https://issues.apache.org/jira/browse/HBASE-19123) | *Major* | **Purge 'complete' support from Coprocesor Observers**
12561
12562 This issue removes the 'complete' facility that was in ObserverContext. It is no longer possible for a Coprocessor to cut the chain-of-invocation and insist its response prevails.
12563
12564
12565 ---
12566
12567 * [HBASE-18911](https://issues.apache.org/jira/browse/HBASE-18911) | *Major* | **Unify Admin and AsyncAdmin's methods name**
12568
12569 Deprecated 4 methods for Admin interface.
12570 Deprecated compactRegionServer(ServerName, boolean). Use compactRegionServer(ServerName) and majorCompactcompactRegionServer(ServerName) instead.
12571 Deprecated getRegionLoad(ServerName) method. Use getRegionLoads(ServerName) instead.
12572 Deprecated getRegionLoad(ServerName, TableName) method. Use getRegionLoads(ServerName, TableName) instead.
12573 Deprecated getQuotaRetriever(QuotaFilter) instead. Use  getQuota(QuotaFilter) instead.
12574
12575 Add 7 methods for Admin interface.
12576 ServerName getMaster();
12577 Collection\<ServerName\> getBackupMasters();
12578 Collection\<ServerName\> getRegionServers();
12579 boolean splitSwitch(boolean enabled, boolean synchronous);
12580 boolean mergeSwitch(boolean enabled, boolean synchronous);
12581 boolean isSplitEnabled();
12582 boolean isMergeEnabled();
12583
12584
12585 ---
12586
12587 * [HBASE-18703](https://issues.apache.org/jira/browse/HBASE-18703) | *Critical* | **Inconsistent behavior for preBatchMutate in doMiniBatchMutate and processRowsWithLocks**
12588
12589 Two write paths Region.batchMutate() and Region.mutateRows() are unified and inconsistencies are resolved.
12590
12591
12592 ---
12593
12594 * [HBASE-18964](https://issues.apache.org/jira/browse/HBASE-18964) | *Major* | **Deprecate RowProcessor and processRowsWithLocks() APIs that take RowProcessor as an argument**
12595
12596 RowProcessor and Region#processRowsWithLocks() methods that take RowProcessor as an argument are deprecated. Use Coprocessors if you want to customize handling.
12597
12598
12599 ---
12600
12601 * [HBASE-19251](https://issues.apache.org/jira/browse/HBASE-19251) | *Major* | **Merge RawAsyncTable and AsyncTable**
12602
12603 Merge the RawAsyncTable and AsyncTable interfaces. Use generic to reflection the difference between the observer style scan API. For the implementation which does not have a user specified thread pool, the observer is AdvancedScanResultConsumer. For the implementation which needs a user specified thread pool, the observer is ScanResultConsumer.
12604
12605
12606 ---
12607
12608 * [HBASE-19262](https://issues.apache.org/jira/browse/HBASE-19262) | *Major* | **Revisit checkstyle rules**
12609
12610 Change the import order rule that now we should put the shaded import at bottom. Ignore the VisibilityModifier warnings for test code.
12611
12612
12613 ---
12614
12615 * [HBASE-19187](https://issues.apache.org/jira/browse/HBASE-19187) | *Minor* | **Remove option to create on heap bucket cache**
12616
12617 Removing the on heap Bucket cache feature.
12618 The config "hbase.bucketcache.ioengine" no longer support the 'heap' value.
12619 Its supported values now are 'offheap',  'file:\<path\>', 'files:\<path\>'  and 'mmap:\<path\>'
12620
12621
12622 ---
12623
12624 * [HBASE-12350](https://issues.apache.org/jira/browse/HBASE-12350) | *Minor* | **Backport error-prone build support to branch-1 and branch-2**
12625
12626 This change introduces compile time support for running the error-prone suite of static analyses. Enable with -PerrorProne on the Maven command line. Requires JDK 8 or higher. (Don't enable if building with JDK 7.)
12627
12628
12629 ---
12630
12631 * [HBASE-14350](https://issues.apache.org/jira/browse/HBASE-14350) | *Blocker* | **Procedure V2 Phase 2: Assignment Manager**
12632
12633 (Incomplete)
12634
12635 = Incompatbiles
12636
12637 == Coprocessor Incompatibilities
12638
12639 Split/Merge have moved to the Master; it runs them now. Means hooks around Split/Merge are now noops. To intercept Split/Merge phases, CPs need to intercept on MasterObserver.
12640
12641
12642 ---
12643
12644 * [HBASE-19189](https://issues.apache.org/jira/browse/HBASE-19189) | *Major* | **Ad-hoc test job for running a subset of tests lots of times**
12645
12646 <!-- markdown -->
12647
12648
12649 Folks can now test out tests on an arbitrary release branch. Head over to [builds.a.o job "HBase-adhoc-run-tests"](https://builds.apache.org/view/H-L/view/HBase/job/HBase-adhoc-run-tests/), then pick "Build with parameters".
12650 Tests are specified as just names e.g. TestLogRollingNoCluster. can also be a glob. e.g. TestHFile*
12651
12652
12653 ---
12654
12655 * [HBASE-19220](https://issues.apache.org/jira/browse/HBASE-19220) | *Major* | **Async tests time out talking to zk; 'clusterid came back null'**
12656
12657 Changed retries from 3 to 30 for zk initial connect for registry.
12658
12659
12660 ---
12661
12662 * [HBASE-19002](https://issues.apache.org/jira/browse/HBASE-19002) | *Minor* | **Introduce more examples to show how to intercept normal region operations**
12663
12664 With the change in Coprocessor APIs, the hbase-examples module has been updated to provide additional examples that show how to write Coprocessors against the new API.
12665
12666
12667 ---
12668
12669 * [HBASE-18961](https://issues.apache.org/jira/browse/HBASE-18961) | *Major* | **doMiniBatchMutate() is big, split it into smaller methods**
12670
12671 HRegion.batchMutate()/ doMiniBatchMutate() is refactored with aim to unify batchMutate() and mutateRows() code paths later. batchMutate() currently handles 2 types of batches: MutationBatchOperations and ReplayBatchOperations. Common base class BatchOperations is augmented with common methods which are overridden in derived classes as needed. doMiniBatchMutate() is implemented using common methods in base class BatchOperations.
12672
12673
12674 ---
12675
12676 * [HBASE-19103](https://issues.apache.org/jira/browse/HBASE-19103) | *Minor* | **Add BigDecimalComparator for filter**
12677
12678 If BigDecimal is stored as value, and you need to add a matched comparator to the value filter when scanning, a BigDecimalComparator can be used.
12679
12680
12681 ---
12682
12683 * [HBASE-19111](https://issues.apache.org/jira/browse/HBASE-19111) | *Critical* | **Add missing CellUtil#isPut(Cell) methods**
12684
12685 A new public API method was added to CellUtil "isPut(Cell)" for clients to use to determine if the Cell is for a Put operation.
12686
12687 Additionally, other CellUtil API calls which expose Cell-implementation were marked as deprecated and will be removed in a future version.
12688
12689
12690 ---
12691
12692 * [HBASE-19160](https://issues.apache.org/jira/browse/HBASE-19160) | *Critical* | **Re-expose CellComparator**
12693
12694 CellComparator is now InterfaceAudience.Public
12695
12696
12697 ---
12698
12699 * [HBASE-19131](https://issues.apache.org/jira/browse/HBASE-19131) | *Major* | **Add the ClusterStatus hook and cleanup other hooks which can be replaced by ClusterStatus hook**
12700
12701 1) Add preGetClusterStatus() and postGetClusterStatus() hooks
12702 2) add preGetClusterStatus() to access control check - an admin action
12703
12704
12705 ---
12706
12707 * [HBASE-19095](https://issues.apache.org/jira/browse/HBASE-19095) | *Major* | **Add CP hooks in RegionObserver for in memory compaction**
12708
12709 Add 4 methods in RegionObserver:
12710 preMemStoreCompaction
12711 preMemStoreCompactionCompactScannerOpen
12712 preMemStoreCompactionCompact
12713 postMemStoreCompaction
12714 preMemStoreCompaction and postMemStoreCompaction will always be called for all in memory compactions. Under eager mode, preMemStoreCompactionCompactScannerOpen will be called before opening store scanner to allow you changing the max versions and TTL, and preMemStoreCompactionCompact will be called after the creation to let you do wrapping.
12715
12716
12717 ---
12718
12719 * [HBASE-19152](https://issues.apache.org/jira/browse/HBASE-19152) | *Trivial* | **Update refguide 'how to build an RC' and the make\_rc.sh script**
12720
12721 The make\_rc.sh script can run an hbase2 build now generating tarballs and pushing up to maven repository. TODO: Sign and checksum, check tarball, push to apache dist.....
12722
12723
12724 ---
12725
12726 * [HBASE-19179](https://issues.apache.org/jira/browse/HBASE-19179) | *Critical* | **Remove hbase-prefix-tree**
12727
12728 Purged the hbase-prefix-tree module and all references from the code base.
12729
12730 prefix-tree data block encoding was a super cool experimental feature that saw some usage initially but has since languished. If interested in carrying this sweet facility forward, write the dev list and we'll restore this module.
12731
12732
12733 ---
12734
12735 * [HBASE-19176](https://issues.apache.org/jira/browse/HBASE-19176) | *Major* | **Remove hbase-native-client from branch-2**
12736
12737 Removed the hbase-native-client module from branch-2 (it is still in Master). It is not complete. Look for a finished C++ client in the near future. Will restore native client to branch-2 at that point.
12738
12739
12740 ---
12741
12742 * [HBASE-19144](https://issues.apache.org/jira/browse/HBASE-19144) | *Major* | **[RSgroups] Retry assignments in FAILED\_OPEN state when servers (re)join the cluster**
12743
12744 When regionserver placement groups (RSGroups) is active, as servers join the cluster the Master will attempt to reassign regions in FAILED\_OPEN state.
12745
12746
12747 ---
12748
12749 * [HBASE-18770](https://issues.apache.org/jira/browse/HBASE-18770) | *Critical* | **Remove bypass method in ObserverContext and implement the 'bypass' logic case by case**
12750
12751 Removes blanket bypass mechanism (Observer#bypass). Instead, a curated subset of methods are bypassable.
12752
12753     Changes Coprocessor ObserverContext 'bypass' semantic. We flip the
12754     default so bypass is NOT supported on Observer invocations; only a
12755     couple of preXXX methods in RegionObserver allow it: e.g.  preGet
12756     and prePut but not preFlush, etc. Everywhere else, we throw
12757     a Exception if a Coprocessor Observer tries to invoke bypass. Master
12758     Observers can no longer stop or change move, split, assign, create table, etc.
12759     preBatchMutate can no longer be bypassed (bypass the finer-grained
12760     prePut, preDelete, etc. instead)
12761
12762     Ditto on complete, the mechanism that allowed a Coprocessor
12763     rule that all subsequent Coprocessors are skipped in an
12764     invocation chain; now, complete is only available to
12765     bypassable methods (and Coprocessors will get an exception if
12766     they try to 'complete' when it is not allowed).
12767
12768     See javadoc for whether a Coprocessor Observer method supports
12769     'bypass'. If no mention, 'bypass' is NOT supported.
12770
12771 The below methods have been marked deprecated in hbase2. We would have liked to have removed them because they use IA.Private parameters but they are in use by CoreCoprocessors or are critical to downstreamers and we have no alternatives to provide currently.
12772
12773 @Deprecated public boolean prePrepareTimeStampForDeleteVersion(final Mutation mutation, final Cell kv, final byte[] byteNow, final Get get) throws IOException {
12774
12775 @Deprecated public boolean preWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12776
12777 @Deprecated public void postWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12778
12779 @Deprecated public DeleteTracker postInstantiateDeleteTracker(DeleteTracker result) throws IOException
12780
12781 Metrics are updated now even if the Coprocessor does a bypass; e.g. The put count is updated even if a Coprocessor bypasses the core put operation (We do it this way so no need for Coprocessors to have access to our core metrics system).
12782
12783
12784 ---
12785
12786 * [HBASE-19033](https://issues.apache.org/jira/browse/HBASE-19033) | *Blocker* | **Allow CP users to change versions and TTL before opening StoreScanner**
12787
12788 Add back the three methods without a return value:
12789 preFlushScannerOpen
12790 preCompactScannerOpen
12791 preStoreScannerOpen
12792
12793 Introduce a ScanOptions interface to let CP users change the max versions and TTL of a ScanInfo. It will be passed as a parameter in the three methods above.
12794
12795 Inntroduce a new example WriteHeavyIncrementObserver which convert increment to put and do aggregating when get. It uses the above three methods.
12796
12797
12798 ---
12799
12800 * [HBASE-19110](https://issues.apache.org/jira/browse/HBASE-19110) | *Minor* | **Add default for Server#isStopping & #getFileSystem**
12801
12802 Made defaults for Server#isStopping and Server#getFileSystem. Should have done this when I added them (lesson learned, was actually mentioned in a review).
12803
12804
12805 ---
12806
12807 * [HBASE-19047](https://issues.apache.org/jira/browse/HBASE-19047) | *Critical* | **CP exposed Scanner types should not extend Shipper**
12808
12809 RegionObserver#preScannerOpen signature changed
12810 RegionScanner preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan,  RegionScanner s)   -\>   void preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan)
12811 The pre hook can no longer return a RegionScanner instance.
12812
12813
12814 ---
12815
12816 * [HBASE-18995](https://issues.apache.org/jira/browse/HBASE-18995) | *Critical* | **Move methods that are for internal usage from CellUtil to Private util class**
12817
12818 Split CellUtil into public CellUtil and PrivateCellUtil for Internal use only.
12819
12820
12821 ---
12822
12823 * [HBASE-18906](https://issues.apache.org/jira/browse/HBASE-18906) | *Critical* | **Provide Region#waitForFlushes API**
12824
12825 Provided an API in Region (Exposed to CPs)
12826 boolean waitForFlushes(long timeout)
12827 This call will make the current thread to be waiting for all flushes in this region to be finished.  (Upto the time out time being specified). The boolean return value specify whether the flushes are really over or the time out being elapsed. Return false when timeout elapsed but flushes are not over or  true when flushes are over
12828
12829
12830 ---
12831
12832 * [HBASE-18905](https://issues.apache.org/jira/browse/HBASE-18905) | *Major* | **Allow CPs to request flush on Region and know the completion of the requested flush**
12833
12834 Add a FlushLifeCycleTracker which is similiar to CompactionLifeCycleTracker for tracking flush.
12835 Add a requestFlush method in Region interface to let CP users request flush on a region. The operation is asynchronous, you need to use the FlushLifeCycleTracker to track the flush.
12836 The difference with CompactionLifeCycleTracker is that, flush is per region so we do not use Store as a parameter of the methods. And also, notExecuted means the whole flush has not been executed, and afterExecution means the whole flush has been finished, so we do not have a separated completed method. A flush will be ended either by notExecuted or afterExecution.
12837
12838
12839 ---
12840
12841 * [HBASE-19048](https://issues.apache.org/jira/browse/HBASE-19048) | *Major* | **Cleanup MasterObserver hooks which takes IA private params**
12842
12843 Purged InterfaceAudience.Private parameters from methods in MasterObserver.
12844
12845 preAbortProcedure no longer takes a ProcedureExecutor.
12846
12847 postGetProcedures no longer takes a list of Procedures.
12848
12849 postGetLocks no longer takes a list of locks.
12850
12851 preRequestLock and postRequestLock no longer take lock type.
12852
12853 preLockHeartbeat and postLockHeartbeat no longer takes a lock procedure.
12854
12855 The implication is that that the Coprocessors that depended on these params have had to coarsen so for example, the AccessController can not do access per Procedure or Lock but rather, makes a judgement on the general access (You'll need to be ADMIN to see list of procedures and locks).
12856
12857
12858 ---
12859
12860 * [HBASE-18994](https://issues.apache.org/jira/browse/HBASE-18994) | *Major* | **Decide if META/System tables should use Compacting Memstore or Default Memstore**
12861
12862 Added a new config 'hbase.systemtables.compacting.memstore.type"  for the system tables. By default all the system tables will have 'NONE' as the type and so it will be using the default memstore by default.
12863 {code}
12864  \<property\>
12865     \<name\>hbase.systemtables.compacting.memstore.type\</name\>
12866     \<value\>NONE\</value\>
12867   \</property\>
12868 {code}
12869
12870
12871 ---
12872
12873 * [HBASE-19029](https://issues.apache.org/jira/browse/HBASE-19029) | *Critical* | **Align RPC timout methods in Table and AsyncTableBase**
12874
12875 Deprecate the following methods in Table:
12876 - int getRpcTimeout()
12877 - int getReadRpcTimeout()
12878 - int getWriteRpcTimeout()
12879 - int getOperationTimeout()
12880
12881 Add the following methods to Table:
12882 - long getRpcTimeout(TimeUnit)
12883 - long getReadRpcTimeout(TimeUnit)
12884 - long getWriteRpcTimeout(TimeUnit)
12885 - long getOperationTimeout(TimeUnit)
12886
12887 Add missing deprecation tag for long getRpcTimeout(TimeUnit unit) in AsyncTableBase
12888
12889
12890 ---
12891
12892 * [HBASE-18410](https://issues.apache.org/jira/browse/HBASE-18410) | *Major* | **FilterList  Improvement.**
12893
12894 In this task, we fixed all existing bugs in FilterList, and did the code refactor which ensured interface compatibility .
12895
12896 The primary bug  fixes are :
12897 1. For sub-filter in FilterList with MUST\_PASS\_ONE, if previous filterKeyValue() of sub-filter returns NEXT\_COL, we cannot make sure that the next cell will be the first cell in next column, because FilterList choose the minimal forward step among sub-filters, and it may return a SKIP. so here we add an extra check to ensure that the next cell will match preivous return code for sub-filters.
12898 2. Previous logic about transforming cell of FilterList is incorrect, we should set the previous transform result (rather than the given cell in question) as the initial vaule of transform cell before call filterKeyValue() of FilterList.
12899 3. Handle the ReturnCodes which the previous code did not handle.
12900
12901 About code refactor, we divided the FilterList into two separated sub-classes: FilterListWithOR and FilterListWithAND,  The FilterListWithOR has been optimised to choose the next minimal step to seek cell rather than SKIP cell one by one, and the FilterListWithAND  has been optimised to choose the next maximal key to seek among sub-filters in filter list. All in all, The code in FilterList is clean and easier to follow now.
12902
12903 Note that ReturnCode NEXT\_ROW has been redefined as skipping to next row in current family,   not to next row in all family. it’s more reasonable, because ReturnCode is a concept in store level, not in region level.
12904
12905 Another bug that needs attention is: filterAllRemaining() in FilterList with MUST\_PASS\_ONE  will now return false if the filter list is empty whereas earlier it used to return true for Operator.MUST\_PASS\_ONE.  it's more reasonable now.
12906
12907
12908 ---
12909
12910 * [HBASE-19077](https://issues.apache.org/jira/browse/HBASE-19077) | *Critical* | **Have Region\*CoprocessorEnvironment provide an ImmutableOnlineRegions**
12911
12912 Adds getOnlineRegions to the RegionCoprocessorEnvironment (Context) and ditto to RegionServerCoprocessorEnvironment. Allows Coprocessor get list of Regions online on the currently hosting RegionServer.
12913
12914
12915 ---
12916
12917 * [HBASE-19021](https://issues.apache.org/jira/browse/HBASE-19021) | *Critical* | **Restore a few important missing logics for balancer in 2.0**
12918
12919 Re-enabled 'hbase.master.loadbalance.bytable', default 'false'.
12920 Draining servers are removed from consideration by blancer.balanceCluster() call.
12921
12922
12923 ---
12924
12925 * [HBASE-19049](https://issues.apache.org/jira/browse/HBASE-19049) | *Major* | **Update kerby to 1.0.1 GA release**
12926
12927 HBase now relies on Kerby version 1.0.1 for its test environment. No downstream facing change is expected.
12928
12929
12930 ---
12931
12932 * [HBASE-16290](https://issues.apache.org/jira/browse/HBASE-16290) | *Major* | **Dump summary of callQueue content; can help debugging**
12933
12934 Patch to print summary of call queues by size and count. This is displayed on the debug dump page of region server UI
12935
12936
12937 ---
12938
12939 * [HBASE-18846](https://issues.apache.org/jira/browse/HBASE-18846) | *Major* | **Accommodate the hbase-indexer/lily/SEP consumer deploy-type**
12940
12941 Makes it so hbase-indexer/lily can move off dependence on internal APIs and instead move to public APIs.
12942
12943 Adds being able to disable near-all HRegionServer services. This along with an existing plugin mechanism which allows configuring the RegionServer to host an alternate Connection implementation, makes it so we can put up a cluster of hollowed-out HRegionServers purposed to pose as a Replication Sink for a source HBase Cluster (Users do not need to figure our RPC, our PB encodings, build a distributed service, etc.). In the alternate supplied Connection implementation, hbase-indexer would install its own code to catch the Replication.
12944
12945 Below and attached are sample hbase-server.xml files and alternate Connection implementations. To start up an HRegionServer as a sink, first make sure there is a ZooKeeper ensemble we can talk to. If none, just start one:
12946 {code}
12947 ./bin/hbase-daemon.sh start zookeeper
12948 {code}
12949
12950 To start up a single RegionServer, put in place the below sample hbase-site.xml and a derviative of the below IndexerConnection on the CLASSPATH, and then start the RegionServer:
12951 {code}
12952 ./bin/hbase-daemon.sh  start  org.apache.hadoop.hbase.regionserver.HRegionServer
12953 {code}
12954 Stdout and Stderr will go into files under configured logs directory. Browse to localhost:16030 to find webui (unless disabled).
12955
12956 DETAILS
12957
12958 This patch adds configuration to disable RegionServer internal Services, Managers, Caches, etc., starting up.
12959
12960 By default a RegionServer starts up an Admin and Client Service. To disable either or both, use the below booleans:
12961 {code}
12962 hbase.regionserver.admin.service
12963 hbase.regionserver.client.service
12964 {code}
12965
12966 Both default true.
12967
12968 To make a HRegionServer startup and stay up without expecting to communicate with a master, set the below boolean to false:
12969
12970 {code}
12971 hbase.masterless
12972 {code]
12973 Default is false.
12974
12975 h3. Sample hbase-site.xml that disables internal HRegionServer Services
12976 Below is an example hbase-site.xml that turns off most Services and that then installs an alternate Connection implementation, one that is nulled out in all regards except in being able to return a "Table" that can catch a Replication Stream in its {code}batch(List\<? extends Row\> actions, Object[] results){code} method. i.e. what the hbase-indexer wants. I also add the example alternate Connection implementation below (both of these files are also attached to this issue). Expects there to be an up and running zookeeper ensemble.
12977
12978 {code}
12979 \<configuration\>
12980   \<!-- This file is an example for hbase-indexer. It shuts down
12981        facility in the regionserver and interjects a special
12982        Connection implementation which is how hbase-indexer will
12983        receive the replication stream from source hbase cluster.
12984        See the class referenced in the config.
12985
12986        Most of the config in here is booleans set to off and
12987        setting values to zero so services doon't start. Some of
12988        the flags are new via this patch.
12989 --\>
12990   \<!--Need this for the RegionServer to come up standalone--\>
12991   \<property\>
12992     \<name\>hbase.cluster.distributed\</name\>
12993     \<value\>true\</value\>
12994   \</property\>
12995
12996   \<!--This is what you implement, a Connection that returns a Table that
12997        overrides the batch call. It is at this point you do your indexer inserts.
12998     --\>
12999   \<property\>
13000     \<name\>hbase.client.connection.impl\</name\>
13001     \<value\>org.apache.hadoop.hbase.client.IndexerConnection\</value\>
13002     \<description\>A customs connection implementation just so we can interject our
13003       own Table class, one that has an override for the batch call which receives
13004       the replication stream edits; i.e. it is called by the replication sink
13005       #replicateEntries method.\</description\>
13006   \</property\>
13007
13008   \<!--Set hbase.regionserver.info.port to -1 for no webui--\>
13009
13010   \<!--Below are configs to shut down unused services in hregionserver--\>
13011   \<property\>
13012     \<name\>hbase.regionserver.admin.service\</name\>
13013     \<value\>false\</value\>
13014     \<description\>Do NOT stand up an Admin Service Interface on RPC\</description\>
13015   \</property\>
13016   \<property\>
13017     \<name\>hbase.regionserver.client.service\</name\>
13018     \<value\>false\</value\>
13019     \<description\>Do NOT stand up a client-facing Service on RPC\</description\>
13020   \</property\>
13021   \<property\>
13022     \<name\>hbase.wal.provider\</name\>
13023     \<value\>org.apache.hadoop.hbase.wal.DisabledWALProvider\</value\>
13024     \<description\>Set WAL service to be the null WAL\</description\>
13025   \</property\>
13026   \<property\>
13027     \<name\>hbase.regionserver.workers\</name\>
13028     \<value\>false\</value\>
13029     \<description\>Turn off all background workers, log splitters, executors, etc.\</description\>
13030   \</property\>
13031   \<property\>
13032     \<name\>hfile.block.cache.size\</name\>
13033     \<value\>0.0001\</value\>
13034     \<description\>Turn off block cache completely\</description\>
13035   \</property\>
13036   \<property\>
13037     \<name\>hbase.mob.file.cache.size\</name\>
13038     \<value\>0\</value\>
13039     \<description\>Disable MOB cache.\</description\>
13040   \</property\>
13041   \<property\>
13042     \<name\>hbase.masterless\</name\>
13043     \<value\>true\</value\>
13044     \<description\>Do not expect Master in cluster.\</description\>
13045   \</property\>
13046   \<property\>
13047     \<name\>hbase.regionserver.metahandler.count\</name\>
13048     \<value\>1\</value\>
13049     \<description\>How many priority handlers to run; we probably need none.
13050     Default is 20 which is too much on a server like this.\</description\>
13051   \</property\>
13052   \<property\>
13053     \<name\>hbase.regionserver.replication.handler.count\</name\>
13054     \<value\>1\</value\>
13055     \<description\>How many replication handlers to run; we probably need none.
13056     Default is 3 which is too much on a server like this.\</description\>
13057   \</property\>
13058   \<property\>
13059     \<name\>hbase.regionserver.handler.count\</name\>
13060     \<value\>10\</value\>
13061     \<description\>How many default handlers to run; tie to # of CPUs.
13062     Default is 30 which is too much on a server like this.\</description\>
13063   \</property\>
13064   \<property\>
13065     \<name\>hbase.ipc.server.read.threadpool.size\</name\>
13066     \<value\>3\</value\>
13067     \<description\>How many Listener request reaaders to run; tie to a portion # of CPUs (1/4?).
13068     Default is 10 which is too much on a server like this.\</description\>
13069   \</property\>
13070 \</configuration\>
13071 {code}
13072
13073 h2. Sample Connection Implementation
13074 Has call-out for where an hbase-indexer would insert its capture code.
13075 {code}
13076 package org.apache.hadoop.hbase.client;
13077
13078 import com.google.protobuf.Descriptors;
13079 import com.google.protobuf.Message;
13080 import com.google.protobuf.Service;
13081 import com.google.protobuf.ServiceException;
13082 import org.apache.hadoop.conf.Configuration;
13083 import org.apache.hadoop.hbase.CompareOperator;
13084 import org.apache.hadoop.hbase.HTableDescriptor;
13085 import org.apache.hadoop.hbase.TableName;
13086 import org.apache.hadoop.hbase.client.coprocessor.Batch;
13087 import org.apache.hadoop.hbase.filter.CompareFilter;
13088 import org.apache.hadoop.hbase.ipc.CoprocessorRpcChannel;
13089 import org.apache.hadoop.hbase.security.User;
13090
13091 import java.io.IOException;
13092 import java.util.List;
13093 import java.util.Map;
13094 import java.util.concurrent.ExecutorService;
13095
13096
13097 /\*\*
13098  \* Sample class for hbase-indexer.
13099  \* DO NOT COMMIT TO HBASE CODEBASE!!!
13100  \* Overrides Connection just so we can return a Table that has the
13101  \* method that the replication sink calls, i.e. Table#batch.
13102  \* It is at this point that the hbase-indexer catches the replication
13103  \* stream so it can insert into the lucene index.
13104  \*/
13105 public class IndexerConnection implements Connection {
13106   private final Configuration conf;
13107   private final User user;
13108   private final ExecutorService pool;
13109   private volatile boolean closed = false;
13110
13111   public IndexerConnection(Configuration conf, ExecutorService pool, User user) throws IOException {
13112     this.conf = conf;
13113     this.user = user;
13114     this.pool = pool;
13115   }
13116
13117   @Override
13118   public void abort(String why, Throwable e) {}
13119
13120   @Override
13121   public boolean isAborted() {
13122     return false;
13123   }
13124
13125   @Override
13126   public Configuration getConfiguration() {
13127     return this.conf;
13128   }
13129
13130   @Override
13131   public BufferedMutator getBufferedMutator(TableName tableName) throws IOException {
13132     return null;
13133   }
13134
13135   @Override
13136   public BufferedMutator getBufferedMutator(BufferedMutatorParams params) throws IOException {
13137     return null;
13138   }
13139
13140   @Override
13141   public RegionLocator getRegionLocator(TableName tableName) throws IOException {
13142     return null;
13143   }
13144
13145   @Override
13146   public Admin getAdmin() throws IOException {
13147     return null;
13148   }
13149
13150   @Override
13151   public void close() throws IOException {
13152     if (!this.closed) this.closed = true;
13153   }
13154
13155   @Override
13156   public boolean isClosed() {
13157     return this.closed;
13158   }
13159
13160   @Override
13161   public TableBuilder getTableBuilder(final TableName tn, ExecutorService pool) {
13162     if (isClosed()) {
13163       throw new RuntimeException("IndexerConnection is closed.");
13164     }
13165     final Configuration passedInConfiguration = getConfiguration();
13166     return new TableBuilder() {
13167       @Override
13168       public TableBuilder setOperationTimeout(int timeout) {
13169         return null;
13170       }
13171
13172       @Override
13173       public TableBuilder setRpcTimeout(int timeout) {
13174         return null;
13175       }
13176
13177       @Override
13178       public TableBuilder setReadRpcTimeout(int timeout) {
13179         return null;
13180       }
13181
13182       @Override
13183       public TableBuilder setWriteRpcTimeout(int timeout) {
13184         return null;
13185       }
13186
13187       @Override
13188       public Table build() {
13189         return new Table() {
13190           private final Configuration conf = passedInConfiguration;
13191           private final TableName tableName = tn;
13192
13193           @Override
13194           public TableName getName() {
13195             return this.tableName;
13196           }
13197
13198           @Override
13199           public Configuration getConfiguration() {
13200             return this.conf;
13201           }
13202
13203           @Override
13204           public void batch(List\<? extends Row\> actions, Object[] results)
13205           throws IOException, InterruptedException {
13206             // Implementation goes here.
13207           }
13208
13209           @Override
13210           public HTableDescriptor getTableDescriptor() throws IOException {
13211             return null;
13212           }
13213
13214           @Override
13215           public TableDescriptor getDescriptor() throws IOException {
13216             return null;
13217           }
13218
13219           @Override
13220           public boolean exists(Get get) throws IOException {
13221             return false;
13222           }
13223
13224           @Override
13225           public boolean[] existsAll(List\<Get\> gets) throws IOException {
13226             return new boolean[0];
13227           }
13228
13229           @Override
13230           public \<R\> void batchCallback(List\<? extends Row\> actions, Object[] results, Batch.Callback\<R\> callback) throws IOException, InterruptedException {
13231
13232           }
13233
13234           @Override
13235           public Result get(Get get) throws IOException {
13236             return null;
13237           }
13238
13239           @Override
13240           public Result[] get(List\<Get\> gets) throws IOException {
13241             return new Result[0];
13242           }
13243
13244           @Override
13245           public ResultScanner getScanner(Scan scan) throws IOException {
13246             return null;
13247           }
13248
13249           @Override
13250           public ResultScanner getScanner(byte[] family) throws IOException {
13251             return null;
13252           }
13253
13254           @Override
13255           public ResultScanner getScanner(byte[] family, byte[] qualifier) throws IOException {
13256             return null;
13257           }
13258
13259           @Override
13260           public void put(Put put) throws IOException {
13261
13262           }
13263
13264           @Override
13265           public void put(List\<Put\> puts) throws IOException {
13266
13267           }
13268
13269           @Override
13270           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, byte[] value, Put put) throws IOException {
13271             return false;
13272           }
13273
13274           @Override
13275           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Put put) throws IOException {
13276             return false;
13277           }
13278
13279           @Override
13280           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Put put) throws IOException {
13281             return false;
13282           }
13283
13284           @Override
13285           public void delete(Delete delete) throws IOException {
13286
13287           }
13288
13289           @Override
13290           public void delete(List\<Delete\> deletes) throws IOException {
13291
13292           }
13293
13294           @Override
13295           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, byte[] value, Delete delete) throws IOException {
13296             return false;
13297           }
13298
13299           @Override
13300           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Delete delete) throws IOException {
13301             return false;
13302           }
13303
13304           @Override
13305           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Delete delete) throws IOException {
13306             return false;
13307           }
13308
13309           @Override
13310           public void mutateRow(RowMutations rm) throws IOException {
13311
13312           }
13313
13314           @Override
13315           public Result append(Append append) throws IOException {
13316             return null;
13317           }
13318
13319           @Override
13320           public Result increment(Increment increment) throws IOException {
13321             return null;
13322           }
13323
13324           @Override
13325           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount) throws IOException {
13326             return 0;
13327           }
13328
13329           @Override
13330           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount, Durability durability) throws IOException {
13331             return 0;
13332           }
13333
13334           @Override
13335           public void close() throws IOException {
13336
13337           }
13338
13339           @Override
13340           public CoprocessorRpcChannel coprocessorService(byte[] row) {
13341             return null;
13342           }
13343
13344           @Override
13345           public \<T extends Service, R\> Map\<byte[], R\> coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable) throws ServiceException, Throwable {
13346             return null;
13347           }
13348
13349           @Override
13350           public \<T extends Service, R\> void coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13351
13352           }
13353
13354           @Override
13355           public \<R extends Message\> Map\<byte[], R\> batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype) throws ServiceException, Throwable {
13356             return null;
13357           }
13358
13359           @Override
13360           public \<R extends Message\> void batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13361
13362           }
13363
13364           @Override
13365           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, RowMutations mutation) throws IOException {
13366             return false;
13367           }
13368
13369           @Override
13370           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, RowMutations mutation) throws IOException {
13371             return false;
13372           }
13373
13374           @Override
13375           public void setOperationTimeout(int operationTimeout) {
13376
13377           }
13378
13379           @Override
13380           public int getOperationTimeout() {
13381             return 0;
13382           }
13383
13384           @Override
13385           public int getRpcTimeout() {
13386             return 0;
13387           }
13388
13389           @Override
13390           public void setRpcTimeout(int rpcTimeout) {
13391
13392           }
13393
13394           @Override
13395           public int getReadRpcTimeout() {
13396             return 0;
13397           }
13398
13399           @Override
13400           public void setReadRpcTimeout(int readRpcTimeout) {
13401
13402           }
13403
13404           @Override
13405           public int getWriteRpcTimeout() {
13406             return 0;
13407           }
13408
13409           @Override
13410           public void setWriteRpcTimeout(int writeRpcTimeout) {
13411
13412           }
13413         };
13414       }
13415     };
13416   }
13417 }
13418 {code}
13419
13420
13421 ---
13422
13423 * [HBASE-18873](https://issues.apache.org/jira/browse/HBASE-18873) | *Critical* | **Hide protobufs in GlobalQuotaSettings**
13424
13425 GlobalQuotaSettings was introduced to avoid protocol-specific Java classes from leaking into API which is users may leverage. This class has a number of methods which return plain-Java-objects instead of these protocol-specific classes in an effort to better provide stability in the future.
13426
13427
13428 ---
13429
13430 * [HBASE-18893](https://issues.apache.org/jira/browse/HBASE-18893) | *Major* | **Remove Add/Modify/DeleteColumnFamilyProcedure in favor of using ModifyTableProcedure**
13431
13432 The RPC calls for Add/Modify/DeleteColumn have been removed and are now backed by ModifyTable functionality. The corresponding permissions in AccessController have been removed as well.
13433
13434 The shell already bypassed these RPCs and used ModifyTable directly, and thus would not be getting these permission checks, this change brings the rest of the RPC inline with that.
13435
13436 Coprocessor hooks for pre/post Add/Modify/DeleteColumn have likewise been removed. Coprocessors needing to take special actions on schema change should instead process ModifyTable events (which they should have been doing already, but it was easy for developers to miss this nuance).
13437
13438
13439 ---
13440
13441 * [HBASE-16338](https://issues.apache.org/jira/browse/HBASE-16338) | *Major* | **update jackson to 2.y**
13442
13443 HBase has upgraded from Jackson 1 to Jackson 2. JSON output should not have changed and this should not be user facing, but server classpaths should be adjusted accordingly.
13444
13445
13446 ---
13447
13448 * [HBASE-19051](https://issues.apache.org/jira/browse/HBASE-19051) | *Minor* | **Add new split algorithm for num string**
13449
13450 Add new split algorithm DecimalStringSplit，row are decimal-encoded long values in the range "00000000" =\> "99999999" .
13451 create 't1','f', { NUMREGIONS =\> 10 , SPLITALGO =\> 'DecimalStringSplit' }
13452 The split point will be 10000000,20000000,...,90000000
13453
13454
13455 ---
13456
13457 * [HBASE-19067](https://issues.apache.org/jira/browse/HBASE-19067) | *Major* | **Do not expose getHDFSBlockDistribution in StoreFile**
13458
13459 Removed CP exposed StoreFile#getHDFSBlockDistribution
13460
13461
13462 ---
13463
13464 * [HBASE-18989](https://issues.apache.org/jira/browse/HBASE-18989) | *Major* | **Polish the compaction related CP hooks**
13465
13466 Add two new methods in CompactionLifeCycleTracker.
13467 The notExecuted method will be called if the selectCompaction failed or space quota limitation reached.
13468 The completed method will be called after all the requested compactions are finished. The compaction scheduling is pre Store so if you request compaction on a region it may lead to multiple compactions.
13469 Remove the User parameter in Region.requestCompaction methods as it is useless for CP users.
13470 Add a boolean parameter to indicate whether you want to do a major compaction. And so that the triggerMajorCompaction method is removed.
13471 Remove the getCompactionProgress method in Store interface.
13472 Add a UT to confirm that CompactionLifeCycleTracker works correctly, and it also shows how to use CompactionLifeCycleTracker to wait for the completion of a compaction.
13473
13474
13475 ---
13476
13477 * [HBASE-19046](https://issues.apache.org/jira/browse/HBASE-19046) | *Major* | **RegionObserver#postCompactSelection  Avoid passing shaded ImmutableList param**
13478
13479 RegionObserver#postCompactSelection signature is changed.
13480 Arg type org.apache.hadoop.hbase.shaded.com.google.common.collect.ImmutableList is replaced with java.util.List
13481
13482
13483 ---
13484
13485 * [HBASE-19043](https://issues.apache.org/jira/browse/HBASE-19043) | *Major* | **Purge TableWrapper and CoprocessorHConnnection**
13486
13487 Removes getTable from the CoprocessorEnvrionment Interface and from the BaseEnvironment implementation. Also removes TableWrapper and CoprocessorHConnection, two classes that were used by BaseEnvironment to keep a tag on Tables created by Coprocessors that BaseEnvironment might close them out on #shutdown.
13488
13489 Long after these classes and methods were added, in HBase 1.0.0, we moved to a mode where management of Tables was shifted from HBase to the Client; the Client is to manage lifecycle. Table also became a (relatively) lightweight construct so folks are used to getting a Table instance, using it, and then immediately closing it when done.
13490
13491 Coprocessors should do the same in hbase2.0.0.
13492
13493 CoprocessorHConnection short-circuited RPC. This feature has since been integrated into Server Connections; when they create a Connection, they get one that will short-circuit if the request is to a localhost so no need of CoprocessorHConnection any more.
13494
13495 Coprocessors get the Server Connection when they ask for a Connection from their \*CoprocessorEnvironment.
13496
13497
13498 ---
13499
13500 * [HBASE-19014](https://issues.apache.org/jira/browse/HBASE-19014) | *Major* | **surefire fails; When writing xml report stdout/stderr ... No such file or directory**
13501
13502 Running tests with a wildcard selector, i.e.{{-Dtest=org.apache.hadoop.hbase.server.\*}} no longer works.
13503
13504
13505 ---
13506
13507 * [HBASE-10367](https://issues.apache.org/jira/browse/HBASE-10367) | *Major* | **RegionServer graceful stop / decommissioning**
13508
13509 Added three top level Admin APIs to help decommissioning and graceful stop of region servers.
13510
13511   /\*\*
13512    \* Mark region server(s) as decommissioned to prevent additional regions from getting
13513    \* assigned to them. Optionally unload the regions on the servers. If there are multiple servers
13514    \* to be decommissioned, decommissioning them at the same time can prevent wasteful region
13515    \* movements. Region unloading is asynchronous.
13516    \* @param servers The list of servers to decommission.
13517    \* @param offload True to offload the regions from the decommissioned servers
13518    \*/
13519   void decommissionRegionServers(List\<ServerName\> servers, boolean offload) throws IOException;
13520
13521   /\*\*
13522    \* List region servers marked as decommissioned, which can not be assigned regions.
13523    \* @return List of decommissioned region servers.
13524    \*/
13525   List\<ServerName\> listDecommissionedRegionServers() throws IOException;
13526
13527   /\*\*
13528    \* Remove decommission marker from a region server to allow regions assignments.
13529    \* Load regions onto the server if a list of regions is given. Region loading is
13530    \* asynchronous.
13531    \* @param server The server to recommission.
13532    \* @param encodedRegionNames Regions to load onto the server.
13533    \*/
13534   void recommissionRegionServer(ServerName server, List\<byte[]\> encodedRegionNames)  throws IOException;
13535
13536
13537 ---
13538
13539 * [HBASE-19042](https://issues.apache.org/jira/browse/HBASE-19042) | *Blocker* | **Oracle Java 8u144 downloader broken in precommit check**
13540
13541 Precommit switched from Oracle JDK 8 to OpenJDK-8.
13542
13543
13544 ---
13545
13546 * [HBASE-18945](https://issues.apache.org/jira/browse/HBASE-18945) | *Major* | **Make a IA.LimitedPrivate interface for CellComparator**
13547
13548 CellCompartor has been added as an interface with IA.LimitedPrivate. It has the following methods
13549 #int compare(Cell leftCell, Cell rightCell);
13550 #int compareRows(Cell leftCell, Cell rightCell)
13551 #int compareRows(Cell cell, byte[] bytes, int offset, int length)
13552 #int compareWithoutRow(Cell leftCell, Cell rightCell)
13553 #int compareFamilies(Cell leftCell, Cell rightCell
13554 #int compareQualifiers(Cell leftCell, Cell rightCell)
13555 #int compareTimestamps(Cell leftCell, Cell rightCell)
13556 #int compareTimestamps(long leftCellts, long rightCellts)
13557
13558 This is exposed to CPs and CPs can make use of the above methods to do comparisons on the cells.
13559 For internal usage we have CellComparatorImpl and it has static references to COMPARATOR and META\_CELL\_COMPARATOR.
13560 So when a region or store is initialized we should use one of the above comparator. For META table we need the META\_CELL\_COMPARATOR and all other table's  regions/stores will use the COMPARTOR.
13561 While writing the comparator name in FixedFileTrailer of the Hfile we have now ensured that this rename of CellComparator.COMPARATOR/CellComparator.META\_CELL\_COMPARATOR to CellComparatorImpl.COMPARATOR/CellComparatorImpl.META\_CELL\_COMPARATOR is handled.
13562
13563 CellUtils is an util method that provides lot of APIs that helps to do compare, matching functionalities between two cells, or with a cell and a corrpesponding byte[] etc. Some of the APIs are internally used which will be cleaned up in a follow on JIRA HBASE-18995.
13564
13565
13566 ---
13567
13568 * [HBASE-19001](https://issues.apache.org/jira/browse/HBASE-19001) | *Major* | **Remove the hooks in RegionObserver which are designed to construct a StoreScanner which is marked as IA.Private**
13569
13570 These methods are removed:
13571 KeyValueScanner preStoreScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13572       Store store, Scan scan, NavigableSet\<byte[]\> targetCols, KeyValueScanner s, long readPt)
13573       throws IOException;
13574 InternalScanner preFlushScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13575       Store store, List\<KeyValueScanner\> scanners, InternalScanner s, long readPoint)
13576       throws IOException;
13577 InternalScanner preCompactScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13578       Store store, List\<? extends KeyValueScanner\> scanners, ScanType scanType, long earliestPutTs,
13579       InternalScanner s, CompactionLifeCycleTracker tracker, CompactionRequest request,
13580       long readPoint) throws IOException;
13581
13582 For flush and compaction, CP users are expected to wrap the InternalScanner in preFlush/preCompact. And for normal region operation, just use preGetOp/preScannerOpen to modify the Get/Scan object.
13583
13584 This method in Region interface is also removed as we do not need to use read point in CP hooks anymore:
13585 long getReadPoint(IsolationLevel isolationLevel);
13586
13587
13588 ---
13589
13590 * [HBASE-18350](https://issues.apache.org/jira/browse/HBASE-18350) | *Blocker* | **RSGroups are broken under AMv2**
13591
13592 Moves RSGroup on to AMv2. Reenables disabled RSGroups tests.
13593
13594
13595 ---
13596
13597 * [HBASE-18960](https://issues.apache.org/jira/browse/HBASE-18960) | *Major* | **A few bug fixes and minor improvements around batchMutate()**
13598
13599 All operations for which further processing is skipped by preBatchMutate coprocessor hook are treated as SUCCESS instead of FAILED.
13600
13601
13602 ---
13603
13604 * [HBASE-14247](https://issues.apache.org/jira/browse/HBASE-14247) | *Critical* | **Separate the old WALs into different regionserver directories**
13605
13606 Add a new config hbase.separate.oldlogdir.by.regionserver. The default value is false. If this config is true, the old wal dir will be separated by regionservers. This will change the oldWALs layout. The oldWALs is used by replication. So if a cluster didn't use replication, it can be rolling upgrade (upgrade this config from false to true) directly. If a cluster use replication, the oldWALs will be not found when layout changed. So the cluster need rolling upgrade twice. Firstly, only rolling cluster to use new version code. Secondly rolling the config from false to true. Because the cluster already rolling to new version code, so it can find the oldWALs in the new dir layout.
13607
13608
13609 ---
13610
13611 * [HBASE-18954](https://issues.apache.org/jira/browse/HBASE-18954) | *Major* | **Make \*CoprocessorHost classes private**
13612
13613 - Make CoprocessorHost and its implementations InterfaceAudience.Private
13614 - Configurations from "CoprocessorHost" have been moved to new "CoprocessorConfigurations" class.
13615
13616
13617 ---
13618
13619 * [HBASE-15410](https://issues.apache.org/jira/browse/HBASE-15410) | *Major* | **Utilize the max seek value when all Filters in MUST\_PASS\_ALL FilterList return SEEK\_NEXT\_USING\_HINT**
13620
13621 This optimization, targeting SEEK\_NEXT\_USING\_HINT return values, utilizes the max seek value and is transparent to Filters.
13622
13623
13624 ---
13625
13626 * [HBASE-18747](https://issues.apache.org/jira/browse/HBASE-18747) | *Critical* | **Introduce new example and helper classes to tell CP users how to do filtering on scanners**
13627
13628 Modify ZooKeeperScanPolicyObserver in hbase-examples to show how to do filtering in the CP hooks of flush and compaction in hbase-2.0.
13629
13630
13631 ---
13632
13633 * [HBASE-18108](https://issues.apache.org/jira/browse/HBASE-18108) | *Blocker* | **Procedure WALs are archived but not cleaned; fix**
13634
13635 The archived Procedure WALs are moved to \<hbase\_root\>/oldWALs/masterProcedureWALs
13636 directory. TimeToLiveProcedureWALCleaner class was added which regularly cleans the Procedure WAL files from there.
13637
13638 The TimeToLiveProcedureWALCleaner is added to hbase.master.logcleaner.plugins configuration value.
13639
13640 A new config parameter is added: hbase.master.procedurewalcleaner.ttl, which specifies how long a Procedure WAL should stay in the archive directory.
13641
13642
13643 ---
13644
13645 * [HBASE-18183](https://issues.apache.org/jira/browse/HBASE-18183) | *Major* | **Region interface cleanup for CP expose**
13646
13647 Below methods are removed from CP exposed Region interface
13648 getOpenSeqNum
13649 getOldestSeqIdOfStore
13650 isLoadingCfsOnDemandDefault
13651 getReadpoint
13652 updateReadRequestsCount
13653 updateWriteRequestsCount
13654 getRegionServicesForStores
13655 getMetrics
13656 getHDFSBlocksDistribution
13657 releaseRowLocks
13658 batchReplay
13659 get(Get get, boolean withCoprocessor, long nonceGroup, long nonce)
13660 bulkLoadHFiles
13661 execService
13662 registerService
13663 checkFamilies
13664 checkTimestamps
13665 prepareDelete
13666 prepareDeleteTimestamps
13667 updateCellTimestamps
13668 flush
13669 compact
13670 waitForFlushesAndCompactions
13671 waitForFlushes
13672
13673 Change signature of below methods by dropping params 'nonceGroup', 'nonce'
13674 append(Append append, long nonceGroup, long nonce)
13675 batchMutate(Mutation[] mutations, long nonceGroup, long nonce)
13676 increment(Increment increment, long nonceGroup, long nonce)
13677
13678
13679 ---
13680
13681 * [HBASE-18949](https://issues.apache.org/jira/browse/HBASE-18949) | *Major* | **Remove the CompactionRequest parameter in preCompactSelection**
13682
13683 Remove the CompactionRequest parameter in preCompactSelection as we do not have a CompactionRequest at that time.
13684
13685
13686 ---
13687
13688 * [HBASE-18909](https://issues.apache.org/jira/browse/HBASE-18909) | *Major* | **Deprecate Admin's methods which used String regex**
13689
13690 Pushed to master and branch-2. Thanks all for reviewing.
13691
13692
13693 ---
13694
13695 * [HBASE-18931](https://issues.apache.org/jira/browse/HBASE-18931) | *Major* | **Make ObserverContext an interface and remove private/testing methods**
13696
13697 Changes ObserverContext from a class to an interface and hides away constructor, testing functions and other internal-only functions in the implementation class.
13698
13699
13700 ---
13701
13702 * [HBASE-18878](https://issues.apache.org/jira/browse/HBASE-18878) | *Major* | **Use Optional\<T\> return types when T can be null**
13703
13704 **WARNING: No release note provided for this change.**
13705
13706
13707 ---
13708
13709 * [HBASE-18649](https://issues.apache.org/jira/browse/HBASE-18649) | *Major* | **Deprecate KV Usage in MR to move to Cells in 3.0**
13710
13711 All the mappers and reducers output type will be now of MapReduceCell type. No more KeyValue type. How ever in branch-2 for compatibility we have allowed the older interfaces/classes that work with KeyValue to stay in the code base but they have been marked as deprecated.
13712 The following interfaces/classes have been deprecated in branch-2
13713 Import#KeyValueWritableComparablePartitioner
13714 Import#KeyValueWritableComparator
13715 Import#KeyValueWritableComparable
13716 Import#KeyValueReducer
13717 Import#KeyValueSortImporter
13718 Import#KeyValueImporter
13719 KeyValueSortReducer
13720 KeyValueSerialization
13721 WALPlayer#WALKeyValueMapper
13722
13723 So any existing MR jobs that is using the above public interfaces/classes will continue to work in branch-2 and the expected output value type of those mappers and reducers can continue to be KeyValue type.
13724
13725 In branch-3 the mappers and reducers output will only expect MapReduceCell as the type and will no longer work with KeyValue type.
13726 The new public classes/interfaces added for branch-3 and in branch-2 are
13727 CellSerialization
13728 CellSortReducer
13729 Import#CellWritableComparablePartitioner
13730 Import#CellWritableComparable
13731 Import#CellWritableComparator
13732 Import#CellReducer
13733 Import#CellSortImporter
13734 Import#CellImporter
13735 WALPlayer#WALCellMapper
13736
13737
13738 ---
13739
13740 * [HBASE-18897](https://issues.apache.org/jira/browse/HBASE-18897) | *Major* | **Substitute MemStore for Memstore**
13741
13742 The changes of IA.Public/IA.LimitedPrivate classes are shown below:
13743 HTableDescriptor class
13744 \* boolean hasRegionMemstoreReplication()
13745 + boolean hasRegionMemStoreReplication()
13746 \* HTableDescriptor setRegionMemstoreReplication(boolean)
13747 + HTableDescriptor setRegionMemStoreReplication(boolean)
13748
13749 RegionLoadStats class
13750 \* int getMemstoreLoad()
13751 + int getMemStoreLoad()
13752
13753 ServerLoad class
13754 \* int getMemstoreSizeInMB()
13755 + int getMemStoreSizeMB()
13756
13757 Region class
13758 - long getMemstoreSize()
13759 + long getMemStoreSize()
13760
13761 Store class
13762 - MemstoreSize getMemStoreSize()
13763 + MemStoreSize getMemStoreSize()
13764 - MemstoreSize getFlushableSize()
13765 + MemStoreSize getFlushableSize()
13766 - MemstoreSize getSnapshotSize()
13767 + MemStoreSize getSnapshotSize()
13768
13769 StoreFile class
13770 - long getMaxMemstoreTS()
13771 + long getMaxMemStoreTS()
13772
13773
13774 ---
13775
13776 * [HBASE-18010](https://issues.apache.org/jira/browse/HBASE-18010) | *Major* | **Connect CellChunkMap to be used for flattening in CompactingMemStore**
13777
13778 The CellChunkMap is very dense index for Memstore ImmutableSegment and the only one that can be taken off-heap. However, CellChunkMap works on-heap as well. The coding of the entire flow of working with CellChunkMap is not yet finished, thus CellChunkMap is disabled for usage so far. The continuation is done under HBASE-18232.
13779
13780
13781 ---
13782
13783 * [HBASE-18883](https://issues.apache.org/jira/browse/HBASE-18883) | *Major* | **Upgrade to Curator 4.0**
13784
13785 Curator version has been updated from 2.x to 4.0 (running in ZK 3.4 compatibility mode).
13786
13787 Users who experience classpath issues due to version conflicts are recommended to use either the hbase-shaded-client or hbase-shaded-mapreduce artifacts.
13788
13789
13790 ---
13791
13792 * [HBASE-13844](https://issues.apache.org/jira/browse/HBASE-13844) | *Minor* | **Move static helper methods from KeyValue into CellUtils**
13793
13794 Move KeyValue.parseColumn() to CellUtil
13795
13796
13797 ---
13798
13799 * [HBASE-18839](https://issues.apache.org/jira/browse/HBASE-18839) | *Major* | **Apply RegionInfo to code base**
13800
13801 The incompatible changes of IA.Public/LimitedPrivate classes are shown below.
13802 + new method
13803 - removed method
13804 \* deprecated method
13805 -------------------------------------
13806 HRegionLocation class
13807 + RegionInfo getRegion()
13808 \* HRegionInfo getRegionInfo()
13809
13810 AsyncAdmin class
13811 + CompletableFuture\<List\<RegionInfo\>\> getOnlineRegions(ServerName serverName);
13812 - CompletableFuture\<List\<HRegionInfo\>\> getOnlineRegions(ServerName serverName);
13813 + CompletableFuture\<List\<RegionInfo\>\> getTableRegions(TableName tableName);
13814 - CompletableFuture\<List\<HRegionInfo\>\> getTableRegions(TableName tableName);
13815
13816 HBaseTestingUtility class
13817 - Table createTable(HTableDescriptor htd, byte[][] families, Configuration c)
13818 - Table createTable(HTableDescriptor htd, byte[][] families, byte[][] splitKeys, Configuration c)
13819 - Table createTable(HTableDescriptor htd, byte[][] splitRows)
13820 - void modifyTableSync(Admin admin, HTableDescriptor desc)
13821 - HRegion createLocalHRegion(HTableDescriptor desc, byte [] startKey, byte [] endKey)
13822 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc)
13823 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc)
13824 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc)
13825 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc, WAL wal)
13826 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc, WAL wal)
13827 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc, WAL wal)
13828 - List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13829 + List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13830 - WAL createWal(final Configuration conf, final Path rootDir, final HRegionInfo hri)
13831 + WAL createWal(final Configuration conf, final Path rootDir, final RegionInfo hri)
13832 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir,final Configuration conf, final HTableDescriptor htd)
13833 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13834 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13835 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13836 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13837 - boolean assignRegion(final HRegionInfo regionInfo)
13838 + boolean assignRegion(final RegionInfo regionInfo)
13839 - void moveRegionAndWait(HRegionInfo destRegion, ServerName destServer)
13840 + void moveRegionAndWait(RegionInfo destRegion, ServerName destServer)
13841 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd)
13842 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd, int numRegionsPerServer)
13843 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor[] hcds, int numRegionsPerServer)
13844 - HRegion createTestRegion(String tableName, HColumnDescriptor cd)
13845
13846 WALEdit class
13847 - WALEdit createFlushWALEdit(HRegionInfo hri, FlushDescriptor f)
13848 + WALEdit createFlushWALEdit(RegionInfo hri, FlushDescriptor f)
13849 - WALEdit createRegionEventWALEdit(HRegionInfo hri,RegionEventDescriptor regionEventDesc)
13850 + WALEdit createRegionEventWALEdit(RegionInfo hri,RegionEventDescriptor regionEventDesc)
13851 - WALEdit createCompaction(final HRegionInfo hri, final CompactionDescriptor c)
13852 + WALEdit createCompaction(final RegionInfo hri, final CompactionDescriptor c)
13853 - byte[] getRowForRegion(HRegionInfo hri)
13854 + byte[] getRowForRegion(RegionInfo hri)
13855 - WALEdit createBulkLoadEvent(HRegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13856 + - WALEdit createBulkLoadEvent(RegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13857
13858 RegionScanner class
13859 - HRegionInfo getRegionInfo();
13860 + RegionInfo getRegionInfo();
13861
13862 RegionPlan class
13863 - RegionPlan(final HRegionInfo hri, ServerName source, ServerName dest)
13864 + RegionPlan(final RegionInfo hri, ServerName source, ServerName dest)
13865
13866 Region class
13867 - HRegionInfo getRegionInfo();
13868 + RegionInfo getRegionInfo();
13869
13870 TableSnapshotInputFormat.TableSnapshotRegionSplit class
13871 \* HRegionInfo getRegionInfo()
13872 + RegionInfo getRegion()
13873
13874 RawAsyncTable.CoprocessorCallback class
13875 - void onRegionComplete(HRegionInfo region, R resp)
13876 + void onRegionComplete(RegionInfo region, R resp)
13877 - void onRegionError(RegionInfo region, Throwable error);
13878 + void onRegionError(HRegionInfo region, Throwable error);
13879
13880
13881 ---
13882
13883 * [HBASE-18826](https://issues.apache.org/jira/browse/HBASE-18826) | *Major* | **Use HStore instead of Store in our own code base and remove unnecessary methods in Store interface**
13884
13885 **WARNING: No release note provided for this change.**
13886
13887
13888 ---
13889
13890 * [HBASE-17732](https://issues.apache.org/jira/browse/HBASE-17732) | *Critical* | **Coprocessor Design Improvements**
13891
13892 We are moving from Inheritence
13893 - Observer \*is\* Coprocessor
13894 - FooService \*is\* CoprocessorService
13895 To Composition
13896 - Coprocessor \*has\* Observer
13897 - Coprocessor \*has\* Service
13898 ------------------------------------------------------
13899 Summary
13900 ------------------------------------------------------
13901 - Adds four new interfaces - MasterCoprocessor, RegionCoprocessor, RegionServierCoprocessor,
13902   WALCoprocessor
13903 - These new \*Coprocessor interfaces have a get\*Observer() function for each observer type
13904   supported by them.
13905 - Added Coprocessor#getService() to base interface. All extending \*Coprocessor interfaces will
13906   get it from the base interface.
13907 - Added BulkLoadObserver hooks to RegionCoprocessorHost instad of SecureBulkLoadManager doing its
13908   own trickery.
13909 - CoprocessorHost#find\*() fuctions: Too many testing hooks digging into CP internals.
13910   Deleted if can, else marked @VisibleForTesting.
13911 ------------------------------------------------------
13912 Backward Compatibility
13913 ------------------------------------------------------
13914 - Old coprocessors implementing \*Observer won't get loaded (no backward compatibility guarantees).
13915 - Third party coprocessors only implementing Coprocessor will not get loaded (just like Observers).
13916 - Old coprocessors implementing CoprocessorService (for master/region host)
13917   /SingletonCoprocessorService (for RegionServer host) will continue to work with 2.0.
13918 - Added test to ensure backward compatibility of CoprocessorService/SingletonCoprocessorService
13919 - Note that if a coprocessor implements both observer and service in same class, its service
13920   component will continue to work but it's observer component won't work.
13921
13922
13923 ---
13924
13925 * [HBASE-18298](https://issues.apache.org/jira/browse/HBASE-18298) | *Critical* | **RegionServerServices Interface cleanup for CP expose**
13926
13927 We used to pass the RegionServerServices (RSS) which gave Coprocesosrs (CP) all sort of access to internal Server machinery. We now only allows the CP a subset of the RSS in the form of the CPRSS Interface. Particulars:
13928
13929 Removed method getRegionServerServices from CP exposed RegionCoprocessorEnvironment and RegionServerCoprocessorEnvironment and replaced with getCoprocessorRegionServerServices. This returns a new interface CoprocessorRegionServerServices which is only a subset of RegionServerServices. With that below methods are no longer exposed for CPs
13930 WAL getWAL(HRegionInfo regionInfo)
13931 List\<WAL\> getWALs()
13932 FlushRequester getFlushRequester()
13933 RegionServerAccounting getRegionServerAccounting()
13934 RegionServerRpcQuotaManager getRegionServerRpcQuotaManager()
13935 SecureBulkLoadManager getSecureBulkLoadManager()
13936 RegionServerSpaceQuotaManager getRegionServerSpaceQuotaManager()
13937 void postOpenDeployTasks(final PostOpenDeployContext context)
13938 void postOpenDeployTasks(final Region r)
13939 boolean reportRegionStateTransition(final RegionStateTransitionContext context)
13940 boolean reportRegionStateTransition(TransitionCode code, long openSeqNum, HRegionInfo... hris)
13941 boolean reportRegionStateTransition(TransitionCode code, HRegionInfo... hris)
13942 RpcServerInterface getRpcServer()
13943 ConcurrentMap\<byte[], Boolean\> getRegionsInTransitionInRS()
13944 Leases getLeases()
13945 ExecutorService getExecutorService()
13946 Map\<String, Region\> getRecoveringRegions()
13947 public ServerNonceManager getNonceManager()
13948 boolean registerService(Service service)
13949 HeapMemoryManager getHeapMemoryManager()
13950 double getCompactionPressure()
13951 ThroughputController getFlushThroughputController()
13952 double getFlushPressure()
13953 MetricsRegionServer getMetrics()
13954 EntityLock regionLock(List\<HRegionInfo\> regionInfos, String description, Abortable abort)
13955 void unassign(byte[] regionName)
13956 Configuration getConfiguration()
13957 ZooKeeperWatcher getZooKeeper()
13958 ClusterConnection getClusterConnection()
13959 MetaTableLocator getMetaTableLocator()
13960 CoordinatedStateManager getCoordinatedStateManager()
13961 ChoreService getChoreService()
13962 void stop(String why)
13963 void abort(String why, Throwable e)
13964 boolean isAborted()
13965 void updateRegionFavoredNodesMapping(String encodedRegionName, List\<ServerName\> favoredNodes)
13966 InetSocketAddress[] getFavoredNodesForRegion(String encodedRegionName)
13967 void addToOnlineRegions(Region region)
13968 boolean removeFromOnlineRegions(final Region r, ServerName destination)
13969
13970 Also 3 methods name have been changed
13971 List\<Region\> getOnlineRegions(TableName tableName) -\> List\<Region\> getRegions(TableName tableName)
13972 List\<Region\> getOnlineRegions() -\> List\<Region\> getRegions()
13973 Region getFromOnlineRegions(final String encodedRegionName) -\> Region getRegion(final String encodedRegionName)
13974
13975
13976 ---
13977
13978 * [HBASE-16769](https://issues.apache.org/jira/browse/HBASE-16769) | *Blocker* | **Deprecate/remove PB references from MasterObserver and RegionServerObserver**
13979
13980 Signature of below methods in MasterObserver changed and instead of org.apache.hadoop.hbase.shaded.protobuf.generated.SnapshotDescription param, we will be passing org.apache.hadoop.hbase.client.SnapshotDescription
13981 preListSnapshot
13982 postListSnapshot
13983 preSnapshot
13984 postSnapshot
13985 preCloneSnapshot
13986 postCloneSnapshot
13987 preRestoreSnapshot
13988 postRestoreSnapshot
13989 preDeleteSnapshot
13990 postDeleteSnapshot
13991
13992 Also changed signature of RegionServerObserver#preReplicateLogEntries and preReplicateLogEntries by removing params List\<org.apache.hadoop.hbase.shaded.protobuf.generated.AdminProtos.WALEntry\>, org.apache.hadoop.hbase.CellScanner
13993
13994
13995 ---
13996
13997 * [HBASE-18859](https://issues.apache.org/jira/browse/HBASE-18859) | *Major* | **Purge PB from BulkLoadObserver**
13998
13999 No longer pass the protobuf request to prePrepareBulkLoad and preCleanupBulkLoad in BulkLoadObserver as part of our effort to purge protobuf from our Coprocessor API Interface (if you need to read the Table and RegionInfo, pull it from the passed in RegionCoprocessorEnvironment ObserverContext).
14000
14001
14002 ---
14003
14004 * [HBASE-18731](https://issues.apache.org/jira/browse/HBASE-18731) | *Major* | **[compat 1-2] Mark protected methods of QuotaSettings that touch Protobuf internals as IA.Private**
14005
14006 The following methods in QuotaSettings were annotated InterfaceAudience.Private; they are for internal use only in hbase-2.0.0
14007
14008 buildSetQuotaRequestProto(final QuotaSettings settings)
14009 setupSetQuotaRequest(SetQuotaRequest.Builder builder)
14010
14011 Note that there were versions of these methods in HBase 1.y that used classes in the {{org.apache.hadoop.hbase.protobuf.generated}} package. That package no longer exists as a part of our cleanup of protobufs from our public facing API and the related methods have been removed.
14012
14013
14014 ---
14015
14016 * [HBASE-18825](https://issues.apache.org/jira/browse/HBASE-18825) | *Major* | **Use HStoreFile instead of StoreFile in our own code base and remove unnecessary methods in StoreFile interface**
14017
14018 Cleanup the StoreFile interface.
14019
14020 The metadata keys are moved to HStoreFile.
14021
14022 These methods are removed:
14023 CacheConfig getCacheConf();
14024 byte[] getMetadataValue(byte[] key);
14025 boolean isCompactedAway();
14026 boolean isReferencedInReads();
14027 void initReader() throws IOException;
14028 StoreFileScanner getPreadScanner(boolean cacheBlocks, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn);
14029 StoreFileScanner getStreamScanner(boolean canUseDropBehind, boolean cacheBlocks, boolean isCompaction, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn) throws IOException;
14030 StoreFileReader getReader();
14031 void closeReader(boolean evictOnClose) throws IOException;
14032 void markCompactedAway();
14033 void deleteReader() throws IOException;
14034
14035 Notice that these methods are still available in HStoreFile.
14036
14037 And the return value of getFirstKey and getLastKey are changed from Cell to Optional\<Cell\> to better indicate that they may not be available.
14038
14039
14040 ---
14041
14042 * [HBASE-18786](https://issues.apache.org/jira/browse/HBASE-18786) | *Major* | **FileNotFoundException should not be silently handled for primary region replicas**
14043
14044 FileNotFoundException opening a StoreFile in a primary replica now causes a RegionServer to crash out where before it would be ignored (or optionally handled via close/reopen).
14045
14046
14047 ---
14048
14049 * [HBASE-10504](https://issues.apache.org/jira/browse/HBASE-10504) | *Blocker* | **Define Replication Interface**
14050
14051 Adds a new plugin point ReplicationEndpoint. ReplicationSource, internal to hbase, tails the WAL and calls registered ReplicationEndpoints. ReplicationEndpoint implementations are responsible for actually shipping the edits to the other (hbase or non-hbase) cluster. ReplicationEndpoint can be defined per peer. Default inter-cluster replication works without any changes (lily etc should still work). ReplicationEndpoints have various facility including means for filtering out WAL edits source-side before they can be shipped to remote peers.
14052
14053
14054 ---
14055
14056 * [HBASE-18142](https://issues.apache.org/jira/browse/HBASE-18142) | *Major* | **Deletion of a cell deletes the previous versions too**
14057
14058 Now, delete.rb won't delete all versions of the specified column. It only delete the specified version (if user assigns a timestamp) or the latest version (default behavior)
14059
14060
14061 ---
14062
14063 * [HBASE-18446](https://issues.apache.org/jira/browse/HBASE-18446) | *Critical* | **Mark StoreFileScanner/StoreFileReader as IA.LimitedPrivate(Phoenix)**
14064
14065 Mark StoreFileScanner and StoreFileReader as IA.LimitPrivate(Phoenix).
14066 Deprecated the preStoreFileReaderOpen and postStoreFileReaderOpen method in RegionObserver to indicate that these methods are only supposed to be used by Phoenix.
14067
14068
14069 ---
14070
14071 * [HBASE-18798](https://issues.apache.org/jira/browse/HBASE-18798) | *Major* | **Remove the unused methods in RegionServerObserver**
14072
14073 Remove the following APIs from RegionServerObserver:
14074 # preRollBackMerge
14075 # postRollBackMerge
14076 # preMergeCommit
14077 # postMergeCommit
14078 # postMerge
14079 # preMerge
14080
14081
14082 ---
14083
14084 * [HBASE-18831](https://issues.apache.org/jira/browse/HBASE-18831) | *Major* | **Add explicit dependency on javax.el**
14085
14086 Specify an explicit version for javax.el. Without it we rely on repository cached metadata of which a prevalent version seems to list all versions between b01 and b08 but finishes with a b08-jbossorg which is in the jboss repo, a repo most of us do not list in our poms.
14087
14088
14089 ---
14090
14091 * [HBASE-17980](https://issues.apache.org/jira/browse/HBASE-17980) | *Major* | **Any HRegionInfo we give out should be immutable**
14092
14093 Provide alternate user-facing API that takes a RegionInfo Interface instead of a HRegionInfo; the old HRegionInfo methods have been deprecated in 2.0.0 and will be removed in 3.0.0.
14094
14095
14096 ---
14097
14098 * [HBASE-14004](https://issues.apache.org/jira/browse/HBASE-14004) | *Critical* | **[Replication] Inconsistency between Memstore and WAL may result in data in remote cluster that is not in the origin**
14099
14100 Now when replicating a wal file which is still opened for write, we will get its committed length from the WAL instance in the same RS to prevent replicating uncommit WALEdit.
14101
14102 This is very important if you use AsyncFSWAL, as we use fan-out in AsyncFSWAL. The data written to DN will be visible immediately as all DNs think it is the end of a pipeline, although the client has not received an ack, and also NN may truncate the file if the client crashes at the same time.
14103
14104
14105 ---
14106
14107 * [HBASE-18819](https://issues.apache.org/jira/browse/HBASE-18819) | *Major* | **Set version number to 2.0.0-alpha3 from 2.0.0-alpha3-SNAPSHOT**
14108
14109 Set version on branch-2 to be 2.0.0-alpha3 as part of RC making.
14110
14111
14112 ---
14113
14114 * [HBASE-18683](https://issues.apache.org/jira/browse/HBASE-18683) | *Major* | **Upgrade hbase to commons-math 3**
14115
14116 Moved on to commons-math3. Removed commons-math2.
14117
14118
14119 ---
14120
14121 * [HBASE-18453](https://issues.apache.org/jira/browse/HBASE-18453) | *Major* | **CompactionRequest should not be exposed to user directly**
14122
14123 Introduce a CompactionLifeCycleTracker to let the CP users know when the compaction starts and ends. CompactionRequest is marked as IA.Private and should be used in CP implementation any more.
14124
14125
14126 ---
14127
14128 * [HBASE-18794](https://issues.apache.org/jira/browse/HBASE-18794) | *Major* | **Remove deprecated methods in MasterObserver**
14129
14130 The removed APIs are shown below.
14131 # preCreateTableHandler
14132 # postCreateTableHandler
14133 # preDeleteTableHandler
14134 # postDeleteTableHandler
14135 # preTruncateTableHandler
14136 # postTruncateTableHandler
14137 # preModifyTableHandler
14138 # postModifyTableHandler
14139 # preAddColumn
14140 # postAddColumn
14141 # preAddColumnHandler
14142 # postAddColumnHandler
14143 # preModifyColumn
14144 # postModifyColumn
14145 # preModifyColumnHandler
14146 # postModifyColumnHandler
14147 # preDeleteColumn
14148 # postDeleteColumn
14149 # preDeleteColumnHandler
14150 # postDeleteColumnHandler
14151 # preEnableTableHandler
14152 # postEnableTableHandler
14153 # preDisableTableHandler
14154 # postDisableTableHandler
14155 # preDispatchMerge
14156 # postDispatchMerge
14157
14158
14159 ---
14160
14161 * [HBASE-14998](https://issues.apache.org/jira/browse/HBASE-14998) | *Blocker* | **Unify synchronous and asynchronous methods in Admin and cleanup**
14162
14163  \* Deprecates getAlterStatus. Everywhere else we talk of 'modify' rather
14164        'alter' and should use Future returned from async instead.
14165  \* isTableAvailable(TableName, byte [][]) has been deprecated to be
14166        removed; use the overrie instead. This is a weird method.
14167  \* Changed listTableDescriptor to getDescriptor.
14168  \* Renamed other like methods to have same pattern (deprecating the old):
14169         balancer =\> balance
14170         setBalancerRunning =\> balancerSwitch
14171         setNormalizerRunning =\> normalizerSwitch
14172         enableCatalogJanitor =\> catalogJanitorSwitch
14173         setCleanerChoreRunning =\> cleanerChoreSwitch
14174         setSplitOrMergeEnabled =\> splitOrMergeEnabledSwitch
14175
14176  \* Renamed (with deprecation of old) runCatalogScan =\> runCatalogJanitor.
14177  \* Reviewed generated javadoc and made some edits; purged reference to
14178        hbase issues from our API, fixed param names, etc.
14179  \* Made all the enable services methods have same pattern.
14180  \* Renamed takeSnapshotAsync as snapshotAsync (with deprecation of old)
14181  \* Renamed execProcedureWithRet as execProcedureWithReturn (with
14182        deprecation)
14183
14184
14185 ---
14186
14187 * [HBASE-18723](https://issues.apache.org/jira/browse/HBASE-18723) | *Major* | **[pom cleanup] Do a pass with dependency:analyze; remove unused and explicity list the dependencies we exploit**
14188
14189 Purged a bunch of dependencies included but unused. Added reference to dependencies we do use but did not list (transitively included). Purged all but junit from parent pom dependency set and did explicit include in modules instead; not all modules need mockito, etc. Still work to do: grey area around hadoop and its transitive includes need cleanup still to make the  dependency:analyze runs clean. Also figure how to purge junit from parent dependency list.
14190
14191
14192 ---
14193
14194 * [HBASE-17823](https://issues.apache.org/jira/browse/HBASE-17823) | *Major* | **Migrate to Apache Yetus Audience Annotations**
14195
14196 HBase now uses stability and audience annotations sourced from Apache Yetus, instead of the custom annotations that were previously in place.
14197
14198
14199 ---
14200
14201 * [HBASE-18793](https://issues.apache.org/jira/browse/HBASE-18793) | *Major* | **Remove deprecated methods in RegionObserver**
14202
14203 These deprecated methods are removed from RegionObserver:
14204 InternalScanner preFlushScannerOpen(ObserverContext, Store, List, InternalScanner) throws IOException;
14205 void preCompactSelection(ObserverContext, Store, List) throws IOException;
14206 void postCompactSelection(ObserverContext, Store, ImmutableList);
14207 InternalScanner preCompact(ObserverContext, Store, InternalScanner, ScanType) throws IOException;
14208 InternalScanner preCompactScannerOpen(ObserverContext, Store, List, ScanType, long, InternalScanner, CompactionRequest) throws IOException;
14209 InternalScanner preCompactScannerOpen( ObserverContext, Store store, List, ScanType, long, InternalScanner) throws IOException;
14210 void preSplit(ObserverContext) throws IOException;
14211 void preSplit(ObserverContext, byte[]) throws IOException;
14212 void postSplit(ObserverContext, Region, Region) throws IOException;
14213 void preSplitBeforePONR(ObserverContext, byte[], List) throws IOException;
14214 void preSplitAfterPONR(ObserverContext) throws IOException;
14215 void preRollBackSplit(ObserverContext) throws IOException;
14216 void postRollBackSplit(ObserverContext) throws IOException;
14217 void postCompleteSplit(ObserverContext) throws IOException;
14218 long preIncrementColumnValue(ObserverContext, byte[], byte[], byte[], long, boolean) throws IOException;
14219 long postIncrementColumnValue(ObserverContextc, byte[], byte[], byte[], long, boolean, long) throws IOException;
14220 KeyValueScanner preStoreScannerOpen(ObserverContext, Store, Scan, NavigableSet, KeyValueScanner) throws IOException;
14221 boolean postScannerFilterRow(ObserverContext, InternalScanner, byte[], int, short, boolean) throws IOException;
14222 boolean postBulkLoadHFile(ObserverContext, List, boolean) throws IOException;
14223
14224 And this method is also removed since we never call it in our code base:
14225 InternalScanner preFlushScannerOpen(ObserverContext, Store, KeyValueScanner, InternalScanner, long) throws IOException;
14226
14227 The deprecated annotation is removed for these two methods as they are still being used:
14228 void preFlush(ObserverContext) throws IOException;
14229 void postFlush(ObserverContextc) throws IOException;
14230
14231
14232 ---
14233
14234 * [HBASE-18733](https://issues.apache.org/jira/browse/HBASE-18733) | *Major* | **[compat 1-2] Hide WALKey**
14235
14236 WALKey, @InterfaceAudience.LimitedPrivate(HBaseInterfaceAudience.REPLICATION), changed a bunch for 2.0.0. See below. We figured it ok hiding it since it should be internals anyway -- only we should be making them.
14237
14238
14239 ---
14240
14241 * [HBASE-13271](https://issues.apache.org/jira/browse/HBASE-13271) | *Critical* | **Table#puts(List\<Put\>) operation is indeterminate; needs fixing**
14242
14243 Adds more spec on how Get, Delete, and Put work and how they differ to help the user.
14244
14245
14246 ---
14247
14248 * [HBASE-16479](https://issues.apache.org/jira/browse/HBASE-16479) | *Major* | **Move WALEdit from hbase.regionserver.wal package to hbase.wal package**
14249
14250 Incompatible move of WALEdit class from regionserver.wal to wal. Effects @InterfaceAudience.LimitedPrivate({ HBaseInterfaceAudience.REPLICATION,
14251     HBaseInterfaceAudience.COPROC })
14252
14253 (
14254
14255
14256 ---
14257
14258 * [HBASE-10240](https://issues.apache.org/jira/browse/HBASE-10240) | *Critical* | **Remove 0.94-\>0.96 migration code**
14259
14260 Purge 0.94=\>0.96 deprecated, migration code. This means that if you are on 0.94 and wish to go to hbase 2.0, you must first migrate to a version of hbase that is \>= 0.96.
14261
14262
14263 ---
14264
14265 * [HBASE-18783](https://issues.apache.org/jira/browse/HBASE-18783) | *Minor* | **Declare the builder of ClusterStatus as IA.Private, and remove the Writables from ClusterStatus**
14266
14267 **WARNING: No release note provided for this change.**
14268
14269
14270 ---
14271
14272 * [HBASE-18106](https://issues.apache.org/jira/browse/HBASE-18106) | *Critical* | **Redo ProcedureInfo and LockInfo**
14273
14274 Admin.listProcedures and Admin.listLocks were renamed to getProcedures and getLocks (listProcedures was added to hbase 1.2). This change was done in an incompatible way -- we just yanked listProcedures (Because Admin Interface is not compatible with hbase1).
14275
14276     Main changes:
14277     - ProcedureInfo and LockInfo were removed, we use JSON instead of them
14278     - Procedure and LockedResource are their server side equivalent
14279     - Procedure protobuf state\_data became obsolate, it is only kept for
14280       reading previously written WAL
14281     - Procedure protobuf contains a state\_message field, which stores the internal
14282       state messages (Any type instead of bytes)
14283     - Procedure.serializeStateData and deserializeStateData were changed slightly
14284     - Procedures internal states are available on client side
14285     - Procedures are displayed on web UI and in shell in the following jruby format:
14286       { ID =\> '1', PARENT\_ID = '-1', PARAMETERS =\> [ ..extra state information.. ] }
14287
14288
14289 ---
14290
14291 * [HBASE-18621](https://issues.apache.org/jira/browse/HBASE-18621) | *Major* | **Refactor ClusterOptions before applying to code base**
14292
14293 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14294 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14295
14296
14297 ---
14298
14299 * [HBASE-18780](https://issues.apache.org/jira/browse/HBASE-18780) | *Minor* | **Remove HLogPrettyPrinter and hlog command**
14300
14301 **WARNING: No release note provided for this change.**
14302
14303
14304 ---
14305
14306 * [HBASE-14997](https://issues.apache.org/jira/browse/HBASE-14997) | *Critical* | **Move compareOp and Comparators out of filter to client package**
14307
14308 Deprecate checkAnd\* APIs that take the filter CompareOp. Added new overrides that take a generic CompareOperator instead. CompareOperator will be used by checkAnd\* in Table API and by filters going forward.
14309
14310 Other nice improvements suggested by this issue have been moved out to HBASE-18774.
14311
14312
14313 ---
14314
14315 * [HBASE-17972](https://issues.apache.org/jira/browse/HBASE-17972) | *Minor* | **Remove mergePool from CompactSplitThread**
14316
14317 After this jira, mergePool will be permanently removed from CompactSplitThread.
14318
14319
14320 ---
14321
14322 * [HBASE-18704](https://issues.apache.org/jira/browse/HBASE-18704) | *Major* | **Upgrade hbase to commons-collections 4**
14323
14324 **WARNING: No release note provided for this change.**
14325
14326
14327 ---
14328
14329 * [HBASE-18697](https://issues.apache.org/jira/browse/HBASE-18697) | *Major* | **Need a shaded hbase-mapreduce module**
14330
14331 Replaces hbase-shaded-server-\<version\>.jar with hbase-shaded-mapreduce-\<version\>.jar.
14332
14333
14334 ---
14335
14336 * [HBASE-15607](https://issues.apache.org/jira/browse/HBASE-15607) | *Blocker* | **Remove PB references from Admin for 2.0**
14337
14338 All the references to Protos in Admin.java have been removed and replaced with respective POJO classes.
14339 The references to Protos that were removed are
14340 AdminProtos.GetRegionInfoResponse,
14341 HBaseProtos.SnapshotDescription, HBaseProtos.SnapshotDescription.Type,
14342  MasterProtos.SnapshotResponse.
14343 CompactionType, CompactionState and MasterSwitchType Enums have been moved out of Admin.java to standalone Enums.
14344
14345
14346 ---
14347
14348 * [HBASE-18674](https://issues.apache.org/jira/browse/HBASE-18674) | *Major* | **upgrade hbase to commons-lang3**
14349
14350 Move to commons-lang3 from common-lang (check it out!... Nice lib...Some nice utility)
14351
14352
14353 ---
14354
14355 * [HBASE-18736](https://issues.apache.org/jira/browse/HBASE-18736) | *Major* | **Cleanup the HTD/HCD for Admin**
14356
14357 Changed the passed arguments from HTD/HCD to TD/CFD for Admin.
14358
14359
14360 ---
14361
14362 * [HBASE-18699](https://issues.apache.org/jira/browse/HBASE-18699) | *Major* | **Copy LoadIncrementalHFiles to another package and mark the old one as deprecated**
14363
14364 Introduce a new o.a.h.h.tool.LoadIncrementalHFiles. The old o.a.h.h.mapreduce.LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
14365
14366
14367 ---
14368
14369 * [HBASE-18739](https://issues.apache.org/jira/browse/HBASE-18739) | *Major* | **Make all TimeRange Constructors InterfaceAudience Private.**
14370
14371 All constructors have already been deprecated. This change makes them InterfaceAudience Private.
14372
14373
14374 ---
14375
14376 * [HBASE-18675](https://issues.apache.org/jira/browse/HBASE-18675) | *Minor* | **Making {max,min}SessionTimeout configurable for MiniZooKeeperCluster**
14377
14378 <!-- markdown -->
14379
14380
14381 Standalone clusters and minicluster instances can now configure the session timeout for our embedded ZooKeeper quorum using `hbase.zookeeper.property.minSessionTimeout` and `hbase.zookeeper.property.maxSessionTimeout`.
14382
14383
14384 ---
14385
14386 * [HBASE-15806](https://issues.apache.org/jira/browse/HBASE-15806) | *Critical* | **An endpoint-based export tool**
14387
14388 org.apache.hadoop.hbase.coprocessor.Export
14389 Instructs HBase to dump the contents of table to HDFS in a sequence file
14390 + replaces MR by endpoint (see org.apache.hadoop.hbase.mapreduce.Export)
14391 + no large data to be transfered between hbase server and client
14392 + same command line as org.apache.hadoop.hbase.mapreduce.Export
14393 - user needs to alter table for deploying ExportEndpoint
14394 - user needs to adjust the endpoint timeout for dumping large data
14395 - user needs to get the EXECUTE permission
14396
14397
14398 ---
14399
14400 * [HBASE-18577](https://issues.apache.org/jira/browse/HBASE-18577) | *Critical* | **shaded client includes several non-relocated third party dependencies**
14401
14402 <!-- markdown -->
14403
14404
14405 The HBase shaded artifacts (hbase-shaded-client and hbase-shaded-server) no longer contain several non-relocated third party dependency classes that were mistakenly included. Downstream users who relied on these classes being present will need to add a runtime dependency onto an appropriate third party artifact.
14406
14407 Previously, we erroneously packaged several third party libs without relocating them. In some cases these libraries have now been relocated; in some cases they are no longer included at all.
14408
14409 Includes:
14410
14411 * jaxb
14412 * jetty
14413 * jersey
14414 * codahale metrics (HBase 1.4+ only)
14415 * commons-crypto
14416 * jets3t
14417 * junit
14418 * curator (HBase 1.4+)
14419 * netty 3 (HBase 1.1)
14420 * mokito-junit4 (HBase 1.1)
14421
14422 There is now testing to ensure that the shaded artifacts only contain expected relocated content. It can be run via `mvn -Dtest=noUnitTests -pl hbase-shaded/hbase-shaded-check-invariants -am -Prelease verify`.
14423
14424 For version 2.0+ this patch removes hadoop-mapreduce-client-core from the set of dependencies included for the hbase-client and hbase-shaded-client artifacts.
14425
14426 For 2.0+, the slf4j-log4j12 dependency is now optional for both shaded artifacts.
14427
14428
14429 ---
14430
14431 * [HBASE-14745](https://issues.apache.org/jira/browse/HBASE-14745) | *Blocker* | **Shade the last few dependencies in hbase-shaded-client**
14432
14433 Previously some dependencies in hbase-shaded-client were still leaking into the un-shaded namespace. This should now be fixed.
14434
14435 Additionally the rat checking on generated intermediate files from shading should be skipped.
14436
14437
14438 ---
14439
14440 * [HBASE-18665](https://issues.apache.org/jira/browse/HBASE-18665) | *Critical* | **ReversedScannerCallable invokes getRegionLocations incorrectly**
14441
14442 Performing reverse scan on tables used the meta cache incorrectly and fetched data from meta table every time. This fix solves this issue and which results in performance improvement for reverse scans.
14443
14444
14445 ---
14446
14447 * [HBASE-3935](https://issues.apache.org/jira/browse/HBASE-3935) | *Major* | **HServerLoad.storefileIndexSizeMB should be changed to storefileIndexSizeKB**
14448
14449 This patch removed the storefile\_index\_size\_MB in protobuf. It will cause the value of storefile\_index\_size\_MB is zero if user still use hbase-client 1.x.
14450
14451
14452 ---
14453
14454 * [HBASE-18640](https://issues.apache.org/jira/browse/HBASE-18640) | *Major* | **Move mapreduce out of hbase-server into separate hbase-mapreduce module**
14455
14456 - Moves all org.apache.hadoop.hbase.mapreduce.\* (except LoadIncrementalHFiles) and org.apache.hadoop.hbase.mapred.\* classes from hbase-server module to new hbase-mapreduce module.
14457 - Also moves following tools from hbase-server module to hbase-mapreduce module: CompactionTool, ExportSnapshot, PerformanceEvaluation, LoadTestTool
14458 - Very minor breakages in  LoadTestTool(LimitedPrivate HBaseInterfaceAudience.TOOLS)
14459
14460
14461 ---
14462
14463 * [HBASE-18519](https://issues.apache.org/jira/browse/HBASE-18519) | *Major* | **Use builder pattern to create cell**
14464
14465 Introduce the CellBuilder helper.
14466 1) Using CellBuilderFactory to get CellBuilder for creating cell with row,
14467     column, qualifier, type, and value.
14468 2) For internal use, the ExtendedCellBuilder, which is created by ExtendedCellBuilderFactory, is able to build cell with extra fields - sequence id and tags -
14469
14470
14471 ---
14472
14473 * [HBASE-18448](https://issues.apache.org/jira/browse/HBASE-18448) | *Minor* | **EndPoint example  for refreshing HFiles for stores**
14474
14475 Adds a new RefreshHFiles Coprocessor Endpoint example. Includes client and serverside-endpoint that iterates region Stores to call #refreshStoreFiles.
14476
14477
14478 ---
14479
14480 * [HBASE-18658](https://issues.apache.org/jira/browse/HBASE-18658) | *Major* | **Purge hokey hbase Service implementation; use (internal) Guava Service instead**
14481
14482 Removed hbase Service class. It was not fully-formed. Now Guava is relocated, use its Service instead internally; it has nice implementation facility too in AbstractService.
14483
14484
14485 ---
14486
14487 * [HBASE-15982](https://issues.apache.org/jira/browse/HBASE-15982) | *Blocker* | **Interface ReplicationEndpoint extends Guava's Service**
14488
14489     Breaking change to our ReplicationEndpoint and BaseReplicationEndpoint.
14490
14491     ReplicationEndpoint implemented Guava 0.12 Service. An abstract
14492     subclass, BaseReplicationEndpoint, provided default implementations
14493     and facility, among other things, by extending Guava's
14494     AbstractService class.
14495
14496     Both of these HBase classes were marked LimitedPrivate for
14497     REPLICATION so these classes were semi-public and made it so
14498     Guava 0.12 was part of our API.
14499
14500     Having Guava in our API was a mistake. It anchors us and the
14501     implementation of the Interface to Guava 0.12. This is untenable
14502     given Guava changes and that the Service Interface in particular
14503     has had extensive revamp and improvement done. We can't hold to
14504     the Guava Interface. It changed. We can't stay on Guava 0.12;
14505     implementors and others on our CLASSPATH won't abide being stuck
14506     on an old Guava.
14507
14508     So we make breaking changes. The unhitching of our Interface
14509     from Guava could only be done in a breaking manner. It undoes the
14510     LimitedPrivate on BaseReplicationEndpoint while keeping it for the RE
14511     Interface. It means consumers will have to copy/paste the
14512     AbstractService-based BRE into their own codebase also supplying their
14513     own Guava; HBase no longer 'supplies' this (our Guava usage has
14514     been internalized, relocated).
14515
14516     This patch then adds into RE the basic methods RE needs of the old
14517     Guava Service rather than return a Service to start/stop only to go
14518     back to the RE instance to do actual work. A few method names had to
14519     be changed so could make implementations with Guava Service internally
14520     and not have RE method names and types clash). Semantics remained the
14521     same otherwise. For example startAsync and stopAsync in Guava are start
14522     and stop in RE.
14523
14524
14525 ---
14526
14527 * [HBASE-18347](https://issues.apache.org/jira/browse/HBASE-18347) | *Major* | **Implement a BufferedMutator for async client**
14528
14529 Introduce an AsyncBufferedMutator for batching requests to HBase for a single table.
14530
14531 Use AsyncConnection.getBufferedMutator method to get an AsyncBufferedMutator instance.
14532
14533
14534 ---
14535
14536 * [HBASE-18546](https://issues.apache.org/jira/browse/HBASE-18546) | *Critical* | **Always overwrite the TS for Append/Increment unless no existing cells are found**
14537
14538 If there is no existing cell in submitting Append/Increment, the custom ts won't be overridden. By contrast, the cell's ts will always be overridden by server.
14539
14540
14541 ---
14542
14543 * [HBASE-18224](https://issues.apache.org/jira/browse/HBASE-18224) | *Critical* | **Upgrade jetty**
14544
14545 Moved from Jetty 9.3.x to 9.4.x.
14546
14547 Jetty returns more correct HTTP code when Header is too long, 431 instead of 413, and it requires more threads to start up (made default 16 instead of 10).
14548
14549
14550 ---
14551
14552 * [HBASE-17442](https://issues.apache.org/jira/browse/HBASE-17442) | *Critical* | **Move most of the replication related classes from hbase-client to hbase-replication package**
14553
14554 Move replication implementation's classes from hbase-client to hbase-replication package.
14555
14556
14557 ---
14558
14559 * [HBASE-18653](https://issues.apache.org/jira/browse/HBASE-18653) | *Major* | **Undo hbase2 check against \< hadoop2.6.x; i.e. implement agreed drop of hadoop 2.4 and 2.5 support in hbase2**
14560
14561 Change the yetus profile for branch-2 so it no longer runs hadoop 2.4.x and 2.5.x build checks.
14562
14563
14564 ---
14565
14566 * [HBASE-18630](https://issues.apache.org/jira/browse/HBASE-18630) | *Major* | **Prune dependencies; as is branch-2 has duplicates**
14567
14568 Removed doubled instances of javax.inject and commons-beanutils where the versions were close.
14569
14570 Other instances of 'double' includes have different groupids so wary pruning especially when transitive includes (hadoop or jetty et al.)
14571
14572
14573 ---
14574
14575 * [HBASE-18631](https://issues.apache.org/jira/browse/HBASE-18631) | *Minor* | **Allow configuration of ChaosMonkey properties via hbase-site**
14576
14577 This change invalidates the need for a separate Java properties file to configure the ChaosMonkey included with HBase. These properties can be provided directly in hbase-site.xml. If configuration in provided in both locations, the Java properties file takes precendence.
14578
14579
14580 ---
14581
14582 * [HBASE-18489](https://issues.apache.org/jira/browse/HBASE-18489) | *Major* | **Expose scan cursor in RawScanResultConsumer**
14583
14584 Add a 'cursor' method which returns an 'Optional\<Cursor\>' in 'RawScanResultConsumer.ScanController'. You can use this method to obtain the scan cursor if available.
14585
14586
14587 ---
14588
14589 * [HBASE-18511](https://issues.apache.org/jira/browse/HBASE-18511) | *Blocker* | **Default no regions on master**
14590
14591 Changes the configuration hbase.balancer.tablesOnMaster from list of table names that the can carry (with 'none' meaning no tables on the master) to instead be a boolean that is set to true if master carries tables/regions and false if it does not. If true, the master acts like any regionserver.
14592
14593 If false, then the master carries no tables. This is the default for hbase-2.0.0.
14594
14595 Another boolean configuration, hbase.balancer.tablesOnMaster.systemTablesOnly, when set to true, enables hbase.balancer.tablesOnMaster and makes it so the master hosts system tables exclusively (the long-time deploy mode of master branch and branch-2 up until this commit).
14596
14597 UPDATE: This is broke. See HBASE-19785.
14598 UPDATE2: Master carrying Regions does not work reliably, see HBASE-19828.
14599
14600 See HBASE-19831, the issue to fix regions on Master
14601
14602 The change of hbase.balancer.tablesOnMaster from String list to boolean and
14603 the addition of a simple boolean to enable system-tables on Master was done
14604 to constrain what operators might ask for via this master configuration.
14605 Stipulating what tables are bound to the Master server verges into
14606 regionserver grouping territory, a more robust means of specifying table
14607 and server combinations. Operators should use this latter if they want
14608 layouts more exotic than those supplied by the provided booleans.
14609
14610
14611 ---
14612
14613 * [HBASE-18553](https://issues.apache.org/jira/browse/HBASE-18553) | *Major* | **Expose scan cursor for asynchronous scanner**
14614
14615 The ResultScanner which is gotten from an AsyncTable will also return cursor results if Scan.isNeedCursorResult is true.
14616
14617
14618 ---
14619
14620 * [HBASE-18598](https://issues.apache.org/jira/browse/HBASE-18598) | *Minor* | **AsyncNonMetaRegionLocator use FIFO algorithm to get a candidate locate request**
14621
14622 Introduce FIFO algorithm to get a candidate locate request for AsyncNonMetaRegionLocator.
14623
14624
14625 ---
14626
14627 * [HBASE-18533](https://issues.apache.org/jira/browse/HBASE-18533) | *Major* | **Expose BucketCache values to be configured**
14628
14629 This patch exposes configuration for Bucketcache. These configs are very similar to those for the LRU cache, but are described below:
14630
14631 "hbase.bucketcache.single.factor"; /\*\* Single access bucket size \*/
14632 "hbase.bucketcache.multi.factor"; /\*\* Multiple access bucket size \*/
14633 "hbase.bucketcache.memory.factor"; /\*\* In-memory bucket size \*/
14634 "hbase.bucketcache.extrafreefactor"; /\*\* Free this floating point factor of extra blocks when evicting. For example free the number of blocks requested \* (1 + extraFreeFactor) \*/
14635 "hbase.bucketcache.acceptfactor"; /\*\* Acceptable size of cache (no evictions if size \< acceptable) \*/
14636 "hbase.bucketcache.minfactor"; /\*\* Minimum threshold of cache (when evicting, evict until size \< min) \*/
14637
14638
14639 ---
14640
14641 * [HBASE-18528](https://issues.apache.org/jira/browse/HBASE-18528) | *Critical* | **DON'T allow user to modify the passed table/column descriptor**
14642
14643 **WARNING: No release note provided for this change.**
14644
14645
14646 ---
14647
14648 * [HBASE-18271](https://issues.apache.org/jira/browse/HBASE-18271) | *Blocker* | **Shade netty**
14649
14650 Depend on hbase-thirdparty for our netty instead of directly relying on netty-all. netty is relocated in hbase-thirdparty from io.netty to org.apache.hadoop.hbase.shaded.io.netty. One kink is that netty bundles an .so. Its files also are relocated. So netty can find the .so content, need to specify on command-line a system property telling netty about the shading.
14651
14652 The .so trick is from
14653              https://stackoverflow.com/questions/33825743/rename-files-inside-a-jar-using-some-maven-plugin
14654
14655 In essence we need the below defined whenever we run tests or deploy:
14656
14657 -Dorg.apache.hadoop.hbase.shaded.io.netty.packagePrefix=org.apache.hadoop.hbase.shaded.
14658
14659 (The trailing '.' is required)
14660
14661 See toward the end of this issue for how to pass config: https://github.com/netty/netty/issues/6665
14662
14663 The system property has been added to bin/hbase. If starting hbase with other than bin/hbase, add this system property (at least on linux).
14664
14665 For devs, going forward, do not reference io.netty. Reference org.apache.hadoop.hbase.io.netty instead. Here is sample:
14666
14667 {code}
14668 -import io.netty.channel.Channel;
14669 -import io.netty.channel.EventLoop;
14670 +import org.apache.hadoop.hbase.shaded.io.netty.channel.Channel;
14671 +import org.apache.hadoop.hbase.shaded.io.netty.channel.EventLoop;
14672 {code}
14673
14674
14675 ---
14676
14677 * [HBASE-15511](https://issues.apache.org/jira/browse/HBASE-15511) | *Major* | **ClusterStatus should be able to return responses by scope**
14678
14679 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14680 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14681
14682
14683 ---
14684
14685 * [HBASE-18551](https://issues.apache.org/jira/browse/HBASE-18551) | *Major* | **[AMv2] UnassignProcedure and crashed regionservers**
14686
14687 Unassign will not proceed if it is unable to talk to the remote server. Now it will expire the server it is unable to communicate with and then wait until it is signaled by ServerCrashProcedure that the server's logs have been split. Only then will judge the unassign successful.
14688
14689 We do this because a subsequent assign lacking the crashed server context might open a region w/o first splitting logs.
14690
14691
14692 ---
14693
14694 * [HBASE-18469](https://issues.apache.org/jira/browse/HBASE-18469) | *Critical* | **Correct  RegionServer metric of  totalRequestCount**
14695
14696 In HBASE-18469 we introduced a new RegionServer metrics in name of "totalRowActionRequestCount" which counts in all row actions and equals to the sum of "readRequestCount" and "writeRequestCount". Meantime, we have changed "totalRequestCount" to count only once for multi request, while previously we will count in action number of the request. As a result, existing monitoring system on totalRequestCount will still work but see a smaller value, and we strongly recommend to change to use the new metrics to monitor server load.
14697
14698
14699 ---
14700
14701 * [HBASE-18500](https://issues.apache.org/jira/browse/HBASE-18500) | *Major* | **Performance issue: Don't use BufferedMutator for HTable's put method**
14702
14703 Remove the deprecated method get/setWriteBufferSize from Table and remove writeBufferSize from TableBuilder. Remove the BufferedMutatorImpl from HTable.
14704
14705
14706 ---
14707
14708 * [HBASE-18387](https://issues.apache.org/jira/browse/HBASE-18387) | *Minor* | **[Thrift] Make principal configurable in DemoClient.java**
14709
14710 This change allows the demonstration Thrift client to customize the server principal used by the Thrift server for instances secured with Kerberos.
14711
14712
14713 ---
14714
14715 * [HBASE-17125](https://issues.apache.org/jira/browse/HBASE-17125) | *Critical* | **Inconsistent result when use filter to read data**
14716
14717 Marked Scan and Get's setMaxVersions() and setMaxVersions(int) as deprecated. They are easy to misunderstand with column family's max versions, so use readAllVersions() and readVersions(int) instead.
14718
14719
14720 ---
14721
14722 * [HBASE-18492](https://issues.apache.org/jira/browse/HBASE-18492) | *Major* | **[AMv2] Embed code for selecting highest versioned region server for system table regions in AssignmentManager.processAssignQueue()**
14723
14724 Favors new servers over older versions when assigning system table regions (more to follow in this area; i.e. changes in the AM itself).
14725
14726
14727 ---
14728
14729 * [HBASE-18517](https://issues.apache.org/jira/browse/HBASE-18517) | *Major* | **limit max log message width in log4j**
14730
14731 Sets a log length max of 1000 characters.
14732
14733
14734 ---
14735
14736 * [HBASE-18502](https://issues.apache.org/jira/browse/HBASE-18502) | *Critical* | **Change MasterObserver to use TableDescriptor and ColumnFamilyDescriptor**
14737
14738 The methods which change to use TableDescriptor/ColumnFamilyDescriptor are shown below.
14739 + preCreateTable( ObserverContext,TableDescriptor, HRegionInfo[])
14740 + postCreateTable(ObserverContext ,TableDescriptor, HRegionInfo[])
14741 + preCreateTableAction(ObserverContext, TableDescriptor,HRegionInfo[])
14742 + postCompletedCreateTableAction(ObserverContext,TableDescriptor,HRegionInfo[])
14743 + preModifyTable(ObserverContext,TableName, TableDescriptor)
14744 + postModifyTable(ObserverContext,TableName, TableDescriptor)
14745 + preModifyTableAction( ObserverContext,TableName,TableDescriptor)
14746 + postCompletedModifyTableAction( ObserverContext,TableName,TableDescriptor)
14747 + preAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14748 + postAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14749 + preAddColumnFamilyAction(ObserverContext,TableName,ColumnFamilyDescriptor)
14750 + postCompletedAddColumnFamilyAction(ObserverContext,TableName, ColumnFamilyDescriptor)
14751 + preModifyColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14752 + preModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment,TableName,ColumnFamilyDescriptor)
14753 + postCompletedModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment\>,TableName,ColumnFamilyDescriptor)
14754 + preCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescriptor)
14755 + postCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescripto)
14756 + preRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14757 + postRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14758 + preGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14759 + postGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14760 + preGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14761 + postGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14762
14763
14764 ---
14765
14766 * [HBASE-18520](https://issues.apache.org/jira/browse/HBASE-18520) | *Minor* | **Add jmx value to determine true Master Start time**
14767
14768 This JIRA adds a JMX value to track when the Master has finished initializing.
14769 The jmx config is 'masterFinishedInitializationTime' and details the time in millis that the Master is fully usable and ready to serve requests.
14770
14771
14772 ---
14773
14774 * [HBASE-17056](https://issues.apache.org/jira/browse/HBASE-17056) | *Critical* | **Remove checked in PB generated files**
14775
14776 Purge all checked in generated protobuf files (30MB). Generate protobuf files inline with the build. Remove checked-in and patched protobuf. Get it from new hbase-thirdparty instead.
14777
14778 Side-effect: Our protobuf went from 3.1.0 to 3.3.1.
14779
14780 Build does not take noticeably longer (still about 2.5 minutes to do a mvn clean install -DskipTests).
14781
14782 IDEs will probably require a mvn build first else they'll complain about missing (generated) files.
14783
14784
14785 ---
14786
14787 * [HBASE-18374](https://issues.apache.org/jira/browse/HBASE-18374) | *Major* | **RegionServer Metrics improvements**
14788
14789 This change adds the latency metrics checkAndPut, checkAndDelete, putBatch and deleteBatch . Also the previous regionserver "mutate" latency metrics are renamed to "put" metrics. Batch metrics capture the latency of the entire batch containing put/delete whereas put/delete metrics capture latency per operation. Note this change will break existing monitoring based on regionserver "mutate" latency metric.
14790
14791
14792 ---
14793
14794 * [HBASE-18023](https://issues.apache.org/jira/browse/HBASE-18023) | *Minor* | **Log multi-\* requests for more than threshold number of rows**
14795
14796 HBASE-18023 introduces a warning message in the RegionServer log when an RPC is received from a client that has more than 5000 "actions" (where an "action" is a collection of mutations for a specific row) in a single RPC. Misbehaving clients who send large RPCs to RegionServers can be malicious, causing temporary pauses via garbage collection or denial of service via crashes. The threshold of 5000 actions per RPC is defined by the property "hbase.rpc.rows.warning.threshold" in hbase-site.xml.
14797
14798
14799 ---
14800
14801 * [HBASE-15968](https://issues.apache.org/jira/browse/HBASE-15968) | *Major* | **New behavior of versions considering mvcc and ts rather than ts only**
14802
14803 This issue resolved two long-term issues in HBase:
14804 Puts may be masked by a delete before them.
14805 Major compactions change query results.
14806
14807 This issue offer a new behavior to fix this issue with a little performance reduction. Set NEW\_VERSION\_BEHAVIOR to true to enable this feature in CF level. See HBASE-15968 for details.
14808 Note if you enable this feature, the order of Mutations matters. But replication will disorder the entries by default. So you have to enable serial replication if you have slave clusters. See HBASE-9465 for details.
14809
14810
14811 ---
14812
14813 * [HBASE-18107](https://issues.apache.org/jira/browse/HBASE-18107) | *Major* | **[AMv2] Remove DispatchMergingRegionsRequest & DispatchMergingRegions**
14814
14815 Removes merge region code added into branch-2 but that was not needed after all. Branch-2 replaced dispatchMergingRegions with MergeTableRegionsProcedure.
14816
14817 Removed:
14818
14819 # dispatchMergingRegions from Connection (was superceded long ago in branch-1).
14820 # mergeRegions from RsRpcServices (was not used).
14821
14822
14823 ---
14824
14825 * [HBASE-15816](https://issues.apache.org/jira/browse/HBASE-15816) | *Major* | **Provide client with ability to set priority on Operations**
14826
14827 Added setPriority(int priority) API to Put, Delete, Increment, Append, Get and Scan pojos.  So for all these ops, the user can provide a custom priority level.
14828
14829
14830 ---
14831
14832 * [HBASE-18430](https://issues.apache.org/jira/browse/HBASE-18430) | *Major* | **Typo in "contributing to documentation" page**
14833
14834 Pushed to {{master}}. Thanks, Coral! Congratulations on your first Apache HBase commit!
14835
14836
14837 ---
14838
14839 * [HBASE-17908](https://issues.apache.org/jira/browse/HBASE-17908) | *Critical* | **Upgrade guava**
14840
14841 Use relocated guava 22.0 gotten from the new hbase-thirdparty ancillary project.
14842
14843 Incompatible change. ReplicationEndpoint and subclasses extend guava Service which changed pretty radically between 12.0 and 22.0. Change is kosher because implementations are marked audience private. Still, this will likely cause grief for the likes of the downstream lily indexer.
14844
14845
14846 ---
14847
14848 * [HBASE-16993](https://issues.apache.org/jira/browse/HBASE-16993) | *Major* | **BucketCache throw java.io.IOException: Invalid HFile block magic when configuring hbase.bucketcache.bucket.sizes**
14849
14850 Any value for hbase.bucketcache.bucket.sizes  configuration to be multiple of 256.  If that is not the case, instantiation of L2 Bucket cache itself will fail throwing IllegalArgumentException.
14851
14852
14853 ---
14854
14855 * [HBASE-16090](https://issues.apache.org/jira/browse/HBASE-16090) | *Major* | **ResultScanner is not closed in SyncTable#finishRemainingHashRanges()**
14856
14857 pushed to 1.3 and 1.2. SyncTable was introduced in 1.2, so skipping 1.1.
14858
14859
14860 ---
14861
14862 * [HBASE-18332](https://issues.apache.org/jira/browse/HBASE-18332) | *Minor* | **Upgrade asciidoctor-maven-plugin**
14863
14864 Committed to master and branch-2. Thanks!
14865
14866
14867 ---
14868
14869 * [HBASE-18161](https://issues.apache.org/jira/browse/HBASE-18161) | *Minor* | **Incremental Load support for Multiple-Table HFileOutputFormat**
14870
14871 In order to use this feature, a user must
14872 1. Register their tables when configuring their job
14873  2. Create a composite key of the tablename and original rowkey to send as the mapper output key.
14874
14875   To register their tables (and configure their job for incremental load into multiple tables), a user must call the static MultiHFileOutputFormat.configureIncrementalLoad function to register the HBase tables that will be ingested into.
14876
14877 To create the composite key, a helper function MultiHFileOutputFormat2.createCompositeKey should be called with the destination tablename and rowkey as arguments, and the result should be output as the mapper key.
14878
14879  Before this JIRA, for HFileOutputFormat2 a configuration for the storage policy was set per Column Family. This was set manually by the user. In this JIRA, this is unchanged when using HFileOutputFormat2. However, when specifically using MultiHFileOutputFormat2, the user now has to manually set the prefix by creating a composite of the table name and the column family. The user can create the new composite value by calling MultiHFileOutputFormat2.createCompositeKey with the tablename and column family as arguments.
14880
14881 Changes added through this JIRA are backwards compatible with existing HFileOutputFormat2 apis and functionality.
14882
14883 The configuration parameter "hbase.mapreduce.hfileoutputformat.table.name" is now a REQUIRED parameter though it is normally set automatically when configureIncrementalLoad method is called within HFileOutputFormat2
14884
14885
14886 ---
14887
14888 * [HBASE-18229](https://issues.apache.org/jira/browse/HBASE-18229) | *Critical* | **create new Async Split API to embrace AM v2**
14889
14890 A new splitRegionAsync() API is added in client. The existing splitRegion()  and split() API will call the new API so client does not have to change its code.
14891
14892 Move HBaseAdmin.splitXXX() logic to master, client splitXXX() API now go to master directly instead of going to RegionServer first.
14893
14894 Also added splitSync() API
14895
14896
14897 ---
14898
14899 * [HBASE-18339](https://issues.apache.org/jira/browse/HBASE-18339) | *Major* | **Update test-patch to use hadoop 3.0.0-alpha4**
14900
14901 HBase now defaults to Apache Hadoop 3.0.0-alpha4 when the Hadoop 3 profile is active.
14902
14903
14904 ---
14905
14906 * [HBASE-18267](https://issues.apache.org/jira/browse/HBASE-18267) | *Major* | **The result from the postAppend is ignored**
14907
14908 **WARNING: No release note provided for this change.**
14909
14910
14911 ---
14912
14913 * [HBASE-18307](https://issues.apache.org/jira/browse/HBASE-18307) | *Major* | **Share the same EventLoopGroup for NettyRpcServer, NettyRpcClient and AsyncFSWALProvider at RS side**
14914
14915 There are two configuration name changes as the event loop configs will not only effect rpc server but be shared by different components in the same RS instance.
14916
14917 'hbase.rpc.server.nativetransport' -\> 'hbase.netty.nativetransport'
14918
14919 'hbase.netty.rpc.server.worker.count' -\> 'hbase.netty.worker.count'
14920
14921
14922 ---
14923
14924 * [HBASE-18241](https://issues.apache.org/jira/browse/HBASE-18241) | *Critical* | **Change client.Table, client.Admin, Region, Store, and HBaseTestingUtility to not use HTableDescriptor or HColumnDescriptor**
14925
14926 - : removed API
14927 + : new API
14928 \* : deprecated API
14929 ---------------------------
14930 Region class
14931 - HTableDescriptor getTableDesc()
14932 +TableDescriptor getTableDescriptor()
14933
14934 Store class
14935 - HColumnDescriptor getFamily()
14936 + ColumnFamilyDescriptor getColumnFamilyDescriptor()
14937
14938 Table class
14939 \* HTableDescriptor getTableDescriptor()
14940 + TableDescriptor getDescriptor()\|
14941
14942 \*Admin class\*
14943 \* HTableDescriptor getTableDescriptor(TableName)
14944 + List\<TableDescriptor\> listTableDescriptor(TableName)\|
14945 \* HTableDescriptor[] getTableDescriptors(List\<String\>)
14946 \* HTableDescriptor[] getTableDescriptorsByTableName(List\<TableName\>)
14947 + List\<TableDescriptor\> listTableDescriptors(List\<TableName\>)
14948 \* HTableDescriptor[] listTables()
14949 + List\<TableDescriptor\> listTableDescriptors()
14950 \* HTableDescriptor[] listTables(Pattern)
14951 + List\<TableDescriptor\> listTableDescriptors(Pattern)
14952 \* HTableDescriptor[] listTables(String)
14953 + List\<TableDescriptor\> listTableDescriptors(String)
14954 \* HTableDescriptor[] listTables(Pattern, boolean)
14955 + List\<TableDescriptor\> listTableDescriptors(Pattern, boolean)
14956 \* HTableDescriptor[] listTables(String, boolean)
14957 + List\<TableDescriptor\> listTableDescriptors(String, boolean)
14958 \* HTableDescriptor[] deleteTables(String)
14959 \* HTableDescriptor[] deleteTables(Pattern)
14960 \* HTableDescriptor[] enableTables(String)
14961 \* HTableDescriptor[] enableTables(Pattern)
14962 \* HTableDescriptor[] disableTables(String)
14963 \* HTableDescriptor[] disableTables(Pattern)
14964 \* void modifyTable(TableName, HTableDescriptor)
14965 + void modifyTable(TableDescriptor)
14966 \* void modifyTableAsync(TableName, HTableDescriptor)
14967 + void modifyTableAsync(TableDescriptor)
14968 \* HTableDescriptor[] listTableDescriptorsByNamespace(String)
14969 + List\<TableDescriptor\> listTableDescriptorsByNamespace(byte[])
14970 \* void createTable(HTableDescriptor)
14971 + void createTable(TableDescriptor)
14972 \* void createTable(HTableDescriptor, byte[], byte[], int)
14973 + void createTable({color:red}TableDescriptor, byte[], byte[], int)
14974 \* void createTable(HTableDescriptor, byte[][])
14975 + void createTable(TableDescriptor, byte[][])
14976 \* Future\<Void\> createTableAsync(HTableDescriptor, byte[][])
14977 + Future\<Void\> createTableAsync(TableDescriptor, byte[][])
14978
14979 \*HBaseTestingUtility class\*
14980 \* Table createTable(HTableDescriptor, byte[][], Configuration)
14981 + Table createTable(TableDescriptor, byte[][], Configuration)
14982 \* Table createTable(HTableDescriptor, byte[][], byte[][], Configuration)
14983 + Table createTable(TableDescriptor, byte[][], byte[][], Configuration)
14984 \* public Table createTable(HTableDescriptor, byte[][])
14985 + public Table createTable(TableDescriptor, byte[][])
14986 \* void modifyTableSync(Admin, HTableDescriptor)
14987 + void modifyTableSync(Admin, TableDescriptor)
14988 \* HRegion createLocalHRegion(HTableDescriptor, byte [], byte [])
14989 + HRegion createLocalHRegion(TableDescriptor, byte [], byte [])
14990 \* HRegion createLocalHRegion(HRegionInf, HTableDescriptor)
14991 + HRegion createLocalHRegion(HRegionInf, TableDescriptor)
14992 \* HRegion createLocalHRegion(HRegionInfo, HTableDescriptor, WAL)
14993 + HRegion createLocalHRegion(HRegionInfo, TableDescriptor, WAL)
14994 \* List createMultiRegionsInMeta(final Configuration, HTableDescriptor, byte [][])
14995 + List createMultiRegionsInMeta(final Configuration, TableDescriptor, byte [][])
14996 \* HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, HTableDescriptor)
14997 + HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, TableDescriptor)
14998 \* HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, HTableDescriptor, boolean)
14999 + HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, TableDescriptor, boolean)
15000 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor)
15001 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor)
15002 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor, int)
15003 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor, int)
15004 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor[], int)
15005 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor[], int)
15006 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor[],SplitAlgorithm, int)
15007 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor[],SplitAlgorithm, int)
15008 \* HRegion createTestRegion(String, HColumnDescriptor)
15009 + HRegion createTestRegion(String, ColumnFamilyDescriptor)
15010
15011
15012 ---
15013
15014 * [HBASE-18083](https://issues.apache.org/jira/browse/HBASE-18083) | *Major* | **Make large/small file clean thread number configurable in HFileCleaner**
15015
15016 After HBASE-18083 we could configure HFileCleaner to use multiple threads for large/small (archived) hfile cleaning with hbase.regionserver.hfilecleaner.large.thread.count and hbase.regionserver.hfilecleaner.small.thread.count, both default to 1. These properties support online configuration change.
15017
15018
15019 ---
15020
15021 * [HBASE-17931](https://issues.apache.org/jira/browse/HBASE-17931) | *Blocker* | **Assign system tables to servers with highest version**
15022
15023 We usually keep compatibility between old client and new server so we can do rolling upgrade, HBase cluster first, then HBase client. But we don't guarantee new client can access old server.
15024 In an HBase cluster, we have system tables and region servers will access these tables so for servers they are also an HBase client. So if the system tables are in region servers with lower version we may get trouble because region servers with higher version may can not access them.
15025 After this patch, we will move all system regions to region servers with highest version. So when we do a rolling upgrade across two major or minor versions, we should ALWAYS UPGRADE MASTER FIRST and then upgrade region servers. The new master will handle system tables correctly.
15026
15027
15028 ---
15029
15030 * [HBASE-6581](https://issues.apache.org/jira/browse/HBASE-6581) | *Major* | **Build with hadoop.profile=3.0**
15031
15032 Make us build against hadoop trunk (3.0)
15033
15034
15035 ---
15036
15037 * [HBASE-16120](https://issues.apache.org/jira/browse/HBASE-16120) | *Minor* | **Add shell test for truncate\_preserve**
15038
15039 Add unit tests for truncate\_preserve
15040
15041
15042 ---
15043
15044 * [HBASE-18240](https://issues.apache.org/jira/browse/HBASE-18240) | *Major* | **Add hbase-thirdparty, a project with hbase utility including an hbase-shaded-thirdparty module with guava, netty, etc.**
15045
15046 Adds a new project, hbase-thirdparty, at https://git-wip-us.apache.org/repos/asf/hbase-thirdparty used by core hbase. GroupID org.apache.hbase.thirdparty. Version 1.0.0.
15047
15048 This project packages relocated third-party libraries used by Apache HBase such as protobuf, guava, and netty among others. HBase core depends on it.
15049
15050 It has threre submodules, one to patch and then relocate (shade) protobuf, and one to do messy .so renaming (netty). The remainder module relocates a bundle of other (unpatched) libs used by hbase. This latter set includes protobuf-util, netty-all, gson, and guava.
15051
15052 All shading is done using the same relocation offset of org.apache.hadoop.hbase.shaded; we add this prefix to the relocated thirdparty library class names.
15053
15054 See the pom.xml in hbase-thirdparty for the explicit version of each third-party lib included (of note, we update out internal protobuf from 3.1.0 to 3.3.1).
15055
15056
15057 ---
15058
15059 * [HBASE-15943](https://issues.apache.org/jira/browse/HBASE-15943) | *Major* | **Add page displaying JVM process metrics**
15060
15061 Adds new "Process Metrics' tab along the top which leads to new page that dumps mbean -- mostly jvm -- metrics
15062
15063
15064 ---
15065
15066 * [HBASE-14902](https://issues.apache.org/jira/browse/HBASE-14902) | *Major* | **Revert some of the stringency recently introduced by checkstyle tightening**
15067
15068 Changes the checkstyle so that on a continuation line for javadoc, instead of default four spaces, instead now it is two spaces. Also one line statements as in if (true) x =1; now pass checkstyle.
15069
15070
15071 ---
15072
15073 * [HBASE-17110](https://issues.apache.org/jira/browse/HBASE-17110) | *Major* | **Improve SimpleLoadBalancer to always take server-level balance into account**
15074
15075 After HBASE-17110 the bytable strategy for SimpleLoadBalancer will also take server level balance into account
15076
15077
15078 ---
15079
15080 * [HBASE-17928](https://issues.apache.org/jira/browse/HBASE-17928) | *Major* | **Shell tool to clear compaction queues**
15081
15082 Adds clear\_compaction\_queues to the hbase shell.
15083 {code}
15084   Clear compaction queues on a regionserver.
15085   The queue\_name contains short and long.
15086   short is shortCompactions's queue,long is longCompactions's queue.
15087
15088   Examples:
15089   hbase\> clear\_compaction\_queues 'host187.example.com,60020'
15090   hbase\> clear\_compaction\_queues 'host187.example.com,60020','long'
15091   hbase\> clear\_compaction\_queues 'host187.example.com,60020', ['long','short']
15092 {code}
15093
15094
15095 ---
15096
15097 * [HBASE-18164](https://issues.apache.org/jira/browse/HBASE-18164) | *Critical* | **Much faster locality cost function and candidate generator**
15098
15099 New locality cost function and candidate generator that use caching and incremental computation to allow the stochastic load balancer to consider ~20x more cluster configurations for big clusters.
15100
15101
15102 ---
15103
15104 * [HBASE-18226](https://issues.apache.org/jira/browse/HBASE-18226) | *Major* | **Disable reverse DNS lookup at HMaster and use the hostname provided by RegionServer**
15105
15106 The following config is added by this JIRA:
15107
15108 hbase.regionserver.hostname.disable.master.reversedns
15109
15110 This config is for experts: don't set its value unless you really know what you are doing.
15111 When set to true, regionserver will use the current node hostname for the servername and HMaster will skip reverse DNS lookup and use the hostname sent by regionserver instead. Note that this config and hbase.regionserver.hostname are mutually exclusive. See https://issues.apache.org/jira/browse/HBASE-18226 for more details.
15112
15113 Caution: please make sure rolling upgrade succeeds before turning on this feature.
15114
15115
15116 ---
15117
15118 * [HBASE-16242](https://issues.apache.org/jira/browse/HBASE-16242) | *Major* | **Upgrade Avro to 1.7.7**
15119
15120 Apache HBase now specifies that version 1.7.7 of the Apache Avro library should be pulled in by maven and included in the convenience binary tarball.
15121
15122
15123 ---
15124
15125 * [HBASE-18213](https://issues.apache.org/jira/browse/HBASE-18213) | *Major* | **Add documentation about the new async client**
15126
15127 Add documentation for async client in section '66. Client' in ref guide.
15128
15129
15130 ---
15131
15132 * [HBASE-17008](https://issues.apache.org/jira/browse/HBASE-17008) | *Critical* | **Examples to make AsyncClient go down easy**
15133
15134 Add two examples for async client. AsyncClientExample is a simple example to show you how to use AsyncTable. HttpProxyExample is an example for advance user to show you how to use RawAsyncTable to write a fully asynchronous HTTP proxy server. There is no extra thread pool, all operations are executed inside netty's event loop.
15135
15136
15137 ---
15138
15139 * [HBASE-18200](https://issues.apache.org/jira/browse/HBASE-18200) | *Major* | **Set hadoop check versions for branch-2 and branch-2.x in pre commit**
15140
15141 Allow setting different hadoop check versions for branch-2 and branch-2.x when running pre commit check.
15142
15143
15144 ---
15145
15146 * [HBASE-18187](https://issues.apache.org/jira/browse/HBASE-18187) | *Major* | **Release hbase-2.0.0-alpha1**
15147
15148 Pushed the release. For detail: http://apache-hbase.679495.n3.nabble.com/ANNOUNCE-Apache-HBase-2-0-0-alpha-1-is-now-available-for-download-td4088484.html
15149
15150
15151 ---
15152
15153 * [HBASE-18137](https://issues.apache.org/jira/browse/HBASE-18137) | *Critical* | **Replication gets stuck for empty WALs**
15154
15155 0-length WAL files can potentially cause the replication queue to get stuck.  A new config "replication.source.eof.autorecovery" has been added: if set to true (default is false), the 0-length WAL file will be skipped after 1) the max number of retries has been hit, and 2) there are more WAL files in the queue.  The risk of enabling this is that there is a chance the 0-length WAL file actually has some data (e.g. block went missing and will come back once a datanode is recovered).
15156
15157
15158 ---
15159
15160 * [HBASE-18192](https://issues.apache.org/jira/browse/HBASE-18192) | *Blocker* | **Replication drops recovered queues on region server shutdown**
15161
15162 If a region server that is processing recovered queue for another previously dead region server is gracefully shut down, it can drop the recovered queue under certain conditions. Running without this fix on a 1.2+ release means possibility of continuing data loss in replication, irrespective of which WALProvider is used.
15163 If a single WAL group (or DefaultWALProvider) is used, running without this fix will always cause dataloss in replication whenever a region server processing recovered queues is gracefully shutdown.
15164
15165
15166 ---
15167
15168 * [HBASE-18109](https://issues.apache.org/jira/browse/HBASE-18109) | *Critical* | **Assign system tables first (priority)**
15169
15170 Adds a sort of procedures before submission so system tables are queued first (which will help ensure they go out first). This should be good enough along w/ existing scheduling mechanisms to ensure system/meta are assigned first (See reasoning below). Open new issue if insufficient.
15171
15172
15173 ---
15174
15175 * [HBASE-18008](https://issues.apache.org/jira/browse/HBASE-18008) | *Major* | **Any HColumnDescriptor we give out should be immutable**
15176
15177 1) The HColumnDescriptor got from Admin, AsyncAdmin, and Table is immutable.
15178 2) HColumnDescriptor have been marked as "Deprecated" and user should substituted
15179      ColumnFamilyDescriptor for HColumnDescriptor.
15180 3) ColumnFamilyDescriptor is constructed through ColumnFamilyDescriptorBuilder and it contains all of the read-only methods from HColumnDescriptor
15181 4) The value to which the IS\_MOB/MOB\_THRESHOLD is mapped is stored as String rather than Boolean/Long. The MOB is an new feature to 2.0 so this change should be acceptable
15182
15183
15184 ---
15185
15186 * [HBASE-18149](https://issues.apache.org/jira/browse/HBASE-18149) | *Major* | **The setting rules for table-scope attributes and family-scope attributes should keep consistent**
15187
15188 If the table-scope attributes value is false, you need not to enclose 'false' in single quotation.Both COMPACTION\_ENABLED =\> false and COMPACTION\_ENABLED =\> 'false' will take effect
15189
15190
15191 ---
15192
15193 * [HBASE-17849](https://issues.apache.org/jira/browse/HBASE-17849) | *Major* | **PE tool random read is not totally random**
15194
15195 When randomRead and randomSeekScan is used with PE tool, now we allow using both --size and --rows. The --size specifies the total size of the data (the range) on which the reads should be performed and --rows specifies the number of rows to be read by each client with in that range.
15196
15197
15198 ---
15199
15200 * [HBASE-15576](https://issues.apache.org/jira/browse/HBASE-15576) | *Major* | **Scanning cursor to prevent blocking long time on ResultScanner.next()**
15201
15202 If you don't like scanning being blocked too long because of heartbeat and partial result, you can use Scan#setNeedCursorResult(true) to get a special result within scanning timeout setting time which will tell you where row the server is scanning. See its javadoc for more details.
15203
15204
15205 ---
15206
15207 * [HBASE-16549](https://issues.apache.org/jira/browse/HBASE-16549) | *Major* | **Procedure v2 - Add new AM metrics**
15208
15209 Following AMv2 procedures are modified to override onSubmit(), onFinish() hooks provided by HBASE-17888 to do
15210 metrics calculations when procedures are submitted and finshed:
15211 \* AssignProcedure
15212 \* UnassignProcedure
15213 \* MergeTableRegionProcedure
15214 \* SplitTableRegionProcedure
15215 \* ServerCrashProcedure
15216
15217 Following metrics is collected for each of the above procedure during lifetime of a process:
15218 \* Total number of requests submitted for a type of procedure
15219 \* Histogram of runtime in milliseconds for successfully completed procedures
15220 \* Total number of failed procedures
15221
15222 As we are moving away from Hadoop's metric2, hbase-metrics-api module is used for newly added metrics.
15223
15224
15225 ---
15226
15227 * [HBASE-9393](https://issues.apache.org/jira/browse/HBASE-9393) | *Critical* | **Hbase does not closing a closed socket resulting in many CLOSE\_WAIT**
15228
15229 To handle this issue client need to have Hadoop client 2.6.4 or 2.7.0+ Hadoop version as CanUnBuffer interface which was added as part of HDFS-7694 is available in only those versions.
15230
15231
15232 ---
15233
15234 * [HBASE-18038](https://issues.apache.org/jira/browse/HBASE-18038) | *Critical* | **Rename StoreFile to HStoreFile and add a StoreFile interface for CP**
15235
15236 StoreFile is now changed to an interface. This is an incompatible change. The coprocessors which implement RegionObserver may need to modify their code.
15237
15238
15239 ---
15240
15241 * [HBASE-16196](https://issues.apache.org/jira/browse/HBASE-16196) | *Critical* | **Update jruby to a newer version.**
15242
15243 The bundled JRuby 1.6.8 has been updated to version 9.1.9.0. The represents a change from Ruby 1.8 to Ruby 2.3.3, which introduces non-compatible language changes for user scripts.
15244
15245 This JRuby version update required an update to joni-2.1.11 and jcodings-1.0.18, used for regular expression matching, as well as several transitive dependency updates that should not be user-visible.
15246
15247
15248 ---
15249
15250 * [HBASE-14614](https://issues.apache.org/jira/browse/HBASE-14614) | *Major* | **Procedure v2: Core Assignment Manager**
15251
15252 Replaces the AssignmentManager with a new procedurev2-based AssignmentManager
15253
15254 h1. AMv2
15255 Puts AssignmentManager up on top of the ProcedureV2 state machine with persistence engine. Each assignment atom is now a Procedure implementation; e.g. an AssignProcedure and an UnassignProcedure. Molecules of aggregated Procedures are used to do more involved assignment steps: e.g. the move region procedure is made of an Unassign followed by an Assign subprocedure.
15256
15257 AMv2 is 1500 lines. Old AM was near 4000. Functionality has been moved out to Procedures. In-memory states of regions and servers has been cleaned up stored in new RegionStates implementation. RegionStateStore takes care of publishing final region state out to the hbase:meta table.
15258
15259 New RemoteProcedureDispatcher/RSProcedureDispatcher runs the Procedure-based assignments ‘remotely’. Knows about ‘servers’. Does aggregation of assignments by time on a time/count basis so can send procedures in batches rather than one per RPC. Procedure status comes back on the back of the RegionServer heartbeat reporting online regions. The response is passed to the AMv2 to ‘process’. It will check against the in-memory state. If there is a mismatch, it fences out the RegionServer on the assumption that something went wrong on the RS side.Timeouts trigger retries. The Procedure machine ensures only one operation at a time on any one region/table using locking and smarts about what is serial and what can be run concurrently.
15260
15261 New accounting of RegionServer version will be used running rolling restarts.
15262
15263 ‘States’ -- OPENING, CLOSING, etc. -- are now in-memory in-the-master only serialized out to the ProcedureV2 WAL. They are no longer persisted to ZooKeeper.
15264
15265 h2. Assign Detail
15266 The Assign starts by pushing the "assign" operation to the AssignmentManager and then will go into a “waiting" state. The AM will batch the "assign" requests and ask the Balancer where to put the region (the various policies will be respected: retain, round-robin, random). Once the AM and the balancer have found a place for the region, the procedure will be resumed and an "open region" request will be placed in the Remote Dispatcher queue, and the procedure once again will go into a "waiting state".  The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the assignment by publishing to new state on hbase:meta or it will retry the assignment.
15267
15268 h3. Unassign Detail
15269  The Unassign starts by placing a "close region" request in the Remote Dispatcher queue, and the procedure will then go into a "waiting state". The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the unassign by publishing its new state on meta or it will retry the unassign.
15270
15271 h1. New Configs
15272  \* "hbase.procedure.remote.dispatcher.threadpool.size" defaults 128
15273  \* "hbase.procedure.remote.dispatcher.delay.msec" default 150ms
15274  \* "hbase.procedure.remote.dispatcher.max.queue.size" with default 32
15275  \* "hbase.regionserver.rpc.startup.waittime" with default 60 seconds.
15276 h1. TODO
15277 As of this writing.
15278
15279 Put up a model diagram.
15280
15281  \* Handle region migration
15282  \* Handle meta assignment first
15283  \* Handle sys table assignment first (e.g. acl, namespace)
15284  \* Handle table priorities
15285  \* Do we report same AM metrics as we used too? We do it all in here now.
15286
15287 INCOMPATIBLE
15288 A known incompatible is that because splits and merges are now run from the master, Coprocessors that used to watch for merge/split from a RegionObserver now no longer work; to watch split/merges, you need to have an observer on the Master instead.
15289
15290
15291 ---
15292
15293 * [HBASE-3462](https://issues.apache.org/jira/browse/HBASE-3462) | *Major* | **Fix table.jsp in regards to splitting a region/table with an optional splitkey**
15294
15295 UI pages for splitting/merging now operate by taking a row key prefix from the user rather than a full region name.
15296
15297
15298 ---
15299
15300 * [HBASE-18129](https://issues.apache.org/jira/browse/HBASE-18129) | *Major* | **truncate\_preserve fails when the truncate method doesn't exists on the master**
15301
15302 The command truncate\_preserve will be fine when the truncate method doesn't exist on the master
15303
15304
15305 ---
15306
15307 * [HBASE-18122](https://issues.apache.org/jira/browse/HBASE-18122) | *Major* | **Scanner id should include ServerName of region server**
15308
15309 The scanner id is not from 1 anymore.
15310 The first 32 bits are MurmurHash32 of ServerName string "host,port,ts". The ServerName contains both host, port, and start timestamp so it can prevent collision. The lowest 32bit is generated by atomic int.
15311
15312
15313 ---
15314
15315 * [HBASE-17997](https://issues.apache.org/jira/browse/HBASE-17997) | *Major* | **In dev environment, add jruby-complete jar to classpath only when jruby is needed**
15316
15317 When JRUBY\_HOME is specified, if the command is "hbase shell" or "hbase org.jruby.Main", CLASSPATH and HBASE\_OPTS will be updated according to JRUBY\_HOME specified
15318 \* Jar under JRUBY\_HOME is added to CLASSPATH
15319 \* The following will be added into HBASE\_OPTS
15320
15321 -Djruby.home=$JRUBY\_HOME -Djruby.lib=$JRUBY\_HOME/lib
15322
15323
15324 That is, as long as JRUBY\_HOME is specified, JRUBY\_HOME specified will take precedence.
15325 \* In dev env, the jar recorded in cached\_classpath\_jruby.txt will be ignored
15326 \* In non dev env, jruby-complete jar packaged with HBase will be ignored
15327
15328
15329 ---
15330
15331 * [HBASE-15616](https://issues.apache.org/jira/browse/HBASE-15616) | *Major* | **Allow null qualifier for all table operations**
15332
15333 After this issue, all table operations will support null qualifier, such as put/get/scan/increment/append/checkAndMutate/checkAndPut/checkAndDelete.
15334
15335
15336 ---
15337
15338 * [HBASE-18035](https://issues.apache.org/jira/browse/HBASE-18035) | *Critical* | **Meta replica does not give any primaryOperationTimeout to primary meta region**
15339
15340 When a client is configured to use meta replica, it sends scan request to all meta replicas almost at the same time. Since meta replica contains stale data, if result from one of replica comes back first, the client may get wrong region locations. To fix this, "hbase.client.meta.replica.scan.timeout" is introduced, a client will always send to primary meta region first, wait the configured timeout for reply. If no result is received, it will send request to replica meta regions. The unit for "hbase.client.meta.replica.scan.timeout"  is microsecond, the default value is 1000000 (1 second).
15341
15342
15343 ---
15344
15345 * [HBASE-11013](https://issues.apache.org/jira/browse/HBASE-11013) | *Major* | **Clone Snapshots on Secure Cluster Should provide option to apply Retained User Permissions**
15346
15347 While creating a snapshot, it will save permissions of the original table into .snapshotinfo file(Backward compatibility) , which is in the snapshot root directory.  For clone\_snapshot/restore\_snapshot command, we provide an additional option( RESTORE\_ACL) to decide whether we will grant permissons of the origin table to the newly created table.
15348
15349
15350 ---
15351
15352 * [HBASE-18018](https://issues.apache.org/jira/browse/HBASE-18018) | *Major* | **Support abort for all procedures by default**
15353
15354 The default behavior for abort() method of StateMachineProcedure class is changed to support aborting all procedures irrespective of if procedure supports rollback or not.
15355
15356
15357 ---
15358
15359 * [HBASE-16851](https://issues.apache.org/jira/browse/HBASE-16851) | *Major* | **User-facing documentation for the In-Memory Compaction feature**
15360
15361 Two blog posts on Apache HBase blog: user manual and programmer manual.
15362 Ref. guide draft published: https://docs.google.com/document/d/1Xi1jh\_30NKnjE3wSR-XF5JQixtyT6H\_CdFTaVi78LKw/edit
15363
15364
15365 ---
15366
15367 * [HBASE-17343](https://issues.apache.org/jira/browse/HBASE-17343) | *Blocker* | **Make Compacting Memstore default in 2.0 with BASIC as the default type**
15368
15369  This JIRA changes the default MemStore to be CompactingMemStore instead of DefaultMemStore. In-memory compaction of CompactingMemStore demonstrated sizable improvement in HBase’s write amplification and read/write performance.
15370
15371 CompactingMemStore achieves these gains through smart use of RAM. The algorithm periodically re-organizes the in-memory data in efficient data structures and reduces redundancies. The  HBase server’s memory footprint therefore periodically expands and contracts. The outcome is longer lifetime of data in memory, less I/O, and overall faster performance. More details about the algorithm and its use appear in the Apache HBase Blog: https://blogs.apache.org/hbase/
15372
15373 How To Use:
15374 The in-memory compaction level can be configured both globally and per column family. The supported levels are none (DefaultMemStore), basic, and eager.
15375
15376 By default, all tables apply basic in-memory compaction. This global configuration can be overridden in hbase-site.xml, as follows:
15377
15378 \<property\>
15379  \<name\>hbase.hregion.compacting.memstore.type\</name\>
15380  \<value\>\<none\|basic\|eager\>\</value\>
15381  \</property\>
15382
15383 The level can also be configured in the HBase shell per column family, as follows:
15384
15385 create ‘\<tablename\>’,
15386 {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
15387
15388
15389 ---
15390
15391 * [HBASE-17786](https://issues.apache.org/jira/browse/HBASE-17786) | *Major* | **Create LoadBalancer perf-tests (test balancer algorithm decoupled from workload)**
15392
15393 $ bin/hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation -help
15394 usage: hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation \<options\>
15395 Options:
15396  -regions \<arg\>         Number of regions to consider by load balancer. Default: 1000000
15397  -servers \<arg\>         Number of servers to consider by load balancer. Default: 1000
15398  -load\_balancer \<arg\>   Type of Load Balancer to use. Default:
15399                         org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer
15400
15401
15402 ---
15403
15404 * [HBASE-17887](https://issues.apache.org/jira/browse/HBASE-17887) | *Blocker* | **Row-level consistency is broken for read**
15405
15406 Now we pass on list of memstoreScanners to the StoreScanner along with the new files to ensure that the StoreScanner sees the latest memstore after flush.
15407
15408
15409 ---
15410
15411 * [HBASE-15296](https://issues.apache.org/jira/browse/HBASE-15296) | *Major* | **Break out writer and reader from StoreFile**
15412
15413 \<!-- mardown --\>
15414 Refactor that breaks out StoreFile Reader and Writer inner classes as StoreFileReader and StoreFileWriter.
15415
15416 NOTE! Changes RegionObserver Coprocessor Interface so incompatible change (Discussed on dev list in thread "[Note breaking change on RegionObserver in hbase-2.0.0](https://s.apache.org/hbase-dev-note-about-HBASE-15296)"
15417
15418
15419 ---
15420
15421 * [HBASE-15199](https://issues.apache.org/jira/browse/HBASE-15199) | *Critical* | **Move jruby jar so only on hbase-shell module classpath; currently globally available**
15422
15423 The JRuby jar is no longer automatically included in classpaths for HBase server processes nor clients. It is still included in the classpath for the HBase shell and for invocations of org.jruby.Main, which should cover HBase provided support scripts.
15424
15425
15426 ---
15427
15428 * [HBASE-18009](https://issues.apache.org/jira/browse/HBASE-18009) | *Major* | **Move RpcServer.Call to a separated file**
15429
15430 The return value of CallRunner.getCall is changed so this is an incompatible change as CallRunner is declared as IA.LimitedPrivate. CallRunner is declared as IS.Evolving so we do not break the rule. And we still keep the getCall method to reduce the impact to user code.
15431
15432
15433 ---
15434
15435 * [HBASE-14925](https://issues.apache.org/jira/browse/HBASE-14925) | *Major* | **Develop HBase shell command/tool to list table's region info through command line**
15436
15437 Added a shell command 'list\_regions' for displaying the table's region info through command line.
15438
15439         List all regions for a particular table as an array and also filter them by server name (optional) as prefix
15440         and maximum locality (optional). By default, it will return all the regions for the table with any locality.
15441         The command displays server name, region name, start key, end key, size of the region in MB, number of requests
15442         and the locality. The information can be projected out via an array as third parameter. By default all these information
15443         is displayed. Possible array values are SERVER\_NAME, REGION\_NAME, START\_KEY, END\_KEY, SIZE, REQ and LOCALITY. Values
15444         are not case sensitive. If you don't want to filter by server name, pass an empty hash / string as shown below.
15445
15446         Examples:
15447         hbase\> list\_regions 'table\_name'
15448         hbase\> list\_regions 'table\_name', 'server\_name'
15449         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}
15450         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}, ['SERVER\_NAME']
15451         hbase\> list\_regions 'table\_name', {}, ['SERVER\_NAME', 'start\_key']
15452         hbase\> list\_regions 'table\_name', '', ['SERVER\_NAME', 'start\_key']
15453
15454
15455 ---
15456
15457 * [HBASE-17471](https://issues.apache.org/jira/browse/HBASE-17471) | *Critical* | **Region Seqid will be out of order in WAL if using mvccPreAssign**
15458
15459 MVCCPreAssign is added by HBASE-16698, but pre-assign mvcc is only used in put/delete path. Other write paths like increment/append still assign mvcc in ringbuffer's consumer thread. If put and increment are used parallel. Then seqid in WAL may not increase monotonically. Disorder in wals will lead to data loss.This patch bring all mvcc/seqid event in wal.append, and synchronize wal append and mvcc acquirement. No disorder in wal will happen. Performance test shows no regression with this patch.
15460
15461
15462 ---
15463
15464 * [HBASE-16466](https://issues.apache.org/jira/browse/HBASE-16466) | *Major* | **HBase snapshots support in VerifyReplication tool to reduce load on live HBase cluster with large tables**
15465
15466 Support for snapshots in VerifyReplication tool i.e. verifyrep can compare source table snapshot against peer table snapshot which reduces load on RS by reading data from HDFS directly using Snapshot scanners.
15467 Instead of comparing against live tables whose state changes due to writes and compactions its better to compare HBase  snapshots which are immutable in nature.
15468
15469
15470 ---
15471
15472 * [HBASE-17263](https://issues.apache.org/jira/browse/HBASE-17263) | *Major* | **  Netty based rpc server impl**
15473
15474 A new RPC server based on Netty4 which can improve random read (get) performance. By default, it is off. To use this feature, please set “hbase.rpc.server.impl" to “org.apache.hadoop.hbase.ipc.NettyRpcServer”.
15475
15476 In one deploy, doubled the throughput and lowered the latency significantly: see https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
15477
15478
15479 ---
15480
15481 * [HBASE-17957](https://issues.apache.org/jira/browse/HBASE-17957) | *Minor* | ** Custom metrics of replicate endpoints don't prepend "source." to global metrics**
15482
15483 Global custom metrics names follow the "source.metricsName" format.
15484
15485
15486 ---
15487
15488 * [HBASE-17757](https://issues.apache.org/jira/browse/HBASE-17757) | *Major* | **Unify blocksize after encoding to decrease memory fragment**
15489
15490 Blocksize is set in columnfamily's atrributes. It is used to control block sizes when generating blocks. But, it doesn't take encoding into count. If you set encoding to blocks, after encoding, the block size varies. Since blocks will be cached in memory after encoding (default), it will cause memory fragment if using blockcache, or decrease the pool efficiency if using bucketCache. This issue introduced a new config named 'hbase.writer.unified.encoded.blocksize.ratio'. The default value of this config is 1, meaning doing nothing. If this value is set to a smaller value like 0.5, and the blocksize is set to 64KB(default value of blocksize). It will unify the blocksize after encoding to 64KB \* 0.5 = 32KB. Unified blocksize will releaf the memory problems mentioned above.
15491
15492
15493 ---
15494
15495 * [HBASE-14286](https://issues.apache.org/jira/browse/HBASE-14286) | *Trivial* | **Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile**
15496
15497 HBASE-14286 Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile
15498
15499
15500 ---
15501
15502 * [HBASE-17817](https://issues.apache.org/jira/browse/HBASE-17817) | *Major* | **Make Regionservers log which tables it removed coprocessors from when aborting**
15503
15504 Add table name to exception logging when a coprocessor is removed from a table by the region server
15505
15506
15507 ---
15508
15509 * [HBASE-17877](https://issues.apache.org/jira/browse/HBASE-17877) | *Major* | **Improve HBase's byte[] comparator**
15510
15511 updated the lexicographic byte array comparator to use a slightly more optimized version similar to the one available in the guava library that compares only the first index where left[index] != right[index]. The comparator also returns the diff directly instead of mapping it to -1, 0, +1 range as was being done in the earlier version. We have seen significant performance gains, calculated in terms of throughput (ops/ms) with these changes ranging from approx 20% for smaller byte arrays upto 200 bytes and almost 100% for large byte array sizes that are in few KB's. We benchmarked with upto 16KB arrays and the general trend indicates that the performance improvement increases as the size of the byte array increases.
15512
15513
15514 ---
15515
15516 * [HBASE-9899](https://issues.apache.org/jira/browse/HBASE-9899) | *Major* | **for idempotent operation dups, return the result instead of throwing conflict exception**
15517
15518 Non-idempotent operations (increment/append/checkAndPut/...) may throw OperationConflictException even though the increment/append succeeded. For example (client rpc retries number set to 3):
15519
15520 1. first increment rpc request success
15521 2. client timeout and send second rpc request, but nonce is same and save in server. The server found that it has already succeed, so return a OperationConflictException to make sure that increment operation only be applied once in server.
15522
15523 This patch will solve this problem by read the previous result when receive a duplicate rpc request.
15524 1. Store the mvcc to OperationContext. When first rpc request succeed, store the mvcc for this operation nonce.
15525 2. When there are duplicate rpc request, convert to read result by the mvcc.
15526
15527
15528 ---
15529
15530 * [HBASE-15583](https://issues.apache.org/jira/browse/HBASE-15583) | *Minor* | **Any HTableDescriptor we give out should be immutable**
15531
15532 # The HTD got from Admin, AsyncAdmin, and Table is immutable.
15533 # DEFERRED\_LOG\_FLUSH is removed.
15534 # cleanup the deprecated construction of HTD
15535
15536
15537 ---
15538
15539 * [HBASE-17956](https://issues.apache.org/jira/browse/HBASE-17956) | *Major* | **Raw scan should ignore TTL**
15540
15541 Now raw scan can also read expired cells.
15542
15543
15544 ---
15545
15546 * [HBASE-15143](https://issues.apache.org/jira/browse/HBASE-15143) | *Minor* | **Procedure v2 - Web UI displaying queues**
15547
15548 Adds a new Admin#listLocks, a panel on the procedures page to list procedure locks, and a list\_locks command to the shell. Use it to see current state of procedure locking in Master process.
15549
15550
15551 ---
15552
15553 * [HBASE-17514](https://issues.apache.org/jira/browse/HBASE-17514) | *Minor* | **Warn when Thrift Server 1 is configured for proxy users but not the HTTP transport**
15554
15555 If users of the Thrift 1 Server enable proxy user support without enabling the prerequisite HTTP transport, we now log a WARN message about the mismatch.
15556
15557
15558 ---
15559
15560 * [HBASE-17914](https://issues.apache.org/jira/browse/HBASE-17914) | *Major* | **Create a new reader instead of cloning a new StoreFile when compaction**
15561
15562 StoreFile.createReader method is gone. Call initReader and then getReader instead.
15563
15564
15565 ---
15566
15567 * [HBASE-16477](https://issues.apache.org/jira/browse/HBASE-16477) | *Major* | **Remove Writable interface and related code from WALEdit/WALKey**
15568
15569 Removes the Writables, and related code from WALEdit class. HBase-2.0 will not be able to read WAL files written with 0.94.x and before.
15570
15571
15572 ---
15573
15574 * [HBASE-17858](https://issues.apache.org/jira/browse/HBASE-17858) | *Major* | **Update refguide about the IS annotation if necessary**
15575
15576 Updated refguide to tell users that IS annotation is only valid for IA.LimitedPrivate classes.
15577
15578
15579 ---
15580
15581 * [HBASE-17857](https://issues.apache.org/jira/browse/HBASE-17857) | *Major* | **Remove IS annotations from IA.Public classes**
15582
15583 Now we do not have InterfaceStability annotations for IA,Public API. The stability of these classes will follow the rule of 'Semantic Versioning'.
15584
15585
15586 ---
15587
15588 * [HBASE-17215](https://issues.apache.org/jira/browse/HBASE-17215) | *Major* | **Separate small/large file delete threads in HFileCleaner to accelerate archived hfile cleanup speed**
15589
15590 After HBASE-17215 we change to use two threads for (archived) hfile cleaning. The size throttling for large/small files could be set through "hbase.regionserver.thread.hfilecleaner.throttle" and default to 67108864 (64M). It supports online configuration change, just find the active master address through zookeeper dump and use it in update\_config command, e.g. update\_config 'hbasem1.et2.tbsite.net,60100,1488038696741'
15591
15592
15593 ---
15594
15595 * [HBASE-16780](https://issues.apache.org/jira/browse/HBASE-16780) | *Critical* | **Since move to protobuf3.1, Cells are limited to 64MB where previous they had no limit**
15596
15597 Upgrade internal pb to 3.2 from 3.1. 3.2 has fix for 64MB limit.
15598
15599
15600 ---
15601
15602 * [HBASE-17287](https://issues.apache.org/jira/browse/HBASE-17287) | *Blocker* | **Master becomes a zombie if filesystem object closes**
15603
15604 If filesystem is not available during log split, abort master server.
15605
15606
15607 ---
15608
15609 * [HBASE-17765](https://issues.apache.org/jira/browse/HBASE-17765) | *Major* | **Reviving the merge possibility in the CompactingMemStore**
15610
15611 Reviving the merge of the compacting pipeline: making the limit on the number of the segments in the pipeline configurable and adding the merge test.
15612
15613 In order to customize the pipeline size limit change the value of the "hbase.hregion.compacting.pipeline.segments.limit" in the hbase-site.xml
15614
15615 Value 1 means to merge the segments on any flush-in-memory. Value higher than 16 means no merge.
15616
15617
15618 ---
15619
15620 * [HBASE-13395](https://issues.apache.org/jira/browse/HBASE-13395) | *Major* | **Remove HTableInterface**
15621
15622 HTableInterface was deprecated in 0.21.0 and is removed in 2.0.0. Use org.apache.hadoop.hbase.client.Table instead.
15623
15624
15625 ---
15626
15627 * [HBASE-17595](https://issues.apache.org/jira/browse/HBASE-17595) | *Critical* | **Add partial result support for small/limited scan**
15628
15629 Now small scan and limited scan could also return partial results.
15630
15631
15632 ---
15633
15634 * [HBASE-16014](https://issues.apache.org/jira/browse/HBASE-16014) | *Major* | **Get and Put constructor argument lists are divergent**
15635
15636 Add 2 constructors fot API Get
15637 1. Get(byte[], int, int)
15638 2. Get(ByteBuffer)
15639
15640
15641 ---
15642
15643 * [HBASE-17584](https://issues.apache.org/jira/browse/HBASE-17584) | *Major* | **Expose ScanMetrics with ResultScanner rather than Scan**
15644
15645 Now you can use ResultScanner.getScanMetrics to get the scan metrics at any time during the scan operation. The old Scan.getScanMetrics is deprecated and still work, but if you use ResultScanner.getScanMetrics to get the scan metrics and reset it, then the metrics published to the Scan instaince will be messed up.
15646
15647
15648 ---
15649
15650 * [HBASE-17802](https://issues.apache.org/jira/browse/HBASE-17802) | *Major* | **Add note that minor versions can add methods to Interfaces**
15651
15652 Update our semver section to include a note on our allowing ourselves the right to add methods to an Interface over a minor version as agreed to up on the dev list:  "If a Client implements an HBase Interface, a recompile MAY be required upgrading to a newer minor version (See release notes for warning about incompatible changes). All effort will be made to provide a default implementation so this case should not arise."
15653
15654
15655 ---
15656
15657 * [HBASE-17426](https://issues.apache.org/jira/browse/HBASE-17426) | *Major* | **Inconsistent environment variable names for enabling JMX**
15658
15659 In bin/hbase-config.sh,
15660 if value for HBASE\_JMX\_BASE is empty, keep current behavior.
15661 if HBASE\_JMX\_OPTS is not empty, keep current behavior.
15662 otherwise use the value of HBASE\_JMX\_BASE
15663
15664
15665 ---
15666
15667 * [HBASE-17740](https://issues.apache.org/jira/browse/HBASE-17740) | *Critical* | **Correct the semantic of batch and partial for async client**
15668
15669 Now async client has the same semantic with sync client for batch and partial.
15670 '''
15671 Now setBatch doesn't mean setAllowPartialResult(true)
15672 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15673 '''
15674
15675 Also a minor API change:
15676 Result#createCompleteResult(List\<Result\>) is changed to Result#createCompleteResult(Iterable\<Result\>).
15677
15678
15679 ---
15680
15681 * [HBASE-17746](https://issues.apache.org/jira/browse/HBASE-17746) | *Major* | **TestSimpleRpcScheduler.testCoDelScheduling is broken**
15682
15683 The executor for CoDel is changed to FastPathBalancedQueueRpcExecutor
15684
15685
15686 ---
15687
15688 * [HBASE-17712](https://issues.apache.org/jira/browse/HBASE-17712) | *Major* | **Remove/Simplify the logic of RegionScannerImpl.handleFileNotFound**
15689
15690 Add a config named 'hbase.hregion.unassign.for.fnfe'. It is used to control whether to reopen a region when hitting FileNotFoundException. The default value is true.
15691
15692
15693 ---
15694
15695 * [HBASE-15941](https://issues.apache.org/jira/browse/HBASE-15941) | *Major* | **HBCK repair should not unsplit healthy splitted region**
15696
15697 A new option -removeParents is now available that will remove an old parent when two valid daughters for that parent exist and -fixHdfsOverlaps is used. If there is an issue trying to remove the parent from META or sidelining the parent from HDFS we will fallback to do a regular merge. For now this option only works when the overlap group consists only of 3 regions (a parent, daughter A and daughter B)
15698
15699
15700 ---
15701
15702 * [HBASE-17737](https://issues.apache.org/jira/browse/HBASE-17737) | *Major* | **Thrift2 proxy should support scan timeRange per column family**
15703
15704 Thrift2 proxy supports scan timeRange per column family
15705
15706
15707 ---
15708
15709 * [HBASE-17718](https://issues.apache.org/jira/browse/HBASE-17718) | *Major* | **Difference between RS's servername and its ephemeral node cause SSH stop working**
15710
15711 Fix our accidentally registering a RegionServer's ephermal znode BEFORE we checked in with the master.
15712
15713
15714 ---
15715
15716 * [HBASE-17717](https://issues.apache.org/jira/browse/HBASE-17717) | *Critical* | **Incorrect ZK ACL set for HBase superuser**
15717
15718 In previous versions of HBase, the system intended to set a ZooKeeper ACL on all "sensitive" ZNodes for the user specified in the hbase.superuser configuration property. Unfortunately, the ACL was malformed which resulted in the hbase.superuser being unable to access the sensitive ZNodes that HBase creates. This JIRA issue fixes this bug. HBase will automatically correct the ACLs on start so users do not need to manually correct the ACLs.
15719
15720
15721 ---
15722
15723 * [HBASE-17716](https://issues.apache.org/jira/browse/HBASE-17716) | *Minor* | **Formalize Scan Metric names**
15724
15725 HBASE-17716 breaks compatibility of ServerSideScanMetrics by changing public field names, and the issue is fixed through HBASE-17886
15726
15727
15728 ---
15729
15730 * [HBASE-15484](https://issues.apache.org/jira/browse/HBASE-15484) | *Blocker* | **Correct the semantic of batch and partial**
15731
15732 Now setBatch doesn't mean setAllowPartialResult(true)
15733 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15734 Scan#setBatch is helpful in paging queries, if you just want to prevent OOM at client, use setAllowPartialResults(true) is better.
15735 We deprecated isPartial and use mayHaveMoreCellsInRow. If it returns false, current Result must be the last one of this row.
15736
15737
15738 ---
15739
15740 * [HBASE-17312](https://issues.apache.org/jira/browse/HBASE-17312) | *Major* | **[JDK8] Use default method for Observer Coprocessors**
15741
15742 Deletes BaseMasterAndRegionObserver, BaseMasterObserver, BaseRegionObserver, BaseRegionServerObserver and BaseWALObserver.
15743 Their corresponding interface classes now use JDK8's 'default' keyword to provide empty/no-op implementations so that:
15744 1. Derived class don't break when more coprocessor hooks are added in future.
15745 2. Derived classes don't have to redundantly override functions they don't care about with empty implementations.
15746
15747 Earlier, BaseXXXObserver classes provided these exact two benefits, but with 'default' keyword in JDK8, they are not needed anymore.
15748
15749 To fix the breakages because of this change, simply change "Foo extends BaseXXXObserver" to "Foo implements XXXObserver".
15750
15751
15752 ---
15753
15754 * [HBASE-17647](https://issues.apache.org/jira/browse/HBASE-17647) | *Major* | **OffheapKeyValue#heapSize() implementation is wrong**
15755
15756 **WARNING: No release note provided for this change.**
15757
15758
15759 ---
15760
15761 * [HBASE-13718](https://issues.apache.org/jira/browse/HBASE-13718) | *Minor* | **Add a pretty printed table description to the table detail page of HBase's master**
15762
15763 <!-- markdown -->
15764
15765
15766 The table information page in the Master UI now includes a schema section that describes the column families defined for that table as well as any column family specific properties that are set.
15767
15768
15769 ---
15770
15771 * [HBASE-17472](https://issues.apache.org/jira/browse/HBASE-17472) | *Major* | **Correct the semantic of  permission grant**
15772
15773 Before this patch, later granted permissions will override previous granted permissions, and previous granted permissions LOST. this issue re-define grant semantic: for master branch, later granted permissions will merge with previous granted permissions.  for branch-1.4, grant keep override behavior for compatibility purpose, and a grant with mergeExistingPermission flag provided.
15774
15775
15776 ---
15777
15778 * [HBASE-17583](https://issues.apache.org/jira/browse/HBASE-17583) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan for sync client**
15779
15780 Now you can include/exlude the startRow and stopRow for a scan. And the new methods to specify startRow and stopRow are withStartRow and withStopRow. The old methods to specify startRow and Row(include constructors) are marked as deprecated as in the old time if startRow and stopRow are equal then we will consider it as a get scan and include the stopRow implicitly. This is strange after we can set inclusiveness explicitly so we add new methods and depredate the old methods. The deprecated methods will be removed in the future.
15781
15782
15783 ---
15784
15785 * [HBASE-9702](https://issues.apache.org/jira/browse/HBASE-9702) | *Major* | **Change unittests that use "table" or "testtable" to use method names.**
15786
15787 Changes all tests to use the TestName JUnit Rule everywhere rather than hardcode table/region/store names.
15788
15789
15790 ---
15791
15792 * [HBASE-17280](https://issues.apache.org/jira/browse/HBASE-17280) | *Minor* | **Add mechanism to control hbase cleaner behavior**
15793
15794 The HBase cleaner chore process cleans up old WAL files and archived HFiles. Cleaner operation can affect query performance when running heavy workloads, so disable the cleaner during peak hours. The cleaner has the following HBase shell commands:
15795
15796 - cleaner\_chore\_enabled: Queries whether cleaner chore is enabled/ disabled.
15797 - cleaner\_chore\_run: Manually runs the cleaner to remove files.
15798 - cleaner\_chore\_switch: enables or disables the cleaner and returns the previous state of the cleaner. For example, cleaner-switch true enables the cleaner.
15799
15800 Following APIs are added in Admin:
15801 - setCleanerChoreRunning(boolean on): Enable/Disable the cleaner chore
15802 - runCleanerChore(): Ask for cleaner chore to run
15803 - isCleanerChoreEnabled(): Query whether cleaner chore is enabled/ disabled.
15804
15805
15806 ---
15807
15808 * [HBASE-17599](https://issues.apache.org/jira/browse/HBASE-17599) | *Major* | **Use mayHaveMoreCellsInRow instead of isPartial**
15809
15810 The word 'isPartial' is ambiguous so we introduce a new method 'mayHaveMoreCellsInRow' to replace it. And the old meaning of 'isPartial' is not the same with 'mayHaveMoreCellsInRow' as for batched scan, if the number of returned cells equals to the batch, isPartial will be false. After this change the meaning of 'isPartial' will be same with 'mayHaveMoreCellsInRow'. This is an incompatible change but it is not likely to break a lot of things as for batched scan the old 'isPartial' is just a redundant information, i.e, if the number of returned cells reaches the batch limit. You have already know the number of returned cells and the value of batch.
15811
15812
15813 ---
15814
15815 * [HBASE-17437](https://issues.apache.org/jira/browse/HBASE-17437) | *Major* | **Support specifying a WAL directory outside of the root directory**
15816
15817 This patch adds support for specifying a WAL directory outside of the HBase root directory.
15818
15819 Multiple configuration variables were added to accomplish this:
15820 hbase.wal.dir: used to configure where the root WAL directory is located. Could be on a different FileSystem than the root directory. WAL directory can not be set to a subdirectory of the root directory. The default value of this is the root directory if unset.
15821
15822 hbase.rootdir.perms: Configures FileSystem permissions to set on the root directory. This is '700' by default.
15823
15824 hbase.wal.dir.perms: Configures FileSystem permissions to set on the WAL directory FileSystem. This is '700' by default.
15825
15826
15827 ---
15828
15829 * [HBASE-17350](https://issues.apache.org/jira/browse/HBASE-17350) | *Critical* | **Fixup of regionserver group-based assignment**
15830
15831 A few bug fixes and tweaks to the fsgroup feature.
15832
15833 Renamed shell command move\_rsgroup\_servers as move\_servers\_rsgroup
15834 Renamed shell comand move\_rsgroup\_tables as move\_tables\_rsgroup
15835
15836 Made the 'default' group more 'dynamic'; i.e. dead servers no longer show in the 'default' group.
15837
15838
15839 ---
15840
15841 * [HBASE-17578](https://issues.apache.org/jira/browse/HBASE-17578) | *Major* | **Thrift per-method metrics should still update in the case of exceptions**
15842
15843 In prior versions, the HBase Thrift handlers failed to increment per-method metrics when an exception was encountered.  These metrics will now always be incremented, whether an exception is encountered or not.  This change also adds exception-type metrics, similar to those exposed in regionservers, for individual exceptions which are received by the Thrift handlers.
15844
15845
15846 ---
15847
15848 * [HBASE-17508](https://issues.apache.org/jira/browse/HBASE-17508) | *Major* | **Unify the implementation of small scan and regular scan for sync client**
15849
15850 Now the scan.setSmall method is deprecated. Consider using scan.setLimit and scan.setReadType in the future. And we will open scanner lazily when you call scanner.next. This is an incompatible change which delays the table existence check and permission check.
15851
15852
15853 ---
15854
15855 * [HBASE-16981](https://issues.apache.org/jira/browse/HBASE-16981) | *Major* | **Expand Mob Compaction Partition policy from daily to weekly, monthly**
15856
15857 Mob compaction partition policy can be set by
15858 hbase\> create 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'weekly'}
15859
15860 or
15861
15862 hbase\> alter 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'monthly'}
15863
15864 Available MOB\_COMPACT\_PARTITION\_POLICY options are "daily", "weekly" and "monthly", the default is "daily".
15865
15866 When it is "weekly" policy, the mob compaction will try to compact files within one calendar week into one for a specific partition, similar for "daily" and "monthly".
15867
15868 With "weekly" policy, one mob file normally is compacted twice during its lifetime (that is first on daily basis and then all such daily based compacted files belonging to a week at the weekly interval), for one region, there normally are 52 files for one year. With "Monthly" policy, one mob file normally is compacted 3 times during its lifetime (First daily and then weekly followed by monthly at end of every month) and normally there are 12 files for one year.
15869
15870
15871 ---
15872
15873 * [HBASE-17197](https://issues.apache.org/jira/browse/HBASE-17197) | *Major* | **hfile does not work in 2.0**
15874
15875 The -f argument is no longer required specifying target file; just pass the file as an argument.
15876
15877
15878 ---
15879
15880 * [HBASE-16812](https://issues.apache.org/jira/browse/HBASE-16812) | *Minor* | **Clean up the locks in MOB**
15881
15882 In MOB-enabled column family, the lock in the major compaction is removed. All the delete markers are retained in the major compaction, and a MOB reference tag is appended to each of the retained delete markers.
15883
15884
15885 ---
15886
15887 * [HBASE-12894](https://issues.apache.org/jira/browse/HBASE-12894) | *Critical* | **Upgrade Jetty to 9.2.6**
15888
15889 Upgrades Jetty to 9.x from 6.x (Jetty9 is in different namespace from Jetty6). Also updated Jersey to 2.x and Servlet to 3.x.
15890
15891
15892 ---
15893
15894 * [HBASE-17566](https://issues.apache.org/jira/browse/HBASE-17566) | *Major* | **Jetty upgrade fixes**
15895
15896 Fix inability at finding static content post push of parent issue moving us to jetty9.
15897
15898
15899 ---
15900
15901 * [HBASE-9774](https://issues.apache.org/jira/browse/HBASE-9774) | *Major* | **HBase native metrics and metric collection for coprocessors**
15902
15903 This issue adds two new modules, hbase-metrics and hbase-metrics-api which define and implement the "new" metric system used internally within HBase. These two modules (and some other code in hbase-hadoop2-compat) module are referred as "HBase metrics framework" which is HBase-specific and independent of any other metrics library (including Hadoop metrics2 and dropwizards metrics).
15904
15905 HBase Metrics API (hbase-metrics-api) contains the interface that HBase exposes internally and to third party code (including coprocessors). It is a thin
15906 abstraction over the actual implementation for backwards compatibility guarantees. The metrics API in this hbase-metrics-api module is inspired by the Dropwizard metrics 3.1 API, however, the API is completely independent.
15907
15908 hbase-metrics module contains implementation of the "HBase Metrics API", including MetricRegistry, Counter, Histogram, etc. These are highly concurrent implementations of the Metric interfaces. Metrics in HBase are grouped into different sets (like WAL, RPC, RegionServer, etc). Each group of metrics should be tracked via a MetricRegistry specific to that group.
15909
15910 Historically, HBase has been using Hadoop's Metrics2 framework [3] for collecting and reporting the metrics internally. However, due to the difficultly of dealing with the Metrics2 framework, HBase is moving away from Hadoop's metrics implementation to its custom implementation. The move will happen incrementally, and during the time, both Hadoop Metrics2-based metrics and hbase-metrics module based classes will be in the source code. All new implementations for metrics SHOULD use the new API and framework.
15911
15912 This jira also introduces the metrics API to coprocessor implementations. Coprocessor writes can export custom metrics using the API and have those collected via metrics2 sinks, as well as exported via JMX in regionserver metrics.
15913
15914 More documentation available at: hbase-metrics-api/README.txt
15915
15916
15917 ---
15918
15919 * [HBASE-17491](https://issues.apache.org/jira/browse/HBASE-17491) | *Major* | **Remove all setters from HTable interface and introduce a TableBuilder to build Table instance**
15920
15921 After HBASE-17491 all setter methods in HTable are marked as deprecated, moved into TableBuilder, and will be removed later.
15922
15923
15924 ---
15925
15926 * [HBASE-17067](https://issues.apache.org/jira/browse/HBASE-17067) | *Major* | **Procedure v2 - remove tryAcquire\*Lock and use wait/wake to make framework event based**
15927
15928 Make the framework more 'lively'; undo 'suspend' notion in Procedure, rely on eventing mechanism instead. Lets us remove no longer needed synchronizations. Framework can now do more ops per second.
15929
15930
15931 ---
15932
15933 * [HBASE-16698](https://issues.apache.org/jira/browse/HBASE-16698) | *Major* | **Performance issue: handlers stuck waiting for CountDownLatch inside WALKey#getWriteEntry under high writing workload**
15934
15935 Assign sequenceid to an edit before we go on the ringbuffer; undoes contention on WALKey latch. Adds a new config "hbase.hregion.mvcc.preassign" which defaults to true: i.e. this speedup is enabled.
15936
15937 User could set this per-table level, like:
15938 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hregion.mvcc.preassign'=\>'false'}}
15939
15940
15941 ---
15942
15943 * [HBASE-17488](https://issues.apache.org/jira/browse/HBASE-17488) | *Trivial* | **WALEdit should be lazily instantiated**
15944
15945 prevent creating unused objects in the WALEdit's construction.
15946 +If the cp#preBatchMutate returns true, the WALEdit is useless. So we should create the WALEdit after step 2.
15947 +The cells came from cp should be counted because they are added into the WALEdit . The use case is the local index of phoenix
15948 +If the mutation contains the SKIP\_WAL property, its cells aren't added into the WALEdit. So these cells shouldn't be counted.
15949
15950
15951 ---
15952
15953 * [HBASE-16831](https://issues.apache.org/jira/browse/HBASE-16831) | *Minor* | **Procedure V2 - Remove org.apache.hadoop.hbase.zookeeper.lock**
15954
15955 Purges code that did zk-hosted locks for table ops (we do procedure-based locks now)
15956
15957
15958 ---
15959
15960 * [HBASE-16867](https://issues.apache.org/jira/browse/HBASE-16867) | *Major* | **Procedure V2 - Check ACLs for remote HBaseLock**
15961
15962 Add checking ACL when taking locks.
15963
15964
15965 ---
15966
15967 * [HBASE-16786](https://issues.apache.org/jira/browse/HBASE-16786) | *Major* | **Procedure V2 - Move ZK-lock's uses to Procedure framework locks (LockProcedure)**
15968
15969 Move locking to be procedure (Pv2) rather than zookeeper based. All locking moved over to new infrastructure including MOBing locking.
15970
15971
15972 ---
15973
15974 * [HBASE-17470](https://issues.apache.org/jira/browse/HBASE-17470) | *Major* | **Remove merge region code from region server**
15975
15976 In 1.x branches, Admin.mergeRegions calls MASTER via dispatchMergingRegions RPC; when executing dispatchMergingRegions RPC, MASTER calls RS via MergeRegions to complete the merge in RS-side.
15977
15978 With HBASE-16119, the merge logic moves to master-side.  This JIRA cleans up unused RPCs (dispatchMergingRegions and MergeRegions) , removes dangerous tools such as Merge and HMerge, and deletes unused RegionServer-side merge region logic in 2.0 release.
15979
15980
15981 ---
15982
15983 * [HBASE-16744](https://issues.apache.org/jira/browse/HBASE-16744) | *Major* | **Procedure V2 - Lock procedures to allow clients to acquire locks on tables/namespaces/regions**
15984
15985  Lock for HBase Entity either a Table, a Namespace, or Regions.
15986
15987 These are remote locks which live on master, and need periodic heartbeats to keep them alive. (Once we request the lock, internally an heartbeat thread will be started). If master doesn't receive the heartbeat in time, it'll release the lock and make it available to other users.
15988
15989 Use {@link LockServiceClient} to build instances. Then call {@link #requestLock()}. {@link #requestLock} will contact master to queue the lock and start the heartbeat thread which will check lock's status periodically and once the lock is acquired, it will send the heartbeats to the master.
15990
15991 Use {@link #await} or {@link #await(long, TimeUnit)} to wait for the lock to be acquired. Always call {@link #unlock()} irrespective of whether lock was acquired or not. If the lock was acquired, it'll be released. If it was not acquired, it is possible that master grants the lock in future and the heartbeat thread keeps it alive forever by sending heartbeats. Calling {@link #unlock()} will stop the heartbeat thread and cancel the lock queued on master.
15992
15993 There are 4 ways in which these remote locks may be released/can be lost:
15994   \* Call {@link #unlock}.
15995   \* Lock times out on master: Can happen because of network issues, GC pauses, etc. Worker thread will call the given abortable as soon as it detects such a situation. Fail to contact master: If worker thread can not contact mater and thus fails to send heartbeat before the timeout expires, it assumes that lock is lost and calls the
15996  \*     abortable.
15997 Worker thread is interrupted.
15998
15999 Use example:
16000
16001  EntityLock lock = lockServiceClient.\*Lock(...., "exampled lock", abortable);
16002   lock.requestLock();
16003   ....
16004    ....can do other initializations here since lock is 'asynchronous'...
16005  ....
16006  if (lock.await(timeout)) {
16007     ....logic requiring mutual exclusion
16008   }
16009    lock.unlock();
16010
16011
16012 ---
16013
16014 * [HBASE-14061](https://issues.apache.org/jira/browse/HBASE-14061) | *Major* | **Support CF-level Storage Policy**
16015
16016 After HBASE-14061 we support to set storage policy for HFile through "hbase.hstore.block.storage.policy" configuration, and we support CF-level setting to override the settings from configuration file. Currently supported storage policies include ALL\_SSD/ONE\_SSD/HOT/WARM/COLD, refer to http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html for more details
16017
16018 For example, to create a table with two families: "cf1" with "ALL\_SSD" storage policy and "cf2" with "ONE\_SSD", we could use below command in hbase shell:
16019 create 'table',{NAME=\>'f1',STORAGE\_POLICY=\>'ALL\_SSD'},{NAME=\>'f2',STORAGE\_POLICY=\>'ONE\_SSD'}
16020
16021 We could also set the configuration in table attribute like all other configurations:
16022 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ONE\_SSD'}}
16023
16024
16025 ---
16026
16027 * [HBASE-17337](https://issues.apache.org/jira/browse/HBASE-17337) | *Major* | **list replication peers request should be routed through master**
16028
16029 List replication peers request will be roughed through master.
16030
16031
16032 ---
16033
16034 * [HBASE-15172](https://issues.apache.org/jira/browse/HBASE-15172) | *Major* | **Support setting storage policy in bulkload**
16035
16036 After HBASE-15172/HBASE-19016 we could set storage policy through "hbase.hstore.block.storage.policy" property for bulkload, or "hbase.hstore.block.storage.policy.\<family\_name\>" for a specified family. Supported storage policy includes: ALL\_SSD, ONE\_SSD, HOT, WARM, COLD, etc.
16037
16038
16039 ---
16040
16041 * [HBASE-17336](https://issues.apache.org/jira/browse/HBASE-17336) | *Major* | **get/update replication peer config requests should be routed through master**
16042
16043 Get/update replication peer config requests will be routed through master.
16044
16045
16046 ---
16047
16048 * [HBASE-17320](https://issues.apache.org/jira/browse/HBASE-17320) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan**
16049
16050 Now you can specific the inclusive of startRow and stopRow for a scan using the new methods withStartRow(byte[] startRow, boolean inclusive) and withStopRow(byte[] stopRow, boolean inclusive). The old setStartRow and setStopRow methods, and the constructors are marked as deprecated because of an strange behavior that we will include the stopRow implicitly if startRow equals to stopRow. This is used to support get scan in the old time. Use withStartRow and withStopRow instead.
16051
16052 For developers, the ConnectionUtils.createClosestRowBefore is also marked as deprecated as the row returned by this method is only very very close to the current row, not closest. Avoid using this method in the future.
16053
16054
16055 ---
16056
16057 * [HBASE-17314](https://issues.apache.org/jira/browse/HBASE-17314) | *Major* | **Limit total buffered size for all replication sources**
16058
16059 Add a conf "replication.total.buffer.quota" to limit total size of buffered entries in all replication peers. It will prevent server getting OOM if there are many peers. Default value is 256MB.
16060
16061
16062 ---
16063
16064 * [HBASE-17174](https://issues.apache.org/jira/browse/HBASE-17174) | *Minor* | **Refactor the AsyncProcess, BufferedMutatorImpl, and HTable**
16065
16066 + cleanup some unused code
16067 + allow being able to share pool between BufferedMutatorImpl
16068 + setting "hbase.client.request.controller.impl" to the name of the alternate RequestController (traffic control) implementation class in Configuration
16069 + The default RequestController implementation is SimpleRequestController
16070 + setting "hbase.client.log.detail.period.ms" to call logger on a period when waiting for tasks to complete
16071
16072
16073 ---
16074
16075 * [HBASE-17335](https://issues.apache.org/jira/browse/HBASE-17335) | *Major* | **enable/disable replication peer requests should be routed through master**
16076
16077 Enable/Disable replication peer requests will be routed through master.
16078
16079
16080 ---
16081
16082 * [HBASE-5401](https://issues.apache.org/jira/browse/HBASE-5401) | *Major* | **PerformanceEvaluation generates 10x the number of expected mappers**
16083
16084 Changes how many tasks PE runs when clients are mapreduce. Now tasks == client count. Previous we hardcoded ten tasks per client instance.
16085
16086
16087 ---
16088
16089 * [HBASE-11392](https://issues.apache.org/jira/browse/HBASE-11392) | *Critical* | **add/remove peer requests should be routed through master**
16090
16091 Add/Remove replication peer requests will be routed through master. And make ReplicationAdmin as Deprecated.
16092
16093
16094 ---
16095
16096 * [HBASE-15924](https://issues.apache.org/jira/browse/HBASE-15924) | *Major* | **Enhance hbase services autorestart capability to hbase-daemon.sh**
16097
16098 Now one can start hbase services with enabled "autostart/autorestart" feature in controlled fashion with the help of "--autostart-window-size" to define the window period and the "--autostart-window-retry-limit" to define the number of times the hbase services have to be restarted upon being killed/terminated abnormally within the provided window perioid.
16099
16100 The following cases are supported with "autostart/autorestart":
16101
16102 a) --autostart-window-size=0 and --autostart-window-retry-limit=0, indicates infinite window size and no retry limit
16103 b) not providing the args, will default to a)
16104 c) --autostart-window-size=0 and --autostart-window-retry-limit=\<positive value\> indicates the autostart process to bail out if the retry limit exceeds irrespective of window period
16105 d) --autostart-window-size=\<x\> and --autostart-window-retry-limit=\<y\> indicates the autostart process to bail out if the retry limit "y" is exceeded for the last window period "x".
16106
16107
16108 ---
16109
16110 * [HBASE-17331](https://issues.apache.org/jira/browse/HBASE-17331) | *Minor* | **Avoid busy waiting in ThrottledInputStream**
16111
16112 For each read(), old ThrottledInputStream sleeps/wakes/checks for many times for controlling the throughput. After this patch, ThrottledInputStream sleeps/wakes/checks only once. So we can reduce CPU usage.
16113
16114
16115 ---
16116
16117 * [HBASE-17296](https://issues.apache.org/jira/browse/HBASE-17296) | *Major* | **Provide per peer throttling for replication**
16118
16119 Provide per peer throttling for replication. Add the bandwidth upper limit to ReplicationPeerConfig and a new shell cmd set\_peer\_bandwidth to update the bandwidth in need.
16120
16121
16122 ---
16123
16124 * [HBASE-17277](https://issues.apache.org/jira/browse/HBASE-17277) | *Major* | **Allow alternate BufferedMutator implementation**
16125
16126 Specify the name of an alternate BufferedMutator implementation by either:
16127
16128  \* Setting "hbase.client.bufferedmutator.classname" to the name of the alternate implementation class in Configuration
16129  \* Or, by setting BufferedMutatorParams#implementationClassName and passing the amended BufferedMutatorParams when calling Connection#getBufferedMutator.
16130
16131
16132 ---
16133
16134 * [HBASE-17294](https://issues.apache.org/jira/browse/HBASE-17294) | *Major* | **External Configuration for Memory Compaction**
16135
16136 This patch provides a single external knob to control memstore compaction. It also inmemory compaction with BASIC policy as our default (AFTERWORD: inmemory compaction as default was undone in HBASE-17333 because of test failures; will be reenabled in later, dedicated issue)
16137
16138 Possible memstore compaction policies are:
16139 (1) None - no memory compaction, when size threshold is exceeded data is flushed to disk
16140 (2) Basic policy applies optimizations which modify the index to a more compacted representation. This is beneficial in all access patterns. The smaller the cells are the greater the benefit of this policy. This is the default policy.
16141 (3) Eager - in addition to compacting the index representation as the basic policy, eager policy eliminates duplication while the data is still in memory (much like the on-disk compaction does after the data is flushed to disk). This policy is most useful for applications with high data churn or small working sets.
16142
16143 Memory compaction policeman be set at the column family level at table creation time:
16144 {code}
16145 create ‘\<tablename\>’,
16146    {NAME =\> ‘\<cfname\>’,
16147     IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
16148 {code}
16149 or as a property at the global configuration level by setting the property in hbase-site.xml, with BASIC being the default value:
16150 {code}
16151 \<property\>
16152         \<name\>hbase.hregion.compacting.memstore.type\</name\>
16153         \<value\>\<NONE\|BASIC\|EAGER\>\</value\>
16154 \</property\>
16155 {code}
16156 The values used in this property can change as memstore compaction policies evolve over time.
16157
16158
16159 ---
16160
16161 * [HBASE-16336](https://issues.apache.org/jira/browse/HBASE-16336) | *Major* | **Removing peers seems to be leaving spare queues**
16162
16163 Add a ReplicationZKNodeCleaner periodically check and delete the useless replication queue zk node belong to the peer which is not exist.
16164
16165
16166 ---
16167
16168 * [HBASE-17272](https://issues.apache.org/jira/browse/HBASE-17272) | *Major* | **Doc how to run Standalone HBase over an HDFS instance; all daemons in one JVM but persisting to an HDFS instance**
16169
16170 Adds section at http://hbase.apache.org/book.html#standalone.over.hdfs on how to make standalone persist to an hdfs instance (where standalone is all daemons in the one jvm).
16171
16172
16173 ---
16174
16175 * [HBASE-16700](https://issues.apache.org/jira/browse/HBASE-16700) | *Minor* | **Allow for coprocessor whitelisting**
16176
16177 Provides ability to restrict table coprocessors based on HDFS path whitelist. (Particularly useful for allowing Phoenix coprocessors but not arbitrary user created coprocessors.)
16178
16179
16180 ---
16181
16182 * [HBASE-17221](https://issues.apache.org/jira/browse/HBASE-17221) | *Major* | **Abstract out an interface for RpcServer.Call**
16183
16184 Provide an interface RpcCall on the server side.
16185 RpcServer.Call now is marked as @InterfaceAudience.Private, and implements the interface RpcCall,
16186
16187
16188 ---
16189
16190 * [HBASE-16119](https://issues.apache.org/jira/browse/HBASE-16119) | *Major* | **Procedure v2 - Reimplement merge**
16191
16192 The merge region logic is controlled by master in 2.0.0 (in 1.x, the core merge region logic is in the region server side).  The coprocessors related to merge region in RS-side would be no-op in 2.0.0 and later release.  Therefore, this is an incompatible change.  Users needs to move the CP logic to new master CP and registers them.
16193
16194 A new mergeRegionsAsync() API is added in client.  The existing mergeRegions() API will call the new API so client does not have to change its code.
16195
16196
16197 ---
16198
16199 * [HBASE-17112](https://issues.apache.org/jira/browse/HBASE-17112) | *Major* | **Prevent setting timestamp of delta operations the same as previous value's**
16200
16201 Before this issue, two concurrent Increments/Appends done in same millisecond or RS's clock going back will result in two results have same TS, which is not friendly to versioning and will get wrong result in slave cluster if the replication is disordered.
16202 After this issue, the result of Increment/Append will always have an incremental TS. There is no any inconsistent in replication for these operations. But there is a rare case that if there is a Delete in same millisecond, the later result can not be masked by this Delete. This can be fixed after we have new semantics that previous Delete will never mask later Put even its timestamp is higher.
16203
16204
16205 ---
16206
16207 * [HBASE-17181](https://issues.apache.org/jira/browse/HBASE-17181) | *Minor* | **Let HBase thrift2 support TThreadedSelectorServer**
16208
16209 Add TThreadedSelectorServer support for HBase Thrift2
16210
16211
16212 ---
16213
16214 * [HBASE-17178](https://issues.apache.org/jira/browse/HBASE-17178) | *Major* | **Add region balance throttling**
16215
16216 Add region balance throttling. Master execute every region balance plan per balance interval, which is equals to divide max balancing time by the size of region balance plan. And Introduce a new config hbase.master.balancer.maxRitPercent to protect availability. If config this to 0.01, then the max percent of regions in transition is 1% when balancing. Then the cluster's availability is at least 99% when balancing.
16217
16218
16219 ---
16220
16221 * [HBASE-15786](https://issues.apache.org/jira/browse/HBASE-15786) | *Major* | **Create DBB backed MSLAB pool**
16222
16223 Added a new config hbase.regionserver.offheap.global.memstore.size using which one can specify the global off heap limit that all memstores can use.  When this config is in MSLAB should be turned ON and we will use the entire size for the MSLAB pool. It will make off heap chunks and pool then. It will behave as if we are working with off heap memstores.  When this config is having a valid value and MSLAB is turned OFF, the system will just ignore the offheap size and continue to use global max heap space % for memstores and work with on heap memstores.
16224
16225
16226 ---
16227
16228 * [HBASE-17132](https://issues.apache.org/jira/browse/HBASE-17132) | *Major* | **Cleanup deprecated code for WAL**
16229
16230 Remove HLogKey and related classes and methods. Remove SequenceFile based log reader and writer. WALObserver and RegionObserver are changed so this is an incompatible change.
16231
16232
16233 ---
16234
16235 * [HBASE-16169](https://issues.apache.org/jira/browse/HBASE-16169) | *Major* | **Make RegionSizeCalculator scalable**
16236
16237 Added couple of API's to Admin.java:
16238
16239 Returns region load map of all regions hosted on a region server
16240 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn) throws IOException;
16241
16242 Returns region load map of all regions of a table hosted on a region server
16243 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn, TableName tableName) throws IOException
16244
16245 Added an API to region server:
16246
16247 public GetRegionLoadResponse getRegionLoad(RpcController controller,
16248     GetRegionLoadRequest request) throws ServiceException;
16249
16250 Primary intention is to use this API for RegionSizeCalculator and not rely on Master for ClusterStatus. On large clusters, ClusterStatus() can take a long time. IfMaster is down/busy, then some of the jobs timeout/fail. Other possible uses:
16251 1. If there is a lighter version of GetClusterStatus API (i.e without the ServerLoad for each RS), then custom maintenance tools can be better. In current world ClusterStatus is heavy. With the new APIs, each API's payload is smaller and distributed. So custom tools can call getRegionLoad() when needed, it will be more accurate. This helps with large clusters. For tools that don't need RegionLoad, the lighter version of API is fine enough.
16252 2. Another use case is a tool like RSTop - since we can see selective metrics at RegionLevel (possibly even deltas between each RPC to the server).
16253
16254
16255 ---
16256
16257 * [HBASE-15788](https://issues.apache.org/jira/browse/HBASE-15788) | *Major* | **Use Offheap ByteBuffers from BufferPool to read RPC requests.**
16258
16259 Using the ByteBuffers from ByteBufferPool to read the request bytes at server.  When the size of the request is smaller than 1/6th size of a BB in the pool, we will not use that but read into an on demand created, proper sized on heap ByteBuffer.
16260
16261
16262 ---
16263
16264 * [HBASE-17046](https://issues.apache.org/jira/browse/HBASE-17046) | *Major* | **Add 1.1 doc to hbase.apache.org**
16265
16266 Adds a 1.1. item to our 'Documentation and API' tab. Gives access to 1.1 APIs, XRef, etc.
16267
16268
16269 ---
16270
16271 * [HBASE-16962](https://issues.apache.org/jira/browse/HBASE-16962) | *Major* | **Add readPoint to preCompactScannerOpen() and preFlushScannerOpen() API**
16272
16273 The following RegionObserver methods are deprecated
16274
16275 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16276     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s)
16277     throws IOException;
16278
16279 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16280     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16281     final long earliestPutTs, final InternalScanner s, CompactionRequest request)
16282
16283 Instead, use the following methods:
16284
16285 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16286     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s,
16287     final long readPoint) throws IOException;
16288
16289 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16290     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16291     final long earliestPutTs, final InternalScanner s, final CompactionRequest request,
16292     final long readPoint) throws IOException
16293
16294
16295 ---
16296
16297 * [HBASE-17017](https://issues.apache.org/jira/browse/HBASE-17017) | *Major* | **Remove the current per-region latency histogram metrics**
16298
16299 Removes per-region level (get size, get time, scan size and scan time histogram) metrics that was exposed before. Per-region histogram metrics with 1000+ regions causes millions of objects to be allocated on heap. The patch introduces getCount and scanCount as counters rather than histograms. Other per-region level metrics are kept as they are.
16300
16301
16302 ---
16303
16304 * [HBASE-16955](https://issues.apache.org/jira/browse/HBASE-16955) | *Major* | **Fixup precommit protoc check to do new distributed protos and pb 3.1.0 build**
16305
16306 Test that environment no longer has to have protoc (2.5 and 3.1) available. Needed small adjustment in yetus protoc build but otherwise all works.
16307
16308
16309 ---
16310
16311 * [HBASE-17050](https://issues.apache.org/jira/browse/HBASE-17050) | *Minor* | **Upgrade Apache CLI version from 1.2 to 1.3.1**
16312
16313 Upgrade Apache CLI version from 1.2 to 1.3.1.
16314
16315 These are few good/important changes included in this update:
16316 - HelpFormatter now prints command-line options in the same order as they
16317   have been added. Fixes CLI-212.
16318 - Standard help text now shows mandatory arguments also for the first
16319   option. Fixes CLI-186.
16320 - A new parser is available: DefaultParser. It combines the features of the
16321   GnuParser and the PosixParser. It also provides additional features like
16322   partial matching for the long options, and long options without separator
16323   (i.e like the JVM memory settings: -Xmx512m). This new parser deprecates
16324   the previous ones. Fixes CLI-161,CLI-167,CLI-181.
16325
16326 For full list of changes:
16327   https://commons.apache.org/proper/commons-cli/changes-report.html#a1.3
16328
16329
16330 ---
16331
16332 * [HBASE-15513](https://issues.apache.org/jira/browse/HBASE-15513) | *Major* | **hbase.hregion.memstore.chunkpool.maxsize is 0.0 by default**
16333
16334 MSLAB chunk pool is on by default in hbase-2.0.0.
16335
16336
16337 ---
16338
16339 * [HBASE-16972](https://issues.apache.org/jira/browse/HBASE-16972) | *Major* | **Log more details for Scan#next request when responseTooSlow**
16340
16341 **WARNING: No release note provided for this change.**
16342
16343
16344 ---
16345
16346 * [HBASE-17014](https://issues.apache.org/jira/browse/HBASE-17014) | *Minor* | **Add clearly marked starting and shutdown log messages for all services.**
16347
16348 Delimit START, STOP, and ABORT messages with '\*\*\*\*\*' so denote.
16349
16350
16351 ---
16352
16353 * [HBASE-16765](https://issues.apache.org/jira/browse/HBASE-16765) | *Critical* | **New SteppingRegionSplitPolicy, avoid too aggressive spread of regions for small tables.**
16354
16355 Introduces a new split policy: SteppingSplitPolicy
16356 This will use a simple step function to split a region at (by default) 2  xflushSize when no other region of the same table is seen on the region server, or max-file-size when one or more other regions of the same table is seen.
16357
16358 In HBase 2.0 this is going to be the default. In previous versions it can be configured.
16359
16360
16361 ---
16362
16363 * [HBASE-16608](https://issues.apache.org/jira/browse/HBASE-16608) | *Major* | **Introducing the ability to merge ImmutableSegments without copy-compaction or SQM usage**
16364
16365 The index-compation and data-compaction variants of CompactingMemStore are introduced. In both types the active (mutable) segment is periodically flushed-in-memory and is added as immutable segment in the compaction pipeline. The CompactingMemStore of index-compaction type is merging all immutable segments of the compacting pipeline into one. The merging of N segments is explained below. The CompactingMemStore of data-compaction type is compacting all immutable segments of the compacting pipeline into one. After the merge/compaction the old segments in the compacting pipeline are replaced with one new.
16366
16367 Before explaining the process of merging N old segments into new one, note that segment structure includes ordered index that allows traversing the cells data efficiently. The merge is copying the ordered indexes of the old segments into one ordered index of new segment. No data is copied, no cells are filtered. Alternatively, in the process of compacting N old segments into new one, both data and index are copied. The old cells are filtered, meaning upon compaction unused versions of the cells are not copied so the new segment has less data then all old ones.
16368
16369 This issue introduces only the merging ability and simplifies the user intervention for switching between types. The previous CompactingMemStore structure was added by HBASE-16420 and HBASE-16421. The future refinements of the policy or merging/compacting will come in HBASE-16417.
16370
16371 In order to create a table with CompactingMemStore as a MemStore one should use:
16372 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16373 IN\_MEMORY\_COMPACTION default is false, so table created as following will have the known DefaultMemStore as a MemStore.
16374 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’}
16375
16376 The default type of CompactingMemStore is index-compaction. In order to change it to data-compaction one should add to the hbase-site.xml
16377 \<property\>
16378     \<name\>hbase.hregion.compacting.memstore.type\</name\>
16379     \<value\>data-compaction\</value\>
16380   \</property\>
16381
16382 in addition to creating the table as following
16383 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16384
16385
16386 ---
16387
16388 * [HBASE-16747](https://issues.apache.org/jira/browse/HBASE-16747) | *Major* | **Track memstore data size and heap overhead separately**
16389
16390 Marking it as incompatible change as there is a change in behavior for region flush decision. The default flush size of 128 MB per region was tracked against both actual data bytes size + overhead of these cells in memstore memory (Overhead because of Cell java objects and CSLM entry).  As part of this jira we will keep track of cell data size only in region level.  So 128 MB flush size means, 128 MB of cell data bytes (key+ value+..)
16391
16392 Globally we will track cell data size and heap overhead separately and will consider both for forced flushes. We will not allow over consume of heap memory by all memstore. This is as old case. Only tracking way is changed.
16393
16394
16395 ---
16396
16397 * [HBASE-16974](https://issues.apache.org/jira/browse/HBASE-16974) | *Minor* | **Update os-maven-plugin to 1.4.1.final+ for building shade file on RHEL/CentOS**
16398
16399 Upgrade os-maven-plugin mvn extension which figures the os we are running on from 1.4 to 1.5.
16400
16401
16402 ---
16403
16404 * [HBASE-16952](https://issues.apache.org/jira/browse/HBASE-16952) | *Major* | **Replace hadoop-maven-plugins with protobuf-maven-plugin for building protos**
16405
16406 Simplifies .proto manipulations. One step only now -- no need to keep pom.xml listing up to date with the protobuf protos directory content -- and no need to preinstall protoc; mvn does it all for you now.
16407
16408
16409 ---
16410
16411 * [HBASE-14551](https://issues.apache.org/jira/browse/HBASE-14551) | *Minor* | **Procedure v2 - Reimplement split**
16412
16413 Moved the Split Region logic to Master and most of split region coprocessor is in master now.  Need to change dependency such as Phoenix.
16414
16415
16416 ---
16417
16418 * [HBASE-15789](https://issues.apache.org/jira/browse/HBASE-15789) | *Major* | **PB related changes to work with offheap**
16419
16420 This issue adds a patch to our checked in internal, shaded protobuf, but it also adds a general means of apply patches to our version of protobuf. Patches found in the new src/main/patches directory are all applied as the last task when you run a build with the -Pcompile-protobuf profile under the hbase-protocol-shaded module. This commit also includes our first patch to protobuf; it adds ByteInput to mimic pb3.1's ByteOutput (src/main/patches/HBASE-15789\_V2.patch attached here).
16421
16422
16423 ---
16424
16425 * [HBASE-16930](https://issues.apache.org/jira/browse/HBASE-16930) | *Major* | **AssignmentManager#checkWals() function can recur infinitely**
16426
16427 Fixed potential infinite recursion in AssignmentManager.checkWals().
16428
16429
16430 ---
16431
16432 * [HBASE-16463](https://issues.apache.org/jira/browse/HBASE-16463) | *Major* | **Improve transparent table/CF encryption with Commons Crypto**
16433
16434 Improve transparent table/CF encryption with Commons Crypto. The change introduces a new optional CryptoCipherProvider (CommonsCryptoAES) for transparent table/CF encryption. And the encryption performance would be accelerated by hardware in modern CPU (AES-NI). This feature could be enabled by updating the configuration "hbase.crypto.cipherprovider" to "org.apache.hadoop.hbase.io.crypto.CryptoCipherProvider" in hbase-site.xml. For detailed information about transparent table/CF encryption including configuration examples see the Security section of the HBase manual.
16435
16436
16437 ---
16438
16439 * [HBASE-16414](https://issues.apache.org/jira/browse/HBASE-16414) | *Major* | **Improve performance for RPC encryption with Apache Common Crypto**
16440
16441 With the security RPC and encryption enabled, introduce Apache Commons Crypto to do the encryption/decryption which supports both supports both JCE Cipher and OpenSSL Cipher. Adds new configs "hbase.rpc.crypto.encryption.aes.enabled" which defaults to false, and "hbase.rpc.crypto.encryption.aes.cipher.class" which defaults to "org.apache.commons.crypto.cipher.JceCipher" to support JCE Cipher, it also can be set as "org.apache.hadoop.crypto.OpensslCipher" to support Openssl Cipher.
16442
16443
16444 ---
16445
16446 * [HBASE-16721](https://issues.apache.org/jira/browse/HBASE-16721) | *Critical* | **Concurrency issue in WAL unflushed seqId tracking**
16447
16448 Fixed a bug in sequenceId tracking for the WALs that caused WAL files to accumulate without being deleted due to a rare race condition.
16449
16450
16451 ---
16452
16453 * [HBASE-16834](https://issues.apache.org/jira/browse/HBASE-16834) | *Major* | **Add AsyncConnection support for ConnectionFactory**
16454
16455 Add createAsyncConnection method to ConnectionFactory for creating AsyncConnection. The default implementation is org.apache.hadoop.hbase.client.AsyncConnectionImpl. You can use 'hbase.client.async.connection.impl' to plug in your own AsyncConnection implementation.
16456
16457
16458 ---
16459
16460 * [HBASE-16729](https://issues.apache.org/jira/browse/HBASE-16729) | *Trivial* | **Define the behavior of (default) empty FilterList**
16461
16462 Empty filter list will behave as when there is no filter added. This change is a behavioral change for those who rely on Empty filter list.
16463
16464
16465 ---
16466
16467 * [HBASE-16799](https://issues.apache.org/jira/browse/HBASE-16799) | *Major* | **CP exposed Store should not expose unwanted APIs**
16468
16469 Below APIs from CP exposed Store interface are removed
16470 upsert(Iterable\<Cell\> cells, long readpoint)
16471 add(Cell cell)
16472 add(Iterable\<Cell\> cells)
16473 replayCompactionMarker(CompactionDescriptor compaction, boolean pickCompactionFiles,  boolean removeFiles)
16474 assertBulkLoadHFileOk(Path srcPath)
16475 bulkLoadHFile(String srcPathStr, long sequenceId)
16476 bulkLoadHFile(StoreFileInfo fileInfo)
16477
16478
16479 ---
16480
16481 * [HBASE-15921](https://issues.apache.org/jira/browse/HBASE-15921) | *Major* | **Add first AsyncTable impl and create TableImpl based on it**
16482
16483 Add AsyncConnection, AsyncTable and AsyncTableRegionLocator. Now the AsyncTable only support get, put and delete. And the implementation of AsyncTableRegionLocator is synchronous actually.
16484
16485
16486 ---
16487
16488 * [HBASE-16664](https://issues.apache.org/jira/browse/HBASE-16664) | *Major* | **Timeout logic in AsyncProcess is broken**
16489
16490 This issue fix three bugs:
16491 1.  rpcTimeout configuration not work for one rpc call in AP
16492 2.  operationTimeout configuration not work for multi-request (batch, put) in AP
16493 3.  setRpcTimeout and setOperationTimeout in HTable is not worked for AP and BufferedMutator.
16494
16495
16496 ---
16497
16498 * [HBASE-16661](https://issues.apache.org/jira/browse/HBASE-16661) | *Minor* | **Add last major compaction age to per-region metrics**
16499
16500 This adds a new per-region metric named "lastMajorCompactionAge" for tracking time since the last major compaction ran on a given region.  If a major compaction has never run, the age will be equal to the current timestamp.
16501
16502
16503 ---
16504
16505 * [HBASE-16117](https://issues.apache.org/jira/browse/HBASE-16117) | *Major* | **Fix Connection leak in mapred.TableOutputFormat**
16506
16507 (This change will be irrelevant after HBASE-16774 lands).
16508 There is a subtle change with error handling when a connection is not able to connect to ZK.  Attempts to create a connection when ZK is not up will now fail immediately instead of silently creating and then failing on a subsequent HBaseAdmin call.
16509
16510
16511 ---
16512
16513 * [HBASE-15984](https://issues.apache.org/jira/browse/HBASE-15984) | *Critical* | **Given failure to parse a given WAL that was closed cleanly, replay the WAL.**
16514
16515 In some particular deployments, the Replication code believes it has
16516 reached EOF for a WAL prior to successfully parsing all bytes known to
16517 exist in a cleanly closed file.
16518
16519 If an EOF is detected due to parsing or other errors while there are still unparsed bytes before the end-of-file trailer, we now reset the WAL to the very beginning and attempt a clean read-through. Because we will retry these failures indefinitely, two additional changes are made to help with diagnostics:
16520
16521 \* On each retry attempt, a log message like the below will be emitted at the WARN level:
16522
16523       Processing end of WAL file '{}'. At position {}, which is too far away
16524       from reported file length {}. Restarting WAL reading (see HBASE-15983
16525       for details).
16526
16527 \*  additional metrics measure the use of this recovery mechanism. they are described in the reference guide.
16528
16529
16530 ---
16531
16532 * [HBASE-16753](https://issues.apache.org/jira/browse/HBASE-16753) | *Minor* | **There is a mismatch between suggested Java version in hbase-env.sh**
16533
16534 Updates the comments and default values in a few scripts and docs to reflect our Java 1.8+ requirement.
16535
16536
16537 ---
16538
16539 * [HBASE-16567](https://issues.apache.org/jira/browse/HBASE-16567) | *Critical* | **Upgrade to protobuf-3.1.x**
16540
16541 Core is now up on protobuf 3.1.0 (Coprocessor Endpoints and REST are still on protobuf 2.5.0).
16542
16543
16544 ---
16545
16546 * [HBASE-15638](https://issues.apache.org/jira/browse/HBASE-15638) | *Critical* | **Shade protobuf**
16547
16548 Shade/relocate and include the protobuf we use internally. See protobuf chapter in the refguide for more on how we protobuf in hbase-.2.0.0 and going forward.
16549
16550 See https://docs.google.com/document/d/1H4NgLXQ9Y9KejwobddCqaVMEDCGbyDcXtdF5iAfDIEk/edit# for how we arrived at this approach.
16551
16552 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201610.mbox/%3C07850EDD-7230-431B-9AB0-C5C91B105EEC%40gmail.com%3E for discussion around merging this change and of how we might revert if an alternative to this awkward patch presents itself; e.g. an hadoop with CLASSPATH isolation (and means of dealing with Sparks use of protobuf 2.5.0, etc.)
16553
16554
16555 ---
16556
16557 * [HBASE-16264](https://issues.apache.org/jira/browse/HBASE-16264) | *Critical* | **Figure how to deal with endpoints and shaded pb**
16558
16559 Shade/relocate the protobuf hbase uses internally. All core now refers to new module added in this patch, hbase-protocol-shaded. Coprocessor Endpoints carry-on with references to the original hbase-protocol module. See new chapter in book on protobufs on how-to going forward.
16560
16561
16562 ---
16563
16564 * [HBASE-16672](https://issues.apache.org/jira/browse/HBASE-16672) | *Major* | **Add option for bulk load to always copy hfile(s) instead of renaming**
16565
16566 This issue adds a config, always.copy.files, to LoadIncrementalHFiles.
16567 When set to true, source hfiles would be copied. Meaning source hfiles would be kept after bulk load is done.
16568 Default value is false.
16569
16570
16571 ---
16572
16573 * [HBASE-16660](https://issues.apache.org/jira/browse/HBASE-16660) | *Critical* | **ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction**
16574
16575 "Please do not use DateTieredCompaction with Major Compaction unless you have a version with this. Otherwise your cluster will not compact any store files and you can end up running out of file descriptors." @churro morales
16576
16577
16578 ---
16579
16580 * [HBASE-16257](https://issues.apache.org/jira/browse/HBASE-16257) | *Blocker* | **Move staging dir to be under hbase root dir**
16581
16582 The HBase property 'hbase.bulkload.staging.dir' is deprecated and is ignored from HBase 2.0.  It will defaults to hbase.rootdir/staging automatically with the correct permissions.
16583
16584
16585 ---
16586
16587 * [HBASE-16650](https://issues.apache.org/jira/browse/HBASE-16650) | *Major* | **Wrong usage of BlockCache eviction stat for heap memory tuning**
16588
16589 Changed tracking of evictedBlocks count NOT to include evictions of blocks for a removed HFile. HFiles gets removed after compaction
16590
16591
16592 ---
16593
16594 * [HBASE-16294](https://issues.apache.org/jira/browse/HBASE-16294) | *Minor* | **hbck reporting "No HDFS region dir found" for replicas**
16595
16596 Fixed warning error message displayed for region directory not found for non-default/ non-primary replicas in hbck
16597
16598
16599 ---
16600
16601 * [HBASE-16540](https://issues.apache.org/jira/browse/HBASE-16540) | *Major* | **Scan should do additional validation on start and stop row**
16602
16603 Scan#setStartRow() and Scan#setStopRow() now validate the argument passed for each row key.  If the length of the byte[] passed exceeds Short.MAX\_VALUE, an IllegalArgumentException will be thrown.
16604
16605
16606 ---
16607
16608 * [HBASE-7612](https://issues.apache.org/jira/browse/HBASE-7612) | *Trivial* | **[JDK8] Replace use of high-scale-lib counters with intrinsic facilities**
16609
16610 org.apache.hadoop.hbase.util.Counter is deprecated now and will be removed in 3.0. Use LongAdder instead.
16611
16612
16613 ---
16614
16615 * [HBASE-16447](https://issues.apache.org/jira/browse/HBASE-16447) | *Critical* | **Replication by namespaces config in peer**
16616
16617 Support replication by namespaces config in peer.
16618 1. Set a namespace in peer config means that all tables in this namespace will be replicated.
16619 2. If the namespaces config is null, then the table-cfs config decide which table's edit can be replicated. If the table-cfs config is null, then the namespaces config decide which table's edit can be replicated.
16620 3. If you already have set a namespace in the peer config, then you can't set any table of this namespace to the peer config. If you already have set a table in the peer config, then you can't set this table's namespace to the peer config.
16621
16622
16623 ---
16624
16625 * [HBASE-16598](https://issues.apache.org/jira/browse/HBASE-16598) | *Major* | **Enable zookeeper useMulti always and clean up in HBase code**
16626
16627 Deprecate the configuration property 'hbase.zookeeper.useMulti'.
16628 useMulti will always be enabled. ZooKeeper 3.4.x and newer is required.
16629
16630 Internal:
16631
16632 The ZKUtil#multiOrSequential(ZooKeeperWatcher zkw, List\<ZKUtilOp\> ops, boolean runSequentialOnMultiFailure) will not check 'hbase.zookeeper.useMulti' anymore, and will always use multi.
16633 It can still fall back to sequential operations if:
16634
16635 RunSequentialOnMultiFailure is true
16636 On calling multi, we get a ZooKeeper exception that can be handled by a sequential call.
16637
16638
16639 ---
16640
16641 * [HBASE-16388](https://issues.apache.org/jira/browse/HBASE-16388) | *Major* | **Prevent client threads being blocked by only one slow region server**
16642
16643 Add a new configuration, hbase.client.perserver.requests.threshold, to limit the max number of concurrent request to one region server. If the user still create new request after reaching the limit, client will throw ServerTooBusyException and do not send the request to the server. This is a client side feature and can prevent client's threads being blocked by one slow region server resulting in the availability of client is much lower than the availability of region servers.
16644
16645 For completeness, here extract on new config from hbase-default.xml:
16646
16647 Property: hbase.client.perserver.requests.threshold
16648 Default: 2147483647
16649 Description: The max number of concurrent pending requests for one server in all client threads (process level). Exceeding requests will be thrown ServerTooBusyException immediately to prevent user's threads being occupied and blocked by only one slow region server. If you use a fix number of threads to access HBase in a synchronous way, set this to a suitable value which is  related to the number of threads will help you. See https://issues.apache.org/jira/browse/HBASE-16388 for details.
16650
16651
16652 ---
16653
16654 * [HBASE-15297](https://issues.apache.org/jira/browse/HBASE-15297) | *Minor* | **error message is wrong when a wrong namspace is specified in grant in hbase shell**
16655
16656 The security admin instance available within the HBase shell now returns "false" from the namespace\_exists? method for non-existent namespaces rather than raising a wrapped NamespaceNotFoundException.
16657
16658 As a side effect, when the "grant" and "revoke" commands in the HBase shell are invoked with a non-existent namespace the resulting error message now properly refers to said namespace rather than to the user.
16659
16660
16661 ---
16662
16663 * [HBASE-16086](https://issues.apache.org/jira/browse/HBASE-16086) | *Major* | **TableCfWALEntryFilter and ScopeWALEntryFilter should not redundantly iterate over cells.**
16664
16665 push to branch-1.3+
16666
16667
16668 ---
16669
16670 * [HBASE-16340](https://issues.apache.org/jira/browse/HBASE-16340) | *Critical* | **ensure no Xerces jars included**
16671
16672 HBase no longer includes Xerces implementation jars that were previously included via transitive dependencies. Downstream users relying on HBase for these artifacts will need to update their dependencies.
16673
16674
16675 ---
16676
16677 * [HBASE-16213](https://issues.apache.org/jira/browse/HBASE-16213) | *Major* | **A new HFileBlock structure for fast random get**
16678
16679 HBASE-16213 introduced a new DataBlockEncoding in name of ROW\_INDEX\_V1, which could improve random read (get) performance especially when the average record size (key-value size per row) is small. To use this feature, please set DATA\_BLOCK\_ENCODING to ROW\_INDEX\_V1 for CF of newly created table, or change existing CF with below command:
16680 alter 'table\_name',{NAME =\> 'cf', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}.
16681
16682 Please note that if we turn this DBE on, HFile block will be bigger than NONE encoding because it adds some meta infos for binary search:
16683 /\*\*
16684  \* Store cells following every row's start offset, so we can binary search to a row's cells.
16685  \*
16686  \* Format:
16687  \* flat cells
16688  \* integer: number of rows
16689  \* integer: row0's offset
16690  \* integer: row1's offset
16691  \* ....
16692  \* integer: dataSize
16693  \*
16694 \*/
16695
16696 Seek in row when random reading is one of the main consumers of CPU. This helps. See slide #7 here https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
16697
16698
16699 ---
16700
16701 * [HBASE-16409](https://issues.apache.org/jira/browse/HBASE-16409) | *Minor* | **Row key for bad row should be properly delimited in VerifyReplication**
16702
16703 --delimiter= option is added to verifyrep.
16704 The delimiter would wrap bad rows in log output.
16705
16706
16707 ---
16708
16709 * [HBASE-14921](https://issues.apache.org/jira/browse/HBASE-14921) | *Major* | **Inmemory Compaction Optimizations; Segment Structure**
16710
16711 A long, working issue that discussed Segment formats introducing CellArrayMap (delivered as the patch attached to this issue) and CellChunkMap (to be delivered later in HBASE-16421 but see patch v02 for an embryonic form named CellBlockSerialized); when to copy Segment data (and when not too); and then what to include at flush time (the suffix Segment or all Segments). Designs that evolved as discussion went on are attached. Outstanding issues turned up here, not including a CellChunkMap implementation, are listed below but are to be addressed in follow-ons (See HBASE-16417):
16712
16713 1. The flattening without compaction is causing many small segments in pipeline, and they are not flushed all together.
16714 2. The issue of compaction prediction cost.
16715
16716
16717 ---
16718
16719 * [HBASE-16450](https://issues.apache.org/jira/browse/HBASE-16450) | *Major* | **Shell tool to dump replication queues**
16720
16721 New tool to dump existing replication peers, configurations and queues when using HBase Replication. The tool provides two flags:
16722
16723  --distributed  This flag will poll each RS for information about the replication queues being processed on this RS.
16724 By default this is not enabled and the information about the replication queues and configuration will be obtained from ZooKeeper.
16725  --hdfs   When --distributed is used, this flag will attempt to calculate the total size of the WAL files used by the replication queues. Since its possible that multiple peers can be configured this value can be overestimated.
16726
16727
16728 ---
16729
16730 * [HBASE-16422](https://issues.apache.org/jira/browse/HBASE-16422) | *Major* | **Tighten our guarantees on compatibility across patch versions**
16731
16732 Adds below change to our compat guarantees:
16733
16734 {code}
16735 -\* Example: A user using a newly deprecated api does not need to modify application code with hbase api calls until the next major version.
16736  10 +\* New APIs introduced in a patch version will only be added in a source compatible way footnote:[See 'Source Compatibility' https://blogs.oracle.com/darcy/entry/kinds\_of\_compatibility]: i.e.     code that implements public APIs will continue to compile.
16737 {code}
16738
16739
16740 ---
16741
16742 * [HBASE-7621](https://issues.apache.org/jira/browse/HBASE-7621) | *Major* | **REST client (RemoteHTable) doesn't support binary row keys**
16743
16744 RemoteHTable now supports binary row keys with any character or byte by properly encoding request URLs. This is a both a behavioral change from earlier versions and an important fix for protocol correctness.
16745
16746
16747 ---
16748
16749 * [HBASE-12721](https://issues.apache.org/jira/browse/HBASE-12721) | *Major* | **Create Docker container cluster infrastructure to enable better testing**
16750
16751 Downstream users wishing to test HBase in a "distributed" fashion (multiple "nodes" running as separate containers on the same host) can now do so in an automated fashion while leveraging Docker for process isolation via the clusterdock project.
16752
16753 For details see the README.md in the dev-support/apache\_hbase\_topology folder.
16754
16755
16756 ---
16757
16758 * [HBASE-16267](https://issues.apache.org/jira/browse/HBASE-16267) | *Critical* | **Remove commons-httpclient dependency from hbase-rest module**
16759
16760 This issue upgrades httpclient to 4.5.2 and httpcore to 4.4.4 which are the versions used by hadoop-2.
16761 This is to handle the following CVE's.
16762
16763 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2015-5262 : http/conn/ssl/SSLConnectionSocketFactory.java in Apache HttpComponents HttpClient before 4.3.6 ignores the http.socket.timeout configuration setting during an SSL handshake, which allows remote attackers to cause a denial of service (HTTPS call hang) via unspecified vectors.
16764
16765 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-6153
16766 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-5783
16767 Apache Commons HttpClient 3.x, as used in Amazon Flexible Payments Service (FPS) merchant Java SDK and other products, does not verify that the server hostname matches a domain name in the subject's Common Name (CN) or subjectAltName field of the X.509 certificate, which allows man-in-the-middle attackers to spoof SSL servers via an arbitrary valid certificate.
16768
16769 Downstream users who are exposed to commons-httpclient via the HBase classpath will have to similarly update their dependency.
16770
16771
16772 ---
16773
16774 * [HBASE-16308](https://issues.apache.org/jira/browse/HBASE-16308) | *Major* | **Contain protobuf references**
16775
16776 Undo protobuf references through the codebase so protobuf references are contained rather than spread about the codebase. For example, moved protobuff-ing up into the various Callables rather than repeat on each method invocation cleaning up boilerplate around rpc calls. Having a few protobuf reference locations only simplifies the parent issue shading project.
16777
16778
16779 ---
16780
16781 * [HBASE-16321](https://issues.apache.org/jira/browse/HBASE-16321) | *Blocker* | **Ensure findbugs jsr305 jar isn't present**
16782
16783 HBase now ensures the jsr305 implementation from the findbugs project is not included in its binary artifacts or the compile / runtime dependencies of its user facing modules. Downstream users that rely on this jar will need to update their dependencies.
16784
16785
16786 ---
16787
16788 * [HBASE-8386](https://issues.apache.org/jira/browse/HBASE-8386) | *Major* | **deprecate TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)**
16789
16790 The MapReduce helper function \`TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)\` has been deprecated since it is easy to use incorrectly. Most users should rely on addDependencyJars(Job) instead.
16791
16792
16793 ---
16794
16795 * [HBASE-16287](https://issues.apache.org/jira/browse/HBASE-16287) | *Major* | **LruBlockCache size should not exceed acceptableSize too many**
16796
16797 In order to avoid blockcache size exceed acceptable size too much, we add one configuration "hbase.lru.blockcache.hard.capacity.limit.factor" to decide whether the block could be put into LruBlockCache or not.  This factor defaults to 1.2
16798 If blockcache size \>= factor\*acceptableSize, we will reject the block into cache.
16799
16800
16801 ---
16802
16803 * [HBASE-16355](https://issues.apache.org/jira/browse/HBASE-16355) | *Major* | **hbase-client dependency on hbase-common test-jar should be test scope**
16804
16805 The HBase client artifact previously incorrectly included the hbase-common test jar as a runtime dependency. With this change, that dependency has been moved to test scope. Downstream users are not expected to be impacted, unless they relied on the transitive dependency for these HBase internal test classes.
16806
16807
16808 ---
16809
16810 * [HBASE-16317](https://issues.apache.org/jira/browse/HBASE-16317) | *Blocker* | **revert all ESAPI changes**
16811
16812 This issue reverts fixes designed to prevent malicious content from rendering in HBase's UIs. Specifically, these changes shipped in 1.1.4+ and 1.2.0+. They were removed due to licensing issues discovered in the dependencies they introduced. Their implementation and those dependencies have been removed from HBase! Removal of these dependencies is against the strict definition of our version compatibility guidelines. However, inclusion of non-Apache approved licenses cannot be tolerated. Implementation of these fixes using an Apache-appropriate means is tracked in HBASE-16328.
16813
16814
16815 ---
16816
16817 * [HBASE-16288](https://issues.apache.org/jira/browse/HBASE-16288) | *Critical* | **HFile intermediate block level indexes might recurse forever creating multi TB files**
16818
16819 A new hfile configuration "hfile.index.block.min.entries" which defaults to 16 determines how many entries the hfile index block can have at least. The configuration which determines how large the index block can be at max (hfile.index.block.max.size) is ignored as long as we have fewer than hfile.index.block.min.entries entries. This ensures that multi-level index does not build up with too many levels.
16820
16821
16822 ---
16823
16824 * [HBASE-16186](https://issues.apache.org/jira/browse/HBASE-16186) | *Major* | **Fix AssignmentManager MBean name**
16825
16826 The AssignmentManager MBean was named AssignmentManger (note misspelling). This patch fixed the misspelling.
16827
16828
16829 ---
16830
16831 * [HBASE-16289](https://issues.apache.org/jira/browse/HBASE-16289) | *Critical* | **AsyncProcess stuck messages need to print region/server**
16832
16833 Adds logging of region and server. Helpful debugging. Logging now looks like this:
16834 {code}
16835 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess$AsyncRequestFutureImpl(1601): #1, waiting for 1  actions to finish on table: DUMMY\_TABLE
16836 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1720): Left over 1 task(s) are processed on server(s): [s1:1,1,1]
16837 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1728): Regions against which left over task(s) are processed: [DUMMY\_TABLE,DUMMY\_BYTES\_1,1.3fd12ea80b4df621fb15497ba75f7368.,DUMMY\_TABLE,DUMMY\_BYTES\_2,2.924207e242e313d2e5491c625e0a296e.]
16838 {code}
16839
16840
16841 ---
16842
16843 * [HBASE-14743](https://issues.apache.org/jira/browse/HBASE-14743) | *Minor* | **Add metrics around HeapMemoryManager**
16844
16845 A memory metrics reveals situations happened in both MemStores and BlockCache in RegionServer. Through this metrics, users/operators can know
16846 1). Current size of MemStores and BlockCache in bytes.
16847 2). Occurrence for Memstore minor and major flush. (named unblocked flush and blocked flush respectively, shown in histogram)
16848 3). Dynamic changes in size between MemStores and BlockCache. (with Increase/Decrease as prefix, shown in histogram). And a counter for no changes, named DoNothingCounter.
16849 4). Occurrence for memory usage alarm (used more than 95% by default) in RegionServer. (named AboveHeapOccupancyLowWatermarkCounter)
16850
16851
16852 ---
16853
16854 * [HBASE-13701](https://issues.apache.org/jira/browse/HBASE-13701) | *Major* | **Consolidate SecureBulkLoadEndpoint into HBase core as default for bulk load**
16855
16856 SecureBulkLoadEndpoint  has been integrated into HBase core as default bulk load mechanism. It is no longer needed to install it as a coprocessor endpoint.
16857 The new server is backward compatible, accommodating non-secure old client and secure old client requesting SecureBulkLoadEndpoint service.
16858 SecureBulkLoadEndpoint is deprecated. The backward compatibility support may be removed in future releases.
16859
16860
16861 ---
16862
16863 * [HBASE-16244](https://issues.apache.org/jira/browse/HBASE-16244) | *Major* | **LocalHBaseCluster start timeout should be configurable**
16864
16865 When LocalHBaseCluster is started from the command line the Master would give up after 30 seconds due to a hardcoded timeout meant for unit tests. This change allows the timeout to be configured via hbase-site as well as sets it to 5 minutes when LocalHBaseCluster is started from the command line.
16866
16867
16868 ---
16869
16870 * [HBASE-16052](https://issues.apache.org/jira/browse/HBASE-16052) | *Major* | **Improve HBaseFsck Scalability**
16871
16872 HBASE-16052 improves the performance and scalability of HBaseFsck, especially for large clusters with a small number of large tables.
16873
16874 Searching for lingering reference files is now a multi-threaded operation.  Loading HDFS region directory information is now multi-threaded at the region-level instead of the table-level to maximize concurrency.  A performance bug in HBaseFsck that resulted in redundant I/O and RPCs was fixed by introducing a FileStatusFilter that filters FileStatus objects directly.
16875
16876
16877 ---
16878
16879 * [HBASE-16144](https://issues.apache.org/jira/browse/HBASE-16144) | *Major* | **Replication queue's lock will live forever if RS acquiring the lock has died prematurely**
16880
16881 If zk based replication queue is used and useMulti is false, we will schedule a chore to clean up the orphan replication queue lock on zk.
16882
16883
16884 ---
16885
16886 * [HBASE-3727](https://issues.apache.org/jira/browse/HBASE-3727) | *Minor* | **MultiHFileOutputFormat**
16887
16888 MultiHFileOutputFormat support output of HFiles from multiple tables. It will output directories and hfiles as follow,
16889      --table1
16890        --family1
16891        --family2
16892          --Hfiles
16893      --table2
16894        --family3
16895          --hfiles
16896        --family4
16897
16898 family directory and its hfiles match the output of HFileOutputFormat2
16899
16900
16901 ---
16902
16903 * [HBASE-16231](https://issues.apache.org/jira/browse/HBASE-16231) | *Major* | **Integration tests should support client keytab login for secure clusters**
16904
16905 Prior to this change, the integration test clients (IntegrationTest\*) relied on the Kerberos credential cache for authentication against secured clusters.  This could lead to the tests failing due to authentication failures when the tickets in the credential cache expired.  With this change, the integration test clients will make use of the configuration properties for "hbase.client.keytab.file" and "hbase.client.kerberos.principal", when available.  This will perform a login from the configured keytab file and automatically refresh the credentials in the background for the process lifetime.
16906
16907
16908 ---
16909
16910 * [HBASE-13823](https://issues.apache.org/jira/browse/HBASE-13823) | *Major* | **Procedure V2: unnecessaery operations on AssignmentManager#recoverTableInDisablingState() and recoverTableInEnablingState()**
16911
16912 For cluster upgraded from 1.0.x or older releases, master startup would not continue the in-progress enable/disable table process.  If orphaned znode with ENABLING/DISABLING state exists in the cluster, run hbck or manually fix the issue.
16913
16914 For new cluster or cluster upgraded from 1.1.x and newer release, there is no issue to worry about.
16915
16916
16917 ---
16918
16919 * [HBASE-16095](https://issues.apache.org/jira/browse/HBASE-16095) | *Major* | **Add priority to TableDescriptor and priority region open thread pool**
16920
16921 Adds a PRIORITY property to the HTableDescriptor. PRIORITY should be in the same range as the RpcScheduler defines it (HConstants.XXX\_QOS).
16922
16923 Table priorities are only used for region opening for now. There can be other uses later (like RpcScheduling).
16924
16925 Regions of high priority tables (priority \>= than HIGH\_QOS) are opened from a different thread pool than the regular region open thread pool. However, table priorities are not used as a global order for region assigning or opening.
16926
16927
16928 ---
16929
16930 * [HBASE-16081](https://issues.apache.org/jira/browse/HBASE-16081) | *Blocker* | **Replication remove\_peer gets stuck and blocks WAL rolling**
16931
16932 When a replication endpoint is sent a shutdown request by the replication source in situations like removing a peer, we now try to gracefully shut it down by draining the items already sent for replication to the peer cluster. If the drain does not complete in the specified time (hbase.rpc.timeout \* replication.source.maxterminationmultiplier), the regionserver is aborted to avoid blocking the WAL roll.
16933
16934
16935 ---
16936
16937 * [HBASE-16087](https://issues.apache.org/jira/browse/HBASE-16087) | *Major* | **Replication shouldn't start on a master if if only hosts system tables**
16938
16939 Masters will no longer start any replication threads if they are hosting only system tables.
16940
16941 In order to change this add something to the config for tables on master that doesn't start with "hbase:" ( Replicating system tables is something that's currently unsupported and can open up security holes, so do this at your own peril)
16942
16943
16944 ---
16945
16946 * [HBASE-14548](https://issues.apache.org/jira/browse/HBASE-14548) | *Major* | **Expand how table coprocessor jar and dependency path can be specified**
16947
16948 Allow a directory containing the jars or some wildcards to be specified, such as: hdfs://namenode:port/user/hadoop-user/
16949 or
16950 hdfs://namenode:port/user/hadoop-user/\*.jar
16951
16952 Please note that if a directory is specified, all jar files(.jar) directly in the directory are added, but it does not search files in the subtree rooted in the directory.
16953 Do not contain any wildcard if you would like to specify a directory.
16954
16955
16956 ---
16957
16958 * [HBASE-15925](https://issues.apache.org/jira/browse/HBASE-15925) | *Blocker* | **compat-module maven variable not evaluated**
16959
16960 Downstream users of HBase dependencies that do not properly activate Maven profiles should now see a correct transitive dependency on the default hadoop-compatibility-module.
16961
16962
16963 ---
16964
16965 * [HBASE-16140](https://issues.apache.org/jira/browse/HBASE-16140) | *Major* | **bump owasp.esapi from 2.1.0 to 2.1.0.1**
16966
16967 The dependency owasp.esapi had a compatible change from 2.1.0 to 2.1.0.1. As a result, the transitive dependency commons-fileupload had a change from 1.2 to 1.3.1, which has some minor class changes that impact binary compatibility. Interested users should check the release notes of commons-fileupload to see if any of the incompatible changes impact them.
16968
16969 http://commons.apache.org/proper/commons-fileupload/changes-report.html
16970
16971
16972 ---
16973
16974 * [HBASE-16147](https://issues.apache.org/jira/browse/HBASE-16147) | *Major* | **Shell command for getting compaction state**
16975
16976 compaction\_state shell command would return compaction state in String form:
16977 NONE, MINOR, MAJOR, MAJOR\_AND\_MINOR
16978
16979
16980 ---
16981
16982 * [HBASE-14878](https://issues.apache.org/jira/browse/HBASE-14878) | *Major* | **maven archetype: client application with shaded jars**
16983
16984 Adds new hbase-shaded-client archetype; also corrects an omission found in hbase-archetypes/README.md in the section headed "How to add a new archetype".
16985
16986
16987 ---
16988
16989 * [HBASE-14877](https://issues.apache.org/jira/browse/HBASE-14877) | *Major* | **maven archetype: client application**
16990
16991 This patch introduces a new infrastructure for creation and maintenance of Maven archetypes in the context of the hbase project, and it also introduces the first archetype, which end-users may utilize to generate a simple hbase-client dependent project.
16992
16993 NOTE that this patch should introduce two new WARNINGs ("Using platform encoding ... to copy filtered resources") into the hbase install process. These warnings are hard-wired into the maven-archetype-plugin:create-from-project goal. See hbase/hbase-archetypes/README.md, footnote [6] for details.
16994
16995 After applying the patch, see hbase/hbase-archetypes/README.md for details regarding the new archetype infrastructure introduced by this patch. (The README text is also conveniently positioned at the top of the patch itself.)
16996
16997 Here is the opening paragraph of the README.md file:
16998 =================
16999 The hbase-archetypes subproject of hbase provides an infrastructure for creation and maintenance of Maven archetypes pertinent to HBase. Upon deployment to the archetype catalog of the central Maven repository, these archetypes may be used by end-user developers to autogenerate completely configured Maven projects (including fully-functioning sample code) through invocation of the archetype:generate goal of the maven-archetype-plugin.
17000 ========
17001 The README.md file also contains several paragraphs under the heading, "Notes for contributors and committers to the HBase project", which explains the layout of 'hbase-archetypes', and how archetypes are created and installed into the local Maven repository, ready for deployment to the central Maven repository. It also outlines how new archetypes may be developed and added to the collection in the future.
17002
17003
17004 ---
17005
17006 * [HBASE-15977](https://issues.apache.org/jira/browse/HBASE-15977) | *Major* | **Failed variable substitution on home page**
17007
17008 Done. Thanks, Dima, Andrew!
17009
17010
17011 ---
17012
17013 * [HBASE-5291](https://issues.apache.org/jira/browse/HBASE-5291) | *Major* | **Add Kerberos HTTP SPNEGO authentication support to HBase web consoles**
17014
17015 HBase Web UIs can be secured from general public access using SPNEGO to require a valid Kerberos ticket.
17016
17017 Setting 'hbase.security.authentication.ui' to 'kerberos' in hbase-site.xml is a global switch to have all Web UIs allow only authenticated clients via Kerberos. 'hbase.security.authentication.spnego.kerberos.principal' and 'hbase.security.authentication.spnego.kerberos.keytab' are two other required properties in hbase-site.xml, the Kerberos principal and keytab to use for the server to use to log in. The primary in the Kerberos principal must be 'HTTP' as required by the SPNEGO mechanism, e.g. 'HTTP/host.domain.com@DOMAIN.COM'.
17018
17019
17020 ---
17021
17022 * [HBASE-15950](https://issues.apache.org/jira/browse/HBASE-15950) | *Major* | **Fix memstore size estimates to be more tighter**
17023
17024 The estimates of heap usage by the memstore objects (KeyValue, object and array header sizes, etc) have been made more accurate for heap sizes up to 32G (using CompressedOops), resulting in them dropping by 10-50% in practice. This also results in less number of flushes and compactions due to "fatter" flushes. YMMV. As a result, the actual heap usage of the memstore before being flushed may increase by up to 100%. If configured memory limits for the region server had been tuned based on observed usage, this change could result in worse GC behavior or even OutOfMemory errors. Set the environment property (not hbase-site.xml) "hbase.memorylayout.use.unsafe" to false to disable.
17025
17026
17027 ---
17028
17029 * [HBASE-16023](https://issues.apache.org/jira/browse/HBASE-16023) | *Major* | **Fastpath for the FIFO rpcscheduler**
17030
17031 Adds a 'fastpath' when using the default FIFO rpc scheduler ('fifo'). Does direct handoff from Reader thread to Handler if there is one ready and willing. Will shine best when high random read workload (YCSB workloadc for instance)
17032
17033
17034 ---
17035
17036 * [HBASE-15971](https://issues.apache.org/jira/browse/HBASE-15971) | *Critical* | **Regression: Random Read/WorkloadC slower in 1.x than 0.98**
17037
17038 Change the default rpc scheduler from 'deadline' to 'fifo' instead so it is the same as in branch 0.98. 'deadline' was of questionable benefit but with a high cost scheduling. To re-enable 'deadline', set hbase.ipc.server.callqueue.type to 'deadline' in your hbase-site.xml.
17039
17040
17041 ---
17042
17043 * [HBASE-15525](https://issues.apache.org/jira/browse/HBASE-15525) | *Critical* | **OutOfMemory could occur when using BoundedByteBufferPool during RPC bursts**
17044
17045 Added a new ByteBufferPool which pools N ByteBuffers. By default it makes off heap ByteBuffers when getBuffer() is called. The size of each buffer defaults to 64KB. This can be configured using 'hbase.ipc.server.reservoir.initial.buffer.size'.   The max number of buffers which can be pooled defaults to twice the number of handler threads in RS. This can be configured with key 'hbase.ipc.server.reservoir.initial.max'.  While responding to read requests and client support Codec, we will create CellBlocks and directly return it as PB payload. For making this block, we will use N ByteBuffers from pool as per the total size of the response cells. The default size of 64 KB for the buffer is inline with the number of bytes written to RPC layer in one short.(That is also 64KB).  When at point of time, the calle not able to get a free buffer from the pool (it returns null then), it will make on heap Buffer of same size (as that of Buffers in pool) and use that to create cell block.
17046
17047
17048 ---
17049
17050 * [HBASE-15994](https://issues.apache.org/jira/browse/HBASE-15994) | *Major* | **Allow selection of RpcSchedulers**
17051
17052 Adds a FifoRpcSchedulerFactory so you can try the FifoRpcScheduler by setting  "hbase.region.server.rpc.scheduler.factory.class"
17053
17054
17055 ---
17056
17057 * [HBASE-15989](https://issues.apache.org/jira/browse/HBASE-15989) | *Major* | **Remove hbase.online.schema.update.enable**
17058
17059 Removes the "hbase.online.schema.update.enable" property.
17060 from now, every operation that alter the schema (e.g. modifyTable, addFamily, removeFamily, ...) will use the online schema update. there is no need to disable/enable the table.
17061
17062
17063 ---
17064
17065 * [HBASE-15981](https://issues.apache.org/jira/browse/HBASE-15981) | *Minor* | **Stripe and Date-tiered compactions inaccurately suggest disabling table in docs**
17066
17067 Removes reference to disabling table in docs for stripe and date-tiered compactions
17068
17069
17070 ---
17071
17072 * [HBASE-15931](https://issues.apache.org/jira/browse/HBASE-15931) | *Critical* | **Add log for long-running tasks in AsyncProcess**
17073
17074 After HBASE-15931, we will log more details for long-running tasks in AsyncProcess#waitForMaximumCurrentTasks every 10 seconds, including:
17075 1. Table name will be included in the tasks status log
17076 2. On which regionserver(s) the tasks are runnning will be logged when less than hbase.client.threshold.log.details tasks left, by default 10.
17077 3. Against which regions the tasks are running will be logged when less than 2 tasks left.
17078
17079
17080 ---
17081
17082 * [HBASE-15907](https://issues.apache.org/jira/browse/HBASE-15907) | *Major* | **Missing documentation of create table split options**
17083
17084 documentation changes only - added section to Shell tricks and cross reference from region splitting section
17085
17086
17087 ---
17088
17089 * [HBASE-15915](https://issues.apache.org/jira/browse/HBASE-15915) | *Major* | **Set timeouts on hanging tests**
17090
17091 Use @ClassRule to set timeout on test case level (instead of @Rule which sets timeout for the test methods). CategoryBasedTimeout.forClass(..) determines the timeout value based on category annotation (small/medium/large) on the test case.
17092
17093
17094 ---
17095
17096 * [HBASE-15875](https://issues.apache.org/jira/browse/HBASE-15875) | *Major* | **Remove HTable references and HTableInterface**
17097
17098 **WARNING: No release note provided for this change.**
17099
17100
17101 ---
17102
17103 * [HBASE-15610](https://issues.apache.org/jira/browse/HBASE-15610) | *Blocker* | **Remove deprecated HConnection for 2.0 thus removing all PB references for 2.0**
17104
17105 **WARNING: No release note provided for this change.**
17106
17107
17108 ---
17109
17110 * [HBASE-15890](https://issues.apache.org/jira/browse/HBASE-15890) | *Major* | **Allow thrift to set/unset "cacheBlocks" for Scans**
17111
17112 Adds cacheBlocks to Scan
17113
17114
17115 ---
17116
17117 * [HBASE-15876](https://issues.apache.org/jira/browse/HBASE-15876) | *Blocker* | **Remove doBulkLoad(Path hfofDir, final HTable table) though it has not been through a full deprecation cycle**
17118
17119 Removes a doBulkLoad method though it has not been through a full deprecation cycle (but it is 'damaged' because it has a parameter that has been properly deprecated). Use the alternative {code}public void doBulkLoad(Path hfofDir, final Admin admin, Table table, RegionLocator regionLocator){code}
17120
17121 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201605.mbox/%3CCAMUu0w-ZiLoLBLO3D76=n3AjUr=VMtTUeYA28weLHYeq8+e3bQ@mail.gmail.com%3E for NOTICE on this 'premature' removal.
17122
17123
17124 ---
17125
17126 * [HBASE-15228](https://issues.apache.org/jira/browse/HBASE-15228) | *Major* | **Add the methods to RegionObserver to trigger start/complete restoring WALs**
17127
17128 Added two hooks around WAL restore.
17129 preReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17130 and
17131 postReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17132
17133 Will be called at start and end of restore of a WAL file.
17134 The other hook around WAL restore (preWALRestore ) will be called before restore of every entry within the WAL file.
17135
17136
17137 ---
17138
17139 * [HBASE-15856](https://issues.apache.org/jira/browse/HBASE-15856) | *Critical* | **Cached Connection instances can wind up with addresses never resolved**
17140
17141 During periods where DNS resolution was not available or not working correctly, we could previously cache unresolved hostnames forever, in some cases preventing further connections to these hosts even when DNS service was restored.  With this change, unresolved hostnames will no longer be cached, and will instead throw an UnknownHostException during connection setup.
17142
17143
17144 ---
17145
17146 * [HBASE-15593](https://issues.apache.org/jira/browse/HBASE-15593) | *Major* | **Time limit of scanning should be offered by client**
17147
17148 Add a new configuration: hbase.ipc.min.client.request.timeout
17149 Minimum allowable timeout (in milliseconds) in rpc request's header. This configuration exists to prevent the rpc service regarding this request as timeout immediately.
17150
17151
17152 ---
17153
17154 * [HBASE-15784](https://issues.apache.org/jira/browse/HBASE-15784) | *Major* | **Misuse core/maxPoolSize of LinkedBlockingQueue in ThreadPoolExecutor**
17155
17156 The core pool size and max pool size of ThreadPoolExecutor should be the same when LinkedBlockingQueue is used. Thus the configurations hbase.hconnection.threads.max, hbase.hconnection.meta.lookup.threads.max, hbase.region.replica.replication.threads.max and hbase.multihconnection.threads.max are used as the number of the core threads, and the related configurations \*.thread.core are not used any more.
17157
17158
17159 ---
17160
17161 * [HBASE-15651](https://issues.apache.org/jira/browse/HBASE-15651) | *Major* | **Add report-flakies.py to use jenkins api to get failing tests**
17162
17163 To find recent set of flakies, run the script added by this patch. Run it to get usage information passing -h:
17164
17165 {code}
17166 $ ./dev-support/report-flakies.py -h
17167 {code}
17168
17169 If you get the below:
17170
17171 {code}
17172 $ python ./dev-support/report-flakies.py
17173 Traceback (most recent call last):
17174   File "./dev-support/report-flakies.py", line 25, in \<module\>
17175     import requests
17176 ImportError: No module named requests
17177 {code}
17178
17179 ... install the requests module:
17180
17181 {code}
17182 $ sudo pip install requests
17183 {code}
17184
17185
17186 ---
17187
17188 * [HBASE-15780](https://issues.apache.org/jira/browse/HBASE-15780) | *Critical* | **Expose AuthUtil as IA.Public**
17189
17190 Downstream users with long lived applications that need to communicate with secure HBase instances can now rely on the AuthUtil class to handle authenticating via keytab.
17191
17192 For more information, see the javadoc for the org.apache.hadoop.hbase.AuthUtil class.
17193
17194
17195 ---
17196
17197 * [HBASE-15811](https://issues.apache.org/jira/browse/HBASE-15811) | *Blocker* | **Batch Get after batch Put does not fetch all Cells**
17198
17199 We were not waiting on all executors in a batch to complete which meant a read-your-own-writes could sometimes fail -- especially if client is loaded; i.e. putting to multiple machines in a cluster. The test for no-more-executors was damaged by the 0.99/0.98.4 fix "HBASE-11403 Fix race conditions around Object#notify"
17200
17201
17202 ---
17203
17204 * [HBASE-15801](https://issues.apache.org/jira/browse/HBASE-15801) | *Major* | **Upgrade checkstyle for all branches**
17205
17206 All active branches now use maven-checkstyle-plugin 2.17 and checkstyle 6.18.
17207
17208
17209 ---
17210
17211 * [HBASE-15236](https://issues.apache.org/jira/browse/HBASE-15236) | *Major* | **Inconsistent cell reads over multiple bulk-loaded HFiles**
17212
17213 This jira fixes that following bug:
17214 During bulkloading, if there are multiple hfiles corresponding to same region, and if they have same timestamps (which may have been set using importtsv.timestamp) and duplicate keys across them, then get and scan may return values coming from different hfiles.
17215
17216
17217 ---
17218
17219 * [HBASE-15740](https://issues.apache.org/jira/browse/HBASE-15740) | *Major* | **Replication source.shippedKBs metric is undercounting because it is in KB**
17220
17221 Removed Replication source.shippedKBs metric in favor of source.shippedBytes
17222
17223
17224 ---
17225
17226 * [HBASE-15773](https://issues.apache.org/jira/browse/HBASE-15773) | *Major* | **CellCounter improvements**
17227
17228 The CellCounter map reduce job now supports additional configuration options on the Scan instance it creates, using the org.apache.hadoop.hbase.mapreduce.TableInputFormat defined property names.  For a full list of the options, run ./hbase org.apache.hadoop.hbase.mapreduce.CellCounter with no arguments.
17229
17230 CellCounter also no longer creates job counters for per-rowkey and per-rowkey/qualifier cell counts.  For most tables, these counters would cause the job to fail due to mapreduce job counter limits.
17231
17232
17233 ---
17234
17235 * [HBASE-15759](https://issues.apache.org/jira/browse/HBASE-15759) | *Minor* | **RegionObserver.preStoreScannerOpen() doesn't have acces to current readpoint**
17236
17237 The following RegionObserver method is deprecated and would no longer be called in hbase 2.0:
17238
17239   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17240       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17241       final KeyValueScanner s) throws IOException {
17242
17243 Instead, override this method:
17244
17245   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17246       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17247       final KeyValueScanner s, final long readPt) throws IOException {
17248
17249
17250 ---
17251
17252 * [HBASE-15743](https://issues.apache.org/jira/browse/HBASE-15743) | *Major* | **Add Transparent Data Encryption support for FanOutOneBlockAsyncDFSOutput**
17253
17254 Now the AsyncFSWAL can write data to a encryption zone on HDFS.
17255
17256
17257 ---
17258
17259 * [HBASE-15767](https://issues.apache.org/jira/browse/HBASE-15767) | *Major* | **Upgrade httpclient dependency**
17260
17261 HBase now relies on version 4.3.6 of the Apache Commons HTTPClient library. Downstream users who are exposed to it via the HBase classpath will have to similarly update their dependency.
17262
17263
17264 ---
17265
17266 * [HBASE-15575](https://issues.apache.org/jira/browse/HBASE-15575) | *Minor* | **Rename table DDL \*Handler methods in MasterObserver to more meaningful names**
17267
17268 **WARNING: No release note provided for this change.**
17269
17270
17271 ---
17272
17273 * [HBASE-15720](https://issues.apache.org/jira/browse/HBASE-15720) | *Major* | **Print row locks at the debug dump page**
17274
17275 Adds a section to the debug dump page listing current row locks held.
17276
17277
17278 ---
17279
17280 * [HBASE-15703](https://issues.apache.org/jira/browse/HBASE-15703) | *Critical* | **Deadline scheduler needs to return to the client info about skipped calls, not just drop them**
17281
17282 With previous deadline mode of RPC scheduling (the implementation in SimpleRpcScheduler, which is basically a FIFO except that long-running scans are de-prioritized) and FIFO-based RPC scheduler clients are getting CallQueueTooBigException when RPC call queue is full.
17283
17284 With this patch and when hbase.ipc.server.callqueue.type property is set to "codel" mode, clients will also be getting CallDroppedException, which means that the request was discarded by the server as it considers itself to be overloaded and starts to drop requests to avoid going down under the load. The clients will retry upon receiving this exception. It doesn't clear MetaCache with region locations.
17285
17286
17287 ---
17288
17289 * [HBASE-15281](https://issues.apache.org/jira/browse/HBASE-15281) | *Major* | **Allow the FileSystem inside HFileSystem to be wrapped**
17290
17291 This patch adds new configuration property - hbase.fs.wrapper. If provided, it should be fully qualified class name of the class used as a pluggable wrapper for HFileSystem. This may be useful for specific debugging/tracing needs.
17292
17293
17294 ---
17295
17296 * [HBASE-15551](https://issues.apache.org/jira/browse/HBASE-15551) | *Minor* | **Make call queue too big exception use servername**
17297
17298 Fixes issue when CallQueueTooBig exception returned to the client could print useless address info (like 0.0.0.0) if RPC server is listening on something other than the host name, making troubleshooting inconvenient.
17299
17300
17301 ---
17302
17303 * [HBASE-15711](https://issues.apache.org/jira/browse/HBASE-15711) | *Major* | **Add client side property to allow logging details for batch errors**
17304
17305 In HBASE-15711 a new client side property hbase.client.log.batcherrors.details is introduced to allow logging full stacktrace of exceptions for batch error. It's disabled by default and set the property to true will enable it.
17306
17307
17308 ---
17309
17310 * [HBASE-15686](https://issues.apache.org/jira/browse/HBASE-15686) | *Major* | **Add override mechanism for the exempt classes when dynamically loading table coprocessor**
17311
17312 New coprocessor table descriptor attribute, hbase.coprocessor.classloader.included.classes, is added.
17313 User can specify class name prefixes (semicolon separated) which should be loaded by CoprocessorClassLoader through this attribute using the following syntax:
17314 {code}
17315   hbase\> alter 't1',    'coprocessor'=\>'hdfs:///foo.jar\|com.foo.FooRegionObserver\|1001\|arg1=1,arg2=2'
17316 {code}
17317
17318
17319 ---
17320
17321 * [HBASE-15645](https://issues.apache.org/jira/browse/HBASE-15645) | *Critical* | **hbase.rpc.timeout is not used in operations of HTable**
17322
17323 Fixes regression where hbase.rpc.timeout configuration was ignored in branch-1.0+
17324
17325 Adds new methods setOperationTimeout, getOperationTimeout, setRpcTimeout, and getRpcTimeout to Table. In branch-1.3+ they are public interfaces and in 1.0-1.2 they are labeled as @InterfaceAudience.Private.
17326
17327 Adds hbase.client.operation.timeout to hbase-default.xml with default of 1200000
17328
17329
17330 ---
17331
17332 * [HBASE-15477](https://issues.apache.org/jira/browse/HBASE-15477) | *Major* | **Do not save 'next block header' when we cache hfileblocks**
17333
17334 Fix over-persisting in blockcache; no longer save the block PLUS the header of the next block (33 bytes) when writing the cache.
17335
17336 Also removes support for hfileblock v1; hfile block v1 was used writing hfile v1. hfile v1 was the default in hbase before hbase-0.92. hbase.96 would not start unless all v1 hfiles had been compacted out of the cluster.
17337
17338
17339 ---
17340
17341 * [HBASE-15628](https://issues.apache.org/jira/browse/HBASE-15628) | *Major* | **Implement an AsyncOutputStream which can work with any FileSystem implementation**
17342
17343 Introduce an AsyncFSOutput interface which is an abstraction of the original FanOutOneBlockAsyncDFSOutput. Now you can create AsyncFSOutput on any FileSystem using the method AsyncFSOutputHelper.createOutput. The returned AsyncFSOutput will be FanOutOneBlockAsyncDFSOutput if the given FileSystem is a DistributedFileSystem.
17344
17345
17346 ---
17347
17348 * [HBASE-15392](https://issues.apache.org/jira/browse/HBASE-15392) | *Major* | **Single Cell Get reads two HFileBlocks**
17349
17350 When an explicit Get with a one or more columns specified, we at a minimum, were overseeking, reading until we tripped over the next row, regardless, and only then returning. If the next row was in-block, we'd just do too much seeking but if the next row was in the next (or in the next block beyond that), we would keep seeking and loading blocks until we found the next row before we'd return.
17351
17352 There remains one case where we will still 'overread'. It is when the row end aligns with the end of the block. In this case we will load the next block just to find that there are no more cells in the current row. See HBASE-15457.
17353
17354
17355 ---
17356
17357 * [HBASE-15671](https://issues.apache.org/jira/browse/HBASE-15671) | *Major* | **Add per-table metrics on memstore, storefile and regionsize**
17358
17359 Adds storeFileSize, memstoreSize and tableSize to the per-table metrics.
17360
17361
17362 ---
17363
17364 * [HBASE-15366](https://issues.apache.org/jira/browse/HBASE-15366) | *Major* | **Add doc, trace-level logging, and test around hfileblock**
17365
17366 No functional change. Added javadoc, comments, and extra trace-level logging to make clear what is happening around the reading and caching of hfile blocks.
17367
17368
17369 ---
17370
17371 * [HBASE-15368](https://issues.apache.org/jira/browse/HBASE-15368) | *Major* | **Add pluggable window support**
17372
17373 Use 'hbase.hstore.compaction.date.tiered.window.factory.class' to specify the window implementation you like for date tiered compaction. Now the only and default implementation is org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory.
17374
17375 {code}
17376 \<property\>
17377 \<name\>hbase.hstore.compaction.date.tiered.window.factory.class\</name\>
17378 \<value\>org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory\</value\>
17379 \</property\>
17380 \<property\>
17381 {code}
17382
17383
17384 ---
17385
17386 * [HBASE-15518](https://issues.apache.org/jira/browse/HBASE-15518) | *Major* | **Add Per-Table metrics back**
17387
17388 Adds per-table metrics aggregated from per-region metrics in region server metrics. New metrics are available under JMX section "Hadoop:service=HBase,name=RegionServer,sub=Tables" and they are available via hadoop metrics2 collectors.
17389
17390
17391 ---
17392
17393 * [HBASE-15640](https://issues.apache.org/jira/browse/HBASE-15640) | *Major* | **L1 cache doesn't give fair warning that it is showing partial stats only when it hits limit**
17394
17395 The blockcache UI tab would stop refreshing at 100k blocks (configurable, see "hbase.ui.blockcache.by.file.max"), which isn't very many blocks when doing a big cache, giving a misleading picture of the content of L1 and/or L2 cache. Up the default limit to 1M blocks (UI takes a while but just a few seconds counting over 1M blocks).
17396
17397 Also, when beyond the limit give the user a noticeable WARNING in the UI.
17398
17399
17400 ---
17401
17402 * [HBASE-15386](https://issues.apache.org/jira/browse/HBASE-15386) | *Major* | **PREFETCH\_BLOCKS\_ON\_OPEN in HColumnDescriptor is ignored**
17403
17404 This was a non-issue. The PREFETCH\_... flag actually works. While here though made the following additions.
17405
17406 Changes the prefetch TRACE-level loggings to include the word 'Prefetch' in them so you know what they are about.
17407
17408 Changes the cryptic logging of the CacheConfig#toString to have some preamble saying why and what column family is responsible (helps figure what is going on)
17409
17410 Add test that verifies setting flag on HColumnDescriptor actually works.
17411
17412
17413 ---
17414
17415 * [HBASE-13372](https://issues.apache.org/jira/browse/HBASE-13372) | *Major* | **Unit tests for SplitTransaction and RegionMergeTransaction listeners**
17416
17417 HBASE-13372 Add unit tests for SplitTransaction and RegionMergeTransaction listeners
17418
17419
17420 ---
17421
17422 * [HBASE-15187](https://issues.apache.org/jira/browse/HBASE-15187) | *Major* | **Integrate CSRF prevention filter to REST gateway**
17423
17424 Protection against CSRF attack can be turned on with config parameter, hbase.rest.csrf.enabled - default value is false.
17425
17426 The custom header to be sent can be changed via config parameter, hbase.rest.csrf.custom.header whose default value is "X-XSRF-HEADER".
17427
17428 Config parameter, hbase.rest.csrf.methods.to.ignore , controls which HTTP methods are not associated with customer header check.
17429
17430 Config parameter, hbase.rest-csrf.browser-useragents-regex , is a comma-separated list of regular expressions used to match against an HTTP request's User-Agent header when protection against cross-site request forgery (CSRF) is enabled for REST server by setting hbase.rest.csrf.enabled to true.
17431
17432 The implementation came from hadoop/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/http/RestCsrfPreventionFilter.java
17433
17434 We should periodically update the RestCsrfPreventionFilter.java in hbase codebase to include fixes to the hadoop implementation.
17435
17436
17437 ---
17438
17439 * [HBASE-15481](https://issues.apache.org/jira/browse/HBASE-15481) | *Trivial* | **Add pre/post roll to WALObserver**
17440
17441 <!-- markdown -->
17442
17443
17444 WALObserver coprocessors now can receive notifications of WAL rolling via the new methods `preWALRoll` and `postWALRoll`.
17445
17446 This change is incompatible due to the addition of these methods to the `WALObserver` interface. Downstream users are encouraged to instead extend the `BaseWALObserver` class, which remains compatible through this change.
17447
17448
17449 ---
17450
17451 * [HBASE-15507](https://issues.apache.org/jira/browse/HBASE-15507) | *Major* | **Online modification of enabled ReplicationPeerConfig**
17452
17453 Added update\_peer\_config to the HBase shell and ReplicationAdmin, and provided a callback for custom replication endpoints to be notified of changes to their configuration and peer data
17454
17455
17456 ---
17457
17458 * [HBASE-15537](https://issues.apache.org/jira/browse/HBASE-15537) | *Major* | **Make multi WAL work with WALs other than FSHLog**
17459
17460 Add the delegate config for multiwal back. Now you can use 'hbase.wal.regiongrouping.delegate.provider' to specify the wal provider you want to use for multiwal. For example:
17461 {code}
17462 \<property\>
17463 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
17464 \<value\>asyncfs\</value\>
17465 \</property\>
17466 {code}
17467 And the default value is filesystem which is the alias of DefaultWALProvider, i.e., the FSHLog.
17468
17469
17470 ---
17471
17472 * [HBASE-15400](https://issues.apache.org/jira/browse/HBASE-15400) | *Major* | **Use DateTieredCompactor for Date Tiered Compaction**
17473
17474 With this patch combined with HBASE-15389, when we compact, we can output multiple files along the current window boundaries. There are two use cases:
17475 1. Major compaction: We want to output date tiered store files with data older than max age archived in trunks of the window size on the higher tier. Once a window is old enough, we don't combine the windows to promote to the next tier any further. So files in these windows retain the same timespan as they were minor-compacted last time, which is the window size of the highest tier. Major compaction will touch these files and we want to maintain the same layout. This way, TTL and archiving will be simpler and more efficient.
17476 2. Bulk load files and the old file generated by major compaction before upgrading to DTCP.
17477
17478 This will change the way to enable date tiered compaction.
17479 To turn it on:
17480 hbase.hstore.engine.class: org.apache.hadoop.hbase.regionserver.DateTieredStoreEngine
17481
17482 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17483 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17484 hbase.hstore.compaction.throughput.higher.bound and hbase.hstore.compaction.throughput.lower.bound need to be set for desired throughput range as uncompressed rates.
17485
17486 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17487 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17488
17489 Because major compaction is turned on now, we also need to adjust the configuration for max file to compact according to the larger file count:
17490 hbase.hstore.compaction.max: set to the same number as hbase.hstore.blockingStoreFiles.
17491
17492 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17493
17494
17495 ---
17496
17497 * [HBASE-15592](https://issues.apache.org/jira/browse/HBASE-15592) | *Major* | **Print Procedure WAL content**
17498
17499 Use hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter
17500 to print the content of a Procedure WAL.
17501 e.g.
17502 hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter -f /hbase/MasterProcWALs/state-00000000000000002571.log
17503
17504
17505 ---
17506
17507 * [HBASE-15396](https://issues.apache.org/jira/browse/HBASE-15396) | *Minor* | **Enhance mapreduce.TableSplit to add encoded region name**
17508
17509 To aid troubleshooting of MapReduce job that rely on the HBase provided input format, splits now include the encoded region name they cover.
17510
17511
17512 ---
17513
17514 * [HBASE-15568](https://issues.apache.org/jira/browse/HBASE-15568) | *Major* | **Procedure V2 - Remove CreateTableHandler in HBase Apache 2.0 release**
17515
17516 **WARNING: No release note provided for this change.**
17517
17518
17519 ---
17520
17521 * [HBASE-15521](https://issues.apache.org/jira/browse/HBASE-15521) | *Major* | **Procedure V2 - RestoreSnapshot and CloneSnapshot**
17522
17523 **WARNING: No release note provided for this change.**
17524
17525
17526 ---
17527
17528 * [HBASE-15538](https://issues.apache.org/jira/browse/HBASE-15538) | *Major* | **Implement secure async protobuf wal writer**
17529
17530 Add the following config in hbase-site.xml if you want to use secure protobuf wal writer together with AsyncFSWAL
17531 {code}
17532 \<property\>
17533 \<name\>hbase.regionserver.hlog.async.writer.impl\</name\>
17534 \<value\>org.apache.hadoop.hbase.regionserver.wal.SecureAsyncProtobufLogWriter\</value\>
17535 \</property\>
17536 \<property\>
17537 {code}
17538
17539
17540 ---
17541
17542 * [HBASE-11393](https://issues.apache.org/jira/browse/HBASE-11393) | *Major* | **Replication TableCfs should be a PB object rather than a string**
17543
17544 **WARNING: No release note provided for this change.**
17545
17546
17547 ---
17548
17549 * [HBASE-15265](https://issues.apache.org/jira/browse/HBASE-15265) | *Major* | **Implement an asynchronous FSHLog**
17550
17551 To enable, set the WALProvider as follows:
17552
17553 {code}
17554 \<property\>
17555 \<name\>hbase.wal.provider\</name\>
17556 \<value\>asyncfs\</value\>
17557 \</property\>
17558 \<property\>
17559 {code}
17560
17561 To check which provider is active, look for the log line:
17562
17563 LOG.info("Instantiating WALProvider of type " + clazz);
17564
17565
17566 ---
17567
17568 * [HBASE-14256](https://issues.apache.org/jira/browse/HBASE-14256) | *Major* | **Flush task message may be confusing when region is recovered**
17569
17570 HBASE-14256 Correct confusing flush task message
17571
17572
17573 ---
17574
17575 * [HBASE-15212](https://issues.apache.org/jira/browse/HBASE-15212) | *Major* | **RPCServer should enforce max request size**
17576
17577 Adds a configuration parameter "hbase.ipc.max.request.size" which defaults to 256MB to protect the server against very large incoming RPC requests. All requests larger than this size will be immediately rejected before allocating any resources (memory allocation, etc).
17578
17579
17580 ---
17581
17582 * [HBASE-15412](https://issues.apache.org/jira/browse/HBASE-15412) | *Major* | **Add average region size metric**
17583
17584 Adds a new metric for called "averageRegionSize" that is emitted as a regionserver metric. Metric description:
17585 Average region size over the region server including memstore and storefile sizes
17586
17587
17588 ---
17589
17590 * [HBASE-15479](https://issues.apache.org/jira/browse/HBASE-15479) | *Major* | **No more garbage or beware of autoboxing**
17591
17592 This fix decreases client's memory allocation during writes by more than 50%.
17593
17594
17595 ---
17596
17597 * [HBASE-15322](https://issues.apache.org/jira/browse/HBASE-15322) | *Critical* | **Operations using Unsafe path broken for platforms not having sun.misc.Unsafe**
17598
17599 **WARNING: No release note provided for this change.**
17600
17601
17602 ---
17603
17604 * [HBASE-12940](https://issues.apache.org/jira/browse/HBASE-12940) | *Major* | **Expose listPeerConfigs and getPeerConfig to the HBase shell**
17605
17606 Adds get\_peer\_config and list\_peer\_configs to the hbase shell.
17607
17608
17609 ---
17610
17611 * [HBASE-15430](https://issues.apache.org/jira/browse/HBASE-15430) | *Critical* | **Failed taking snapshot - Manifest proto-message too large**
17612
17613 Failed taking snapshot - Manifest proto-message too large. add property ("snapshot.manifest.size.limit")  to change max size of proto-message
17614
17615
17616 ---
17617
17618 * [HBASE-15323](https://issues.apache.org/jira/browse/HBASE-15323) | *Major* | **Hbase Rest CheckAndDeleteAPi should be able to delete more cells**
17619
17620 Fixed an issue in REST server checkAndDelete operation where the remaining cells other than the to-be-checked column are also applied in the Delete operation. Also fixed an issue in RemoteHTable where the Delete object was not passed correctly to the REST server side.
17621
17622
17623 ---
17624
17625 * [HBASE-15377](https://issues.apache.org/jira/browse/HBASE-15377) | *Major* | **Per-RS Get metric is time based, per-region metric is size-based**
17626
17627 Per-region metrics related to Get histograms are changed from being response size based into being latency based similar to the per-regionserver metrics of the same name.
17628
17629 Added GetSize histogram metrics at the per-regionserver and per-region level for the response sizes.
17630
17631
17632 ---
17633
17634 * [HBASE-6721](https://issues.apache.org/jira/browse/HBASE-6721) | *Major* | **RegionServer Group based Assignment**
17635
17636 [ADVANCED USERS ONLY] This patch adds a new experimental module hbase-rsgroup. It is an advanced feature for partitioning regionservers into distinctive groups for strict isolation, and should only be used by users who are sophisticated enough to understand the full implications and have a sufficient background in managing HBase clusters.
17637
17638 RSGroups can be defined and managed with shell commands or corresponding Java APIs. A server can be added to a group with hostname and port pair, and tables can be moved to this group so that only regionservers in the same rsgroup can host the regions of the table. RegionServers and tables can only belong to 1 group at a time. By default, all tables and regionservers belong to the "default" group. System tables can also be put into a group using the regular APIs. A custom balancer implementation tracks assignments per rsgroup and makes sure to move regions to the relevant regionservers in that group. The group information is stored in a regular HBase table, and a zookeeper-based read-only cache is used at the cluster bootstrap time.
17639
17640 To enable, add the following to your hbase-site.xml and restart your Master:
17641
17642
17643  \<property\>
17644    \<name\>hbase.coprocessor.master.classes\</name\>
17645    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupAdminEndpoint\</value\>
17646  \</property\>
17647  \<property\>
17648    \<name\>hbase.master.loadbalancer.class\</name\>
17649    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupBasedLoadBalancer\</value\>
17650  \</property\>
17651
17652
17653 Then use the shell 'rsgroup' commands to create and manipulate regionserver groups: e.g. to add a group and then add a server to it, do as follows:
17654
17655  hbase(main):008:0\> add\_rsgroup 'my\_group'
17656  Took 0.5610 seconds
17657
17658 This adds a group to the 'hbase:rsgroup' system table. Add a server (hostname + port) to the group using the 'move\_rsgroup\_servers' command as follows:
17659
17660  hbase(main):010:0\> move\_rsgroup\_servers 'my\_group',['k.att.net:51129']
17661
17662
17663 ---
17664
17665 * [HBASE-15435](https://issues.apache.org/jira/browse/HBASE-15435) | *Major* | **Add WAL (in bytes) written metric**
17666
17667 Adds a new metric named "writtenBytes" as a per-regionserver metric. Metric Description:
17668 Size (in bytes) of the data written to the WAL.
17669
17670
17671 ---
17672
17673 * [HBASE-13963](https://issues.apache.org/jira/browse/HBASE-13963) | *Critical* | **avoid leaking jdk.tools**
17674
17675 HBase now ensures that the JDK tools jar used during the build process is not exposed to downstream clients as a transitive dependency of hbase-annotations.
17676
17677 If you need to have the JDK tools jar in your classpath, you should add a system dependency on it. See the hbase-annotations pom for an example of the necessary pom additions.
17678
17679
17680 ---
17681
17682 * [HBASE-15271](https://issues.apache.org/jira/browse/HBASE-15271) | *Major* | **Spark Bulk Load: Need to write HFiles to tmp location then rename to protect from Spark Executor Failures**
17683
17684 When using the bulk load helper provided by the hbase-spark module, output files will now be written into temporary files and only made available when the executor has successfully completed.
17685
17686 Previously, failed executors would leave their files in place in a way that would be picked up by a bulk load command. This caused retried failures to include spurious copies of some cells.
17687
17688
17689 ---
17690
17691 * [HBASE-15364](https://issues.apache.org/jira/browse/HBASE-15364) | *Major* | **Fix unescaped \< characters in Javadoc**
17692
17693 HBASE-15364 Fix unescaped \< and \> characters in Javadoc
17694
17695
17696 ---
17697
17698 * [HBASE-15243](https://issues.apache.org/jira/browse/HBASE-15243) | *Major* | **Utilize the lowest seek value when all Filters in MUST\_PASS\_ONE FilterList return SEEK\_NEXT\_USING\_HINT**
17699
17700 When all filters in a MUST\_PASS\_ONE FilterList return a SEEK\_USING\_NEXT\_HINT code, we return SEEK\_NEXT\_USING\_HINT from the FilterList#filterKeyValue() to utilize the lowest seek value.
17701
17702
17703 ---
17704
17705 * [HBASE-15354](https://issues.apache.org/jira/browse/HBASE-15354) | *Major* | **Use same criteria for clearing meta cache for all operations**
17706
17707 This patch fixes some issues when MetaCache (region location cache) gets unnecessarily dropped on the client.
17708
17709 On master branch we now in RegionServerCallable and RegionServerAdminCallable pass the actual exception down to Connection#updateCachedLocation, so we could check there if the exception is "meta-clearing" or not.
17710
17711 on branch-1, branch-1.2 and branch 1.3 we now check if the exception is meta-clearing or not in AsyncProcess (this check was there on master, but not on earlier branches)
17712
17713
17714 ---
17715
17716 * [HBASE-15376](https://issues.apache.org/jira/browse/HBASE-15376) | *Major* | **ScanNext metric is size-based while every other per-operation metric is time based**
17717
17718 Removed ScanNext histogram metrics as regionserver level and per-region level metrics since the semantics is not compatible with other similar metrics (size histogram vs latency histogram).
17719
17720 Instead, this patch adds ScanTime and ScanSize histogram metrics at the regionserver and per-region level.
17721
17722
17723 ---
17724
17725 * [HBASE-15338](https://issues.apache.org/jira/browse/HBASE-15338) | *Minor* | **Add a option to disable the data block cache for testing the performance of underlying file system**
17726
17727 Add a new config: hbase.block.data.cacheonread, which is a global switch for caching data blocks on read. The default value of this switch is true, and data blocks will be cached on read if the block cache is enabled for the family and cacheBlocks flag is set to be true for get and scan operations. If this global switch is set to false, data blocks won't be cached even if the block cache is enabled for the family and the cacheBlocks flag of Gets or Scans are sets as true. Bloom blocks and index blocks are always be cached if the block cache of the regionserver is enabled. One usage of this switch is for the performance tests for the extreme case that  the cache for data blocks all missed and all data blocks are read from underlying file system.
17728
17729
17730 ---
17731
17732 * [HBASE-15136](https://issues.apache.org/jira/browse/HBASE-15136) | *Critical* | **Explore different queuing behaviors while busy**
17733
17734 Previously RPC request scheduler in HBase had 2 modes in could operate in:
17735
17736  - simple FIFO
17737  - "partial" deadline, where deadline constraints are only imposed on long-running scan requests.
17738
17739 This patch adds new type of scheduler to HBase, based on the research around controlled delay (CoDel) algorithm [1], used in networking to combat bufferbloat, as well as some analysis on generalizing it to generic request queues [2]. The purpose of that work is to prevent long standing call queues caused by discrepancy between request rate and available throughput, caused by kernel/disk IO/networking stalls.
17740
17741 New RPC scheduler could be enabled by setting hbase.ipc.server.callqueue.type=codel in configuration. Several additional params allow to configure algorithm behavior -
17742
17743 hbase.ipc.server.callqueue.codel.target.delay
17744 hbase.ipc.server.callqueue.codel.interval
17745 hbase.ipc.server.callqueue.codel.lifo.threshold
17746
17747 [1] Controlling Queue Delay / A modern AQM is just one piece of the solution to bufferbloat. http://queue.acm.org/detail.cfm?id=2209336
17748 [2] Fail at Scale / Reliability in the face of rapid change. http://queue.acm.org/detail.cfm?id=2839461
17749
17750
17751 ---
17752
17753 * [HBASE-15181](https://issues.apache.org/jira/browse/HBASE-15181) | *Major* | **A simple implementation of date based tiered compaction**
17754
17755 Date tiered compaction policy is a date-aware store file layout that is beneficial for time-range scans for time-series data.
17756
17757 When it performs well:
17758
17759     reads for limited time ranges, especially scans of recent data
17760
17761 When it doesn't perform as well:
17762
17763     random gets without a time range
17764     frequent deletes and updates
17765     out of order data writes, especially writes with timestamps in the future
17766     bulk loads of historical data
17767
17768 Recommended configuration:
17769 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
17770 hbase.hstore.compaction.compaction.policy: org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
17771
17772 Parameters for Date Tiered Compaction:
17773 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
17774 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
17775 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
17776 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
17777 hbase.hstore.compaction.date.tiered.window.policy.class: the policy to select store files within the same time window. It doesn’t apply to the incoming window. Default at exploring compaction. This is to avoid wasteful compaction.
17778
17779 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17780 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17781
17782 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17783 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17784
17785 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17786
17787
17788 ---
17789
17790 * [HBASE-15290](https://issues.apache.org/jira/browse/HBASE-15290) | *Major* | **Hbase Rest CheckAndAPI should save other cells along with compared cell**
17791
17792 Fixed an issue in REST server checkAndPut operation where the remaining cells other than the to-be-checked column are also applied in the put operation .
17793
17794
17795 ---
17796
17797 * [HBASE-15264](https://issues.apache.org/jira/browse/HBASE-15264) | *Major* | **Implement a fan out HDFS OutputStream**
17798
17799 Implement a fan-out asynchronous DFSOutputStream for implementing new WAL writer.
17800
17801
17802 ---
17803
17804 * [HBASE-13259](https://issues.apache.org/jira/browse/HBASE-13259) | *Critical* | **mmap() based BucketCache IOEngine**
17805
17806 mmap() based bucket cache can be configured by specifying the property
17807 {code}
17808 \<property\>
17809   \<name\>hbase.bucketcache.ioengine\</name\>
17810   \<value\> mmap://filepath \</value\>
17811 \</property\>
17812 {code}
17813 This mode of bucket cache is ideal when your file based bucket cache size is lesser than then available RAM. When the cache is bigger than the available RAM then the kernel page faults will make this cache perform lesser particularly in case of scans.
17814
17815
17816 ---
17817
17818 * [HBASE-11927](https://issues.apache.org/jira/browse/HBASE-11927) | *Major* | **Use Native Hadoop Library for HFile checksum (And flip default from CRC32 to CRC32C)**
17819
17820 Checksumming is cpu intensive. HBase computes additional checksums for HFiles (hdfs does checksums too) and stores them inline with file data. During reading, these checksums are verified to ensure data is not corrupted. This patch tries to use Hadoop Native Library for checksum computation, if it’s available, otherwise falls back to standard Java libraries. Instructions to load NHL in HBase can be found here (http://hbase.apache.org/book.html#hadoop.native.lib).
17821
17822 Default checksum algorithm has been changed from CRC32 to CRC32C primarily because of two reasons: 1) CRC32C has better error detection properties, and 2) New Intel processors have a dedicated instruction for crc32c computation (SSE4.2 instruction set)\*. This change is fully backward compatible. Also, users should not see any differences except decrease in cpu usage. To keep old settings, set configuration ‘hbase.hstore.checksum.algorithm’ to ‘CRC32’.
17823
17824 \* On linux, run 'cat /proc/cpuinfo’ and look for sse4\_2 in list of flags to see if your processor supports SSE4.2.
17825
17826
17827 ---
17828
17829 * [HBASE-15219](https://issues.apache.org/jira/browse/HBASE-15219) | *Critical* | **Canary tool does not return non-zero exit code when one of regions is in stuck state**
17830
17831 A new flag is added for Canary tool: -treatFailureAsError
17832 When this flag is specified, read / write failure would result in Canary tool exit code of 5.
17833
17834
17835 ---
17836
17837 * [HBASE-14949](https://issues.apache.org/jira/browse/HBASE-14949) | *Major* | **Resolve name conflict when splitting if there are duplicated WAL entries**
17838
17839 Now we can write duplicated WAL entries into different WAL files. This feature is required by the replication consistency fix and new implementation of WAL writer.
17840
17841
17842 ---
17843
17844 * [HBASE-15100](https://issues.apache.org/jira/browse/HBASE-15100) | *Blocker* | **Master WALProcs still never clean up**
17845
17846 The constructor for o.a.h.hbase.ProcedureInfo was mistakenly labeled IA.Public in previous releases and has now changed to IA.Private. Downstream users are safe to consume ProcedureInfo objects returned from HBase public interfaces, but should not expect to be able to reliably create new instances themselves.
17847
17848 The method ProcedureInfo.setNonceKey has been removed, because it should not have been exposed to clients.
17849
17850
17851 ---
17852
17853 * [HBASE-14355](https://issues.apache.org/jira/browse/HBASE-14355) | *Major* | **Scan different TimeRange for each column family**
17854
17855 Adds being able to Scan each column family with a different time range. Adds new methods setColumnFamilyTimeRange and getColumnFamilyTimeRange to Scan.
17856
17857
17858 ---
17859
17860 * [HBASE-14460](https://issues.apache.org/jira/browse/HBASE-14460) | *Critical* | **[Perf Regression] Merge of MVCC and SequenceId (HBASE-8763) slowed Increments, CheckAndPuts, batch operations**
17861
17862 This release note tries to tell the general story. Dive into sub-tasks for more specific release noting.
17863
17864 Increments, appends, checkAnd\* have been slow since hbase-.1.0.0. The unification of mvcc and sequence id done by HBASE-8763 was responsible.
17865
17866 A ‘fast-path’ workaround was added by HBASE-15031 “Fix merge of MVCC and SequenceID performance regression in branch-1.0 for Increments”. It became available in 1.0.3 and 1.1.3. To enable the fast path, set "hbase.increment.fast.but.narrow.consistency" and then rolling restart. The workaround was for increments only (appends, checkAndPut, etc., were not addressed. See HBASE-15031 release note for more detail).
17867
17868 Subsequently, the regression was properly identified and fixed in HBASE-15213 and the fix applied to branch-1.0 and branch-1.1. As it happens, hbase-1.2.0 does not suffer from the performance regression (though the thought was that it did -- and so it got the fast-path patch too via HBASE-15092) nor does the master branch. HBASE-15213 identified that HBASE-12751 (as a side effect) had cured the regression.
17869
17870 hbase-1.0.4 (if it is ever released -- 1.0 has been end-of-lifed) and hbase-1.1.4 will have the HBASE-15213 fix.  If you are suffering from the increment regression and you are on 1.0.3 or 1.1.3, you can enable the work around to get back your increment performance but you should upgrade.
17871
17872
17873 ---
17874
17875 * [HBASE-15046](https://issues.apache.org/jira/browse/HBASE-15046) | *Major* | **Perf test doing all mutation steps under row lock**
17876
17877 In here we perf tested a realignment of the write pipeline and mvcc handling.  Thought was that this work was a predicate for a general fix of HBASE-14460 (turns out, realignment of write path was not needed to fix the increment perf regression). The perf testing here made it so we were able to simplify writing. HBASE-15158 was just committed. This work is done.
17878
17879
17880 ---
17881
17882 * [HBASE-15158](https://issues.apache.org/jira/browse/HBASE-15158) | *Major* | **Change order in which we do write pipeline operations; do all under row locks!**
17883
17884 Changed the write pipeline order; made it more rational, easier-to-reason-about doing all updates to WA, MemStore, and mvcc while read/write rowlock is held where before we'd release after WAL append and then do sync and mvcc.
17885
17886
17887 ---
17888
17889 * [HBASE-15157](https://issues.apache.org/jira/browse/HBASE-15157) | *Major* | **Add \*PerformanceTest for Append, CheckAnd\***
17890
17891 Add append, increment, checkAndMutate, checkAndPut, and checkAndDelete tests to PerformanceEvaluation tool. Below are excerpts from new usage from PE:
17892
17893 ....
17894 Command:
17895  append          Append on each row; clients overlap on keyspace so some concurrent operations
17896  checkAndDelete  CheckAndDelete on each row; clients overlap on keyspace so some concurrent operations
17897  checkAndMutate  CheckAndMutate on each row; clients overlap on keyspace so some concurrent operations
17898  checkAndPut     CheckAndPut on each row; clients overlap on keyspace so some concurrent operations
17899  filterScan      Run scan test using a filter to find a specific row based on it's value (make sure to use --rows=20)
17900  increment       Increment on each row; clients overlap on keyspace so some concurrent operations
17901  randomRead      Run random read test
17902 ....
17903 Examples:
17904 ...
17905  To run 10 clients doing increments over ten rows:
17906  $ bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation --rows=10 --nomapred increment 10
17907
17908 Removed IncrementPerformanceTest. It is not as configurable as the additions made here.
17909
17910
17911 ---
17912
17913 * [HBASE-15218](https://issues.apache.org/jira/browse/HBASE-15218) | *Blocker* | **On RS crash and replay of WAL, loosing all Tags in Cells**
17914
17915 This issue fixes
17916 - In case of normal WAL (Not encrypted) we were loosing all cell tags on WAL replay after an RS crash
17917 - In case of encrypted WAL we were not even persisting Cell tags in WAL.  Tags from all unflushed (to HFile) Cells will get lost even after WAL replay recovery is done.
17918
17919 As we use tags for Cell level security, this fixes 2 security issues
17920  - Cell level visibility labels security breach . Making a visibility restricted cell global readable
17921  - Cell level ACL availability issue.  A user who is cell level authorized to read this cell can not read it. It is a data loss for him.
17922
17923
17924 ---
17925
17926 * [HBASE-15129](https://issues.apache.org/jira/browse/HBASE-15129) | *Major* | **Set default value for hbase.fs.tmp.dir rather than fully depend on hbase-default.xml**
17927
17928 Before HBASE-15129, if somehow hbase-default.xml is not on classpath, default values for hbase.fs.tmp.dir and hbase.bulkload.staging.dir are left empty. After HBASE-15129,  default values of both properties are set to "/user/\<user.name\>/hbase-staging".
17929
17930
17931 ---
17932
17933 * [HBASE-14969](https://issues.apache.org/jira/browse/HBASE-14969) | *Major* | **Add throughput controller for flush**
17934
17935 Adds means of throttling flush throughput. By default there is no limit; we use NoLimitThroughputController. An alternative controller, PressureAwareFlushThroughputController, allows specifying throughput bounds. A new simple factor, flush pressure, influences throughput. See PressureAwareFlushThroughputController.java class for detail.
17936
17937
17938 ---
17939
17940 * [HBASE-11425](https://issues.apache.org/jira/browse/HBASE-11425) | *Major* | **Cell/DBB end-to-end on the read-path**
17941
17942 For E2E off heaped read path, first of all there should be an off heap backed BucketCache(BC). Configure 'hbase.bucketcache.ioengine' to offheap in hbase-site.xml. Also specify the total capacity of the BC using hbase.bucketcache.size config.  Please remember to adjust value of 'HBASE\_OFFHEAPSIZE' in hbase-env.sh as per this capacity. Here-by we specify the max possible off-heap memory allocation for the RS java process. So this should be bigger than the off-heap BC size. Please keep in mind that there is no default for hbase.bucketcache.ioengine which means the BC is turned OFF by default.
17943
17944 Next thing to tune is the ByteBuffer pool in the RPC server side. The buffers from this pool will be used to accumulate the cell bytes and create a result cell block to send back to the client side. 'hbase.ipc.server.reservoir.enabled' can be used to turn this pool ON or OFF. By default this pool is ON and available. HBase will create off heap ByteBuffers and pool them. Please make sure not to turn this OFF if you want E2E off heaping in read path. If this pool is turned off, the server will create temp buffers on heap to accumulate the cell bytes and make a result cell block. This can impact the GC on a highly read loaded server.  The user can tune this pool with respect to how many buffers are in the pool and what should be the size of each ByteBuffer.
17945 Use the config 'hbase.ipc.server.reservoir.initial.buffer.size' to tune each of the buffer sizes. Defaults is 64 KB.
17946
17947 When the read pattern is a random row read and each of the rows are smaller in size compared to this 64 KB, try reducing this. When the result size is larger than one ByteBuffer size, the server will try to grab more than one buffer and make a result cell block out of these.  When the pool is running out of buffers, the server will end up creating temporary on-heap buffers.
17948
17949 The maximum number of ByteBuffers in the pool can be tuned using the config 'hbase.ipc.server.reservoir.initial.max'. Its value defaults to 64 \* region server handlers configured (See the config 'hbase.regionserver.handler.count'). The math is such that by default we consider 2 MB as the result cell block size per read result and each handler will be handling a read. For 2 MB size, we need 32 buffers each of size 64 KB (See default buffer size in pool).  So per handler 32 ByteBuffers(BB). We allocate twice this size as the max BBs count such that one handler can be creating the response and handing it to the RPC Responder thread and then handling a new request creating a new response cell block (using pooled buffers). Even if the responder could not send back the first TCP reply immediately, our count should allow that we should still have enough buffers in our pool without having to make temporary buffers on the heap.  Again for smaller sized random row reads, tune this max count. There are lazily created buffers and the count is the max count to be pooled.
17950
17951 The setting for HBASE\_OFFHEAPSIZE in hbase-env.sh should consider this off heap buffer pool at the RPC side also.  We need to config this max off heap size for RS as a bit higher than the sum of this max pool size and the off heap cache size. The TCP layer will also need to create direct bytebuffers for TCP communication. Also the DFS client will need some off-heap to do its workings especially if short-circuit reads are configured. Allocating an extra of 1 - 2 GB for the max direct memory size has worked in tests.
17952
17953 If you still see GC issues even after making E2E read path off heap, look for issues in the appropriate buffer pool. Check the below RS log with INFO level:
17954
17955   "Pool already reached its max capacity : XXX and no free buffers now. Consider increasing the value for 'hbase.ipc.server.reservoir.initial.max' ?"
17956
17957 If you are using co processors and refer the Cells in the read results, DO NOT store reference to these Cells out of the scope of the CP hook methods. Some times the CPs need store info about the cell (Like its row key) for considering in the next CP hook call etc. For such cases, pls clone the required fields of the entire Cell as per the use cases.  [ See CellUtil#cloneXXX(Cell) APIs ]
17958
17959
17960 ---
17961
17962 * [HBASE-15145](https://issues.apache.org/jira/browse/HBASE-15145) | *Major* | **HBCK and Replication should authenticate to zookepeer using server principal**
17963
17964 Added a new command line argument: --auth-as-server to enable authenticating to ZooKeeper as the HBase Server principal. This is required for secure clusters for doing replication operations like add\_peer, list\_peers, etc until HBASE-11392 is fixed. This advanced option can also be used for manually fixing secure znodes.
17965
17966 Commands can now be invoked like:
17967 hbase --auth-as-server shell
17968 hbase --auth-as-server zkcli
17969
17970 HBCK in secure setup also needs to authenticate to ZK using servers principals.This is turned on by default (no need to pass additional argument).
17971
17972 When authenticating as server, HBASE\_SERVER\_JAAS\_OPTS is concatenated to HBASE\_OPTS if defined in hbase-env.sh. Otherwise, HBASE\_REGIONSERVER\_OPTS is concatenated.
17973
17974
17975 ---
17976
17977 * [HBASE-15125](https://issues.apache.org/jira/browse/HBASE-15125) | *Major* | **HBaseFsck's adoptHdfsOrphan function creates region with wrong end key boundary**
17978
17979 **WARNING: No release note provided for this change.**
17980
17981
17982 ---
17983
17984 * [HBASE-13082](https://issues.apache.org/jira/browse/HBASE-13082) | *Major* | **Coarsen StoreScanner locks to RegionScanner**
17985
17986 After this JIRA we will not be doing any scanner reset after compaction during a course of a scan. The files that were compacted will still be continued to be used in the scan process. The compacted files will be archived by a background thread that runs every 2 mins by default only when there are no active scanners on those comapcted files. The above duration can be controlled using the knob 'hbase.hfile.compactions.cleaner.interval'.
17987
17988
17989 ---
17990
17991 * [HBASE-14865](https://issues.apache.org/jira/browse/HBASE-14865) | *Major* | **Support passing multiple QOPs to SaslClient/Server via hbase.rpc.protection**
17992
17993 With this patch, hbase.rpc.protection can now take multiple comma-separate QOP values. Accepted QOP values remain unchanged and are 'authentication', 'integrity', and 'privacy'. Server or client can use this configuration to specify their preference (in decreasing order) while negotiating QOP.
17994 This feature can be used to upgrade or downgrade QOP in an online cluster without compromising availability (i.e. taking cluster offline). For e.g. to change qop from A to B, typical steps would be:
17995 "A" --\> "B,A" --\> rolling restart --\> "B" --\> rolling restart
17996
17997 Sidenote: Based on experimentation, server's choice is given higher preference than client's choice. i.e. if server's choices are "A,B,C" and client's choices are "B,C,A", both A and B are acceptable, but A is chosen.
17998
17999
18000 ---
18001
18002 * [HBASE-15098](https://issues.apache.org/jira/browse/HBASE-15098) | *Blocker* | **Normalizer switch in configuration is not used**
18003
18004 The config parameter, hbase.normalizer.enabled, has been dropped since it is not used in the code base.
18005
18006
18007 ---
18008
18009 * [HBASE-15111](https://issues.apache.org/jira/browse/HBASE-15111) | *Trivial* | **"hbase version" should write to stdout**
18010
18011 The \`hbase version\` command now outputs directly to stdout rather than to a logger. This change allows the version information to be output consistently regardless of logger configuration. Naturally, this also means the command output ignores all logger configuration. Furthermore, the move from loggers to direct output changes the output of the command to omit metadata commonly included in logger ouput such as a timestamp, log level, and logger name.
18012
18013
18014 ---
18015
18016 * [HBASE-15027](https://issues.apache.org/jira/browse/HBASE-15027) | *Major* | **Refactor the way the CompactedHFileDischarger threads are created**
18017
18018 The property 'hbase.hfile.compactions.discharger.interval' has been renamed to 'hbase.hfile.compaction.discharger.interval' that describes the interval after which the compaction discharger chore service should run.
18019 The property 'hbase.hfile.compaction.discharger.thread.count' describes the thread count that does the compaction discharge work.
18020 The CompactedHFilesDischarger is a chore service now started as part of the RegionServer and this chore service iterates over all the onlineRegions in that RS and uses the RegionServer's executor service to launch a set of threads that does this job of compaction files clean up.
18021
18022
18023 ---
18024
18025 * [HBASE-14468](https://issues.apache.org/jira/browse/HBASE-14468) | *Major* | **Compaction improvements: FIFO compaction policy**
18026
18027 FIFO compaction policy selects only files which have all cells expired. The column family MUST have non-default TTL.
18028 Essentially, FIFO compactor does only one job: collects expired store files.
18029
18030 Because we do not do any real compaction, we do not use CPU and IO (disk and network), we do not evict hot data from a block cache. The result: improved throughput and latency both write and read.
18031 See: https://github.com/facebook/rocksdb/wiki/FIFO-compaction-style
18032
18033
18034 ---
18035
18036 * [HBASE-14888](https://issues.apache.org/jira/browse/HBASE-14888) | *Major* | **ClusterSchema: Add Namespace Operations**
18037
18038 This patch changes the semantic around namespace create/delete/modify when coprocessor asks that the invocation be by-passed. Previous the by-pass was done silently -- the method would just return with no indication as to whether by-pass route had been taken or not.  This patch adds throwing of a BypassCoprocessorException which is thrown if we have been asked to bypass a call.
18039
18040 The bypass facility has been in place since hbase 1.0.0 when namespace creation/deletion, etc.., was originally added in HBASE-8408 (HBASE-15071 is about addressing bypass handling in a general way)
18041
18042
18043 ---
18044
18045 * [HBASE-15018](https://issues.apache.org/jira/browse/HBASE-15018) | *Major* | **Inconsistent way of handling TimeoutException in the rpc client implementations**
18046
18047 When using the new AsyncRpcClient introduced in HBase 1.1.0 (HBASE-12684), time outs now result in an IOException wrapped around a CallTimeoutException instead of a bare CallTimeoutException. This change makes the AsyncRpcClient behave the same as the default HBase 1.y RPC client implementation.
18048
18049
18050 ---
18051
18052 * [HBASE-14796](https://issues.apache.org/jira/browse/HBASE-14796) | *Minor* | **Enhance the Gets in the connector**
18053
18054 spark.hbase.bulkGetSize  in HBaseSparkConf is for grouping bulkGet, and default value is 1000.
18055
18056
18057 ---
18058
18059 * [HBASE-14976](https://issues.apache.org/jira/browse/HBASE-14976) | *Minor* | **Add RPC call queues to the web ui**
18060
18061 Adds column displaying current aggregated call queues size in region server queues tab UI.
18062
18063
18064 ---
18065
18066 * [HBASE-14822](https://issues.apache.org/jira/browse/HBASE-14822) | *Major* | **Renewing leases of scanners doesn't work**
18067
18068 And 1.1, 1.0, and 0.98.
18069
18070
18071 ---
18072
18073 * [HBASE-14205](https://issues.apache.org/jira/browse/HBASE-14205) | *Critical* | **RegionCoprocessorHost System.nanoTime() performance bottleneck**
18074
18075 **WARNING: No release note provided for this change.**
18076
18077
18078 ---
18079
18080 * [HBASE-14978](https://issues.apache.org/jira/browse/HBASE-14978) | *Blocker* | **Don't allow Multi to retain too many blocks**
18081
18082 Limiting the amount of memory resident for any one request allows the server to handle concurrent requests smoothly. To this end we added the ability to limit the size of responses to a multi request. That worked well however it correctly represent the amount of memory resident. So this issue adds on a an approximation of the number of blocks held for a request.
18083
18084 All clients before 1.2.0 will not get this multi request chunking based upon blocks kept. All clients 1.2.0 and after will.
18085
18086
18087 ---
18088
18089 * [HBASE-14951](https://issues.apache.org/jira/browse/HBASE-14951) | *Minor* | **Make hbase.regionserver.maxlogs obsolete**
18090
18091 Rolling WAL events across a cluster can be highly correlated, hence flushing memstores, hence triggering minor compactions, that can be promoted to major ones. These events are highly correlated in time if there is a balanced write-load on the regions in a table. Default value for maximum WAL files (\* hbase.regionserver.maxlogs\*), which controls WAL rolling events - 32 is too small for many modern deployments.
18092 Now we calculate this value dynamically (if not defined by user), using the following formula:
18093
18094 maxLogs = Math.max( 32, HBASE\_HEAP\_SIZE \* memstoreRatio \* 2/ LogRollSize), where
18095
18096 memstoreRatio is \*hbase.regionserver.global.memstore.size\*
18097 LogRollSize is maximum WAL file size (default 0.95 \* HDFS block size)
18098
18099 We need to make sure that we avoid fully or minimize events when RS has to flush memstores prematurely only because it reached artificial limit of hbase.regionserver.maxlogs, this is why we put this 2 x multiplier in equation, this gives us maximum WAL capacity of 2 x RS memstore-size.
18100
18101 Runaway WAL files.
18102
18103 The default log rolling period (1h) allows to accumulate up to 2 X Memstore Size data in a WAL. For heap size - 32G and all other default setting, this gives ~ 26GB of data. Under heavy write load, the number of WAL files can increase dramatically. RegionServer LogRoller will be archiving old WALs periodically. User has three options, either override default hbase.regionserver.maxlogs or override default hbase.regionserver.logroll.period (decrease), or both to control runaway WALs.
18104
18105 For system with bursty write load,  the hbase.regionserver.logroll.period can be decreased to lower value. In this case the maximum number of wal files will be defined by the total size of memstore (unflushed data), not by the hbase.regionserver.maxlogs. But for majority of applications there will be no issues with defaults. Data will be flushed periodically from memstore, the LogRoller will archive old wal files and the system will never reach the new defaults for hbase.regionserver.maxlogs, unless the system is under extreme load for prolonged period of time, but in this case, decreasing hbase.regionserver.logroll.period allows us to control runaway wal files.
18106
18107 The following table gives the new default maximum log files values for several different Region Server heap sizes:
18108
18109 heap    memstore perc   maxLogs
18110 1G              40%                             32
18111 2G              40%                             32
18112 10G             40%                             80
18113 20G             40%                             160
18114 32G             40%                             256
18115
18116
18117 ---
18118
18119 * [HBASE-14984](https://issues.apache.org/jira/browse/HBASE-14984) | *Major* | **Allow memcached block cache to set optimze to false**
18120
18121 Setting hbase.cache.memcached.spy.optimze to true will allow the spy memcached client to try and optimize for the number of requests outstanding. This can increase throughput but can also increase variance for request times.
18122
18123 Setting it to true will help when round trip times are longer.
18124 Setting it to false ( the default ) will help ensure a more even distribution of response times.
18125
18126
18127 ---
18128
18129 * [HBASE-14534](https://issues.apache.org/jira/browse/HBASE-14534) | *Minor* | **Bump yammer/coda/dropwizard metrics dependency version**
18130
18131 Updated yammer metrics to version 3.1.2 (now it's been renamed to dropwizard). API has changed quite a bit, consult https://dropwizard.github.io/metrics/3.1.0/manual/core/ for additional information.
18132
18133 Note that among other things, in yammer 2.2.0 histograms were by default created in non-biased mode (uniform sampling), while in 3.1.0 histograms created via MetricsRegistry.histogram(...) are by default exponentially decayed. This shouldn't affect end users, though.
18134
18135
18136 ---
18137
18138 * [HBASE-14960](https://issues.apache.org/jira/browse/HBASE-14960) | *Major* | **Fallback to using default RPCControllerFactory if class cannot be loaded**
18139
18140 If the configured RPC controller factory (via hbase.rpc.controllerfactory.class) cannot be found in the classpath or loaded, we fall back to using the default RPC controller factory in HBase.
18141
18142
18143 ---
18144
18145 * [HBASE-14946](https://issues.apache.org/jira/browse/HBASE-14946) | *Critical* | **Don't allow multi's to over run the max result size.**
18146
18147 The HBase region server will now send a chunk of get responses to a client if the total response size is too large. This will only be done for clients 1.2.0 and beyond. Older clients by default will have the old behavior.
18148
18149 This patch is for the case where the basic flow is like this:
18150
18151 I want to get a single column from lots of rows. So I create a list of gets. Then I send them to table.get(List\<Get\>). If the regions for that table are spread out then those requests get chunked out to all the region servers. No one regionserver gets too many. However if one region server contains lots of regions for that table then a multi action can contain lots of gets. No single get is too onerous. However the regionserver won't return until every get is complete. So if there are thousands of gets that are sent in one multi then the regionserver can retain lots of data in one thread.
18152
18153
18154 ---
18155
18156 * [HBASE-14906](https://issues.apache.org/jira/browse/HBASE-14906) | *Major* | **Improvements on FlushLargeStoresPolicy**
18157
18158 In HBASE-14906 we use "hbase.hregion.memstore.flush.size/column\_family\_number" as the default threshold for memstore flush instead of the fixed value through "hbase.hregion.percolumnfamilyflush.size.lower.bound" property, which makes  the default threshold more flexible to various use case. We also introduce a new property in name of "hbase.hregion.percolumnfamilyflush.size.lower.bound.min" with 16M as the default value to avoid small flush in cases like hundreds of column families.
18159
18160 After this change setting "hbase.hregion.percolumnfamilyflush.size.lower.bound" in hbase-site.xml won't take effect anymore, but expert users could still set this property in table descriptor to override the default value just as before
18161
18162
18163 ---
18164
18165 * [HBASE-14769](https://issues.apache.org/jira/browse/HBASE-14769) | *Major* | **Remove unused functions and duplicate javadocs from HBaseAdmin**
18166
18167 - Removes functions from HBaseAdmin which require table name parameter as either byte[] or String. Use their counterparts which take TableName instead.
18168 - Removes redundant javadocs from HBaseAdmin as they will be automatically inherited from Admin interface.
18169 - HBaseAdmin is marked Audience.private so it should have been straight forward okay to remove the functions. But HBaseTestingUtility, which is marked Audience.public had a public function returning its instance, which moved this decision into gray area. Discussing in the community, it was decided that it would be okay to do so in this particular case.
18170
18171
18172 ---
18173
18174 * [HBASE-13153](https://issues.apache.org/jira/browse/HBASE-13153) | *Major* | **Bulk Loaded HFile Replication**
18175
18176 This enhances the HBase replication to support replication of bulk loaded data. This is configurable, by default it is set to false which means it will not replicate the bulk loaded data to its peer(s). To enable it set "hbase.replication.bulkload.enabled" to true.
18177
18178 Following are the additional configurations added for this enhancement,
18179  a. hbase.replication.cluster.id - This is manadatory to configure in cluster where replication for bulk loaded data is enabled. A source cluster is uniquely identified by sink cluster using this id. This should be configured in the source cluster configuration file for all the RS.
18180  b. hbase.replication.conf.dir - This represents the directory where all the active cluster's file system client configurations are defined in subfolders corresponding to their respective replication cluster id in peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is HBASE\_CONF\_DIR.
18181  c. hbase.replication.source.fs.conf.provider - This represents the class which provides the source cluster file system client configuration to peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is org.apache.hadoop.hbase.replication.regionserver.DefaultSourceFSConfigurationProvider
18182
18183  For example: If source cluster FS client configurations are copied in peer cluster under directory /home/user/dc1/ then  hbase.replication.cluster.id should be configured as dc1 and hbase.replication.conf.dir as /home/user
18184
18185 Note:
18186  a. Any modification to source cluster FS client configuration files in peer cluster side replication configuration directory then it needs to restart all its peer(s) cluster RS with default hbase.replication.source.fs.conf.provider.
18187  b. Only 'xml' type files will be loaded by the default hbase.replication.source.fs.conf.provider.
18188
18189 As part of this we have made following changes to LoadIncrementalHFiles class which is marked as Public and Stable class,
18190  a. Raised the visibility scope of LoadQueueItem class from package private to public.
18191  b. Added a new method loadHFileQueue, which loads the queue of LoadQueueItem into the table as per the region keys provided.
18192
18193
18194 ---
18195
18196 * [HBASE-7171](https://issues.apache.org/jira/browse/HBASE-7171) | *Major* | **Initial web UI for region/memstore/storefiles details**
18197
18198 HBASE-7171 adds 2 new pages to the region server Web UI to ease debugging and provide greater insight into the physical data layout.
18199
18200 Region names in UI table listing all regions (on the RS status page) are now hyperlinks leading to region detail page which shows some aggregate memstore information (currently just memory used) along with the list of all Store Files (HFiles) in the region. Names of Store Files are also hyperlinks leading to Store File detail page, which currently runs 'hbase hfile' command behind the scene and displays statistics about store file.
18201
18202
18203 ---
18204
18205 * [HBASE-14655](https://issues.apache.org/jira/browse/HBASE-14655) | *Blocker* | **Narrow the scope of doAs() calls to region observer notifications for compaction**
18206
18207 Region observer notifications w.r.t. compaction request are now audited with request user through proper scope of doAs() calls.
18208
18209
18210 ---
18211
18212 * [HBASE-14631](https://issues.apache.org/jira/browse/HBASE-14631) | *Blocker* | **Region merge request should be audited with request user through proper scope of doAs() calls to region observer notifications**
18213
18214 Region observer notifications w.r.t. merge request are now audited with request user through proper scope of doAs() calls.
18215
18216
18217 ---
18218
18219 * [HBASE-14605](https://issues.apache.org/jira/browse/HBASE-14605) | *Blocker* | **Split fails due to 'No valid credentials' error when SecureBulkLoadEndpoint#start tries to access hdfs**
18220
18221 When split is requested by non-super user, split related notifications for Coprocessor are executed using the login of the request user.
18222 Previously the notifications were carried out as super user.
18223
18224
18225 ---
18226
18227 * [HBASE-14926](https://issues.apache.org/jira/browse/HBASE-14926) | *Major* | **Hung ThriftServer; no timeout on read from client; if client crashes, worker thread gets stuck reading**
18228
18229 Adds a timeout to server read from clients. Adds new configs hbase.thrift.server.socket.read.timeout for setting read timeout on server socket in milliseconds. Default is 60000;
18230
18231
18232 ---
18233
18234 * [HBASE-14825](https://issues.apache.org/jira/browse/HBASE-14825) | *Minor* | **HBase Ref Guide corrections of typos/misspellings**
18235
18236 Corrections to content of "book.html", which is pulled from various \*.adoc files and \*.xml files.
18237 -- corrects typos/misspellings
18238 -- corrects incorrectly formatted links
18239
18240
18241 ---
18242
18243 * [HBASE-14821](https://issues.apache.org/jira/browse/HBASE-14821) | *Major* | **CopyTable should allow overriding more config properties for peer cluster**
18244
18245 Configuration properties for org.apache.hadoop.hbase.mapreduce.TableOutputFormat can now be overridden by prefixing the property keys with "hbase.mapred.output.".  When the configuration is applied to TableOutputFormat, these entries will be rewritten with the prefix removed -- ie. "hbase.mapred.output.hbase.security.authentication" becomes "hbase.security.authentication".  This can be useful when directing output to a peer cluster with different security configuration, for example.
18246
18247
18248 ---
18249
18250 * [HBASE-14799](https://issues.apache.org/jira/browse/HBASE-14799) | *Critical* | **Commons-collections object deserialization remote command execution vulnerability**
18251
18252 This issue resolves a potential security vulnerability. For all versions we update our commons-collections dependency to the release that fixes the reported vulnerability in that library. In 0.98 we additionally disable by default a feature of code carried from 0.94 for backwards compatibility that is not needed.
18253
18254
18255 ---
18256
18257 * [HBASE-12751](https://issues.apache.org/jira/browse/HBASE-12751) | *Major* | **Allow RowLock to be reader writer**
18258
18259 Locks on row are now reader/writer rather than exclusive.
18260
18261 Moves sequenceid out of HRegion and into MVCC class; MVCC is now in charge. A WAL append is still stamped in same way (we pass MVCC context in a few places where we previously we did not).
18262
18263 MVCC methods cleaned up. Make a bit more sense now. Less of them.
18264
18265 Simplifies our update of MemStore/WAL. Now we update memstore AFTER we add to WAL (but before we sync). This fixes possible dataloss when two edits came in with same coordinates; we could order the edits in memstore differently to how they arrived in the WAL.
18266
18267 Marked as an incompatible change because it breaks Distributed Log Replay, a feature we'd determined already was unreliable and to be removed.
18268
18269
18270 ---
18271
18272 * [HBASE-14793](https://issues.apache.org/jira/browse/HBASE-14793) | *Major* | **Allow limiting size of block into L1 block cache.**
18273
18274 Very large blocks can fragment the heap and cause bad issues for the garbage collector, especially the G1GC. Now there is a maximum size that a block can be and still stick in the LruBlockCache. That size defaults to 16mb but can be controlled by changing "hbase.lru.max.block.size"
18275
18276
18277 ---
18278
18279 * [HBASE-14387](https://issues.apache.org/jira/browse/HBASE-14387) | *Major* | **Compaction improvements: Maximum off-peak compaction size**
18280
18281 New configuration option: hbase.hstore.compaction.max.size.offpeak - maximum selection size eligible for minor compaction during off peak hours.
18282 hbase.hstore.compaction.max.size - this is default maximum if no off-peak hours are defined or if no maximum off-peak maximum size is defined.
18283
18284
18285 ---
18286
18287 * [HBASE-12822](https://issues.apache.org/jira/browse/HBASE-12822) | *Minor* | **Option for Unloading regions through region\_mover.rb without Acknowledging**
18288
18289 Incorporated in HBASE-13014.
18290
18291
18292 ---
18293
18294 * [HBASE-14700](https://issues.apache.org/jira/browse/HBASE-14700) | *Major* | **Support a "permissive" mode for secure clusters to allow "simple" auth clients**
18295
18296 Secure HBase now supports a permissive mode to allow mixed secure and insecure clients.  This allows clients to be incrementally migrated over to a secure configuration.  To enable clients to continue to connect using SIMPLE authentication when the cluster is configured for security, set "hbase.ipc.server.fallback-to-simple-auth-allowed" equal to "true" in hbase-site.xml.  NOTE: This setting should ONLY be used as a temporary measure while converting clients over to secure authentication.  It MUST BE DISABLED for secure operation.
18297
18298
18299 ---
18300
18301 * [HBASE-14257](https://issues.apache.org/jira/browse/HBASE-14257) | *Major* | **Periodic flusher only handles hbase:meta, not other system tables**
18302
18303 Memstore periodic flusher used to flush META table every 5 minutes but not any other system tables. This jira extends it to flush all system tables within this time period.
18304
18305
18306 ---
18307
18308 * [HBASE-14658](https://issues.apache.org/jira/browse/HBASE-14658) | *Major* | **Allow loading a MonkeyFactory by class name**
18309
18310 You can specify one of the predefined set of Monkeys when you run Integration Tests by passing the -m\|--monkey arguments on the command line; e.g -m CALM or -m SLOW\_DETERMINISTIC
18311
18312 This patch  makes it so you can pass the name of a class as the monkey to run: e.g. -m org.example.KingKong
18313
18314
18315 ---
18316
18317 * [HBASE-14521](https://issues.apache.org/jira/browse/HBASE-14521) | *Major* | **Unify the semantic of hbase.client.retries.number**
18318
18319 After this change, hbase.client.reties.number universally means the number of retry which is one less than total tries number,  for both non-batch operations like get/scan/increment etc. which uses RpcRetryingCallerImpl#callWithRetries to submit the call or batch operations like put through AsyncProcess#submit.
18320
18321 Note that previously this property means total tries number for puts, so please adjust the setting of its value if necessary. Please also be cautious when setting it to zero since retry is necessary for client cache update when region move happens.
18322
18323
18324 ---
18325
18326 * [HBASE-13819](https://issues.apache.org/jira/browse/HBASE-13819) | *Major* | **Make RPC layer CellBlock buffer a DirectByteBuffer**
18327
18328 For master branch(2.0 version), the BoundedByteBufferPool always create Direct (off heap) ByteBuffers and return that.
18329 For branch-1(1.3 version), byte default the buffers returned will be off heap. This can be changed to return on heap ByteBuffers by configuring 'hbase.ipc.server.reservoir.direct.buffer' to false.
18330
18331
18332 ---
18333
18334 * [HBASE-14517](https://issues.apache.org/jira/browse/HBASE-14517) | *Minor* | **Show regionserver's version in master status page**
18335
18336 Adds server version to the listing of regionservers on the master home page.
18337
18338 if a cluster where the versions deviate, at the bottom of the 'Version' column on the master home page listing of 'Region Servers', you will see a note in red that says something like: 'Total:10              9 nodes with inconsistent version'
18339
18340
18341 ---
18342
18343 * [HBASE-12911](https://issues.apache.org/jira/browse/HBASE-12911) | *Major* | **Client-side metrics**
18344
18345 Introduces collection and reporting of various client-perceived metrics. Metrics are exposed via JMX under "org.apache.hadoop.hbase.client.MetricsConnection". Metrics are scoped according to connection instance, so multiple connection objects (ie, to different clusters) will report their metrics separately. Metrics are disabled by default, must be enabled by configuring "hbase.client.metrics.enable=true".
18346
18347
18348 ---
18349
18350 * [HBASE-14529](https://issues.apache.org/jira/browse/HBASE-14529) | *Major* | **Respond to SIGHUP to reload config**
18351
18352 HBase daemons can now be signaled to reload their config by sending SIGHUP to the java process. Not all config parameters can be reloaded.
18353
18354 In order for this new feature to work the hbase-daemon.sh script was changed to use disown rather than nohup. Functionally this shouldn't change anything but the processes will have a different parent when being run from a connected login shell.
18355
18356
18357 ---
18358
18359 * [HBASE-14502](https://issues.apache.org/jira/browse/HBASE-14502) | *Major* | **Purge use of jmock and remove as dependency**
18360
18361 HBASE-14502 Purge use of jmock and remove as dependency
18362
18363
18364 ---
18365
18366 * [HBASE-14544](https://issues.apache.org/jira/browse/HBASE-14544) | *Major* | **Allow HConnectionImpl to not refresh the dns on errors**
18367
18368 By setting hbase.resolve.hostnames.on.failure to false you can reduce the number of dns name resolutions that a client will do. However if machines leave and come back with different ip's the changes will not be noticed by the clients. So only set hbase.resolve.hostnames.on.failure to false if your cluster dns is not changing while clients are connected.
18369
18370
18371 ---
18372
18373 * [HBASE-14367](https://issues.apache.org/jira/browse/HBASE-14367) | *Major* | **Add normalization support to shell**
18374
18375 This patch adds shell support for region normalizer (see HBASE-13103).
18376
18377 3 commands have been added to hbase shell 'tools' command group (modeled on how the balancer works):
18378
18379  - 'normalizer\_enabled' checks whether region normalizer is turned on
18380  - 'normalizer\_switch' allows user to turn normalizer on and off
18381  - 'normalize' runs region normalizer if it's turned on.
18382
18383 Also 'alter' command has been extended to allow user to enable/disable region normalization per table (disabled by default). Use it as
18384
18385 alter 'testtable', {NORMALIZATION\_MODE =\> 'true'}
18386
18387 Here is the help for the normalize command:
18388
18389 {code}
18390 hbase(main):008:0\> help 'normalize'
18391 Trigger region normalizer for all tables which have NORMALIZATION\_MODE flag set. Returns true
18392  if normalizer ran successfully, false otherwise. Note that this command has no effect
18393  if region normalizer is disabled (make sure it's turned on using 'normalizer\_switch' command).
18394
18395  Examples:
18396
18397    hbase\> normalize
18398 {code}
18399
18400
18401 ---
18402
18403 * [HBASE-14475](https://issues.apache.org/jira/browse/HBASE-14475) | *Major* | **Region split requests are always audited with "hbase" user rather than request user**
18404
18405 Region observer notifications w.r.t. split request are now audited with request user through proper scope of doAs() calls.
18406
18407
18408 ---
18409
18410 * [HBASE-14230](https://issues.apache.org/jira/browse/HBASE-14230) | *Minor* | **replace reflection in FSHlog with HdfsDataOutputStream#getCurrentBlockReplication()**
18411
18412 Remove calling getNumCurrentReplicas on HdfsDataOutputStream via reflection. getNumCurrentReplicas showed up in hadoop 1+ and hadoop 0.2x. In hadoop-2 it was deprecated.
18413
18414
18415 ---
18416
18417 * [HBASE-14495](https://issues.apache.org/jira/browse/HBASE-14495) | *Major* | **TestHRegion#testFlushCacheWhileScanning goes zombie**
18418
18419 The WAL append was changed by HBASE-12751. Every append now sets a latch on an edit. The latch needs to be cleared or else the WAL will hang. The original failures in TestHRegion turned up 'holes' where we were failing to throw the latch if we skipped out early because we were interrupted. Other 'holes' were found where we had mocked up a WAL so the latch would just stay in place.  Futher holes were found appending WAL markers... here we were skipping the mvcc completely for a few edits.  A clean up of WALUtils made all markers take the same code paths.
18420
18421
18422 ---
18423
18424 * [HBASE-14280](https://issues.apache.org/jira/browse/HBASE-14280) | *Minor* | **Bulk Upload from HA cluster to remote HA hbase cluster fails**
18425
18426 Patch will effectively work with Hadoop version 2.6 or greater with a launch of "internal.nameservices".
18427 There will be no change in versions older than 2.6.
18428
18429
18430 ---
18431
18432 * [HBASE-14334](https://issues.apache.org/jira/browse/HBASE-14334) | *Major* | **Move Memcached block cache in to it's own optional module.**
18433
18434 Move external block cache to it's own module. This  will reduce dependencies for people who use hbase-server.
18435 Currently Memcached is the reference implementation for external block cache. External block caches allow HBase to take advantage of other more complex caches that can live longer than the HBase regionserver process and are not necessarily tied to a single computer
18436     life time. However external block caches add in extra operational overhead.
18437
18438
18439 ---
18440
18441 * [HBASE-14433](https://issues.apache.org/jira/browse/HBASE-14433) | *Major* | **Set down the client executor core thread count from 256 in tests**
18442
18443 Tests run with client executors that have core thread count of 4 and a keepalive of 3 seconds. They used to default to 256 core threads and 60 seconds  for keepalive.
18444
18445
18446 ---
18447
18448 * [HBASE-14400](https://issues.apache.org/jira/browse/HBASE-14400) | *Critical* | **Fix HBase RPC protection documentation**
18449
18450 To use rpc protection in HBase, set the value of 'hbase.rpc.protection' to:
18451 'authentication' : simple authentication using kerberos
18452 'integrity' : authentication and integrity
18453 'privacy' : authentication and confidentiality
18454
18455 Earlier, HBase reference guide erroneously mentioned in some places to set the value to 'auth-conf'. This patch fixes the guide and adds temporary support for erroneously recommended values.
18456
18457
18458 ---
18459
18460 * [HBASE-14306](https://issues.apache.org/jira/browse/HBASE-14306) | *Major* | **Refine RegionGroupingProvider: fix issues and make it more scalable**
18461
18462 In HBASE-14306 we've changed default strategy of RegionGroupingProvider from "identify" to "bounded", so it's required to explicitly set "hbase.wal.regiongrouping.strategy" to "identify" if user still wants to use one WAL per region
18463
18464 Please also notice that in the new framework there will be one WAL per group, and the region-group mapping is decided by RegionGroupingStrategy. Accordingly, we've removed BoundedRegionGroupingProvider and added BoundedRegionGroupingStrategy as a replacement. If you already have a customized class for hbase.wal.regiongrouping.strategy, please check the new logic and make updates if necessary.
18465
18466
18467 ---
18468
18469 * [HBASE-6617](https://issues.apache.org/jira/browse/HBASE-6617) | *Major* | **ReplicationSourceManager should be able to track multiple WAL paths**
18470
18471 ReplicationSourceManager now could track multiple wal paths. Notice that although most changes are internal and all metrics names remain the same, signature of below methods in MetricsSource are changed:
18472
18473 1. refreshAgeOfLastShippedOp now requires a String parameter which indicates the wal group id of the reporter
18474 2. setAgeOfLastShippedOp also adds a String parameter for wal group id
18475
18476
18477 ---
18478
18479 * [HBASE-14314](https://issues.apache.org/jira/browse/HBASE-14314) | *Major* | **Metrics for block cache should take region replicas into account**
18480
18481 The following metrics for primary region replica are added:
18482
18483 blockCacheHitCountPrimary
18484 blockCacheMissCountPrimary
18485 blockCacheEvictionCountPrimary
18486
18487
18488 ---
18489
18490 * [HBASE-14317](https://issues.apache.org/jira/browse/HBASE-14317) | *Blocker* | **Stuck FSHLog: bad disk (HDFS-8960) and can't roll WAL**
18491
18492 Tighten up WAL-use semantic.
18493
18494 1. If an append or a sync throws an exception, all subsequent attempts at using the log will also throw this same exception. The WAL is now a lame-duck until you roll it.
18495 2. If a successful append, and then we fail to sync the append, this is a fatal exception. The container must abort to replay the WAL logs even though we have told the client that the appends failed.
18496
18497 The above rules have been applied laxly up to this; it used to be possible to get a good sync to go in over the top of a failed append. This has been fixed in this patch.
18498
18499 Also fixed a hang in the WAL subsystem if a request to pause the write pipeline took on a failed sync. before the roll requests sync got scheduled.
18500
18501
18502 TODO: Revisit our WAL system. HBASE-12751 helps rationalize our write pipeline. In particular, it manages sequenceid inside mvcc which should make it so we can purge mechanism that writes empty, unflushed appends just to get the next sequenceid... problematic when WAL goes lame-duck. Lets get it in.
18503 TODO: A successful append followed by a failed sync probably only needs us replace the WAL (if we have signalled the client that the appends failed). Bummer is that replicating, these last appends might make it to the sink cluster or get replayed during recovery. HBase should keep its own WAL length? Or sequenceid of last successful sync should be passed when doing recovery and replication?
18504
18505
18506 ---
18507
18508 * [HBASE-14261](https://issues.apache.org/jira/browse/HBASE-14261) | *Major* | **Enhance Chaos Monkey framework by adding zookeeper and datanode fault injections.**
18509
18510 This change augments existing chaos monkey framework with actions for restarting underlying zookeeper quorum and hdfs nodes of distributed hbase cluster. One assumption made while creating zk actions are that zookeper ensemble is an independent external service and won't be managed by hbase cluster.  For these actions to work as expected, the following parameters need to be configured appropriately.
18511
18512 {code}
18513 \<property\>
18514   \<name\>hbase.it.clustermanager.hadoop.home\</name\>
18515   \<value\>$HADOOP\_HOME\</value\>
18516 \</property\>
18517 \<property\>
18518   \<name\>hbase.it.clustermanager.zookeeper.home\</name\>
18519   \<value\>$ZOOKEEPER\_HOME\</value\>
18520 \</property\>
18521 \<property\>
18522   \<name\>hbase.it.clustermanager.hbase.user\</name\>
18523   \<value\>hbase\</value\>
18524 \</property\>
18525 \<property\>
18526   \<name\>hbase.it.clustermanager.hadoop.hdfs.user\</name\>
18527   \<value\>hdfs\</value\>
18528 \</property\>
18529 \<property\>
18530   \<name\>hbase.it.clustermanager.zookeeper.user\</name\>
18531   \<value\>zookeeper\</value\>
18532 \</property\>
18533 {code}
18534
18535 The service user related configurations are newly introduced since in prod/test environments each service is managed by different user. Once the above parameters are configured properly, you can start using them as needed. An example usage for invoking these new actions is:
18536
18537 {{./hbase org.apache.hadoop.hbase.IntegrationTestAcidGuarantees -m serverAndDependenciesKilling}}
18538
18539
18540 ---
18541
18542 * [HBASE-14309](https://issues.apache.org/jira/browse/HBASE-14309) | *Major* | **Allow load balancer to operate when there is region in transition by adding force flag**
18543
18544 This issue adds boolean parameter, force, to 'balancer' command so that admin can force region balancing even when there is region (other than hbase:meta) in transition - assuming RIT being transient.
18545 If hbase:meta is in transition, balancer command returns false.
18546
18547 WARNING: For experts only. Forcing a balance may do more damage than repair when assignment is confused
18548 Note: enclose the force parameter in double quotes
18549
18550
18551 ---
18552
18553 * [HBASE-14313](https://issues.apache.org/jira/browse/HBASE-14313) | *Critical* | **After a Connection sees ConnectionClosingException it never recovers**
18554
18555 HConnection could get stuck when talking to a host that went down and then returned. This has been fixed by closing the connection in all paths.
18556
18557
18558 ---
18559
18560 * [HBASE-13339](https://issues.apache.org/jira/browse/HBASE-13339) | *Blocker* | **Update default Hadoop version to latest for master**
18561
18562 Master/2.0.0 now builds on the latest stable hadoop by default.
18563
18564
18565 ---
18566
18567 * [HBASE-14224](https://issues.apache.org/jira/browse/HBASE-14224) | *Critical* | **Fix coprocessor handling of duplicate classes**
18568
18569 Prevent Coprocessors being doubly-loaded; a particular coprocessor can only be loaded once.
18570
18571
18572 ---
18573
18574 * [HBASE-13127](https://issues.apache.org/jira/browse/HBASE-13127) | *Major* | **Add timeouts on all tests so less zombie sightings**
18575
18576 Use junit facility to impose timeout on test. Use test category to chose which timeout to apply: small tests timeout after 30 seconds, medium tests after 180 seconds, and large tests after ten minutes.
18577
18578 Updated junit version from 4.11 to 4.12. 4.12 has support for feature used here.
18579
18580 Add this at the head of your junit4 class to add a category-based timeout:
18581
18582 {code}
18583 @Rule public final TestRule timeout =   CategoryBasedTimeout.builder().withTimeout(this.getClass()).
18584       withLookingForStuckThread(true).build();
18585 {code}
18586
18587 For example:
18588
18589
18590 ---
18591
18592 * [HBASE-14148](https://issues.apache.org/jira/browse/HBASE-14148) | *Major* | **Web UI Framable Page**
18593
18594 Security fix: Adds protection from clickjacking using X-Frame-Options header.
18595 This will prevent use of HBase UI in frames. To disable this feature, set the configuration 'hbase.http.filter.xframeoptions.mode' to 'ALLOW' (default is 'DENY').
18596
18597
18598 ---
18599
18600 * [HBASE-10844](https://issues.apache.org/jira/browse/HBASE-10844) | *Major* | **Coprocessor failure during batchmutation leaves the memstore datastructs in an inconsistent state**
18601
18602 Promotes an -ea assert to logged FATAL and RS abort when memstore is found to be in an inconsistent state.
18603
18604
18605 ---
18606
18607 * [HBASE-13966](https://issues.apache.org/jira/browse/HBASE-13966) | *Minor* | **Limit column width in table.jsp**
18608
18609 Wraps region, start key, end key columns if too long.
18610
18611
18612 ---
18613
18614 * [HBASE-13706](https://issues.apache.org/jira/browse/HBASE-13706) | *Minor* | **CoprocessorClassLoader should not exempt Hive classes**
18615
18616 Starting from HBase 2.0, CoprocessorClassLoader will not exempt hadoop classes or zookeeper classes.  This means that if the custom coprocessor jar contains hadoop or zookeeper packages and classes, they will be loaded by the CoprocessorClassLoader.  Only hbase packages and classes  are exempted from the CoprocessorClassLoader. They (and their dependencies) are loaded by the parent server class loader.
18617
18618
18619 ---
18620
18621 * [HBASE-14054](https://issues.apache.org/jira/browse/HBASE-14054) | *Major* | **Acknowledged writes may get lost if regionserver clock is set backwards**
18622
18623 In {{checkAndPut}} write path use max(max timestamp for the row, System.currentTimeMillis()) in the, instead of blindly taking System.currentTimeMillis() to ensure that checkAndPut() cannot do writes which is already eclipsed. This is similar to what has been done in HBASE-12449 for increment and append.
18624
18625
18626 ---
18627
18628 * [HBASE-13985](https://issues.apache.org/jira/browse/HBASE-13985) | *Minor* | **Add configuration to skip validating HFile format when bulk loading**
18629
18630 A new config, hbase.loadincremental.validate.hfile , is introduced - default to true
18631 When set to false, checking hfile format is skipped during bulkloading.
18632
18633
18634 ---
18635
18636 * [HBASE-14201](https://issues.apache.org/jira/browse/HBASE-14201) | *Major* | **hbck should not take a lock unless fixing errors**
18637
18638 HBCK no longer takes a lock until there are changes to the cluster being made.
18639
18640 The old behavior can be achieved by passing the -exclusive flag.
18641
18642
18643 ---
18644
18645 * [HBASE-14081](https://issues.apache.org/jira/browse/HBASE-14081) | *Minor* | **(outdated) references to SVN/trunk in documentation**
18646
18647 HBASE-14081 Remove (outdated) references to SVN/trunk from documentation
18648
18649
18650 ---
18651
18652 * [HBASE-13865](https://issues.apache.org/jira/browse/HBASE-13865) | *Trivial* | **Increase the default value for hbase.hregion.memstore.block.multipler from 2 to 4 (part 2)**
18653
18654 Increase default hbase.hregion.memstore.block.multiplier from 2 to 4 in the code to match the default value in the config files.
18655
18656
18657 ---
18658
18659 * [HBASE-12295](https://issues.apache.org/jira/browse/HBASE-12295) | *Major* | **Prevent block eviction under us if reads are in progress from the BBs**
18660
18661 We try to delay the eviction of the block till the cellblocks are formed at the Rpc layer. A simple reference counting mechanism is introduced when ever a block is accessed from the Bucket cache.  Once a scanner completes using a block the reference count is decremented.  The eviction of the block happens only when the reference count of that block is 0.
18662 We also introduce a concept of ShareableMemory based on the type of blocks we create from the Block cache. The blocks from the ByteBufferIOEngine directly refer to the buckets in offheap and such blocks are marked SHARED memory type. The blocks from LRU, HDFS and file mode of Bucket cache are all marked EXCLUSIVE because these blocks have their own exclusive memory.
18663 For the CP case, any cell coming out of SHARED memory block is copied before returning the results, because CPs can use the results as its state so that eviction cannot corrupt the results.
18664
18665
18666 ---
18667
18668 * [HBASE-11339](https://issues.apache.org/jira/browse/HBASE-11339) | *Major* | **HBase MOB**
18669
18670 The Moderate Object Storage (MOB) feature (HBASE-11339[1]) is modified I/O and compaction path that allows individual moderately sized values (100KB-10MB) to be stored in a way that write amplification is reduced when compared to the normal I/O path. MOB is defined in the column family and it is almost isolated with other components, the features and performance cannot be effected in normal columns.
18671
18672 For more details on how to use the feature please consult the HBase Reference Guide
18673
18674
18675 ---
18676
18677 * [HBASE-13954](https://issues.apache.org/jira/browse/HBASE-13954) | *Major* | **Remove HTableInterface#getRowOrBefore related server side code**
18678
18679 Removed Table#getRowOrBefore, Region#getClosestRowBefore, Store#getRowKeyAtOrBefore, RemoteHTable#getRowOrBefore apis and Thrift support for getRowOrBefore.
18680 Also removed two coprocessor hooks preGetClosestRowBefore and postGetClosestRowBefore.
18681 User using this api can instead use reverse scan something like below,
18682 {code}
18683  Scan scan = new Scan(row);
18684   scan.setSmall(true);
18685   scan.setCaching(1);
18686   scan.setReversed(true);
18687   scan.addFamily(family);
18688 {code}
18689 pass this scan object to the scanner and retrieve the first Result from scanner output.
18690
18691
18692 ---
18693
18694 * [HBASE-12296](https://issues.apache.org/jira/browse/HBASE-12296) | *Major* | **Filters should work with ByteBufferedCell**
18695
18696 Change to support offheaping.
18697
18698 Incompatible change for filters ColumnPrefixFilter and MultipleColumnPrefixFilter
18699
18700 Changes parameters to filterColumn so takes a Cell rather than a byte [].
18701
18702 hbase-client-1.2.7-SNAPSHOT.jar, ColumnPrefixFilter.class
18703 package org.apache.hadoop.hbase.filter
18704 ColumnPrefixFilter.filterColumn ( byte[ ] buffer, int qualifierOffset, int qualifierLength )  :  Filter.ReturnCode
18705 org/apache/hadoop/hbase/filter/ColumnPrefixFilter.filterColumn:([BII)Lorg/apache/hadoop/hbase/filter/Filter$ReturnCode;
18706
18707 Ditto for filterColumnValue in SingleColumnValueFilter. Takes a Cell instead of byte array.
18708
18709
18710 ---
18711
18712 * [HBASE-14045](https://issues.apache.org/jira/browse/HBASE-14045) | *Major* | **Bumping thrift version to 0.9.2.**
18713
18714 This changes upgrades thrift dependency of HBase to 0.9.2. Though this doesn't break any HBase compatibility promises, it might impact any downstream projects that share thrift dependency with HBase.
18715
18716
18717 ---
18718
18719 * [HBASE-14027](https://issues.apache.org/jira/browse/HBASE-14027) | *Major* | **Clean up netty dependencies**
18720
18721 HBase's convenience binary artifact no longer contains the netty 3.2.4 jar . This jar was not directly used by HBase, but may have been relied on by downstream applications.
18722
18723
18724 ---
18725
18726 * [HBASE-7782](https://issues.apache.org/jira/browse/HBASE-7782) | *Minor* | **HBaseTestingUtility.truncateTable() not acting like CLI**
18727
18728 HBaseTestingUtility now uses the truncate API added in HBASE-8332 so that calls to HBTU.truncateTable will behave like the shell command: effectively dropping the table and recreating a new one with the same split points.
18729
18730 Previously, HBTU.truncateTable instead issued deletes for all the data already in the table. If you wish to maintain the same behavior, you should use the newly added HBTU.deleteTableData method.
18731
18732
18733 ---
18734
18735 * [HBASE-14047](https://issues.apache.org/jira/browse/HBASE-14047) | *Major* | **Cleanup deprecated APIs from Cell class**
18736
18737 The following API from Cell (which were deprecated since past few major versions) are removed now.
18738 getRow
18739 getFamily
18740 getQualifier
18741 getValue
18742 getMvccVersion
18743 The above apis can be replaced with their respective CellUtil#cloneXXX (allocates a copy) or Cell#getXXXArray (essentially just returns a pointer) based on the use case.
18744
18745
18746 ---
18747
18748 * [HBASE-14029](https://issues.apache.org/jira/browse/HBASE-14029) | *Major* | **getting started for standalone still references hadoop-version-specific binary artifacts**
18749
18750 HBASE-14029 Correct documentation for Hadoop version specific artifacts
18751
18752
18753 ---
18754
18755 * [HBASE-13849](https://issues.apache.org/jira/browse/HBASE-13849) | *Major* | **Remove restore and clone snapshot from the WebUI**
18756
18757 The HBase master status web page no longer allows operators to clone snapshots nor restore snapshots.
18758
18759
18760 ---
18761
18762 * [HBASE-13646](https://issues.apache.org/jira/browse/HBASE-13646) | *Major* | **HRegion#execService should not try to build incomplete messages**
18763
18764 When RegionServerCoprocessors throw an exception we will no longer attempt to build an incomplete RPC response message. Instead, the response message will be null.
18765
18766
18767 ---
18768
18769 * [HBASE-13639](https://issues.apache.org/jira/browse/HBASE-13639) | *Major* | **SyncTable - rsync for HBase tables**
18770
18771 Tool to sync two tables that tries to send the differences only like rsync.
18772
18773 Adds two new MapReduce jobs, SyncTable and HashTable. See usage for these jobs on how to use. See design doc for generally overview: https://docs.google.com/document/d/1-2c9kJEWNrXf5V4q\_wBcoIXfdchN7Pxvxv1IO6PW0-U/edit
18774
18775 From comments below, "It can be challenging to run against a table getting live writes, if those writes are updates/overwrites. In general, you can run it against a time range to ignore new writes, but if those writes update existing cells, then the time range scan may or may not see older versions of those cells depending on whether major compaction has happened, which may be different in remote clusters."
18776
18777
18778 ---
18779
18780 * [HBASE-13895](https://issues.apache.org/jira/browse/HBASE-13895) | *Critical* | **DATALOSS: Region assigned before WAL replay when abort**
18781
18782 If the master went to assign a region concurrent with a RegionServer abort, the returned RegionServerAbortedException was being handled as though the region had been cleanly offlined so assign was allowed proceed. If the region was opened in its new location before WAL replay completion, the replayed edits were ignored, worst case, or were later played over the top of edits that had come in since open and so susceptible to overwrite. In either case, DATALOSS.
18783
18784
18785 ---
18786
18787 * [HBASE-13983](https://issues.apache.org/jira/browse/HBASE-13983) | *Minor* | **Doc how the oddball HTable methods getStartKey, getEndKey, etc. will be removed in 2.0.0**
18788
18789 Adds extra doc on getStartKeys, getEndKeys, and getStartEndKeys in HTable explaining that they will be removed in 2.0.0 (these methods did not get the proper full major version deprecation cycle).
18790
18791 In this issue, we actually also remove these methods in master/2.0.0 branch.
18792
18793
18794 ---
18795
18796 * [HBASE-13747](https://issues.apache.org/jira/browse/HBASE-13747) | *Critical* | **Promote Java 8 to "yes" in support matrix**
18797
18798 Java 8 is considered supported and tested as of HBase 1.2+
18799
18800
18801 ---
18802
18803 * [HBASE-13959](https://issues.apache.org/jira/browse/HBASE-13959) | *Critical* | **Region splitting uses a single thread in most common cases**
18804
18805 The performance of region splitting has been improved by using a thread pool to split the store files concurrently. Prior to this change, the store files were always split sequentially in a single thread, so a region with multiple store files ended up taking several seconds. The thread pool is sized dynamically with the aim of getting maximum concurrency, without exceeding the number of cores available for HBase Java process. A lower limit for the thread pool can be explicitly set using the property hbase.regionserver.region.split.threads.max.
18806
18807
18808 ---
18809
18810 * [HBASE-13930](https://issues.apache.org/jira/browse/HBASE-13930) | *Major* | **Exclude Findbugs packages from shaded jars**
18811
18812 Exclude Findbugs packages from shaded jars
18813
18814
18815 ---
18816
18817 * [HBASE-13214](https://issues.apache.org/jira/browse/HBASE-13214) | *Major* | **Remove deprecated and unused methods from HTable class**
18818
18819 **WARNING: No release note provided for this change.**
18820
18821
18822 ---
18823
18824 * [HBASE-13869](https://issues.apache.org/jira/browse/HBASE-13869) | *Trivial* | **Fix typo in HBase book**
18825
18826 Fix typo in HBase book
18827
18828
18829 ---
18830
18831 * [HBASE-13938](https://issues.apache.org/jira/browse/HBASE-13938) | *Major* | **Deletes done during the region merge transaction may get eclipsed**
18832
18833 Use the master's timestamp when sending hbase:meta edits on region merge to ensure proper ordering of new region addition and old region deletes.
18834
18835
18836 ---
18837
18838 * [HBASE-13898](https://issues.apache.org/jira/browse/HBASE-13898) | *Minor* | **correct additional javadoc failures under java 8**
18839
18840 Correct Javadoc generation errors
18841
18842
18843 ---
18844
18845 * [HBASE-13103](https://issues.apache.org/jira/browse/HBASE-13103) | *Major* | **[ergonomics] add region size balancing as a feature of master**
18846
18847 This patch adds optional ability for HMaster to normalize regions in size (disabled by default, change hbase.normalizer.enabled property to true to turn it on). If enabled, HMaster periodically (every 30 minutes by default) monitors tables for which normalization is enabled in table configuration and performs splits/merges as seems appropriate. Users may implement their own normalization strategies by implementing RegionNormalizer interface and configuring it in hbase-site.xml.
18848
18849
18850 ---
18851
18852 * [HBASE-13900](https://issues.apache.org/jira/browse/HBASE-13900) | *Minor* | **duplicate methods between ProtobufMagic and ProtobufUtil**
18853
18854 Use ProtobufMagic methods in ProtobufUtil
18855
18856
18857 ---
18858
18859 * [HBASE-13843](https://issues.apache.org/jira/browse/HBASE-13843) | *Trivial* | **Fix internal constant text in ReplicationManager.java**
18860
18861 In previous versions of HBase, the ReplicationAdmin utility erroneously used the string key "columnFamlyName" when listing replicated column families. It now uses the corrected spelling of "columnFamilyName" (note the added "i").
18862
18863 Downstream code that parsed the replication entries returned from listReplicated will need to be updated to use the new key. Previously compiled code that relied on the static CFNAME member of ReplicationAdmin will need to be recompiled in order to see the updated value.
18864
18865
18866 ---
18867
18868 * [HBASE-13886](https://issues.apache.org/jira/browse/HBASE-13886) | *Major* | **Return empty value when the mob file is corrupt instead of throwing exceptions**
18869
18870 By default the Get/Scan will throw Exception when it is not able to find a mob cell because the mob file is missing/corrupted. This jira adds a facility to continue scan/get and get other cells with mob cell value as empty. Set an attribute MobConstants.EMPTY\_VALUE\_ON\_MOBCELL\_MISS = true in Scan/Get for getting this behaviour
18871
18872
18873 ---
18874
18875 * [HBASE-13686](https://issues.apache.org/jira/browse/HBASE-13686) | *Major* | **Fail to limit rate in RateLimiter**
18876
18877 As per this jira contribution. We now support two kinds of RateLimiter.
18878 1) org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter : This limiter will refill resources at every TimeUnit/resources interval.
18879 Example: For a limiter configured with 10resources/second, then 1resource will be refilled after every 100ms.
18880
18881 2) org.apache.hadoop.hbase.quotas.FixedIntervalRateLimiter: This limiter will refill resources only after a given fixed interval of time.
18882
18883 Client can configure anyone of this rate limiter for the cluster by setting the value for the property "hbase.quota.rate.limiter" in the hbase-site.xml. org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter is the default value.
18884 Note: Client needs to restart the cluster for the configuration to take into effect.
18885
18886
18887 ---
18888
18889 * [HBASE-13816](https://issues.apache.org/jira/browse/HBASE-13816) | *Major* | **Build shaded modules only in release profile**
18890
18891 hbase-shaded-client and hbase-shaded-server modules will not build the actual jars unless -Prelease is supplied in mvn.
18892
18893
18894 ---
18895
18896 * [HBASE-13754](https://issues.apache.org/jira/browse/HBASE-13754) | *Major* | **Allow non KeyValue Cell types also to oswrite**
18897
18898 This jira has removed the already deprecated method
18899 KeyValue#oswrite(final KeyValue kv, final OutputStream out)
18900
18901
18902 ---
18903
18904 * [HBASE-13375](https://issues.apache.org/jira/browse/HBASE-13375) | *Major* | **Provide HBase superuser higher priority over other users in the RPC handling**
18905
18906 This JIRA modifies the signature of PriorityFunction#getPriority() method to also take request user as a parameter; all RPC requests sent by super users (as determined by cluster configuration) are executed with Admin QoS.
18907
18908
18909 ---
18910
18911 * [HBASE-5980](https://issues.apache.org/jira/browse/HBASE-5980) | *Minor* | **Scanner responses from RS should include metrics on rows/KVs filtered**
18912
18913 Adds scan metrics to the result. In the shell, set the ALL\_METRICS attribute to true on your scan to see dump of metrics after results (see the scan help for examples).
18914
18915 If you would prefer to see only a subset of the metrics, the METRICS array can be defined to include the names of only the metrics you care about.
18916
18917
18918 ---
18919
18920 * [HBASE-13698](https://issues.apache.org/jira/browse/HBASE-13698) | *Major* | **Add RegionLocator methods to Thrift2 proxy.**
18921
18922 Added getRegionLocation and getAllRegionLocations to the thrift2 interface.
18923
18924
18925 ---
18926
18927 * [HBASE-13636](https://issues.apache.org/jira/browse/HBASE-13636) | *Major* | **Remove deprecation for HBASE-4072 (Reading of zoo.cfg)**
18928
18929 Purge support for parsing zookeepers zoo.cfg deprecated since hbase-0.96.0
18930
18931
18932 ---
18933
18934 * [HBASE-13071](https://issues.apache.org/jira/browse/HBASE-13071) | *Major* | **Hbase Streaming Scan Feature**
18935
18936 MOTIVATION
18937
18938 A pipelined scan API is introduced for speeding up applications that combine massive data traversal with compute-intensive processing. Traditional HBase scans save network trips through prefetching the data to the client side cache. However, they prefetch synchronously: the fetch request to regionserver is invoked only when the entire cache is consumed. This leads to a stop-and-wait access pattern, in which the client stalls until the next chunk of data is fetched. Applications that do significant processing can benefit from background data prefetching, which eliminates this bottleneck. The pipelined scan implementation overlaps the cache population at the client side with application processing. Namely, it issues a new scan RPC when the iteration retrieves 50% of the cache. If the application processing (that is, the time between invocations of next()) is substantial, the new chunk of data will be available before the previous one is exhausted, and the client will not experience any delay. Ideally, the prefetch and the processing times should be balanced.
18939
18940 API AND CONFIGURATION
18941
18942 Asynchronous scanning can be configured either globally for all tables and scans, or on per-scan basis via a new Scan class API.
18943
18944 Configuration in hbase-site.xml: hbase.client.scanner.async.prefetch, default false:
18945
18946  \<property\>
18947    \<name\>hbase.client.scanner.async.prefetch\</name\>
18948    \<value\>true\</value\>
18949  \</property\>
18950
18951 API - Scan#setAsyncPrefetch(boolean)
18952
18953       Scan scan = new Scan();
18954       scan.setCaching(1000);
18955       scan.setMaxResultSize(BIG\_SIZE);
18956       scan.setAsyncPrefetch(true);
18957         ...
18958       ResultScanner scanner = table.getScanner(scan);
18959
18960 IMPLEMENTATION NOTES
18961
18962 Pipelined scan is implemented by a new ClientAsyncPrefetchScanner class, which is fully API-compatible with the synchronous ClientSimpleScanner. ClientAsyncPrefetchScanner is not instantiated in case of small (Scan#setSmall) and reversed (Scan#setReversed) scanners. The application is responsible for setting the prefetch size in a way that the prefetch time and the processing times are balanced. Note that due to double buffering, the client side cache can use twice as much memory as the synchronous scanner.
18963
18964 Generally, this feature will put more load on the server (higher fetch rate -- which is the whole point).  Also, YMMV.
18965
18966
18967 ---
18968
18969 * [HBASE-13533](https://issues.apache.org/jira/browse/HBASE-13533) | *Trivial* | **section on configuring ~/.m2/settings.xml has no anchor**
18970
18971 Correct setting.xml anchor in book
18972
18973
18974 ---
18975
18976 * [HBASE-13625](https://issues.apache.org/jira/browse/HBASE-13625) | *Major* | **Use HDFS for HFileOutputFormat2 partitioner's path**
18977
18978 Introduces a new config hbase.fs.tmp.dir which is a directory in HDFS (or default file system) to use as a staging directory for HFileOutputFormat2. This is also used as the default for hbase.bulkload.staging.dir
18979
18980
18981 ---
18982
18983 * [HBASE-10800](https://issues.apache.org/jira/browse/HBASE-10800) | *Major* | **Use CellComparator instead of KVComparator**
18984
18985 From 2.0 branch onwards KVComparator and its subclasses MetaComparator, RawBytesComparator are all deprecated.
18986 All the comparators are moved to CellComparator.  MetaCellComparator, a subclass of CellComparator, will be used to compare hbase:meta cells.
18987 Previously exposed static instances KeyValue.COMPARATOR, KeyValue.META\_COMPARATOR and KeyValue.RAW\_COMPARATOR are deprecated instead use CellComparator.COMPARATOR and CellComparator.META\_COMPARATOR.
18988 Also note that there will be no RawBytesComparator.  Where ever we need to compare raw bytes use Bytes.BYTES\_RAWCOMPARATOR.
18989 CellComparator will always operate on cells and its components, abstracting the fact that a cell can be backed by a single byte[] as opposed to how KVComparators were working.
18990
18991
18992 ---
18993
18994 * [HBASE-13333](https://issues.apache.org/jira/browse/HBASE-13333) | *Major* | **Renew Scanner Lease without advancing the RegionScanner**
18995
18996 Adds a renewLease call to ClientScanner
18997
18998
18999 ---
19000
19001 * [HBASE-13564](https://issues.apache.org/jira/browse/HBASE-13564) | *Major* | **Master MBeans are not published**
19002
19003 To use the coprocessor-based JMX implementation provided by HBase for Master.
19004 Add below property in hbase-site.xml file:
19005
19006 \<property\>
19007   \<name\>hbase.coprocessor.master.classes\</name\>
19008   \<value\>org.apache.hadoop.hbase.JMXListener\</value\>
19009 \</property\>
19010
19011 NOTE: DO NOT set \`com.sun.management.jmxremote.port\` for Java VM at the same time.
19012
19013 By default, the JMX listens on TCP port 10101 for Master, we can further configure the port using below properties:
19014
19015 \<property\>
19016   \<name\>master.rmi.registry.port\</name\>
19017   \<value\>61110\</value\>
19018 \</property\>
19019 \<property\>
19020   \<name\>master.rmi.connector.port\</name\>
19021   \<value\>61120\</value\>
19022 \</property\>
19023 ----
19024
19025 The registry port can be shared with connector port in most cases, so you only need to configure master.rmi.registry.port.
19026 However if you want to use SSL communication, the 2 ports must be configured to different values.
19027
19028
19029 ---
19030
19031 * [HBASE-13537](https://issues.apache.org/jira/browse/HBASE-13537) | *Major* | **Procedure V2 - Change the admin interface for async operations to return Future (incompatible with branch-1.x)**
19032
19033 As we made changes to return types in asynchronous methods of Admin API, this change is going to break binary compatibility. The source compatibility is kept intact though. The applications running against this change needs to be recompiled to keep things working.
19034
19035
19036 ---
19037
19038 * [HBASE-13517](https://issues.apache.org/jira/browse/HBASE-13517) | *Major* | **Publish a client artifact with shaded dependencies**
19039
19040 HBase now provides added convenience artifacts that shade most dependencies. These jars hbase-shaded-client and hbase-shaded-server are meant to be used when dependency conflicts can not be solved any other way. The normal jars hbase-client and hbase-server should still be preferred when possible.
19041
19042 Do not use hbase-shaded-server or hbase-shaded-client inside of a co-processor as bad things will happen.
19043
19044
19045 ---
19046
19047 * [HBASE-13149](https://issues.apache.org/jira/browse/HBASE-13149) | *Blocker* | **HBase MR is broken on Hadoop 2.5+ Yarn**
19048
19049 In HBase 1.1.0 and above we have upgraded the version of Jackson dependencies (jackson-core-asl, jackson-mapper-asl, jackson-jaxrs and jackson-xc) from 1.8.8 to 1.9.13. This is to follow the upgrade to Jackson 1.9.13 in Hadoop 2.5 and above which causes Jackson class incompatibility for HBase as reported in HBASE-13149.  Refer to HADOOP-10104 and YARN-2092 for additional information. Jackson1.9.13 is not completely backward compatible with the prior version 1.8.8 used in HBase. See the Compatibility reports attached in HBASE-13149 and http://svn.codehaus.org/jackson/trunk/release-notes/VERSION for more information.
19050
19051 This upgrade does not have direct impact on HBase users and HBase applications in most cases. In the rare case where your HBase application uses Jackson directly AND your application has compatibility issue with Jackson 1.9.13, you can do the following to mitigate the problem.
19052
19053 1. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, we recommend you update your application to use Jackson 1.9.13. You may be able to explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you, but the general recommendation is that you upgrade to Jackson 1.9.13.
19054 2. You may choose to continue using Jackson 1.8.8 and not to use Jackson 1.9.13 in your classpath.  You can also choose to replace the Jackson 1.9.13 jars in $HBASE\_HOME/lib with 1.8.8 jars.  It can work for you in the following cases:
19055 a) You are on a Hadoop version earlier than Hadoop 2.5,  or
19056 b) You are on Hadoop 2.5 or above, but your HBase application does not involve running Yarn jobs.
19057 3. You may experiment with further isolation using the shaded jars introduced with 1.1.0 via HBASE-13517.
19058
19059 Note that it may not be tested or guaranteed that using Jackson 1.8.8 in $HBASE\_HOME/lib will work in future HBase releases.
19060 It is recommended that your HBase application matches the Jackson version provided in HBase.
19061
19062 In HBase 0.98.x and HBase 1.0.x, we have NOT upgraded the version of Jackson dependencies. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, you may encounter Jackson class incomparability issue, as reported in HBASE-13149.
19063
19064 You can do the following to mitigate the problem:
19065 1. Use 'hadoop jar' command to run your HBase jobs.
19066 2. Explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you.
19067 3. You can also choose to replace the Jackson 1.8.8 jars in $HBASE\_HOME/lib with 1.9.13 jars from your Hadoop lib directory. We have tested HBase 0.98 with Jackson 1.9.13.
19068
19069
19070 ---
19071
19072 * [HBASE-13481](https://issues.apache.org/jira/browse/HBASE-13481) | *Major* | **Master should respect master (old) DNS/bind related configurations**
19073
19074 Master now honors configuration options as was before 1.0.0 releases:
19075 hbase.master.ipc.address
19076 hbase.master.dns.interface
19077 hbase.master.dns.nameserver
19078 hbase.master.info.bindAddress
19079 This jira also adds hbase.master.hostname parameter as an extension to HBASE-12954.
19080
19081
19082 ---
19083
19084 * [HBASE-13090](https://issues.apache.org/jira/browse/HBASE-13090) | *Major* | **Progress heartbeats for long running scanners**
19085
19086 Previously, there was no way to enforce a time limit on scan RPC requests. The server would receive a scan RPC request and take as much time as it needed to accumulate enough results to reach a limit or exhaust the region. The problem with this approach was that, in the case of a very selective scan, the processing of the scan could take too long and cause timeouts client side.
19087
19088 With this fix, the server will now enforce a time limit on the execution of scan RPC requests. When a scan RPC request arrives to the server, a time limit is calculated to be half of whichever timeout value is more restictive between the configurations ("hbase.client.scanner.timeout.period" and "hbase.rpc.timeout"). When the time limit is reached, the server will return whatever results it has accumulated up to that point. The results may be empty.
19089
19090 To ensure that timeout checks do not occur too often (which would hurt the performance of scans), the configuration "hbase.cells.scanned.per.heartbeat.check" has been introduced. This configuration controls how often System.currentTimeMillis() is called to update the progress towards the time limit. Currently, the default value of this configuration value is 10000. Specifying a smaller value will provide a tighter bound on the time limit, but may hurt scan performance due to the higher frequency of calls to System.currentTimeMillis().
19091
19092 Protobuf models for ScanRequest and ScanResponse have been updated so that heartbeat support can be communicated. Support for heartbeat messages is specified in the request sent to the server via ScanRequest.Builder#setClientHandlesHeartbeats. Only when the server sees that ScanRequest#getClientHandlesHeartbeats() is true will it send heartbeat messages back to the client. A response is marked as a heartbeat message via the boolean flag ScanResponse#getHeartbeatMessage
19093
19094
19095 ---
19096
19097 * [HBASE-13307](https://issues.apache.org/jira/browse/HBASE-13307) | *Major* | **Making methods under ScannerV2#next inlineable, faster**
19098
19099 Made methods smaller under Scanner#next so inlinable and compilable (was getting 'too big to compile' from hotspot). Use of unsafe to parse shorts rather than use BB#getShort... faster, etc.
19100
19101
19102 ---
19103
19104 * [HBASE-13453](https://issues.apache.org/jira/browse/HBASE-13453) | *Critical* | **Master should not bind to region server ports**
19105
19106 In 1.0.x, master by default binds to the region server ports (both rpc and info). This change brings back the usage of old master rpc and info ports in 1.1+ and master (2.0) branches. The motivation for this change is to ease the life of the user so that he does not need to do anything to bring up a RS on the same host and also to make the migration from 0.98 to 1.1  hassle free.  However, the users going from 1.0 to 1.1 would see the change in the master ports.
19107
19108
19109 ---
19110
19111 * [HBASE-13419](https://issues.apache.org/jira/browse/HBASE-13419) | *Major* | **Thrift gateway should propagate text from exception causes.**
19112
19113 Compose thrift exception text from the text of the entire cause chain of the underlying exception.
19114
19115
19116 ---
19117
19118 * [HBASE-13275](https://issues.apache.org/jira/browse/HBASE-13275) | *Major* | **Setting hbase.security.authorization to false does not disable authorization**
19119
19120 Prior to this change the configuration setting 'hbase.security.authorization' had no effect if security coprocessor were installed. The act of installing the security coprocessors was assumed to indicate active authorizaton was desired and required. Now it is possible to install the security coprocessors yet have them operate in a passive state with active authorization disabled by setting 'hbase.security.authorization' to false. This can be useful but is probably not what you want. For more information, consult the Security section of the HBase online manual.
19121
19122 'hbase.security.authorization' defaults to true for backwards comptatible behavior.
19123
19124
19125 ---
19126
19127 * [HBASE-13118](https://issues.apache.org/jira/browse/HBASE-13118) | *Major* | **[PE] Add being able to write many columns**
19128
19129 Adds a --columns option to PE so you can write more than one column (changes default qualifier from 'data' to '0').
19130
19131
19132 ---
19133
19134 * [HBASE-13270](https://issues.apache.org/jira/browse/HBASE-13270) | *Major* | **Setter for Result#getStats is #addResults; confusing!**
19135
19136 Deprecates Result#addResults in favor of Result#setStatistics
19137
19138
19139 ---
19140
19141 * [HBASE-13362](https://issues.apache.org/jira/browse/HBASE-13362) | *Major* | **Set max result size from client only (like scanner caching).**
19142
19143 This introduces a new config option: hbase.server.scanner.max.result.size
19144 This setting enforces a maximum result size (in bytes), when reached the server will return the results is has so far.
19145 This is a safety setting and should be kept large. The default is inifinite in 0.98 and 1.0.x and 100mb in 1.1 and later.
19146
19147 Use hbase.client.scanner.max.result.size instead to enforce practical chunk sizes of a few mb (defaults to 2mb)
19148
19149
19150 ---
19151
19152 * [HBASE-11544](https://issues.apache.org/jira/browse/HBASE-11544) | *Critical* | **[Ergonomics] hbase.client.scanner.caching is dogged and will try to return batch even if it means OOME**
19153
19154 Results returned from RPC calls may now be returned as partials
19155
19156 When is a Result marked as a partial?
19157 When the server must stop the scan because the max size limit has been reached. Means that the LAST Result returned within the ScanResult's Result array may be marked as a partial if the scan's max size limit caused it to stop in the middle of a row.
19158
19159 Incompatible Change: The return type of InternalScanners#next and RegionScanners#nextRaw has been changed to NextState from boolean
19160 The previous boolean return value can be accessed via NextState#hasMoreValues()
19161 Provides more context as to what happened inside the scanner
19162
19163 Scan caching default has been changed to Integer.Max\_Value
19164 This value works together with the new maxResultSize value from HBASE-12976 (defaults to 2MB)
19165 Results returned from server on basis of size rather than number of rows
19166 Provides better use of network since row size varies amongst tables
19167
19168 Protobuf models have changed for Result, ScanRequest, and ScanResponse to support new partial Results
19169
19170 Partial Results should be invisible to application layer unless Scan#setAllowPartials is set
19171
19172 Scan#setAllowPartials has been added to allow the application to request to see the partial Results returned by the server rather than have the ClientScanner form the complete Result prior to returning it to the application
19173
19174 To disable the use of partial Results on the server, set ScanRequest.Builder#setClientHandlesPartials() to be false in the ScanRequest issued to server
19175
19176 Partial Results should allow the server to return large rows in parts rather than accumulate all the cells for that particular row and run out of memory
19177
19178
19179 ---
19180
19181 * [HBASE-11864](https://issues.apache.org/jira/browse/HBASE-11864) | *Minor* | **Enhance HLogPrettyPrinter to print information from WAL Header**
19182
19183 Enhance WALPrettyPrinter to print information (writer classnames and cell codec classname) from WAL Header
19184
19185
19186 ---
19187
19188 * [HBASE-13289](https://issues.apache.org/jira/browse/HBASE-13289) | *Major* | **typo in splitSuccessCount  metric**
19189
19190 In hbase 1.0.0, 0.98.10, 0.98.10.1, 0.98.11, and 0.98.12 'splitSuccessCount' was misspelled as 'splitSuccessCounnt'
19191
19192
19193 ---
19194
19195 * [HBASE-12990](https://issues.apache.org/jira/browse/HBASE-12990) | *Major* | **MetaScanner should be replaced by MetaTableAccessor**
19196
19197 Removes MetaScanner. Use MetaTableAccessor instead.
19198
19199
19200 ---
19201
19202 * [HBASE-13373](https://issues.apache.org/jira/browse/HBASE-13373) | *Major* | **Squash HFileReaderV3 together with HFileReaderV2 and AbstractHFileReader; ditto for Scanners and BlockReader, etc.**
19203
19204 Marking as incompatible change. Requires hfiles be major version \>= 2 and \>= minor version 3.  Version 3 files are enabled by default in 1.0.  0.98 writes version 2 minor version 3.  You cannot go to 1.0 from anything before 0.98.
19205
19206
19207 ---
19208
19209 * [HBASE-13252](https://issues.apache.org/jira/browse/HBASE-13252) | *Major* | **Get rid of managed connections and connection caching**
19210
19211 For a long time, HBase supported 2 types of connections - managed, which were cached and closed automatically when not needed, and unmanaged, where user is responsible for closing the connections by calling #close() on them.
19212
19213 The concept of managed connections in HBase (deprecated before) has now been extinguished completely, and now all callers are responsible for managing the lifecycle of connections they acquire.
19214
19215
19216 ---
19217
19218 * [HBASE-12954](https://issues.apache.org/jira/browse/HBASE-12954) | *Minor* | **Ability impaired using HBase on multihomed hosts**
19219
19220 The following config is added by this JIRA:
19221
19222 hbase.regionserver.hostname
19223
19224 This config is for experts: don't set its value unless you really know what you are doing.
19225 When set to a non-empty value, this represents the (external facing) hostname for the underlying server.
19226 See https://issues.apache.org/jira/browse/HBASE-12954 for details.
19227
19228 Caution: please make sure rolling upgrade succeeds before turning on this feature.
19229
19230
19231 ---
19232
19233 * [HBASE-13187](https://issues.apache.org/jira/browse/HBASE-13187) | *Critical* | **Add ITBLL that exercises per CF flush**
19234
19235 Pass the -D flag generator.multiple.columnfamilies on the command-line if you want the generator to write three column families rather than the default one. When set, we will write the usual 'meta' column family and use it checking linked-list is wholesome but we will also write a 'tiny' column family and a 'big' column family to provoke uneven flushing; good for testing the flush-by-columnfamily feature.
19236
19237
19238 ---
19239
19240 * [HBASE-13361](https://issues.apache.org/jira/browse/HBASE-13361) | *Minor* | **Remove or undeprecate {get\|set}ScannerCaching in HTable**
19241
19242 Removed getScannerCaching and setScannerCaching from Table
19243
19244
19245 ---
19246
19247 * [HBASE-10728](https://issues.apache.org/jira/browse/HBASE-10728) | *Major* | **get\_counter value is never used.**
19248
19249 for 0.98 and 1.0 changes are compatible (due to mitigation by HBASE-13433):
19250
19251 \* The "get\_counter" command no longer requires a dummy 4th argument. Downstream users are encouraged to migrate code to not pass this argument because it will result in an error for HBase 1.1+.
19252 \* The "incr" command now outputs the current value of the counter to stdout.
19253 ex:
19254 {code}
19255 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19256 COUNTER VALUE = 1772
19257 0 row(s) in 0.1180 seconds
19258 {code}
19259
19260 for 1.1+ changes are incompatible:
19261
19262 \* The "get\_counter" command no longer accepts a dummy 4th argument. Downstream users will need to update their code to not pass this argument.
19263 ex:
19264 {code}
19265 jruby-1.6.8 :006 \> get\_counter 'counter\_example', 'r1', 'cf1:foo'
19266 COUNTER VALUE = 1772
19267
19268 {code}
19269 \* The "incr" command now outputs the current value of the counter to stdout.
19270 ex:
19271 {code}
19272 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19273 COUNTER VALUE = 1772
19274 0 row(s) in 0.1180 seconds
19275 {code}
19276
19277
19278 ---
19279
19280 * [HBASE-13170](https://issues.apache.org/jira/browse/HBASE-13170) | *Major* | **Allow block cache to be external**
19281
19282 HBase can use memcached as an external block cache. To use this change your config to set hbase.blockcache.use.external to true and hbase.cache.memcached.servers to contain the list of memcached servers to use.
19283
19284
19285 ---
19286
19287 * [HBASE-13316](https://issues.apache.org/jira/browse/HBASE-13316) | *Minor* | **Reduce the downtime on planned moves of regions**
19288
19289 When issuing an Admin.move command the RegionServer that receive the region will try and open the StoreFiles of that region to prime the block cache with index blocks.
19290
19291
19292 ---
19293
19294 * [HBASE-13298](https://issues.apache.org/jira/browse/HBASE-13298) | *Critical* | **Clarify if Table.{set\|get}WriteBufferSize() is deprecated or not**
19295
19296 Deprecate said methods. They were mistakenly included in Table Interface.
19297
19298
19299 ---
19300
19301 * [HBASE-13248](https://issues.apache.org/jira/browse/HBASE-13248) | *Major* | **Make HConnectionImplementation top-level class.**
19302
19303 **WARNING: No release note provided for this change.**
19304
19305
19306 ---
19307
19308 * [HBASE-13331](https://issues.apache.org/jira/browse/HBASE-13331) | *Blocker* | **Exceptions from DFS client can cause CatalogJanitor to delete referenced files**
19309
19310 Fixes an issue where files from a split region that were still referenced were erroneously deleted leading to data loss.
19311
19312
19313 ---
19314
19315 * [HBASE-13273](https://issues.apache.org/jira/browse/HBASE-13273) | *Major* | **Make Result.EMPTY\_RESULT read-only; currently it can be modified**
19316
19317 The Result.EMPTY\_RESULT object is now immutable. In previous releases, the object could be modified by a caller to no longer be empty. Code that relies on this behavior will now receive an UnsupportedOperationException.
19318
19319
19320 ---
19321
19322 * [HBASE-12867](https://issues.apache.org/jira/browse/HBASE-12867) | *Major* | **Shell does not support custom replication endpoint specification**
19323
19324 Adds support to add\_peer in hbase shell to add a custom replication endpoint from HBASE-12254.
19325
19326
19327 ---
19328
19329 * [HBASE-13198](https://issues.apache.org/jira/browse/HBASE-13198) | *Major* | **Remove HConnectionManager**
19330
19331 **WARNING: No release note provided for this change.**
19332
19333
19334 ---
19335
19336 * [HBASE-12586](https://issues.apache.org/jira/browse/HBASE-12586) | *Major* | **Task 6 & 7 from HBASE-9117,  delete all public HTable constructors and delete ConnectionManager#{delete,get}Connection**
19337
19338 HTable class has been marked as private API before, and now it's no longer directly instantiable from client code (all public constructors have been removed). All clients should use Connection#getTable() and Connection#getRegionLocator() when appropriate to obtain Table and RegionLocator implementations to work with.
19339
19340
19341 ---
19342
19343 * [HBASE-13171](https://issues.apache.org/jira/browse/HBASE-13171) | *Minor* | **Change AccessControlClient methods to accept connection object to reduce setup time.**
19344
19345 **WARNING: No release note provided for this change.**
19346
19347
19348 ---
19349
19350 * [HBASE-12706](https://issues.apache.org/jira/browse/HBASE-12706) | *Critical* | **Support multiple port numbers in ZK quorum string**
19351
19352 hbase.zookeeper.quorum configuration now allows servers together with client ports consistent with the way Zookeeper java client accepts the quorum string. In this case, using hbase.zookeeper.clientPort is not needed. eg.  hbase.zookeeper.quorum=myserver1:2181,myserver2:20000,myserver3:31111
19353
19354
19355 ---
19356
19357 * [HBASE-13142](https://issues.apache.org/jira/browse/HBASE-13142) | *Major* | **[PERF] Reuse the IPCUtil#buildCellBlock buffer**
19358
19359 Adds buffer reuse sending Cell results. It is on by default and should not need configuration. Improves GC profile and ups throughput. The benefit gets better the larger the row size returned.
19360
19361 The buffer reservoir is bounded at a maximum count after which we will start logging at WARN level that the reservoir is running at capacity (returned buffers will be discarded and not added back to the reservoir pool). Default maximum is twice the handler count: i.e. 2 \* hbase.regionserver.handler.count. This should be more than enough. Set the maximum with the new configuration: hbase.ipc.server.reservoir.max
19362
19363 The reservoir will not cache buffers in excess of hbase.ipc.server.reservoir.max.buffer.size  The default is 10MB. This means that if a row is very large, then we will allocate a buffer of the average size that is currently in the pool and we will then resize it till we can accommodate the return. These resizes are expensive. The resultant buffer will be used and then discarded.
19364
19365 To check how the reservoir is doing, enable trace level logging for a few seconds on a regionserver. You can do this from the regionserver UI. See 'Log Level'. Set org.apache.hadoop.hbase.io.BoundedByteBufferPool to TRACE. The BoundedByteBufferPool will spew report to the log. Disable the TRACE level and then check the log. You'll see allocation rate, size of pool, size of buffers in pool, etc.
19366
19367
19368 ---
19369
19370 * [HBASE-13012](https://issues.apache.org/jira/browse/HBASE-13012) | *Major* | **Add shell commands to trigger the mob file compactor**
19371
19372 This adds two new shell commands -- compact\_mob and major\_compact\_mob to the hbase shell.
19373
19374 Run compaction on a mob enabled column family or all mob enabled column families within a table
19375           Examples:
19376           Compact a column family within a table:
19377           hbase\> compact\_mob 't1', 'c1'
19378           Compact all mob enabled column families
19379           hbase\> compact\_mob 't1'
19380
19381 Run major compaction on a mob enabled column family or all mob enabled column families within a table
19382           Examples:
19383           Compact a column family within a table:
19384           hbase\> major\_compact\_mob 't1', 'c1'
19385           Compact all mob enabled column families within a table
19386           hbase\> major\_compact\_mob 't1'
19387
19388
19389 ---
19390
19391 * [HBASE-12869](https://issues.apache.org/jira/browse/HBASE-12869) | *Major* | **Add a REST API implementation of the ClusterManager interface**
19392
19393 Adds an implementation of ClusterManager to control REST API-managed HBase clusters.
19394
19395
19396 ---
19397
19398 * [HBASE-13047](https://issues.apache.org/jira/browse/HBASE-13047) | *Trivial* | **Add "HBase Configuration" link missing on the table details pages**
19399
19400 Add a '/conf' link to UI
19401
19402
19403 ---
19404
19405 * [HBASE-13044](https://issues.apache.org/jira/browse/HBASE-13044) | *Minor* | **Configuration option for disabling coprocessor loading**
19406
19407 This change adds two new configuration options:
19408 - "hbase.coprocessor.enabled" controls globally if any coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19409 - "hbase.coprocessor.user.enabled" controls if any user (aka table) coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19410
19411
19412 ---
19413
19414 * [HBASE-12961](https://issues.apache.org/jira/browse/HBASE-12961) | *Minor* | **Negative values in read and write region server metrics**
19415
19416 Change read and write request count in ServerLoad from int to long
19417
19418
19419 ---
19420
19421 * [HBASE-7332](https://issues.apache.org/jira/browse/HBASE-7332) | *Minor* | **[webui] HMaster webui should display the number of regions a table has.**
19422
19423 Adds counts for various regions states to the table listing on main page. See attached screenshot.
19424
19425
19426 ---
19427
19428 * [HBASE-8329](https://issues.apache.org/jira/browse/HBASE-8329) | *Major* | **Limit compaction speed**
19429
19430 Adds compaction throughput limit mechanism(the word "throttle" is already used when choosing compaction thread pool, so use a different word here to avoid ambiguity). Default is org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController, will limit throughput as follow:
19431 1. In off peak hours, use a fixed limitation "hbase.hstore.compaction.throughput.offpeak" (default is Long.MAX\_VALUE which means no limitation).
19432 2. In normal hours, the limitation is tuned between "hbase.hstore.compaction.throughput.lower.bound"(default 10MB/sec) and "hbase.hstore.compaction.throughput.higher.bound"(default 20MB/sec), using the formula "lower + (higer - lower) \* param" where param is in range [0.0, 1.0] and calculate based on store files count on this regionserver.
19433 3. If some stores have too many store files(storefilesCount \> blockingFileCount), then there is no limitation no matter peak or off peak.
19434 You can set "hbase.regionserver.throughput.controller" to org.apache.hadoop.hbase.regionserver.throttle.NoLimitThroughputController to disable throughput controlling.
19435 And we have implemented ConfigurationObserver which means you can change all configurations above and do not need to restart cluster.
19436
19437 The throttle is on by default in hbase-2.0.0. There is no limit in hbase-1.x.
19438
19439
19440 ---
19441
19442 * [HBASE-6778](https://issues.apache.org/jira/browse/HBASE-6778) | *Major* | **Deprecate Chore; its a thread per task when we should have one thread to do all tasks**
19443
19444 Corresponding usages for new ScheduledChore vs. Deprecated Chore:
19445 Chore.interrupt() -\> ScheduledChore.cancel(mayInterruptWhileRunning = true)
19446 Threads.setDaemonThreadRunning(Chore) -\> ChoreService.scheduleChore(ScheduledChore)
19447 Chore.isAlive -\> ScheduledChore.isScheduled()
19448 Chore.getSleeper().skipSleepCycle() -\> ScheduledChore.triggerNow()
19449
19450
19451 ---
19452
19453 * [HBASE-11574](https://issues.apache.org/jira/browse/HBASE-11574) | *Major* | **hbase:meta's regions can be replicated**
19454
19455 On the server side, set hbase.meta.replica.count to the number of replicas of meta that you want to have in the cluster (defaults to 1). hbase.regionserver. meta.storefile.refresh.period should be set to a non-zero number in milliseconds - something like 30000 (defaults to 0).
19456 On the client/user side, set hbase.meta.replicas.use to true.
19457
19458
19459 ---
19460
19461 * [HBASE-12808](https://issues.apache.org/jira/browse/HBASE-12808) | *Major* | **Use Java API Compliance Checker for binary/source compatibility**
19462
19463 Adds a dev-support/check\_compatibility.sh script for comparing versions. Run the script to see usage.
19464
19465
19466 ---
19467
19468 * [HBASE-12684](https://issues.apache.org/jira/browse/HBASE-12684) | *Major* | **Add new AsyncRpcClient**
19469
19470 Retrofit a new, netty-based rpc transport on the client. This client is slightly slower if little contention given the extra tier or so that netty adds and that we block on a Future waiting on the call to finish.  This client opens the way for HBase having a native Async API.
19471
19472 This client is on by default in master branch (2.0 hbase). It is off in branch-1.0 (hbase-1.1.x).  To enable it, set "hbase.rpc.client.impl" to "org.apache.hadoop.hbase.ipc.AsyncRpcClient"
19473
19474
19475 ---
19476
19477 * [HBASE-8410](https://issues.apache.org/jira/browse/HBASE-8410) | *Major* | **Basic quota support for namespaces**
19478
19479 Namespace auditor provides basic quota support for namespaces in terms of number of tables and number of regions. In order to use namespace quotas, quota support must be enabled by setting
19480 "hbase.quota.enabled" property to true in hbase-site.xml file.
19481
19482 The users can add quota information to namespace, while creating new namespaces or by altering existing ones.
19483
19484 Examples:
19485 1. create\_namespace 'ns1', {'hbase.namespace.quota.maxregions'=\>'10'}
19486 2. create\_namespace 'ns2', {'hbase.namespace.quota.maxtables'=\>'2','hbase.namespace.quota.maxregions'=\>'5'}
19487 3. alter\_namespace 'ns3', {METHOD =\> 'set', 'hbase.namespace.quota.maxtables'=\>'5','hbase.namespace.quota.maxregions'=\>'25'}
19488
19489 The quotas can be modified/added to namespace at any point of time. To remove quotas, the following command can be used:
19490
19491 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxtables'}
19492 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxregions'}
19493
19494
19495 ---
19496
19497 * [HBASE-12902](https://issues.apache.org/jira/browse/HBASE-12902) | *Major* | **Post-asciidoc conversion fix-ups**
19498
19499 Pushed to master. Shout if there are any issues.
19500
19501
19502 ---
19503
19504 * [HBASE-12848](https://issues.apache.org/jira/browse/HBASE-12848) | *Major* | **Utilize Flash storage for WAL**
19505
19506 For users on a version of Hadoop that supports tiered storage policies (i.e. Apache Hadoop 2.6.0+), HBase now allows users to opt-in to having the write ahead log placed on the SSD tier. Users on earlier versions of Hadoop will be unable to take advantage of this feature.
19507
19508 Use of tiered storage is controlled by a new RegionServer config, hbase.wal.storage.policy. It defaults to the value 'NONE', which will rely on HDFS defaults for a policy decision.
19509
19510 User can specify ONE\_SSD or ALL\_SSD as the value:
19511 ONE\_SSD: place only one replica of WAL files in SSD and the remaining in default storage
19512 ALL\_SSD: all replica for WAL files are placed on SSD
19513
19514 See [the HDFS docs on storage policy\|http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html]
19515
19516
19517 ---
19518
19519 * [HBASE-11144](https://issues.apache.org/jira/browse/HBASE-11144) | *Major* | **Filter to support scanning multiple row key ranges**
19520
19521 MultiRowRangeFilter is a filter to support scanning multiple row key ranges. If the number of the ranges is small, using multiple scans can also do the same thing and can work well. But when the number of ranges are quite big (e.g. millions), use the MultiRowRangeFilter will be nice. In this filter, the ranges will be sorted and merged, so users do not have to take care of ranges are not continuous. And if users are using something like rest, thrift or pig to access the data the filter might be the practical solution.
19522
19523
19524 ---
19525
19526 * [HBASE-12268](https://issues.apache.org/jira/browse/HBASE-12268) | *Major* | **Add support for Scan.setRowPrefixFilter to shell**
19527
19528 Added new option, ROWPREFIXFILTER, to the scan command in the HBase shell to easily scan for a specific row prefix.
19529
19530
19531 ---
19532
19533 * [HBASE-12775](https://issues.apache.org/jira/browse/HBASE-12775) | *Major* | **CompressionTest ate my HFile (sigh!)**
19534
19535 CompressionTest will now abort when the target path exists.
19536
19537
19538 ---
19539
19540 * [HBASE-12695](https://issues.apache.org/jira/browse/HBASE-12695) | *Critical* | **JDK 1.8 compilation broken**
19541
19542 Use the -Pjavac maven profile in order to compile HBase using the compiler provided by the JDK instead of the default error-prone compiler plugin. This is useful for now if you are building HBase with JDK 1.8 or a JDK that doesn't support error-prone.
19543
19544
19545 ---
19546
19547 * [HBASE-10201](https://issues.apache.org/jira/browse/HBASE-10201) | *Major* | **Port 'Make flush decisions per column family' to trunk**
19548
19549 Adds new flushing policy mechanism. Default, org.apache.hadoop.hbase.regionserver.FlushLargeStoresPolicy, will try to avoid flushing out the small column families in a region, those whose memstores are \< hbase.hregion.percolumnfamilyflush.size.lower.bound. To restore the old behavior of flushes writing out all column families, set hbase.regionserver.flush.policy to org.apache.hadoop.hbase.regionserver.FlushAllStoresPolicy either in hbase-default.xml or on a per-table basis by setting the policy to use with HTableDescriptor.getFlushPolicyClassName().
19550
19551
19552 ---
19553
19554 * [HBASE-12559](https://issues.apache.org/jira/browse/HBASE-12559) | *Major* | **Provide LoadBalancer with online configuration capability**
19555
19556 updateConfiguration(ServerName server) method of Admin now updates config for HMaster as well.
19557 Specifically, config update would be taken by load balancer.
19558
19559
19560 ---
19561
19562 * [HBASE-10378](https://issues.apache.org/jira/browse/HBASE-10378) | *Major* | **Divide HLog interface into User and Implementor specific interfaces**
19563
19564 HBase internals for the write ahead log have been refactored. Advanced users of HBase should be aware of the following changes.
19565
19566 Public Audience
19567   - The Admin API for asking a region server to roll WAL files has changed from a synchronous command that returns a set of regions the WAL implementation would like flushed into an asynchronous command that returns nothing. Older clients relying on the former behavior will still be able to interact with newer servers, but the response body will always contain an empty list of regions to flush.
19568   - The shell command "hlog\_roll" has been deprecated. Operators should use the "wal\_roll" command instead. This command is subject to the changes described above for the Admin API to roll WAL files.
19569   - The command for analyzing write ahead logs has been renamed from 'hlog' to 'wal'. The old usage is deprecated and will be removed in a future version.
19570   - Some utility methods in the HBaseTesetingUtility related to testing write-ahead-logs were changed in incompatible ways. No functionality has been removed, but method names and arguments have changed. See the HBaseTestingUtility javadoc for details.
19571   - The WALPlayer utility has deprecated the configuration keys used for advanced customization. Users should switch to the updated configuration keys. See the usage information on the WALPlayer tool for details.
19572   - The HLogInputFormat utility class for processing logs with MapReduce has been deprecated and will be removed in a future version. Users should switch to the WALInputFormat.
19573   - The labeling of server metrics on the region server status pages changed. Previously, the number of backing files for the write ahead log was labeled 'Num. HLog Files'. If you wish to see this statistic now, please look for the label 'Num. WAL Files.'  If you rely on JMX for these metrics, their location has not changed.
19574
19575 LimitedPrivate(COPROC) Audience, LimitedPrivate(PHOENIX)
19576   - The RegionObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseRegionObserver class. For those that implement RegionObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the RegionObserver javadoc for details.
19577   - Classes related to reading WAL entries (ReaderBase, ProtobufLogReader, SequenceFileLogReader) have changed in a backwards incompatible way. Users who referenced HLog.Reader directly or HLog.Entry will have to update. These changes do not impact compatibility with extant wal files.
19578   - The WALObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseWALObserver class. For those that implement WALObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the WALObserver javadoc for details.
19579  - The WALCoprocessorEnvironment  has changed in a backwards incompatible way. WALObserver coprocessors that relied on retrieving an object representing the write ahead log instance will have to be updated.
19580
19581 LimitedPrivate(REPLICATION) Audience
19582  - The WALEntryFilter API has changed in a backwards incompatible way. Implementers will have to be updated.
19583  - The ReplicationEndpoint.ReplicateContext API has changed in a backwards incompatible way. Implementers who use this interface will have to be updated. These changes do not impact wire compatibility for replicating between clusters.
19584  - The HLogKey API is deprecated in favor of the WALKey API. Additionally, the HLogKey API has changed in a backwards incompatible way by changing from implementing WriteableComparable\<HLogKey\> to implementing Writeable and Comparable\<WALKey\>.
19585
19586
19587 ---
19588
19589 * [HBASE-11683](https://issues.apache.org/jira/browse/HBASE-11683) | *Major* | **Metrics for MOB**
19590
19591 Adds new mob related metrics:
19592
19593 mobCompactedIntoMobCellsCount
19594 mobCompactedIntoMobCellsSize
19595 mobCompactedFromMobCellsCount
19596 mobCompactedFromMobCellsSize
19597 mobFlushCount
19598 mobFlushedCellsCount
19599 mobFlushedCellsSize
19600 mobScanCellsCount
19601 mobScanCellsSize
19602 mobFileCacheAccessCount
19603 mobFileCacheMissCount
19604 mobFileCacheHitPercent
19605 mobFileCacheEvictedCount
19606 mobFileCacheCount
19607
19608
19609 ---
19610
19611 * [HBASE-11912](https://issues.apache.org/jira/browse/HBASE-11912) | *Major* | **Catch some bad practices at compile time with error-prone**
19612
19613 Errors from error-prone will fail the build in the compile phase. Warnings look like Javac warnings and are counted as such by test-patch etc
19614
19615
19616 ---
19617
19618 * [HBASE-12220](https://issues.apache.org/jira/browse/HBASE-12220) | *Major* | **Add hedgedReads and hedgedReadWins metrics**
19619
19620 Adds metrics hedgedReads and hedgedReadWins counts.
19621
19622
19623 ---
19624
19625 * [HBASE-6290](https://issues.apache.org/jira/browse/HBASE-6290) | *Minor* | **Add a function a mark a server as dead and start the recovery the process**
19626
19627 Adds a script to mark a server as dead.
19628
19629 Usage: considerAsDead.sh --hostname serverName
19630
19631
19632 ---
19633
19634 * [HBASE-12111](https://issues.apache.org/jira/browse/HBASE-12111) | *Major* | **Remove deprecated APIs from Mutation(s)**
19635
19636 Removed the below from hbase-2 (were deprecated on release of hbase-1.0.0)
19637
19638 Mutation setWriteToWAL(boolean)
19639 boolean getWriteToWAL()
19640 Mutation setFamilyMap(NavigableMap\<byte [], List\<KeyValue\>\>)
19641 NavigableMap\<byte [], List\<KeyValue\>\> getFamilyMap()
19642
19643
19644 ---
19645
19646 * [HBASE-12084](https://issues.apache.org/jira/browse/HBASE-12084) | *Major* | **Remove deprecated APIs from Result**
19647
19648 The below KeyValue based APIs are removed from Result
19649 KeyValue[] raw()
19650 List\<KeyValue\> list()
19651 List\<KeyValue\> getColumn(byte [] family, byte [] qualifier)
19652 KeyValue getColumnLatest(byte [] family, byte [] qualifier)
19653 KeyValue getColumnLatest(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19654
19655 They are replaced with
19656 Cell[] rawCells()
19657 List\<Cell\> listCells()
19658 List\<Cell\> getColumnCells(byte [] family, byte [] qualifier)
19659 Cell getColumnLatestCell(byte [] family, byte [] qualifier)
19660 Cell getColumnLatestCell(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19661 respectively
19662
19663 Also the constructors which were taking KeyValues also removed
19664 Result(KeyValue [] cells)
19665 Result(List\<KeyValue\> kvs)
19666
19667
19668 ---
19669
19670 * [HBASE-12048](https://issues.apache.org/jira/browse/HBASE-12048) | *Major* | **Remove deprecated APIs from Filter**
19671
19672 The following APIs are removed from Filter
19673 KeyValue transform(KeyValue)
19674 KeyValue getNextKeyHint(KeyValue)
19675 and replaced with
19676 Cell transformCell(Cell)
19677 Cell getNextCellHint(Cell)
19678 respectively.
19679 If a custom Filter implementation have overridden any of these methods, we will no longer call them. User has to change the custom Filter to override cell based methods as shown above
19680
19681
19682 ---
19683
19684 * [HBASE-7767](https://issues.apache.org/jira/browse/HBASE-7767) | *Major* | **Get rid of ZKTable, and table enable/disable state in ZK**
19685
19686 Keeps table enabled/disabled state in HDFS rather than up in ZooKeeper.  Auto-migrates any existing zk state.
19687
19688
19689 ---
19690
19691 * [HBASE-11911](https://issues.apache.org/jira/browse/HBASE-11911) | *Major* | **Break up tests into more fine grained categories**
19692
19693 Adds new test categories besides the class smalltests, mediumtests, and largetests.  Adds:
19694
19695 ClientTests
19696 CoprocessorTests
19697 FilterTests
19698 FlakeyTests
19699 IOTests
19700 MapReduceTests
19701 MasterTests
19702 MiscTests
19703 RegionServerTests
19704 ReplicationTests
19705 RestTests
19706 SecurityTests
19707 VerySlowMapReduceTests
19708 VerySlowRegionServerTests
19709
19710 See description for examples on how to use them.
19711
19712
19713 ---
19714
19715 * [HBASE-11658](https://issues.apache.org/jira/browse/HBASE-11658) | *Major* | **Piped commands to hbase shell should return non-zero if shell command failed.**
19716
19717 Adds a noninteractive mode (-n or --noninteractive) to the hbase shell that exits with a non-zero error code on failed or invalid shell command executions, and exits with a zero error code upon successful execution.
19718
19719
19720 ---
19721
19722 * [HBASE-11640](https://issues.apache.org/jira/browse/HBASE-11640) | *Major* | **Add syntax highlighting support to HBase Ref Guide programlistings**
19723
19724 This got committed, so I guess it is safe to resolve it?
19725
19726
19727 ---
19728
19729 * [HBASE-11606](https://issues.apache.org/jira/browse/HBASE-11606) | *Minor* | **Enable ZK-less region assignment by default**
19730
19731 By default, we don't use ZK for region assignment now. To fall back to the old way, you can set hbase.assignment.usezk to true.
19732
19733
19734 ---
19735
19736 * [HBASE-3135](https://issues.apache.org/jira/browse/HBASE-3135) | *Major* | **Make our MR jobs implement Tool and use ToolRunner so can do -D trickery, etc.**
19737
19738 All MR jobs implement Tool Interface, http://hadoop.apache.org/docs/current/api/org/apache/hadoop/util/Tool.html, so now you can pass properties on command line with the -D flag, etc.
19739
19740
19741 ---
19742
19743 * [HBASE-11556](https://issues.apache.org/jira/browse/HBASE-11556) | *Major* | **Move HTablePool to hbase-thrift module.**
19744
19745 HTablePool was deprecated in 0.98.1 but was still present and usable by apps built against versions before HBase 2.0.  It has been moved and is not intended to be used by user applications, and is now an internal part of the thrift2 proxy server only.
19746
19747
19748 ---
19749
19750 * [HBASE-11548](https://issues.apache.org/jira/browse/HBASE-11548) | *Trivial* | **[PE] Add 'cycling' test N times and unit tests for size/zipf/valueSize calculations**
19751
19752 Adds --cycles=N argument.
19753
19754
19755 ---
19756
19757 * [HBASE-11344](https://issues.apache.org/jira/browse/HBASE-11344) | *Major* | **Hide row keys and such from the web UIs**
19758
19759 Configure "hbase.display.keys" to false (default: true) in the master/regionservers if the row-keys should be hidden in the webUIs (like in the webUI for table details).
19760
19761
19762 ---
19763
19764 * [HBASE-6580](https://issues.apache.org/jira/browse/HBASE-6580) | *Major* | **Deprecate HTablePool in favor of HConnection.getTable(...)**
19765
19766 This issue introduces a few new APIs:
19767 \* HConnectionManager:
19768 {code}
19769     public static HConnection createConnection(Configuration conf)
19770     public static HConnection createConnection(Configuration conf, ExecutorService pool)
19771 {code}
19772 \* HConnection:
19773 {code}
19774     public HTableInterface getTable(String tableName) throws IOException
19775     public HTableInterface getTable(byte[] tableName) throws IOException
19776     public HTableInterface getTable(String tableName, ExecutorService pool) throws IOException
19777     public HTableInterface getTable(byte[] tableName, ExecutorService pool) throws IOException
19778 {code}
19779
19780 By default HConnectionImplementation will create an ExecutorService when needed. The ExecutorService can optionally passed be passed in.
19781 HTableInterfaces are retrieved from the HConnection. By default the HConnection's ExecutorService is used, but optionally that can be overridden for each HTable.
19782
19783
19784 ---
19785
19786 * [HBASE-8450](https://issues.apache.org/jira/browse/HBASE-8450) | *Critical* | **Update hbase-default.xml and general recommendations to better suit current hw, h2, experience, etc.**
19787
19788 Changed defaults:
19789
19790 + max versions now 1 instead of 3
19791 + row blooms on by default (except on .META. table)
19792 + handlers 30 instead of 10
19793 + upped memstore lower limit from .35 to .38
19794 + zookeeper timeout default is 90seconds instead of 180
19795 + client pause is 100ms instead of 1000ms
19796 + retries are now 20 instead of 10 (so overall we still wait same amount of time)
19797 + bulkload retries is 10 instead of infinite
19798 + major compactions are now once a week instead of once every 24 hours; they are staggered so all regionservers do not start compacting at the same time
19799 + blockingstorefiles is 10 instead of 7
19800 + block cache is 0.4 instead of 0.25
19801 + Previous, default for hbase.rootdir was /tmp/hbase-${user.name}.  Now it is ${java.io.tmpdir}/hbase-${user.name} which is usually the same location but may not be (on macos, it points to /var/tmp....).
19802
19803
19804 ---
19805
19806 * [HBASE-4072](https://issues.apache.org/jira/browse/HBASE-4072) | *Major* | **Deprecate/disable and remove support for reading ZooKeeper zoo.cfg files from the classpath**
19807
19808 The Apache ZooKeeper config file zoo.cfg will no longer be read when instantiating a HBaseConfiguration object, as it causes various inconsistency issues. Instead, users have to specify all HBase-relevant ZooKeeper properties in the hbase-site.xml using the various "hbase.zookeeper" prefixed properties. For example, specify "hbase.zookeeper.quorum" to provide a ZK quorum server list.
19809
19810 To enable zoo.cfg reading, for which support may be removed in a future release, set the property "hbase.config.read.zookeeper.config" to true in the hbase-site.xml at the client and servers like so:
19811
19812 \<property\>
19813   \<name\>hbase.config.read.zookeeper.config\</name\>
19814   \<value\>true\</value\>
19815   \<description\>
19816         Set to true to allow HBaseConfiguration to read the
19817         zoo.cfg file for ZooKeeper properties. Switching this to true
19818         is not recommended, since the functionality of reading ZK
19819         properties from a zoo.cfg file has been deprecated.
19820   \</description\>
19821 \</property\>
19822
19823
19824