RELEASENOTES.md

   1 # RELEASENOTES
   2
   3 <!---
   4 # Licensed to the Apache Software Foundation (ASF) under one
   5 # or more contributor license agreements.  See the NOTICE file
   6 # distributed with this work for additional information
   7 # regarding copyright ownership.  The ASF licenses this file
   8 # to you under the Apache License, Version 2.0 (the
   9 # "License"); you may not use this file except in compliance
  10 # with the License.  You may obtain a copy of the License at
  11 #
  12 #     http://www.apache.org/licenses/LICENSE-2.0
  13 #
  14 # Unless required by applicable law or agreed to in writing, software
  15 # distributed under the License is distributed on an "AS IS" BASIS,
  16 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  17 # See the License for the specific language governing permissions and
  18 # limitations under the License.
  19
  20 # Be careful doing manual edits in this file. Do not change format
  21 # of release header or remove the below marker. This file is generated.
  22 # DO NOT REMOVE THIS MARKER; FOR INTERPOLATING CHANGES!-->
  23 # HBASE  2.4.5 Release Notes
  24
  25 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  26
  27
  28 ---
  29
  30 * [HBASE-26088](https://issues.apache.org/jira/browse/HBASE-26088) | *Critical* | **conn.getBufferedMutator(tableName) leaks thread executors and other problems**
  31
  32 The API doc for Connection#getBufferedMutator(TableName) and Connection#getBufferedMutator(BufferedMutatorParams) mentioned that when user dont pass a ThreadPool to be used, we use the ThreadPool in the Connection.  But in reality, we were creating new ThreadPool in such cases.
  33
  34 We are keeping the behaviour of code as is but corrected the Javadoc and also a bug of not closing this new pool while Closing the BufferedMutator.
  35
  36
  37 ---
  38
  39 * [HBASE-25986](https://issues.apache.org/jira/browse/HBASE-25986) | *Minor* | **Expose the NORMALIZARION\_ENABLED table descriptor through a property in hbase-site**
  40
  41 New config: hbase.table.normalization.enabled
  42
  43 Default value: false
  44
  45 Description: This config is used to set default behaviour of normalizer at table level. To override this at table level one can set NORMALIZATION\_ENABLED at table descriptor level and that property will be honored. Of course, this property at table level can only work if normalizer is enabled at cluster level using "normalizer\_switch true" command.
  46
  47
  48 ---
  49
  50 * [HBASE-22923](https://issues.apache.org/jira/browse/HBASE-22923) | *Major* | **hbase:meta is assigned to localhost when we downgrade the hbase version**
  51
  52 Introduced new config: hbase.min.version.move.system.tables
  53
  54 When the operator uses this configuration option, any version between
  55 the current cluster version and the value of "hbase.min.version.move.system.tables"
  56 does not trigger any auto-region movement. Auto-region movement here
  57 refers to auto-migration of system table regions to newer server versions.
  58 It is assumed that the configured range of versions does not require special
  59 handling of moving system table regions to higher versioned RegionServer.
  60 This auto-migration is done by AssignmentManager#checkIfShouldMoveSystemRegionAsync().
  61 Example: Let's assume the cluster is on version 1.4.0 and we have
  62 set "hbase.min.version.move.system.tables" as "2.0.0". Now if we upgrade
  63 one RegionServer on 1.4.0 cluster to 1.6.0 (\< 2.0.0), then AssignmentManager will
  64 not move hbase:meta, hbase:namespace and other system table regions
  65 to newly brought up RegionServer 1.6.0 as part of auto-migration.
  66 However, if we upgrade one RegionServer on 1.4.0 cluster to 2.2.0 (\> 2.0.0),
  67 then AssignmentManager will move all system table regions to newly brought
  68 up RegionServer 2.2.0 as part of auto-migration done by
  69 AssignmentManager#checkIfShouldMoveSystemRegionAsync().
  70
  71 Overall, assuming we have system RSGroup where we keep HBase system tables, if we use
  72 config "hbase.min.version.move.system.tables" with value x.y.z then while upgrading cluster to
  73 version greater than or equal to x.y.z, the first RegionServer that we upgrade must
  74 belong to system RSGroup only.
  75
  76
  77 ---
  78
  79 * [HBASE-25902](https://issues.apache.org/jira/browse/HBASE-25902) | *Critical* | **Add missing CFs in meta during HBase 1 to 2.3+ Upgrade**
  80
  81 While upgrading cluster from 1.x to 2.3+ versions, after the active master is done setting it's status as 'Initialized', it attempts to add 'table' and 'repl\_barrier' CFs in meta. Once CFs are added successfully, master is aborted with PleaseRestartMasterException because master has missed certain initialization events (e.g ClusterSchemaService is not initialized and tableStateManager fails to migrate table states from ZK to meta due to missing CFs). Subsequent active master initialization is expected to be smooth.
  82 In the presence of multi masters, when one of them becomes active for the first time after upgrading to HBase 2.3+, it is aborted after fixing CFs in meta and one of the other backup masters will take over and become active soon. Hence, overall this is expected to be smooth upgrade if we have backup masters configured. If not, operator is expected to restart same master again manually.
  83
  84
  85 ---
  86
  87 * [HBASE-25877](https://issues.apache.org/jira/browse/HBASE-25877) | *Major* | **Add access  check for compactionSwitch**
  88
  89 Now calling RSRpcService.compactionSwitch, i.e, Admin.compactionSwitch at client side, requires ADMIN permission.
  90 This is an incompatible change but it is also a bug, as we should not allow any users to disable compaction on a regionserver, so we apply this to all active branches.
  91
  92
  93 ---
  94
  95 * [HBASE-25984](https://issues.apache.org/jira/browse/HBASE-25984) | *Critical* | **FSHLog WAL lockup with sync future reuse [RS deadlock]**
  96
  97 Fixes a WAL lockup issue due to premature reuse of the sync futures by the WAL consumers. The lockup causes the WAL system to hang resulting in blocked appends and syncs thus holding up the RPC handlers from progressing. Only workaround without this fix is to force abort the region server.
  98
  99
 100 ---
 101
 102 * [HBASE-25993](https://issues.apache.org/jira/browse/HBASE-25993) | *Major* | **Make excluded SSL cipher suites configurable for all Web UIs**
 103
 104 Add "ssl.server.exclude.cipher.list" configuration to excluded cipher suites for the http server started by the InfoServer.
 105
 106
 107 ---
 108
 109 * [HBASE-25969](https://issues.apache.org/jira/browse/HBASE-25969) | *Major* | **Cleanup netty-all transitive includes**
 110
 111 We have an (old) netty-all in our produced artifacts. It is transitively included from hadoop. It is needed by MiniMRCluster referenced from a few MR tests in hbase. This commit adds netty-all excludes everywhere else but where tests will fail unless the transitive is allowed through. TODO: move MR and/or MR tests out of hbase core.
 112
 113
 114
 115 # HBASE  2.4.4 Release Notes
 116
 117 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 118
 119
 120 ---
 121
 122 * [HBASE-25963](https://issues.apache.org/jira/browse/HBASE-25963) | *Major* | **HBaseCluster should be marked as IA.Public**
 123
 124 Change HBaseCluster to IA.Public as its sub class MiniHBaseCluster is IA.Public.
 125
 126
 127
 128 # HBASE  2.4.3 Release Notes
 129
 130 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 131
 132
 133 ---
 134
 135 * [HBASE-25766](https://issues.apache.org/jira/browse/HBASE-25766) | *Major* | **Introduce RegionSplitRestriction that restricts the pattern of the split point**
 136
 137 After HBASE-25766, we can specify a split restriction, "KeyPrefix" or "DelimitedKeyPrefix", to a table with the "hbase.regionserver.region.split\_restriction.type" property. The "KeyPrefix" split restriction groups rows by a prefix of the row-key. And the "DelimitedKeyPrefix" split restriction groups rows by a prefix of the row-key with a delimiter.
 138
 139 For example:
 140 \`\`\`
 141 # Create a table with a "KeyPrefix" split restriction, where the prefix length is 2 bytes
 142 hbase\> create 'tbl1', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'KeyPrefix', 'hbase.regionserver.region.split\_restriction.prefix\_length' =\> '2'}}
 143
 144 # Create a table with a "DelimitedKeyPrefix" split restriction, where the delimiter is a comma (,)
 145 hbase\> create 'tbl2', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'DelimitedKeyPrefix', 'hbase.regionserver.region.split\_restriction.delimiter' =\> ','}}
 146 \`\`\`
 147
 148 Instead of specifying a split restriction to a table directly, we can also set the properties in hbase-site.xml. In this case, the specified split restriction is applied for all the tables.
 149
 150 Note that the split restriction is also applied to a user-specified split point so that we don't allow users to break the restriction, which is different behavior from the existing KeyPrefixRegionSplitPolicy and DelimitedKeyPrefixRegionSplitPolicy.
 151
 152
 153 ---
 154
 155 * [HBASE-25775](https://issues.apache.org/jira/browse/HBASE-25775) | *Major* | **Use a special balancer to deal with maintenance mode**
 156
 157 Introduced a MaintenanceLoadBalancer to be used only under maintenance mode. Typically you should not use it as your balancer implementation.
 158
 159
 160 ---
 161
 162 * [HBASE-25767](https://issues.apache.org/jira/browse/HBASE-25767) | *Major* | **CandidateGenerator.getRandomIterationOrder is too slow on large cluster**
 163
 164 In the actual implementation classes of CandidateGenerator, now we just random select a start point and then iterate sequentially, instead of using the old way, where we will create a big array to hold all the integers in [0, num\_regions\_in\_cluster), shuffle the array, and then iterate on the array.
 165 The new implementation is 'random' enough as every time we just select one candidate. The problem for the old implementation is that, it will create an array every time when we want to get a candidate, if we have tens of thousands regions, we will create an array with tens of thousands length everytime, which causes big GC pressure and slow down the balancer execution.
 166
 167
 168 ---
 169
 170 * [HBASE-25734](https://issues.apache.org/jira/browse/HBASE-25734) | *Minor* | **Backport HBASE-24305 to branch-2.4**
 171
 172 The following method was added to ServerName
 173
 174 - #valueOf(Address, long)
 175
 176
 177 ---
 178
 179 * [HBASE-25199](https://issues.apache.org/jira/browse/HBASE-25199) | *Minor* | **Remove HStore#getStoreHomedir**
 180
 181 Moved the following methods from HStore to HRegionFileSystem
 182
 183 - #getStoreHomedir(Path, RegionInfo, byte[])
 184 - #getStoreHomedir(Path, String, byte[])
 185
 186
 187 ---
 188
 189 * [HBASE-25685](https://issues.apache.org/jira/browse/HBASE-25685) | *Major* | **asyncprofiler2.0 no longer supports svg; wants html**
 190
 191 If asyncprofiler 1.x, all is good. If asyncprofiler 2.x and it is hbase-2.3.x or hbase-2.4.x, add '?output=html' to get flamegraphs from the profiler.
 192
 193 Otherwise, if hbase-2.5+ and asyncprofiler2, all works. If asyncprofiler1 and hbase-2.5+, you may have to add '?output=svg' to the query.
 194
 195
 196 ---
 197
 198 * [HBASE-25518](https://issues.apache.org/jira/browse/HBASE-25518) | *Major* | **Support separate child regions to different region servers**
 199
 200 Config key for enable/disable automatically separate child regions to different region servers in the procedure of split regions. One child will be kept to the server where parent region is on, and the other child will be assigned to a random server.
 201
 202 hbase.master.auto.separate.child.regions.after.split.enabled
 203
 204 Default setting is false/off.
 205
 206
 207 ---
 208
 209 * [HBASE-25374](https://issues.apache.org/jira/browse/HBASE-25374) | *Minor* | **Make REST Client connection and socket time out configurable**
 210
 211 Configuration parameter to set rest client connection timeout
 212
 213 "hbase.rest.client.conn.timeout" Default is 2 \* 1000
 214
 215 "hbase.rest.client.socket.timeout" Default of 30 \* 1000
 216
 217
 218 ---
 219
 220 * [HBASE-25587](https://issues.apache.org/jira/browse/HBASE-25587) | *Major* | **[hbck2] Schedule SCP for all unknown servers**
 221
 222 Adds scheduleSCPsForUnknownServers to Hbck Service.
 223
 224
 225 ---
 226
 227 * [HBASE-25636](https://issues.apache.org/jira/browse/HBASE-25636) | *Minor* | **Expose HBCK report as metrics**
 228
 229 Expose HBCK repost results in metrics, includes: "orphanRegionsOnRS", "orphanRegionsOnFS", "inconsistentRegions", "holes", "overlaps", "unknownServerRegions" and "emptyRegionInfoRegions".
 230
 231
 232 ---
 233
 234 * [HBASE-24305](https://issues.apache.org/jira/browse/HBASE-24305) | *Minor* | **Handle deprecations in ServerName**
 235
 236 The following methods were removed or made private from ServerName (due to HBASE-17624):
 237
 238 - getHostNameMinusDomain(String): Was made private without a replacement.
 239 - parseHostname(String): Use #valueOf(String) instead.
 240 - parsePort(String): Use #valueOf(String) instead.
 241 - parseStartcode(String): Use #valueOf(String) instead.
 242 - getServerName(String, int, long): Was made private. Use #valueOf(String, int, long) instead.
 243 - getServerName(String, long): Use #valueOf(String, long) instead.
 244 - getHostAndPort(): Use #getAddress() instead.
 245 - getServerStartcodeFromServerName(String): Use instance of ServerName to pull out start code)
 246 - getServerNameLessStartCode(String): Use #getAddress() instead.
 247
 248
 249
 250 # HBASE  2.4.2 Release Notes
 251
 252 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 253
 254
 255 ---
 256
 257 * [HBASE-25492](https://issues.apache.org/jira/browse/HBASE-25492) | *Major* | **Create table with rsgroup info in branch-2**
 258
 259 HBASE-25492 added a new interface in TableDescriptor which allows user to define RSGroup name while creating or modifying a table.
 260
 261
 262 ---
 263
 264 * [HBASE-25460](https://issues.apache.org/jira/browse/HBASE-25460) | *Major* | **Expose drainingServers as cluster metric**
 265
 266 Exposed new jmx metrics: "draininigRegionServers" and "numDrainingRegionServers" to provide "comma separated names for regionservers that are put in draining mode" and "num of such regionservers" respectively.
 267
 268
 269 ---
 270
 271 * [HBASE-25615](https://issues.apache.org/jira/browse/HBASE-25615) | *Major* | **Upgrade java version in pre commit docker file**
 272
 273 jdk8u232-b09 -\> jdk8u282-b08
 274 jdk-11.0.6\_10 -\> jdk-11.0.10\_9
 275
 276
 277 ---
 278
 279 * [HBASE-23887](https://issues.apache.org/jira/browse/HBASE-23887) | *Major* | **New L1 cache : AdaptiveLRU**
 280
 281 Introduced new L1 cache: AdaptiveLRU. This is supposed to provide better performance than default LRU cache.
 282 Set config key "hfile.block.cache.policy" to "AdaptiveLRU" in hbase-site in order to start using this new cache.
 283
 284
 285
 286 # HBASE  2.4.1 Release Notes
 287
 288 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 289
 290
 291 ---
 292
 293 * [HBASE-25449](https://issues.apache.org/jira/browse/HBASE-25449) | *Major* | **'dfs.client.read.shortcircuit' should not be set in hbase-default.xml**
 294
 295 The presence of HDFS short-circuit read configuration properties in hbase-default.xml inadvertently causes short-circuit reads to not happen inside of RegionServers, despite short-circuit reads being enabled in hdfs-site.xml.
 296
 297
 298 ---
 299
 300 * [HBASE-25333](https://issues.apache.org/jira/browse/HBASE-25333) | *Major* | **Add maven enforcer rule to ban VisibleForTesting imports**
 301
 302 Ban the imports of guava VisiableForTesting, which means you should not use this annotation in HBase any more.
 303 For IA.Public and IA.LimitedPrivate classes, typically you should not expose any test related fields/methods there, and if you want to hide something, use IA.Private on the specific fields/methods.
 304 For IA.Private classes, if you want to expose something only for tests, use the RestrictedApi annotation from error prone, where it could cause a compilation error if someone break the rule in the future.
 305
 306
 307 ---
 308
 309 * [HBASE-25441](https://issues.apache.org/jira/browse/HBASE-25441) | *Critical* | **add security check for some APIs in RSRpcServices**
 310
 311 RsRpcServices APIs that can be accessed only through Admin rights:
 312 - stopServer
 313 - updateFavoredNodes
 314 - updateConfiguration
 315 - clearRegionBlockCache
 316 - clearSlowLogsResponses
 317
 318
 319 ---
 320
 321 * [HBASE-25432](https://issues.apache.org/jira/browse/HBASE-25432) | *Blocker* | **we should add security checks for setTableStateInMeta and fixMeta**
 322
 323 setTableStateInMeta and fixMeta can be accessed only through Admin rights
 324
 325
 326 ---
 327
 328 * [HBASE-25318](https://issues.apache.org/jira/browse/HBASE-25318) | *Minor* | **Configure where IntegrationTestImportTsv generates HFiles**
 329
 330 Added IntegrationTestImportTsv.generatedHFileFolder configuration property to override the default location in IntegrationTestImportTsv. Useful for running the integration test when HDFS Transparent Encryption is enabled.
 331
 332
 333 ---
 334
 335 * [HBASE-25456](https://issues.apache.org/jira/browse/HBASE-25456) | *Critical* | **setRegionStateInMeta need security check**
 336
 337 setRegionStateInMeta can be accessed only through Admin rights
 338
 339
 340
 341 # HBASE  2.4.0 Release Notes
 342
 343 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 344
 345
 346 ---
 347
 348 * [HBASE-25127](https://issues.apache.org/jira/browse/HBASE-25127) | *Major* | **Enhance PerformanceEvaluation to profile meta replica performance.**
 349
 350 Three new commands are added to PE:
 351
 352 metaWrite, metaRandomRead and cleanMeta.
 353
 354 Usage example:
 355 hbase pe  --rows=100000 metaWrite  1
 356 hbase pe  --nomapreduce --rows=100000 metaRandomRead  32
 357 hbase pe  --rows=100000 cleanMeta 1
 358
 359 metaWrite and cleanMeta should be run with only 1 thread and the same number of rows so all the rows inserted will be cleaned up properly.
 360
 361 metaRandomRead can be run with multiple threads. The rows option should set to within the range of rows inserted by metaWrite
 362
 363
 364 ---
 365
 366 * [HBASE-25237](https://issues.apache.org/jira/browse/HBASE-25237) | *Major* | **'hbase master stop' shuts down the cluster, not the master only**
 367
 368 \`hbase master stop\` should shutdown only master by default.
 369 1. Help added to \`hbase master stop\`:
 370 To stop cluster, use \`stop-hbase.sh\` or \`hbase master stop --shutDownCluster\`
 371
 372 2. Help added to \`stop-hbase.sh\`:
 373 stop-hbase.sh can only be used for shutting down entire cluster. To shut down (HMaster\|HRegionServer) use hbase-daemon.sh stop (master\|regionserver)
 374
 375
 376 ---
 377
 378 * [HBASE-25242](https://issues.apache.org/jira/browse/HBASE-25242) | *Critical* | **Add Increment/Append support to RowMutations**
 379
 380 After HBASE-25242, we can add Increment/Append operations to RowMutations and perform those operations atomically in a single row.
 381 HBASE-25242 includes an API change where the mutateRow() API returns a Result object to get the result of the Increment/Append operations.
 382
 383
 384 ---
 385
 386 * [HBASE-25263](https://issues.apache.org/jira/browse/HBASE-25263) | *Major* | **Change encryption key generation algorithm used in the HBase shell**
 387
 388 Since the backward-compatible change we introduced in HBASE-25263,  we use the more secure PBKDF2WithHmacSHA384  key generation algorithm (instead of PBKDF2WithHmacSHA1) to generate a secret key for HFile / WalFile encryption, when the user is defining a string encryption key in the hbase shell.
 389
 390
 391 ---
 392
 393 * [HBASE-24268](https://issues.apache.org/jira/browse/HBASE-24268) | *Minor* | **REST and Thrift server do not handle the "doAs" parameter case insensitively**
 394
 395 This change allows the REST and Thrift servers to handle the "doAs" parameter case-insensitively, which is deemed as correct per the "specification" provided by the Hadoop community.
 396
 397
 398 ---
 399
 400 * [HBASE-25278](https://issues.apache.org/jira/browse/HBASE-25278) | *Minor* | **Add option to toggle CACHE\_BLOCKS in count.rb**
 401
 402 A new option, CACHE\_BLOCKS, was added to the \`count\` shell command which will force the data for a table to be loaded into the block cache. By default, the \`count\` command will not cache any blocks. This option can serve as a means to for a table's data to be loaded into block cache on demand. See the help message on the count shell command for usage details.
 403
 404
 405 ---
 406
 407 * [HBASE-18070](https://issues.apache.org/jira/browse/HBASE-18070) | *Critical* | **Enable memstore replication for meta replica**
 408
 409 "Async WAL Replication" [1] was added by HBASE-11183 "Timeline Consistent region replicas - Phase 2 design" but only for user-space tables. This feature adds "Async WAL Replication" for the hbase:meta table.  It also adds a client 'LoadBalance' mode that has reads go to replicas first and to the primary only on fail so as to shed read load from the primary to alleviate \*hotspotting\* on the hbase:meta Region.
 410
 411 Configuration is as it was for the user-space 'Async WAL Replication'. See [2] and [3] for details on how to enable.
 412
 413 1. http://hbase.apache.org/book.html#async.wal.replication
 414 2. http://hbase.apache.org/book.html#async.wal.replication.meta
 415 3. http://hbase.apache.org/book.html#\_async\_wal\_replication\_for\_meta\_table\_as\_of\_hbase\_2\_4\_0
 416
 417
 418 ---
 419
 420 * [HBASE-25126](https://issues.apache.org/jira/browse/HBASE-25126) | *Major* | **Add load balance logic in hbase-client to distribute read load over meta replica regions.**
 421
 422 See parent issue, HBASE-18070, release notes for how to enable.
 423
 424
 425 ---
 426
 427 * [HBASE-25026](https://issues.apache.org/jira/browse/HBASE-25026) | *Minor* | **Create a metric to track full region scans RPCs**
 428
 429 Adds a new metric where we collect the number of full region scan requests at the RPC layer. This will be collected under "name" : "Hadoop:service=HBase,name=RegionServer,sub=Server"
 430
 431
 432 ---
 433
 434 * [HBASE-25253](https://issues.apache.org/jira/browse/HBASE-25253) | *Major* | **Deprecated master carrys regions related methods and configs**
 435
 436 Since 2.4.0, deprecated all master carrys regions related methods(LoadBalancer,BaseLoadBalancer,ZNodeClearer) and configs(hbase.balancer.tablesOnMaster, hbase.balancer.tablesOnMaster.systemTablesOnly), they will be removed in 3.0.0.
 437
 438
 439 ---
 440
 441 * [HBASE-20598](https://issues.apache.org/jira/browse/HBASE-20598) | *Major* | **Upgrade to JRuby 9.2**
 442
 443 <!-- markdown -->
 444 The HBase shell now relies on JRuby 9.2. This is a new major version change for JRuby. The most significant change is Ruby compatibility changed from Ruby 2.3 to Ruby 2.5. For more detailed changes please see [the JRuby release announcement for the start of the 9.2 series](https://www.jruby.org/2018/05/24/jruby-9-2-0-0.html) as well as the [general release announcement page for updates since that version](https://www.jruby.org/news).
 445
 446 The runtime dependency versions present on the server side classpath for the Joni (now 2.1.31) and JCodings (now 1.0.55) libraries have also been updated to match those found in the JRuby version shipped with HBase. These version changes are maintenance releases and should be backwards compatible when updated in tandem.
 447
 448
 449 ---
 450
 451 * [HBASE-25181](https://issues.apache.org/jira/browse/HBASE-25181) | *Major* | **Add options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys.**
 452
 453 <!-- markdown -->
 454 This change adds options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys. Changes are done such that defaults will keep the same behavior prior to this issue.
 455
 456 Prior to this change HBase always used the MD5 hash algorithm to store a hash for encryption keys. This hash is needed to verify the secret key of the subject. (e.g. making sure that the same secrey key is used during encrypted HFile read and write). The MD5 algorithm is considered weak, and can not be used in some (e.g. FIPS compliant) clusters. Having a configurable hash enables us to use newer and more secure hash algorithms like SHA-384 or SHA-512 (which are FIPS compliant).
 457
 458 The hash is set via the configuration option `hbase.crypto.key.hash.algorithm`. It should be set to a JDK `MessageDigest` algorithm like "MD5", "SHA-256" or "SHA-384". The default is "MD5" for backward compatibility.
 459
 460 Alternatively, clusters which rely on an encryption at rest mechanism outside of HBase (e.g. those offered by HDFS) and wish to ensure HBase's encryption at rest system is inactive can set `hbase.crypto.enabled` to `false`.
 461
 462
 463 ---
 464
 465 * [HBASE-25238](https://issues.apache.org/jira/browse/HBASE-25238) | *Critical* | **Upgrading HBase from 2.2.0 to 2.3.x fails because of “Message missing required fields: state”**
 466
 467 Fixes master procedure store migration issues going from 2.0.x to 2.2.x and/or 2.3.x. Also fixes failed heartbeat parse during rolling upgrade from 2.0.x. to 2.3.x.
 468
 469
 470 ---
 471
 472 * [HBASE-25234](https://issues.apache.org/jira/browse/HBASE-25234) | *Major* | **[Upgrade]Incompatibility in reading RS report from 2.1 RS when Master is upgraded to a version containing HBASE-21406**
 473
 474 Fixes so auto-migration of master procedure store works again going from 2.0.x =\> 2.2+. Also make it so heartbeats work when rolling upgrading from 2.0.x =\> 2.3+.
 475
 476
 477 ---
 478
 479 * [HBASE-25212](https://issues.apache.org/jira/browse/HBASE-25212) | *Major* | **Optionally abort requests in progress after deciding a region should close**
 480
 481 If hbase.regionserver.close.wait.abort is set to true, interrupt RPC handler threads holding the region close lock.
 482
 483 Until requests in progress can be aborted, wait on the region close lock for a configurable interval (specified by hbase.regionserver.close.wait.time.ms, default 60000 (1 minute)). If we have failed to acquire the close lock after this interval elapses, if allowed (also specified by hbase.regionserver.close.wait.abort), abort the regionserver.
 484
 485 We will attempt to interrupt any running handlers every hbase.regionserver.close.wait.interval.ms (default 10000 (10 seconds)) until either the close lock is acquired or we reach the maximum wait time.
 486
 487
 488 ---
 489
 490 * [HBASE-25167](https://issues.apache.org/jira/browse/HBASE-25167) | *Major* | **Normalizer support for hot config reloading**
 491
 492 <!-- markdown -->
 493 This patch adds [dynamic configuration](https://hbase.apache.org/book.html#dyn_config) support for the following configuration keys related to the normalizer:
 494 * hbase.normalizer.throughput.max_bytes_per_sec
 495 * hbase.normalizer.split.enabled
 496 * hbase.normalizer.merge.enabled
 497 * hbase.normalizer.min.region.count
 498 * hbase.normalizer.merge.min_region_age.days
 499 * hbase.normalizer.merge.min_region_size.mb
 500
 501
 502 ---
 503
 504 * [HBASE-25224](https://issues.apache.org/jira/browse/HBASE-25224) | *Major* | **Maximize sleep for checking meta and namespace regions availability**
 505
 506 Changed the max sleep time during meta and namespace regions availability check to be 60 sec. Previously there was no such cap
 507
 508
 509 ---
 510
 511 * [HBASE-24628](https://issues.apache.org/jira/browse/HBASE-24628) | *Major* | **Region normalizer now respects a rate limit**
 512
 513 <!-- markdown -->
 514 Introduces a new configuration, `hbase.normalizer.throughput.max_bytes_per_sec`, for specifying a limit on the throughput of actions executed by the normalizer. Note that while this configuration value is in bytes, the minimum honored valued is `1,000,000`, or `1m`. Supports values configured using the human-readable suffixes honored by [`Configuration.getLongBytes`](https://hadoop.apache.org/docs/current/api/org/apache/hadoop/conf/Configuration.html#getLongBytes-java.lang.String-long-)
 515
 516
 517 ---
 518
 519 * [HBASE-14067](https://issues.apache.org/jira/browse/HBASE-14067) | *Major* | **bundle ruby files for hbase shell into a jar.**
 520
 521 <!-- markdown -->
 522 The `hbase-shell` artifact now contains the ruby files that implement the hbase shell. There should be no downstream impact for users of the shell that rely on the `hbase shell` command.
 523
 524 Folks that wish to include the HBase ruby classes defined for the shell in their own JRuby scripts should add the `hbase-shell.jar` file to their classpath rather than add `${HBASE_HOME}/lib/ruby` to their load paths.
 525
 526
 527 ---
 528
 529 * [HBASE-24875](https://issues.apache.org/jira/browse/HBASE-24875) | *Major* | **Remove the force param for unassign since it dose not take effect any more**
 530
 531 <!-- markdown -->
 532 The "force" flag to various unassign commands (java api, shell, etc) has been ignored since HBase 2. As of this change the methods that take it are now deprecated. Downstream users should stop passing/using this flag.
 533
 534 The Admin and AsyncAdmin Java APIs will have the deprecated version of the unassign method with a force flag removed in HBase 4. Callers can safely continue to use the deprecated API until then; the internal implementation just calls the new method.
 535
 536 The MasterObserver coprocessor API deprecates the `preUnassign` and `postUnassign` methods that include the force parameter and replaces them with versions that omit this parameter. The deprecated methods will be removed from the API in HBase 3. Until then downstream coprocessor implementations can safely continue to *just* implement the deprecated method if they wish; the replacement methods provide a default implementation that calls the deprecated method with force set to `false`.
 537
 538
 539 ---
 540
 541 * [HBASE-25099](https://issues.apache.org/jira/browse/HBASE-25099) | *Major* | **Change meta replica count by altering meta table descriptor**
 542
 543 Now you can change the region replication config for meta table by altering meta table.
 544 The old "hbase.meta.replica.count" is deprecated and will be removed in 4.0.0. But if it is set, we will still honor it, which means, when master restart, if we find out that the value of 'hbase.meta.replica.count' is different with the region replication config of meta table, we will schedule an alter table operation to change the region replication config to the value you configured for 'hbase.meta.replica.count'.
 545
 546
 547 ---
 548
 549 * [HBASE-23834](https://issues.apache.org/jira/browse/HBASE-23834) | *Major* | **HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch**
 550
 551 Use shaded json and jersey in HBase.
 552 Ban the imports of unshaded json and jersey in code.
 553
 554
 555 ---
 556
 557 * [HBASE-25163](https://issues.apache.org/jira/browse/HBASE-25163) | *Major* | **Increase the timeout value for nightly jobs**
 558
 559 Increase timeout value for nightly jobs to 16 hours since the new build machines are dedicated to hbase project, so we are allowed to use it all the time.
 560
 561
 562 ---
 563
 564 * [HBASE-22976](https://issues.apache.org/jira/browse/HBASE-22976) | *Major* | **[HBCK2] Add RecoveredEditsPlayer**
 565
 566 WALPlayer can replay the content of recovered.edits directories.
 567
 568 Side-effect is that WAL filename timestamp is now factored when setting start/end times for WALInputFormat; i.e. wal.start.time and wal.end.time values on a job context. Previous we looked at wal.end.time only. Now we consider wal.start.time too. If a file has a name outside of wal.start.time\<-\>wal.end.time, it'll be by-passed. This change-in-behavior will make it easier on operator crafting timestamp filters processing WALs.
 569
 570
 571 ---
 572
 573 * [HBASE-25165](https://issues.apache.org/jira/browse/HBASE-25165) | *Minor* | **Change 'State time' in UI so sorts**
 574
 575 Start time on the Master UI is now displayed using ISO8601 format instead of java Date#toString().
 576
 577
 578 ---
 579
 580 * [HBASE-25124](https://issues.apache.org/jira/browse/HBASE-25124) | *Major* | **Support changing region replica count without disabling table**
 581
 582 Now you do not need to disable a table before changing its 'region replication' property.
 583 If you are decreasing the replica count, the excess region replicas will be closed before reopening other replicas.
 584 If you are increasing the replica count, the new region replicas will be opened after reopening the existing replicas.
 585
 586
 587 ---
 588
 589 * [HBASE-25154](https://issues.apache.org/jira/browse/HBASE-25154) | *Major* | **Set java.io.tmpdir to project build directory to avoid writing std\*deferred files to /tmp**
 590
 591 Change the java.io.tmpdir to project.build.directory in surefire-maven-plugin, to avoid writing std\*deferred files to /tmp which may blow up the /tmp disk on our jenkins build node.
 592
 593
 594 ---
 595
 596 * [HBASE-25055](https://issues.apache.org/jira/browse/HBASE-25055) | *Major* | **Add ReplicationSource for meta WALs; add enable/disable when hbase:meta assigned to RS**
 597
 598 Set hbase.region.replica.replication.catalog.enabled to enable async WAL Replication for hbase:meta region replicas. Its off by default.
 599
 600 Defaults to the RegionReadReplicaEndpoint.class shipping edits -- set hbase.region.replica.catalog.replication to target a different endpoint implementation.
 601
 602
 603 ---
 604
 605 * [HBASE-25109](https://issues.apache.org/jira/browse/HBASE-25109) | *Major* | **Add MR Counters to WALPlayer; currently hard to tell if it is doing anything**
 606
 607 Adds a WALPlayer to MR Counter output:
 608
 609         org.apache.hadoop.hbase.mapreduce.WALPlayer$Counter
 610                 CELLS\_READ=89574
 611                 CELLS\_WRITTEN=89572
 612                 DELETES=64
 613                 PUTS=5305
 614                 WALEDITS=4375
 615
 616
 617 ---
 618
 619 * [HBASE-24896](https://issues.apache.org/jira/browse/HBASE-24896) | *Major* | **'Stuck' in static initialization creating RegionInfo instance**
 620
 621 1. Untangle RegionInfo, RegionInfoBuilder, and MutableRegionInfo static
 622 initializations.
 623 2. Undo static initializing references from RegionInfo to RegionInfoBuilder.
 624 3. Mark RegionInfo#UNDEFINED IA.Private and deprecated;
 625 it is for internal use only and likely to be removed in HBase4. (sub-task HBASE-24918)
 626 4. Move MutableRegionInfo from inner-class of
 627 RegionInfoBuilder to be (package private) standalone. (sub-task HBASE-24918)
 628
 629
 630 ---
 631
 632 * [HBASE-24956](https://issues.apache.org/jira/browse/HBASE-24956) | *Major* | **ConnectionManager#locateRegionInMeta waits for user region lock indefinitely.**
 633
 634 <!-- markdown -->
 635
 636 Without this fix there are situations in which locateRegionInMeta() on a client is not bound by a timeout. This happens because of a global lock whose acquisition was not under any lock scope. This affects client facing API calls that rely on this method to locate a table region in meta. This fix brings the lock acquisition under the scope of "hbase.client.meta.operation.timeout" and that guarantees a bounded wait time.
 637
 638
 639 ---
 640
 641 * [HBASE-24764](https://issues.apache.org/jira/browse/HBASE-24764) | *Minor* | **Add support of adding base peer configs via hbase-site.xml for all replication peers.**
 642
 643 <!-- markdown -->
 644
 645 Adds a new configuration parameter "hbase.replication.peer.base.config" which accepts a semi-colon separated key=CSV pairs (example: k1=v1;k2=v2_1,v3...). When this configuration is set on the server side, these kv pairs are added to every peer configuration if not already set. Peer specific configuration overrides have precedence over the above default configuration. This is useful in cases when some configuration has to be set for all the peers by default and one does not want to add to every peer definition.
 646
 647
 648 ---
 649
 650 * [HBASE-24994](https://issues.apache.org/jira/browse/HBASE-24994) | *Minor* | **Add hedgedReadOpsInCurThread metric**
 651
 652 Expose Hadoop hedgedReadOpsInCurThread metric to HBase.
 653 This metric counts the number of times the hedged reads service executor rejected a read task, falling back to the current thread.
 654 This will help determine the proper size of the thread pool (dfs.client.hedged.read.threadpool.size).
 655
 656
 657 ---
 658
 659 * [HBASE-24776](https://issues.apache.org/jira/browse/HBASE-24776) | *Major* | **[hbtop] Support Batch mode**
 660
 661 HBASE-24776 added the following command line parameters to hbtop:
 662 \| Argument \| Description \|
 663 \|---\|---\|
 664 \| -n,--numberOfIterations \<arg\> \| The number of iterations \|
 665 \| -O,--outputFieldNames \| Print each of the available field names on a separate line, then quit \|
 666 \| -f,--fields \<arg\> \| Show only the given fields. Specify comma separated fields to show multiple fields \|
 667 \| -s,--sortField \<arg\> \| The initial sort field. You can prepend a \`+' or \`-' to the field name to also override the sort direction. A leading \`+' will force sorting high to low, whereas a \`-' will ensure a low to high ordering \|
 668 \| -i,--filters \<arg\> \| The initial filters. Specify comma separated filters to set multiple filters \|
 669 \| -b,--batchMode \| Starts hbtop in Batch mode, which could be useful for sending output from hbtop to other programs or to a file. In this mode, hbtop will not accept input and runs until the iterations limit you've set with the \`-n' command-line option or until killed \|
 670
 671
 672 ---
 673
 674 * [HBASE-24602](https://issues.apache.org/jira/browse/HBASE-24602) | *Major* | **Add Increment and Append support to CheckAndMutate**
 675
 676 Summary of the change of HBASE-24602:
 677 - Add \`build(Increment)\` and \`build(Append)\` methods to the \`Builder\` class of the \`CheckAndMutate\` class. After this change, we can perform checkAndIncrement/Append operations as follows:
 678 \`\`\`
 679 // Build a CheckAndMutate object with a Increment object
 680 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 681   .ifEquals(family, qualifier, value)
 682   .build(increment);
 683
 684 // Perform a CheckAndIncrement operation
 685 CheckAndMutateResult checkAndMutateResult = table.checkAndMutate(checkAndMutate);
 686
 687 // Get whether or not the CheckAndIncrement operation is successful
 688 boolean success = checkAndMutateResult.isSuccess();
 689
 690 // Get the result of the increment operation
 691 Result result = checkAndMutateResult.getResult();
 692 \`\`\`
 693 - After this change, \`HRegion.batchMutate()\` is used for increment/append operations.
 694 - As the side effect of the above change, the following coprocessor methods of RegionObserver are called when increment/append operations are performed:
 695   - preBatchMutate()
 696   - postBatchMutate()
 697   - postBatchMutateIndispensably()
 698
 699
 700 ---
 701
 702 * [HBASE-24694](https://issues.apache.org/jira/browse/HBASE-24694) | *Major* | **Support flush a single column family of table**
 703
 704 Adds option for the flush command to flush all stores from the specified column family only, among all regions of the given table (stores from other column families on this table would not get flushed).
 705
 706
 707 ---
 708
 709 * [HBASE-24625](https://issues.apache.org/jira/browse/HBASE-24625) | *Critical* | **AsyncFSWAL.getLogFileSizeIfBeingWritten does not return the expected synced file length.**
 710
 711 We add a method getSyncedLength in  WALProvider.WriterBase interface for  WALFileLengthProvider used for replication, considering the case if we use  AsyncFSWAL,we write to 3 DNs concurrently,according to the visibility guarantee of HDFS, the data will be available immediately
 712 when arriving at DN since all the DNs will be considered as the last one in pipeline.This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency.The method WriterBase#getLength may return length which just in hdfs client buffer and not successfully synced to HDFS, so we use this method WriterBase#getSyncedLength to return the length successfully synced to HDFS and replication thread could only read writing WAL file limited by this length.
 713 see also HBASE-14004 and this document for more details:
 714 https://docs.google.com/document/d/11AyWtGhItQs6vsLRIx32PwTxmBY3libXwGXI25obVEY/edit#
 715
 716 Before this patch, replication may read uncommitted data and replicate it to the slave cluster and cause data inconsistency between master and slave cluster, we could use FSHLog instead of AsyncFSWAL  to reduce probability of inconsistency without this patch applied.
 717
 718
 719 ---
 720
 721 * [HBASE-24779](https://issues.apache.org/jira/browse/HBASE-24779) | *Minor* | **Improve insight into replication WAL readers hung on checkQuota**
 722
 723 New metrics are exposed, on the global source, for replication which indicate the "WAL entry buffer" that was introduced in HBASE-15995. When this usage reaches the limit, that RegionServer will cease to read more data for the sake of trying to replicate it. This usage (and limit) is local to each RegionServer is shared across all peers being handled by that RegionServer.
 724
 725
 726 ---
 727
 728 * [HBASE-24404](https://issues.apache.org/jira/browse/HBASE-24404) | *Major* | **Support flush a single column family of region**
 729
 730 This adds an extra "flush" command option that allows for specifying an individual family to have its store flushed.
 731
 732 Usage:
 733 flush 'REGIONNAME','FAMILYNAME'
 734 flush 'ENCODED\_REGIONNAME','FAMILYNAME'
 735
 736
 737 ---
 738
 739 * [HBASE-24805](https://issues.apache.org/jira/browse/HBASE-24805) | *Major* | **HBaseTestingUtility.getConnection should be threadsafe**
 740
 741 <!-- markdown -->
 742 Users of `HBaseTestingUtility` can now safely call the `getConnection` method from multiple threads.
 743
 744 As a consequence of refactoring to improve the thread safety of the HBase testing classes, the protected `conf` member of the  `HBaseCommonTestingUtility` class has been marked final. Downstream users who extend from the class hierarchy rooted at this class will need to pass the Configuration instance they want used to their super constructor rather than overwriting the instance variable.
 745
 746
 747 ---
 748
 749 * [HBASE-24767](https://issues.apache.org/jira/browse/HBASE-24767) | *Major* | **Change default to false for HBASE-15519 per-user metrics**
 750
 751 Disables per-user metrics. They were enabled by default for the first time in hbase-2.3.0 but they need some work before they can be on all the time (See HBASE-15519)
 752
 753
 754 ---
 755
 756 * [HBASE-24704](https://issues.apache.org/jira/browse/HBASE-24704) | *Major* | **Make the Table Schema easier to view even there are multiple families**
 757
 758 Improve the layout of column family from vertical to horizontal in table UI.
 759
 760
 761 ---
 762
 763 * [HBASE-11686](https://issues.apache.org/jira/browse/HBASE-11686) | *Minor* | **Shell code should create a binding / irb workspace instead of polluting the root namespace**
 764
 765 In shell, all HBase constants and commands have been moved out of the top-level and into an IRB Workspace. Piped stdin and scripts passed by name to the shell will be evaluated within this workspace. If you absolutely need the top-level definitions, use the new compatibility flag, ie. hbase shell --top-level-defs or hbase shell --top-level-defs script2run.rb.
 766
 767
 768 ---
 769
 770 * [HBASE-24632](https://issues.apache.org/jira/browse/HBASE-24632) | *Major* | **Enable procedure-based log splitting as default in hbase3**
 771
 772 Enables procedure-based distributed WAL splitting as default (HBASE-20610). To use 'classic' zk-coordinated splitting instead, set 'hbase.split.wal.zk.coordinated' to 'true'.
 773
 774
 775 ---
 776
 777 * [HBASE-24698](https://issues.apache.org/jira/browse/HBASE-24698) | *Major* | **Turn OFF Canary WebUI as default**
 778
 779 Flips default for 'HBASE-23994 Add WebUI to Canary' The UI defaulted to on at port 16050. This JIRA changes it so new UI is off by default.
 780
 781 To enable the UI, set property 'hbase.canary.info.port' to the port you want the UI to use.
 782
 783
 784 ---
 785
 786 * [HBASE-24650](https://issues.apache.org/jira/browse/HBASE-24650) | *Major* | **Change the return types of the new checkAndMutate methods introduced in HBASE-8458**
 787
 788 HBASE-24650 introduced CheckAndMutateResult class and changed the return type of checkAndMutate methods to this class in order to support CheckAndMutate with Increment/Append. CheckAndMutateResult class has two fields, one is \*success\* that indicates whether the operation is successful or not, and the other one is \*result\* that's the result of the operation and is used for  CheckAndMutate with Increment/Append.
 789
 790 The new APIs for the Table interface:
 791 \`\`\`
 792 /\*\*
 793  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 794  \* it performs the specified action.
 795  \*
 796  \* @param checkAndMutate The CheckAndMutate object.
 797  \* @return A CheckAndMutateResult object that represents the result for the CheckAndMutate.
 798  \* @throws IOException if a remote or network exception occurs.
 799  \*/
 800 default CheckAndMutateResult checkAndMutate(CheckAndMutate checkAndMutate) throws IOException {
 801   return checkAndMutate(Collections.singletonList(checkAndMutate)).get(0);
 802 }
 803
 804 /\*\*
 805  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
 806  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
 807  \* atomically (and thus, each may fail independently of others).
 808  \*
 809  \* @param checkAndMutates The list of CheckAndMutate.
 810  \* @return A list of CheckAndMutateResult objects that represents the result for each
 811  \*   CheckAndMutate.
 812  \* @throws IOException if a remote or network exception occurs.
 813  \*/
 814 default List\<CheckAndMutateResult\> checkAndMutate(List\<CheckAndMutate\> checkAndMutates)
 815   throws IOException {
 816   throw new NotImplementedException("Add an implementation!");
 817 }
 818 {code}
 819
 820 The new APIs for the AsyncTable interface:
 821 {code}
 822 /\*\*
 823  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 824  \* it performs the specified action.
 825  \*
 826  \* @param checkAndMutate The CheckAndMutate object.
 827  \* @return A {@link CompletableFuture}s that represent the result for the CheckAndMutate.
 828  \*/
 829 CompletableFuture\<CheckAndMutateResult\> checkAndMutate(CheckAndMutate checkAndMutate);
 830
 831 /\*\*
 832  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
 833  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
 834  \* atomically (and thus, each may fail independently of others).
 835  \*
 836  \* @param checkAndMutates The list of CheckAndMutate.
 837  \* @return A list of {@link CompletableFuture}s that represent the result for each
 838  \*   CheckAndMutate.
 839  \*/
 840 List\<CompletableFuture\<CheckAndMutateResult\>\> checkAndMutate(
 841   List\<CheckAndMutate\> checkAndMutates);
 842
 843 /\*\*
 844  \* A simple version of batch checkAndMutate. It will fail if there are any failures.
 845  \*
 846  \* @param checkAndMutates The list of rows to apply.
 847  \* @return A {@link CompletableFuture} that wrapper the result list.
 848  \*/
 849 default CompletableFuture\<List\<CheckAndMutateResult\>\> checkAndMutateAll(
 850   List\<CheckAndMutate\> checkAndMutates) {
 851   return allOf(checkAndMutate(checkAndMutates));
 852 }
 853 \`\`\`
 854
 855
 856 ---
 857
 858 * [HBASE-24671](https://issues.apache.org/jira/browse/HBASE-24671) | *Major* | **Add excludefile and designatedfile options to graceful\_stop.sh**
 859
 860 Add excludefile and designatedfile options to graceful\_stop.sh.
 861
 862 Designated file with \<hostname:port\> per line as unload targets.
 863
 864 Exclude file should have \<hostname:port\> per line. We do not unload regions to hostnames given in exclude file.
 865
 866 Here is a simple example using graceful\_stop.sh with designatedfile option:
 867 ./bin/graceful\_stop.sh --maxthreads 4 --designatedfile /path/designatedfile hostname
 868 The usage of the excludefile option is the same as the above.
 869
 870
 871 ---
 872
 873 * [HBASE-24560](https://issues.apache.org/jira/browse/HBASE-24560) | *Major* | **Add a new option of designatedfile in RegionMover**
 874
 875 Add a new option "designatedfile" in RegionMover.
 876
 877 If designated file is present with some contents, we will unload regions to hostnames provided in designated file.
 878
 879 Designated file should have 'host:port' per line.
 880
 881
 882 ---
 883
 884 * [HBASE-24289](https://issues.apache.org/jira/browse/HBASE-24289) | *Major* | **Heterogeneous Storage for Date Tiered Compaction**
 885
 886 Enhance DateTieredCompaction to support HDFS storage policy within one class family.
 887 # First you need enable DTCP.
 888 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
 889 hbase.hstore.compaction.compaction.policy=org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
 890 ## Parameters for Date Tiered Compaction:
 891 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
 892 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
 893 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
 894 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
 895
 896 # Then enable HDTCP(Heterogeneous Date Tiered Compaction) as follow example configurations:
 897 hbase.hstore.compaction.date.tiered.storage.policy.enable=true
 898 hbase.hstore.compaction.date.tiered.hot.window.age.millis=3600000
 899 hbase.hstore.compaction.date.tiered.hot.window.storage.policy=ALL\_SSD
 900 hbase.hstore.compaction.date.tiered.warm.window.age.millis=20600000
 901 hbase.hstore.compaction.date.tiered.warm.window.storage.policy=ONE\_SSD
 902 hbase.hstore.compaction.date.tiered.cold.window.storage.policy=HOT
 903 ## It is better to enable WAL and flushing HFile storage policy with HDTCP. You can tune follow settings as well:
 904 hbase.wal.storage.policy=ALL\_SSD
 905 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ALL\_SSD'}}
 906
 907 # Disable HDTCP as follow:
 908 hbase.hstore.compaction.date.tiered.storage.policy.enable=false
 909
 910
 911 ---
 912
 913 * [HBASE-24648](https://issues.apache.org/jira/browse/HBASE-24648) | *Major* | **Remove the legacy 'forceSplit' related code at region server side**
 914
 915 Add a canSplit method to RegionSplitPolicy to determine whether we can split a region. Usually it is not related to RegionSplitPolicy so in the default implementation, it will test whether region is available and does not have reference file, but in DisabledRegionSplitPolicy, we will always return false.
 916
 917
 918 ---
 919
 920 * [HBASE-24382](https://issues.apache.org/jira/browse/HBASE-24382) | *Major* | **Flush partial stores of region filtered by seqId when archive wal due to too many wals**
 921
 922 Change the flush level from region to store when there are too many wals, benefit from this we can reduce unnessary flush tasks and small hfiles.
 923
 924
 925 ---
 926
 927 * [HBASE-24038](https://issues.apache.org/jira/browse/HBASE-24038) | *Major* | **Add a metric to show the locality of ssd in table.jsp**
 928
 929 Add a metric to show the locality of ssd in table.jsp, and move the locality related metrics to a new tab named localities.
 930
 931
 932 ---
 933
 934 * [HBASE-8458](https://issues.apache.org/jira/browse/HBASE-8458) | *Major* | **Support for batch version of checkAndMutate()**
 935
 936 HBASE-8458 introduced CheckAndMutate class that's used to perform CheckAndMutate operations. Use the builder class to instantiate a CheckAndMutate object. This builder class is fluent style APIs, the code are like:
 937 \`\`\`
 938 // A CheckAndMutate operation where do the specified action if the column (specified by the
 939 family and the qualifier) of the row equals to the specified value
 940 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 941   .ifEquals(family, qualifier, value)
 942   .build(put);
 943
 944 // A CheckAndMutate operation where do the specified action if the column (specified by the
 945 // family and the qualifier) of the row doesn't exist
 946 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 947   .ifNotExists(family, qualifier)
 948   .build(put);
 949
 950 // A CheckAndMutate operation where do the specified action if the row matches the filter
 951 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 952   .ifMatches(filter)
 953   .build(delete);
 954 \`\`\`
 955
 956 And This added new checkAndMutate APIs to the Table and AsyncTable interfaces, and deprecated the old checkAndMutate APIs. The example code for the new APIs are as follows:
 957 \`\`\`
 958 Table table = ...;
 959
 960 CheckAndMutate checkAndMutate = ...;
 961
 962 // Perform the checkAndMutate operation
 963 boolean success = table.checkAndMutate(checkAndMutate);
 964
 965 CheckAndMutate checkAndMutate1 = ...;
 966 CheckAndMutate checkAndMutate2 = ...;
 967
 968 // Batch version
 969 List\<Boolean\> successList = table.checkAndMutate(Arrays.asList(checkAndMutate1, checkAndMutate2));
 970 \`\`\`
 971
 972 This also has Protocol Buffers level changes. Old clients without this patch will work against new servers with this patch. However, new clients will break against old servers without this patch for checkAndMutate with RM and mutateRow. So, for rolling upgrade, we will need to upgrade servers first, and then roll out the new clients.
 973
 974
 975 ---
 976
 977 * [HBASE-24471](https://issues.apache.org/jira/browse/HBASE-24471) | *Major* | **The way we bootstrap meta table is confusing**
 978
 979 Move all the meta initialization code in MasterFileSystem and HRegionServer to InitMetaProcedure. Add a new step for InitMetaProcedure called INIT\_META\_WRITE\_FS\_LAYOUT to place the moved code.
 980
 981 This is an incompatible change, but should not have much impact. InitMetaProcedure will only be executed once when bootstraping a fresh new cluster, so typically this will not effect rolling upgrading. And even if you hit this problem, as long as InitMetaProcedure has not been finished, we can make sure that there is no user data in the cluster, you can just clean up the cluster and try again. There will be no data loss.
 982
 983
 984 ---
 985
 986 * [HBASE-24017](https://issues.apache.org/jira/browse/HBASE-24017) | *Major* | **Turn down flakey rerun rate on all but hot branches**
 987
 988 Changed master, branch-2, and branch-2.1 to twice a day.
 989 Left branch-2.3, branch-2.2, and branch-1 at every 4 hours.
 990 Changed branch-1.4 and branch-1.3 to @daily (1.3 was running every hour).
 991
 992
 993
 994 # HBASE  2.3.0 Release Notes
 995
 996 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 997
 998
 999 ---
1000
1001 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
1002
1003 <!-- markdown -->
1004
1005 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
1006
1007
1008 ---
1009
1010 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
1011
1012 <!-- markdown -->
1013 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
1014 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
1015
1016
1017 ---
1018
1019 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
1020
1021 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
1022 The metric is now collected under the mbean for Tables and under the mbean for regions.
1023 Under table mbean ie.-
1024 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
1025 The new metrics will be listed as
1026 {code}
1027     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1028  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
1029 {code}
1030 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
1031 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
1032 {code}
1033
1034 The same one under the region ie.
1035 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
1036 comes as
1037 {code}
1038    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1039     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
1040 {code}
1041 where
1042 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
1043 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
1044 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
1045
1046
1047 ---
1048
1049 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
1050
1051 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
1052
1053 $hbase rowcounter -h
1054
1055 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
1056 Options:
1057     --starttime=\<arg\>       starting time filter to start counting rows from.
1058     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
1059     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
1060     --expectedCount=\<arg\>   expected number of rows to be count.
1061 For performance, consider the following configuration properties:
1062 -Dhbase.client.scanner.caching=100
1063 -Dmapreduce.map.speculative=false
1064
1065
1066 ---
1067
1068 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
1069
1070 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
1071
1072
1073 ---
1074
1075 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
1076
1077 Adds being able to edit hbase:meta table schema. For example,
1078
1079 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
1080 Updating all regions with the new schema...
1081 All regions updated.
1082 Done.
1083 Took 1.2138 seconds
1084
1085 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
1086
1087
1088 ---
1089
1090 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
1091
1092 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
1093
1094
1095 ---
1096
1097 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
1098
1099 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
1100
1101
1102 ---
1103
1104 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
1105
1106 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
1107
1108
1109 ---
1110
1111 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
1112
1113 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
1114
1115
1116 ---
1117
1118 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
1119
1120 <!-- markdown -->
1121 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
1122 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
1123 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
1124 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
1125 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
1126 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
1127
1128
1129 ---
1130
1131 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
1132
1133 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
1134
1135 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
1136
1137 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
1138
1139 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
1140
1141
1142 ---
1143
1144 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
1145
1146 Added new metric to differentiate sink startup time from last OP applied time.
1147
1148 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
1149
1150 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
1151
1152 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
1153
1154 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
1155
1156
1157 ---
1158
1159 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
1160
1161 <!-- markdown -->
1162 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
1163
1164 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
1165
1166
1167 ---
1168
1169 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
1170
1171 Add backoff. Avoid retrying every 100ms.
1172
1173
1174 ---
1175
1176 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
1177
1178 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
1179
1180 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
1181
1182
1183 ---
1184
1185 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
1186
1187 Introduced a general 'local region' at master side to store the procedure data, etc.
1188
1189 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
1190
1191 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
1192
1193
1194 ---
1195
1196 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
1197
1198 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
1199
1200
1201 ---
1202
1203 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
1204
1205 Config key: hbase.regionserver.slowlog.systable.enabled
1206 Default value: false
1207
1208 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
1209 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
1210
1211 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
1212
1213 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
1214
1215  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
1216  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
1217  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
1218  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
1219                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
1220                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
1221                                                              rics: false
1222  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
1223  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
1224  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
1225  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
1226  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
1227  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
1228  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
1229  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
1230
1231
1232 ---
1233
1234 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
1235
1236 <!-- markdown -->
1237 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
1238
1239
1240 ---
1241
1242 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
1243
1244 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
1245
1246 The request log is disabled by default in conf/log4j.properties by the following lines:
1247
1248 # Disable request log by default, you can enable this by changing the appender
1249 log4j.category.http.requests=INFO,NullAppender
1250 log4j.additivity.http.requests=false
1251
1252 Change the 'NullAppender' to what ever you want if you want to enable request log.
1253
1254 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
1255
1256
1257 ---
1258
1259 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
1260
1261 Use a empty string to represent no column specified for deleteall in shell mode.
1262 useage:
1263 deleteall 'test','r1','',12345
1264 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
1265
1266
1267 ---
1268
1269 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
1270
1271 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
1272
1273
1274 ---
1275
1276 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
1277
1278 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
1279
1280
1281 ---
1282
1283 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
1284
1285 Moved to hbase-thirdparty 3.3.0.
1286
1287
1288 ---
1289
1290 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
1291
1292 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
1293
1294 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
1295
1296 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
1297
1298
1299 ---
1300
1301 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
1302
1303 <!-- markdown -->
1304 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
1305
1306 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
1307
1308
1309 ---
1310
1311 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
1312
1313 New Config: hbase.rpc.rows.size.threshold.reject
1314 -----------------------------------------------------------------------
1315
1316 Default value: false
1317 Description:
1318 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
1319
1320
1321 ---
1322
1323 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
1324
1325 StochasticLoadBalancer functional improvement:
1326
1327 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
1328
1329
1330 ---
1331
1332 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
1333
1334 user or admin can now use
1335 hbase shell \> rename\_rsgroup 'oldname', 'newname'
1336 to rename rsgroup.
1337
1338
1339 ---
1340
1341 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
1342
1343 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
1344
1345 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
1346
1347
1348 ---
1349
1350 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
1351
1352 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
1353
1354
1355 ---
1356
1357 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
1358
1359 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
1360
1361
1362 ---
1363
1364 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
1365
1366 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
1367
1368
1369 ---
1370
1371 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
1372
1373 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
1374
1375
1376 ---
1377
1378 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
1379
1380 <!-- markdown -->
1381 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
1382
1383
1384 ---
1385
1386 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
1387
1388 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
1389
1390 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
1391
1392 For running tests locally, to go faster, up fork count.
1393
1394 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
1395
1396 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
1397
1398
1399 ---
1400
1401 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
1402
1403 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
1404
1405
1406 ---
1407
1408 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
1409
1410 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
1411
1412
1413 ---
1414
1415 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
1416
1417 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
1418
1419
1420 ---
1421
1422 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
1423
1424 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
1425
1426
1427 ---
1428
1429 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
1430
1431 <!-- markdown -->
1432 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
1433
1434 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
1435
1436
1437 ---
1438
1439 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
1440
1441 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
1442
1443
1444 ---
1445
1446 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
1447
1448 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
1449
1450
1451 ---
1452
1453 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
1454
1455 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
1456
1457
1458 ---
1459
1460 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
1461
1462 ColumnFamilyDescriptor new builder API:
1463
1464     /\*\*
1465      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
1466      \* of versions(versionAfterInterval) after that interval elapses.
1467      \*
1468      \* @param retentionInterval Retain all versions for this interval
1469      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
1470      \*/
1471     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
1472         final int retentionInterval, final int versionAfterInterval)
1473
1474
1475 ---
1476
1477 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
1478
1479 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
1480
1481
1482 ---
1483
1484 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
1485
1486 Expose file system level read metrics for RegionServer.
1487
1488 If the HBase RS runs on top of HDFS, calculate the aggregation of
1489 ReadStatistics of each HdfsFileInputStream. These metrics include:
1490 (1) total number of bytes read from HDFS.
1491 (2) total number of bytes read from local DataNode.
1492 (3) total number of bytes read locally through short-circuit read.
1493 (4) total number of bytes read locally through zero-copy read.
1494
1495 Because HDFS ReadStatistics is calculated per input stream, it is not
1496 feasible to update the aggregated number in real time. Instead, the
1497 metrics are updated when an input stream is closed.
1498
1499
1500 ---
1501
1502 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
1503
1504 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
1505
1506 Here is a simple example of script:
1507 {code}
1508 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
1509 #!/bin/bash
1510 namespace=$1
1511 tablename=$2
1512 if [[ $namespace == test ]]; then
1513   echo test
1514 elif [[ $tablename == \*foo\* ]]; then
1515   echo other
1516 else
1517   echo default
1518 fi
1519 {code}
1520
1521
1522 ---
1523
1524 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
1525
1526 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
1527
1528
1529 ---
1530
1531 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
1532
1533 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
1534
1535
1536 ---
1537
1538 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
1539
1540 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
1541
1542 User used to see....
1543
1544   column=table:state, timestamp=1583967620343 .....
1545
1546 ... but now sees:
1547
1548   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
1549
1550
1551 ---
1552
1553 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
1554
1555 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
1556
1557
1558 ---
1559
1560 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
1561
1562 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
1563
1564 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
1565
1566
1567 ---
1568
1569 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
1570
1571 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
1572
1573 New Admin APIs:
1574 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
1575       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
1576
1577 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
1578       throws IOException;
1579
1580 Configs:
1581
1582 1. hbase.regionserver.slowlog.ringbuffer.size:
1583 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
1584
1585 Default
1586 256
1587
1588 2. hbase.regionserver.slowlog.buffer.enabled:
1589 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
1590
1591 Default
1592 false
1593
1594
1595 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
1596
1597
1598 ---
1599
1600 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
1601
1602 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
1603
1604
1605 ---
1606
1607 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
1608
1609 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
1610
1611 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
1612
1613 This is a fluent style API, the code is like:
1614
1615 For Table interface:
1616 {code}
1617 table.checkAndMutate(row, filter).thenPut(put);
1618 {code}
1619
1620 For AsyncTable interface:
1621 {code}
1622 table.checkAndMutate(row, filter).thenPut(put)
1623     .thenAccept(succ -\> {
1624       if (succ) {
1625         System.out.println("Check and put succeeded");
1626       } else {
1627         System.out.println("Check and put failed");
1628       }
1629     });
1630 {code}
1631
1632
1633 ---
1634
1635 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
1636
1637 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
1638
1639
1640 ---
1641
1642 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
1643
1644 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
1645
1646
1647 ---
1648
1649 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
1650
1651     Adds shell command regioninfo:
1652
1653       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
1654       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
1655       Took 0.4737 seconds
1656
1657
1658 ---
1659
1660 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
1661
1662 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
1663
1664 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
1665
1666
1667 ---
1668
1669 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
1670
1671 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
1672
1673 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
1674 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
1675
1676 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
1677
1678
1679 ---
1680
1681 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
1682
1683 <!-- markdown -->
1684 Enables master based registry as the default registry used by clients to fetch connection metadata.
1685 Refer to the section "Master Registry" in the client documentation for more details and advantages
1686 of this implementation over the default Zookeeper based registry.
1687
1688 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
1689
1690 Where to set this: HBase client configuration (hbase-site.xml)
1691
1692 Possible values:
1693 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
1694 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
1695
1696 Notes on defaults:
1697
1698 - For v3.0.0 and later, MasterRegistry is the default registry
1699 - For all releases in 2.x line, ZK based registry is the default.
1700
1701 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
1702
1703 ```
1704 <property>
1705   <name>hbase.client.registry.impl</name>
1706   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
1707 </property>
1708 ```
1709
1710
1711 ---
1712
1713 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
1714
1715 caffeine: 2.6.2 =\> 2.8.1
1716 commons-codec: 1.10 =\> 1.13
1717 commons-io: 2.5 =\> 2.6
1718 disrupter: 3.3.6 =\> 3.4.2
1719 httpcore: 4.4.6 =\> 4.4.13
1720 jackson: 2.9.10 =\> 2.10.1
1721 jackson.databind: 2.9.10.1 =\> 2.10.1
1722 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
1723 protobuf.plugin: 0.5.0 =\> 0.6.1
1724 zookeeper: 3.4.10 =\> 3.4.14
1725 slf4j: 1.7.25 =\> 1.7.30
1726 rat: 0.12 =\> 0.13
1727 asciidoctor: 1.5.5 =\> 1.5.8
1728 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
1729 error-prone: 2.3.3 =\> 2.3.4
1730
1731
1732 ---
1733
1734 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
1735
1736 - Reverts a binary incompatible binary change for ByteRangeUtils
1737 - Usage of reflection inside CommonFSUtils removed
1738
1739
1740 ---
1741
1742 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
1743
1744 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
1745
1746 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
1747
1748
1749 ---
1750
1751 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
1752
1753 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
1754
1755
1756 ---
1757
1758 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
1759
1760 Add a new config to hbase-default.xml
1761
1762   \<property\>
1763     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
1764     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
1765     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
1766     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
1767     called in order, so put the cleaner that prunes the most files in front. To
1768     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
1769     and add the fully qualified class name here. Always add the above
1770     default hfile cleaners in the list as they will be overwritten in
1771     hbase-site.xml.\</description\>
1772   \</property\>
1773
1774 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
1775
1776
1777 ---
1778
1779 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
1780
1781 Updated parent pom to Apache version 22.
1782
1783
1784 ---
1785
1786 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
1787
1788 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
1789
1790 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
1791
1792
1793 ---
1794
1795 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
1796
1797 Add a new feature to improve MTTR which have 3 steps to failover:
1798 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
1799 2. Open region.
1800 3. Bulkload the recovered.hfiles for every column family.
1801
1802 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
1803
1804 Config hbase.wal.split.to.hfile to true to enable this featue.
1805
1806
1807 ---
1808
1809 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
1810
1811 Changed the logging in hbase-zookeeper to use built-in formatting
1812
1813
1814 ---
1815
1816 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
1817
1818 From the PR:
1819
1820 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
1821
1822 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
1823
1824
1825 ---
1826
1827 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
1828
1829 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
1830
1831
1832 ---
1833
1834 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
1835
1836 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
1837
1838
1839 ---
1840
1841 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
1842
1843 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
1844
1845
1846 ---
1847
1848 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
1849
1850 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
1851
1852 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
1853
1854
1855 ---
1856
1857 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
1858
1859 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
1860
1861
1862 ---
1863
1864 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
1865
1866 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
1867 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
1868
1869 Fixed this bug as part of this Jira.
1870 Updated description for corresponding configs:
1871
1872 1. hbase.master.regions.recovery.check.interval :
1873
1874 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
1875
1876 2. hbase.regions.recovery.store.file.ref.count :
1877
1878 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
1879
1880
1881 ---
1882
1883 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
1884
1885 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
1886
1887
1888 ---
1889
1890 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
1891
1892 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
1893
1894
1895 ---
1896
1897 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
1898
1899 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
1900
1901
1902 ---
1903
1904 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
1905
1906 Bumped surefire plugin to 3.0.0-M4
1907
1908
1909 ---
1910
1911 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
1912
1913 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
1914
1915
1916 ---
1917
1918 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
1919
1920 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
1921 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
1922 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
1923 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
1924 From the shell this can be enabled by using the option per Column Family also by using the below format
1925 {code}
1926 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
1927 {code}
1928
1929
1930 ---
1931
1932 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
1933
1934 <!-- markdown -->
1935
1936 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
1937
1938 ```
1939 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
1940     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
1941 ```
1942
1943 See javadocs of the class `MobRefReporter` for more details.
1944
1945 the reference guide has added some information about MOB internals and troubleshooting.
1946
1947
1948 ---
1949
1950 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
1951
1952 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
1953
1954
1955 ---
1956
1957 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
1958
1959 Fixed unbalanced braces in string representation within HBase shell
1960
1961
1962 ---
1963
1964 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
1965
1966 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
1967 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
1968
1969
1970 ---
1971
1972 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
1973
1974 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
1975
1976
1977 ---
1978
1979 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
1980
1981 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
1982
1983 1. RowFilter
1984 2. ValueFilter
1985 3. QualifierFilter
1986 4. FamilyFilter
1987 5. ColumnValueFilter
1988
1989
1990 ---
1991
1992 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
1993
1994 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
1995
1996
1997 ---
1998
1999 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
2000
2001 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
2002
2003
2004 ---
2005
2006 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
2007
2008 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
2009
2010
2011 ---
2012
2013 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
2014
2015 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
2016
2017
2018 ---
2019
2020 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
2021
2022 <!-- markdown -->
2023 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
2024
2025 Such messages will happen at most once per five minutes.
2026
2027
2028 ---
2029
2030 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
2031
2032 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
2033
2034
2035 ---
2036
2037 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
2038
2039 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
2040
2041
2042 ---
2043
2044 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
2045
2046 <!-- markdown -->
2047
2048 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
2049
2050   - CVE-2019-16942
2051   - CVE-2019-16943
2052
2053 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
2054
2055
2056 ---
2057
2058 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
2059
2060 <!-- markdown -->
2061
2062 The MOB compaction process in the HBase Master now logs more about its activity.
2063
2064 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
2065
2066 Caveats:
2067 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
2068 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
2069 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
2070 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
2071
2072
2073 ---
2074
2075 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
2076
2077 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
2078
2079
2080 ---
2081
2082 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
2083
2084 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
2085
2086 Configs:
2087
2088 1. hbase.master.regions.recovery.check.interval :
2089
2090 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
2091
2092 2. hbase.regions.recovery.store.file.ref.count :
2093
2094 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
2095
2096
2097 ---
2098
2099 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
2100
2101 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
2102
2103 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
2104
2105
2106 ---
2107
2108 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
2109
2110 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
2111
2112
2113 ---
2114
2115 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
2116
2117 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
2118
2119
2120 ---
2121
2122 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
2123
2124 <!-- markdown -->
2125 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
2126
2127 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
2128
2129
2130 ---
2131
2132 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
2133
2134 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
2135
2136
2137 ---
2138
2139 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
2140
2141 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
2142
2143
2144 ---
2145
2146 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
2147
2148 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
2149
2150 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
2151
2152
2153 ---
2154
2155 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
2156
2157 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
2158 \<property\>
2159     \<name\>hbase.bucketcache.ioengine\</name\>
2160     \<value\> pmem:///path in persistent memory \</value\>
2161   \</property\>
2162
2163
2164 ---
2165
2166 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
2167
2168 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
2169 hbase\> snapshot\_cleanup\_switch false
2170
2171 We can re-enable it using:
2172 hbase\> snapshot\_cleanup\_switch true
2173
2174 We can query whether snapshot auto cleanup is enabled for cluster using:
2175 hbase\> snapshot\_cleanup\_enabled
2176
2177
2178 ---
2179
2180 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
2181
2182 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
2183
2184
2185 ---
2186
2187 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
2188
2189 This issue adds via its subtasks:
2190
2191  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
2192  \*\* Master thought this region opened, but no regionserver reported it.
2193  \*\* Master thought this region opened on Server1, but regionserver reported Server2
2194  \*\* More than one regionservers reported opened this region
2195  Both chores can be triggered from the shell to regenerate ‘new’ reports.
2196  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
2197  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
2198  \* Offline replace of hbase.version and hbase.id
2199  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
2200  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
2201  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
2202  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
2203  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
2204  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
2205
2206 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
2207
2208
2209 ---
2210
2211 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
2212
2213 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
2214
2215
2216 ---
2217
2218 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
2219
2220 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
2221
2222
2223 ---
2224
2225 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
2226
2227 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
2228
2229 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
2230
2231 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
2232
2233
2234 ---
2235
2236 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
2237
2238 <!-- markdown -->
2239 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
2240
2241
2242 ---
2243
2244 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
2245
2246 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
2247
2248
2249 ---
2250
2251 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
2252
2253 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
2254
2255
2256 ---
2257
2258 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
2259
2260 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
2261 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
2262
2263
2264 ---
2265
2266 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
2267
2268 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
2269 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
2270 \* TimeRange#until: Represents the time interval [0, maxStamp)
2271 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
2272
2273
2274 ---
2275
2276 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
2277
2278 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
2279 {code}
2280 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
2281 {code}
2282
2283
2284 ---
2285
2286 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
2287
2288 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
2289
2290
2291 ---
2292
2293 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
2294
2295 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
2296
2297 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
2298
2299
2300 ---
2301
2302 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
2303
2304 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
2305
2306
2307 ---
2308
2309 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
2310
2311 New shaded artifact for testing: hbase-shaded-testing-util.
2312
2313
2314 ---
2315
2316 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
2317
2318 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
2319 1. Check HDFS configuration
2320 2. Add master coprocessor:
2321     hbase.coprocessor.master.classes=
2322     “org.apache.hadoop.hbase.security.access.AccessController,
2323 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
2324 3. Enable this feature:
2325     hbase.acl.sync.to.hdfs.enable=true
2326 4. Modify table scheme to enable this feature for a table:
2327     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
2328
2329
2330 ---
2331
2332 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
2333
2334 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
2335
2336 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
2337
2338 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
2339 java.lang.ArrayIndexOutOfBoundsException: 18056
2340         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
2341         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
2342         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
2343         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
2344         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
2345         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
2346         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
2347         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
2348         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
2349
2350 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
2351
2352 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
2353
2354 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
2355
2356
2357 ---
2358
2359 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
2360
2361 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
2362
2363
2364 ---
2365
2366 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
2367
2368 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
2369
2370
2371 ---
2372
2373 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
2374
2375 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
2376
2377 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
2378
2379
2380 ---
2381
2382 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
2383
2384 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
2385
2386
2387 ---
2388
2389 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
2390
2391 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
2392 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
2393
2394
2395 ---
2396
2397 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
2398
2399 1. Add a new chore thread in master to do hbck checking
2400 2. Add a new web ui "HBCK Report" page to display checking results.
2401
2402 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
2403
2404 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
2405
2406
2407 ---
2408
2409 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
2410
2411 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
2412 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
2413
2414
2415 ---
2416
2417 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
2418
2419 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
2420
2421 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
2422
2423 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
2424
2425
2426 ---
2427
2428 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
2429
2430 Add a new master web UI to show the potentially problematic opened regions. There are three case:
2431 1. Master thought this region opened, but no regionserver reported it.
2432 2. Master thought this region opened on Server1, but regionserver reported Server2
2433 3. More than one regionservers reported opened this region
2434
2435
2436 ---
2437
2438 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
2439
2440 Feature: Take a Snapshot With TTL for auto-cleanup
2441
2442 Attribute:
2443 1. TTL
2444      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
2445
2446 Configs:
2447 1. Default Snapshot TTL:
2448      - FOREVER by default
2449      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
2450
2451 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
2452      - hbase.master.cleaner.snapshot.disable: "true"
2453     With this config, HMaster needs restart just like any other hbase-site config.
2454
2455
2456 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
2457
2458
2459 ---
2460
2461 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
2462
2463 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
2464
2465
2466 ---
2467
2468 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
2469
2470 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
2471
2472 This tool is deprecated in 2.x and will be removed in 3.0.
2473
2474
2475 ---
2476
2477 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
2478
2479 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
2480
2481
2482 ---
2483
2484 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
2485
2486 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
2487
2488 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
2489
2490 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
2491
2492
2493 ---
2494
2495 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
2496
2497 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
2498 To use this feature, please make sure the HDFS config is set:
2499 dfs.namenode.acls.enabled=true
2500 fs.permissions.umask-mode=027
2501
2502 and set the HBase config:
2503 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
2504 hbase.user.scan.snapshot.enable=true
2505
2506
2507 ---
2508
2509 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
2510
2511 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2512
2513 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2514
2515
2516 ---
2517
2518 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
2519
2520 <!-- markdown -->
2521
2522 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
2523
2524
2525 ---
2526
2527 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
2528
2529 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
2530
2531
2532 ---
2533
2534 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
2535
2536 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
2537
2538
2539 ---
2540
2541 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
2542
2543 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
2544
2545
2546 ---
2547
2548 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
2549
2550 The HBase "source checksum" now uses SHA512 instead of MD5.
2551
2552
2553 ---
2554
2555 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
2556
2557 <!-- markdown -->
2558
2559 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
2560
2561 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
2562
2563
2564 ---
2565
2566 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
2567
2568 The access method was used to the HttpServerFunctionalTest class as a common place.
2569
2570
2571 ---
2572
2573 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
2574
2575 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
2576
2577
2578 ---
2579
2580 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
2581
2582 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
2583
2584
2585 ---
2586
2587 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
2588
2589 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
2590
2591
2592 ---
2593
2594 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
2595
2596 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
2597
2598
2599 ---
2600
2601 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
2602
2603 Support get\|set LogLevel in secure(kerberized) environment.
2604
2605
2606 ---
2607
2608 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
2609
2610 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
2611
2612
2613 ---
2614
2615 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
2616
2617 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
2618
2619
2620 ---
2621
2622 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
2623
2624 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
2625
2626
2627 ---
2628
2629 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
2630
2631 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
2632
2633
2634 ---
2635
2636 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
2637
2638 Updated metrics core from 3.2.1 to 3.2.6.
2639
2640
2641 ---
2642
2643 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
2644
2645 The rubocop definition for the maximum method length was set to 75.
2646
2647
2648 ---
2649
2650 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
2651
2652 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
2653
2654
2655 ---
2656
2657 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
2658
2659 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
2660
2661
2662 ---
2663
2664 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
2665
2666 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
2667
2668 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
2669
2670 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
2671
2672 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
2673
2674 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
2675 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
2676
2677 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
2678 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
2679 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
2680
2681
2682 ---
2683
2684 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
2685
2686 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
2687
2688 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
2689
2690
2691 ---
2692
2693 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
2694
2695 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
2696
2697
2698 ---
2699
2700 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
2701
2702 <!-- markdown -->
2703 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
2704
2705 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
2706
2707
2708 ---
2709
2710 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
2711
2712 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
2713
2714 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
2715
2716
2717 ---
2718
2719 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
2720
2721 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
2722
2723
2724 ---
2725
2726 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
2727
2728 <!-- markdown -->
2729
2730 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
2731
2732 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
2733
2734
2735 ---
2736
2737 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
2738
2739 Add below method in Table interface:
2740
2741 RegionLocator getRegionLocator() throws IOException;
2742
2743 Add below methods in AsyncTable interface:
2744
2745 AsyncTableRegionLocator getRegionLocator();
2746 CompletableFuture\<TableDescriptor\> getDescriptor();
2747
2748
2749 ---
2750
2751 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
2752
2753 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
2754
2755 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
2756
2757 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
2758
2759
2760 ---
2761
2762 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
2763
2764 Introduced
2765
2766 Future\<Void\> createTableAsync(TableDescriptor);
2767
2768
2769 ---
2770
2771 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
2772
2773 Introduced these methods:
2774 void move(byte[]);
2775 void move(byte[], ServerName);
2776 Future\<Void\> splitRegionAsync(byte[]);
2777
2778 These methods are deprecated:
2779 void move(byte[], byte[])
2780
2781
2782 ---
2783
2784 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
2785
2786 Add a new jenkins file for running pre commit check for GitHub PR.
2787
2788
2789 ---
2790
2791 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
2792
2793 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
2794
2795
2796 ---
2797
2798 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
2799
2800 When insufficient permissions, you now get:
2801
2802 HTTP/1.1 403 Forbidden
2803
2804 on the HTTP side, and in the message
2805
2806 Forbidden
2807 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
2808 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
2809 and the rest of the ADE stack
2810
2811
2812 ---
2813
2814 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
2815
2816 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
2817
2818
2819 ---
2820
2821 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
2822
2823 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
2824
2825
2826 ---
2827
2828 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
2829
2830 <!-- markdown -->
2831 Fixed awkward dependency issue that prevented site building.
2832
2833 #### note specific to HBase 2.1.4
2834 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
2835 ```
2836 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
2837 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
2838         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
2839         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
2840         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
2841         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
2842         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
2843         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
2844         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
2845         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
2846         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
2847         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
2848         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
2849         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
2850         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
2851         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
2852         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
2853         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
2854         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
2855         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
2856         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
2857         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
2858         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
2859         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
2860         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
2861         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
2862         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
2863         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
2864 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
2865         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
2866         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
2867         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
2868         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
2869         ... 26 more
2870
2871 ```
2872
2873 Workaround via any _one_ of the following:
2874 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
2875 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
2876 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
2877 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
2878 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
2879
2880
2881 ---
2882
2883 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
2884
2885 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
2886
2887
2888 ---
2889
2890 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
2891
2892 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
2893
2894
2895 ---
2896
2897 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
2898
2899 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
2900
2901 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
2902
2903
2904 ---
2905
2906 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
2907
2908 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
2909
2910
2911 ---
2912
2913 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
2914
2915 <!-- markdown -->
2916
2917 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
2918
2919 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
2920
2921
2922 ---
2923
2924 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
2925
2926 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
2927
2928
2929 ---
2930
2931 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
2932
2933 Add a cloneSnapshotAsync method with restoreAcl parameter.
2934 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
2935 Make snapshotAsync method returns a Future\<Void\>.
2936 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
2937 Use default methods to reduce the code base for implementation classes.
2938
2939
2940 ---
2941
2942 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
2943
2944 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
2945
2946
2947 ---
2948
2949 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
2950
2951 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
2952 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
2953
2954 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
2955
2956 For example:
2957 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
2958
2959
2960 ---
2961
2962 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
2963
2964 Adds below flush, split, and compaction metrics
2965
2966  +  // split related metrics
2967  +  private MutableFastCounter splitRequest;
2968  +  private MutableFastCounter splitSuccess;
2969  +  private MetricHistogram splitTimeHisto;
2970  +
2971  +  // flush related metrics
2972  +  private MetricHistogram flushTimeHisto;
2973  +  private MetricHistogram flushMemstoreSizeHisto;
2974  +  private MetricHistogram flushOutputSizeHisto;
2975  +  private MutableFastCounter flushedMemstoreBytes;
2976  +  private MutableFastCounter flushedOutputBytes;
2977  +
2978  +  // compaction related metrics
2979  +  private MetricHistogram compactionTimeHisto;
2980  +  private MetricHistogram compactionInputFileCountHisto;
2981  +  private MetricHistogram compactionInputSizeHisto;
2982  +  private MetricHistogram compactionOutputFileCountHisto;
2983  +  private MetricHistogram compactionOutputSizeHisto;
2984  +  private MutableFastCounter compactedInputBytes;
2985  +  private MutableFastCounter compactedOutputBytes;
2986  +
2987  +  private MetricHistogram majorCompactionTimeHisto;
2988  +  private MetricHistogram majorCompactionInputFileCountHisto;
2989  +  private MetricHistogram majorCompactionInputSizeHisto;
2990  +  private MetricHistogram majorCompactionOutputFileCountHisto;
2991  +  private MetricHistogram majorCompactionOutputSizeHisto;
2992  +  private MutableFastCounter majorCompactedInputBytes;
2993  +  private MutableFastCounter majorCompactedOutputBytes;
2994
2995
2996 ---
2997
2998 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
2999
3000 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
3001
3002
3003 ---
3004
3005 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
3006
3007 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
3008 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
3009
3010
3011 ---
3012
3013 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
3014
3015 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
3016
3017 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
3018
3019
3020 ---
3021
3022 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
3023
3024 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
3025 Shell commands are as follows:
3026 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3027
3028 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
3029 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
3030 Shell commands are as follows:
3031 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3032 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
3033 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
3034
3035
3036 ---
3037
3038 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
3039
3040 Change spotbugs version to 3.1.11.
3041
3042
3043 ---
3044
3045 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
3046
3047 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
3048
3049 It also introduces additional info for each recovery queue, which was not accounted by this command before.
3050
3051 The new output for "status 'replication'" command is explained in details below:
3052 a) Source started, target stopped, no edits arrived on source yet:
3053 ...
3054  SOURCE: PeerID=1
3055          Normal Queue: 1
3056            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3057 ...
3058 b) Source started, target stopped, add edit on source:
3059 ...
3060 Normal Queue: 1
3061            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
3062 ...
3063 c) Source started, target stopped, edit added on source, restart source:
3064 ...
3065 SOURCE: PeerID=1
3066          Normal Queue: 1
3067            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3068          Recovered Queue: 1-hbase01.home,16020,1542784524057
3069            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
3070 ...
3071 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
3072 ...
3073 SOURCE: PeerID=1
3074          Normal Queue: 1
3075            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
3076          Recovered Queue: 1-hbase01.home,16020,1542782758742
3077            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
3078 ...
3079 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
3080 ...
3081        SOURCE: PeerID=1
3082          Normal Queue: 1
3083            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
3084 ...
3085 f) Source started, target stopped, add edit on source, restart source, restart target:
3086 ...
3087 SOURCE: PeerID=1
3088          Normal Queue: 1
3089            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3090 ...
3091
3092
3093 ---
3094
3095 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
3096
3097 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
3098
3099
3100 ---
3101
3102 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
3103
3104 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
3105 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
3106 disable\_exceed\_throttle\_quota
3107 There are two limits when enable exceed throttle quota:
3108 1. Must set at least one read and one write region server throttle quota;
3109 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
3110
3111
3112 ---
3113
3114 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
3115
3116 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
3117
3118
3119 ---
3120
3121 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
3122
3123 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
3124
3125
3126 ---
3127
3128 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
3129
3130 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
3131
3132
3133 ---
3134
3135 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
3136
3137 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
3138
3139 hbase\> help 'scan'
3140
3141
3142 ---
3143
3144 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
3145
3146 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
3147
3148 For example:
3149 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
3150
3151
3152 ---
3153
3154 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
3155
3156 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
3157 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
3158
3159
3160 ---
3161
3162 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
3163
3164 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
3165
3166
3167 ---
3168
3169 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
3170
3171 Make StoppedRpcClientException extend DoNotRetryIOException.
3172
3173
3174 ---
3175
3176 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
3177
3178 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
3179 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
3180
3181
3182 ---
3183
3184 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
3185
3186 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
3187
3188 The effect releases are:
3189 2.1.x: 2.1.2 and below
3190 2.0.x: 2.0.4 and below
3191 1.x: 1.4.x and below
3192
3193 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
3194
3195
3196 ---
3197
3198 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
3199
3200 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
3201
3202
3203
3204 # HBASE  2.3.0 Release Notes
3205
3206 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
3207
3208
3209 ---
3210
3211 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
3212
3213 <!-- markdown -->
3214
3215 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
3216
3217
3218 ---
3219
3220 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
3221
3222 <!-- markdown -->
3223 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
3224 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
3225
3226
3227 ---
3228
3229 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
3230
3231 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
3232 The metric is now collected under the mbean for Tables and under the mbean for regions.
3233 Under table mbean ie.-
3234 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
3235 The new metrics will be listed as
3236 {code}
3237     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3238  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
3239 {code}
3240 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
3241 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
3242 {code}
3243
3244 The same one under the region ie.
3245 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
3246 comes as
3247 {code}
3248    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3249     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
3250 {code}
3251 where
3252 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
3253 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
3254 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
3255
3256
3257 ---
3258
3259 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
3260
3261 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
3262
3263 $hbase rowcounter -h
3264
3265 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
3266 Options:
3267     --starttime=\<arg\>       starting time filter to start counting rows from.
3268     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
3269     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
3270     --expectedCount=\<arg\>   expected number of rows to be count.
3271 For performance, consider the following configuration properties:
3272 -Dhbase.client.scanner.caching=100
3273 -Dmapreduce.map.speculative=false
3274
3275
3276 ---
3277
3278 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
3279
3280 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
3281
3282
3283 ---
3284
3285 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
3286
3287 Adds being able to edit hbase:meta table schema. For example,
3288
3289 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
3290 Updating all regions with the new schema...
3291 All regions updated.
3292 Done.
3293 Took 1.2138 seconds
3294
3295 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
3296
3297
3298 ---
3299
3300 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
3301
3302 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
3303
3304
3305 ---
3306
3307 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
3308
3309 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
3310
3311
3312 ---
3313
3314 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
3315
3316 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
3317
3318
3319 ---
3320
3321 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
3322
3323 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
3324
3325
3326 ---
3327
3328 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
3329
3330 <!-- markdown -->
3331 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
3332 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
3333 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
3334 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
3335 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
3336 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
3337
3338
3339 ---
3340
3341 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
3342
3343 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
3344
3345 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
3346
3347 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
3348
3349 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
3350
3351
3352 ---
3353
3354 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
3355
3356 Added new metric to differentiate sink startup time from last OP applied time.
3357
3358 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
3359
3360 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
3361
3362 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
3363
3364 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
3365
3366
3367 ---
3368
3369 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
3370
3371 <!-- markdown -->
3372 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
3373
3374 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
3375
3376
3377 ---
3378
3379 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
3380
3381 Add backoff. Avoid retrying every 100ms.
3382
3383
3384 ---
3385
3386 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
3387
3388 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
3389
3390 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
3391
3392
3393 ---
3394
3395 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
3396
3397 Introduced a general 'local region' at master side to store the procedure data, etc.
3398
3399 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
3400
3401 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
3402
3403
3404 ---
3405
3406 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
3407
3408 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
3409
3410
3411 ---
3412
3413 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
3414
3415 Config key: hbase.regionserver.slowlog.systable.enabled
3416 Default value: false
3417
3418 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
3419 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
3420
3421 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
3422
3423 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
3424
3425  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
3426  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
3427  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
3428  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
3429                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
3430                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
3431                                                              rics: false
3432  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
3433  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
3434  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
3435  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
3436  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
3437  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
3438  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
3439  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
3440
3441
3442 ---
3443
3444 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
3445
3446 <!-- markdown -->
3447 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
3448
3449
3450 ---
3451
3452 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
3453
3454 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
3455
3456 The request log is disabled by default in conf/log4j.properties by the following lines:
3457
3458 # Disable request log by default, you can enable this by changing the appender
3459 log4j.category.http.requests=INFO,NullAppender
3460 log4j.additivity.http.requests=false
3461
3462 Change the 'NullAppender' to what ever you want if you want to enable request log.
3463
3464 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
3465
3466
3467 ---
3468
3469 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
3470
3471 Use a empty string to represent no column specified for deleteall in shell mode.
3472 useage:
3473 deleteall 'test','r1','',12345
3474 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
3475
3476
3477 ---
3478
3479 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
3480
3481 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
3482
3483
3484 ---
3485
3486 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
3487
3488 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
3489
3490
3491 ---
3492
3493 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
3494
3495 Moved to hbase-thirdparty 3.3.0.
3496
3497
3498 ---
3499
3500 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
3501
3502 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
3503
3504 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
3505
3506 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
3507
3508
3509 ---
3510
3511 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
3512
3513 <!-- markdown -->
3514 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
3515
3516 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
3517
3518
3519 ---
3520
3521 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
3522
3523 New Config: hbase.rpc.rows.size.threshold.reject
3524 -----------------------------------------------------------------------
3525
3526 Default value: false
3527 Description:
3528 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
3529
3530
3531 ---
3532
3533 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
3534
3535 StochasticLoadBalancer functional improvement:
3536
3537 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
3538
3539
3540 ---
3541
3542 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
3543
3544 user or admin can now use
3545 hbase shell \> rename\_rsgroup 'oldname', 'newname'
3546 to rename rsgroup.
3547
3548
3549 ---
3550
3551 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
3552
3553 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
3554
3555 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
3556
3557
3558 ---
3559
3560 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
3561
3562 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
3563
3564
3565 ---
3566
3567 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
3568
3569 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
3570
3571
3572 ---
3573
3574 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
3575
3576 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
3577
3578
3579 ---
3580
3581 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
3582
3583 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
3584
3585
3586 ---
3587
3588 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
3589
3590 <!-- markdown -->
3591 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
3592
3593
3594 ---
3595
3596 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
3597
3598 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
3599
3600 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
3601
3602 For running tests locally, to go faster, up fork count.
3603
3604 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
3605
3606 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
3607
3608
3609 ---
3610
3611 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
3612
3613 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
3614
3615
3616 ---
3617
3618 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
3619
3620 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
3621
3622
3623 ---
3624
3625 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
3626
3627 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
3628
3629
3630 ---
3631
3632 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
3633
3634 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
3635
3636
3637 ---
3638
3639 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
3640
3641 <!-- markdown -->
3642 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
3643
3644 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
3645
3646
3647 ---
3648
3649 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
3650
3651 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
3652
3653
3654 ---
3655
3656 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
3657
3658 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
3659
3660
3661 ---
3662
3663 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
3664
3665 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
3666
3667
3668 ---
3669
3670 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
3671
3672 ColumnFamilyDescriptor new builder API:
3673
3674     /\*\*
3675      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
3676      \* of versions(versionAfterInterval) after that interval elapses.
3677      \*
3678      \* @param retentionInterval Retain all versions for this interval
3679      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
3680      \*/
3681     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
3682         final int retentionInterval, final int versionAfterInterval)
3683
3684
3685 ---
3686
3687 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
3688
3689 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
3690
3691
3692 ---
3693
3694 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
3695
3696 Expose file system level read metrics for RegionServer.
3697
3698 If the HBase RS runs on top of HDFS, calculate the aggregation of
3699 ReadStatistics of each HdfsFileInputStream. These metrics include:
3700 (1) total number of bytes read from HDFS.
3701 (2) total number of bytes read from local DataNode.
3702 (3) total number of bytes read locally through short-circuit read.
3703 (4) total number of bytes read locally through zero-copy read.
3704
3705 Because HDFS ReadStatistics is calculated per input stream, it is not
3706 feasible to update the aggregated number in real time. Instead, the
3707 metrics are updated when an input stream is closed.
3708
3709
3710 ---
3711
3712 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
3713
3714 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
3715
3716 Here is a simple example of script:
3717 {code}
3718 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
3719 #!/bin/bash
3720 namespace=$1
3721 tablename=$2
3722 if [[ $namespace == test ]]; then
3723   echo test
3724 elif [[ $tablename == \*foo\* ]]; then
3725   echo other
3726 else
3727   echo default
3728 fi
3729 {code}
3730
3731
3732 ---
3733
3734 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
3735
3736 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
3737
3738
3739 ---
3740
3741 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
3742
3743 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
3744
3745
3746 ---
3747
3748 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
3749
3750 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
3751
3752 User used to see....
3753
3754   column=table:state, timestamp=1583967620343 .....
3755
3756 ... but now sees:
3757
3758   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
3759
3760
3761 ---
3762
3763 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
3764
3765 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
3766
3767
3768 ---
3769
3770 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
3771
3772 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
3773
3774 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
3775
3776
3777 ---
3778
3779 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
3780
3781 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
3782
3783 New Admin APIs:
3784 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
3785       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
3786
3787 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
3788       throws IOException;
3789
3790 Configs:
3791
3792 1. hbase.regionserver.slowlog.ringbuffer.size:
3793 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
3794
3795 Default
3796 256
3797
3798 2. hbase.regionserver.slowlog.buffer.enabled:
3799 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
3800
3801 Default
3802 false
3803
3804
3805 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
3806
3807
3808 ---
3809
3810 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
3811
3812 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
3813
3814
3815 ---
3816
3817 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
3818
3819 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
3820
3821 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
3822
3823 This is a fluent style API, the code is like:
3824
3825 For Table interface:
3826 {code}
3827 table.checkAndMutate(row, filter).thenPut(put);
3828 {code}
3829
3830 For AsyncTable interface:
3831 {code}
3832 table.checkAndMutate(row, filter).thenPut(put)
3833     .thenAccept(succ -\> {
3834       if (succ) {
3835         System.out.println("Check and put succeeded");
3836       } else {
3837         System.out.println("Check and put failed");
3838       }
3839     });
3840 {code}
3841
3842
3843 ---
3844
3845 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
3846
3847 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
3848
3849
3850 ---
3851
3852 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
3853
3854 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
3855
3856
3857 ---
3858
3859 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
3860
3861     Adds shell command regioninfo:
3862
3863       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
3864       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
3865       Took 0.4737 seconds
3866
3867
3868 ---
3869
3870 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
3871
3872 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
3873
3874 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
3875
3876
3877 ---
3878
3879 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
3880
3881 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
3882
3883 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
3884 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
3885
3886 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
3887
3888
3889 ---
3890
3891 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
3892
3893 <!-- markdown -->
3894 Enables master based registry as the default registry used by clients to fetch connection metadata.
3895 Refer to the section "Master Registry" in the client documentation for more details and advantages
3896 of this implementation over the default Zookeeper based registry.
3897
3898 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
3899
3900 Where to set this: HBase client configuration (hbase-site.xml)
3901
3902 Possible values:
3903 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
3904 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
3905
3906 Notes on defaults:
3907
3908 - For v3.0.0 and later, MasterRegistry is the default registry
3909 - For all releases in 2.x line, ZK based registry is the default.
3910
3911 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
3912
3913 ```
3914 <property>
3915   <name>hbase.client.registry.impl</name>
3916   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
3917 </property>
3918 ```
3919
3920
3921 ---
3922
3923 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
3924
3925 caffeine: 2.6.2 =\> 2.8.1
3926 commons-codec: 1.10 =\> 1.13
3927 commons-io: 2.5 =\> 2.6
3928 disrupter: 3.3.6 =\> 3.4.2
3929 httpcore: 4.4.6 =\> 4.4.13
3930 jackson: 2.9.10 =\> 2.10.1
3931 jackson.databind: 2.9.10.1 =\> 2.10.1
3932 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
3933 protobuf.plugin: 0.5.0 =\> 0.6.1
3934 zookeeper: 3.4.10 =\> 3.4.14
3935 slf4j: 1.7.25 =\> 1.7.30
3936 rat: 0.12 =\> 0.13
3937 asciidoctor: 1.5.5 =\> 1.5.8
3938 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
3939 error-prone: 2.3.3 =\> 2.3.4
3940
3941
3942 ---
3943
3944 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
3945
3946 - Reverts a binary incompatible binary change for ByteRangeUtils
3947 - Usage of reflection inside CommonFSUtils removed
3948
3949
3950 ---
3951
3952 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
3953
3954 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
3955
3956 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
3957
3958
3959 ---
3960
3961 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
3962
3963 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
3964
3965
3966 ---
3967
3968 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
3969
3970 Add a new config to hbase-default.xml
3971
3972   \<property\>
3973     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
3974     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
3975     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
3976     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
3977     called in order, so put the cleaner that prunes the most files in front. To
3978     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
3979     and add the fully qualified class name here. Always add the above
3980     default hfile cleaners in the list as they will be overwritten in
3981     hbase-site.xml.\</description\>
3982   \</property\>
3983
3984 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
3985
3986
3987 ---
3988
3989 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
3990
3991 Updated parent pom to Apache version 22.
3992
3993
3994 ---
3995
3996 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
3997
3998 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
3999
4000 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
4001
4002
4003 ---
4004
4005 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
4006
4007 Add a new feature to improve MTTR which have 3 steps to failover:
4008 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
4009 2. Open region.
4010 3. Bulkload the recovered.hfiles for every column family.
4011
4012 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
4013
4014 Config hbase.wal.split.to.hfile to true to enable this featue.
4015
4016
4017 ---
4018
4019 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
4020
4021 Changed the logging in hbase-zookeeper to use built-in formatting
4022
4023
4024 ---
4025
4026 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
4027
4028 From the PR:
4029
4030 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
4031
4032 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
4033
4034
4035 ---
4036
4037 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
4038
4039 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
4040
4041
4042 ---
4043
4044 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
4045
4046 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
4047
4048
4049 ---
4050
4051 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
4052
4053 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
4054
4055
4056 ---
4057
4058 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
4059
4060 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
4061
4062 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
4063
4064
4065 ---
4066
4067 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
4068
4069 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
4070
4071
4072 ---
4073
4074 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
4075
4076 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
4077 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
4078
4079 Fixed this bug as part of this Jira.
4080 Updated description for corresponding configs:
4081
4082 1. hbase.master.regions.recovery.check.interval :
4083
4084 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4085
4086 2. hbase.regions.recovery.store.file.ref.count :
4087
4088 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4089
4090
4091 ---
4092
4093 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
4094
4095 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
4096
4097
4098 ---
4099
4100 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
4101
4102 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
4103
4104
4105 ---
4106
4107 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
4108
4109 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
4110
4111
4112 ---
4113
4114 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
4115
4116 Bumped surefire plugin to 3.0.0-M4
4117
4118
4119 ---
4120
4121 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
4122
4123 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
4124
4125
4126 ---
4127
4128 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
4129
4130 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
4131 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
4132 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
4133 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
4134 From the shell this can be enabled by using the option per Column Family also by using the below format
4135 {code}
4136 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
4137 {code}
4138
4139
4140 ---
4141
4142 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
4143
4144 <!-- markdown -->
4145
4146 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
4147
4148 ```
4149 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
4150     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
4151 ```
4152
4153 See javadocs of the class `MobRefReporter` for more details.
4154
4155 the reference guide has added some information about MOB internals and troubleshooting.
4156
4157
4158 ---
4159
4160 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
4161
4162 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
4163
4164
4165 ---
4166
4167 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
4168
4169 Fixed unbalanced braces in string representation within HBase shell
4170
4171
4172 ---
4173
4174 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
4175
4176 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
4177 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
4178
4179
4180 ---
4181
4182 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
4183
4184 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
4185
4186
4187 ---
4188
4189 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
4190
4191 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
4192
4193 1. RowFilter
4194 2. ValueFilter
4195 3. QualifierFilter
4196 4. FamilyFilter
4197 5. ColumnValueFilter
4198
4199
4200 ---
4201
4202 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
4203
4204 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
4205
4206
4207 ---
4208
4209 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
4210
4211 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
4212
4213
4214 ---
4215
4216 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
4217
4218 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
4219
4220
4221 ---
4222
4223 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
4224
4225 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
4226
4227
4228 ---
4229
4230 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
4231
4232 <!-- markdown -->
4233 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
4234
4235 Such messages will happen at most once per five minutes.
4236
4237
4238 ---
4239
4240 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
4241
4242 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
4243
4244
4245 ---
4246
4247 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
4248
4249 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
4250
4251
4252 ---
4253
4254 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
4255
4256 <!-- markdown -->
4257
4258 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
4259
4260   - CVE-2019-16942
4261   - CVE-2019-16943
4262
4263 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
4264
4265
4266 ---
4267
4268 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
4269
4270 <!-- markdown -->
4271
4272 The MOB compaction process in the HBase Master now logs more about its activity.
4273
4274 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
4275
4276 Caveats:
4277 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
4278 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
4279 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
4280 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
4281
4282
4283 ---
4284
4285 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
4286
4287 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
4288
4289
4290 ---
4291
4292 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
4293
4294 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
4295
4296 Configs:
4297
4298 1. hbase.master.regions.recovery.check.interval :
4299
4300 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4301
4302 2. hbase.regions.recovery.store.file.ref.count :
4303
4304 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4305
4306
4307 ---
4308
4309 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
4310
4311 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
4312
4313 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
4314
4315
4316 ---
4317
4318 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
4319
4320 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
4321
4322
4323 ---
4324
4325 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
4326
4327 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
4328
4329
4330 ---
4331
4332 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
4333
4334 <!-- markdown -->
4335 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
4336
4337 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
4338
4339
4340 ---
4341
4342 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
4343
4344 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
4345
4346
4347 ---
4348
4349 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
4350
4351 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
4352
4353
4354 ---
4355
4356 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
4357
4358 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
4359
4360 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
4361
4362
4363 ---
4364
4365 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
4366
4367 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
4368 \<property\>
4369     \<name\>hbase.bucketcache.ioengine\</name\>
4370     \<value\> pmem:///path in persistent memory \</value\>
4371   \</property\>
4372
4373
4374 ---
4375
4376 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
4377
4378 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
4379 hbase\> snapshot\_cleanup\_switch false
4380
4381 We can re-enable it using:
4382 hbase\> snapshot\_cleanup\_switch true
4383
4384 We can query whether snapshot auto cleanup is enabled for cluster using:
4385 hbase\> snapshot\_cleanup\_enabled
4386
4387
4388 ---
4389
4390 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
4391
4392 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
4393
4394
4395 ---
4396
4397 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
4398
4399 This issue adds via its subtasks:
4400
4401  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
4402  \*\* Master thought this region opened, but no regionserver reported it.
4403  \*\* Master thought this region opened on Server1, but regionserver reported Server2
4404  \*\* More than one regionservers reported opened this region
4405  Both chores can be triggered from the shell to regenerate ‘new’ reports.
4406  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
4407  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
4408  \* Offline replace of hbase.version and hbase.id
4409  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
4410  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
4411  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
4412  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
4413  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
4414  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
4415
4416 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
4417
4418
4419 ---
4420
4421 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
4422
4423 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
4424
4425
4426 ---
4427
4428 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
4429
4430 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
4431
4432
4433 ---
4434
4435 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
4436
4437 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
4438
4439 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
4440
4441 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
4442
4443
4444 ---
4445
4446 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
4447
4448 <!-- markdown -->
4449 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
4450
4451
4452 ---
4453
4454 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
4455
4456 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
4457
4458
4459 ---
4460
4461 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
4462
4463 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
4464
4465
4466 ---
4467
4468 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
4469
4470 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
4471 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
4472
4473
4474 ---
4475
4476 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
4477
4478 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
4479 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
4480 \* TimeRange#until: Represents the time interval [0, maxStamp)
4481 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
4482
4483
4484 ---
4485
4486 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
4487
4488 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
4489 {code}
4490 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
4491 {code}
4492
4493
4494 ---
4495
4496 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
4497
4498 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
4499
4500
4501 ---
4502
4503 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
4504
4505 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
4506
4507 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
4508
4509
4510 ---
4511
4512 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
4513
4514 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
4515
4516
4517 ---
4518
4519 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
4520
4521 New shaded artifact for testing: hbase-shaded-testing-util.
4522
4523
4524 ---
4525
4526 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
4527
4528 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
4529 1. Check HDFS configuration
4530 2. Add master coprocessor:
4531     hbase.coprocessor.master.classes=
4532     “org.apache.hadoop.hbase.security.access.AccessController,
4533 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
4534 3. Enable this feature:
4535     hbase.acl.sync.to.hdfs.enable=true
4536 4. Modify table scheme to enable this feature for a table:
4537     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
4538
4539
4540 ---
4541
4542 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
4543
4544 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
4545
4546 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
4547
4548 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
4549 java.lang.ArrayIndexOutOfBoundsException: 18056
4550         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
4551         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
4552         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
4553         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
4554         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
4555         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
4556         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
4557         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
4558         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
4559
4560 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
4561
4562 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
4563
4564 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
4565
4566
4567 ---
4568
4569 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
4570
4571 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
4572
4573
4574 ---
4575
4576 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
4577
4578 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
4579
4580
4581 ---
4582
4583 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
4584
4585 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
4586
4587 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
4588
4589
4590 ---
4591
4592 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
4593
4594 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
4595
4596
4597 ---
4598
4599 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
4600
4601 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
4602 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
4603
4604
4605 ---
4606
4607 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
4608
4609 1. Add a new chore thread in master to do hbck checking
4610 2. Add a new web ui "HBCK Report" page to display checking results.
4611
4612 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
4613
4614 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
4615
4616
4617 ---
4618
4619 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
4620
4621 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
4622 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
4623
4624
4625 ---
4626
4627 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
4628
4629 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
4630
4631 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
4632
4633 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
4634
4635
4636 ---
4637
4638 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
4639
4640 Add a new master web UI to show the potentially problematic opened regions. There are three case:
4641 1. Master thought this region opened, but no regionserver reported it.
4642 2. Master thought this region opened on Server1, but regionserver reported Server2
4643 3. More than one regionservers reported opened this region
4644
4645
4646 ---
4647
4648 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
4649
4650 Feature: Take a Snapshot With TTL for auto-cleanup
4651
4652 Attribute:
4653 1. TTL
4654      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
4655
4656 Configs:
4657 1. Default Snapshot TTL:
4658      - FOREVER by default
4659      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
4660
4661 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
4662      - hbase.master.cleaner.snapshot.disable: "true"
4663     With this config, HMaster needs restart just like any other hbase-site config.
4664
4665
4666 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
4667
4668
4669 ---
4670
4671 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
4672
4673 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
4674
4675
4676 ---
4677
4678 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
4679
4680 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
4681
4682 This tool is deprecated in 2.x and will be removed in 3.0.
4683
4684
4685 ---
4686
4687 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
4688
4689 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
4690
4691
4692 ---
4693
4694 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
4695
4696 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
4697
4698 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
4699
4700 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
4701
4702
4703 ---
4704
4705 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
4706
4707 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
4708 To use this feature, please make sure the HDFS config is set:
4709 dfs.namenode.acls.enabled=true
4710 fs.permissions.umask-mode=027
4711
4712 and set the HBase config:
4713 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
4714 hbase.user.scan.snapshot.enable=true
4715
4716
4717 ---
4718
4719 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
4720
4721 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4722
4723 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4724
4725
4726 ---
4727
4728 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
4729
4730 <!-- markdown -->
4731
4732 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
4733
4734
4735 ---
4736
4737 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
4738
4739 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
4740
4741
4742 ---
4743
4744 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
4745
4746 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
4747
4748
4749 ---
4750
4751 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
4752
4753 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
4754
4755
4756 ---
4757
4758 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
4759
4760 The HBase "source checksum" now uses SHA512 instead of MD5.
4761
4762
4763 ---
4764
4765 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
4766
4767 <!-- markdown -->
4768
4769 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
4770
4771 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
4772
4773
4774 ---
4775
4776 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
4777
4778 The access method was used to the HttpServerFunctionalTest class as a common place.
4779
4780
4781 ---
4782
4783 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
4784
4785 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
4786
4787
4788 ---
4789
4790 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
4791
4792 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
4793
4794
4795 ---
4796
4797 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
4798
4799 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
4800
4801
4802 ---
4803
4804 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
4805
4806 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
4807
4808
4809 ---
4810
4811 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
4812
4813 Support get\|set LogLevel in secure(kerberized) environment.
4814
4815
4816 ---
4817
4818 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
4819
4820 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
4821
4822
4823 ---
4824
4825 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
4826
4827 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
4828
4829
4830 ---
4831
4832 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
4833
4834 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
4835
4836
4837 ---
4838
4839 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
4840
4841 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
4842
4843
4844 ---
4845
4846 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
4847
4848 Updated metrics core from 3.2.1 to 3.2.6.
4849
4850
4851 ---
4852
4853 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
4854
4855 The rubocop definition for the maximum method length was set to 75.
4856
4857
4858 ---
4859
4860 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
4861
4862 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
4863
4864
4865 ---
4866
4867 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
4868
4869 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
4870
4871
4872 ---
4873
4874 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
4875
4876 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
4877
4878 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
4879
4880 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
4881
4882 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
4883
4884 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
4885 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
4886
4887 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
4888 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
4889 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
4890
4891
4892 ---
4893
4894 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
4895
4896 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
4897
4898 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
4899
4900
4901 ---
4902
4903 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
4904
4905 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
4906
4907
4908 ---
4909
4910 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
4911
4912 <!-- markdown -->
4913 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
4914
4915 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
4916
4917
4918 ---
4919
4920 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
4921
4922 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
4923
4924 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
4925
4926
4927 ---
4928
4929 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
4930
4931 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
4932
4933
4934 ---
4935
4936 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
4937
4938 <!-- markdown -->
4939
4940 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
4941
4942 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
4943
4944
4945 ---
4946
4947 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
4948
4949 Add below method in Table interface:
4950
4951 RegionLocator getRegionLocator() throws IOException;
4952
4953 Add below methods in AsyncTable interface:
4954
4955 AsyncTableRegionLocator getRegionLocator();
4956 CompletableFuture\<TableDescriptor\> getDescriptor();
4957
4958
4959 ---
4960
4961 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
4962
4963 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
4964
4965 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
4966
4967 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
4968
4969
4970 ---
4971
4972 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
4973
4974 Introduced
4975
4976 Future\<Void\> createTableAsync(TableDescriptor);
4977
4978
4979 ---
4980
4981 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
4982
4983 Introduced these methods:
4984 void move(byte[]);
4985 void move(byte[], ServerName);
4986 Future\<Void\> splitRegionAsync(byte[]);
4987
4988 These methods are deprecated:
4989 void move(byte[], byte[])
4990
4991
4992 ---
4993
4994 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
4995
4996 Add a new jenkins file for running pre commit check for GitHub PR.
4997
4998
4999 ---
5000
5001 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
5002
5003 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
5004
5005
5006 ---
5007
5008 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
5009
5010 When insufficient permissions, you now get:
5011
5012 HTTP/1.1 403 Forbidden
5013
5014 on the HTTP side, and in the message
5015
5016 Forbidden
5017 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
5018 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
5019 and the rest of the ADE stack
5020
5021
5022 ---
5023
5024 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
5025
5026 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
5027
5028
5029 ---
5030
5031 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
5032
5033 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
5034
5035
5036 ---
5037
5038 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
5039
5040 <!-- markdown -->
5041 Fixed awkward dependency issue that prevented site building.
5042
5043 #### note specific to HBase 2.1.4
5044 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
5045 ```
5046 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
5047 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
5048         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
5049         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
5050         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
5051         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
5052         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
5053         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
5054         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
5055         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
5056         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
5057         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
5058         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
5059         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
5060         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
5061         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
5062         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
5063         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
5064         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
5065         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
5066         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
5067         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
5068         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
5069         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
5070         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
5071         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
5072         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
5073         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
5074 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
5075         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
5076         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
5077         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
5078         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
5079         ... 26 more
5080
5081 ```
5082
5083 Workaround via any _one_ of the following:
5084 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
5085 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
5086 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
5087 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
5088 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
5089
5090
5091 ---
5092
5093 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
5094
5095 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
5096
5097
5098 ---
5099
5100 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
5101
5102 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
5103
5104
5105 ---
5106
5107 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
5108
5109 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
5110
5111 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
5112
5113
5114 ---
5115
5116 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
5117
5118 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
5119
5120
5121 ---
5122
5123 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
5124
5125 <!-- markdown -->
5126
5127 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
5128
5129 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
5130
5131
5132 ---
5133
5134 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
5135
5136 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
5137
5138
5139 ---
5140
5141 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
5142
5143 Add a cloneSnapshotAsync method with restoreAcl parameter.
5144 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
5145 Make snapshotAsync method returns a Future\<Void\>.
5146 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
5147 Use default methods to reduce the code base for implementation classes.
5148
5149
5150 ---
5151
5152 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
5153
5154 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
5155
5156
5157 ---
5158
5159 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
5160
5161 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
5162 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
5163
5164 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
5165
5166 For example:
5167 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
5168
5169
5170 ---
5171
5172 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
5173
5174 Adds below flush, split, and compaction metrics
5175
5176  +  // split related metrics
5177  +  private MutableFastCounter splitRequest;
5178  +  private MutableFastCounter splitSuccess;
5179  +  private MetricHistogram splitTimeHisto;
5180  +
5181  +  // flush related metrics
5182  +  private MetricHistogram flushTimeHisto;
5183  +  private MetricHistogram flushMemstoreSizeHisto;
5184  +  private MetricHistogram flushOutputSizeHisto;
5185  +  private MutableFastCounter flushedMemstoreBytes;
5186  +  private MutableFastCounter flushedOutputBytes;
5187  +
5188  +  // compaction related metrics
5189  +  private MetricHistogram compactionTimeHisto;
5190  +  private MetricHistogram compactionInputFileCountHisto;
5191  +  private MetricHistogram compactionInputSizeHisto;
5192  +  private MetricHistogram compactionOutputFileCountHisto;
5193  +  private MetricHistogram compactionOutputSizeHisto;
5194  +  private MutableFastCounter compactedInputBytes;
5195  +  private MutableFastCounter compactedOutputBytes;
5196  +
5197  +  private MetricHistogram majorCompactionTimeHisto;
5198  +  private MetricHistogram majorCompactionInputFileCountHisto;
5199  +  private MetricHistogram majorCompactionInputSizeHisto;
5200  +  private MetricHistogram majorCompactionOutputFileCountHisto;
5201  +  private MetricHistogram majorCompactionOutputSizeHisto;
5202  +  private MutableFastCounter majorCompactedInputBytes;
5203  +  private MutableFastCounter majorCompactedOutputBytes;
5204
5205
5206 ---
5207
5208 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
5209
5210 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
5211
5212
5213 ---
5214
5215 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
5216
5217 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
5218 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
5219
5220
5221 ---
5222
5223 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
5224
5225 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
5226
5227 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
5228
5229
5230 ---
5231
5232 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
5233
5234 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
5235 Shell commands are as follows:
5236 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5237
5238 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
5239 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
5240 Shell commands are as follows:
5241 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5242 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
5243 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
5244
5245
5246 ---
5247
5248 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
5249
5250 Change spotbugs version to 3.1.11.
5251
5252
5253 ---
5254
5255 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
5256
5257 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
5258
5259 It also introduces additional info for each recovery queue, which was not accounted by this command before.
5260
5261 The new output for "status 'replication'" command is explained in details below:
5262 a) Source started, target stopped, no edits arrived on source yet:
5263 ...
5264  SOURCE: PeerID=1
5265          Normal Queue: 1
5266            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5267 ...
5268 b) Source started, target stopped, add edit on source:
5269 ...
5270 Normal Queue: 1
5271            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
5272 ...
5273 c) Source started, target stopped, edit added on source, restart source:
5274 ...
5275 SOURCE: PeerID=1
5276          Normal Queue: 1
5277            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5278          Recovered Queue: 1-hbase01.home,16020,1542784524057
5279            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
5280 ...
5281 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
5282 ...
5283 SOURCE: PeerID=1
5284          Normal Queue: 1
5285            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
5286          Recovered Queue: 1-hbase01.home,16020,1542782758742
5287            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
5288 ...
5289 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
5290 ...
5291        SOURCE: PeerID=1
5292          Normal Queue: 1
5293            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
5294 ...
5295 f) Source started, target stopped, add edit on source, restart source, restart target:
5296 ...
5297 SOURCE: PeerID=1
5298          Normal Queue: 1
5299            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5300 ...
5301
5302
5303 ---
5304
5305 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
5306
5307 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
5308
5309
5310 ---
5311
5312 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
5313
5314 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
5315 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
5316 disable\_exceed\_throttle\_quota
5317 There are two limits when enable exceed throttle quota:
5318 1. Must set at least one read and one write region server throttle quota;
5319 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
5320
5321
5322 ---
5323
5324 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
5325
5326 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
5327
5328
5329 ---
5330
5331 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
5332
5333 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
5334
5335
5336 ---
5337
5338 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
5339
5340 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
5341
5342
5343 ---
5344
5345 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
5346
5347 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
5348
5349 hbase\> help 'scan'
5350
5351
5352 ---
5353
5354 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
5355
5356 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
5357
5358 For example:
5359 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
5360
5361
5362 ---
5363
5364 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
5365
5366 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
5367 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
5368
5369
5370 ---
5371
5372 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
5373
5374 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
5375
5376
5377 ---
5378
5379 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
5380
5381 Make StoppedRpcClientException extend DoNotRetryIOException.
5382
5383
5384 ---
5385
5386 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
5387
5388 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
5389 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
5390
5391
5392 ---
5393
5394 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
5395
5396 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
5397
5398 The effect releases are:
5399 2.1.x: 2.1.2 and below
5400 2.0.x: 2.0.4 and below
5401 1.x: 1.4.x and below
5402
5403 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
5404
5405
5406 ---
5407
5408 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
5409
5410 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
5411
5412
5413
5414 # HBASE  2.3.0 Release Notes
5415
5416 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
5417
5418
5419 ---
5420
5421 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
5422
5423 <!-- markdown -->
5424 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
5425 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
5426
5427
5428 ---
5429
5430 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
5431
5432 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
5433 The metric is now collected under the mbean for Tables and under the mbean for regions.
5434 Under table mbean ie.-
5435 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
5436 The new metrics will be listed as
5437 {code}
5438     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5439  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
5440 {code}
5441 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
5442 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
5443 {code}
5444
5445 The same one under the region ie.
5446 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
5447 comes as
5448 {code}
5449    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5450     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
5451 {code}
5452 where
5453 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
5454 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
5455 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
5456
5457
5458 ---
5459
5460 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
5461
5462 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
5463
5464 $hbase rowcounter -h
5465
5466 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
5467 Options:
5468     --starttime=\<arg\>       starting time filter to start counting rows from.
5469     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
5470     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
5471     --expectedCount=\<arg\>   expected number of rows to be count.
5472 For performance, consider the following configuration properties:
5473 -Dhbase.client.scanner.caching=100
5474 -Dmapreduce.map.speculative=false
5475
5476
5477 ---
5478
5479 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
5480
5481 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
5482
5483
5484 ---
5485
5486 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
5487
5488 Adds being able to edit hbase:meta table schema. For example,
5489
5490 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
5491 Updating all regions with the new schema...
5492 All regions updated.
5493 Done.
5494 Took 1.2138 seconds
5495
5496 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
5497
5498
5499 ---
5500
5501 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
5502
5503 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
5504
5505
5506 ---
5507
5508 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
5509
5510 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
5511
5512
5513 ---
5514
5515 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
5516
5517 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
5518
5519
5520 ---
5521
5522 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
5523
5524 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
5525
5526
5527 ---
5528
5529 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
5530
5531 <!-- markdown -->
5532 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
5533 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
5534 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
5535 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
5536 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
5537 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
5538
5539
5540 ---
5541
5542 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
5543
5544 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
5545
5546 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
5547
5548 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
5549
5550 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
5551
5552
5553 ---
5554
5555 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
5556
5557 Added new metric to differentiate sink startup time from last OP applied time.
5558
5559 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
5560
5561 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
5562
5563 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
5564
5565 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
5566
5567
5568 ---
5569
5570 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
5571
5572 <!-- markdown -->
5573 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
5574
5575 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
5576
5577
5578 ---
5579
5580 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
5581
5582 Add backoff. Avoid retrying every 100ms.
5583
5584
5585 ---
5586
5587 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
5588
5589 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
5590
5591 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
5592
5593
5594 ---
5595
5596 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
5597
5598 Introduced a general 'local region' at master side to store the procedure data, etc.
5599
5600 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
5601
5602 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
5603
5604
5605 ---
5606
5607 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
5608
5609 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
5610
5611
5612 ---
5613
5614 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
5615
5616 Config key: hbase.regionserver.slowlog.systable.enabled
5617 Default value: false
5618
5619 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
5620 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
5621
5622 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
5623
5624 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
5625
5626  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
5627  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
5628  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
5629  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
5630                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
5631                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
5632                                                              rics: false
5633  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
5634  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
5635  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
5636  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
5637  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
5638  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
5639  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
5640  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
5641
5642
5643 ---
5644
5645 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
5646
5647 <!-- markdown -->
5648 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
5649
5650
5651 ---
5652
5653 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
5654
5655 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
5656
5657 The request log is disabled by default in conf/log4j.properties by the following lines:
5658
5659 # Disable request log by default, you can enable this by changing the appender
5660 log4j.category.http.requests=INFO,NullAppender
5661 log4j.additivity.http.requests=false
5662
5663 Change the 'NullAppender' to what ever you want if you want to enable request log.
5664
5665 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
5666
5667
5668 ---
5669
5670 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
5671
5672 Use a empty string to represent no column specified for deleteall in shell mode.
5673 useage:
5674 deleteall 'test','r1','',12345
5675 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
5676
5677
5678 ---
5679
5680 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
5681
5682 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
5683
5684
5685 ---
5686
5687 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
5688
5689 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
5690
5691
5692 ---
5693
5694 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
5695
5696 Moved to hbase-thirdparty 3.3.0.
5697
5698
5699 ---
5700
5701 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
5702
5703 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
5704
5705 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
5706
5707 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
5708
5709
5710 ---
5711
5712 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
5713
5714 <!-- markdown -->
5715 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
5716
5717 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
5718
5719
5720 ---
5721
5722 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
5723
5724 New Config: hbase.rpc.rows.size.threshold.reject
5725 -----------------------------------------------------------------------
5726
5727 Default value: false
5728 Description:
5729 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
5730
5731
5732 ---
5733
5734 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
5735
5736 StochasticLoadBalancer functional improvement:
5737
5738 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
5739
5740
5741 ---
5742
5743 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
5744
5745 user or admin can now use
5746 hbase shell \> rename\_rsgroup 'oldname', 'newname'
5747 to rename rsgroup.
5748
5749
5750 ---
5751
5752 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
5753
5754 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
5755
5756 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
5757
5758
5759 ---
5760
5761 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
5762
5763 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
5764
5765
5766 ---
5767
5768 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
5769
5770 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
5771
5772
5773 ---
5774
5775 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
5776
5777 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
5778
5779
5780 ---
5781
5782 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
5783
5784 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
5785
5786
5787 ---
5788
5789 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
5790
5791 <!-- markdown -->
5792 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
5793
5794
5795 ---
5796
5797 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
5798
5799 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
5800
5801 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
5802
5803 For running tests locally, to go faster, up fork count.
5804
5805 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
5806
5807 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
5808
5809
5810 ---
5811
5812 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
5813
5814 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
5815
5816
5817 ---
5818
5819 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
5820
5821 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
5822
5823
5824 ---
5825
5826 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
5827
5828 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
5829
5830
5831 ---
5832
5833 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
5834
5835 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
5836
5837
5838 ---
5839
5840 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
5841
5842 <!-- markdown -->
5843 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
5844
5845 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
5846
5847
5848 ---
5849
5850 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
5851
5852 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
5853
5854
5855 ---
5856
5857 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
5858
5859 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
5860
5861
5862 ---
5863
5864 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
5865
5866 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
5867
5868
5869 ---
5870
5871 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
5872
5873 ColumnFamilyDescriptor new builder API:
5874
5875     /\*\*
5876      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
5877      \* of versions(versionAfterInterval) after that interval elapses.
5878      \*
5879      \* @param retentionInterval Retain all versions for this interval
5880      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
5881      \*/
5882     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
5883         final int retentionInterval, final int versionAfterInterval)
5884
5885
5886 ---
5887
5888 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
5889
5890 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
5891
5892
5893 ---
5894
5895 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
5896
5897 Expose file system level read metrics for RegionServer.
5898
5899 If the HBase RS runs on top of HDFS, calculate the aggregation of
5900 ReadStatistics of each HdfsFileInputStream. These metrics include:
5901 (1) total number of bytes read from HDFS.
5902 (2) total number of bytes read from local DataNode.
5903 (3) total number of bytes read locally through short-circuit read.
5904 (4) total number of bytes read locally through zero-copy read.
5905
5906 Because HDFS ReadStatistics is calculated per input stream, it is not
5907 feasible to update the aggregated number in real time. Instead, the
5908 metrics are updated when an input stream is closed.
5909
5910
5911 ---
5912
5913 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
5914
5915 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
5916
5917 Here is a simple example of script:
5918 {code}
5919 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
5920 #!/bin/bash
5921 namespace=$1
5922 tablename=$2
5923 if [[ $namespace == test ]]; then
5924   echo test
5925 elif [[ $tablename == \*foo\* ]]; then
5926   echo other
5927 else
5928   echo default
5929 fi
5930 {code}
5931
5932
5933 ---
5934
5935 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
5936
5937 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
5938
5939
5940 ---
5941
5942 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
5943
5944 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
5945
5946
5947 ---
5948
5949 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
5950
5951 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
5952
5953 User used to see....
5954
5955   column=table:state, timestamp=1583967620343 .....
5956
5957 ... but now sees:
5958
5959   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
5960
5961
5962 ---
5963
5964 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
5965
5966 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
5967
5968
5969 ---
5970
5971 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
5972
5973 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
5974
5975 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
5976
5977
5978 ---
5979
5980 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
5981
5982 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
5983
5984 New Admin APIs:
5985 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
5986       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
5987
5988 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
5989       throws IOException;
5990
5991 Configs:
5992
5993 1. hbase.regionserver.slowlog.ringbuffer.size:
5994 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
5995
5996 Default
5997 256
5998
5999 2. hbase.regionserver.slowlog.buffer.enabled:
6000 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
6001
6002 Default
6003 false
6004
6005
6006 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
6007
6008
6009 ---
6010
6011 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
6012
6013 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
6014
6015
6016 ---
6017
6018 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
6019
6020 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
6021
6022 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
6023
6024 This is a fluent style API, the code is like:
6025
6026 For Table interface:
6027 {code}
6028 table.checkAndMutate(row, filter).thenPut(put);
6029 {code}
6030
6031 For AsyncTable interface:
6032 {code}
6033 table.checkAndMutate(row, filter).thenPut(put)
6034     .thenAccept(succ -\> {
6035       if (succ) {
6036         System.out.println("Check and put succeeded");
6037       } else {
6038         System.out.println("Check and put failed");
6039       }
6040     });
6041 {code}
6042
6043
6044 ---
6045
6046 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
6047
6048 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
6049
6050
6051 ---
6052
6053 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
6054
6055 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
6056
6057
6058 ---
6059
6060 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
6061
6062     Adds shell command regioninfo:
6063
6064       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
6065       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
6066       Took 0.4737 seconds
6067
6068
6069 ---
6070
6071 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
6072
6073 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
6074
6075 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
6076
6077
6078 ---
6079
6080 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
6081
6082 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
6083
6084 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
6085 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
6086
6087 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
6088
6089
6090 ---
6091
6092 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
6093
6094 <!-- markdown -->
6095 Enables master based registry as the default registry used by clients to fetch connection metadata.
6096 Refer to the section "Master Registry" in the client documentation for more details and advantages
6097 of this implementation over the default Zookeeper based registry.
6098
6099 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
6100
6101 Where to set this: HBase client configuration (hbase-site.xml)
6102
6103 Possible values:
6104 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
6105 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
6106
6107 Notes on defaults:
6108
6109 - For v3.0.0 and later, MasterRegistry is the default registry
6110 - For all releases in 2.x line, ZK based registry is the default.
6111
6112 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
6113
6114 ```
6115 <property>
6116   <name>hbase.client.registry.impl</name>
6117   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
6118 </property>
6119 ```
6120
6121
6122 ---
6123
6124 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
6125
6126 caffeine: 2.6.2 =\> 2.8.1
6127 commons-codec: 1.10 =\> 1.13
6128 commons-io: 2.5 =\> 2.6
6129 disrupter: 3.3.6 =\> 3.4.2
6130 httpcore: 4.4.6 =\> 4.4.13
6131 jackson: 2.9.10 =\> 2.10.1
6132 jackson.databind: 2.9.10.1 =\> 2.10.1
6133 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
6134 protobuf.plugin: 0.5.0 =\> 0.6.1
6135 zookeeper: 3.4.10 =\> 3.4.14
6136 slf4j: 1.7.25 =\> 1.7.30
6137 rat: 0.12 =\> 0.13
6138 asciidoctor: 1.5.5 =\> 1.5.8
6139 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
6140 error-prone: 2.3.3 =\> 2.3.4
6141
6142
6143 ---
6144
6145 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
6146
6147 - Reverts a binary incompatible binary change for ByteRangeUtils
6148 - Usage of reflection inside CommonFSUtils removed
6149
6150
6151 ---
6152
6153 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
6154
6155 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
6156
6157 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
6158
6159
6160 ---
6161
6162 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
6163
6164 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
6165
6166
6167 ---
6168
6169 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
6170
6171 Add a new config to hbase-default.xml
6172
6173   \<property\>
6174     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
6175     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
6176     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
6177     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
6178     called in order, so put the cleaner that prunes the most files in front. To
6179     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
6180     and add the fully qualified class name here. Always add the above
6181     default hfile cleaners in the list as they will be overwritten in
6182     hbase-site.xml.\</description\>
6183   \</property\>
6184
6185 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
6186
6187
6188 ---
6189
6190 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
6191
6192 Updated parent pom to Apache version 22.
6193
6194
6195 ---
6196
6197 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
6198
6199 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
6200
6201 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
6202
6203
6204 ---
6205
6206 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
6207
6208 Add a new feature to improve MTTR which have 3 steps to failover:
6209 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
6210 2. Open region.
6211 3. Bulkload the recovered.hfiles for every column family.
6212
6213 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
6214
6215 Config hbase.wal.split.to.hfile to true to enable this featue.
6216
6217
6218 ---
6219
6220 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
6221
6222 Changed the logging in hbase-zookeeper to use built-in formatting
6223
6224
6225 ---
6226
6227 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
6228
6229 From the PR:
6230
6231 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
6232
6233 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
6234
6235
6236 ---
6237
6238 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
6239
6240 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
6241
6242
6243 ---
6244
6245 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
6246
6247 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
6248
6249
6250 ---
6251
6252 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
6253
6254 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
6255
6256
6257 ---
6258
6259 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
6260
6261 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
6262
6263 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
6264
6265
6266 ---
6267
6268 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
6269
6270 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
6271
6272
6273 ---
6274
6275 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
6276
6277 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
6278 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
6279
6280 Fixed this bug as part of this Jira.
6281 Updated description for corresponding configs:
6282
6283 1. hbase.master.regions.recovery.check.interval :
6284
6285 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6286
6287 2. hbase.regions.recovery.store.file.ref.count :
6288
6289 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6290
6291
6292 ---
6293
6294 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
6295
6296 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
6297
6298
6299 ---
6300
6301 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
6302
6303 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
6304
6305
6306 ---
6307
6308 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
6309
6310 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
6311
6312
6313 ---
6314
6315 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
6316
6317 Bumped surefire plugin to 3.0.0-M4
6318
6319
6320 ---
6321
6322 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
6323
6324 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
6325
6326
6327 ---
6328
6329 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
6330
6331 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
6332 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
6333 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
6334 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
6335 From the shell this can be enabled by using the option per Column Family also by using the below format
6336 {code}
6337 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
6338 {code}
6339
6340
6341 ---
6342
6343 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
6344
6345 <!-- markdown -->
6346
6347 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
6348
6349 ```
6350 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
6351     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
6352 ```
6353
6354 See javadocs of the class `MobRefReporter` for more details.
6355
6356 the reference guide has added some information about MOB internals and troubleshooting.
6357
6358
6359 ---
6360
6361 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
6362
6363 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
6364
6365
6366 ---
6367
6368 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
6369
6370 Fixed unbalanced braces in string representation within HBase shell
6371
6372
6373 ---
6374
6375 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
6376
6377 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
6378 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
6379
6380
6381 ---
6382
6383 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
6384
6385 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
6386
6387
6388 ---
6389
6390 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
6391
6392 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
6393
6394 1. RowFilter
6395 2. ValueFilter
6396 3. QualifierFilter
6397 4. FamilyFilter
6398 5. ColumnValueFilter
6399
6400
6401 ---
6402
6403 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
6404
6405 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
6406
6407
6408 ---
6409
6410 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
6411
6412 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
6413
6414
6415 ---
6416
6417 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
6418
6419 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
6420
6421
6422 ---
6423
6424 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
6425
6426 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
6427
6428
6429 ---
6430
6431 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
6432
6433 <!-- markdown -->
6434 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
6435
6436 Such messages will happen at most once per five minutes.
6437
6438
6439 ---
6440
6441 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
6442
6443 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
6444
6445
6446 ---
6447
6448 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
6449
6450 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
6451
6452
6453 ---
6454
6455 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
6456
6457 <!-- markdown -->
6458
6459 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
6460
6461   - CVE-2019-16942
6462   - CVE-2019-16943
6463
6464 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
6465
6466
6467 ---
6468
6469 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
6470
6471 <!-- markdown -->
6472
6473 The MOB compaction process in the HBase Master now logs more about its activity.
6474
6475 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
6476
6477 Caveats:
6478 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
6479 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
6480 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
6481 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
6482
6483
6484 ---
6485
6486 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
6487
6488 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
6489
6490
6491 ---
6492
6493 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
6494
6495 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
6496
6497 Configs:
6498
6499 1. hbase.master.regions.recovery.check.interval :
6500
6501 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6502
6503 2. hbase.regions.recovery.store.file.ref.count :
6504
6505 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6506
6507
6508 ---
6509
6510 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
6511
6512 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
6513
6514 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
6515
6516
6517 ---
6518
6519 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
6520
6521 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
6522
6523
6524 ---
6525
6526 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
6527
6528 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
6529
6530
6531 ---
6532
6533 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
6534
6535 <!-- markdown -->
6536 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
6537
6538 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
6539
6540
6541 ---
6542
6543 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
6544
6545 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
6546
6547
6548 ---
6549
6550 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
6551
6552 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
6553
6554
6555 ---
6556
6557 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
6558
6559 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
6560
6561 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
6562
6563
6564 ---
6565
6566 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
6567
6568 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
6569 \<property\>
6570     \<name\>hbase.bucketcache.ioengine\</name\>
6571     \<value\> pmem:///path in persistent memory \</value\>
6572   \</property\>
6573
6574
6575 ---
6576
6577 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
6578
6579 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
6580 hbase\> snapshot\_cleanup\_switch false
6581
6582 We can re-enable it using:
6583 hbase\> snapshot\_cleanup\_switch true
6584
6585 We can query whether snapshot auto cleanup is enabled for cluster using:
6586 hbase\> snapshot\_cleanup\_enabled
6587
6588
6589 ---
6590
6591 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
6592
6593 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
6594
6595
6596 ---
6597
6598 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
6599
6600 This issue adds via its subtasks:
6601
6602  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
6603  \*\* Master thought this region opened, but no regionserver reported it.
6604  \*\* Master thought this region opened on Server1, but regionserver reported Server2
6605  \*\* More than one regionservers reported opened this region
6606  Both chores can be triggered from the shell to regenerate ‘new’ reports.
6607  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
6608  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
6609  \* Offline replace of hbase.version and hbase.id
6610  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
6611  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
6612  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
6613  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
6614  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
6615  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
6616
6617 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
6618
6619
6620 ---
6621
6622 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
6623
6624 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
6625
6626
6627 ---
6628
6629 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
6630
6631 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
6632
6633
6634 ---
6635
6636 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
6637
6638 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
6639
6640 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
6641
6642 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
6643
6644
6645 ---
6646
6647 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
6648
6649 <!-- markdown -->
6650 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
6651
6652
6653 ---
6654
6655 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
6656
6657 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
6658
6659
6660 ---
6661
6662 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
6663
6664 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
6665
6666
6667 ---
6668
6669 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
6670
6671 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
6672 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
6673
6674
6675 ---
6676
6677 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
6678
6679 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
6680 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
6681 \* TimeRange#until: Represents the time interval [0, maxStamp)
6682 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
6683
6684
6685 ---
6686
6687 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
6688
6689 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
6690 {code}
6691 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
6692 {code}
6693
6694
6695 ---
6696
6697 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
6698
6699 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
6700
6701
6702 ---
6703
6704 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
6705
6706 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
6707
6708 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
6709
6710
6711 ---
6712
6713 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
6714
6715 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
6716
6717
6718 ---
6719
6720 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
6721
6722 New shaded artifact for testing: hbase-shaded-testing-util.
6723
6724
6725 ---
6726
6727 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
6728
6729 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
6730 1. Check HDFS configuration
6731 2. Add master coprocessor:
6732     hbase.coprocessor.master.classes=
6733     “org.apache.hadoop.hbase.security.access.AccessController,
6734 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
6735 3. Enable this feature:
6736     hbase.acl.sync.to.hdfs.enable=true
6737 4. Modify table scheme to enable this feature for a table:
6738     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
6739
6740
6741 ---
6742
6743 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
6744
6745 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
6746
6747 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
6748
6749 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
6750 java.lang.ArrayIndexOutOfBoundsException: 18056
6751         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
6752         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
6753         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
6754         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
6755         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
6756         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
6757         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
6758         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
6759         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
6760
6761 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
6762
6763 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
6764
6765 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
6766
6767
6768 ---
6769
6770 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
6771
6772 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
6773
6774
6775 ---
6776
6777 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
6778
6779 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
6780
6781
6782 ---
6783
6784 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
6785
6786 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
6787
6788 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
6789
6790
6791 ---
6792
6793 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
6794
6795 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
6796
6797
6798 ---
6799
6800 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
6801
6802 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
6803 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
6804
6805
6806 ---
6807
6808 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
6809
6810 1. Add a new chore thread in master to do hbck checking
6811 2. Add a new web ui "HBCK Report" page to display checking results.
6812
6813 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
6814
6815 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
6816
6817
6818 ---
6819
6820 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
6821
6822 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
6823 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
6824
6825
6826 ---
6827
6828 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
6829
6830 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
6831
6832 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
6833
6834 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
6835
6836
6837 ---
6838
6839 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
6840
6841 Add a new master web UI to show the potentially problematic opened regions. There are three case:
6842 1. Master thought this region opened, but no regionserver reported it.
6843 2. Master thought this region opened on Server1, but regionserver reported Server2
6844 3. More than one regionservers reported opened this region
6845
6846
6847 ---
6848
6849 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
6850
6851 Feature: Take a Snapshot With TTL for auto-cleanup
6852
6853 Attribute:
6854 1. TTL
6855      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
6856
6857 Configs:
6858 1. Default Snapshot TTL:
6859      - FOREVER by default
6860      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
6861
6862 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
6863      - hbase.master.cleaner.snapshot.disable: "true"
6864     With this config, HMaster needs restart just like any other hbase-site config.
6865
6866
6867 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
6868
6869
6870 ---
6871
6872 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
6873
6874 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
6875
6876
6877 ---
6878
6879 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
6880
6881 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
6882
6883 This tool is deprecated in 2.x and will be removed in 3.0.
6884
6885
6886 ---
6887
6888 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
6889
6890 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
6891
6892
6893 ---
6894
6895 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
6896
6897 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
6898
6899 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
6900
6901 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
6902
6903
6904 ---
6905
6906 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
6907
6908 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
6909 To use this feature, please make sure the HDFS config is set:
6910 dfs.namenode.acls.enabled=true
6911 fs.permissions.umask-mode=027
6912
6913 and set the HBase config:
6914 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
6915 hbase.user.scan.snapshot.enable=true
6916
6917
6918 ---
6919
6920 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
6921
6922 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
6923
6924 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
6925
6926
6927 ---
6928
6929 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
6930
6931 <!-- markdown -->
6932
6933 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
6934
6935
6936 ---
6937
6938 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
6939
6940 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
6941
6942
6943 ---
6944
6945 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
6946
6947 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
6948
6949
6950 ---
6951
6952 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
6953
6954 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
6955
6956
6957 ---
6958
6959 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
6960
6961 The HBase "source checksum" now uses SHA512 instead of MD5.
6962
6963
6964 ---
6965
6966 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
6967
6968 <!-- markdown -->
6969
6970 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
6971
6972 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
6973
6974
6975 ---
6976
6977 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
6978
6979 The access method was used to the HttpServerFunctionalTest class as a common place.
6980
6981
6982 ---
6983
6984 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
6985
6986 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
6987
6988
6989 ---
6990
6991 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
6992
6993 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
6994
6995
6996 ---
6997
6998 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
6999
7000 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
7001
7002
7003 ---
7004
7005 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
7006
7007 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
7008
7009
7010 ---
7011
7012 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
7013
7014 Support get\|set LogLevel in secure(kerberized) environment.
7015
7016
7017 ---
7018
7019 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
7020
7021 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
7022
7023
7024 ---
7025
7026 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
7027
7028 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
7029
7030
7031 ---
7032
7033 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
7034
7035 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
7036
7037
7038 ---
7039
7040 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
7041
7042 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
7043
7044
7045 ---
7046
7047 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
7048
7049 Updated metrics core from 3.2.1 to 3.2.6.
7050
7051
7052 ---
7053
7054 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
7055
7056 The rubocop definition for the maximum method length was set to 75.
7057
7058
7059 ---
7060
7061 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
7062
7063 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
7064
7065
7066 ---
7067
7068 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
7069
7070 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
7071
7072
7073 ---
7074
7075 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
7076
7077 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
7078
7079 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
7080
7081 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
7082
7083 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
7084
7085 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
7086 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
7087
7088 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
7089 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
7090 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
7091
7092
7093 ---
7094
7095 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
7096
7097 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
7098
7099 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
7100
7101
7102 ---
7103
7104 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
7105
7106 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
7107
7108
7109 ---
7110
7111 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
7112
7113 <!-- markdown -->
7114 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
7115
7116 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
7117
7118
7119 ---
7120
7121 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
7122
7123 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
7124
7125 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
7126
7127
7128 ---
7129
7130 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
7131
7132 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
7133
7134
7135 ---
7136
7137 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
7138
7139 <!-- markdown -->
7140
7141 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
7142
7143 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
7144
7145
7146 ---
7147
7148 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
7149
7150 Add below method in Table interface:
7151
7152 RegionLocator getRegionLocator() throws IOException;
7153
7154 Add below methods in AsyncTable interface:
7155
7156 AsyncTableRegionLocator getRegionLocator();
7157 CompletableFuture\<TableDescriptor\> getDescriptor();
7158
7159
7160 ---
7161
7162 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
7163
7164 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
7165
7166 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
7167
7168 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
7169
7170
7171 ---
7172
7173 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
7174
7175 Introduced
7176
7177 Future\<Void\> createTableAsync(TableDescriptor);
7178
7179
7180 ---
7181
7182 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
7183
7184 Introduced these methods:
7185 void move(byte[]);
7186 void move(byte[], ServerName);
7187 Future\<Void\> splitRegionAsync(byte[]);
7188
7189 These methods are deprecated:
7190 void move(byte[], byte[])
7191
7192
7193 ---
7194
7195 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
7196
7197 Add a new jenkins file for running pre commit check for GitHub PR.
7198
7199
7200 ---
7201
7202 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
7203
7204 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
7205
7206
7207 ---
7208
7209 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
7210
7211 When insufficient permissions, you now get:
7212
7213 HTTP/1.1 403 Forbidden
7214
7215 on the HTTP side, and in the message
7216
7217 Forbidden
7218 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
7219 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
7220 and the rest of the ADE stack
7221
7222
7223 ---
7224
7225 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
7226
7227 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
7228
7229
7230 ---
7231
7232 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
7233
7234 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
7235
7236
7237 ---
7238
7239 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
7240
7241 <!-- markdown -->
7242 Fixed awkward dependency issue that prevented site building.
7243
7244 #### note specific to HBase 2.1.4
7245 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
7246 ```
7247 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
7248 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
7249         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
7250         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
7251         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
7252         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
7253         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
7254         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
7255         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
7256         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
7257         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
7258         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
7259         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
7260         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
7261         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
7262         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
7263         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
7264         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
7265         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
7266         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
7267         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
7268         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
7269         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
7270         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
7271         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
7272         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
7273         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
7274         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
7275 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
7276         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
7277         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
7278         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
7279         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
7280         ... 26 more
7281
7282 ```
7283
7284 Workaround via any _one_ of the following:
7285 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
7286 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
7287 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
7288 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
7289 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
7290
7291
7292 ---
7293
7294 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
7295
7296 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
7297
7298
7299 ---
7300
7301 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
7302
7303 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
7304
7305
7306 ---
7307
7308 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
7309
7310 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
7311
7312 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
7313
7314
7315 ---
7316
7317 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
7318
7319 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
7320
7321
7322 ---
7323
7324 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
7325
7326 <!-- markdown -->
7327
7328 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
7329
7330 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
7331
7332
7333 ---
7334
7335 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
7336
7337 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
7338
7339
7340 ---
7341
7342 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
7343
7344 Add a cloneSnapshotAsync method with restoreAcl parameter.
7345 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
7346 Make snapshotAsync method returns a Future\<Void\>.
7347 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
7348 Use default methods to reduce the code base for implementation classes.
7349
7350
7351 ---
7352
7353 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
7354
7355 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
7356
7357
7358 ---
7359
7360 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
7361
7362 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
7363 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
7364
7365 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
7366
7367 For example:
7368 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
7369
7370
7371 ---
7372
7373 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
7374
7375 Adds below flush, split, and compaction metrics
7376
7377  +  // split related metrics
7378  +  private MutableFastCounter splitRequest;
7379  +  private MutableFastCounter splitSuccess;
7380  +  private MetricHistogram splitTimeHisto;
7381  +
7382  +  // flush related metrics
7383  +  private MetricHistogram flushTimeHisto;
7384  +  private MetricHistogram flushMemstoreSizeHisto;
7385  +  private MetricHistogram flushOutputSizeHisto;
7386  +  private MutableFastCounter flushedMemstoreBytes;
7387  +  private MutableFastCounter flushedOutputBytes;
7388  +
7389  +  // compaction related metrics
7390  +  private MetricHistogram compactionTimeHisto;
7391  +  private MetricHistogram compactionInputFileCountHisto;
7392  +  private MetricHistogram compactionInputSizeHisto;
7393  +  private MetricHistogram compactionOutputFileCountHisto;
7394  +  private MetricHistogram compactionOutputSizeHisto;
7395  +  private MutableFastCounter compactedInputBytes;
7396  +  private MutableFastCounter compactedOutputBytes;
7397  +
7398  +  private MetricHistogram majorCompactionTimeHisto;
7399  +  private MetricHistogram majorCompactionInputFileCountHisto;
7400  +  private MetricHistogram majorCompactionInputSizeHisto;
7401  +  private MetricHistogram majorCompactionOutputFileCountHisto;
7402  +  private MetricHistogram majorCompactionOutputSizeHisto;
7403  +  private MutableFastCounter majorCompactedInputBytes;
7404  +  private MutableFastCounter majorCompactedOutputBytes;
7405
7406
7407 ---
7408
7409 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
7410
7411 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
7412
7413
7414 ---
7415
7416 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
7417
7418 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
7419 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
7420
7421
7422 ---
7423
7424 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
7425
7426 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
7427
7428 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
7429
7430
7431 ---
7432
7433 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
7434
7435 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
7436 Shell commands are as follows:
7437 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7438
7439 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
7440 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
7441 Shell commands are as follows:
7442 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7443 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
7444 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
7445
7446
7447 ---
7448
7449 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
7450
7451 Change spotbugs version to 3.1.11.
7452
7453
7454 ---
7455
7456 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
7457
7458 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
7459
7460 It also introduces additional info for each recovery queue, which was not accounted by this command before.
7461
7462 The new output for "status 'replication'" command is explained in details below:
7463 a) Source started, target stopped, no edits arrived on source yet:
7464 ...
7465  SOURCE: PeerID=1
7466          Normal Queue: 1
7467            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7468 ...
7469 b) Source started, target stopped, add edit on source:
7470 ...
7471 Normal Queue: 1
7472            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
7473 ...
7474 c) Source started, target stopped, edit added on source, restart source:
7475 ...
7476 SOURCE: PeerID=1
7477          Normal Queue: 1
7478            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7479          Recovered Queue: 1-hbase01.home,16020,1542784524057
7480            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
7481 ...
7482 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
7483 ...
7484 SOURCE: PeerID=1
7485          Normal Queue: 1
7486            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
7487          Recovered Queue: 1-hbase01.home,16020,1542782758742
7488            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
7489 ...
7490 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
7491 ...
7492        SOURCE: PeerID=1
7493          Normal Queue: 1
7494            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
7495 ...
7496 f) Source started, target stopped, add edit on source, restart source, restart target:
7497 ...
7498 SOURCE: PeerID=1
7499          Normal Queue: 1
7500            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7501 ...
7502
7503
7504 ---
7505
7506 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
7507
7508 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
7509
7510
7511 ---
7512
7513 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
7514
7515 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
7516 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
7517 disable\_exceed\_throttle\_quota
7518 There are two limits when enable exceed throttle quota:
7519 1. Must set at least one read and one write region server throttle quota;
7520 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
7521
7522
7523 ---
7524
7525 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
7526
7527 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
7528
7529
7530 ---
7531
7532 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
7533
7534 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
7535
7536
7537 ---
7538
7539 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
7540
7541 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
7542
7543
7544 ---
7545
7546 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
7547
7548 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
7549
7550 hbase\> help 'scan'
7551
7552
7553 ---
7554
7555 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
7556
7557 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
7558
7559 For example:
7560 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
7561
7562
7563 ---
7564
7565 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
7566
7567 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
7568 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
7569
7570
7571 ---
7572
7573 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
7574
7575 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
7576
7577
7578 ---
7579
7580 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
7581
7582 Make StoppedRpcClientException extend DoNotRetryIOException.
7583
7584
7585 ---
7586
7587 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
7588
7589 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
7590 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
7591
7592
7593 ---
7594
7595 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
7596
7597 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
7598
7599 The effect releases are:
7600 2.1.x: 2.1.2 and below
7601 2.0.x: 2.0.4 and below
7602 1.x: 1.4.x and below
7603
7604 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
7605
7606
7607 ---
7608
7609 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
7610
7611 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
7612
7613
7614
7615 # HBASE  2.3.0 Release Notes
7616
7617 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
7618
7619
7620 ---
7621
7622 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
7623
7624 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
7625
7626
7627 ---
7628
7629 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
7630
7631 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
7632
7633
7634 ---
7635
7636 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
7637
7638 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
7639
7640
7641 ---
7642
7643 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
7644
7645 <!-- markdown -->
7646 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
7647 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
7648 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
7649 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
7650 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
7651 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
7652
7653
7654 ---
7655
7656 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
7657
7658 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
7659
7660 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
7661
7662 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
7663
7664 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
7665
7666
7667 ---
7668
7669 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
7670
7671 Added new metric to differentiate sink startup time from last OP applied time.
7672
7673 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
7674
7675 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
7676
7677 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
7678
7679 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
7680
7681
7682 ---
7683
7684 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
7685
7686 <!-- markdown -->
7687 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
7688
7689 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
7690
7691
7692 ---
7693
7694 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
7695
7696 Add backoff. Avoid retrying every 100ms.
7697
7698
7699 ---
7700
7701 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
7702
7703 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
7704
7705 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
7706
7707
7708 ---
7709
7710 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
7711
7712 Introduced a general 'local region' at master side to store the procedure data, etc.
7713
7714 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
7715
7716 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
7717
7718
7719 ---
7720
7721 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
7722
7723 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
7724
7725
7726 ---
7727
7728 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
7729
7730 Config key: hbase.regionserver.slowlog.systable.enabled
7731 Default value: false
7732
7733 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
7734 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
7735
7736 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
7737
7738 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
7739
7740  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
7741  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
7742  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
7743  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
7744                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
7745                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
7746                                                              rics: false
7747  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
7748  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
7749  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
7750  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
7751  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
7752  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
7753  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
7754  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
7755
7756
7757 ---
7758
7759 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
7760
7761 <!-- markdown -->
7762 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
7763
7764
7765 ---
7766
7767 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
7768
7769 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
7770
7771 The request log is disabled by default in conf/log4j.properties by the following lines:
7772
7773 # Disable request log by default, you can enable this by changing the appender
7774 log4j.category.http.requests=INFO,NullAppender
7775 log4j.additivity.http.requests=false
7776
7777 Change the 'NullAppender' to what ever you want if you want to enable request log.
7778
7779 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
7780
7781
7782 ---
7783
7784 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
7785
7786 Use a empty string to represent no column specified for deleteall in shell mode.
7787 useage:
7788 deleteall 'test','r1','',12345
7789 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
7790
7791
7792 ---
7793
7794 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
7795
7796 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
7797
7798
7799 ---
7800
7801 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
7802
7803 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
7804
7805
7806 ---
7807
7808 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
7809
7810 Moved to hbase-thirdparty 3.3.0.
7811
7812
7813 ---
7814
7815 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
7816
7817 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
7818
7819 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
7820
7821 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
7822
7823
7824 ---
7825
7826 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
7827
7828 <!-- markdown -->
7829 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
7830
7831 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
7832
7833
7834 ---
7835
7836 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
7837
7838 New Config: hbase.rpc.rows.size.threshold.reject
7839 -----------------------------------------------------------------------
7840
7841 Default value: false
7842 Description:
7843 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
7844
7845
7846 ---
7847
7848 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
7849
7850 StochasticLoadBalancer functional improvement:
7851
7852 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
7853
7854
7855 ---
7856
7857 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
7858
7859 user or admin can now use
7860 hbase shell \> rename\_rsgroup 'oldname', 'newname'
7861 to rename rsgroup.
7862
7863
7864 ---
7865
7866 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
7867
7868 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
7869
7870 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
7871
7872
7873 ---
7874
7875 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
7876
7877 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
7878
7879
7880 ---
7881
7882 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
7883
7884 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
7885
7886
7887 ---
7888
7889 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
7890
7891 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
7892
7893
7894 ---
7895
7896 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
7897
7898 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
7899
7900
7901 ---
7902
7903 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
7904
7905 <!-- markdown -->
7906 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
7907
7908
7909 ---
7910
7911 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
7912
7913 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
7914
7915 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
7916
7917 For running tests locally, to go faster, up fork count.
7918
7919 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
7920
7921 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
7922
7923
7924 ---
7925
7926 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
7927
7928 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
7929
7930
7931 ---
7932
7933 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
7934
7935 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
7936
7937
7938 ---
7939
7940 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
7941
7942 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
7943
7944
7945 ---
7946
7947 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
7948
7949 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
7950
7951
7952 ---
7953
7954 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
7955
7956 <!-- markdown -->
7957 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
7958
7959 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
7960
7961
7962 ---
7963
7964 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
7965
7966 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
7967
7968
7969 ---
7970
7971 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
7972
7973 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
7974
7975
7976 ---
7977
7978 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
7979
7980 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
7981
7982
7983 ---
7984
7985 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
7986
7987 ColumnFamilyDescriptor new builder API:
7988
7989     /\*\*
7990      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
7991      \* of versions(versionAfterInterval) after that interval elapses.
7992      \*
7993      \* @param retentionInterval Retain all versions for this interval
7994      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
7995      \*/
7996     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
7997         final int retentionInterval, final int versionAfterInterval)
7998
7999
8000 ---
8001
8002 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
8003
8004 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
8005
8006
8007 ---
8008
8009 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
8010
8011 Expose file system level read metrics for RegionServer.
8012
8013 If the HBase RS runs on top of HDFS, calculate the aggregation of
8014 ReadStatistics of each HdfsFileInputStream. These metrics include:
8015 (1) total number of bytes read from HDFS.
8016 (2) total number of bytes read from local DataNode.
8017 (3) total number of bytes read locally through short-circuit read.
8018 (4) total number of bytes read locally through zero-copy read.
8019
8020 Because HDFS ReadStatistics is calculated per input stream, it is not
8021 feasible to update the aggregated number in real time. Instead, the
8022 metrics are updated when an input stream is closed.
8023
8024
8025 ---
8026
8027 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
8028
8029 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
8030
8031 Here is a simple example of script:
8032 {code}
8033 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
8034 #!/bin/bash
8035 namespace=$1
8036 tablename=$2
8037 if [[ $namespace == test ]]; then
8038   echo test
8039 elif [[ $tablename == \*foo\* ]]; then
8040   echo other
8041 else
8042   echo default
8043 fi
8044 {code}
8045
8046
8047 ---
8048
8049 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
8050
8051 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
8052
8053
8054 ---
8055
8056 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
8057
8058 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
8059
8060
8061 ---
8062
8063 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
8064
8065 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
8066
8067 User used to see....
8068
8069   column=table:state, timestamp=1583967620343 .....
8070
8071 ... but now sees:
8072
8073   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
8074
8075
8076 ---
8077
8078 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
8079
8080 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
8081
8082
8083 ---
8084
8085 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
8086
8087 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
8088
8089 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
8090
8091
8092 ---
8093
8094 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
8095
8096 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
8097
8098 New Admin APIs:
8099 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
8100       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
8101
8102 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
8103       throws IOException;
8104
8105 Configs:
8106
8107 1. hbase.regionserver.slowlog.ringbuffer.size:
8108 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
8109
8110 Default
8111 256
8112
8113 2. hbase.regionserver.slowlog.buffer.enabled:
8114 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
8115
8116 Default
8117 false
8118
8119
8120 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
8121
8122
8123 ---
8124
8125 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
8126
8127 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
8128
8129
8130 ---
8131
8132 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
8133
8134 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
8135
8136 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
8137
8138 This is a fluent style API, the code is like:
8139
8140 For Table interface:
8141 {code}
8142 table.checkAndMutate(row, filter).thenPut(put);
8143 {code}
8144
8145 For AsyncTable interface:
8146 {code}
8147 table.checkAndMutate(row, filter).thenPut(put)
8148     .thenAccept(succ -\> {
8149       if (succ) {
8150         System.out.println("Check and put succeeded");
8151       } else {
8152         System.out.println("Check and put failed");
8153       }
8154     });
8155 {code}
8156
8157
8158 ---
8159
8160 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
8161
8162 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
8163
8164
8165 ---
8166
8167 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
8168
8169 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
8170
8171
8172 ---
8173
8174 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
8175
8176     Adds shell command regioninfo:
8177
8178       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
8179       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
8180       Took 0.4737 seconds
8181
8182
8183 ---
8184
8185 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
8186
8187 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
8188
8189 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
8190
8191
8192 ---
8193
8194 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
8195
8196 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
8197
8198 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
8199 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
8200
8201 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
8202
8203
8204 ---
8205
8206 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
8207
8208 <!-- markdown -->
8209 Enables master based registry as the default registry used by clients to fetch connection metadata.
8210 Refer to the section "Master Registry" in the client documentation for more details and advantages
8211 of this implementation over the default Zookeeper based registry.
8212
8213 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
8214
8215 Where to set this: HBase client configuration (hbase-site.xml)
8216
8217 Possible values:
8218 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
8219 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
8220
8221 Notes on defaults:
8222
8223 - For v3.0.0 and later, MasterRegistry is the default registry
8224 - For all releases in 2.x line, ZK based registry is the default.
8225
8226 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
8227
8228 ```
8229 <property>
8230   <name>hbase.client.registry.impl</name>
8231   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
8232 </property>
8233 ```
8234
8235
8236 ---
8237
8238 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
8239
8240 caffeine: 2.6.2 =\> 2.8.1
8241 commons-codec: 1.10 =\> 1.13
8242 commons-io: 2.5 =\> 2.6
8243 disrupter: 3.3.6 =\> 3.4.2
8244 httpcore: 4.4.6 =\> 4.4.13
8245 jackson: 2.9.10 =\> 2.10.1
8246 jackson.databind: 2.9.10.1 =\> 2.10.1
8247 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
8248 protobuf.plugin: 0.5.0 =\> 0.6.1
8249 zookeeper: 3.4.10 =\> 3.4.14
8250 slf4j: 1.7.25 =\> 1.7.30
8251 rat: 0.12 =\> 0.13
8252 asciidoctor: 1.5.5 =\> 1.5.8
8253 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
8254 error-prone: 2.3.3 =\> 2.3.4
8255
8256
8257 ---
8258
8259 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
8260
8261 - Reverts a binary incompatible binary change for ByteRangeUtils
8262 - Usage of reflection inside CommonFSUtils removed
8263
8264
8265 ---
8266
8267 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
8268
8269 Adds being able to edit hbase:meta table schema. For example,
8270
8271 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
8272 Updating all regions with the new schema...
8273 All regions updated.
8274 Done.
8275 Took 1.2138 seconds
8276
8277 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
8278
8279
8280 ---
8281
8282 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
8283
8284 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
8285
8286 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
8287
8288
8289 ---
8290
8291 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
8292
8293 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
8294
8295
8296 ---
8297
8298 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
8299
8300 Add a new config to hbase-default.xml
8301
8302   \<property\>
8303     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
8304     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
8305     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
8306     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
8307     called in order, so put the cleaner that prunes the most files in front. To
8308     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
8309     and add the fully qualified class name here. Always add the above
8310     default hfile cleaners in the list as they will be overwritten in
8311     hbase-site.xml.\</description\>
8312   \</property\>
8313
8314 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
8315
8316
8317 ---
8318
8319 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
8320
8321 Updated parent pom to Apache version 22.
8322
8323
8324 ---
8325
8326 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
8327
8328 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
8329
8330 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
8331
8332
8333 ---
8334
8335 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
8336
8337 Add a new feature to improve MTTR which have 3 steps to failover:
8338 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
8339 2. Open region.
8340 3. Bulkload the recovered.hfiles for every column family.
8341
8342 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
8343
8344 Config hbase.wal.split.to.hfile to true to enable this featue.
8345
8346
8347 ---
8348
8349 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
8350
8351 Changed the logging in hbase-zookeeper to use built-in formatting
8352
8353
8354 ---
8355
8356 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
8357
8358 From the PR:
8359
8360 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
8361
8362 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
8363
8364
8365 ---
8366
8367 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
8368
8369 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
8370
8371
8372 ---
8373
8374 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
8375
8376 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
8377
8378
8379 ---
8380
8381 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
8382
8383 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
8384
8385
8386 ---
8387
8388 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
8389
8390 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
8391
8392 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
8393
8394
8395 ---
8396
8397 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
8398
8399 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
8400
8401
8402 ---
8403
8404 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
8405
8406 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
8407 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
8408
8409 Fixed this bug as part of this Jira.
8410 Updated description for corresponding configs:
8411
8412 1. hbase.master.regions.recovery.check.interval :
8413
8414 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8415
8416 2. hbase.regions.recovery.store.file.ref.count :
8417
8418 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8419
8420
8421 ---
8422
8423 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
8424
8425 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
8426
8427
8428 ---
8429
8430 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
8431
8432 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
8433
8434
8435 ---
8436
8437 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
8438
8439 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
8440
8441
8442 ---
8443
8444 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
8445
8446 Bumped surefire plugin to 3.0.0-M4
8447
8448
8449 ---
8450
8451 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
8452
8453 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
8454
8455
8456 ---
8457
8458 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
8459
8460 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
8461 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
8462 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
8463 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
8464 From the shell this can be enabled by using the option per Column Family also by using the below format
8465 {code}
8466 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
8467 {code}
8468
8469
8470 ---
8471
8472 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
8473
8474 <!-- markdown -->
8475
8476 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
8477
8478 ```
8479 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
8480     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
8481 ```
8482
8483 See javadocs of the class `MobRefReporter` for more details.
8484
8485 the reference guide has added some information about MOB internals and troubleshooting.
8486
8487
8488 ---
8489
8490 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
8491
8492 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
8493
8494
8495 ---
8496
8497 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
8498
8499 Fixed unbalanced braces in string representation within HBase shell
8500
8501
8502 ---
8503
8504 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
8505
8506 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
8507 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
8508
8509
8510 ---
8511
8512 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
8513
8514 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
8515
8516
8517 ---
8518
8519 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
8520
8521 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
8522
8523 1. RowFilter
8524 2. ValueFilter
8525 3. QualifierFilter
8526 4. FamilyFilter
8527 5. ColumnValueFilter
8528
8529
8530 ---
8531
8532 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
8533
8534 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
8535
8536
8537 ---
8538
8539 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
8540
8541 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
8542
8543
8544 ---
8545
8546 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
8547
8548 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
8549
8550
8551 ---
8552
8553 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
8554
8555 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
8556
8557
8558 ---
8559
8560 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
8561
8562 <!-- markdown -->
8563 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
8564
8565 Such messages will happen at most once per five minutes.
8566
8567
8568 ---
8569
8570 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
8571
8572 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
8573
8574
8575 ---
8576
8577 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
8578
8579 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
8580
8581
8582 ---
8583
8584 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
8585
8586 <!-- markdown -->
8587
8588 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
8589
8590   - CVE-2019-16942
8591   - CVE-2019-16943
8592
8593 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
8594
8595
8596 ---
8597
8598 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
8599
8600 <!-- markdown -->
8601
8602 The MOB compaction process in the HBase Master now logs more about its activity.
8603
8604 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
8605
8606 Caveats:
8607 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
8608 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
8609 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
8610 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
8611
8612
8613 ---
8614
8615 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
8616
8617 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
8618
8619
8620 ---
8621
8622 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
8623
8624 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
8625
8626 Configs:
8627
8628 1. hbase.master.regions.recovery.check.interval :
8629
8630 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8631
8632 2. hbase.regions.recovery.store.file.ref.count :
8633
8634 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8635
8636
8637 ---
8638
8639 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
8640
8641 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
8642
8643 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
8644
8645
8646 ---
8647
8648 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
8649
8650 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
8651
8652
8653 ---
8654
8655 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
8656
8657 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
8658
8659
8660 ---
8661
8662 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
8663
8664 <!-- markdown -->
8665 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
8666
8667 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
8668
8669
8670 ---
8671
8672 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
8673
8674 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
8675
8676
8677 ---
8678
8679 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
8680
8681 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
8682
8683
8684 ---
8685
8686 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
8687
8688 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
8689
8690 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
8691
8692
8693 ---
8694
8695 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
8696
8697 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
8698 \<property\>
8699     \<name\>hbase.bucketcache.ioengine\</name\>
8700     \<value\> pmem:///path in persistent memory \</value\>
8701   \</property\>
8702
8703
8704 ---
8705
8706 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
8707
8708 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
8709 hbase\> snapshot\_cleanup\_switch false
8710
8711 We can re-enable it using:
8712 hbase\> snapshot\_cleanup\_switch true
8713
8714 We can query whether snapshot auto cleanup is enabled for cluster using:
8715 hbase\> snapshot\_cleanup\_enabled
8716
8717
8718 ---
8719
8720 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
8721
8722 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
8723
8724
8725 ---
8726
8727 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
8728
8729 This issue adds via its subtasks:
8730
8731  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
8732  \*\* Master thought this region opened, but no regionserver reported it.
8733  \*\* Master thought this region opened on Server1, but regionserver reported Server2
8734  \*\* More than one regionservers reported opened this region
8735  Both chores can be triggered from the shell to regenerate ‘new’ reports.
8736  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
8737  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
8738  \* Offline replace of hbase.version and hbase.id
8739  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
8740  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
8741  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
8742  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
8743  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
8744  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
8745
8746 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
8747
8748
8749 ---
8750
8751 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
8752
8753 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
8754
8755
8756 ---
8757
8758 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
8759
8760 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
8761
8762
8763 ---
8764
8765 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
8766
8767 Before this issue, we've made the read path 100% offheap when block hit the BucketCache 100%, but if the cache missed then RS need to read the block by on-heap API, which would cause high young GC pressure.
8768 This issue will read the block by offheap even if reading the block from filesystem directly, it have some requirement for hadoop version(\>=2.9.3) but can also works with older hadoop version(means still works fine but will read block onheap). We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex, for more details please read it.
8769
8770
8771 ---
8772
8773 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
8774
8775 <!-- markdown -->
8776 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
8777
8778
8779 ---
8780
8781 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
8782
8783 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
8784
8785
8786 ---
8787
8788 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
8789
8790 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
8791
8792
8793 ---
8794
8795 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
8796
8797 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
8798 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
8799
8800
8801 ---
8802
8803 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
8804
8805 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
8806 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
8807 \* TimeRange#until: Represents the time interval [0, maxStamp)
8808 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
8809
8810
8811 ---
8812
8813 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
8814
8815 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
8816 {code}
8817 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
8818 {code}
8819
8820
8821 ---
8822
8823 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
8824
8825 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
8826
8827
8828 ---
8829
8830 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
8831
8832 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
8833
8834 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
8835
8836
8837 ---
8838
8839 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
8840
8841 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
8842
8843
8844 ---
8845
8846 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
8847
8848 New shaded artifact for testing: hbase-shaded-testing-util.
8849
8850
8851 ---
8852
8853 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
8854
8855 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
8856 1. Check HDFS configuration
8857 2. Add master coprocessor:
8858     hbase.coprocessor.master.classes=
8859     “org.apache.hadoop.hbase.security.access.AccessController,
8860 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
8861 3. Enable this feature:
8862     hbase.acl.sync.to.hdfs.enable=true
8863 4. Modify table scheme to enable this feature for a table:
8864     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
8865
8866
8867 ---
8868
8869 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
8870
8871 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
8872
8873 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
8874
8875 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
8876 java.lang.ArrayIndexOutOfBoundsException: 18056
8877         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
8878         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
8879         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
8880         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
8881         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
8882         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
8883         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
8884         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
8885         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
8886
8887 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
8888
8889 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
8890
8891 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
8892
8893
8894 ---
8895
8896 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
8897
8898 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
8899
8900
8901 ---
8902
8903 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
8904
8905 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
8906
8907
8908 ---
8909
8910 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
8911
8912 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
8913
8914 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
8915
8916
8917 ---
8918
8919 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
8920
8921 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
8922
8923
8924 ---
8925
8926 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
8927
8928 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
8929 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
8930
8931
8932 ---
8933
8934 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
8935
8936 1. Add a new chore thread in master to do hbck checking
8937 2. Add a new web ui "HBCK Report" page to display checking results.
8938
8939 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
8940
8941 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
8942
8943
8944 ---
8945
8946 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
8947
8948 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
8949
8950 $hbase rowcounter -h
8951
8952 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
8953 Options:
8954     --starttime=\<arg\>       starting time filter to start counting rows from.
8955     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
8956     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
8957     --expectedCount=\<arg\>   expected number of rows to be count.
8958 For performance, consider the following configuration properties:
8959 -Dhbase.client.scanner.caching=100
8960 -Dmapreduce.map.speculative=false
8961
8962
8963 ---
8964
8965 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
8966
8967 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
8968 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
8969
8970
8971 ---
8972
8973 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
8974
8975 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
8976
8977 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
8978
8979 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
8980
8981
8982 ---
8983
8984 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
8985
8986 Add a new master web UI to show the potentially problematic opened regions. There are three case:
8987 1. Master thought this region opened, but no regionserver reported it.
8988 2. Master thought this region opened on Server1, but regionserver reported Server2
8989 3. More than one regionservers reported opened this region
8990
8991
8992 ---
8993
8994 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
8995
8996 Feature: Take a Snapshot With TTL for auto-cleanup
8997
8998 Attribute:
8999 1. TTL
9000      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
9001
9002 Configs:
9003 1. Default Snapshot TTL:
9004      - FOREVER by default
9005      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
9006
9007 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
9008      - hbase.master.cleaner.snapshot.disable: "true"
9009     With this config, HMaster needs restart just like any other hbase-site config.
9010
9011
9012 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
9013
9014
9015 ---
9016
9017 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
9018
9019 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
9020
9021
9022 ---
9023
9024 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
9025
9026 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
9027
9028 This tool is deprecated in 2.x and will be removed in 3.0.
9029
9030
9031 ---
9032
9033 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
9034
9035 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
9036
9037
9038 ---
9039
9040 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
9041
9042 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
9043
9044 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
9045
9046 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
9047
9048
9049 ---
9050
9051 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
9052
9053 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
9054 To use this feature, please make sure the HDFS config is set:
9055 dfs.namenode.acls.enabled=true
9056 fs.permissions.umask-mode=027
9057
9058 and set the HBase config:
9059 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
9060 hbase.user.scan.snapshot.enable=true
9061
9062
9063 ---
9064
9065 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
9066
9067 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9068
9069 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9070
9071
9072 ---
9073
9074 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
9075
9076 <!-- markdown -->
9077
9078 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
9079
9080
9081 ---
9082
9083 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9084
9085 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9086
9087
9088 ---
9089
9090 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9091
9092 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9093
9094
9095 ---
9096
9097 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
9098
9099 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
9100
9101
9102 ---
9103
9104 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
9105
9106 The HBase "source checksum" now uses SHA512 instead of MD5.
9107
9108
9109 ---
9110
9111 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9112
9113 <!-- markdown -->
9114
9115 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9116
9117 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9118
9119
9120 ---
9121
9122 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
9123
9124 The access method was used to the HttpServerFunctionalTest class as a common place.
9125
9126
9127 ---
9128
9129 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9130
9131 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9132
9133
9134 ---
9135
9136 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9137
9138 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9139
9140
9141 ---
9142
9143 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9144
9145 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9146
9147
9148 ---
9149
9150 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9151
9152 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9153
9154
9155 ---
9156
9157 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
9158
9159 Support get\|set LogLevel in secure(kerberized) environment.
9160
9161
9162 ---
9163
9164 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9165
9166 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9167
9168
9169 ---
9170
9171 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
9172
9173 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
9174
9175
9176 ---
9177
9178 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9179
9180 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9181
9182
9183 ---
9184
9185 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9186
9187 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9188
9189
9190 ---
9191
9192 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
9193
9194 Updated metrics core from 3.2.1 to 3.2.6.
9195
9196
9197 ---
9198
9199 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
9200
9201 The rubocop definition for the maximum method length was set to 75.
9202
9203
9204 ---
9205
9206 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
9207
9208 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
9209
9210
9211 ---
9212
9213 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
9214
9215 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
9216
9217
9218 ---
9219
9220 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
9221
9222 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
9223
9224 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
9225
9226 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
9227
9228 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
9229
9230 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
9231 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
9232
9233 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
9234 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
9235 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
9236
9237
9238 ---
9239
9240 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
9241
9242 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
9243
9244 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
9245
9246
9247 ---
9248
9249 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
9250
9251 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
9252
9253
9254 ---
9255
9256 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
9257
9258 <!-- markdown -->
9259 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
9260
9261 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
9262
9263
9264 ---
9265
9266 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
9267
9268 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
9269
9270 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
9271
9272
9273 ---
9274
9275 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
9276
9277 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
9278
9279
9280 ---
9281
9282 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
9283
9284 <!-- markdown -->
9285
9286 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
9287
9288 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
9289
9290
9291 ---
9292
9293 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
9294
9295 Add below method in Table interface:
9296
9297 RegionLocator getRegionLocator() throws IOException;
9298
9299 Add below methods in AsyncTable interface:
9300
9301 AsyncTableRegionLocator getRegionLocator();
9302 CompletableFuture\<TableDescriptor\> getDescriptor();
9303
9304
9305 ---
9306
9307 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
9308
9309 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
9310
9311 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
9312
9313 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
9314
9315
9316 ---
9317
9318 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
9319
9320 Introduced
9321
9322 Future\<Void\> createTableAsync(TableDescriptor);
9323
9324
9325 ---
9326
9327 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
9328
9329 Introduced these methods:
9330 void move(byte[]);
9331 void move(byte[], ServerName);
9332 Future\<Void\> splitRegionAsync(byte[]);
9333
9334 These methods are deprecated:
9335 void move(byte[], byte[])
9336
9337
9338 ---
9339
9340 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
9341
9342 Add a new jenkins file for running pre commit check for GitHub PR.
9343
9344
9345 ---
9346
9347 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
9348
9349 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
9350
9351
9352 ---
9353
9354 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
9355
9356 When insufficient permissions, you now get:
9357
9358 HTTP/1.1 403 Forbidden
9359
9360 on the HTTP side, and in the message
9361
9362 Forbidden
9363 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
9364 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
9365 and the rest of the ADE stack
9366
9367
9368 ---
9369
9370 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
9371
9372 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
9373
9374
9375 ---
9376
9377 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
9378
9379 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
9380
9381
9382 ---
9383
9384 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
9385
9386 <!-- markdown -->
9387 Fixed awkward dependency issue that prevented site building.
9388
9389 #### note specific to HBase 2.1.4
9390 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
9391 ```
9392 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
9393 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
9394         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
9395         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
9396         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
9397         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
9398         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
9399         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
9400         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
9401         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
9402         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
9403         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
9404         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
9405         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
9406         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
9407         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
9408         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
9409         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
9410         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
9411         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
9412         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
9413         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
9414         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
9415         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
9416         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
9417         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
9418         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
9419         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
9420 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
9421         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
9422         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
9423         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
9424         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
9425         ... 26 more
9426
9427 ```
9428
9429 Workaround via any _one_ of the following:
9430 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
9431 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
9432 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
9433 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
9434 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
9435
9436
9437 ---
9438
9439 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
9440
9441 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
9442
9443
9444 ---
9445
9446 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
9447
9448 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
9449
9450
9451 ---
9452
9453 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
9454
9455 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
9456
9457 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
9458
9459
9460 ---
9461
9462 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
9463
9464 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
9465
9466
9467 ---
9468
9469 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
9470
9471 <!-- markdown -->
9472
9473 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
9474
9475 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
9476
9477
9478 ---
9479
9480 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
9481
9482 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
9483
9484
9485 ---
9486
9487 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
9488
9489 Add a cloneSnapshotAsync method with restoreAcl parameter.
9490 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
9491 Make snapshotAsync method returns a Future\<Void\>.
9492 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
9493 Use default methods to reduce the code base for implementation classes.
9494
9495
9496 ---
9497
9498 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
9499
9500 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
9501
9502
9503 ---
9504
9505 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
9506
9507 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
9508 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
9509
9510 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
9511
9512 For example:
9513 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
9514
9515
9516 ---
9517
9518 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
9519
9520 Adds below flush, split, and compaction metrics
9521
9522  +  // split related metrics
9523  +  private MutableFastCounter splitRequest;
9524  +  private MutableFastCounter splitSuccess;
9525  +  private MetricHistogram splitTimeHisto;
9526  +
9527  +  // flush related metrics
9528  +  private MetricHistogram flushTimeHisto;
9529  +  private MetricHistogram flushMemstoreSizeHisto;
9530  +  private MetricHistogram flushOutputSizeHisto;
9531  +  private MutableFastCounter flushedMemstoreBytes;
9532  +  private MutableFastCounter flushedOutputBytes;
9533  +
9534  +  // compaction related metrics
9535  +  private MetricHistogram compactionTimeHisto;
9536  +  private MetricHistogram compactionInputFileCountHisto;
9537  +  private MetricHistogram compactionInputSizeHisto;
9538  +  private MetricHistogram compactionOutputFileCountHisto;
9539  +  private MetricHistogram compactionOutputSizeHisto;
9540  +  private MutableFastCounter compactedInputBytes;
9541  +  private MutableFastCounter compactedOutputBytes;
9542  +
9543  +  private MetricHistogram majorCompactionTimeHisto;
9544  +  private MetricHistogram majorCompactionInputFileCountHisto;
9545  +  private MetricHistogram majorCompactionInputSizeHisto;
9546  +  private MetricHistogram majorCompactionOutputFileCountHisto;
9547  +  private MetricHistogram majorCompactionOutputSizeHisto;
9548  +  private MutableFastCounter majorCompactedInputBytes;
9549  +  private MutableFastCounter majorCompactedOutputBytes;
9550
9551
9552 ---
9553
9554 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
9555
9556 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
9557
9558
9559 ---
9560
9561 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
9562
9563 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
9564 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
9565
9566
9567 ---
9568
9569 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
9570
9571 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
9572
9573 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
9574
9575
9576 ---
9577
9578 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
9579
9580 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
9581 Shell commands are as follows:
9582 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9583
9584 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
9585 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
9586 Shell commands are as follows:
9587 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9588 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
9589 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
9590
9591
9592 ---
9593
9594 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
9595
9596 Change spotbugs version to 3.1.11.
9597
9598
9599 ---
9600
9601 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
9602
9603 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
9604
9605 It also introduces additional info for each recovery queue, which was not accounted by this command before.
9606
9607 The new output for "status 'replication'" command is explained in details below:
9608 a) Source started, target stopped, no edits arrived on source yet:
9609 ...
9610  SOURCE: PeerID=1
9611          Normal Queue: 1
9612            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9613 ...
9614 b) Source started, target stopped, add edit on source:
9615 ...
9616 Normal Queue: 1
9617            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
9618 ...
9619 c) Source started, target stopped, edit added on source, restart source:
9620 ...
9621 SOURCE: PeerID=1
9622          Normal Queue: 1
9623            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9624          Recovered Queue: 1-hbase01.home,16020,1542784524057
9625            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
9626 ...
9627 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
9628 ...
9629 SOURCE: PeerID=1
9630          Normal Queue: 1
9631            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
9632          Recovered Queue: 1-hbase01.home,16020,1542782758742
9633            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
9634 ...
9635 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
9636 ...
9637        SOURCE: PeerID=1
9638          Normal Queue: 1
9639            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
9640 ...
9641 f) Source started, target stopped, add edit on source, restart source, restart target:
9642 ...
9643 SOURCE: PeerID=1
9644          Normal Queue: 1
9645            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9646 ...
9647
9648
9649 ---
9650
9651 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
9652
9653 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
9654
9655
9656 ---
9657
9658 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
9659
9660 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
9661 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
9662 disable\_exceed\_throttle\_quota
9663 There are two limits when enable exceed throttle quota:
9664 1. Must set at least one read and one write region server throttle quota;
9665 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
9666
9667
9668 ---
9669
9670 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
9671
9672 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
9673
9674
9675 ---
9676
9677 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
9678
9679 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
9680
9681
9682 ---
9683
9684 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
9685
9686 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
9687
9688
9689 ---
9690
9691 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
9692
9693 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
9694
9695 hbase\> help 'scan'
9696
9697
9698 ---
9699
9700 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
9701
9702 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
9703
9704 For example:
9705 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
9706
9707
9708 ---
9709
9710 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
9711
9712 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
9713 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
9714
9715
9716 ---
9717
9718 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
9719
9720 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
9721
9722
9723 ---
9724
9725 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
9726
9727 Make StoppedRpcClientException extend DoNotRetryIOException.
9728
9729
9730 ---
9731
9732 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
9733
9734 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
9735 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
9736
9737
9738 ---
9739
9740 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
9741
9742 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
9743
9744 The effect releases are:
9745 2.1.x: 2.1.2 and below
9746 2.0.x: 2.0.4 and below
9747 1.x: 1.4.x and below
9748
9749 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
9750
9751
9752 ---
9753
9754 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
9755
9756 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
9757
9758
9759
9760
9761 # HBASE  2.2.0 Release Notes
9762
9763 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
9764
9765
9766 ---
9767
9768 * [HBASE-21970](https://issues.apache.org/jira/browse/HBASE-21970) | *Major* | **Document that how to upgrade from 2.0 or 2.1 to 2.2+**
9769
9770 See the document http://hbase.apache.org/book.html#upgrade2.2 about how to upgrade from 2.0 or 2.1 to 2.2+.
9771
9772 HBase 2.2+ uses a new Procedure form assiging/unassigning/moving Regions. It does not process HBase 2.1 and 2.0's Unassign/Assign Procedure types. Upgrade requires that we first drain the Master Procedure Store of old style Procedures before starting the new 2.2 Master. So you need to make sure that before you kill the old version (2.0 or 2.1) Master, there is no region in transition. And once the new version (2.2+) Master is up, you can rolling upgrade RegionServers one by one.
9773
9774 And there is a more safer way if you are running 2.1.1+ or 2.0.3+ cluster. It need four steps to upgrade Master.
9775
9776 1. Shutdown both active and standby Masters (Your cluster will continue to server reads and writes without interruption).
9777 2. Set the property hbase.procedure.upgrade-to-2-2 to true in hbase-site.xml for the Master, and start only one Master, still using the 2.1.1+ (or 2.0.3+) version.
9778 3. Wait until the Master quits. Confirm that there is a 'READY TO ROLLING UPGRADE' message in the Master log as the cause of the shutdown. The Procedure Store is now empty.
9779 4. Start new Masters with the new 2.2+ version.
9780
9781 Then you can rolling upgrade RegionServers one by one. See HBASE-21075 for more details.
9782
9783
9784 ---
9785
9786 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9787
9788 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9789
9790
9791 ---
9792
9793 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9794
9795 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9796
9797
9798 ---
9799
9800 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9801
9802 <!-- markdown -->
9803
9804 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9805
9806 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9807
9808
9809 ---
9810
9811 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9812
9813 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9814
9815
9816 ---
9817
9818 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9819
9820 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9821
9822
9823 ---
9824
9825 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9826
9827 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9828
9829
9830 ---
9831
9832 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9833
9834 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9835
9836
9837 ---
9838
9839 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9840
9841 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9842
9843
9844 ---
9845
9846 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9847
9848 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9849
9850
9851 ---
9852
9853 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9854
9855 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9856
9857
9858 ---
9859
9860 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
9861
9862 Updated metrics core from 3.2.1 to 3.2.6.
9863
9864
9865 ---
9866
9867 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
9868
9869 The rubocop definition for the maximum method length was set to 75.
9870
9871
9872 ---
9873
9874 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
9875
9876 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
9877
9878
9879 ---
9880
9881 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
9882
9883 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
9884
9885
9886 ---
9887
9888 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
9889
9890 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
9891
9892
9893 ---
9894
9895 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
9896
9897 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
9898
9899
9900 ---
9901
9902 * [HBASE-22155](https://issues.apache.org/jira/browse/HBASE-22155) | *Major* | **Move 2.2.0 on to hbase-thirdparty-2.2.0**
9903
9904  Updates libs used internally by hbase via hbase-thirdparty as follows:
9905
9906  gson 2.8.1 -\\\> 2.8.5
9907  guava 22.0 -\\\> 27.1-jre
9908  pb 3.5.1 -\\\> 3.7.0
9909  netty 4.1.17 -\\\> 4.1.34
9910  commons-collections4 4.1 -\\\> 4.3
9911
9912
9913 ---
9914
9915 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
9916
9917 Introduced
9918
9919 Future\<Void\> createTableAsync(TableDescriptor);
9920
9921
9922 ---
9923
9924 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
9925
9926 Introduced these methods:
9927 void move(byte[]);
9928 void move(byte[], ServerName);
9929 Future\<Void\> splitRegionAsync(byte[]);
9930
9931 These methods are deprecated:
9932 void move(byte[], byte[])
9933
9934
9935 ---
9936
9937 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
9938
9939 Add a new jenkins file for running pre commit check for GitHub PR.
9940
9941
9942 ---
9943
9944 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
9945
9946 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
9947
9948
9949 ---
9950
9951 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
9952
9953 When insufficient permissions, you now get:
9954
9955 HTTP/1.1 403 Forbidden
9956
9957 on the HTTP side, and in the message
9958
9959 Forbidden
9960 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
9961 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
9962 and the rest of the ADE stack
9963
9964
9965 ---
9966
9967 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
9968
9969 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
9970
9971
9972 ---
9973
9974 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
9975
9976 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
9977
9978
9979 ---
9980
9981 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
9982
9983 <!-- markdown -->
9984 Fixed awkward dependency issue that prevented site building.
9985
9986 #### note specific to HBase 2.1.4
9987 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
9988 ```
9989 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
9990 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
9991         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
9992         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
9993         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
9994         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
9995         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
9996         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
9997         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
9998         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
9999         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
10000         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
10001         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
10002         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
10003         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
10004         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
10005         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
10006         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
10007         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
10008         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
10009         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
10010         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
10011         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
10012         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
10013         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
10014         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
10015         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
10016         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
10017 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
10018         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
10019         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
10020         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
10021         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
10022         ... 26 more
10023
10024 ```
10025
10026 Workaround via any _one_ of the following:
10027 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
10028 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
10029 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
10030 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
10031 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
10032
10033
10034 ---
10035
10036 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
10037
10038 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
10039
10040
10041 ---
10042
10043 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
10044
10045 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
10046
10047 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
10048
10049
10050 ---
10051
10052 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
10053
10054 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
10055
10056
10057 ---
10058
10059 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
10060
10061 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
10062
10063
10064 ---
10065
10066 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
10067
10068 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
10069
10070
10071 ---
10072
10073 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
10074
10075 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
10076 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
10077
10078 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
10079
10080 For example:
10081 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
10082
10083
10084 ---
10085
10086 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
10087
10088 Adds below flush, split, and compaction metrics
10089
10090  +  // split related metrics
10091  +  private MutableFastCounter splitRequest;
10092  +  private MutableFastCounter splitSuccess;
10093  +  private MetricHistogram splitTimeHisto;
10094  +
10095  +  // flush related metrics
10096  +  private MetricHistogram flushTimeHisto;
10097  +  private MetricHistogram flushMemstoreSizeHisto;
10098  +  private MetricHistogram flushOutputSizeHisto;
10099  +  private MutableFastCounter flushedMemstoreBytes;
10100  +  private MutableFastCounter flushedOutputBytes;
10101  +
10102  +  // compaction related metrics
10103  +  private MetricHistogram compactionTimeHisto;
10104  +  private MetricHistogram compactionInputFileCountHisto;
10105  +  private MetricHistogram compactionInputSizeHisto;
10106  +  private MetricHistogram compactionOutputFileCountHisto;
10107  +  private MetricHistogram compactionOutputSizeHisto;
10108  +  private MutableFastCounter compactedInputBytes;
10109  +  private MutableFastCounter compactedOutputBytes;
10110  +
10111  +  private MetricHistogram majorCompactionTimeHisto;
10112  +  private MetricHistogram majorCompactionInputFileCountHisto;
10113  +  private MetricHistogram majorCompactionInputSizeHisto;
10114  +  private MetricHistogram majorCompactionOutputFileCountHisto;
10115  +  private MetricHistogram majorCompactionOutputSizeHisto;
10116  +  private MutableFastCounter majorCompactedInputBytes;
10117  +  private MutableFastCounter majorCompactedOutputBytes;
10118
10119
10120 ---
10121
10122 * [HBASE-20886](https://issues.apache.org/jira/browse/HBASE-20886) | *Critical* | **[Auth] Support keytab login in hbase client**
10123
10124 From 2.2.0, hbase supports client login via keytab. To use this feature, client should specify \`hbase.client.keytab.file\` and \`hbase.client.keytab.principal\` in hbase-site.xml, then the connection will contain the needed credentials which be renewed periodically to communicate with kerberized hbase cluster.
10125
10126
10127 ---
10128
10129 * [HBASE-21410](https://issues.apache.org/jira/browse/HBASE-21410) | *Major* | **A helper page that help find all problematic regions and procedures**
10130
10131 After HBASE-21410, we add a helper page to Master UI. This helper page is mainly to help HBase operator quickly found all regions and pids that are get stuck.
10132 There are 2 entries to get in this page.
10133 One is showing in the Regions in Transition section, it made "num region(s) in transition" a link that you can click and check all regions in transition and their related procedure IDs.
10134 The other one is showing in the table details section, it made the number of CLOSING or OPENING regions a link, which you can click and check regions and related procedure IDs of CLOSING or OPENING regions of a certain table.
10135 In this helper page, not only you can see all regions and related procedures, there are 2 buttons at the top which will show these regions or procedure IDs in text format. This is mainly aim to help operator to easily copy and paste all problematic procedure IDs and encoded region names to HBCK2's command line, by which we HBase operator can bypass these procedures or assign these regions.
10136
10137
10138 ---
10139
10140 * [HBASE-21588](https://issues.apache.org/jira/browse/HBASE-21588) | *Major* | **Procedure v2 wal splitting implementation**
10141
10142 After HBASE-21588, we introduce a new way to do WAL splitting coordination by procedure framework. This can simplify the process of WAL splitting and no need to connect zookeeper any more.
10143 During ServerCrashProcedure, it will create a SplitWALProcedure for each WAL that need to split. Then each SplitWALProcedure will spawn a SplitWALRemoteProcedure to send the request to regionserver.
10144 At the RegionServer side, whole process is handled by SplitWALCallable. It split the WAL and return the result to master.
10145 According to my test, this patch has a better performance as the number of WALs that need to split increase. And it can relieve the pressure on zookeeper.
10146
10147
10148 ---
10149
10150 * [HBASE-20734](https://issues.apache.org/jira/browse/HBASE-20734) | *Major* | **Colocate recovered edits directory with hbase.wal.dir**
10151
10152 Previously the recovered.edits directory was under the root directory. This JIRA moves the recovered.edits directory to be under the hbase.wal.dir if set. It also adds a check for any recovered.edits found under the root directory for backwards compatibility. This gives improvements when a faster media(like SSD) or more local FileSystem is used for the hbase.wal.dir than the root dir.
10153
10154
10155 ---
10156
10157 * [HBASE-20401](https://issues.apache.org/jira/browse/HBASE-20401) | *Minor* | **Make \`MAX\_WAIT\` and \`waitIfNotFinished\` in CleanerContext configurable**
10158
10159 When oldwals (and hfile) cleaner cleans stale wals (and hfiles), it will periodically check and wait the clean results from filesystem, the total wait time will be no more than a max time.
10160
10161 The periodically wait and check configurations are hbase.oldwals.cleaner.thread.check.interval.msec (default is 500 ms) and hbase.regionserver.hfilecleaner.thread.check.interval.msec (default is 1000 ms).
10162
10163 Meanwhile, The max time configurations are hbase.oldwals.cleaner.thread.timeout.msec and hbase.regionserver.hfilecleaner.thread.timeout.msec, they are set to 60 seconds by default.
10164
10165 All support dynamic configuration.
10166
10167 e.g. in the oldwals cleaning scenario, one may consider tuning hbase.oldwals.cleaner.thread.timeout.msec and hbase.oldwals.cleaner.thread.check.interval.msec
10168
10169 1. While deleting a oldwal never complete (strange but possible), then delete file task needs to wait for a max of 60 seconds. Here, 60 seconds might be too long, or the opposite way is to increase more than 60 seconds in the use cases of slow file delete.
10170 2. The check and wait of a file delete is set to default in the period of 500 milliseconds, one might want to tune this checking period to a short interval to check more frequently or to a longer interval to avoid checking too often to manage their delete file task checking period (the longer interval may be use to avoid checking too fast while using a high latency storage).
10171
10172
10173 ---
10174
10175 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
10176
10177 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
10178
10179
10180 ---
10181
10182 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
10183
10184 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
10185 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
10186
10187
10188 ---
10189
10190 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
10191
10192 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
10193
10194 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
10195
10196
10197 ---
10198
10199 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
10200
10201 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
10202 Shell commands are as follows:
10203 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10204
10205 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
10206 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
10207 Shell commands are as follows:
10208 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10209 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
10210 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
10211
10212
10213 ---
10214
10215 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
10216
10217 Change spotbugs version to 3.1.11.
10218
10219
10220 ---
10221
10222 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
10223
10224 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
10225
10226
10227 ---
10228
10229 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
10230
10231 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
10232 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
10233 disable\_exceed\_throttle\_quota
10234 There are two limits when enable exceed throttle quota:
10235 1. Must set at least one read and one write region server throttle quota;
10236 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
10237
10238
10239 ---
10240
10241 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
10242
10243 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
10244
10245
10246 ---
10247
10248 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
10249
10250 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
10251
10252
10253 ---
10254
10255 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
10256
10257 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
10258
10259
10260 ---
10261
10262 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
10263
10264 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
10265
10266 hbase\> help 'scan'
10267
10268
10269 ---
10270
10271 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
10272
10273 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
10274
10275 For example:
10276 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
10277
10278
10279 ---
10280
10281 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
10282
10283 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
10284 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
10285
10286
10287 ---
10288
10289 * [HBASE-21727](https://issues.apache.org/jira/browse/HBASE-21727) | *Minor* | **Simplify documentation around client timeout**
10290
10291 Deprecated HBaseConfiguration#getInt(Configuration, String, String, int) method and removed it from 3.0.0 version.
10292
10293
10294 ---
10295
10296 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
10297
10298 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
10299
10300
10301 ---
10302
10303 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
10304
10305 Make StoppedRpcClientException extend DoNotRetryIOException.
10306
10307
10308 ---
10309
10310 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
10311
10312 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
10313 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
10314
10315
10316 ---
10317
10318 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
10319
10320 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
10321
10322 The effect releases are:
10323 2.1.x: 2.1.2 and below
10324 2.0.x: 2.0.4 and below
10325 1.x: 1.4.x and below
10326
10327 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
10328
10329
10330 ---
10331
10332 * [HBASE-21792](https://issues.apache.org/jira/browse/HBASE-21792) | *Major* | **Mark HTableMultiplexer as deprecated and remove it in 3.0.0**
10333
10334 HTableMultiplexer exposes the implementation class, and it is incomplete, so we mark it as deprecated and remove it in 3.0.0 release.
10335
10336 There is no direct replacement for HTableMultiplexer, please use BufferedMutator if you want to batch mutations to a table.
10337
10338
10339 ---
10340
10341 * [HBASE-21782](https://issues.apache.org/jira/browse/HBASE-21782) | *Major* | **LoadIncrementalHFiles should not be IA.Public**
10342
10343 Introduce a BulkLoadHFiles interface which is marked as IA.Public, for doing bulk load programmatically.
10344 Introduce a BulkLoadHFilesTool which extends BulkLoadHFiles, and is marked as IA.LimitedPrivate(TOOLS), for using from command line.
10345 The old LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
10346
10347
10348 ---
10349
10350 * [HBASE-21762](https://issues.apache.org/jira/browse/HBASE-21762) | *Major* | **Move some methods in ClusterConnection to Connection**
10351
10352 Move the two getHbck method from ClusterConnection to Connection, and mark the methods as IA.LimitedPrivate(HBCK), as ClusterConnection is IA.Private and should not be depended by HBCK2.
10353
10354 Add a clearRegionLocationCache method in Connection to clear the region location cache for all the tables. As in RegionLocator, most of the methods have a 'reload' parameter, which implicitly tells user that we have a region location cache, so adding a method to clear the cache is fine.
10355
10356
10357 ---
10358
10359 * [HBASE-21713](https://issues.apache.org/jira/browse/HBASE-21713) | *Major* | **Support set region server throttle quota**
10360
10361 Support set region server rpc throttle quota which represents the read/write ability of region servers and throttles when region server's total requests exceeding the limit.
10362
10363 Use the following shell command to set RS quota:
10364 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', THROTTLE\_TYPE =\> WRITE, LIMIT =\> '20000req/sec'
10365 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', LIMIT =\> NONE
10366 "all" represents the throttle quota of all region servers and setting specified region server quota isn't supported currently.
10367
10368
10369 ---
10370
10371 * [HBASE-21689](https://issues.apache.org/jira/browse/HBASE-21689) | *Minor* | **Make table/namespace specific current quota info available in shell(describe\_namespace & describe)**
10372
10373 In shell commands "describe\_namespace" and "describe", which are used to see the descriptors of the namespaces and tables respectively, quotas set on that particular namespace/table will also be printed along.
10374
10375
10376 ---
10377
10378 * [HBASE-17370](https://issues.apache.org/jira/browse/HBASE-17370) | *Major* | **Fix or provide shell scripts to drain and decommission region server**
10379
10380 Adds shell support for the following:
10381 - List decommissioned/draining region servers
10382 - Decommission a list of region servers, optionally offload corresponding regions
10383 - Recommission a region server, optionally load a list of passed regions
10384
10385
10386 ---
10387
10388 * [HBASE-21734](https://issues.apache.org/jira/browse/HBASE-21734) | *Major* | **Some optimization in FilterListWithOR**
10389
10390 After HBASE-21620, the filterListWithOR has been a bit slow because we need to merge each sub-filter's RC , while before HBASE-21620, we will skip many RC merging, but the logic was wrong. So here we choose another way to optimaze the performance: removing the KeyValueUtil#toNewKeyCell.
10391 Anoop Sam John suggested that the KeyValueUtil#toNewKeyCell can save some GC before because if we copy key part of cell into a single byte[], then the block the cell refering won't be refered by the filter list any more, the upper layer can GC the data block quickly. while after HBASE-21620, we will update the prevCellList for every encountered cell now, so the lifecycle of cell in prevCellList for FilterList will be quite shorter. so just use the cell ref for saving cpu.
10392 BTW, we removed all the arrays streams usage in filter list, because it's also quite time-consuming in our test.
10393
10394
10395 ---
10396
10397 * [HBASE-21738](https://issues.apache.org/jira/browse/HBASE-21738) | *Critical* | **Remove all the CSLM#size operation in our memstore because it's an quite time consuming.**
10398
10399 We found the memstore snapshotting would cost much time because of calling the time-consuming ConcurrentSkipListMap#Size, it would make the p999 latency spike happen. So in this issue, we remove all ConcurrentSkipListMap#size in memstore by counting the cellsCount in MemstoreSizeing. As the issue described, the p999 latency spike was mitigated.
10400
10401
10402 ---
10403
10404 * [HBASE-21034](https://issues.apache.org/jira/browse/HBASE-21034) | *Major* | **Add new throttle type: read/write capacity unit**
10405
10406 Provides a new throttle type: capacity unit. One read/write/request capacity unit represents that read/write/read+write up to 1K data. If data size is more than 1K, then consume additional capacity units.
10407
10408 Use shell command to set capacity unit(CU):
10409 set\_quota TYPE =\> THROTTLE, THROTTLE\_TYPE =\> WRITE, USER =\> 'u1', LIMIT =\> '10CU/sec'
10410
10411 Use the "hbase.quota.read.capacity.unit" property to set the data size of one read capacity unit in bytes, the default value is 1K. Use the "hbase.quota.write.capacity.unit" property to set the data size of one write capacity unit in bytes, the default value is 1K.
10412
10413
10414 ---
10415
10416 * [HBASE-21595](https://issues.apache.org/jira/browse/HBASE-21595) | *Minor* | **Print thread's information and stack traces when RS is aborting forcibly**
10417
10418 Does thread dump on stdout on abort.
10419
10420
10421 ---
10422
10423 * [HBASE-21732](https://issues.apache.org/jira/browse/HBASE-21732) | *Critical* | **Should call toUpperCase before using Enum.valueOf in some methods for ColumnFamilyDescriptor**
10424
10425 Now all the Enum configs in ColumnFamilyDescriptor can accept lower case config value.
10426
10427
10428 ---
10429
10430 * [HBASE-21712](https://issues.apache.org/jira/browse/HBASE-21712) | *Minor* | **Make submit-patch.py python3 compatible**
10431
10432 Python3 support was added to dev-support/submit-patch.py. To install newly required dependencies run \`pip install -r dev-support/python-requirements.txt\` command.
10433
10434
10435 ---
10436
10437 * [HBASE-21657](https://issues.apache.org/jira/browse/HBASE-21657) | *Major* | **PrivateCellUtil#estimatedSerializedSizeOf has been the bottleneck in 100% scan case.**
10438
10439 In HBASE-21657,  I simplified the path of estimatedSerialiedSize() & estimatedSerialiedSizeOfCell() by moving the general getSerializedSize()
10440 and heapSize() from ExtendedCell to Cell interface. The patch also included some other improvments:
10441
10442 1. For 99%  of case, our cells has no tags, so let the HFileScannerImpl just return the NoTagsByteBufferKeyValue if no tags, which means we can save
10443    lots of cpu time when sending no tags cell to rpc because can just return the length instead of getting the serialize size by caculating offset/length
10444    of each fields(row/cf/cq..)
10445 2. Move the subclass's getSerializedSize implementation from ExtendedCell to their own class, which mean we did not need to call ExtendedCell's
10446    getSerialiedSize() firstly, then forward to subclass's getSerializedSize(withTags).
10447 3. Give a estimated result arraylist size for avoiding the frequent list extension when in a big scan, now we estimate the array size as min(scan.rows, 512).
10448    it's also help a lot.
10449
10450 We gain almost ~40% throughput improvement in 100% scan case for branch-2 (cacheHitRatio~100%)[1], it's a good thing. While it's a incompatible change in
10451 some case, such as if the upstream user implemented their own Cells, although it's rare but can happen, then their compile will be error.
10452
10453
10454 ---
10455
10456 * [HBASE-21647](https://issues.apache.org/jira/browse/HBASE-21647) | *Major* | **Add status track for splitting WAL tasks**
10457
10458 Adds task monitor that shows ServerCrashProcedure progress in UI.
10459
10460
10461 ---
10462
10463 * [HBASE-21652](https://issues.apache.org/jira/browse/HBASE-21652) | *Major* | **Refactor ThriftServer making thrift2 server inherited from thrift1 server**
10464
10465 Before this issue, thrift1 server and thrift2 server are totally different servers. If a new feature is added to thrift1 server, thrfit2 server have to make the same change to support it(e.g. authorization). After this issue, thrift2 server is inherited from thrift1, thrift2 server now have all the features thrift1 server has(e.g http support, which thrift2 server doesn't have before).  The way to start thrift1 or thrift2 server remain the same after this issue.
10466
10467
10468 ---
10469
10470 * [HBASE-21661](https://issues.apache.org/jira/browse/HBASE-21661) | *Major* | **Provide Thrift2 implementation of Table/Admin**
10471
10472 ThriftAdmin/ThriftTable are implemented based on Thrift2. With ThriftAdmin/ThriftTable, People can use thrift2 protocol just like HTable/HBaseAdmin.
10473 Example of using ThriftConnection
10474 Configuration conf = HBaseConfiguration.create();
10475 conf.set(ClusterConnection.HBASE\_CLIENT\_CONNECTION\_IMPL,ThriftConnection.class.getName());
10476 Connection conn = ConnectionFactory.createConnection(conf);
10477 Table table = conn.getTable(tablename)
10478 It is just like a normal Connection, similar use experience with the default ConnectionImplementation
10479
10480
10481 ---
10482
10483 * [HBASE-21618](https://issues.apache.org/jira/browse/HBASE-21618) | *Critical* | **Scan with the same startRow(inclusive=true) and stopRow(inclusive=false) returns one result**
10484
10485 There was a bug when scan with the same startRow(inclusive=true) and stopRow(inclusive=false). The old incorrect behavior is return one result. After this fix, the new correct behavior is return nothing.
10486
10487
10488 ---
10489
10490 * [HBASE-21159](https://issues.apache.org/jira/browse/HBASE-21159) | *Major* | **Add shell command to switch throttle on or off**
10491
10492 Support enable or disable rpc throttle when hbase quota is enabled. If hbase quota is enabled, rpc throttle is enabled by default.  When disable rpc throttle, HBase will not throttle any request. Use the following commands to switch rpc throttle : enable\_rpc\_throttle / disable\_rpc\_throttle.
10493
10494
10495 ---
10496
10497 * [HBASE-21659](https://issues.apache.org/jira/browse/HBASE-21659) | *Minor* | **Avoid to load duplicate coprocessors in system config and table descriptor**
10498
10499 Add a new configuration "hbase.skip.load.duplicate.table.coprocessor". The default value is false to keep compatible with the old behavior. Config it true to skip load duplicate table coprocessor.
10500
10501
10502 ---
10503
10504 * [HBASE-21650](https://issues.apache.org/jira/browse/HBASE-21650) | *Major* | **Add DDL operation and some other miscellaneous to thrift2**
10505
10506 Added DDL operations and some other structure definition to thrift2. Methods added:
10507 create/modify/addColumnFamily/deleteColumnFamily/modifyColumnFamily/enable/disable/truncate/delete table
10508 create/modify/delete namespace
10509 get(list)TableDescriptor(s)/get(list)NamespaceDescirptor(s)
10510 tableExists/isTableEnabled/isTableDisabled/isTableAvailabe
10511 And some class definitions along with those methods
10512
10513
10514 ---
10515
10516 * [HBASE-21643](https://issues.apache.org/jira/browse/HBASE-21643) | *Major* | **Introduce two new region coprocessor method and deprecated postMutationBeforeWAL**
10517
10518 Deprecated region coprocessor postMutationBeforeWAL and introduce two new region coprocessor postIncrementBeforeWAL and postAppendBeforeWAL instead.
10519
10520
10521 ---
10522
10523 * [HBASE-21635](https://issues.apache.org/jira/browse/HBASE-21635) | *Major* | **Use maven enforcer to ban imports from illegal packages**
10524
10525 Use de.skuzzle.enforcer.restrict-imports-enforcer-rule extension for maven enforcer plugin to ban illegal imports at compile time. Now if you use illegal imports, for example, import com.google.common.\*, there will be a compile error, instead of a checkstyle warning.
10526
10527
10528 ---
10529
10530 * [HBASE-21401](https://issues.apache.org/jira/browse/HBASE-21401) | *Critical* | **Sanity check when constructing the KeyValue**
10531
10532 Add a sanity check when constructing KeyValue from a byte[]. we use the constructor when we're reading kv from socket or HFIle or WAL(replication). the santiy check isn't designed for discovering the bits corruption in network transferring or disk IO. It is designed to detect bugs inside HBase in advance. and HBASE-21459 indicated that there's extremely small performance loss for diff kinds of keyvalue.
10533
10534
10535 ---
10536
10537 * [HBASE-21554](https://issues.apache.org/jira/browse/HBASE-21554) | *Minor* | **Show replication endpoint classname for replication peer on master web UI**
10538
10539 The replication UI on master will show the replication endpoint classname.
10540
10541
10542 ---
10543
10544 * [HBASE-21549](https://issues.apache.org/jira/browse/HBASE-21549) | *Major* | **Add shell command for serial replication peer**
10545
10546 Add a SERIAL flag for add\_peer command to identifiy whether or not the replication peer is a serial replication peer. The default serial flag is false.
10547
10548
10549 ---
10550
10551 * [HBASE-21453](https://issues.apache.org/jira/browse/HBASE-21453) | *Major* | **Convert ReadOnlyZKClient to DEBUG instead of INFO**
10552
10553 Log level of ReadOnlyZKClient moved to debug.
10554
10555
10556 ---
10557
10558 * [HBASE-21283](https://issues.apache.org/jira/browse/HBASE-21283) | *Minor* | **Add new shell command 'rit' for listing regions in transition**
10559
10560 <!-- markdown -->
10561
10562 The HBase `shell` now includes a command to list regions currently in transition.
10563
10564 ```
10565 HBase Shell
10566 Use "help" to get list of supported commands.
10567 Use "exit" to quit this interactive shell.
10568 Version 1.5.0-SNAPSHOT, r9bb6d2fa8b760f16cd046657240ebd4ad91cb6de, Mon Oct  8 21:05:50 UTC 2018
10569
10570 hbase(main):001:0> help 'rit'
10571 List all regions in transition.
10572 Examples:
10573   hbase> rit
10574
10575 hbase(main):002:0> create ...
10576 0 row(s) in 2.5150 seconds
10577 => Hbase::Table - IntegrationTestBigLinkedList
10578
10579 hbase(main):003:0> rit
10580 0 row(s) in 0.0340 seconds
10581
10582 hbase(main):004:0> unassign '56f0c38c81ae453d19906ce156a2d6a1'
10583 0 row(s) in 0.0540 seconds
10584
10585 hbase(main):005:0> rit
10586 IntegrationTestBigLinkedList,L\xCC\xCC\xCC\xCC\xCC\xCC\xCB,1539117183224.56f0c38c81ae453d19906ce156a2d6a1. state=PENDING_CLOSE, ts=Tue Oct 09 20:33:34 UTC 2018 (0s ago), server=null
10587 1 row(s) in 0.0170 seconds
10588 ```
10589
10590
10591 ---
10592
10593 * [HBASE-21567](https://issues.apache.org/jira/browse/HBASE-21567) | *Major* | **Allow overriding configs starting up the shell**
10594
10595 Allow passing of -Dkey=value option to shell to override hbase-\* configuration: e.g.:
10596
10597 $ ./bin/hbase shell -Dhbase.zookeeper.quorum=ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org -Draining=false
10598 ...
10599 hbase(main):001:0\> @shell.hbase.configuration.get("hbase.zookeeper.quorum")
10600 =\> "ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org"
10601 hbase(main):002:0\> @shell.hbase.configuration.get("raining")
10602 =\> "false"
10603
10604
10605 ---
10606
10607 * [HBASE-21560](https://issues.apache.org/jira/browse/HBASE-21560) | *Major* | **Return a new TableDescriptor for MasterObserver#preModifyTable to allow coprocessor modify the TableDescriptor**
10608
10609 Incompatible change. Allow MasterObserver#preModifyTable to return a new TableDescriptor. And master will use this returned TableDescriptor to modify table.
10610
10611
10612 ---
10613
10614 * [HBASE-21551](https://issues.apache.org/jira/browse/HBASE-21551) | *Blocker* | **Memory leak when use scan with STREAM at server side**
10615
10616 <!-- markdown -->
10617 ### Summary
10618 HBase clusters will experience Region Server failures due to out of memory errors due to a leak given any of the following:
10619
10620 * User initiates Scan operations set to use the STREAM reading type
10621 * User initiates Scan operations set to use the default reading type that read more than 4 * the block size of column families involved in the scan (e.g. by default 4*64KiB)
10622 * Compactions run
10623
10624 ### Root cause
10625
10626 When there are long running scans the Region Server process attempts to optimize access by using a different API geared towards sequential access. Due to an error in HBASE-20704 for HBase 2.0+ the Region Server fails to release related resources when those scans finish. That same optimization path is always used for the HBase internal file compaction process.
10627
10628 ### Workaround
10629
10630 Impact for this error can be minimized by setting the config value “hbase.storescanner.pread.max.bytes” to MAX_INT to avoid the optimization for default user scans. Clients should also be checked to ensure they do not pass the STREAM read type to the Scan API. This will have a severe impact on performance for long scans.
10631
10632 Compactions always use this sequential optimized reading mechanism so downstream users will need to periodically restart Region Server roles after compactions have happened.
10633
10634
10635 ---
10636
10637 * [HBASE-21550](https://issues.apache.org/jira/browse/HBASE-21550) | *Major* | **Add a new method preCreateTableRegionInfos for MasterObserver which allows CPs to modify the TableDescriptor**
10638
10639 Add a new method preCreateTableRegionInfos for MasterObserver, which will be called before creating region infos for the given table,  before the preCreateTable method. It allows you to return a new TableDescritor to override the original one. Returns null or throws exception will stop the creation.
10640
10641
10642 ---
10643
10644 * [HBASE-21492](https://issues.apache.org/jira/browse/HBASE-21492) | *Critical* | **CellCodec Written To WAL Before It's Verified**
10645
10646 After HBASE-21492 the return type of WALCellCodec#getWALCellCodecClass has been changed from String to Class
10647
10648
10649 ---
10650
10651 * [HBASE-21387](https://issues.apache.org/jira/browse/HBASE-21387) | *Major* | **Race condition surrounding in progress snapshot handling in snapshot cache leads to loss of snapshot files**
10652
10653 To prevent race condition between in progress snapshot (performed by TakeSnapshotHandler) and HFileCleaner which results in data loss, this JIRA introduced mutual exclusion between taking snapshot and running HFileCleaner. That is, at any given moment, either some snapshot can be taken or, HFileCleaner checks hfiles which are not referenced, but not both can be running.
10654
10655
10656 ---
10657
10658 * [HBASE-21452](https://issues.apache.org/jira/browse/HBASE-21452) | *Major* | **Illegal character in hbase counters group name**
10659
10660 Changes group name of hbase metrics from "HBase Counters" to "HBaseCounters".
10661
10662
10663 ---
10664
10665 * [HBASE-21443](https://issues.apache.org/jira/browse/HBASE-21443) | *Major* | **[hbase-connectors] Purge hbase-\* modules from core now they've been moved to hbase-connectors**
10666
10667 Parent issue moved hbase-spark\* modules to hbase-connectors. This issue removes hbase-spark\* modules from hbase core repo.
10668
10669
10670 ---
10671
10672 * [HBASE-21430](https://issues.apache.org/jira/browse/HBASE-21430) | *Major* | **[hbase-connectors] Move hbase-spark\* modules to hbase-connectors repo**
10673
10674 hbase-spark\* modules have been cloned to https://github.com/apache/hbase-connectors All spark connector dev is to happen in that repo from here on out.
10675
10676 Let me file a subtask to remove hbase-spark\* modules from hbase core.
10677
10678
10679 ---
10680
10681 * [HBASE-21417](https://issues.apache.org/jira/browse/HBASE-21417) | *Critical* | **Pre commit build is broken due to surefire plugin crashes**
10682
10683 Add -Djdk.net.URLClassPath.disableClassPathURLCheck=true when executing surefire plugin.
10684
10685
10686 ---
10687
10688 * [HBASE-21191](https://issues.apache.org/jira/browse/HBASE-21191) | *Major* | **Add a holding-pattern if no assign for meta or namespace (Can happen if masterprocwals have been cleared).**
10689
10690 Puts master startup into holding pattern if meta is not assigned (previous it would exit). To make progress again, operator needs to inject an assign (Caveats and instruction can be found in HBASE-21035).
10691
10692
10693 ---
10694
10695 * [HBASE-21322](https://issues.apache.org/jira/browse/HBASE-21322) | *Critical* | **Add a scheduleServerCrashProcedure() API to HbckService**
10696
10697 Adds scheduleServerCrashProcedure to the HbckService.
10698
10699
10700 ---
10701
10702 * [HBASE-21325](https://issues.apache.org/jira/browse/HBASE-21325) | *Major* | **Force to terminate regionserver when abort hang in somewhere**
10703
10704 Add two new config hbase.regionserver.abort.timeout and hbase.regionserver.abort.timeout.task. If regionserver abort timeout, it will schedule an abort timeout task to run. The default abort task is SystemExitWhenAbortTimeout, which will force to terminate region server when abort timeout. And you can config a special abort timeout task by hbase.regionserver.abort.timeout.task.
10705
10706
10707 ---
10708
10709 * [HBASE-21215](https://issues.apache.org/jira/browse/HBASE-21215) | *Major* | **Figure how to invoke hbck2; make it easy to find**
10710
10711 Adds to bin/hbase means of invoking hbck2. Pass the new '-j' option on the 'hbck' command with a value of the full path to the HBCK2.jar.
10712
10713 E.g:
10714
10715 $ ./bin/hbase hbck -j ~/checkouts/hbase-operator-tools/hbase-hbck2/target/hbase-hbck2-1.0.0-SNAPSHOT.jar  setTableState x ENABLED
10716
10717
10718 ---
10719
10720 * [HBASE-21372](https://issues.apache.org/jira/browse/HBASE-21372) | *Major* | **Set hbase.assignment.maximum.attempts to Long.MAX**
10721
10722 Retry assigns 'forever' (or until an intervention such as a ServerCrashProcedure).
10723
10724 Previous retry was a maximum of ten times but on failure, handling was an indeterminate.
10725
10726
10727 ---
10728
10729 * [HBASE-21338](https://issues.apache.org/jira/browse/HBASE-21338) | *Major* | **[balancer] If balancer is an ill-fit for cluster size, it gives little indication**
10730
10731 The description claims the balancer not dynamically configurable but this is an error; it is http://hbase.apache.org/book.html#dyn\_config
10732
10733 Also, if balancer is seen to be cutting out too soon, try setting "hbase.master.balancer.stochastic.runMaxSteps" to true.
10734
10735 Adds cleaner logging around balancer start.
10736
10737
10738 ---
10739
10740 * [HBASE-21073](https://issues.apache.org/jira/browse/HBASE-21073) | *Major* | **"Maintenance mode" master**
10741
10742     Instead of being an ephemeral state set by hbck, maintenance mode is now
10743     an explicit toggle set by either configuration property or environment
10744     variable. In maintenance mode, master will host system tables and not
10745     assign any user-space tables to RSs. This gives operators the ability to
10746     affect repairs to meta table with fewer moving parts.
10747
10748
10749 ---
10750
10751 * [HBASE-21335](https://issues.apache.org/jira/browse/HBASE-21335) | *Critical* | **Change the default wait time of HBCK2 tool**
10752
10753 Changed waitTime parameter to lockWait on bypass. Changed default waitTime from 0 -- i.e. wait for ever -- to 1ms so if lock is held, we'll go past it and if override enforce bypass.
10754
10755
10756 ---
10757
10758 * [HBASE-21291](https://issues.apache.org/jira/browse/HBASE-21291) | *Major* | **Add a test for bypassing stuck state-machine procedures**
10759
10760 bypass will now throw an Exception if passed a lockWait \<= 0; i.e bypass will prevent an operator getting stuck on an entity lock waiting forever (lockWait == 0)
10761
10762
10763 ---
10764
10765 * [HBASE-21320](https://issues.apache.org/jira/browse/HBASE-21320) | *Major* | **[canary] Cleanup of usage and add commentary**
10766
10767 Cleans up usage and docs around Canary.  Does not change command-line args (though we should -- smile).
10768
10769
10770 ---
10771
10772 * [HBASE-21278](https://issues.apache.org/jira/browse/HBASE-21278) | *Critical* | **Do not rollback successful sub procedures when rolling back a procedure**
10773
10774 For the sub procedures which are successfully finished, do not do rollback. This is a change in rollback behavior.
10775
10776 State changes which are done by sub procedures should be handled by parent procedures when rolling back. For example, when rolling back a MergeTableProcedure, we will schedule new procedures to bring the offline regions online instead of rolling back the original procedures which off-lined the regions (in fact these procedures can not be rolled back...).
10777
10778
10779 ---
10780
10781 * [HBASE-21158](https://issues.apache.org/jira/browse/HBASE-21158) | *Critical* | **Empty qualifier cell should not be returned if it does not match QualifierFilter**
10782
10783 <!-- markdown -->
10784
10785 Scans that make use of `QualifierFilter` previously would erroneously return both columns with an empty qualifier along with those that matched. After this change that behavior has changed to only return those columns that match.
10786
10787
10788 ---
10789
10790 * [HBASE-21098](https://issues.apache.org/jira/browse/HBASE-21098) | *Major* | **Improve Snapshot Performance with Temporary Snapshot Directory when rootDir on S3**
10791
10792 It is recommended to place the working directory on-cluster on HDFS as doing so has shown a strong performance increase due to data locality. It is important to note that the working directory should not overlap with any existing directories as the working directory will be cleaned out during the snapshot process. Beyond that, any well-named directory on HDFS should be sufficient.
10793
10794
10795 ---
10796
10797 * [HBASE-21185](https://issues.apache.org/jira/browse/HBASE-21185) | *Minor* | **WALPrettyPrinter: Additional useful info to be printed by wal printer tool, for debugability purposes**
10798
10799 This adds two extra features to WALPrettyPrinter tool:
10800
10801 1) Output for each cell combined size of cell descriptors, plus the cell value itself, in a given WAL edit. This is printed on the results as "cell total size sum:" info by default;
10802
10803 2) An optional -g/--goto argument, that allows to seek straight to that specific WAL file position, then sequentially reading the WAL from that point towards its end;
10804
10805
10806 ---
10807
10808 * [HBASE-21287](https://issues.apache.org/jira/browse/HBASE-21287) | *Major* | **JVMClusterUtil Master initialization wait time not configurable**
10809
10810 Local HBase cluster (as used by unit tests) wait times on startup and initialization can be configured via \`hbase.master.start.timeout.localHBaseCluster\` and \`hbase.master.init.timeout.localHBaseCluster\`
10811
10812
10813 ---
10814
10815 * [HBASE-21280](https://issues.apache.org/jira/browse/HBASE-21280) | *Trivial* | **Add anchors for each heading in UI**
10816
10817 Adds anchors #tables, #tasks, etc.
10818
10819
10820 ---
10821
10822 * [HBASE-21232](https://issues.apache.org/jira/browse/HBASE-21232) | *Major* | **Show table state in Tables view on Master home page**
10823
10824 Add table state column to the tables panel
10825
10826
10827 ---
10828
10829 * [HBASE-21223](https://issues.apache.org/jira/browse/HBASE-21223) | *Critical* | **[amv2] Remove abort\_procedure from shell**
10830
10831 Removed the abort\_procedure command from shell -- dangerous -- and deprecated abortProcedure in Admin API.
10832
10833
10834 ---
10835
10836 * [HBASE-20636](https://issues.apache.org/jira/browse/HBASE-20636) | *Major* | **Introduce two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED**
10837
10838 Add two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED
10839 1. ROWPREFIX\_FIXED\_LENGTH: specify the length of the prefix
10840 2. ROWPREFIX\_DELIMITED: specify the delimiter of the prefix
10841 Need to specify parameters for these two types of bloomfilter, otherwise the table will fail to create
10842 Example:
10843 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_FIXED\_LENGTH', CONFIGURATION =\> {'RowPrefixBloomFilter.prefix\_length' =\> '10'}}
10844 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_DELIMITED', CONFIGURATION =\> {'RowPrefixDelimitedBloomFilter.delimiter' =\> '#'}}
10845
10846
10847 ---
10848
10849 * [HBASE-21156](https://issues.apache.org/jira/browse/HBASE-21156) | *Critical* | **[hbck2] Queue an assign of hbase:meta and bulk assign/unassign**
10850
10851 Adds 'raw' assigns/unassigns to the Hbck Service. Takes a list of encoded region names and bulk assigns/unassigns. Skirts Master 'state' check and does not invoke Coprocessors. For repair only.
10852
10853 Here is what HBCK2 usage looks like now:
10854
10855 {code}
10856 $ java -cp hbase-hbck2-1.0.0-SNAPSHOT.jar  org.apache.hbase.HBCK2
10857 usage: HBCK2 \<OPTIONS\> COMMAND [\<ARGS\>]
10858
10859 Options:
10860  -d,--debug                      run with debug output
10861  -h,--help                       output this help message
10862     --hbase.zookeeper.peerport   peerport of target hbase ensemble
10863     --hbase.zookeeper.quorum     ensemble of target hbase
10864     --zookeeper.znode.parent     parent znode of target hbase
10865
10866 Commands:
10867  setTableState \<TABLENAME\> \<STATE\>
10868    Possible table states: ENABLED, DISABLED, DISABLING, ENABLING
10869    To read current table state, in the hbase shell run:
10870      hbase\> get 'hbase:meta', '\<TABLENAME\>', 'table:state'
10871    A value of \\x08\\x00 == ENABLED, \\x08\\x01 == DISABLED, etc.
10872    An example making table name 'user' ENABLED:
10873      $ HBCK2 setTableState users ENABLED
10874    Returns whatever the previous table state was.
10875
10876  assign \<ENCODED\_REGIONNAME\> ...
10877    A 'raw' assign that can be used even during Master initialization.
10878    Skirts Coprocessors. Pass one or more encoded RegionNames:
10879    e.g. 1588230740 is hard-coded encoding for hbase:meta region and
10880    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
10881    user-space encoded Region name looks like. For example:
10882      $ HBCK2 assign 1588230740 de00010733901a05f5a2a3a382e27dd4
10883    Returns the pid of the created AssignProcedure or -1 if none.
10884
10885  unassign \<ENCODED\_REGIONNAME\> ...
10886    A 'raw' unassign that can be used even during Master initialization.
10887    Skirts Coprocessors. Pass one or more encoded RegionNames:
10888    Skirts Coprocessors. Pass one or more encoded RegionNames:
10889    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
10890    user-space encoded Region name looks like. For example:
10891      $ HBCK2 unassign 1588230740 de00010733901a05f5a2a3a382e27dd4
10892    Returns the pid of the created UnassignProcedure or -1 if none.
10893 {code}
10894
10895
10896 ---
10897
10898 * [HBASE-21021](https://issues.apache.org/jira/browse/HBASE-21021) | *Major* | **Result returned by Append operation should be ordered**
10899
10900 This change ensures Append operations are assembled into the expected order.
10901
10902
10903 ---
10904
10905 * [HBASE-21171](https://issues.apache.org/jira/browse/HBASE-21171) | *Major* | **[amv2] Tool to parse a directory of MasterProcWALs standalone**
10906
10907 Make it so can run the WAL parse and load system in isolation. Here is an example:
10908
10909 {code}$ HBASE\_OPTS=" -XX:+UnlockDiagnosticVMOptions -XX:+UnlockCommercialFeatures -XX:+FlightRecorder -XX:+DebugNonSafepoints" ./bin/hbase org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore ~/big\_set\_of\_masterprocwals/
10910 {code}
10911
10912
10913 ---
10914
10915 * [HBASE-21107](https://issues.apache.org/jira/browse/HBASE-21107) | *Minor* | **add a metrics for netty direct memory**
10916
10917 Add a new nettyDirectMemoryUsage under server's ipc metrics to show direct memory usage for netty rpc server.
10918
10919
10920 ---
10921
10922 * [HBASE-21153](https://issues.apache.org/jira/browse/HBASE-21153) | *Major* | **Shaded client jars should always build in relevant phase to avoid confusion**
10923
10924 Client facing artifacts are now built whenever Maven is run through the "package" goal. Previously, the client facing artifacts would create placeholder jars that skipped repackaging HBase and third-party dependencies unless the "release" profile was active.
10925
10926 Build times may be noticeably longer depending on your build hardware. For example, the Jenkins worker nodes maintained by ASF Infra take ~14% longer to do a full packaging build. An example portability-focused personal laptop took ~25% longer.
10927
10928
10929 ---
10930
10931 * [HBASE-20942](https://issues.apache.org/jira/browse/HBASE-20942) | *Major* | **Improve RpcServer TRACE logging**
10932
10933 Allows configuration of the length of RPC messages printed to the log at TRACE level via "hbase.ipc.trace.param.size" in RpcServer.
10934
10935
10936 ---
10937
10938 * [HBASE-20649](https://issues.apache.org/jira/browse/HBASE-20649) | *Minor* | **Validate HFiles do not have PREFIX\_TREE DataBlockEncoding**
10939
10940 <!-- markdown -->
10941 Users who have previously made use of prefix tree encoding can now check that their existing HFiles no longer contain data that uses it with an additional preupgrade check command.
10942
10943 ```
10944 hbase pre-upgrade validate-hfile
10945 ```
10946
10947 Please see the "HFile Content validation" section of the ref guide's coverage of the pre-upgrade validator tool for usage details.
10948
10949
10950 ---
10951
10952 * [HBASE-20941](https://issues.apache.org/jira/browse/HBASE-20941) | *Major* | **Create and implement HbckService in master**
10953
10954 Adds an HBCK Service and a first method to force-change-in-table-state for use by an HBCK client effecting 'repair' to a malfunctioning HBase.
10955
10956
10957 ---
10958
10959 * [HBASE-21071](https://issues.apache.org/jira/browse/HBASE-21071) | *Major* | **HBaseTestingUtility::startMiniCluster() to use builder pattern**
10960
10961 Cleanup all the cluster start override combos in HBaseTestingUtility by adding a StartMiniClusterOption and Builder.
10962
10963
10964 ---
10965
10966 * [HBASE-21072](https://issues.apache.org/jira/browse/HBASE-21072) | *Major* | **Block out HBCK1 in hbase2**
10967
10968 Fence out hbase-1.x hbck1 instances. Stop them making state changes on an hbase-2.x cluster; they could do damage. We do this by writing the hbck1 lock file into place on hbase-2.x Master start-up.
10969
10970 To disable this new behavior, set hbase.write.hbck1.lock.file to false
10971
10972
10973 ---
10974
10975 * [HBASE-20881](https://issues.apache.org/jira/browse/HBASE-20881) | *Major* | **Introduce a region transition procedure to handle all the state transition for a region**
10976
10977 Introduced a new TransitRegionStateProcedure to replace the old AssignProcedure/UnassignProcedure/MoveRegionProcedure. In the old code, MRP will not be attached to RegionStateNode, so it can not be interrupted by ServerCrashProcedure, which introduces lots of tricky code to deal with races, and also causes lots of other difficulties on how to prevent scheduling redundant or even conflict procedures for a region.
10978
10979 And now TRSP is the only one procedure which can bring region online or offline. When you want to schedule one, you need to check whether there is already one attached to the RegionStateNode, under the lock of the RegionStateNode. If not just go ahead, and if there is one, then you should do something, for example, give up and fail directly, or tell the TRSP to give up(This is what SCP does). Since the check and attach are both under the lock of RSN, it will greatly reduce the possible races, and make the code much simpler.
10980
10981
10982 ---
10983
10984 * [HBASE-21012](https://issues.apache.org/jira/browse/HBASE-21012) | *Critical* | **Revert the change of serializing TimeRangeTracker**
10985
10986 HFiles generated by 2.0.0, 2.0.1, 2.1.0 are not forward compatible to 1.4.6-, 1.3.2.1-, 1.2.6.1-, and other inactive releases. Why HFile lose compatability is hbase in new versions (2.0.0, 2.0.1, 2.1.0) use protobuf to serialize/deserialize TimeRangeTracker (TRT) while old versions use DataInput/DataOutput. To solve this, We have to put HBASE-21012 to 2.x and put HBASE-21013 in 1.x. For more information, please check HBASE-21008.
10987
10988
10989 ---
10990
10991 * [HBASE-20965](https://issues.apache.org/jira/browse/HBASE-20965) | *Major* | **Separate region server report requests to new handlers**
10992
10993 After HBASE-20965, we can use MasterFifoRpcScheduler in master to separate RegionServerReport requests to indenpedent handler. To use this feature, please set "hbase.master.rpc.scheduler.factory.class" to
10994  "org.apache.hadoop.hbase.ipc.MasterFifoRpcScheduler". Use "hbase.master.server.report.handler.count" to set RegionServerReport handlers count, the default value is half of "hbase.regionserver.handler.count" value, but at least 1, and the other handlers count in master is "hbase.regionserver.handler.count" value minus RegionServerReport handlers count, but at least 1 too.
10995
10996
10997 ---
10998
10999 * [HBASE-20813](https://issues.apache.org/jira/browse/HBASE-20813) | *Minor* | **Remove RPC quotas when the associated table/Namespace is dropped off**
11000
11001 In previous releases, when a Space Quota was configured on a table or namespace and that table or namespace was deleted, the Space Quota was also deleted. This change improves the implementation so that the same is also done for RPC Quotas.
11002
11003
11004 ---
11005
11006 * [HBASE-20986](https://issues.apache.org/jira/browse/HBASE-20986) | *Major* | **Separate the config of block size when we do log splitting and write Hlog**
11007
11008 After HBASE-20986, we can set different value to block size of WAL and recovered edits. Both of their default value is 2 \* default HDFS blocksize. And hbase.regionserver.recoverededits.blocksize is for block size of recovered edits while hbase.regionserver.hlog.blocksize is for block size of WAL.
11009
11010
11011 ---
11012
11013 * [HBASE-20856](https://issues.apache.org/jira/browse/HBASE-20856) | *Minor* | **PITA having to set WAL provider in two places**
11014
11015 With this change if a WAL's meta provider (hbase.wal.meta\_provider) is not explicitly set, it now defaults to whatever hbase.wal.provider is set to. Previous, the two settings operated independently, each with its own default.
11016
11017 This change is operationally incompatible with previous HBase versions because the default WAL meta provider no longer defaults to AsyncFSWALProvider but to hbase.wal.provider.
11018
11019 The thought is that this is more in line with an operator's expectation, that a change in hbase.wal.provider is sufficient to change how WALs are written, especially given hbase.wal.meta\_provider is an obscure configuration and that the very idea that meta regions would have their own wal provider would likely come as a surprise.
11020
11021
11022 ---
11023
11024 * [HBASE-20538](https://issues.apache.org/jira/browse/HBASE-20538) | *Critical* | **Upgrade our hadoop versions to 2.7.7 and 3.0.3**
11025
11026 Update hadoop-two.version to 2.7.7 and hadoop-three.version to 3.0.3 due to a JDK issue which is solved by HADOOP-15473.
11027
11028
11029 ---
11030
11031 * [HBASE-20846](https://issues.apache.org/jira/browse/HBASE-20846) | *Major* | **Restore procedure locks when master restarts**
11032
11033 1. Make hasLock method final, and add a locked field in Procedure to record whether we have the lock. We will set it to true in doAcquireLock and to false in doReleaseLock. The sub procedures do not need to manage it any more.
11034
11035 2. Also added a locked field in the proto message. When storing, the field will be set according to the return value of hasLock. And when loading, there is a new field in Procedure called lockedWhenLoading. We will set it to true if the locked field in proto message is true.
11036
11037 3. The reason why we can not set the locked field directly to true by calling doAcquireLock is that, during initialization, most procedures need to wait until master is initialized. So the solution here is that, we introduced a new method called waitInitialized in Procedure, and move the wait master initialized related code from acquireLock to this method. And we added a restoreLock method to Procedure, if lockedWhenLoading is true, we will call the acquireLock to get the lock, but do not set locked to true. And later when we call doAcquireLock and pass the waitInitialized check, we will test lockedWhenLoading, if it is true, when we just set the locked field to true and return, without actually calling the acquireLock method since we have already called it once.
11038
11039
11040 ---
11041
11042 * [HBASE-20672](https://issues.apache.org/jira/browse/HBASE-20672) | *Minor* | **New metrics ReadRequestRate and WriteRequestRate**
11043
11044 Exposing 2 new metrics in HBase to provide ReadRequestRate and WriteRequestRate at region server level. These metrics give the rate of request handled by the region server and are reset after every monitoring interval.
11045
11046
11047 ---
11048
11049 * [HBASE-6028](https://issues.apache.org/jira/browse/HBASE-6028) | *Minor* | **Implement a cancel for in-progress compactions**
11050
11051 Added a new command to the shell to switch on/off compactions called "compaction\_switch". Disabling compactions will interrupt any currently ongoing compactions. This setting will be lost on restart of the server. Added the configuration hbase.regionserver.compaction.enabled so user can enable/disable compactions via hbase-site.xml.
11052
11053
11054 ---
11055
11056 * [HBASE-20884](https://issues.apache.org/jira/browse/HBASE-20884) | *Major* | **Replace usage of our Base64 implementation with java.util.Base64**
11057
11058 Class org.apache.hadoop.hbase.util.Base64 has been removed in it's entirety from HBase 2+. In HBase 1, unused methods have been removed from the class and the audience was changed from  Public to Private. This class was originally intended as an internal utility class that could be used externally but thinking since changed; these classes should not have been advertised as public to end-users.
11059
11060 This represents an incompatible change for users who relied on this implementation. An alternative implementation for affected clients is available at java.util.Base64 when using Java 8 or newer; be aware, it may encode/decode differently. For clients seeking to restore this specific implementation, it is available in the public domain for download at http://iharder.sourceforge.net/current/java/base64/
11061
11062
11063 ---
11064
11065 * [HBASE-20357](https://issues.apache.org/jira/browse/HBASE-20357) | *Major* | **AccessControlClient API Enhancement**
11066
11067 This enhances the AccessControlClient APIs to retrieve the permissions based on namespace, table name, family and qualifier for specific user. AccessControlClient can also validate a user whether allowed to perform specified operations on a particular table.
11068 Following APIs have been added,
11069 1) getUserPermissions(Connection connection, String tableRegex, byte[] columnFamily, byte[] columnQualifier, String userName)
11070          Scope of retrieving permission will be same as existing.
11071 2) hasPermission(onnection connection, String tableName, byte[] columnFamily, byte[] columnQualifier, String userName, Permission.Action... actions)
11072      Scope of validating user privilege,
11073            User can perform self check without any special privilege but ADMIN privilege will be required to perform check for other users.
11074            For example, suppose there are two users "userA" & "userB" then there can be below scenarios,
11075             a. When userA want to check whether userA have privilege to perform mentioned actions
11076                  userA don't need ADMIN privilege, as it's a self query.
11077             b. When userA want to check whether userB have privilege to perform mentioned actions,
11078                  userA must have ADMIN or superuser privilege, as it's trying to query for other user.
11079
11080
11081
11082 # HBASE  2.1.0 Release Notes
11083
11084 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11085
11086
11087 ---
11088
11089 * [HBASE-20691](https://issues.apache.org/jira/browse/HBASE-20691) | *Blocker* | **Storage policy should allow deferring to HDFS**
11090
11091 After HBASE-20691 we have changed the default setting of hbase.wal.storage.policy from "HOT" back to "NONE" which means we defer the policy to HDFS. This fixes the problem of release 2.0.0 that the storage policy of WAL directory will defer to HDFS and may not be "HOT" even if you explicitly set hbase.wal.storage.policy to "HOT"
11092
11093
11094 ---
11095
11096 * [HBASE-20839](https://issues.apache.org/jira/browse/HBASE-20839) | *Blocker* | **Fallback to FSHLog if we can not instantiated AsyncFSWAL when user does not specify AsyncFSWAL explicitly**
11097
11098 As we hack into the internal of DFSClient when implementing AsyncFSWAL to get better performance, a patch release of hadoop can make it broken.
11099
11100 So now, if user does not specify a wal provider, then we will first try to use 'asyncfs', i.e, the AsyncFSWALProvider. If we fail due to some compatible issues, we will fallback to 'filesystem', i.e, FSHLog.
11101
11102
11103 ---
11104
11105 * [HBASE-20193](https://issues.apache.org/jira/browse/HBASE-20193) | *Critical* | **Basic Replication Web UI - Regionserver**
11106
11107 After HBASE-20193, we add a section to web ui to show the replication status of each wal group. There are 2 parts of this section, they both show the peerId, wal group and current replicating log of each replication source. And one is showing the information of replication log queue, i.e. size of current log, log queue size and replicating offset. The other one is showing the delay of replication, i.e. last shipped age and replication delay.
11108 If the offset shows -1 and replication delay is UNKNOWN, that means replication is not started. This may be caused by this peer is disabled or the replicationEndpoint is sleeping due to some reason.
11109
11110
11111 ---
11112
11113 * [HBASE-19997](https://issues.apache.org/jira/browse/HBASE-19997) | *Blocker* | **[rolling upgrade] 1.x =\> 2.x**
11114
11115 Now we have a 'basically work' solution for rolling upgrade from 1.4.x to 2.x. Please see the "Rolling Upgrade from 1.x to 2.x" section in ref guide for more details.
11116
11117
11118 ---
11119
11120 * [HBASE-20270](https://issues.apache.org/jira/browse/HBASE-20270) | *Major* | **Turn off command help that follows all errors in shell**
11121
11122 <!-- markdown -->
11123 The command help that followed all errors, before, is now no longer available. Erroneous command inputs would now just show error-texts followed by the shell command to try for seeing the help message. It looks like: For usage try 'help “create”’. Operators can copy-paste the command to get the help message.
11124
11125
11126 ---
11127
11128 * [HBASE-20194](https://issues.apache.org/jira/browse/HBASE-20194) | *Critical* | **Basic Replication WebUI - Master**
11129
11130 After HBASE-20194, we added 2 parts to master's web page.
11131 One is Peers that shows all replication peers and some of their configurations, like peer id, cluster key, state, bandwidth, and which namespace or table it will replicate.
11132 The other one is replication status of all regionservers, we added a tab to region servers division, then we can check the replication delay of all region servers for any peer. This table shows AgeOfLastShippedOp, SizeOfLogQueue and ReplicationLag for each regionserver and the table is sort by ReplicationLag in descending order. By this way we can easily find the problematic region server. If the replication delay is UNKNOWN, that means this walGroup doesn't start replicate yet and it may get disabled. ReplicationLag will update once this peer start replicate.
11133
11134
11135 ---
11136
11137 * [HBASE-18569](https://issues.apache.org/jira/browse/HBASE-18569) | *Major* | **Add prefetch support for async region locator**
11138
11139 Add prefetch support for async region locator. The default value is 10. Set 'hbase.client.locate.prefetch.limit' in hbase-site.xml if you want to use another value for it.
11140
11141
11142 ---
11143
11144 * [HBASE-20642](https://issues.apache.org/jira/browse/HBASE-20642) | *Major* | **IntegrationTestDDLMasterFailover throws 'InvalidFamilyOperationException**
11145
11146 This changes client-side nonce generation to use the same nonce for re-submissions of client RPC DDL operations.
11147
11148
11149 ---
11150
11151 * [HBASE-20708](https://issues.apache.org/jira/browse/HBASE-20708) | *Blocker* | **Remove the usage of RecoverMetaProcedure in master startup**
11152
11153 Introduce an InitMetaProcedure to initialize meta table for a new HBase deploy. Marked RecoverMetaProcedure deprecated and remove the usage of it in the current code base. We still need to keep it in place for compatibility. The code in RecoverMetaProcedure has been moved to ServerCrashProcedure, and SCP will always be enabled and we will rely on it to bring meta region online.
11154
11155 For more on the issue addressed by this commit, see the design doc for overview and plan: https://docs.google.com/document/d/1\_872oHzrhJq4ck7f6zmp1J--zMhsIFvXSZyX1Mxg5MA/edit#heading=h.xy1z4alsq7uy
11156
11157
11158 ---
11159
11160 * [HBASE-20334](https://issues.apache.org/jira/browse/HBASE-20334) | *Major* | **add a test that expressly uses both our shaded client and the one from hadoop 3**
11161
11162 <!-- markdown -->
11163
11164 HBase now includes a helper script that can be used to run a basic functionality test for a given HBase installation at in `dev_support`. The test can optionally be given an HBase client artifact to rely on and can optionally be given specific Hadoop client artifacts to use.
11165
11166 For usage information see `./dev-support/hbase_nightly_pseudo-distributed-test.sh --help`.
11167
11168 The project nightly tests now make use of this test to check running on top of Hadoop 2, Hadoop 3, and Hadoop 3 with shaded client artifacts.
11169
11170
11171 ---
11172
11173 * [HBASE-19735](https://issues.apache.org/jira/browse/HBASE-19735) | *Major* | **Create a minimal "client" tarball installation**
11174
11175 <!-- markdown -->
11176
11177 The HBase convenience binary artifacts now includes a client focused tarball that a) includes more docs and b) does not include scripts or jars only needed for running HBase cluster services.
11178
11179 The new artifact is made as a normal part of the `assembly:single` maven command.
11180
11181
11182 ---
11183
11184 * [HBASE-20615](https://issues.apache.org/jira/browse/HBASE-20615) | *Major* | **emphasize use of shaded client jars when they're present in an install**
11185
11186 <!-- markdown -->
11187
11188 HBase's built in scripts now rely on the downstream facing shaded artifacts where possible. In particular interest to downstream users, the `hbase classpath` and `hbase mapredcp` commands now return the relevant shaded client artifact and only those third paty jars needed to make use of them (e.g. slf4j-api, commons-logging, htrace, etc).
11189
11190 Downstream users should note that by default the `hbase classpath` command will treat having `hadoop` on the shell's PATH as an implicit request to include the output of the `hadoop classpath` command in the returned classpath. This long-existing behavior can be opted out of by setting the environment variable `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` to the value "true". For example: `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP="true" bin/hbase classpath`.
11191
11192
11193 ---
11194
11195 * [HBASE-20333](https://issues.apache.org/jira/browse/HBASE-20333) | *Critical* | **break up shaded client into one with no Hadoop and one that's standalone**
11196
11197 <!-- markdown -->
11198
11199 Downstream users who need to use both HBase and Hadoop APIs should switch to relying on the new `hbase-shaded-client-byo-hadoop` artifact rather than the existing `hbase-shaded-client` artifact. The new artifact no longer includes and Hadoop classes.
11200
11201 It should work in combination with either the output of `hadoop classpath` or the Hadoop provided client-facing shaded artifacts in Hadoop 3+.
11202
11203
11204 ---
11205
11206 * [HBASE-20332](https://issues.apache.org/jira/browse/HBASE-20332) | *Critical* | **shaded mapreduce module shouldn't include hadoop**
11207
11208 <!-- markdown -->
11209
11210 The `hbase-shaded-mapreduce` artifact no longer include its own copy of Hadoop classes. Users who make use of the artifact via YARN should be able to get these classes from YARN's classpath without having to make any changes.
11211
11212
11213 ---
11214
11215 * [HBASE-20681](https://issues.apache.org/jira/browse/HBASE-20681) | *Major* | **IntegrationTestDriver fails after HADOOP-15406 due to missing hamcrest-core**
11216
11217 <!-- markdown -->
11218
11219 Users of our integration tests on Hadoop 3 can now add all needed dependencies by pointing at jars included in our binary convenience artifact.
11220
11221 Prior to this fix, downstream users on Hadoop 3 would need to get a copy of the Hamcrest v1.3 jar from elsewhere.
11222
11223
11224 ---
11225
11226 * [HBASE-19852](https://issues.apache.org/jira/browse/HBASE-19852) | *Major* | **HBase Thrift 1 server SPNEGO Improvements**
11227
11228 Adds two new properties for hbase-site.xml for THRIFT SPNEGO when in HTTP mode:
11229 \* hbase.thrift.spnego.keytab.file
11230 \* hbase.thrift.spnego.principal
11231
11232
11233 ---
11234
11235 * [HBASE-20590](https://issues.apache.org/jira/browse/HBASE-20590) | *Critical* | **REST Java client is not able to negotiate with the server in the secure mode**
11236
11237 Adds a negotiation logic between a secure java REST client and server. After this jira the Java REST client will start responding to the Negotiate challenge sent by the server. Adds RESTDemoClient which can be used to verify whether the secure Java REST client works against secure REST server or not.
11238
11239
11240 ---
11241
11242 * [HBASE-20634](https://issues.apache.org/jira/browse/HBASE-20634) | *Critical* | **Reopen region while server crash can cause the procedure to be stuck**
11243
11244 A second attempt at fixing HBASE-20173. Fixes unfinished keeping of server state inside AM (ONLINE=\>SPLITTING=\>OFFLINE=\>null). Concurrent unassigns look at server state to figure if they should wait on SCP to wake them up or not.
11245
11246
11247 ---
11248
11249 * [HBASE-20579](https://issues.apache.org/jira/browse/HBASE-20579) | *Minor* | **Improve snapshot manifest copy in ExportSnapshot**
11250
11251 This patch adds an FSUtil.copyFilesParallel() to help copy files in parallel, and it will return all the paths of directories and files traversed. Thus when we copy manifest in ExportSnapshot, we can copy reference files concurrently and use the paths it returns to help setOwner and setPermission.
11252 The size of thread pool is determined by the configuration snapshot.export.copy.references.threads, and its default value is the number of runtime available processors.
11253
11254
11255 ---
11256
11257 * [HBASE-18116](https://issues.apache.org/jira/browse/HBASE-18116) | *Major* | **Replication source in-memory accounting should not include bulk transfer hfiles**
11258
11259 Before this change we would incorrectly include the size of enqueued store files for bulk replication in the calculation for determining whether or not to rate limit the transfer of WAL edits. Because bulk replication uses a separate and asynchronous mechanism for file transfer this could incorrectly limit the batch sizes for WAL replication if bulk replication in progress, with negative impact on latency and throughput.
11260
11261
11262 ---
11263
11264 * [HBASE-20592](https://issues.apache.org/jira/browse/HBASE-20592) | *Minor* | **Create a tool to verify tables do not have prefix tree encoding**
11265
11266 PreUpgradeValidator tool with DataBlockEncoding validator was added to verify cluster is upgradable to HBase 2.
11267
11268
11269 ---
11270
11271 * [HBASE-20501](https://issues.apache.org/jira/browse/HBASE-20501) | *Blocker* | **Change the Hadoop minimum version to 2.7.1**
11272
11273 <!-- markdown -->
11274 HBase is no longer able to maintain compatibility with Apache Hadoop versions that are no longer receiving updates. This release raises the minimum supported version to Hadoop 2.7.1. Downstream users are strongly advised to upgrade to the latest Hadoop 2.7 maintenance release.
11275
11276 Downstream users of earlier HBase versions are similarly advised to upgrade to Hadoop 2.7.1+. When doing so, it is especially important to follow the guidance from [the HBase Reference Guide's Hadoop section](http://hbase.apache.org/book.html#hadoop) on replacing the Hadoop artifacts bundled with HBase.
11277
11278
11279 ---
11280
11281 * [HBASE-20601](https://issues.apache.org/jira/browse/HBASE-20601) | *Minor* | **Add multiPut support and other miscellaneous to PE**
11282
11283 1. Add multiPut support
11284 Set --multiPut=number to enable batchput(meanwhile, --autoflush need be set to false)
11285
11286 2. Add Connection Count support
11287 Added a new parameter connCount to PE. set --connCount=2 means all threads will share 2 connections.
11288 oneCon option and connCount option shouldn't be set at the same time.
11289
11290 3. Add avg RT and avg TPS/QPS statstic for all threads
11291
11292 4. Delete some redundant code
11293 Now RandomWriteTest is inherited from SequentialWrite.
11294
11295
11296 ---
11297
11298 * [HBASE-20544](https://issues.apache.org/jira/browse/HBASE-20544) | *Blocker* | **downstream HBaseTestingUtility fails with invalid port**
11299
11300 <!-- markdown -->
11301
11302 HBase now relies on an internal mechanism to determine when it is running a local hbase cluster meant for external interaction vs an encapsulated test. When created via the `HBaseTestingUtility`, ports for Master and RegionServer services and UIs will be set to random ports to allow for multiple parallel uses on a single machine. Normally when running a Standalone HBase Deployment (as described in the HBase Reference Guide) the ports will be picked according to the same defaults used in a full cluster set up. If you wish to instead use the random port assignment set `hbase.localcluster.assign.random.ports` to true.
11303
11304
11305 ---
11306
11307 * [HBASE-20004](https://issues.apache.org/jira/browse/HBASE-20004) | *Minor* | **Client is not able to execute REST queries in a secure cluster**
11308
11309 Added 'hbase.rest.http.allow.options.method' configuration property to allow user to decide whether Rest Server HTTP should allow OPTIONS method or not. By default it is enabled in HBase 2.1.0+ versions and in other versions it is disabled.
11310 Similarly 'hbase.thrift.http.allow.options.method' is added HBase 1.5, 2.1.0 and 3.0.0 versions. It is disabled by default.
11311
11312
11313 ---
11314
11315 * [HBASE-20327](https://issues.apache.org/jira/browse/HBASE-20327) | *Minor* | **When qualifier is not specified, append and incr operation do not work (shell)**
11316
11317 This change will enable users to perform append and increment operation with null qualifier via hbase-shell.
11318
11319
11320 ---
11321
11322 * [HBASE-18842](https://issues.apache.org/jira/browse/HBASE-18842) | *Minor* | **The hbase shell clone\_snaphost command returns bad error message**
11323
11324 <!-- markdown -->
11325
11326 When attempting to clone a snapshot but using a namespace that does not exist, the HBase shell will now correctly report the exception as caused by the passed namespace. Previously, the shell would report that the problem was an unknown namespace but it would claim the user provided table name was not found as a namespace. Both before and after this change the shell properly used the passed namespace to attempt to handle the request.
11327
11328
11329 ---
11330
11331 * [HBASE-20406](https://issues.apache.org/jira/browse/HBASE-20406) | *Major* | **HBase Thrift HTTP - Shouldn't handle TRACE/OPTIONS methods**
11332
11333 <!-- markdown -->
11334 When configured to do thrift-over-http, the HBase Thrift API Server no longer accepts the HTTP methods TRACE nor OPTIONS.
11335
11336
11337 ---
11338
11339 * [HBASE-20046](https://issues.apache.org/jira/browse/HBASE-20046) | *Major* | **Reconsider the implementation for serial replication**
11340
11341 Now in replication we can make sure the order of pushing logs is same as the order of requests from client. Set the serial flag to true for a replication peer to enable this feature.
11342
11343
11344 ---
11345
11346 * [HBASE-20159](https://issues.apache.org/jira/browse/HBASE-20159) | *Major* | **Support using separate ZK quorums for client**
11347
11348 After HBASE-20159 we allow client to use different ZK quorums by introducing three new properties: hbase.client.zookeeper.quorum and hbase.client.zookeeper.property.clientPort to specify client zookeeper properties (note that the combination of these two properties should be different from the server ZK quorums), and hbase.client.zookeeper.observer.mode to indicate whether the client ZK nodes are in observer mode (false by default)
11349
11350 HConstants.DEFAULT\_ZOOKEPER\_CLIENT\_PORT has been removed in HBase 3.0 and replaced by the correctly spelled DEFAULT\_ZOOKEEPER\_CLIENT\_PORT.
11351
11352
11353 ---
11354
11355 * [HBASE-20242](https://issues.apache.org/jira/browse/HBASE-20242) | *Major* | **The open sequence number will grow if we fail to open a region after writing the max sequence id file**
11356
11357 Now when opening a region, we will store the current max sequence id of the region to its max sequence id file instead of the 'next sequence id'. This could avoid the sequence id bumping when we fail to open a region, and also align to the behavior when we close a region.
11358
11359
11360 ---
11361
11362 * [HBASE-19024](https://issues.apache.org/jira/browse/HBASE-19024) | *Critical* | **Configurable default durability for synchronous WAL**
11363
11364 The default durability setting for the synchronous WAL is Durability.SYNC\_WAL, which triggers HDFS hflush() to flush edits to the datanodes. We also support Durability.FSYNC\_WAL, which instead triggers HDFS hsync() to flush \_and\_ fsync edits. This change introduces the new configuration setting "hbase.wal.hsync", defaulting to FALSE, that if set to TRUE changes the default durability setting for the synchronous WAL to  FSYNC\_WAL.
11365
11366
11367 ---
11368
11369 * [HBASE-19389](https://issues.apache.org/jira/browse/HBASE-19389) | *Critical* | **Limit concurrency of put with dense (hundreds) columns to prevent write handler exhausted**
11370
11371 After HBASE-19389 we introduced a RegionServer self-protection mechanism to prevent write handler getting exhausted by high concurrency put with dense columns, mainly through two new properties: hbase.region.store.parallel.put.limit.min.column.count to decide what kind of put (with how many columns within a single column family) to limit (100 by default) and hbase.region.store.parallel.put.limit to limit the concurrency (10 by default). There's another property for advanced user and please check source and javadoc of StoreHotnessProtector for more details.
11372
11373
11374 ---
11375
11376 * [HBASE-20148](https://issues.apache.org/jira/browse/HBASE-20148) | *Major* | **Make serial replication as a option for a peer instead of a table**
11377
11378 A new method setSerial has been added to the interface ReplicationPeerConfigBuilder which is marked as IA.Public. This interface is not supposed to be implemented by client code, but if you do, this will be an incompatible change as you need to add this method to your implementation too.
11379
11380
11381 ---
11382
11383 * [HBASE-19397](https://issues.apache.org/jira/browse/HBASE-19397) | *Major* | **Design  procedures for ReplicationManager to notify peer change event from master**
11384
11385 Introduce 5 procedures to do peer modifications:
11386 AddPeerProcedure
11387 RemovePeerProcedure
11388 UpdatePeerConfigProcedure
11389 EnablePeerProcedure
11390 DisablePeerProcedure
11391
11392 The procedures are all executed with the following stage:
11393 1. Call pre CP hook, if an exception is thrown then give up
11394 2. Check whether the operation is valid, if not then give up
11395 3. Update peer storage. Notice that if we have entered this stage, then we can not rollback any more.
11396 4. Schedule sub procedures to refresh the peer config on every RS.
11397 5. Do post cleanup if any.
11398 6. Call post CP hook. The exception thrown will be ignored since we have already done the work.
11399
11400 The procedure will hold an exclusive lock on the peer id, so now there is no concurrent modifications on a single peer.
11401
11402 And now it is guaranteed that once the procedure is done, the peer modification has already taken effect on all RSes.
11403
11404 Abstracte a storage layer for replication peer/queue manangement, and refactored the upper layer to remove zk related naming/code/comment.
11405
11406 Add pre/postExecuteProcedures CP hooks to RegionServerObserver, and add permission check for executeProcedures method which requires the caller to be system user or super user.
11407
11408 On rolling upgrade: just do not do any replication peer modifications during the rolling upgrading. There is no pb/layout changes on the peer/queue storage on zk.
11409 # HBASE  2.0.0 Release Notes
11410
11411
11412 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11413
11414
11415 ---
11416
11417 * [HBASE-20464](https://issues.apache.org/jira/browse/HBASE-20464) | *Major* | **Disable IMC**
11418
11419 Change the default so that on creation of new tables, In-Memory Compaction BASIC is NOT enabled.
11420
11421 This change is in branch-2.0 only, not in branch-2.
11422
11423
11424 ---
11425
11426 * [HBASE-20276](https://issues.apache.org/jira/browse/HBASE-20276) | *Blocker* | **[shell] Revert shell REPL change and document**
11427
11428 <!-- markdown -->
11429
11430
11431
11432 The HBase shell now behaves as it did prior to the changes that started in HBASE-15965. Namely, some shell commands return values that may be further manipulated within the shell's IRB session.
11433
11434 The command line option `--return-values` is no longer acted on by the shell since it now always behaves as it did when passed this parameter. Passing the option results in a harmless warning about this change.
11435
11436 Users who wish to maintain the behavior seen in the 1.4.0-1.4.2 releases of the HBase shell should refer to the section _irbrc_ in the reference guide for how to configure their IRB session to avoid echoing expression results to the console.
11437
11438
11439 ---
11440
11441 * [HBASE-18792](https://issues.apache.org/jira/browse/HBASE-18792) | *Blocker* | **hbase-2 needs to defend against hbck operations**
11442
11443 As of HBase version 2.0, the hbck tool is significantly changed. In general, all Read-Only options are supported and can be be used safely. Most -fix/ -repair options are NOT supported. Please see usage below for details on which options are not supported:
11444
11445
11446 Usage: fsck [opts] {only tables}
11447  where [opts] are:
11448    -help Display help options (this)
11449    -details Display full report of all regions.
11450    -timelag \<timeInSeconds\>  Process only regions that  have not experienced any metadata updates in the last  \<timeInSeconds\> seconds.
11451    -sleepBeforeRerun \<timeInSeconds\> Sleep this many seconds before checking if the fix worked if run with -fix
11452    -summary Print only summary of the tables and status.
11453    -metaonly Only check the state of the hbase:meta table.
11454    -sidelineDir \<hdfs://\> HDFS path to backup existing meta.
11455    -boundaries Verify that regions boundaries are the same between META and store files.
11456    -exclusive Abort if another hbck is exclusive or fixing.
11457
11458   Datafile Repair options: (expert features, use with caution!)
11459    -checkCorruptHFiles     Check all Hfiles by opening them to make sure they are valid
11460    -sidelineCorruptHFiles  Quarantine corrupted HFiles.  implies -checkCorruptHFiles
11461
11462  Replication options
11463    -fixReplication   Deletes replication queues for removed peers
11464
11465   Metadata Repair options supported as of version 2.0: (expert features, use with caution!)
11466    -fixVersionFile   Try to fix missing hbase.version file in hdfs.
11467    -fixReferenceFiles  Try to offline lingering reference store files
11468    -fixHFileLinks  Try to offline lingering HFileLinks
11469    -noHdfsChecking   Don't load/check region info from HDFS. Assumes hbase:meta region info is good. Won't check/fix any HDFS issue, e.g. hole, orphan, or overlap
11470    -ignorePreCheckPermission  ignore filesystem permission pre-check
11471
11472 NOTE: Following options are NOT supported as of HBase version 2.0+.
11473
11474   UNSUPPORTED Metadata Repair options: (expert features, use with caution!)
11475    -fix              Try to fix region assignments.  This is for backwards compatiblity
11476    -fixAssignments   Try to fix region assignments.  Replaces the old -fix
11477    -fixMeta          Try to fix meta problems.  This assumes HDFS region info is good.
11478    -fixHdfsHoles     Try to fix region holes in hdfs.
11479    -fixHdfsOrphans   Try to fix region dirs with no .regioninfo file in hdfs
11480    -fixTableOrphans  Try to fix table dirs with no .tableinfo file in hdfs (online mode only)
11481    -fixHdfsOverlaps  Try to fix region overlaps in hdfs.
11482    -maxMerge \<n\>     When fixing region overlaps, allow at most \<n\> regions to merge. (n=5 by default)
11483    -sidelineBigOverlaps  When fixing region overlaps, allow to sideline big overlaps
11484    -maxOverlapsToSideline \<n\>  When fixing region overlaps, allow at most \<n\> regions to sideline per group. (n=2 by default)
11485    -fixSplitParents  Try to force offline split parents to be online.
11486    -removeParents    Try to offline and sideline lingering parents and keep daughter regions.
11487    -fixEmptyMetaCells  Try to fix hbase:meta entries not referencing any region (empty REGIONINFO\_QUALIFIER rows)
11488
11489   UNSUPPORTED Metadata Repair shortcuts
11490    -repair           Shortcut for -fixAssignments -fixMeta -fixHdfsHoles -fixHdfsOrphans -fixHdfsOverlaps -fixVersionFile -sidelineBigOverlaps -fixReferenceFiles-fixHFileLinks
11491    -repairHoles      Shortcut for -fixAssignments -fixMeta -fixHdfsHoles
11492
11493
11494 ---
11495
11496 * [HBASE-19994](https://issues.apache.org/jira/browse/HBASE-19994) | *Major* | **Create a new class for RPC throttling exception, make it retryable.**
11497
11498 A new RpcThrottlingException deprecates ThrottlingException. The new RpcThrottlingException is a retryable Exception that clients will retry when Rpc throttling quota is exceeded. The deprecated ThrottlingException is a nonretryable Exception.
11499
11500
11501 ---
11502
11503 * [HBASE-20224](https://issues.apache.org/jira/browse/HBASE-20224) | *Blocker* | **Web UI is broken in standalone mode**
11504
11505 Standalone webui was broken inadvertently by HBASE-20027.
11506
11507
11508 ---
11509
11510 * [HBASE-18784](https://issues.apache.org/jira/browse/HBASE-18784) | *Major* | **Use of filesystem that requires hflush / hsync / append / etc should query outputstream capabilities**
11511
11512 <!-- markdown -->
11513
11514
11515
11516 If HBase is run on top of Apache Hadoop libraries that support the needed APIs it will verify that underlying Filesystem implementations provide the needed durability mechanisms to safely operate. The needed APIs *should* be present in Hadoop 3 release and Hadoop 2 releases starting in the Hadoop 2.9 series. If the APIs are not available, HBase behaves as it has in previous releases (that is, it moves forward assuming such a check would pass).
11517
11518 Where this check fails, it is unsafe to rely on HBase in a production setting. In the event of process or node failure, the HBase RegionServer process may fail to have access to all the data it previously wrote to its write ahead log, resulting in data loss. In the event of process or node failure, the HBase master process may lose all or part of the write ahead log that it relies on for cluster management operations, leaving the cluster in an inconsistent state that we aren't sure it could recover from.
11519
11520 Notably, the LocalFileSystem implementation provided by Hadoop reports (accurately) via these new APIs that it can not provide the durability HBase needs to operate. As such, the current instructions for single-node HBase operation have been updated both with a) how to bypass this safety check and b) a strong warning about the dire consequences of doing so outside of a dev/test environment.
11521
11522
11523 ---
11524
11525 * [HBASE-20219](https://issues.apache.org/jira/browse/HBASE-20219) | *Critical* | **An error occurs when scanning with reversed=true and loadColumnFamiliesOnDemand=true**
11526
11527 Throws DoNotRetryIOException when you ask for a reverse scan loading adjacent column families on demand. Previous it threw IllegalStateException
11528
11529
11530 ---
11531
11532 * [HBASE-20358](https://issues.apache.org/jira/browse/HBASE-20358) | *Minor* | **Fix bin/hbase thrift usage text**
11533
11534 Cleanup usage message and command-line processing (no functional change).
11535
11536
11537 ---
11538
11539 * [HBASE-20182](https://issues.apache.org/jira/browse/HBASE-20182) | *Blocker* | **Can not locate region after split and merge**
11540
11541 Now if we hit a split parent when locating a region, we will skip to the next row and try again until the region does not contain our row. So there will be no RegionOfflineException for a split parent any more, instead, if the split children have not been onlined yet, i.e, we finally arrive at a region which does not contain our row, an IOException will be thrown.
11542
11543
11544 ---
11545
11546 * [HBASE-20149](https://issues.apache.org/jira/browse/HBASE-20149) | *Critical* | **Purge dev javadoc from bin tarball (or make a separate tarball of javadoc)**
11547
11548 We no longer include dev or dev test javadocs in our binary bundle. We still build them; they are just not included because they were half the size of the resultant tarball.
11549
11550 Here is our story on javadoc as of this commit:
11551
11552  \* apidocs - user facing main api javadocs. currently for a release line, published on website and linked from menu. included in the bin tarball
11553  \* devapidocs - hbase internal javadocs. currently for a release line, published on the website but not linked from the menu. no longer included in the bin tarball.
11554  \* testapidocs - user facing test scope api javadocs. currently for a release line, not published. included in the bin tarball.
11555  \* testdevapidocs - hbase internal test scope javadocs. currently for a release line, not published. no longer included in the bin tarball
11556
11557
11558 ---
11559
11560 * [HBASE-18828](https://issues.apache.org/jira/browse/HBASE-18828) | *Blocker* | **[2.0] Generate CHANGES.txt**
11561
11562 Moves us over to yetus releasedocmaker tooling generating CHANGES. CHANGES is not markdown (CHANGES.md) as opposed to CHANGES.txt. We've also added a new RELEASENOTES.md that lists JIRA release notes (courtesy of releasedocmaker).
11563
11564 CHANGES/RELEASENOTES are current as of now. Will need a 'freshening' when we cut the RC.
11565
11566
11567 ---
11568
11569 * [HBASE-14175](https://issues.apache.org/jira/browse/HBASE-14175) | *Critical* | **Adopt releasedocmaker for better generated release notes**
11570
11571 We will use yetus releasedocmaker to make our changes doc from here on out. A CHANGELOG.md will replace our current CHANGES.txt. Adjacent, we'll keep up a RELEASENOTES.md doc courtesy of releasedocmaker.
11572
11573 Over in HBASE-18828 is where we are working through steps for the RM integrating this new tooling.
11574
11575
11576 ---
11577
11578 * [HBASE-16499](https://issues.apache.org/jira/browse/HBASE-16499) | *Critical* | **slow replication for small HBase clusters**
11579
11580 Changed the default value for replication.source.ratio from 0.1 to 0.5. Which means now by default 50% of the total RegionServers in peer cluster(s) will participate in replication.
11581
11582
11583 ---
11584
11585 * [HBASE-16459](https://issues.apache.org/jira/browse/HBASE-16459) | *Trivial* | **Remove unused hbase shell --format option**
11586
11587 <!-- markdown -->
11588
11589
11590
11591
11592 The HBase `shell` command no longer recognizes the option `--format`. Previously this option only recognized the default value of 'console'. The default value is now always used.
11593
11594
11595 ---
11596
11597 * [HBASE-20259](https://issues.apache.org/jira/browse/HBASE-20259) | *Critical* | **Doc configs for in-memory-compaction and add detail to in-memory-compaction logging**
11598
11599 Disables in-memory compaction as default.
11600
11601 Adds logging of in-memory compaction configuration on creation.
11602
11603 Adds a chapter to the refguide on this new feature.
11604
11605
11606 ---
11607
11608 * [HBASE-20282](https://issues.apache.org/jira/browse/HBASE-20282) | *Major* | **Provide short name invocations for useful tools**
11609
11610 \`hbase regionsplitter\` is a new short invocation for \`hbase org.apache.hadoop.hbase.util.RegionSplitter\`
11611
11612
11613 ---
11614
11615 * [HBASE-20314](https://issues.apache.org/jira/browse/HBASE-20314) | *Major* | **Precommit build for master branch fails because of surefire fork fails**
11616
11617 Upgrade surefire plugin to 2.21.0.
11618
11619
11620 ---
11621
11622 * [HBASE-20130](https://issues.apache.org/jira/browse/HBASE-20130) | *Critical* | **Use defaults (16020 & 16030) as base ports when the RS is bound to localhost**
11623
11624 <!-- markdown -->
11625
11626
11627
11628 When region servers bind to localhost (mostly in pseudo distributed mode), default ports (16020 & 16030) are used as base ports. This will support up to 9 instances of region servers by default with `local-regionservers.sh` script. If additional instances are needed, see the reference guide on how to deploy with a different range using the environment variables `HBASE_RS_BASE_PORT` and `HBASE_RS_INFO_BASE_PORT`.
11629
11630
11631 ---
11632
11633 * [HBASE-20111](https://issues.apache.org/jira/browse/HBASE-20111) | *Critical* | **Able to split region explicitly even on shouldSplit return false from split policy**
11634
11635 When a split is requested on a Region, the RegionServer hosting that Region will now consult the configured SplitPolicy for that table when determining if a split of that Region is allowed. When a split is disallowed (due to the Region not being OPEN or the SplitPolicy denying the request), the operation will \*not\* be implicitly retried as it has previously done. Users will need to guard against and explicitly retry region split requests which are denied by the system.
11636
11637
11638 ---
11639
11640 * [HBASE-20223](https://issues.apache.org/jira/browse/HBASE-20223) | *Blocker* | **Use hbase-thirdparty 2.1.0**
11641
11642 Moves commons-cli and commons-collections4 into the HBase thirdparty shaded jar which means that these are no longer generally available for users on the classpath.
11643
11644
11645 ---
11646
11647 * [HBASE-19128](https://issues.apache.org/jira/browse/HBASE-19128) | *Major* | **Purge Distributed Log Replay from codebase, configurations, text; mark the feature as unsupported, broken.**
11648
11649 Removes Distributed Log Replay feature. Disable the feature before upgrading.
11650
11651
11652 ---
11653
11654 * [HBASE-19504](https://issues.apache.org/jira/browse/HBASE-19504) | *Major* | **Add TimeRange support into checkAndMutate**
11655
11656 1) checkAndMutate accept a TimeRange to query the specified cell
11657 2) remove writeToWAL flag from Region#checkAndMutate since it is useless (this is a incompatible change)
11658
11659
11660 ---
11661
11662 * [HBASE-20237](https://issues.apache.org/jira/browse/HBASE-20237) | *Critical* | **Put back getClosestRowBefore and throw UnknownProtocolException instead... for asynchbase client**
11663
11664 Throw UnknownProtocolException if a client connects and tries to invoke the old getClosestRowOrBefore method. Pre-hbase-1.0.0 or asynchbase do this instead of using its replacement, the reverse Scan.
11665
11666 getClosestRowOrBefore was implemented as a flag on Get. Before this patch though the flag was set, hbase2 were ignoring it. This made it look like a pre-1.0.0 client was 'working' but then it'd fail finding the appropriate Region for a client-specified row doing lookups into hbase:meta.
11667
11668
11669 ---
11670
11671 * [HBASE-20247](https://issues.apache.org/jira/browse/HBASE-20247) | *Major* | **Set version as 2.0.0 in branch-2.0 in prep for first RC**
11672
11673 Set version as 2.0.0 on branch-2.0.
11674
11675
11676 ---
11677
11678 * [HBASE-20090](https://issues.apache.org/jira/browse/HBASE-20090) | *Major* | **Properly handle Preconditions check failure in MemStoreFlusher$FlushHandler.run**
11679
11680 When there is concurrent region split, MemStoreFlusher may not find flushable region if the only candidate region left hasn't received writes (resulting in 0 data size).
11681 After this JIRA, such scenario wouldn't trigger Precondition assertion (replaced by an if statement to see whether there is any flushable region).
11682 If there is no flushable region, a DEBUG log would appear in region server log, saying "Above memory mark but there is no flushable region".
11683
11684
11685 ---
11686
11687 * [HBASE-19552](https://issues.apache.org/jira/browse/HBASE-19552) | *Major* | **update hbase to use new thirdparty libs**
11688
11689 hbase-thirdparty libs have moved to o.a.h.thirdparty offset. Netty shading system property is no longer necessary.
11690
11691
11692 ---
11693
11694 * [HBASE-20119](https://issues.apache.org/jira/browse/HBASE-20119) | *Minor* | **Introduce a pojo class to carry coprocessor information in order to make TableDescriptorBuilder accept multiple cp at once**
11695
11696 1) Make all methods in TableDescriptorBuilder be setter pattern.
11697 addCoprocessor -\> setCoprocessor
11698 addColumnFamily -\> setColumnFamily
11699 (addCoprocessor and addColumnFamily are still in branch-2 but they are marked as deprecated)
11700 2) add CoprocessorDescriptor to carry cp information
11701 3) add CoprocessorDescriptorBuilder to build CoprocessorDescriptor
11702 4) TD disallow user to set negative priority to coprocessor since parsing the negative value will cause a exception
11703
11704
11705 ---
11706
11707 * [HBASE-17165](https://issues.apache.org/jira/browse/HBASE-17165) | *Critical* | **Add retry to LoadIncrementalHFiles tool**
11708
11709 Adds retry to load of incremental hfiles. Pertinent key is HConstants.HBASE\_CLIENT\_RETRIES\_NUMBER. Default is HConstants.DEFAULT\_HBASE\_CLIENT\_RETRIES\_NUMBER.
11710
11711
11712 ---
11713
11714 * [HBASE-20108](https://issues.apache.org/jira/browse/HBASE-20108) | *Critical* | **\`hbase zkcli\` falls into a non-interactive prompt after HBASE-15199**
11715
11716 This issue fixes a runtime dependency issues where JLine is not made available on the classpath which causes the ZooKeeper CLI to appear non-interactive. JLine was being made available unintentionally via the JRuby jar file on the classpath for the HBase shell. While the JRuby jar is not always present, the fix made here was to selectively include the JLine dependency on the zkcli command's classpath.
11717
11718
11719 ---
11720
11721 * [HBASE-8770](https://issues.apache.org/jira/browse/HBASE-8770) | *Blocker* | **deletes and puts with the same ts should be resolved according to mvcc/seqNum**
11722
11723 This behavior is available as a new feature. See HBASE-15968 release note.
11724
11725 This issue is just about adding to the refguide documentation on the HBASE\_15968 feature.
11726
11727
11728 ---
11729
11730 * [HBASE-19114](https://issues.apache.org/jira/browse/HBASE-19114) | *Major* | **Split out o.a.h.h.zookeeper from hbase-server and hbase-client**
11731
11732 Splits out most of ZooKeeper related code into a separate new module: hbase-zookeeper.
11733 Also, renames some ZooKeeper related classes to follow a common naming pattern - "ZK" prefix - as compared to many different styles earlier.
11734
11735
11736 ---
11737
11738 * [HBASE-19437](https://issues.apache.org/jira/browse/HBASE-19437) | *Critical* | **Batch operation can't handle the null result for Append/Increment**
11739
11740 The result from server is changed from null to Result.EMPTY\_RESULT when Append/Increment operation can't retrieve any data from server,
11741
11742
11743 ---
11744
11745 * [HBASE-17448](https://issues.apache.org/jira/browse/HBASE-17448) | *Major* | **Export metrics from RecoverableZooKeeper**
11746
11747 Committed to master and branch-1
11748
11749
11750 ---
11751
11752 * [HBASE-19400](https://issues.apache.org/jira/browse/HBASE-19400) | *Major* | **Add missing security checks in MasterRpcServices**
11753
11754 Added ACL check to following Admin functions:
11755 enableCatalogJanitor, runCatalogJanitor, cleanerChoreSwitch, runCleanerChore, execProcedure, execProcedureWithReturn, normalize, normalizerSwitch, coprocessorService.
11756 When ACL is enabled, only those with ADMIN rights will be able to invoke these operations successfully.
11757
11758
11759 ---
11760
11761 * [HBASE-20048](https://issues.apache.org/jira/browse/HBASE-20048) | *Blocker* | **Revert serial replication feature**
11762
11763 Revert the serial replication feature from all branches. Plan to reimplement it soon and land onto 2.1 release line.
11764
11765
11766 ---
11767
11768 * [HBASE-19166](https://issues.apache.org/jira/browse/HBASE-19166) | *Blocker* | **AsyncProtobufLogWriter persists ProtobufLogWriter as class name for backward compatibility**
11769
11770 For backward compatibility, AsyncProtobufLogWriter uses "ProtobufLogWriter" as writer class name and SecureAsyncProtobufLogWriter uses "SecureProtobufLogWriter" as writer class name.
11771
11772
11773 ---
11774
11775 * [HBASE-18596](https://issues.apache.org/jira/browse/HBASE-18596) | *Blocker* | **[TEST] A hbase1 cluster should be able to replicate to a hbase2 cluster; verify**
11776
11777 Replication between versions verified as basically working. 0.98.25-SNAPSHOT to beta-2 hbase2 and a 1.2-ish version tried.
11778
11779
11780 ---
11781
11782 * [HBASE-20017](https://issues.apache.org/jira/browse/HBASE-20017) | *Blocker* | **BufferedMutatorImpl submit the same mutation repeatedly**
11783
11784 This change fixes multithreading issues in the implementation of BufferedMutator. BufferedMutator should not be used with 1.4 releases prior to 1.4.2.
11785
11786
11787 ---
11788
11789 * [HBASE-20032](https://issues.apache.org/jira/browse/HBASE-20032) | *Minor* | **Receving multiple warnings for missing reporting.plugins.plugin.version**
11790
11791 Add (latest) version elements missing from reporting plugins in top-level pom.
11792
11793
11794 ---
11795
11796 * [HBASE-19954](https://issues.apache.org/jira/browse/HBASE-19954) | *Major* | **Separate TestBlockReorder into individual tests to avoid ShutdownHook suppression error against hadoop3**
11797
11798 hadoop3 minidfscluster removes all shutdown handlers when the cluster goes down which made this test that does FS-stuff fail (Fix was to break up the test so each test method ran with an unadulterated FS).
11799
11800
11801 ---
11802
11803 * [HBASE-20014](https://issues.apache.org/jira/browse/HBASE-20014) | *Major* | **TestAdmin1 Times out**
11804
11805 Ups the overall test timeout from 10 minutes to 13minutes. 15minutes is the surefire timeout.
11806
11807
11808 ---
11809
11810 * [HBASE-20020](https://issues.apache.org/jira/browse/HBASE-20020) | *Critical* | **Make sure we throw DoNotRetryIOException when ConnectionImplementation is closed**
11811
11812 Add checkClosed to core Client methods. Avoid unnecessary retry.
11813
11814
11815 ---
11816
11817 * [HBASE-19978](https://issues.apache.org/jira/browse/HBASE-19978) | *Major* | **The keepalive logic is incomplete in ProcedureExecutor**
11818
11819 Completes keep-alive logic and then enables it; ProcedureExecutor Workers will spin up more threads when need settling back to the core count after the burst in demand has passed. Default keep-alive is one minute. Default core-count is CPUs/4 or 16, which ever is greater. Maximum is an arbitrary core-count \* 10 (a limit that should never be hit and if it is, there is something else very wrong).
11820
11821
11822 ---
11823
11824 * [HBASE-19950](https://issues.apache.org/jira/browse/HBASE-19950) | *Minor* | **Introduce a ColumnValueFilter**
11825
11826 ColumnValueFilter provides a way to fetch matched cells only by providing specified column, value and a comparator, which is different from SingleValueFilter, fetching an entire row as soon as a matched cell found.
11827
11828
11829 ---
11830
11831 * [HBASE-18294](https://issues.apache.org/jira/browse/HBASE-18294) | *Major* | **Reduce global heap pressure: flush based on heap occupancy**
11832
11833 A region is flushed if its memory component exceeds the region flush threshold.
11834 A flush policy decides which stores to flush by comparing the size of the store to a column-family-flush threshold.
11835 If the overall size of all memstores in the machine exceeds the bounds defined by the administrator (denoted global pressure) a region is selected and flushed.
11836 HBASE-18294 changes flush decisions to be based on heap-occupancy and not data (key-value) size, consistently across levels. This rolls back some of the changes by HBASE-16747. Specifically,
11837 (1) RSs, Regions and stores track their overall on-heap and off-heap occupancy,
11838 (2) A region is flushed when its on-heap+off-heap size exceeds the region flush threshold specified in hbase.hregion.memstore.flush.size,
11839 (3) The store to be flushed is chosen based on its on-heap+off-heap size
11840 (4) At the RS level, a flush is triggered when the overall on-heap exceeds the on-heap limit, or when the overall off-heap size exceeds the off-heap limit (low/high water marks).
11841
11842 Note that when the region flush size is set to XXmb a region flush may be triggered even before writing keys and values of size XX because the total heap occupancy of the region which includes additional metadata exceeded the threshold.
11843
11844
11845 ---
11846
11847 * [HBASE-19116](https://issues.apache.org/jira/browse/HBASE-19116) | *Critical* | **Currently the tail of hfiles with CellComparator\* classname makes it so hbase1 can't open hbase2 written hfiles; fix**
11848
11849 hbase-2.x sets KeyValue Comparators into the tail of hfiles rather than CellComparator, what it uses internally, just so hbase-1.x can continue to read hbase-2.x written hfiles.
11850
11851
11852 ---
11853
11854 * [HBASE-19948](https://issues.apache.org/jira/browse/HBASE-19948) | *Major* | **Since HBASE-19873, HBaseClassTestRule, Small/Medium/Large has different semantic**
11855
11856 In subtask, fixed doc and annotations to be more explicit that test timings are for the whole Test Fixture/Test Class/Test Suite NOT the test method only as we'd measuring up to this (tother subtasks untethered Categorization and test timeout such that all categories now have a ten minute timeout -- no test can run longer than ten minutes or it gets killed/timedout).
11857
11858
11859 ---
11860
11861 * [HBASE-16060](https://issues.apache.org/jira/browse/HBASE-16060) | *Blocker* | **1.x clients cannot access table state talking to 2.0 cluster**
11862
11863 By default, we mirror table state to zookeeper so hbase-1.x clients will work against an hbase-2 cluster (With this patch, hbase-1.x clients can do most Admin functions including table create; hbase-1.x clients can do all Table/DML against hbase-2 cluster).
11864
11865 Flag to disable mirroring is hbase.mirror.table.state.to.zookeeper; set it to false in Configuration.
11866
11867 Related, Master on startup will look to see if there are table state znodes left over by an hbase-1 instance. If any found, it will migrate the table state to hbase-2 setting the state into the hbase:meta table where table state is now kept. We will do this check on every Master start. Notion is that this will be overall beneficial with low impediment. To disable the migration check, set hbase.migrate.table.state.from.zookeeper to false.
11868
11869
11870 ---
11871
11872 * [HBASE-19900](https://issues.apache.org/jira/browse/HBASE-19900) | *Critical* | **Region-level exception destroy the result of batch**
11873
11874 This fix makes the following changes to how client handle the both of action result and region exception.
11875 1) honor the action result rather than region exception. If the action have both of true result and region exception, the action is fine as the exception is caused by other actions which are in the same region.
11876 2) honor the action exception rather than region exception. If the action have both of action exception and region exception, we deal with the action exception only. If we also handle the region exception for the same action, it will introduce the negative count of actions in progress. The AsyncRequestFuture#waitUntilDone will block forever.
11877
11878
11879 ---
11880
11881 * [HBASE-19841](https://issues.apache.org/jira/browse/HBASE-19841) | *Major* | **Tests against hadoop3 fail with StreamLacksCapabilityException**
11882
11883 HBaseTestingUtility now assumes that all clusters will use local storage until a MiniDFSCluster is started or assigned.
11884
11885
11886 ---
11887
11888 * [HBASE-19528](https://issues.apache.org/jira/browse/HBASE-19528) | *Major* | **Major Compaction Tool**
11889
11890 Tool allows you to compact a cluster with given concurrency of regionservers compacting at a given time.  If tool completes successfully everything requested for compaction will be compacted, regardless of region moves, splits and merges.
11891
11892
11893 ---
11894
11895 * [HBASE-19919](https://issues.apache.org/jira/browse/HBASE-19919) | *Major* | **Tidying up logging**
11896
11897 (I thought this change innocuous but I made work for a co-worker when I upped interval between log cleaner runs -- meant a smoke test failed because we were slow doing an expected cleanup).
11898
11899 Edit of log lines removing redundancy. Shorten thread names shown in log.  Made some log TRACE instead of DEBUG.  Capitalizations.
11900
11901 Upped log cleaner interval from every minute to every ten minutes. hbase.master.cleaner.interval
11902
11903 Lowered default count of threads started by Procedure Executor from count of CPUs to 1/4 of count of CPUs.
11904
11905
11906 ---
11907
11908 * [HBASE-19901](https://issues.apache.org/jira/browse/HBASE-19901) | *Major* | **Up yetus proclimit on nightlies**
11909
11910 Pass to yetus a dockermemlimit of 20G and a proclimit of 10000. Defaults are 4G and 1G respectively.
11911
11912
11913 ---
11914
11915 * [HBASE-19912](https://issues.apache.org/jira/browse/HBASE-19912) | *Minor* | **The flag "writeToWAL" of Region#checkAndRowMutate is useless**
11916
11917 Remove useless 'writeToWAL' flag of Region#checkAndRowMutate & related class
11918
11919
11920 ---
11921
11922 * [HBASE-19911](https://issues.apache.org/jira/browse/HBASE-19911) | *Major* | **Convert some tests from small to medium because they are timing out: TestNettyRpcServer, TestClientClusterStatus, TestCheckTestClasses**
11923
11924 Changed a few tests so they are medium sized rather than small size.
11925
11926 Also, upped the time we wait on small tests to 60seconds from 30seconds. Small tests are tests that run in 15seconds or less. What we changed was the timeout watcher. It is now more lax, more tolerant of dodgy infrastructure that might be running tests slowly.
11927
11928
11929 ---
11930
11931 * [HBASE-19892](https://issues.apache.org/jira/browse/HBASE-19892) | *Major* | **Checking 'patch attach' and yetus 0.7.0 and move to Yetus 0.7.0**
11932
11933 Moved our internal yetus reference from 0.6.0 to 0.7.0. Concurrently, I changed hadoopqa to run with 0.7.0 (by editing the config in jenkins).
11934
11935
11936 ---
11937
11938 * [HBASE-19873](https://issues.apache.org/jira/browse/HBASE-19873) | *Major* | **Add a CategoryBasedTimeout ClassRule for all UTs**
11939
11940 Along with @category -- small, medium, large -- all hbase tests must now carry a ClassRule as follows:
11941
11942 +  @ClassRule
11943 +  public static final HBaseClassTestRule CLASS\_RULE =
11944 +      HBaseClassTestRule.forClass(TestInterfaceAudienceAnnotations.class);
11945
11946 where the class changes by test.
11947
11948 Currently the classrule enforces timeout for the whole test suite -- i.e. if a SmallTest Category then all the tests in the TestSuite must complete inside 60seconds, the timeout we set on SmallTest Category test suite -- but is meant to be a repository for general, runtime, hbase test facility.
11949
11950
11951 ---
11952
11953 * [HBASE-19770](https://issues.apache.org/jira/browse/HBASE-19770) | *Critical* | **Add '--return-values' option to Shell to print return values of commands in interactive mode**
11954
11955 Introduces a new option to the HBase shell: -r, --return-values. When the shell is in "interactive" mode (default), the return value of shell commands are not returned to the user as they dirty the console output. For those who desire this functionality, the "--return-values" option restores the old functionality of the commands passing their return value to the user.
11956
11957
11958 ---
11959
11960 * [HBASE-15321](https://issues.apache.org/jira/browse/HBASE-15321) | *Major* | **Ability to open a HRegion from hdfs snapshot.**
11961
11962 HRegion.openReadOnlyFileSystemHRegion() provides the ability to open HRegion from a read-only hdfs snapshot.  Because hdfs snapshots are read-only, no cleanup happens when using this API.
11963
11964
11965 ---
11966
11967 * [HBASE-17513](https://issues.apache.org/jira/browse/HBASE-17513) | *Critical* | **Thrift Server 1 uses different QOP settings than RPC and Thrift Server 2 and can easily be misconfigured so there is no encryption when the operator expects it.**
11968
11969 This change fixes an issue where users could have unintentionally configured the HBase Thrift1 server to run without wire-encryption, when they believed they had configured the Thrift1 server to do so.
11970
11971
11972 ---
11973
11974 * [HBASE-19828](https://issues.apache.org/jira/browse/HBASE-19828) | *Major* | **Flakey TestRegionsOnMasterOptions.testRegionsOnAllServers**
11975
11976 Disables TestRegionsOnMasterOptions because Regions on Master does not work reliably; see HBASE-19831.
11977
11978
11979 ---
11980
11981 * [HBASE-18963](https://issues.apache.org/jira/browse/HBASE-18963) | *Major* | **Remove MultiRowMutationProcessor and implement mutateRows... methods using batchMutate()**
11982
11983 Modified HRegion.mutateRow() APIs to use batchMutate() instead of processRowsWithLocks() with MultiRowMutationProcessor. MultiRowMutationProcessor is removed to have single write path that uses batchMutate().
11984
11985
11986 ---
11987
11988 * [HBASE-19163](https://issues.apache.org/jira/browse/HBASE-19163) | *Major* | **"Maximum lock count exceeded" from region server's batch processing**
11989
11990 When there are many mutations against the same row in a batch, as each mutation will acquire a shared row lock, it will exceed the maximum shared lock count the java ReadWritelock supports (64k). Along with other optimization, the batch is divided into multiple possible minibatches. A new config is added to limit the maximum number of mutations in the minibatch.
11991
11992    \<property\>
11993     \<name\>hbase.regionserver.minibatch.size\</name\>
11994     \<value\>20000\</value\>
11995    \</property\>
11996 The default value is 20000.
11997
11998
11999 ---
12000
12001 * [HBASE-19739](https://issues.apache.org/jira/browse/HBASE-19739) | *Minor* | **Include thrift IDL files in HBase binary distribution**
12002
12003 Thrift IDLs are now shipped, bundled up in the respective hbase-\*thrift.jars (look for files ending in .thrift).
12004
12005
12006 ---
12007
12008 * [HBASE-11409](https://issues.apache.org/jira/browse/HBASE-11409) | *Major* | **Add more flexibility for input directory structure to LoadIncrementalHFiles**
12009
12010 Allows for users to bulk load entire tables from hdfs by specifying the parameter -loadTable.  This allows you to pass in a table level directory and have all regions column families bulk loaded, if you do not specify the -loadTable parameter LoadIncrementalHFiles will work as before. Note: you must have a pre-created table to run with -loadTable it will not create one for you.
12011
12012
12013 ---
12014
12015 * [HBASE-19769](https://issues.apache.org/jira/browse/HBASE-19769) | *Critical* | **IllegalAccessError on package-private Hadoop metrics2 classes in MapReduce jobs**
12016
12017 Client-side ZooKeeper metrics which were added to 2.0.0 alpha/beta releases cause issues when launching MapReduce jobs via {{yarn jar}} on the command line. This stems from ClassLoader separation issues that YARN implements. It was chosen that the easiest solution was to remove these ZooKeeper metrics entirely.
12018
12019
12020 ---
12021
12022 * [HBASE-19783](https://issues.apache.org/jira/browse/HBASE-19783) | *Minor* | **Change replication peer cluster key/endpoint from a not-null value to null is not allowed**
12023
12024 To reduce the confusing behavior, now when you call updatePeerConfig with empty ClusterKey or ReplicationEndpointImpl, but the value of field of the to-be-updated ReplicationPeerConfig is not null, we will throw exception instead of ignoring them.
12025
12026
12027 ---
12028
12029 * [HBASE-19483](https://issues.apache.org/jira/browse/HBASE-19483) | *Major* | **Add proper privilege check for rsgroup commands**
12030
12031 This JIRA aims at refactoring AccessController, using ACL as core library in CPs.
12032 1. Stripping out a public class AccessChecker from AccessController, using ACL as core library in CPs. AccessChecker don't have any dependency on anything CP related. Create it's instance from other CPS.
12033 2. Change the default value of hbase.security.authorization to false.
12034 3. Don't use CP hooks to check access in RSGroup. Use the access checker instance directly in functions of RSGroupAdminServiceImpl.
12035
12036
12037 ---
12038
12039 * [HBASE-19358](https://issues.apache.org/jira/browse/HBASE-19358) | *Major* | **Improve the stability of splitting log when do fail over**
12040
12041 After HBASE-19358 we introduced a new property hbase.split.writer.creation.bounded to limit the opening writers for each WALSplitter. If set to true, we won't open any writer for recovered.edits until the entries accumulated in memory reaching hbase.regionserver.hlog.splitlog.buffersize (which defaults at 128M) and will write and close the file in one go instead of keeping the writer open. It's false by default and we recommend to set it to true if your cluster has a high region load (like more than 300 regions per RS), especially when you observed obvious NN/HDFS slow down during hbase (single RS or cluster) failover.
12042
12043
12044 ---
12045
12046 * [HBASE-19651](https://issues.apache.org/jira/browse/HBASE-19651) | *Minor* | **Remove LimitInputStream**
12047
12048 HBase had copied from guava the file LmiitedInputStream. This commit removes the copied file in favor of (our internal, shaded) guava's ByteStreams.limit. Guava 14.0's LIS noted: "Use ByteStreams.limit(java.io.InputStream, long) instead. This class is scheduled to be removed in Guava release 15.0."
12049
12050
12051 ---
12052
12053 * [HBASE-19691](https://issues.apache.org/jira/browse/HBASE-19691) | *Critical* | **Do not require ADMIN permission for obtaining ClusterStatus**
12054
12055 This change reverts an unintentional requirement for global ADMIN permission to obtain cluster status from the active HMaster.
12056
12057
12058 ---
12059
12060 * [HBASE-19486](https://issues.apache.org/jira/browse/HBASE-19486) | *Major* | ** Periodically ensure records are not buffered too long by BufferedMutator**
12061
12062 The BufferedMutator now supports two settings that are used to ensure records do not stay too long in the buffer of a BufferedMutator. For periodically flushing the BufferedMutator there is now a "Timeout": "How old may the oldest record in the buffer be before we force a flush" and a "TimerTick": How often do we check if the timeout has been exceeded. Using these settings you can make the BufferedMutator automatically flush the write buffer if after the specified number of milliseconds no flush has occurred.
12063
12064 This is mainly useful in streaming scenarios (i.e. writing data into HBase using Apache Flink/Beam/Storm) where it is common (especially in a test/development situation) to see small unpredictable bursts of data that need to be written into HBase. When using the BufferedMutator till now the effect was that records would remain in the write buffer until the buffer was full or an explicit flush was triggered. In practice this would mean that the 'last few records' of a burst would remain in the write buffer until the next burst arrives filling the buffer to capacity and thus triggering a flush.
12065
12066
12067 ---
12068
12069 * [HBASE-19670](https://issues.apache.org/jira/browse/HBASE-19670) | *Major* | **Workaround: Purge User API building from branch-2 so can make a beta-1**
12070
12071 Disable filtering of User API based off yetus annotation done in doclet. See parent issue for build failure currently being worked on but not done in time for a beta-1.
12072
12073
12074 ---
12075
12076 * [HBASE-19282](https://issues.apache.org/jira/browse/HBASE-19282) | *Major* | **CellChunkMap Benchmarking and User Interface**
12077
12078 When MSLAB is in use (that is the default config) , we will always use the CellChunkMap indexing variant for in memory flushed Immutable segments. When MSLAB is turned off, we will use CellAraryMap. These can not be changed with any configs.  The in memory flush threshold been made to be default to 10% of region flush size. This can be turned using 'hbase.memstore.inmemoryflush.threshold.factor'.
12079
12080
12081 ---
12082
12083 * [HBASE-19628](https://issues.apache.org/jira/browse/HBASE-19628) | *Major* | **ByteBufferCell should extend ExtendedCell**
12084
12085 ByteBufferCell → ByteBufferExtendedCell
12086 MapReduceCell → MapReduceExtendedCell
12087 ByteBufferChunkCell → ByteBufferChunkKeyValue
12088 NoTagByteBufferChunkCell → NoTagByteBufferChunkKeyValue
12089 KeyOnlyByteBufferCell → KeyOnlyByteBufferExtendedCell
12090 TagRewriteByteBufferCell → TagRewriteByteBufferExtendedCell
12091 ValueAndTagRewriteByteBufferCell → ValueAndTagRewriteByteBufferExtendedCell
12092 EmptyByteBufferCell → EmptyByteBufferExtendedCell
12093 FirstOnRowByteBufferCell → FirstOnRowByteBufferExtendedCell
12094 LastOnRowByteBufferCell → LastOnRowByteBufferExtendedCell
12095 FirstOnRowColByteBufferCell → FirstOnRowColByteBufferExtendedCell
12096 FirstOnRowColTSByteBufferCell → FirstOnRowColTSByteBufferExtendedCell
12097 LastOnRowColByteBufferCell → LastOnRowColByteBufferCell
12098 OffheapDecodedCell → OffheapDecodedExtendedCell
12099
12100
12101 ---
12102
12103 * [HBASE-19576](https://issues.apache.org/jira/browse/HBASE-19576) | *Major* | **Introduce builder for ReplicationPeerConfig and make it immutable**
12104
12105 Add a ReplicationPeerConfigBuilder to create ReplicationPeerConfig and make ReplicationPeerConfig immutable. Meanwhile, deprecated set\* methods in ReplicationPeerConfig.
12106
12107
12108 ---
12109
12110 * [HBASE-10092](https://issues.apache.org/jira/browse/HBASE-10092) | *Critical* | **Move to slf4j**
12111
12112 We now have slf4j as our front-end. Be careful adding logging from here on out; make sure it slf4j.
12113
12114 From here on out, as us devs go, we need to convert log messages from being 'guarded' -- i.e. surrounded by if (LOG.isDebugEnabled...) -- to instead being parameterized log messages. e.g. the latter rather than the former in the below:
12115
12116 logger.debug("The new entry is "+entry+".");
12117 logger.debug("The new entry is {}.", entry);
12118
12119 See [1] for background on perf benefits.
12120
12121 Note, FATAL log level is not present in slf4j. It is noted as a Marker but won't show in logs as a LEVEL.
12122
12123 1.  https://www.slf4j.org/faq.html#logging\_performance
12124
12125
12126 ---
12127
12128 * [HBASE-19148](https://issues.apache.org/jira/browse/HBASE-19148) | *Blocker* | **Reevaluate default values of configurations**
12129
12130 Removed unused hbase.fs.tmp.dir from hbase-default.xml.
12131
12132 Upped hbase.master.fileSplitTimeout from 30s to 10minutes (suggested by production experience)
12133
12134 Added note that handler-count should be ~CPU count.
12135
12136 hbase.regionserver.logroll.multiplier has been changed from 0.95 to 0.5 AND the default block size has been doubled.
12137
12138 A few of the core configs are now dumped to the log on startup.
12139
12140
12141 ---
12142
12143 * [HBASE-19492](https://issues.apache.org/jira/browse/HBASE-19492) | *Major* | **Add EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS support to replication peer config**
12144
12145 Add two new field:  EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS to replication peer config.
12146
12147 If replicate\_all flag is true, it means all user tables will be replicated to peer cluster. Then allow config exclude namespaces or exclude table-cfs which can't be replicated to  peer cluster.
12148
12149 If replicate\_all flag is false, it means all user tables can't be replicated to peer cluster. Then allow to config namespaces or table-cfs which will be replicated to peer cluster.
12150
12151
12152 ---
12153
12154 * [HBASE-19494](https://issues.apache.org/jira/browse/HBASE-19494) | *Major* | **Create simple WALKey filter that can be plugged in on the Replication Sink**
12155
12156 Adds means of adding very basic filter on the sink side of replication. We already have a means of installing filter source-side, which is better place to filter edits before they are shipped over the network, but this facility is needed by hbase-indexer.
12157
12158 Set hbase.replication.sink.walentrysinkfilter with a no-param Constructor implementation. See test in patch for example.
12159
12160
12161 ---
12162
12163 * [HBASE-19112](https://issues.apache.org/jira/browse/HBASE-19112) | *Blocker* | **Suspect methods on Cell to be deprecated**
12164
12165 Adds method Cell#getType which returns enum describing Cell Type.
12166
12167 Deprecates the following Cell methods:
12168
12169  getTypeByte
12170  getSequenceId
12171  getTagsArray
12172  getTagsOffset
12173  getTagsLength
12174
12175 CPs trying to build cells should use RawCellBuilderFactory that supports  building cells with tags.
12176
12177
12178 ---
12179
12180 * [HBASE-14790](https://issues.apache.org/jira/browse/HBASE-14790) | *Major* | **Implement a new DFSOutputStream for logging WAL only**
12181
12182 Implement a FanOutOneBlockAsyncDFSOutput for writing WAL only, the WAL provider which uses this class is AsyncFSWALProvider.
12183
12184 It is based on netty, and will write to 3 DNs at the same time concurrently(fan-out) so generally it will lead to a lower latency. And it is also fail-fast, the stream will become unwritable immediately after there are any read/write errors, no pipeline recovery. You need to call recoverLease to force close the output for this case. And it only supports to write a file with a single block. For WAL this is a good behavior as we can always open a new file when the old one is broken. The performance analysis in HBASE-16890 shows that it has a better performance.
12185
12186 Behavior changes:
12187 1. As now we write to 3 DNs concurrently, according to the visibility guarantee of HDFS, the data will be available immediately when arriving at DN since all the DNs will be considered as the last one in pipeline. This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency. HBASE-14004 is used to solve the problem.
12188 2. There will be no sync failure. When the output is broken, we will open a new file and write all the unacked wal entries to the new file. This means that we may have duplicated entries in wal files. HBASE-14949 is used to solve this problem.
12189
12190
12191 ---
12192
12193 * [HBASE-15536](https://issues.apache.org/jira/browse/HBASE-15536) | *Critical* | **Make AsyncFSWAL as our default WAL**
12194
12195 Now the default WALProvider is AsyncFSWALProvider, i.e. 'asyncfs'.
12196 If you want to change back to use FSHLog, please add this in hbase-site.xml
12197 {code}
12198 \<property\>
12199 \<name\>hbase.wal.provider\</name\>
12200 \<value\>filesystem\</value\>
12201 \</property\>
12202 {code}
12203 If you want to use FSHLog with multiwal, please add this in hbase-site.xml
12204 {code}
12205 \<property\>
12206 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
12207 \<value\>filesystem\</value\>
12208 \</property\>
12209 {code}
12210
12211 This patch also sets hbase.wal.async.use-shared-event-loop to false so WAL has its own netty event group.
12212
12213
12214 ---
12215
12216 * [HBASE-19462](https://issues.apache.org/jira/browse/HBASE-19462) | *Major* | **Deprecate all addImmutable methods in Put**
12217
12218 Deprecates Put#addImmutable as of release 2.0.0, this will be removed in HBase 3.0.0. Use {@link #add(Cell)} and {@link org.apache.hadoop.hbase.CellBuilder} instead
12219
12220
12221 ---
12222
12223 * [HBASE-19213](https://issues.apache.org/jira/browse/HBASE-19213) | *Minor* | **Align check and mutate operations in Table and AsyncTable**
12224
12225 In Table interface deprecate checkAndPut, checkAndDelete and checkAndMutate methods.
12226 Similarly to AsyncTable a new method was added to replace the deprecated ones: CheckAndMutateBuilder checkAndMutate(byte[] row, byte[] family) with CheckAndMutateBuilder interface which can be used to construct the checkAnd\*() operations.
12227
12228
12229 ---
12230
12231 * [HBASE-19134](https://issues.apache.org/jira/browse/HBASE-19134) | *Major* | **Make WALKey an Interface; expose Read-Only version to CPs**
12232
12233 Made WALKey an Interface and added a WALKeyImpl implementation. WALKey comes through to Coprocessors. WALKey is read-only.
12234
12235
12236 ---
12237
12238 * [HBASE-18169](https://issues.apache.org/jira/browse/HBASE-18169) | *Blocker* | **Coprocessor fix and cleanup before 2.0.0 release**
12239
12240 Refactor of Coprocessor API for hbase2. Purged methods that exposed too much of our internals. Other hooks were recast so they no longer took or returned internal classes; instead we pass Interfaces or read-only versions of implementations.
12241
12242 Here is some overview doc on changes in hbase2 for Coprocessors including detail on why the change was made:
12243 https://github.com/apache/hbase/blob/branch-2.0/dev-support/design-docs/Coprocessor\_Design\_Improvements-Use\_composition\_instead\_of\_inheritance-HBASE-17732.adoc
12244
12245
12246 ---
12247
12248 * [HBASE-19301](https://issues.apache.org/jira/browse/HBASE-19301) | *Major* | **Provide way for CPs to create short circuited connection with custom configurations**
12249
12250 Provided a way for the CP users to create a short circuitable connection with custom configs.
12251
12252 createConnection(Configuration) is added to MasterCoprocessorEnvironment, RegionServerCoprocessorEnvironment and RegionCoprocessorEnvironment.
12253
12254 The getConnection() method already available in these Env interfaces returns the cluster connection used by the server (which the server also uses) where as this new method will create a new connection on request. The difference from connection created using ConnectionFactory APIs is that this connection can short circuit the calls to same server avoiding the RPC paths. The connection will NOT be cached/maintained by server. That should be done the CPs.
12255
12256 Be careful creating Connections out of a Coprocessor. See the javadoc on these createConnection and getConnection.
12257
12258
12259 ---
12260
12261 * [HBASE-19357](https://issues.apache.org/jira/browse/HBASE-19357) | *Major* | **Bucket cache no longer L2 for LRU cache**
12262
12263 Removed cacheDataInL1 option for HCD
12264 BucketCache is no longer the L2 for LRU on heap cache. When BC is used, data blocks will be strictly on BC only where as index/bloom blocks are on LRU L1 cache.
12265 Config 'hbase.bucketcache.combinedcache.enabled' is removed. There is no way set combined mode = false. Means make BC as victim handler for LRU cache.
12266 This will be one more noticeable change when one uses BucketCache in File mode.  Then the system table's data block(Including the META table)  will be cached in Bucket Cache files only. Plain scan from META files alone test reveal that the throughput of file mode BC is almost half only.  But for META entries we have RegionLocation cache at client side connections. So this would not be a big concern in a real cluster usage. Will check more on this and probably fix even when we do tiered BucketCache.
12267
12268
12269 ---
12270
12271 * [HBASE-19430](https://issues.apache.org/jira/browse/HBASE-19430) | *Major* | **Remove the SettableTimestamp and SettableSequenceId**
12272
12273 All the cells which are used in server side are of ExtendedCell now.
12274
12275
12276 ---
12277
12278 * [HBASE-19295](https://issues.apache.org/jira/browse/HBASE-19295) | *Major* | **The Configuration returned by CPEnv should be read-only.**
12279
12280 CoprocessorEnvironment#getConfiguration returns a READ-ONLY Configuration. Attempts at altering the returned Configuration -- whether setting or adding resources -- will result in an IllegalStateException warning of the Read-only condition of the returned Configuration.
12281
12282
12283 ---
12284
12285 * [HBASE-19410](https://issues.apache.org/jira/browse/HBASE-19410) | *Major* | **Move zookeeper related UTs to hbase-zookeeper and mark them as ZKTests**
12286
12287 There is a new HBaseZKTestingUtility which can only start a mini zookeeper cluster. And we will publish sources for test-jar for all modules.
12288
12289
12290 ---
12291
12292 * [HBASE-19323](https://issues.apache.org/jira/browse/HBASE-19323) | *Major* | **Make netty engine default in hbase2**
12293
12294 NettyRpcServer is now our default RPC server replacing SimpleRpcServer.
12295
12296
12297 ---
12298
12299 * [HBASE-19426](https://issues.apache.org/jira/browse/HBASE-19426) | *Major* | **Move has() and setTimestamp() to Mutation**
12300
12301 Moves #has and #setTimestamp back up to Mutation from the subclass Put so available to other Mutation implementations.
12302
12303
12304 ---
12305
12306 * [HBASE-19384](https://issues.apache.org/jira/browse/HBASE-19384) | *Critical* | **Results returned by preAppend hook in a coprocessor are replaced with null from other coprocessor even on bypass**
12307
12308 When a coprocessor sets 'bypass', we will skip calling subsequent Coprocessors that may be stacked-up on the method invocation; e.g. if a prePut has three coprocessors hooked up, if the first coprocessor decides to set 'bypass', we will not call the two subsequent coprocessors (this is similar to the 'complete' functionality that was in hbase1, removed in hbase2).
12309
12310
12311 ---
12312
12313 * [HBASE-19408](https://issues.apache.org/jira/browse/HBASE-19408) | *Trivial* | **Remove WALActionsListener.Base**
12314
12315 1) remove the WALActionsListener.Base
12316 2) provide default method implementation to WALActionsListener
12317 The person who want to receive the notification of WAL events should implements the WALActionsListener rather than WALActionsListener.Base.
12318
12319
12320 ---
12321
12322 * [HBASE-19339](https://issues.apache.org/jira/browse/HBASE-19339) | *Critical* | **Eager policy results in the negative size of memstore**
12323
12324 Enable TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy
12325
12326
12327 ---
12328
12329 * [HBASE-19336](https://issues.apache.org/jira/browse/HBASE-19336) | *Major* | **Improve rsgroup to allow assign all tables within a specified namespace by only writing namespace**
12330
12331 Add two new shell cmd.
12332 move\_namespaces\_rsgroup is used to reassign tables of specified namespaces from one RegionServer group to another.
12333 move\_servers\_namespaces\_rsgroup is used to reassign regionServers and tables of specified namespaces from one group to another.
12334
12335
12336 ---
12337
12338 * [HBASE-19285](https://issues.apache.org/jira/browse/HBASE-19285) | *Critical* | **Add per-table latency histograms**
12339
12340 Per-RegionServer table latency histograms have been returned to HBase (after being removed due to impacting performance). These metrics are exposed via a new JMX bean "TableLatencies" with the typical naming conventions: namespace, table, and histogram component.
12341
12342
12343 ---
12344
12345 * [HBASE-19359](https://issues.apache.org/jira/browse/HBASE-19359) | *Major* | **Revisit the default config of hbase client retries number**
12346
12347 The default value of hbase.client.retries.number was 35. It is now 10.
12348 And for server side, the default hbase.client.serverside.retries.multiplier was 10. So the server side retries number was 35 \* 10 = 350. It is now 3.
12349
12350
12351 ---
12352
12353 * [HBASE-18090](https://issues.apache.org/jira/browse/HBASE-18090) | *Major* | **Improve TableSnapshotInputFormat to allow more multiple mappers per region**
12354
12355 In this task, we make it possible to run multiple mappers per region in the table snapshot. The following code is primary table snapshot mapper initializatio:
12356
12357 TableMapReduceUtil.initTableSnapshotMapperJob(
12358           snapshotName,                     // The name of the snapshot (of a table) to read from
12359           scan,                                      // Scan instance to control CF and attribute selection
12360           mapper,                                 // mapper
12361           outputKeyClass,                   // mapper output key
12362           outputValueClass,                // mapper output value
12363           job,                                       // The current job to adjust
12364           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12365           restoreDir,                           // a temporary directory to copy the snapshot files into
12366 );
12367
12368 The job only run one map task per region in the table snapshot. With this feature, client can specify the desired num of mappers when init table snapshot mapper job：
12369
12370 TableMapReduceUtil.initTableSnapshotMapperJob(
12371           snapshotName,                     // The name of the snapshot (of a table) to read from
12372           scan,                                      // Scan instance to control CF and attribute selection
12373           mapper,                                 // mapper
12374           outputKeyClass,                   // mapper output key
12375           outputValueClass,                // mapper output value
12376           job,                                       // The current job to adjust
12377           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12378           restoreDir,                           // a temporary directory to copy the snapshot files into
12379           splitAlgorithm,                     // splitAlgo algorithm to split, current split algorithms  support RegionSplitter.UniformSplit() and RegionSplitter.HexStringSplit()
12380           n                                         // how many input splits to generate per one region
12381 );
12382
12383
12384 ---
12385
12386 * [HBASE-19035](https://issues.apache.org/jira/browse/HBASE-19035) | *Major* | **Miss metrics when coprocessor use region scanner to read data**
12387
12388 1. Move read requests count to region level. Because RegionScanner is exposed to CP.
12389 2. Update write requests count in processRowsWithLocks.
12390 3. Remove requestRowActionCount in RSRpcServices. This metric can be computed by region's readRequestsCount and writeRequestsCount.
12391
12392
12393 ---
12394
12395 * [HBASE-19318](https://issues.apache.org/jira/browse/HBASE-19318) | *Critical* | **MasterRpcServices#getSecurityCapabilities explicitly checks for the HBase AccessController implementation**
12396
12397 Fixes an issue with loading customer coprocessor endpoint implementations inside of the HBase Master which breaks Apache Ranger.
12398
12399
12400 ---
12401
12402 * [HBASE-19092](https://issues.apache.org/jira/browse/HBASE-19092) | *Critical* | **Make Tag IA.LimitedPrivate and expose for CPs**
12403
12404 This JIRA aims at exposing Tags for Coprocessor usage.
12405 Tag interface is now exposed to Coprocessors and CPs can make use of this interface to create their own Tags.
12406 RawCell is a new interface that is a subtype of Cell and that is exposed to CPs. RawCell has the following APIs
12407
12408 List\<Tag\> getTags()
12409 Optional\<Tag\> getTag(byte type)
12410 byte[] cloneTags()
12411
12412 The above APIs helps to read tags from the Cell.
12413
12414 CellUtil#createCell(Cell cell, List\<Tag\> tags)
12415 CellUtil#createCell(Cell cell, byte[] tags)
12416 CellUtil#createCell(Cell cell, byte[] value, byte[] tags)
12417 are deprecated.
12418 If CPs want to create a cell with Tags they can use the RegionCoprocessorEnvironment#getCellBuilder() that returns an ExtendedCellBuilder.
12419 Using ExtendedCellBuilder the CP can create Cells with Tags. Other helper methods to work on Tags are available as static APIs in Tag interface.
12420
12421
12422 ---
12423
12424 * [HBASE-19266](https://issues.apache.org/jira/browse/HBASE-19266) | *Minor* | **TestAcidGuarantees should cover adaptive in-memory compaction**
12425
12426 separate the TestAcidGuarantees by the policy:
12427 1) NONE -\> TestAcidGuaranteesWithNoInMemCompaction
12428 2) BASIC -\> TestAcidGuaranteesWithBasicPolicy
12429 3) EAGER -\> TestAcidGuaranteesWithEagerPolicy
12430 4) ADAPTIVE -\> TestAcidGuaranteesWithAdaptivePolicy
12431
12432 TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy are disabled by default as the eager policy may cause the negative size of memstore.
12433
12434
12435 ---
12436
12437 * [HBASE-16868](https://issues.apache.org/jira/browse/HBASE-16868) | *Critical* | **Add a replicate\_all flag to avoid misuse the namespaces and table-cfs config of replication peer**
12438
12439 Add a replicate\_all flag to replication peer config. The default value is true, which means all user tables (REPLICATION\_SCOPE != 0 ) will be replicated to peer cluster.
12440
12441 How to config a peer from replicate all to only replicate special namespace/tablecfs?
12442 Step1. Add a new peer with no namespace/tablecfs config, the replicate\_all flag will be true automatically.
12443 Step2. User want only replicate some namespaces or tables, so set replicate\_all flag to false first.
12444 Step3. Add special namespaces or table-cfs config to the replication peer.
12445
12446 How to config a peer from replicate special namespace/tablecfs to replicate all?
12447 Step1. Add a new peer with special namespace/tablecfs config, the replicate\_all flag will be false automatically.
12448 Step2. User want replicate all user tables, so remove the special namespace/tablecfs config first.
12449 Step3. Set replicate\_all flag to true.
12450
12451 How to config replicate nothing?
12452 Set replicate\_all flag to false and no namespace/tablecfs config, then all tables cannot be replicated to peer cluster.
12453
12454
12455 ---
12456
12457 * [HBASE-19122](https://issues.apache.org/jira/browse/HBASE-19122) | *Critical* | **preCompact and preFlush can bypass by returning null scanner; shut it down**
12458
12459 Remove the ability to 'bypass' preFlush and preCompact by returning a null Scanner. Bypass is disallowed on these methods in hbase2.
12460
12461
12462 ---
12463
12464 * [HBASE-19200](https://issues.apache.org/jira/browse/HBASE-19200) | *Major* | **make hbase-client only depend on ZKAsyncRegistry and ZNodePaths**
12465
12466 ConnectionImplementation now uses asynchronous connections to zookeeper via ZKAsyncRegistry to get cluster id, master address, meta region location, etc.
12467 Since ZKAsyncRegistry uses curator framework, this change purges a lot of zookeeper dependencies in hbase-client.
12468 Now hbase-client only depends on only ZKAsyncRegistry, ZNodePaths and the newly introduced ZKMetadata.
12469
12470
12471 ---
12472
12473 * [HBASE-19311](https://issues.apache.org/jira/browse/HBASE-19311) | *Major* | **Promote TestAcidGuarantees to LargeTests and start mini cluster once to make it faster**
12474
12475 Introduce a AcidGuaranteesTestTool and expose as tool instead of TestAcidGuarantees. Now TestAcidGuarantees is just a UT.
12476
12477
12478 ---
12479
12480 * [HBASE-19293](https://issues.apache.org/jira/browse/HBASE-19293) | *Major* | **Support adding a new replication peer in disabled state**
12481
12482 Add a boolean parameter which means the new replication peer's state is enabled or disabled for Admin/AsyncAdmin's addReplicationPeer method. Meanwhile, you can use shell cmd to add a enabled/disabled replication peer. The STATE parameter is optional and the default state is enabled.
12483
12484 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "ENABLED"
12485 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "DISABLED"
12486
12487
12488 ---
12489
12490 * [HBASE-19123](https://issues.apache.org/jira/browse/HBASE-19123) | *Major* | **Purge 'complete' support from Coprocesor Observers**
12491
12492 This issue removes the 'complete' facility that was in ObserverContext. It is no longer possible for a Coprocessor to cut the chain-of-invocation and insist its response prevails.
12493
12494
12495 ---
12496
12497 * [HBASE-18911](https://issues.apache.org/jira/browse/HBASE-18911) | *Major* | **Unify Admin and AsyncAdmin's methods name**
12498
12499 Deprecated 4 methods for Admin interface.
12500 Deprecated compactRegionServer(ServerName, boolean). Use compactRegionServer(ServerName) and majorCompactcompactRegionServer(ServerName) instead.
12501 Deprecated getRegionLoad(ServerName) method. Use getRegionLoads(ServerName) instead.
12502 Deprecated getRegionLoad(ServerName, TableName) method. Use getRegionLoads(ServerName, TableName) instead.
12503 Deprecated getQuotaRetriever(QuotaFilter) instead. Use  getQuota(QuotaFilter) instead.
12504
12505 Add 7 methods for Admin interface.
12506 ServerName getMaster();
12507 Collection\<ServerName\> getBackupMasters();
12508 Collection\<ServerName\> getRegionServers();
12509 boolean splitSwitch(boolean enabled, boolean synchronous);
12510 boolean mergeSwitch(boolean enabled, boolean synchronous);
12511 boolean isSplitEnabled();
12512 boolean isMergeEnabled();
12513
12514
12515 ---
12516
12517 * [HBASE-18703](https://issues.apache.org/jira/browse/HBASE-18703) | *Critical* | **Inconsistent behavior for preBatchMutate in doMiniBatchMutate and processRowsWithLocks**
12518
12519 Two write paths Region.batchMutate() and Region.mutateRows() are unified and inconsistencies are resolved.
12520
12521
12522 ---
12523
12524 * [HBASE-18964](https://issues.apache.org/jira/browse/HBASE-18964) | *Major* | **Deprecate RowProcessor and processRowsWithLocks() APIs that take RowProcessor as an argument**
12525
12526 RowProcessor and Region#processRowsWithLocks() methods that take RowProcessor as an argument are deprecated. Use Coprocessors if you want to customize handling.
12527
12528
12529 ---
12530
12531 * [HBASE-19251](https://issues.apache.org/jira/browse/HBASE-19251) | *Major* | **Merge RawAsyncTable and AsyncTable**
12532
12533 Merge the RawAsyncTable and AsyncTable interfaces. Use generic to reflection the difference between the observer style scan API. For the implementation which does not have a user specified thread pool, the observer is AdvancedScanResultConsumer. For the implementation which needs a user specified thread pool, the observer is ScanResultConsumer.
12534
12535
12536 ---
12537
12538 * [HBASE-19262](https://issues.apache.org/jira/browse/HBASE-19262) | *Major* | **Revisit checkstyle rules**
12539
12540 Change the import order rule that now we should put the shaded import at bottom. Ignore the VisibilityModifier warnings for test code.
12541
12542
12543 ---
12544
12545 * [HBASE-19187](https://issues.apache.org/jira/browse/HBASE-19187) | *Minor* | **Remove option to create on heap bucket cache**
12546
12547 Removing the on heap Bucket cache feature.
12548 The config "hbase.bucketcache.ioengine" no longer support the 'heap' value.
12549 Its supported values now are 'offheap',  'file:\<path\>', 'files:\<path\>'  and 'mmap:\<path\>'
12550
12551
12552 ---
12553
12554 * [HBASE-12350](https://issues.apache.org/jira/browse/HBASE-12350) | *Minor* | **Backport error-prone build support to branch-1 and branch-2**
12555
12556 This change introduces compile time support for running the error-prone suite of static analyses. Enable with -PerrorProne on the Maven command line. Requires JDK 8 or higher. (Don't enable if building with JDK 7.)
12557
12558
12559 ---
12560
12561 * [HBASE-14350](https://issues.apache.org/jira/browse/HBASE-14350) | *Blocker* | **Procedure V2 Phase 2: Assignment Manager**
12562
12563 (Incomplete)
12564
12565 = Incompatbiles
12566
12567 == Coprocessor Incompatibilities
12568
12569 Split/Merge have moved to the Master; it runs them now. Means hooks around Split/Merge are now noops. To intercept Split/Merge phases, CPs need to intercept on MasterObserver.
12570
12571
12572 ---
12573
12574 * [HBASE-19189](https://issues.apache.org/jira/browse/HBASE-19189) | *Major* | **Ad-hoc test job for running a subset of tests lots of times**
12575
12576 <!-- markdown -->
12577
12578
12579 Folks can now test out tests on an arbitrary release branch. Head over to [builds.a.o job "HBase-adhoc-run-tests"](https://builds.apache.org/view/H-L/view/HBase/job/HBase-adhoc-run-tests/), then pick "Build with parameters".
12580 Tests are specified as just names e.g. TestLogRollingNoCluster. can also be a glob. e.g. TestHFile*
12581
12582
12583 ---
12584
12585 * [HBASE-19220](https://issues.apache.org/jira/browse/HBASE-19220) | *Major* | **Async tests time out talking to zk; 'clusterid came back null'**
12586
12587 Changed retries from 3 to 30 for zk initial connect for registry.
12588
12589
12590 ---
12591
12592 * [HBASE-19002](https://issues.apache.org/jira/browse/HBASE-19002) | *Minor* | **Introduce more examples to show how to intercept normal region operations**
12593
12594 With the change in Coprocessor APIs, the hbase-examples module has been updated to provide additional examples that show how to write Coprocessors against the new API.
12595
12596
12597 ---
12598
12599 * [HBASE-18961](https://issues.apache.org/jira/browse/HBASE-18961) | *Major* | **doMiniBatchMutate() is big, split it into smaller methods**
12600
12601 HRegion.batchMutate()/ doMiniBatchMutate() is refactored with aim to unify batchMutate() and mutateRows() code paths later. batchMutate() currently handles 2 types of batches: MutationBatchOperations and ReplayBatchOperations. Common base class BatchOperations is augmented with common methods which are overridden in derived classes as needed. doMiniBatchMutate() is implemented using common methods in base class BatchOperations.
12602
12603
12604 ---
12605
12606 * [HBASE-19103](https://issues.apache.org/jira/browse/HBASE-19103) | *Minor* | **Add BigDecimalComparator for filter**
12607
12608 If BigDecimal is stored as value, and you need to add a matched comparator to the value filter when scanning, a BigDecimalComparator can be used.
12609
12610
12611 ---
12612
12613 * [HBASE-19111](https://issues.apache.org/jira/browse/HBASE-19111) | *Critical* | **Add missing CellUtil#isPut(Cell) methods**
12614
12615 A new public API method was added to CellUtil "isPut(Cell)" for clients to use to determine if the Cell is for a Put operation.
12616
12617 Additionally, other CellUtil API calls which expose Cell-implementation were marked as deprecated and will be removed in a future version.
12618
12619
12620 ---
12621
12622 * [HBASE-19160](https://issues.apache.org/jira/browse/HBASE-19160) | *Critical* | **Re-expose CellComparator**
12623
12624 CellComparator is now InterfaceAudience.Public
12625
12626
12627 ---
12628
12629 * [HBASE-19131](https://issues.apache.org/jira/browse/HBASE-19131) | *Major* | **Add the ClusterStatus hook and cleanup other hooks which can be replaced by ClusterStatus hook**
12630
12631 1) Add preGetClusterStatus() and postGetClusterStatus() hooks
12632 2) add preGetClusterStatus() to access control check - an admin action
12633
12634
12635 ---
12636
12637 * [HBASE-19095](https://issues.apache.org/jira/browse/HBASE-19095) | *Major* | **Add CP hooks in RegionObserver for in memory compaction**
12638
12639 Add 4 methods in RegionObserver:
12640 preMemStoreCompaction
12641 preMemStoreCompactionCompactScannerOpen
12642 preMemStoreCompactionCompact
12643 postMemStoreCompaction
12644 preMemStoreCompaction and postMemStoreCompaction will always be called for all in memory compactions. Under eager mode, preMemStoreCompactionCompactScannerOpen will be called before opening store scanner to allow you changing the max versions and TTL, and preMemStoreCompactionCompact will be called after the creation to let you do wrapping.
12645
12646
12647 ---
12648
12649 * [HBASE-19152](https://issues.apache.org/jira/browse/HBASE-19152) | *Trivial* | **Update refguide 'how to build an RC' and the make\_rc.sh script**
12650
12651 The make\_rc.sh script can run an hbase2 build now generating tarballs and pushing up to maven repository. TODO: Sign and checksum, check tarball, push to apache dist.....
12652
12653
12654 ---
12655
12656 * [HBASE-19179](https://issues.apache.org/jira/browse/HBASE-19179) | *Critical* | **Remove hbase-prefix-tree**
12657
12658 Purged the hbase-prefix-tree module and all references from the code base.
12659
12660 prefix-tree data block encoding was a super cool experimental feature that saw some usage initially but has since languished. If interested in carrying this sweet facility forward, write the dev list and we'll restore this module.
12661
12662
12663 ---
12664
12665 * [HBASE-19176](https://issues.apache.org/jira/browse/HBASE-19176) | *Major* | **Remove hbase-native-client from branch-2**
12666
12667 Removed the hbase-native-client module from branch-2 (it is still in Master). It is not complete. Look for a finished C++ client in the near future. Will restore native client to branch-2 at that point.
12668
12669
12670 ---
12671
12672 * [HBASE-19144](https://issues.apache.org/jira/browse/HBASE-19144) | *Major* | **[RSgroups] Retry assignments in FAILED\_OPEN state when servers (re)join the cluster**
12673
12674 When regionserver placement groups (RSGroups) is active, as servers join the cluster the Master will attempt to reassign regions in FAILED\_OPEN state.
12675
12676
12677 ---
12678
12679 * [HBASE-18770](https://issues.apache.org/jira/browse/HBASE-18770) | *Critical* | **Remove bypass method in ObserverContext and implement the 'bypass' logic case by case**
12680
12681 Removes blanket bypass mechanism (Observer#bypass). Instead, a curated subset of methods are bypassable.
12682
12683     Changes Coprocessor ObserverContext 'bypass' semantic. We flip the
12684     default so bypass is NOT supported on Observer invocations; only a
12685     couple of preXXX methods in RegionObserver allow it: e.g.  preGet
12686     and prePut but not preFlush, etc. Everywhere else, we throw
12687     a Exception if a Coprocessor Observer tries to invoke bypass. Master
12688     Observers can no longer stop or change move, split, assign, create table, etc.
12689     preBatchMutate can no longer be bypassed (bypass the finer-grained
12690     prePut, preDelete, etc. instead)
12691
12692     Ditto on complete, the mechanism that allowed a Coprocessor
12693     rule that all subsequent Coprocessors are skipped in an
12694     invocation chain; now, complete is only available to
12695     bypassable methods (and Coprocessors will get an exception if
12696     they try to 'complete' when it is not allowed).
12697
12698     See javadoc for whether a Coprocessor Observer method supports
12699     'bypass'. If no mention, 'bypass' is NOT supported.
12700
12701 The below methods have been marked deprecated in hbase2. We would have liked to have removed them because they use IA.Private parameters but they are in use by CoreCoprocessors or are critical to downstreamers and we have no alternatives to provide currently.
12702
12703 @Deprecated public boolean prePrepareTimeStampForDeleteVersion(final Mutation mutation, final Cell kv, final byte[] byteNow, final Get get) throws IOException {
12704
12705 @Deprecated public boolean preWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12706
12707 @Deprecated public void postWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12708
12709 @Deprecated public DeleteTracker postInstantiateDeleteTracker(DeleteTracker result) throws IOException
12710
12711 Metrics are updated now even if the Coprocessor does a bypass; e.g. The put count is updated even if a Coprocessor bypasses the core put operation (We do it this way so no need for Coprocessors to have access to our core metrics system).
12712
12713
12714 ---
12715
12716 * [HBASE-19033](https://issues.apache.org/jira/browse/HBASE-19033) | *Blocker* | **Allow CP users to change versions and TTL before opening StoreScanner**
12717
12718 Add back the three methods without a return value:
12719 preFlushScannerOpen
12720 preCompactScannerOpen
12721 preStoreScannerOpen
12722
12723 Introduce a ScanOptions interface to let CP users change the max versions and TTL of a ScanInfo. It will be passed as a parameter in the three methods above.
12724
12725 Inntroduce a new example WriteHeavyIncrementObserver which convert increment to put and do aggregating when get. It uses the above three methods.
12726
12727
12728 ---
12729
12730 * [HBASE-19110](https://issues.apache.org/jira/browse/HBASE-19110) | *Minor* | **Add default for Server#isStopping & #getFileSystem**
12731
12732 Made defaults for Server#isStopping and Server#getFileSystem. Should have done this when I added them (lesson learned, was actually mentioned in a review).
12733
12734
12735 ---
12736
12737 * [HBASE-19047](https://issues.apache.org/jira/browse/HBASE-19047) | *Critical* | **CP exposed Scanner types should not extend Shipper**
12738
12739 RegionObserver#preScannerOpen signature changed
12740 RegionScanner preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan,  RegionScanner s)   -\>   void preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan)
12741 The pre hook can no longer return a RegionScanner instance.
12742
12743
12744 ---
12745
12746 * [HBASE-18995](https://issues.apache.org/jira/browse/HBASE-18995) | *Critical* | **Move methods that are for internal usage from CellUtil to Private util class**
12747
12748 Split CellUtil into public CellUtil and PrivateCellUtil for Internal use only.
12749
12750
12751 ---
12752
12753 * [HBASE-18906](https://issues.apache.org/jira/browse/HBASE-18906) | *Critical* | **Provide Region#waitForFlushes API**
12754
12755 Provided an API in Region (Exposed to CPs)
12756 boolean waitForFlushes(long timeout)
12757 This call will make the current thread to be waiting for all flushes in this region to be finished.  (Upto the time out time being specified). The boolean return value specify whether the flushes are really over or the time out being elapsed. Return false when timeout elapsed but flushes are not over or  true when flushes are over
12758
12759
12760 ---
12761
12762 * [HBASE-18905](https://issues.apache.org/jira/browse/HBASE-18905) | *Major* | **Allow CPs to request flush on Region and know the completion of the requested flush**
12763
12764 Add a FlushLifeCycleTracker which is similiar to CompactionLifeCycleTracker for tracking flush.
12765 Add a requestFlush method in Region interface to let CP users request flush on a region. The operation is asynchronous, you need to use the FlushLifeCycleTracker to track the flush.
12766 The difference with CompactionLifeCycleTracker is that, flush is per region so we do not use Store as a parameter of the methods. And also, notExecuted means the whole flush has not been executed, and afterExecution means the whole flush has been finished, so we do not have a separated completed method. A flush will be ended either by notExecuted or afterExecution.
12767
12768
12769 ---
12770
12771 * [HBASE-19048](https://issues.apache.org/jira/browse/HBASE-19048) | *Major* | **Cleanup MasterObserver hooks which takes IA private params**
12772
12773 Purged InterfaceAudience.Private parameters from methods in MasterObserver.
12774
12775 preAbortProcedure no longer takes a ProcedureExecutor.
12776
12777 postGetProcedures no longer takes a list of Procedures.
12778
12779 postGetLocks no longer takes a list of locks.
12780
12781 preRequestLock and postRequestLock no longer take lock type.
12782
12783 preLockHeartbeat and postLockHeartbeat no longer takes a lock procedure.
12784
12785 The implication is that that the Coprocessors that depended on these params have had to coarsen so for example, the AccessController can not do access per Procedure or Lock but rather, makes a judgement on the general access (You'll need to be ADMIN to see list of procedures and locks).
12786
12787
12788 ---
12789
12790 * [HBASE-18994](https://issues.apache.org/jira/browse/HBASE-18994) | *Major* | **Decide if META/System tables should use Compacting Memstore or Default Memstore**
12791
12792 Added a new config 'hbase.systemtables.compacting.memstore.type"  for the system tables. By default all the system tables will have 'NONE' as the type and so it will be using the default memstore by default.
12793 {code}
12794  \<property\>
12795     \<name\>hbase.systemtables.compacting.memstore.type\</name\>
12796     \<value\>NONE\</value\>
12797   \</property\>
12798 {code}
12799
12800
12801 ---
12802
12803 * [HBASE-19029](https://issues.apache.org/jira/browse/HBASE-19029) | *Critical* | **Align RPC timout methods in Table and AsyncTableBase**
12804
12805 Deprecate the following methods in Table:
12806 - int getRpcTimeout()
12807 - int getReadRpcTimeout()
12808 - int getWriteRpcTimeout()
12809 - int getOperationTimeout()
12810
12811 Add the following methods to Table:
12812 - long getRpcTimeout(TimeUnit)
12813 - long getReadRpcTimeout(TimeUnit)
12814 - long getWriteRpcTimeout(TimeUnit)
12815 - long getOperationTimeout(TimeUnit)
12816
12817 Add missing deprecation tag for long getRpcTimeout(TimeUnit unit) in AsyncTableBase
12818
12819
12820 ---
12821
12822 * [HBASE-18410](https://issues.apache.org/jira/browse/HBASE-18410) | *Major* | **FilterList  Improvement.**
12823
12824 In this task, we fixed all existing bugs in FilterList, and did the code refactor which ensured interface compatibility .
12825
12826 The primary bug  fixes are :
12827 1. For sub-filter in FilterList with MUST\_PASS\_ONE, if previous filterKeyValue() of sub-filter returns NEXT\_COL, we cannot make sure that the next cell will be the first cell in next column, because FilterList choose the minimal forward step among sub-filters, and it may return a SKIP. so here we add an extra check to ensure that the next cell will match preivous return code for sub-filters.
12828 2. Previous logic about transforming cell of FilterList is incorrect, we should set the previous transform result (rather than the given cell in question) as the initial vaule of transform cell before call filterKeyValue() of FilterList.
12829 3. Handle the ReturnCodes which the previous code did not handle.
12830
12831 About code refactor, we divided the FilterList into two separated sub-classes: FilterListWithOR and FilterListWithAND,  The FilterListWithOR has been optimised to choose the next minimal step to seek cell rather than SKIP cell one by one, and the FilterListWithAND  has been optimised to choose the next maximal key to seek among sub-filters in filter list. All in all, The code in FilterList is clean and easier to follow now.
12832
12833 Note that ReturnCode NEXT\_ROW has been redefined as skipping to next row in current family,   not to next row in all family. it’s more reasonable, because ReturnCode is a concept in store level, not in region level.
12834
12835 Another bug that needs attention is: filterAllRemaining() in FilterList with MUST\_PASS\_ONE  will now return false if the filter list is empty whereas earlier it used to return true for Operator.MUST\_PASS\_ONE.  it's more reasonable now.
12836
12837
12838 ---
12839
12840 * [HBASE-19077](https://issues.apache.org/jira/browse/HBASE-19077) | *Critical* | **Have Region\*CoprocessorEnvironment provide an ImmutableOnlineRegions**
12841
12842 Adds getOnlineRegions to the RegionCoprocessorEnvironment (Context) and ditto to RegionServerCoprocessorEnvironment. Allows Coprocessor get list of Regions online on the currently hosting RegionServer.
12843
12844
12845 ---
12846
12847 * [HBASE-19021](https://issues.apache.org/jira/browse/HBASE-19021) | *Critical* | **Restore a few important missing logics for balancer in 2.0**
12848
12849 Re-enabled 'hbase.master.loadbalance.bytable', default 'false'.
12850 Draining servers are removed from consideration by blancer.balanceCluster() call.
12851
12852
12853 ---
12854
12855 * [HBASE-19049](https://issues.apache.org/jira/browse/HBASE-19049) | *Major* | **Update kerby to 1.0.1 GA release**
12856
12857 HBase now relies on Kerby version 1.0.1 for its test environment. No downstream facing change is expected.
12858
12859
12860 ---
12861
12862 * [HBASE-16290](https://issues.apache.org/jira/browse/HBASE-16290) | *Major* | **Dump summary of callQueue content; can help debugging**
12863
12864 Patch to print summary of call queues by size and count. This is displayed on the debug dump page of region server UI
12865
12866
12867 ---
12868
12869 * [HBASE-18846](https://issues.apache.org/jira/browse/HBASE-18846) | *Major* | **Accommodate the hbase-indexer/lily/SEP consumer deploy-type**
12870
12871 Makes it so hbase-indexer/lily can move off dependence on internal APIs and instead move to public APIs.
12872
12873 Adds being able to disable near-all HRegionServer services. This along with an existing plugin mechanism which allows configuring the RegionServer to host an alternate Connection implementation, makes it so we can put up a cluster of hollowed-out HRegionServers purposed to pose as a Replication Sink for a source HBase Cluster (Users do not need to figure our RPC, our PB encodings, build a distributed service, etc.). In the alternate supplied Connection implementation, hbase-indexer would install its own code to catch the Replication.
12874
12875 Below and attached are sample hbase-server.xml files and alternate Connection implementations. To start up an HRegionServer as a sink, first make sure there is a ZooKeeper ensemble we can talk to. If none, just start one:
12876 {code}
12877 ./bin/hbase-daemon.sh start zookeeper
12878 {code}
12879
12880 To start up a single RegionServer, put in place the below sample hbase-site.xml and a derviative of the below IndexerConnection on the CLASSPATH, and then start the RegionServer:
12881 {code}
12882 ./bin/hbase-daemon.sh  start  org.apache.hadoop.hbase.regionserver.HRegionServer
12883 {code}
12884 Stdout and Stderr will go into files under configured logs directory. Browse to localhost:16030 to find webui (unless disabled).
12885
12886 DETAILS
12887
12888 This patch adds configuration to disable RegionServer internal Services, Managers, Caches, etc., starting up.
12889
12890 By default a RegionServer starts up an Admin and Client Service. To disable either or both, use the below booleans:
12891 {code}
12892 hbase.regionserver.admin.service
12893 hbase.regionserver.client.service
12894 {code}
12895
12896 Both default true.
12897
12898 To make a HRegionServer startup and stay up without expecting to communicate with a master, set the below boolean to false:
12899
12900 {code}
12901 hbase.masterless
12902 {code]
12903 Default is false.
12904
12905 h3. Sample hbase-site.xml that disables internal HRegionServer Services
12906 Below is an example hbase-site.xml that turns off most Services and that then installs an alternate Connection implementation, one that is nulled out in all regards except in being able to return a "Table" that can catch a Replication Stream in its {code}batch(List\<? extends Row\> actions, Object[] results){code} method. i.e. what the hbase-indexer wants. I also add the example alternate Connection implementation below (both of these files are also attached to this issue). Expects there to be an up and running zookeeper ensemble.
12907
12908 {code}
12909 \<configuration\>
12910   \<!-- This file is an example for hbase-indexer. It shuts down
12911        facility in the regionserver and interjects a special
12912        Connection implementation which is how hbase-indexer will
12913        receive the replication stream from source hbase cluster.
12914        See the class referenced in the config.
12915
12916        Most of the config in here is booleans set to off and
12917        setting values to zero so services doon't start. Some of
12918        the flags are new via this patch.
12919 --\>
12920   \<!--Need this for the RegionServer to come up standalone--\>
12921   \<property\>
12922     \<name\>hbase.cluster.distributed\</name\>
12923     \<value\>true\</value\>
12924   \</property\>
12925
12926   \<!--This is what you implement, a Connection that returns a Table that
12927        overrides the batch call. It is at this point you do your indexer inserts.
12928     --\>
12929   \<property\>
12930     \<name\>hbase.client.connection.impl\</name\>
12931     \<value\>org.apache.hadoop.hbase.client.IndexerConnection\</value\>
12932     \<description\>A customs connection implementation just so we can interject our
12933       own Table class, one that has an override for the batch call which receives
12934       the replication stream edits; i.e. it is called by the replication sink
12935       #replicateEntries method.\</description\>
12936   \</property\>
12937
12938   \<!--Set hbase.regionserver.info.port to -1 for no webui--\>
12939
12940   \<!--Below are configs to shut down unused services in hregionserver--\>
12941   \<property\>
12942     \<name\>hbase.regionserver.admin.service\</name\>
12943     \<value\>false\</value\>
12944     \<description\>Do NOT stand up an Admin Service Interface on RPC\</description\>
12945   \</property\>
12946   \<property\>
12947     \<name\>hbase.regionserver.client.service\</name\>
12948     \<value\>false\</value\>
12949     \<description\>Do NOT stand up a client-facing Service on RPC\</description\>
12950   \</property\>
12951   \<property\>
12952     \<name\>hbase.wal.provider\</name\>
12953     \<value\>org.apache.hadoop.hbase.wal.DisabledWALProvider\</value\>
12954     \<description\>Set WAL service to be the null WAL\</description\>
12955   \</property\>
12956   \<property\>
12957     \<name\>hbase.regionserver.workers\</name\>
12958     \<value\>false\</value\>
12959     \<description\>Turn off all background workers, log splitters, executors, etc.\</description\>
12960   \</property\>
12961   \<property\>
12962     \<name\>hfile.block.cache.size\</name\>
12963     \<value\>0.0001\</value\>
12964     \<description\>Turn off block cache completely\</description\>
12965   \</property\>
12966   \<property\>
12967     \<name\>hbase.mob.file.cache.size\</name\>
12968     \<value\>0\</value\>
12969     \<description\>Disable MOB cache.\</description\>
12970   \</property\>
12971   \<property\>
12972     \<name\>hbase.masterless\</name\>
12973     \<value\>true\</value\>
12974     \<description\>Do not expect Master in cluster.\</description\>
12975   \</property\>
12976   \<property\>
12977     \<name\>hbase.regionserver.metahandler.count\</name\>
12978     \<value\>1\</value\>
12979     \<description\>How many priority handlers to run; we probably need none.
12980     Default is 20 which is too much on a server like this.\</description\>
12981   \</property\>
12982   \<property\>
12983     \<name\>hbase.regionserver.replication.handler.count\</name\>
12984     \<value\>1\</value\>
12985     \<description\>How many replication handlers to run; we probably need none.
12986     Default is 3 which is too much on a server like this.\</description\>
12987   \</property\>
12988   \<property\>
12989     \<name\>hbase.regionserver.handler.count\</name\>
12990     \<value\>10\</value\>
12991     \<description\>How many default handlers to run; tie to # of CPUs.
12992     Default is 30 which is too much on a server like this.\</description\>
12993   \</property\>
12994   \<property\>
12995     \<name\>hbase.ipc.server.read.threadpool.size\</name\>
12996     \<value\>3\</value\>
12997     \<description\>How many Listener request reaaders to run; tie to a portion # of CPUs (1/4?).
12998     Default is 10 which is too much on a server like this.\</description\>
12999   \</property\>
13000 \</configuration\>
13001 {code}
13002
13003 h2. Sample Connection Implementation
13004 Has call-out for where an hbase-indexer would insert its capture code.
13005 {code}
13006 package org.apache.hadoop.hbase.client;
13007
13008 import com.google.protobuf.Descriptors;
13009 import com.google.protobuf.Message;
13010 import com.google.protobuf.Service;
13011 import com.google.protobuf.ServiceException;
13012 import org.apache.hadoop.conf.Configuration;
13013 import org.apache.hadoop.hbase.CompareOperator;
13014 import org.apache.hadoop.hbase.HTableDescriptor;
13015 import org.apache.hadoop.hbase.TableName;
13016 import org.apache.hadoop.hbase.client.coprocessor.Batch;
13017 import org.apache.hadoop.hbase.filter.CompareFilter;
13018 import org.apache.hadoop.hbase.ipc.CoprocessorRpcChannel;
13019 import org.apache.hadoop.hbase.security.User;
13020
13021 import java.io.IOException;
13022 import java.util.List;
13023 import java.util.Map;
13024 import java.util.concurrent.ExecutorService;
13025
13026
13027 /\*\*
13028  \* Sample class for hbase-indexer.
13029  \* DO NOT COMMIT TO HBASE CODEBASE!!!
13030  \* Overrides Connection just so we can return a Table that has the
13031  \* method that the replication sink calls, i.e. Table#batch.
13032  \* It is at this point that the hbase-indexer catches the replication
13033  \* stream so it can insert into the lucene index.
13034  \*/
13035 public class IndexerConnection implements Connection {
13036   private final Configuration conf;
13037   private final User user;
13038   private final ExecutorService pool;
13039   private volatile boolean closed = false;
13040
13041   public IndexerConnection(Configuration conf, ExecutorService pool, User user) throws IOException {
13042     this.conf = conf;
13043     this.user = user;
13044     this.pool = pool;
13045   }
13046
13047   @Override
13048   public void abort(String why, Throwable e) {}
13049
13050   @Override
13051   public boolean isAborted() {
13052     return false;
13053   }
13054
13055   @Override
13056   public Configuration getConfiguration() {
13057     return this.conf;
13058   }
13059
13060   @Override
13061   public BufferedMutator getBufferedMutator(TableName tableName) throws IOException {
13062     return null;
13063   }
13064
13065   @Override
13066   public BufferedMutator getBufferedMutator(BufferedMutatorParams params) throws IOException {
13067     return null;
13068   }
13069
13070   @Override
13071   public RegionLocator getRegionLocator(TableName tableName) throws IOException {
13072     return null;
13073   }
13074
13075   @Override
13076   public Admin getAdmin() throws IOException {
13077     return null;
13078   }
13079
13080   @Override
13081   public void close() throws IOException {
13082     if (!this.closed) this.closed = true;
13083   }
13084
13085   @Override
13086   public boolean isClosed() {
13087     return this.closed;
13088   }
13089
13090   @Override
13091   public TableBuilder getTableBuilder(final TableName tn, ExecutorService pool) {
13092     if (isClosed()) {
13093       throw new RuntimeException("IndexerConnection is closed.");
13094     }
13095     final Configuration passedInConfiguration = getConfiguration();
13096     return new TableBuilder() {
13097       @Override
13098       public TableBuilder setOperationTimeout(int timeout) {
13099         return null;
13100       }
13101
13102       @Override
13103       public TableBuilder setRpcTimeout(int timeout) {
13104         return null;
13105       }
13106
13107       @Override
13108       public TableBuilder setReadRpcTimeout(int timeout) {
13109         return null;
13110       }
13111
13112       @Override
13113       public TableBuilder setWriteRpcTimeout(int timeout) {
13114         return null;
13115       }
13116
13117       @Override
13118       public Table build() {
13119         return new Table() {
13120           private final Configuration conf = passedInConfiguration;
13121           private final TableName tableName = tn;
13122
13123           @Override
13124           public TableName getName() {
13125             return this.tableName;
13126           }
13127
13128           @Override
13129           public Configuration getConfiguration() {
13130             return this.conf;
13131           }
13132
13133           @Override
13134           public void batch(List\<? extends Row\> actions, Object[] results)
13135           throws IOException, InterruptedException {
13136             // Implementation goes here.
13137           }
13138
13139           @Override
13140           public HTableDescriptor getTableDescriptor() throws IOException {
13141             return null;
13142           }
13143
13144           @Override
13145           public TableDescriptor getDescriptor() throws IOException {
13146             return null;
13147           }
13148
13149           @Override
13150           public boolean exists(Get get) throws IOException {
13151             return false;
13152           }
13153
13154           @Override
13155           public boolean[] existsAll(List\<Get\> gets) throws IOException {
13156             return new boolean[0];
13157           }
13158
13159           @Override
13160           public \<R\> void batchCallback(List\<? extends Row\> actions, Object[] results, Batch.Callback\<R\> callback) throws IOException, InterruptedException {
13161
13162           }
13163
13164           @Override
13165           public Result get(Get get) throws IOException {
13166             return null;
13167           }
13168
13169           @Override
13170           public Result[] get(List\<Get\> gets) throws IOException {
13171             return new Result[0];
13172           }
13173
13174           @Override
13175           public ResultScanner getScanner(Scan scan) throws IOException {
13176             return null;
13177           }
13178
13179           @Override
13180           public ResultScanner getScanner(byte[] family) throws IOException {
13181             return null;
13182           }
13183
13184           @Override
13185           public ResultScanner getScanner(byte[] family, byte[] qualifier) throws IOException {
13186             return null;
13187           }
13188
13189           @Override
13190           public void put(Put put) throws IOException {
13191
13192           }
13193
13194           @Override
13195           public void put(List\<Put\> puts) throws IOException {
13196
13197           }
13198
13199           @Override
13200           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, byte[] value, Put put) throws IOException {
13201             return false;
13202           }
13203
13204           @Override
13205           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Put put) throws IOException {
13206             return false;
13207           }
13208
13209           @Override
13210           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Put put) throws IOException {
13211             return false;
13212           }
13213
13214           @Override
13215           public void delete(Delete delete) throws IOException {
13216
13217           }
13218
13219           @Override
13220           public void delete(List\<Delete\> deletes) throws IOException {
13221
13222           }
13223
13224           @Override
13225           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, byte[] value, Delete delete) throws IOException {
13226             return false;
13227           }
13228
13229           @Override
13230           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Delete delete) throws IOException {
13231             return false;
13232           }
13233
13234           @Override
13235           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Delete delete) throws IOException {
13236             return false;
13237           }
13238
13239           @Override
13240           public void mutateRow(RowMutations rm) throws IOException {
13241
13242           }
13243
13244           @Override
13245           public Result append(Append append) throws IOException {
13246             return null;
13247           }
13248
13249           @Override
13250           public Result increment(Increment increment) throws IOException {
13251             return null;
13252           }
13253
13254           @Override
13255           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount) throws IOException {
13256             return 0;
13257           }
13258
13259           @Override
13260           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount, Durability durability) throws IOException {
13261             return 0;
13262           }
13263
13264           @Override
13265           public void close() throws IOException {
13266
13267           }
13268
13269           @Override
13270           public CoprocessorRpcChannel coprocessorService(byte[] row) {
13271             return null;
13272           }
13273
13274           @Override
13275           public \<T extends Service, R\> Map\<byte[], R\> coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable) throws ServiceException, Throwable {
13276             return null;
13277           }
13278
13279           @Override
13280           public \<T extends Service, R\> void coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13281
13282           }
13283
13284           @Override
13285           public \<R extends Message\> Map\<byte[], R\> batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype) throws ServiceException, Throwable {
13286             return null;
13287           }
13288
13289           @Override
13290           public \<R extends Message\> void batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13291
13292           }
13293
13294           @Override
13295           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, RowMutations mutation) throws IOException {
13296             return false;
13297           }
13298
13299           @Override
13300           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, RowMutations mutation) throws IOException {
13301             return false;
13302           }
13303
13304           @Override
13305           public void setOperationTimeout(int operationTimeout) {
13306
13307           }
13308
13309           @Override
13310           public int getOperationTimeout() {
13311             return 0;
13312           }
13313
13314           @Override
13315           public int getRpcTimeout() {
13316             return 0;
13317           }
13318
13319           @Override
13320           public void setRpcTimeout(int rpcTimeout) {
13321
13322           }
13323
13324           @Override
13325           public int getReadRpcTimeout() {
13326             return 0;
13327           }
13328
13329           @Override
13330           public void setReadRpcTimeout(int readRpcTimeout) {
13331
13332           }
13333
13334           @Override
13335           public int getWriteRpcTimeout() {
13336             return 0;
13337           }
13338
13339           @Override
13340           public void setWriteRpcTimeout(int writeRpcTimeout) {
13341
13342           }
13343         };
13344       }
13345     };
13346   }
13347 }
13348 {code}
13349
13350
13351 ---
13352
13353 * [HBASE-18873](https://issues.apache.org/jira/browse/HBASE-18873) | *Critical* | **Hide protobufs in GlobalQuotaSettings**
13354
13355 GlobalQuotaSettings was introduced to avoid protocol-specific Java classes from leaking into API which is users may leverage. This class has a number of methods which return plain-Java-objects instead of these protocol-specific classes in an effort to better provide stability in the future.
13356
13357
13358 ---
13359
13360 * [HBASE-18893](https://issues.apache.org/jira/browse/HBASE-18893) | *Major* | **Remove Add/Modify/DeleteColumnFamilyProcedure in favor of using ModifyTableProcedure**
13361
13362 The RPC calls for Add/Modify/DeleteColumn have been removed and are now backed by ModifyTable functionality. The corresponding permissions in AccessController have been removed as well.
13363
13364 The shell already bypassed these RPCs and used ModifyTable directly, and thus would not be getting these permission checks, this change brings the rest of the RPC inline with that.
13365
13366 Coprocessor hooks for pre/post Add/Modify/DeleteColumn have likewise been removed. Coprocessors needing to take special actions on schema change should instead process ModifyTable events (which they should have been doing already, but it was easy for developers to miss this nuance).
13367
13368
13369 ---
13370
13371 * [HBASE-16338](https://issues.apache.org/jira/browse/HBASE-16338) | *Major* | **update jackson to 2.y**
13372
13373 HBase has upgraded from Jackson 1 to Jackson 2. JSON output should not have changed and this should not be user facing, but server classpaths should be adjusted accordingly.
13374
13375
13376 ---
13377
13378 * [HBASE-19051](https://issues.apache.org/jira/browse/HBASE-19051) | *Minor* | **Add new split algorithm for num string**
13379
13380 Add new split algorithm DecimalStringSplit，row are decimal-encoded long values in the range "00000000" =\> "99999999" .
13381 create 't1','f', { NUMREGIONS =\> 10 , SPLITALGO =\> 'DecimalStringSplit' }
13382 The split point will be 10000000,20000000,...,90000000
13383
13384
13385 ---
13386
13387 * [HBASE-19067](https://issues.apache.org/jira/browse/HBASE-19067) | *Major* | **Do not expose getHDFSBlockDistribution in StoreFile**
13388
13389 Removed CP exposed StoreFile#getHDFSBlockDistribution
13390
13391
13392 ---
13393
13394 * [HBASE-18989](https://issues.apache.org/jira/browse/HBASE-18989) | *Major* | **Polish the compaction related CP hooks**
13395
13396 Add two new methods in CompactionLifeCycleTracker.
13397 The notExecuted method will be called if the selectCompaction failed or space quota limitation reached.
13398 The completed method will be called after all the requested compactions are finished. The compaction scheduling is pre Store so if you request compaction on a region it may lead to multiple compactions.
13399 Remove the User parameter in Region.requestCompaction methods as it is useless for CP users.
13400 Add a boolean parameter to indicate whether you want to do a major compaction. And so that the triggerMajorCompaction method is removed.
13401 Remove the getCompactionProgress method in Store interface.
13402 Add a UT to confirm that CompactionLifeCycleTracker works correctly, and it also shows how to use CompactionLifeCycleTracker to wait for the completion of a compaction.
13403
13404
13405 ---
13406
13407 * [HBASE-19046](https://issues.apache.org/jira/browse/HBASE-19046) | *Major* | **RegionObserver#postCompactSelection  Avoid passing shaded ImmutableList param**
13408
13409 RegionObserver#postCompactSelection signature is changed.
13410 Arg type org.apache.hadoop.hbase.shaded.com.google.common.collect.ImmutableList is replaced with java.util.List
13411
13412
13413 ---
13414
13415 * [HBASE-19043](https://issues.apache.org/jira/browse/HBASE-19043) | *Major* | **Purge TableWrapper and CoprocessorHConnnection**
13416
13417 Removes getTable from the CoprocessorEnvrionment Interface and from the BaseEnvironment implementation. Also removes TableWrapper and CoprocessorHConnection, two classes that were used by BaseEnvironment to keep a tag on Tables created by Coprocessors that BaseEnvironment might close them out on #shutdown.
13418
13419 Long after these classes and methods were added, in HBase 1.0.0, we moved to a mode where management of Tables was shifted from HBase to the Client; the Client is to manage lifecycle. Table also became a (relatively) lightweight construct so folks are used to getting a Table instance, using it, and then immediately closing it when done.
13420
13421 Coprocessors should do the same in hbase2.0.0.
13422
13423 CoprocessorHConnection short-circuited RPC. This feature has since been integrated into Server Connections; when they create a Connection, they get one that will short-circuit if the request is to a localhost so no need of CoprocessorHConnection any more.
13424
13425 Coprocessors get the Server Connection when they ask for a Connection from their \*CoprocessorEnvironment.
13426
13427
13428 ---
13429
13430 * [HBASE-19014](https://issues.apache.org/jira/browse/HBASE-19014) | *Major* | **surefire fails; When writing xml report stdout/stderr ... No such file or directory**
13431
13432 Running tests with a wildcard selector, i.e.{{-Dtest=org.apache.hadoop.hbase.server.\*}} no longer works.
13433
13434
13435 ---
13436
13437 * [HBASE-10367](https://issues.apache.org/jira/browse/HBASE-10367) | *Major* | **RegionServer graceful stop / decommissioning**
13438
13439 Added three top level Admin APIs to help decommissioning and graceful stop of region servers.
13440
13441   /\*\*
13442    \* Mark region server(s) as decommissioned to prevent additional regions from getting
13443    \* assigned to them. Optionally unload the regions on the servers. If there are multiple servers
13444    \* to be decommissioned, decommissioning them at the same time can prevent wasteful region
13445    \* movements. Region unloading is asynchronous.
13446    \* @param servers The list of servers to decommission.
13447    \* @param offload True to offload the regions from the decommissioned servers
13448    \*/
13449   void decommissionRegionServers(List\<ServerName\> servers, boolean offload) throws IOException;
13450
13451   /\*\*
13452    \* List region servers marked as decommissioned, which can not be assigned regions.
13453    \* @return List of decommissioned region servers.
13454    \*/
13455   List\<ServerName\> listDecommissionedRegionServers() throws IOException;
13456
13457   /\*\*
13458    \* Remove decommission marker from a region server to allow regions assignments.
13459    \* Load regions onto the server if a list of regions is given. Region loading is
13460    \* asynchronous.
13461    \* @param server The server to recommission.
13462    \* @param encodedRegionNames Regions to load onto the server.
13463    \*/
13464   void recommissionRegionServer(ServerName server, List\<byte[]\> encodedRegionNames)  throws IOException;
13465
13466
13467 ---
13468
13469 * [HBASE-19042](https://issues.apache.org/jira/browse/HBASE-19042) | *Blocker* | **Oracle Java 8u144 downloader broken in precommit check**
13470
13471 Precommit switched from Oracle JDK 8 to OpenJDK-8.
13472
13473
13474 ---
13475
13476 * [HBASE-18945](https://issues.apache.org/jira/browse/HBASE-18945) | *Major* | **Make a IA.LimitedPrivate interface for CellComparator**
13477
13478 CellCompartor has been added as an interface with IA.LimitedPrivate. It has the following methods
13479 #int compare(Cell leftCell, Cell rightCell);
13480 #int compareRows(Cell leftCell, Cell rightCell)
13481 #int compareRows(Cell cell, byte[] bytes, int offset, int length)
13482 #int compareWithoutRow(Cell leftCell, Cell rightCell)
13483 #int compareFamilies(Cell leftCell, Cell rightCell
13484 #int compareQualifiers(Cell leftCell, Cell rightCell)
13485 #int compareTimestamps(Cell leftCell, Cell rightCell)
13486 #int compareTimestamps(long leftCellts, long rightCellts)
13487
13488 This is exposed to CPs and CPs can make use of the above methods to do comparisons on the cells.
13489 For internal usage we have CellComparatorImpl and it has static references to COMPARATOR and META\_CELL\_COMPARATOR.
13490 So when a region or store is initialized we should use one of the above comparator. For META table we need the META\_CELL\_COMPARATOR and all other table's  regions/stores will use the COMPARTOR.
13491 While writing the comparator name in FixedFileTrailer of the Hfile we have now ensured that this rename of CellComparator.COMPARATOR/CellComparator.META\_CELL\_COMPARATOR to CellComparatorImpl.COMPARATOR/CellComparatorImpl.META\_CELL\_COMPARATOR is handled.
13492
13493 CellUtils is an util method that provides lot of APIs that helps to do compare, matching functionalities between two cells, or with a cell and a corrpesponding byte[] etc. Some of the APIs are internally used which will be cleaned up in a follow on JIRA HBASE-18995.
13494
13495
13496 ---
13497
13498 * [HBASE-19001](https://issues.apache.org/jira/browse/HBASE-19001) | *Major* | **Remove the hooks in RegionObserver which are designed to construct a StoreScanner which is marked as IA.Private**
13499
13500 These methods are removed:
13501 KeyValueScanner preStoreScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13502       Store store, Scan scan, NavigableSet\<byte[]\> targetCols, KeyValueScanner s, long readPt)
13503       throws IOException;
13504 InternalScanner preFlushScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13505       Store store, List\<KeyValueScanner\> scanners, InternalScanner s, long readPoint)
13506       throws IOException;
13507 InternalScanner preCompactScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13508       Store store, List\<? extends KeyValueScanner\> scanners, ScanType scanType, long earliestPutTs,
13509       InternalScanner s, CompactionLifeCycleTracker tracker, CompactionRequest request,
13510       long readPoint) throws IOException;
13511
13512 For flush and compaction, CP users are expected to wrap the InternalScanner in preFlush/preCompact. And for normal region operation, just use preGetOp/preScannerOpen to modify the Get/Scan object.
13513
13514 This method in Region interface is also removed as we do not need to use read point in CP hooks anymore:
13515 long getReadPoint(IsolationLevel isolationLevel);
13516
13517
13518 ---
13519
13520 * [HBASE-18350](https://issues.apache.org/jira/browse/HBASE-18350) | *Blocker* | **RSGroups are broken under AMv2**
13521
13522 Moves RSGroup on to AMv2. Reenables disabled RSGroups tests.
13523
13524
13525 ---
13526
13527 * [HBASE-18960](https://issues.apache.org/jira/browse/HBASE-18960) | *Major* | **A few bug fixes and minor improvements around batchMutate()**
13528
13529 All operations for which further processing is skipped by preBatchMutate coprocessor hook are treated as SUCCESS instead of FAILED.
13530
13531
13532 ---
13533
13534 * [HBASE-14247](https://issues.apache.org/jira/browse/HBASE-14247) | *Critical* | **Separate the old WALs into different regionserver directories**
13535
13536 Add a new config hbase.separate.oldlogdir.by.regionserver. The default value is false. If this config is true, the old wal dir will be separated by regionservers. This will change the oldWALs layout. The oldWALs is used by replication. So if a cluster didn't use replication, it can be rolling upgrade (upgrade this config from false to true) directly. If a cluster use replication, the oldWALs will be not found when layout changed. So the cluster need rolling upgrade twice. Firstly, only rolling cluster to use new version code. Secondly rolling the config from false to true. Because the cluster already rolling to new version code, so it can find the oldWALs in the new dir layout.
13537
13538
13539 ---
13540
13541 * [HBASE-18954](https://issues.apache.org/jira/browse/HBASE-18954) | *Major* | **Make \*CoprocessorHost classes private**
13542
13543 - Make CoprocessorHost and its implementations InterfaceAudience.Private
13544 - Configurations from "CoprocessorHost" have been moved to new "CoprocessorConfigurations" class.
13545
13546
13547 ---
13548
13549 * [HBASE-15410](https://issues.apache.org/jira/browse/HBASE-15410) | *Major* | **Utilize the max seek value when all Filters in MUST\_PASS\_ALL FilterList return SEEK\_NEXT\_USING\_HINT**
13550
13551 This optimization, targeting SEEK\_NEXT\_USING\_HINT return values, utilizes the max seek value and is transparent to Filters.
13552
13553
13554 ---
13555
13556 * [HBASE-18747](https://issues.apache.org/jira/browse/HBASE-18747) | *Critical* | **Introduce new example and helper classes to tell CP users how to do filtering on scanners**
13557
13558 Modify ZooKeeperScanPolicyObserver in hbase-examples to show how to do filtering in the CP hooks of flush and compaction in hbase-2.0.
13559
13560
13561 ---
13562
13563 * [HBASE-18108](https://issues.apache.org/jira/browse/HBASE-18108) | *Blocker* | **Procedure WALs are archived but not cleaned; fix**
13564
13565 The archived Procedure WALs are moved to \<hbase\_root\>/oldWALs/masterProcedureWALs
13566 directory. TimeToLiveProcedureWALCleaner class was added which regularly cleans the Procedure WAL files from there.
13567
13568 The TimeToLiveProcedureWALCleaner is added to hbase.master.logcleaner.plugins configuration value.
13569
13570 A new config parameter is added: hbase.master.procedurewalcleaner.ttl, which specifies how long a Procedure WAL should stay in the archive directory.
13571
13572
13573 ---
13574
13575 * [HBASE-18183](https://issues.apache.org/jira/browse/HBASE-18183) | *Major* | **Region interface cleanup for CP expose**
13576
13577 Below methods are removed from CP exposed Region interface
13578 getOpenSeqNum
13579 getOldestSeqIdOfStore
13580 isLoadingCfsOnDemandDefault
13581 getReadpoint
13582 updateReadRequestsCount
13583 updateWriteRequestsCount
13584 getRegionServicesForStores
13585 getMetrics
13586 getHDFSBlocksDistribution
13587 releaseRowLocks
13588 batchReplay
13589 get(Get get, boolean withCoprocessor, long nonceGroup, long nonce)
13590 bulkLoadHFiles
13591 execService
13592 registerService
13593 checkFamilies
13594 checkTimestamps
13595 prepareDelete
13596 prepareDeleteTimestamps
13597 updateCellTimestamps
13598 flush
13599 compact
13600 waitForFlushesAndCompactions
13601 waitForFlushes
13602
13603 Change signature of below methods by dropping params 'nonceGroup', 'nonce'
13604 append(Append append, long nonceGroup, long nonce)
13605 batchMutate(Mutation[] mutations, long nonceGroup, long nonce)
13606 increment(Increment increment, long nonceGroup, long nonce)
13607
13608
13609 ---
13610
13611 * [HBASE-18949](https://issues.apache.org/jira/browse/HBASE-18949) | *Major* | **Remove the CompactionRequest parameter in preCompactSelection**
13612
13613 Remove the CompactionRequest parameter in preCompactSelection as we do not have a CompactionRequest at that time.
13614
13615
13616 ---
13617
13618 * [HBASE-18909](https://issues.apache.org/jira/browse/HBASE-18909) | *Major* | **Deprecate Admin's methods which used String regex**
13619
13620 Pushed to master and branch-2. Thanks all for reviewing.
13621
13622
13623 ---
13624
13625 * [HBASE-18931](https://issues.apache.org/jira/browse/HBASE-18931) | *Major* | **Make ObserverContext an interface and remove private/testing methods**
13626
13627 Changes ObserverContext from a class to an interface and hides away constructor, testing functions and other internal-only functions in the implementation class.
13628
13629
13630 ---
13631
13632 * [HBASE-18878](https://issues.apache.org/jira/browse/HBASE-18878) | *Major* | **Use Optional\<T\> return types when T can be null**
13633
13634 **WARNING: No release note provided for this change.**
13635
13636
13637 ---
13638
13639 * [HBASE-18649](https://issues.apache.org/jira/browse/HBASE-18649) | *Major* | **Deprecate KV Usage in MR to move to Cells in 3.0**
13640
13641 All the mappers and reducers output type will be now of MapReduceCell type. No more KeyValue type. How ever in branch-2 for compatibility we have allowed the older interfaces/classes that work with KeyValue to stay in the code base but they have been marked as deprecated.
13642 The following interfaces/classes have been deprecated in branch-2
13643 Import#KeyValueWritableComparablePartitioner
13644 Import#KeyValueWritableComparator
13645 Import#KeyValueWritableComparable
13646 Import#KeyValueReducer
13647 Import#KeyValueSortImporter
13648 Import#KeyValueImporter
13649 KeyValueSortReducer
13650 KeyValueSerialization
13651 WALPlayer#WALKeyValueMapper
13652
13653 So any existing MR jobs that is using the above public interfaces/classes will continue to work in branch-2 and the expected output value type of those mappers and reducers can continue to be KeyValue type.
13654
13655 In branch-3 the mappers and reducers output will only expect MapReduceCell as the type and will no longer work with KeyValue type.
13656 The new public classes/interfaces added for branch-3 and in branch-2 are
13657 CellSerialization
13658 CellSortReducer
13659 Import#CellWritableComparablePartitioner
13660 Import#CellWritableComparable
13661 Import#CellWritableComparator
13662 Import#CellReducer
13663 Import#CellSortImporter
13664 Import#CellImporter
13665 WALPlayer#WALCellMapper
13666
13667
13668 ---
13669
13670 * [HBASE-18897](https://issues.apache.org/jira/browse/HBASE-18897) | *Major* | **Substitute MemStore for Memstore**
13671
13672 The changes of IA.Public/IA.LimitedPrivate classes are shown below:
13673 HTableDescriptor class
13674 \* boolean hasRegionMemstoreReplication()
13675 + boolean hasRegionMemStoreReplication()
13676 \* HTableDescriptor setRegionMemstoreReplication(boolean)
13677 + HTableDescriptor setRegionMemStoreReplication(boolean)
13678
13679 RegionLoadStats class
13680 \* int getMemstoreLoad()
13681 + int getMemStoreLoad()
13682
13683 ServerLoad class
13684 \* int getMemstoreSizeInMB()
13685 + int getMemStoreSizeMB()
13686
13687 Region class
13688 - long getMemstoreSize()
13689 + long getMemStoreSize()
13690
13691 Store class
13692 - MemstoreSize getMemStoreSize()
13693 + MemStoreSize getMemStoreSize()
13694 - MemstoreSize getFlushableSize()
13695 + MemStoreSize getFlushableSize()
13696 - MemstoreSize getSnapshotSize()
13697 + MemStoreSize getSnapshotSize()
13698
13699 StoreFile class
13700 - long getMaxMemstoreTS()
13701 + long getMaxMemStoreTS()
13702
13703
13704 ---
13705
13706 * [HBASE-18010](https://issues.apache.org/jira/browse/HBASE-18010) | *Major* | **Connect CellChunkMap to be used for flattening in CompactingMemStore**
13707
13708 The CellChunkMap is very dense index for Memstore ImmutableSegment and the only one that can be taken off-heap. However, CellChunkMap works on-heap as well. The coding of the entire flow of working with CellChunkMap is not yet finished, thus CellChunkMap is disabled for usage so far. The continuation is done under HBASE-18232.
13709
13710
13711 ---
13712
13713 * [HBASE-18883](https://issues.apache.org/jira/browse/HBASE-18883) | *Major* | **Upgrade to Curator 4.0**
13714
13715 Curator version has been updated from 2.x to 4.0 (running in ZK 3.4 compatibility mode).
13716
13717 Users who experience classpath issues due to version conflicts are recommended to use either the hbase-shaded-client or hbase-shaded-mapreduce artifacts.
13718
13719
13720 ---
13721
13722 * [HBASE-13844](https://issues.apache.org/jira/browse/HBASE-13844) | *Minor* | **Move static helper methods from KeyValue into CellUtils**
13723
13724 Move KeyValue.parseColumn() to CellUtil
13725
13726
13727 ---
13728
13729 * [HBASE-18839](https://issues.apache.org/jira/browse/HBASE-18839) | *Major* | **Apply RegionInfo to code base**
13730
13731 The incompatible changes of IA.Public/LimitedPrivate classes are shown below.
13732 + new method
13733 - removed method
13734 \* deprecated method
13735 -------------------------------------
13736 HRegionLocation class
13737 + RegionInfo getRegion()
13738 \* HRegionInfo getRegionInfo()
13739
13740 AsyncAdmin class
13741 + CompletableFuture\<List\<RegionInfo\>\> getOnlineRegions(ServerName serverName);
13742 - CompletableFuture\<List\<HRegionInfo\>\> getOnlineRegions(ServerName serverName);
13743 + CompletableFuture\<List\<RegionInfo\>\> getTableRegions(TableName tableName);
13744 - CompletableFuture\<List\<HRegionInfo\>\> getTableRegions(TableName tableName);
13745
13746 HBaseTestingUtility class
13747 - Table createTable(HTableDescriptor htd, byte[][] families, Configuration c)
13748 - Table createTable(HTableDescriptor htd, byte[][] families, byte[][] splitKeys, Configuration c)
13749 - Table createTable(HTableDescriptor htd, byte[][] splitRows)
13750 - void modifyTableSync(Admin admin, HTableDescriptor desc)
13751 - HRegion createLocalHRegion(HTableDescriptor desc, byte [] startKey, byte [] endKey)
13752 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc)
13753 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc)
13754 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc)
13755 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc, WAL wal)
13756 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc, WAL wal)
13757 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc, WAL wal)
13758 - List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13759 + List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13760 - WAL createWal(final Configuration conf, final Path rootDir, final HRegionInfo hri)
13761 + WAL createWal(final Configuration conf, final Path rootDir, final RegionInfo hri)
13762 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir,final Configuration conf, final HTableDescriptor htd)
13763 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13764 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13765 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13766 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13767 - boolean assignRegion(final HRegionInfo regionInfo)
13768 + boolean assignRegion(final RegionInfo regionInfo)
13769 - void moveRegionAndWait(HRegionInfo destRegion, ServerName destServer)
13770 + void moveRegionAndWait(RegionInfo destRegion, ServerName destServer)
13771 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd)
13772 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd, int numRegionsPerServer)
13773 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor[] hcds, int numRegionsPerServer)
13774 - HRegion createTestRegion(String tableName, HColumnDescriptor cd)
13775
13776 WALEdit class
13777 - WALEdit createFlushWALEdit(HRegionInfo hri, FlushDescriptor f)
13778 + WALEdit createFlushWALEdit(RegionInfo hri, FlushDescriptor f)
13779 - WALEdit createRegionEventWALEdit(HRegionInfo hri,RegionEventDescriptor regionEventDesc)
13780 + WALEdit createRegionEventWALEdit(RegionInfo hri,RegionEventDescriptor regionEventDesc)
13781 - WALEdit createCompaction(final HRegionInfo hri, final CompactionDescriptor c)
13782 + WALEdit createCompaction(final RegionInfo hri, final CompactionDescriptor c)
13783 - byte[] getRowForRegion(HRegionInfo hri)
13784 + byte[] getRowForRegion(RegionInfo hri)
13785 - WALEdit createBulkLoadEvent(HRegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13786 + - WALEdit createBulkLoadEvent(RegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13787
13788 RegionScanner class
13789 - HRegionInfo getRegionInfo();
13790 + RegionInfo getRegionInfo();
13791
13792 RegionPlan class
13793 - RegionPlan(final HRegionInfo hri, ServerName source, ServerName dest)
13794 + RegionPlan(final RegionInfo hri, ServerName source, ServerName dest)
13795
13796 Region class
13797 - HRegionInfo getRegionInfo();
13798 + RegionInfo getRegionInfo();
13799
13800 TableSnapshotInputFormat.TableSnapshotRegionSplit class
13801 \* HRegionInfo getRegionInfo()
13802 + RegionInfo getRegion()
13803
13804 RawAsyncTable.CoprocessorCallback class
13805 - void onRegionComplete(HRegionInfo region, R resp)
13806 + void onRegionComplete(RegionInfo region, R resp)
13807 - void onRegionError(RegionInfo region, Throwable error);
13808 + void onRegionError(HRegionInfo region, Throwable error);
13809
13810
13811 ---
13812
13813 * [HBASE-18826](https://issues.apache.org/jira/browse/HBASE-18826) | *Major* | **Use HStore instead of Store in our own code base and remove unnecessary methods in Store interface**
13814
13815 **WARNING: No release note provided for this change.**
13816
13817
13818 ---
13819
13820 * [HBASE-17732](https://issues.apache.org/jira/browse/HBASE-17732) | *Critical* | **Coprocessor Design Improvements**
13821
13822 We are moving from Inheritence
13823 - Observer \*is\* Coprocessor
13824 - FooService \*is\* CoprocessorService
13825 To Composition
13826 - Coprocessor \*has\* Observer
13827 - Coprocessor \*has\* Service
13828 ------------------------------------------------------
13829 Summary
13830 ------------------------------------------------------
13831 - Adds four new interfaces - MasterCoprocessor, RegionCoprocessor, RegionServierCoprocessor,
13832   WALCoprocessor
13833 - These new \*Coprocessor interfaces have a get\*Observer() function for each observer type
13834   supported by them.
13835 - Added Coprocessor#getService() to base interface. All extending \*Coprocessor interfaces will
13836   get it from the base interface.
13837 - Added BulkLoadObserver hooks to RegionCoprocessorHost instad of SecureBulkLoadManager doing its
13838   own trickery.
13839 - CoprocessorHost#find\*() fuctions: Too many testing hooks digging into CP internals.
13840   Deleted if can, else marked @VisibleForTesting.
13841 ------------------------------------------------------
13842 Backward Compatibility
13843 ------------------------------------------------------
13844 - Old coprocessors implementing \*Observer won't get loaded (no backward compatibility guarantees).
13845 - Third party coprocessors only implementing Coprocessor will not get loaded (just like Observers).
13846 - Old coprocessors implementing CoprocessorService (for master/region host)
13847   /SingletonCoprocessorService (for RegionServer host) will continue to work with 2.0.
13848 - Added test to ensure backward compatibility of CoprocessorService/SingletonCoprocessorService
13849 - Note that if a coprocessor implements both observer and service in same class, its service
13850   component will continue to work but it's observer component won't work.
13851
13852
13853 ---
13854
13855 * [HBASE-18298](https://issues.apache.org/jira/browse/HBASE-18298) | *Critical* | **RegionServerServices Interface cleanup for CP expose**
13856
13857 We used to pass the RegionServerServices (RSS) which gave Coprocesosrs (CP) all sort of access to internal Server machinery. We now only allows the CP a subset of the RSS in the form of the CPRSS Interface. Particulars:
13858
13859 Removed method getRegionServerServices from CP exposed RegionCoprocessorEnvironment and RegionServerCoprocessorEnvironment and replaced with getCoprocessorRegionServerServices. This returns a new interface CoprocessorRegionServerServices which is only a subset of RegionServerServices. With that below methods are no longer exposed for CPs
13860 WAL getWAL(HRegionInfo regionInfo)
13861 List\<WAL\> getWALs()
13862 FlushRequester getFlushRequester()
13863 RegionServerAccounting getRegionServerAccounting()
13864 RegionServerRpcQuotaManager getRegionServerRpcQuotaManager()
13865 SecureBulkLoadManager getSecureBulkLoadManager()
13866 RegionServerSpaceQuotaManager getRegionServerSpaceQuotaManager()
13867 void postOpenDeployTasks(final PostOpenDeployContext context)
13868 void postOpenDeployTasks(final Region r)
13869 boolean reportRegionStateTransition(final RegionStateTransitionContext context)
13870 boolean reportRegionStateTransition(TransitionCode code, long openSeqNum, HRegionInfo... hris)
13871 boolean reportRegionStateTransition(TransitionCode code, HRegionInfo... hris)
13872 RpcServerInterface getRpcServer()
13873 ConcurrentMap\<byte[], Boolean\> getRegionsInTransitionInRS()
13874 Leases getLeases()
13875 ExecutorService getExecutorService()
13876 Map\<String, Region\> getRecoveringRegions()
13877 public ServerNonceManager getNonceManager()
13878 boolean registerService(Service service)
13879 HeapMemoryManager getHeapMemoryManager()
13880 double getCompactionPressure()
13881 ThroughputController getFlushThroughputController()
13882 double getFlushPressure()
13883 MetricsRegionServer getMetrics()
13884 EntityLock regionLock(List\<HRegionInfo\> regionInfos, String description, Abortable abort)
13885 void unassign(byte[] regionName)
13886 Configuration getConfiguration()
13887 ZooKeeperWatcher getZooKeeper()
13888 ClusterConnection getClusterConnection()
13889 MetaTableLocator getMetaTableLocator()
13890 CoordinatedStateManager getCoordinatedStateManager()
13891 ChoreService getChoreService()
13892 void stop(String why)
13893 void abort(String why, Throwable e)
13894 boolean isAborted()
13895 void updateRegionFavoredNodesMapping(String encodedRegionName, List\<ServerName\> favoredNodes)
13896 InetSocketAddress[] getFavoredNodesForRegion(String encodedRegionName)
13897 void addToOnlineRegions(Region region)
13898 boolean removeFromOnlineRegions(final Region r, ServerName destination)
13899
13900 Also 3 methods name have been changed
13901 List\<Region\> getOnlineRegions(TableName tableName) -\> List\<Region\> getRegions(TableName tableName)
13902 List\<Region\> getOnlineRegions() -\> List\<Region\> getRegions()
13903 Region getFromOnlineRegions(final String encodedRegionName) -\> Region getRegion(final String encodedRegionName)
13904
13905
13906 ---
13907
13908 * [HBASE-16769](https://issues.apache.org/jira/browse/HBASE-16769) | *Blocker* | **Deprecate/remove PB references from MasterObserver and RegionServerObserver**
13909
13910 Signature of below methods in MasterObserver changed and instead of org.apache.hadoop.hbase.shaded.protobuf.generated.SnapshotDescription param, we will be passing org.apache.hadoop.hbase.client.SnapshotDescription
13911 preListSnapshot
13912 postListSnapshot
13913 preSnapshot
13914 postSnapshot
13915 preCloneSnapshot
13916 postCloneSnapshot
13917 preRestoreSnapshot
13918 postRestoreSnapshot
13919 preDeleteSnapshot
13920 postDeleteSnapshot
13921
13922 Also changed signature of RegionServerObserver#preReplicateLogEntries and preReplicateLogEntries by removing params List\<org.apache.hadoop.hbase.shaded.protobuf.generated.AdminProtos.WALEntry\>, org.apache.hadoop.hbase.CellScanner
13923
13924
13925 ---
13926
13927 * [HBASE-18859](https://issues.apache.org/jira/browse/HBASE-18859) | *Major* | **Purge PB from BulkLoadObserver**
13928
13929 No longer pass the protobuf request to prePrepareBulkLoad and preCleanupBulkLoad in BulkLoadObserver as part of our effort to purge protobuf from our Coprocessor API Interface (if you need to read the Table and RegionInfo, pull it from the passed in RegionCoprocessorEnvironment ObserverContext).
13930
13931
13932 ---
13933
13934 * [HBASE-18731](https://issues.apache.org/jira/browse/HBASE-18731) | *Major* | **[compat 1-2] Mark protected methods of QuotaSettings that touch Protobuf internals as IA.Private**
13935
13936 The following methods in QuotaSettings were annotated InterfaceAudience.Private; they are for internal use only in hbase-2.0.0
13937
13938 buildSetQuotaRequestProto(final QuotaSettings settings)
13939 setupSetQuotaRequest(SetQuotaRequest.Builder builder)
13940
13941 Note that there were versions of these methods in HBase 1.y that used classes in the {{org.apache.hadoop.hbase.protobuf.generated}} package. That package no longer exists as a part of our cleanup of protobufs from our public facing API and the related methods have been removed.
13942
13943
13944 ---
13945
13946 * [HBASE-18825](https://issues.apache.org/jira/browse/HBASE-18825) | *Major* | **Use HStoreFile instead of StoreFile in our own code base and remove unnecessary methods in StoreFile interface**
13947
13948 Cleanup the StoreFile interface.
13949
13950 The metadata keys are moved to HStoreFile.
13951
13952 These methods are removed:
13953 CacheConfig getCacheConf();
13954 byte[] getMetadataValue(byte[] key);
13955 boolean isCompactedAway();
13956 boolean isReferencedInReads();
13957 void initReader() throws IOException;
13958 StoreFileScanner getPreadScanner(boolean cacheBlocks, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn);
13959 StoreFileScanner getStreamScanner(boolean canUseDropBehind, boolean cacheBlocks, boolean isCompaction, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn) throws IOException;
13960 StoreFileReader getReader();
13961 void closeReader(boolean evictOnClose) throws IOException;
13962 void markCompactedAway();
13963 void deleteReader() throws IOException;
13964
13965 Notice that these methods are still available in HStoreFile.
13966
13967 And the return value of getFirstKey and getLastKey are changed from Cell to Optional\<Cell\> to better indicate that they may not be available.
13968
13969
13970 ---
13971
13972 * [HBASE-18786](https://issues.apache.org/jira/browse/HBASE-18786) | *Major* | **FileNotFoundException should not be silently handled for primary region replicas**
13973
13974 FileNotFoundException opening a StoreFile in a primary replica now causes a RegionServer to crash out where before it would be ignored (or optionally handled via close/reopen).
13975
13976
13977 ---
13978
13979 * [HBASE-10504](https://issues.apache.org/jira/browse/HBASE-10504) | *Blocker* | **Define Replication Interface**
13980
13981 Adds a new plugin point ReplicationEndpoint. ReplicationSource, internal to hbase, tails the WAL and calls registered ReplicationEndpoints. ReplicationEndpoint implementations are responsible for actually shipping the edits to the other (hbase or non-hbase) cluster. ReplicationEndpoint can be defined per peer. Default inter-cluster replication works without any changes (lily etc should still work). ReplicationEndpoints have various facility including means for filtering out WAL edits source-side before they can be shipped to remote peers.
13982
13983
13984 ---
13985
13986 * [HBASE-18142](https://issues.apache.org/jira/browse/HBASE-18142) | *Major* | **Deletion of a cell deletes the previous versions too**
13987
13988 Now, delete.rb won't delete all versions of the specified column. It only delete the specified version (if user assigns a timestamp) or the latest version (default behavior)
13989
13990
13991 ---
13992
13993 * [HBASE-18446](https://issues.apache.org/jira/browse/HBASE-18446) | *Critical* | **Mark StoreFileScanner/StoreFileReader as IA.LimitedPrivate(Phoenix)**
13994
13995 Mark StoreFileScanner and StoreFileReader as IA.LimitPrivate(Phoenix).
13996 Deprecated the preStoreFileReaderOpen and postStoreFileReaderOpen method in RegionObserver to indicate that these methods are only supposed to be used by Phoenix.
13997
13998
13999 ---
14000
14001 * [HBASE-18798](https://issues.apache.org/jira/browse/HBASE-18798) | *Major* | **Remove the unused methods in RegionServerObserver**
14002
14003 Remove the following APIs from RegionServerObserver:
14004 # preRollBackMerge
14005 # postRollBackMerge
14006 # preMergeCommit
14007 # postMergeCommit
14008 # postMerge
14009 # preMerge
14010
14011
14012 ---
14013
14014 * [HBASE-18831](https://issues.apache.org/jira/browse/HBASE-18831) | *Major* | **Add explicit dependency on javax.el**
14015
14016 Specify an explicit version for javax.el. Without it we rely on repository cached metadata of which a prevalent version seems to list all versions between b01 and b08 but finishes with a b08-jbossorg which is in the jboss repo, a repo most of us do not list in our poms.
14017
14018
14019 ---
14020
14021 * [HBASE-17980](https://issues.apache.org/jira/browse/HBASE-17980) | *Major* | **Any HRegionInfo we give out should be immutable**
14022
14023 Provide alternate user-facing API that takes a RegionInfo Interface instead of a HRegionInfo; the old HRegionInfo methods have been deprecated in 2.0.0 and will be removed in 3.0.0.
14024
14025
14026 ---
14027
14028 * [HBASE-14004](https://issues.apache.org/jira/browse/HBASE-14004) | *Critical* | **[Replication] Inconsistency between Memstore and WAL may result in data in remote cluster that is not in the origin**
14029
14030 Now when replicating a wal file which is still opened for write, we will get its committed length from the WAL instance in the same RS to prevent replicating uncommit WALEdit.
14031
14032 This is very important if you use AsyncFSWAL, as we use fan-out in AsyncFSWAL. The data written to DN will be visible immediately as all DNs think it is the end of a pipeline, although the client has not received an ack, and also NN may truncate the file if the client crashes at the same time.
14033
14034
14035 ---
14036
14037 * [HBASE-18819](https://issues.apache.org/jira/browse/HBASE-18819) | *Major* | **Set version number to 2.0.0-alpha3 from 2.0.0-alpha3-SNAPSHOT**
14038
14039 Set version on branch-2 to be 2.0.0-alpha3 as part of RC making.
14040
14041
14042 ---
14043
14044 * [HBASE-18683](https://issues.apache.org/jira/browse/HBASE-18683) | *Major* | **Upgrade hbase to commons-math 3**
14045
14046 Moved on to commons-math3. Removed commons-math2.
14047
14048
14049 ---
14050
14051 * [HBASE-18453](https://issues.apache.org/jira/browse/HBASE-18453) | *Major* | **CompactionRequest should not be exposed to user directly**
14052
14053 Introduce a CompactionLifeCycleTracker to let the CP users know when the compaction starts and ends. CompactionRequest is marked as IA.Private and should be used in CP implementation any more.
14054
14055
14056 ---
14057
14058 * [HBASE-18794](https://issues.apache.org/jira/browse/HBASE-18794) | *Major* | **Remove deprecated methods in MasterObserver**
14059
14060 The removed APIs are shown below.
14061 # preCreateTableHandler
14062 # postCreateTableHandler
14063 # preDeleteTableHandler
14064 # postDeleteTableHandler
14065 # preTruncateTableHandler
14066 # postTruncateTableHandler
14067 # preModifyTableHandler
14068 # postModifyTableHandler
14069 # preAddColumn
14070 # postAddColumn
14071 # preAddColumnHandler
14072 # postAddColumnHandler
14073 # preModifyColumn
14074 # postModifyColumn
14075 # preModifyColumnHandler
14076 # postModifyColumnHandler
14077 # preDeleteColumn
14078 # postDeleteColumn
14079 # preDeleteColumnHandler
14080 # postDeleteColumnHandler
14081 # preEnableTableHandler
14082 # postEnableTableHandler
14083 # preDisableTableHandler
14084 # postDisableTableHandler
14085 # preDispatchMerge
14086 # postDispatchMerge
14087
14088
14089 ---
14090
14091 * [HBASE-14998](https://issues.apache.org/jira/browse/HBASE-14998) | *Blocker* | **Unify synchronous and asynchronous methods in Admin and cleanup**
14092
14093  \* Deprecates getAlterStatus. Everywhere else we talk of 'modify' rather
14094        'alter' and should use Future returned from async instead.
14095  \* isTableAvailable(TableName, byte [][]) has been deprecated to be
14096        removed; use the overrie instead. This is a weird method.
14097  \* Changed listTableDescriptor to getDescriptor.
14098  \* Renamed other like methods to have same pattern (deprecating the old):
14099         balancer =\> balance
14100         setBalancerRunning =\> balancerSwitch
14101         setNormalizerRunning =\> normalizerSwitch
14102         enableCatalogJanitor =\> catalogJanitorSwitch
14103         setCleanerChoreRunning =\> cleanerChoreSwitch
14104         setSplitOrMergeEnabled =\> splitOrMergeEnabledSwitch
14105
14106  \* Renamed (with deprecation of old) runCatalogScan =\> runCatalogJanitor.
14107  \* Reviewed generated javadoc and made some edits; purged reference to
14108        hbase issues from our API, fixed param names, etc.
14109  \* Made all the enable services methods have same pattern.
14110  \* Renamed takeSnapshotAsync as snapshotAsync (with deprecation of old)
14111  \* Renamed execProcedureWithRet as execProcedureWithReturn (with
14112        deprecation)
14113
14114
14115 ---
14116
14117 * [HBASE-18723](https://issues.apache.org/jira/browse/HBASE-18723) | *Major* | **[pom cleanup] Do a pass with dependency:analyze; remove unused and explicity list the dependencies we exploit**
14118
14119 Purged a bunch of dependencies included but unused. Added reference to dependencies we do use but did not list (transitively included). Purged all but junit from parent pom dependency set and did explicit include in modules instead; not all modules need mockito, etc. Still work to do: grey area around hadoop and its transitive includes need cleanup still to make the  dependency:analyze runs clean. Also figure how to purge junit from parent dependency list.
14120
14121
14122 ---
14123
14124 * [HBASE-17823](https://issues.apache.org/jira/browse/HBASE-17823) | *Major* | **Migrate to Apache Yetus Audience Annotations**
14125
14126 HBase now uses stability and audience annotations sourced from Apache Yetus, instead of the custom annotations that were previously in place.
14127
14128
14129 ---
14130
14131 * [HBASE-18793](https://issues.apache.org/jira/browse/HBASE-18793) | *Major* | **Remove deprecated methods in RegionObserver**
14132
14133 These deprecated methods are removed from RegionObserver:
14134 InternalScanner preFlushScannerOpen(ObserverContext, Store, List, InternalScanner) throws IOException;
14135 void preCompactSelection(ObserverContext, Store, List) throws IOException;
14136 void postCompactSelection(ObserverContext, Store, ImmutableList);
14137 InternalScanner preCompact(ObserverContext, Store, InternalScanner, ScanType) throws IOException;
14138 InternalScanner preCompactScannerOpen(ObserverContext, Store, List, ScanType, long, InternalScanner, CompactionRequest) throws IOException;
14139 InternalScanner preCompactScannerOpen( ObserverContext, Store store, List, ScanType, long, InternalScanner) throws IOException;
14140 void preSplit(ObserverContext) throws IOException;
14141 void preSplit(ObserverContext, byte[]) throws IOException;
14142 void postSplit(ObserverContext, Region, Region) throws IOException;
14143 void preSplitBeforePONR(ObserverContext, byte[], List) throws IOException;
14144 void preSplitAfterPONR(ObserverContext) throws IOException;
14145 void preRollBackSplit(ObserverContext) throws IOException;
14146 void postRollBackSplit(ObserverContext) throws IOException;
14147 void postCompleteSplit(ObserverContext) throws IOException;
14148 long preIncrementColumnValue(ObserverContext, byte[], byte[], byte[], long, boolean) throws IOException;
14149 long postIncrementColumnValue(ObserverContextc, byte[], byte[], byte[], long, boolean, long) throws IOException;
14150 KeyValueScanner preStoreScannerOpen(ObserverContext, Store, Scan, NavigableSet, KeyValueScanner) throws IOException;
14151 boolean postScannerFilterRow(ObserverContext, InternalScanner, byte[], int, short, boolean) throws IOException;
14152 boolean postBulkLoadHFile(ObserverContext, List, boolean) throws IOException;
14153
14154 And this method is also removed since we never call it in our code base:
14155 InternalScanner preFlushScannerOpen(ObserverContext, Store, KeyValueScanner, InternalScanner, long) throws IOException;
14156
14157 The deprecated annotation is removed for these two methods as they are still being used:
14158 void preFlush(ObserverContext) throws IOException;
14159 void postFlush(ObserverContextc) throws IOException;
14160
14161
14162 ---
14163
14164 * [HBASE-18733](https://issues.apache.org/jira/browse/HBASE-18733) | *Major* | **[compat 1-2] Hide WALKey**
14165
14166 WALKey, @InterfaceAudience.LimitedPrivate(HBaseInterfaceAudience.REPLICATION), changed a bunch for 2.0.0. See below. We figured it ok hiding it since it should be internals anyway -- only we should be making them.
14167
14168
14169 ---
14170
14171 * [HBASE-13271](https://issues.apache.org/jira/browse/HBASE-13271) | *Critical* | **Table#puts(List\<Put\>) operation is indeterminate; needs fixing**
14172
14173 Adds more spec on how Get, Delete, and Put work and how they differ to help the user.
14174
14175
14176 ---
14177
14178 * [HBASE-16479](https://issues.apache.org/jira/browse/HBASE-16479) | *Major* | **Move WALEdit from hbase.regionserver.wal package to hbase.wal package**
14179
14180 Incompatible move of WALEdit class from regionserver.wal to wal. Effects @InterfaceAudience.LimitedPrivate({ HBaseInterfaceAudience.REPLICATION,
14181     HBaseInterfaceAudience.COPROC })
14182
14183 (
14184
14185
14186 ---
14187
14188 * [HBASE-10240](https://issues.apache.org/jira/browse/HBASE-10240) | *Critical* | **Remove 0.94-\>0.96 migration code**
14189
14190 Purge 0.94=\>0.96 deprecated, migration code. This means that if you are on 0.94 and wish to go to hbase 2.0, you must first migrate to a version of hbase that is \>= 0.96.
14191
14192
14193 ---
14194
14195 * [HBASE-18783](https://issues.apache.org/jira/browse/HBASE-18783) | *Minor* | **Declare the builder of ClusterStatus as IA.Private, and remove the Writables from ClusterStatus**
14196
14197 **WARNING: No release note provided for this change.**
14198
14199
14200 ---
14201
14202 * [HBASE-18106](https://issues.apache.org/jira/browse/HBASE-18106) | *Critical* | **Redo ProcedureInfo and LockInfo**
14203
14204 Admin.listProcedures and Admin.listLocks were renamed to getProcedures and getLocks (listProcedures was added to hbase 1.2). This change was done in an incompatible way -- we just yanked listProcedures (Because Admin Interface is not compatible with hbase1).
14205
14206     Main changes:
14207     - ProcedureInfo and LockInfo were removed, we use JSON instead of them
14208     - Procedure and LockedResource are their server side equivalent
14209     - Procedure protobuf state\_data became obsolate, it is only kept for
14210       reading previously written WAL
14211     - Procedure protobuf contains a state\_message field, which stores the internal
14212       state messages (Any type instead of bytes)
14213     - Procedure.serializeStateData and deserializeStateData were changed slightly
14214     - Procedures internal states are available on client side
14215     - Procedures are displayed on web UI and in shell in the following jruby format:
14216       { ID =\> '1', PARENT\_ID = '-1', PARAMETERS =\> [ ..extra state information.. ] }
14217
14218
14219 ---
14220
14221 * [HBASE-18621](https://issues.apache.org/jira/browse/HBASE-18621) | *Major* | **Refactor ClusterOptions before applying to code base**
14222
14223 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14224 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14225
14226
14227 ---
14228
14229 * [HBASE-18780](https://issues.apache.org/jira/browse/HBASE-18780) | *Minor* | **Remove HLogPrettyPrinter and hlog command**
14230
14231 **WARNING: No release note provided for this change.**
14232
14233
14234 ---
14235
14236 * [HBASE-14997](https://issues.apache.org/jira/browse/HBASE-14997) | *Critical* | **Move compareOp and Comparators out of filter to client package**
14237
14238 Deprecate checkAnd\* APIs that take the filter CompareOp. Added new overrides that take a generic CompareOperator instead. CompareOperator will be used by checkAnd\* in Table API and by filters going forward.
14239
14240 Other nice improvements suggested by this issue have been moved out to HBASE-18774.
14241
14242
14243 ---
14244
14245 * [HBASE-17972](https://issues.apache.org/jira/browse/HBASE-17972) | *Minor* | **Remove mergePool from CompactSplitThread**
14246
14247 After this jira, mergePool will be permanently removed from CompactSplitThread.
14248
14249
14250 ---
14251
14252 * [HBASE-18704](https://issues.apache.org/jira/browse/HBASE-18704) | *Major* | **Upgrade hbase to commons-collections 4**
14253
14254 **WARNING: No release note provided for this change.**
14255
14256
14257 ---
14258
14259 * [HBASE-18697](https://issues.apache.org/jira/browse/HBASE-18697) | *Major* | **Need a shaded hbase-mapreduce module**
14260
14261 Replaces hbase-shaded-server-\<version\>.jar with hbase-shaded-mapreduce-\<version\>.jar.
14262
14263
14264 ---
14265
14266 * [HBASE-15607](https://issues.apache.org/jira/browse/HBASE-15607) | *Blocker* | **Remove PB references from Admin for 2.0**
14267
14268 All the references to Protos in Admin.java have been removed and replaced with respective POJO classes.
14269 The references to Protos that were removed are
14270 AdminProtos.GetRegionInfoResponse,
14271 HBaseProtos.SnapshotDescription, HBaseProtos.SnapshotDescription.Type,
14272  MasterProtos.SnapshotResponse.
14273 CompactionType, CompactionState and MasterSwitchType Enums have been moved out of Admin.java to standalone Enums.
14274
14275
14276 ---
14277
14278 * [HBASE-18674](https://issues.apache.org/jira/browse/HBASE-18674) | *Major* | **upgrade hbase to commons-lang3**
14279
14280 Move to commons-lang3 from common-lang (check it out!... Nice lib...Some nice utility)
14281
14282
14283 ---
14284
14285 * [HBASE-18736](https://issues.apache.org/jira/browse/HBASE-18736) | *Major* | **Cleanup the HTD/HCD for Admin**
14286
14287 Changed the passed arguments from HTD/HCD to TD/CFD for Admin.
14288
14289
14290 ---
14291
14292 * [HBASE-18699](https://issues.apache.org/jira/browse/HBASE-18699) | *Major* | **Copy LoadIncrementalHFiles to another package and mark the old one as deprecated**
14293
14294 Introduce a new o.a.h.h.tool.LoadIncrementalHFiles. The old o.a.h.h.mapreduce.LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
14295
14296
14297 ---
14298
14299 * [HBASE-18739](https://issues.apache.org/jira/browse/HBASE-18739) | *Major* | **Make all TimeRange Constructors InterfaceAudience Private.**
14300
14301 All constructors have already been deprecated. This change makes them InterfaceAudience Private.
14302
14303
14304 ---
14305
14306 * [HBASE-18675](https://issues.apache.org/jira/browse/HBASE-18675) | *Minor* | **Making {max,min}SessionTimeout configurable for MiniZooKeeperCluster**
14307
14308 <!-- markdown -->
14309
14310
14311 Standalone clusters and minicluster instances can now configure the session timeout for our embedded ZooKeeper quorum using `hbase.zookeeper.property.minSessionTimeout` and `hbase.zookeeper.property.maxSessionTimeout`.
14312
14313
14314 ---
14315
14316 * [HBASE-15806](https://issues.apache.org/jira/browse/HBASE-15806) | *Critical* | **An endpoint-based export tool**
14317
14318 org.apache.hadoop.hbase.coprocessor.Export
14319 Instructs HBase to dump the contents of table to HDFS in a sequence file
14320 + replaces MR by endpoint (see org.apache.hadoop.hbase.mapreduce.Export)
14321 + no large data to be transfered between hbase server and client
14322 + same command line as org.apache.hadoop.hbase.mapreduce.Export
14323 - user needs to alter table for deploying ExportEndpoint
14324 - user needs to adjust the endpoint timeout for dumping large data
14325 - user needs to get the EXECUTE permission
14326
14327
14328 ---
14329
14330 * [HBASE-18577](https://issues.apache.org/jira/browse/HBASE-18577) | *Critical* | **shaded client includes several non-relocated third party dependencies**
14331
14332 <!-- markdown -->
14333
14334
14335 The HBase shaded artifacts (hbase-shaded-client and hbase-shaded-server) no longer contain several non-relocated third party dependency classes that were mistakenly included. Downstream users who relied on these classes being present will need to add a runtime dependency onto an appropriate third party artifact.
14336
14337 Previously, we erroneously packaged several third party libs without relocating them. In some cases these libraries have now been relocated; in some cases they are no longer included at all.
14338
14339 Includes:
14340
14341 * jaxb
14342 * jetty
14343 * jersey
14344 * codahale metrics (HBase 1.4+ only)
14345 * commons-crypto
14346 * jets3t
14347 * junit
14348 * curator (HBase 1.4+)
14349 * netty 3 (HBase 1.1)
14350 * mokito-junit4 (HBase 1.1)
14351
14352 There is now testing to ensure that the shaded artifacts only contain expected relocated content. It can be run via `mvn -Dtest=noUnitTests -pl hbase-shaded/hbase-shaded-check-invariants -am -Prelease verify`.
14353
14354 For version 2.0+ this patch removes hadoop-mapreduce-client-core from the set of dependencies included for the hbase-client and hbase-shaded-client artifacts.
14355
14356 For 2.0+, the slf4j-log4j12 dependency is now optional for both shaded artifacts.
14357
14358
14359 ---
14360
14361 * [HBASE-14745](https://issues.apache.org/jira/browse/HBASE-14745) | *Blocker* | **Shade the last few dependencies in hbase-shaded-client**
14362
14363 Previously some dependencies in hbase-shaded-client were still leaking into the un-shaded namespace. This should now be fixed.
14364
14365 Additionally the rat checking on generated intermediate files from shading should be skipped.
14366
14367
14368 ---
14369
14370 * [HBASE-18665](https://issues.apache.org/jira/browse/HBASE-18665) | *Critical* | **ReversedScannerCallable invokes getRegionLocations incorrectly**
14371
14372 Performing reverse scan on tables used the meta cache incorrectly and fetched data from meta table every time. This fix solves this issue and which results in performance improvement for reverse scans.
14373
14374
14375 ---
14376
14377 * [HBASE-3935](https://issues.apache.org/jira/browse/HBASE-3935) | *Major* | **HServerLoad.storefileIndexSizeMB should be changed to storefileIndexSizeKB**
14378
14379 This patch removed the storefile\_index\_size\_MB in protobuf. It will cause the value of storefile\_index\_size\_MB is zero if user still use hbase-client 1.x.
14380
14381
14382 ---
14383
14384 * [HBASE-18640](https://issues.apache.org/jira/browse/HBASE-18640) | *Major* | **Move mapreduce out of hbase-server into separate hbase-mapreduce module**
14385
14386 - Moves all org.apache.hadoop.hbase.mapreduce.\* (except LoadIncrementalHFiles) and org.apache.hadoop.hbase.mapred.\* classes from hbase-server module to new hbase-mapreduce module.
14387 - Also moves following tools from hbase-server module to hbase-mapreduce module: CompactionTool, ExportSnapshot, PerformanceEvaluation, LoadTestTool
14388 - Very minor breakages in  LoadTestTool(LimitedPrivate HBaseInterfaceAudience.TOOLS)
14389
14390
14391 ---
14392
14393 * [HBASE-18519](https://issues.apache.org/jira/browse/HBASE-18519) | *Major* | **Use builder pattern to create cell**
14394
14395 Introduce the CellBuilder helper.
14396 1) Using CellBuilderFactory to get CellBuilder for creating cell with row,
14397     column, qualifier, type, and value.
14398 2) For internal use, the ExtendedCellBuilder, which is created by ExtendedCellBuilderFactory, is able to build cell with extra fields - sequence id and tags -
14399
14400
14401 ---
14402
14403 * [HBASE-18448](https://issues.apache.org/jira/browse/HBASE-18448) | *Minor* | **EndPoint example  for refreshing HFiles for stores**
14404
14405 Adds a new RefreshHFiles Coprocessor Endpoint example. Includes client and serverside-endpoint that iterates region Stores to call #refreshStoreFiles.
14406
14407
14408 ---
14409
14410 * [HBASE-18658](https://issues.apache.org/jira/browse/HBASE-18658) | *Major* | **Purge hokey hbase Service implementation; use (internal) Guava Service instead**
14411
14412 Removed hbase Service class. It was not fully-formed. Now Guava is relocated, use its Service instead internally; it has nice implementation facility too in AbstractService.
14413
14414
14415 ---
14416
14417 * [HBASE-15982](https://issues.apache.org/jira/browse/HBASE-15982) | *Blocker* | **Interface ReplicationEndpoint extends Guava's Service**
14418
14419     Breaking change to our ReplicationEndpoint and BaseReplicationEndpoint.
14420
14421     ReplicationEndpoint implemented Guava 0.12 Service. An abstract
14422     subclass, BaseReplicationEndpoint, provided default implementations
14423     and facility, among other things, by extending Guava's
14424     AbstractService class.
14425
14426     Both of these HBase classes were marked LimitedPrivate for
14427     REPLICATION so these classes were semi-public and made it so
14428     Guava 0.12 was part of our API.
14429
14430     Having Guava in our API was a mistake. It anchors us and the
14431     implementation of the Interface to Guava 0.12. This is untenable
14432     given Guava changes and that the Service Interface in particular
14433     has had extensive revamp and improvement done. We can't hold to
14434     the Guava Interface. It changed. We can't stay on Guava 0.12;
14435     implementors and others on our CLASSPATH won't abide being stuck
14436     on an old Guava.
14437
14438     So we make breaking changes. The unhitching of our Interface
14439     from Guava could only be done in a breaking manner. It undoes the
14440     LimitedPrivate on BaseReplicationEndpoint while keeping it for the RE
14441     Interface. It means consumers will have to copy/paste the
14442     AbstractService-based BRE into their own codebase also supplying their
14443     own Guava; HBase no longer 'supplies' this (our Guava usage has
14444     been internalized, relocated).
14445
14446     This patch then adds into RE the basic methods RE needs of the old
14447     Guava Service rather than return a Service to start/stop only to go
14448     back to the RE instance to do actual work. A few method names had to
14449     be changed so could make implementations with Guava Service internally
14450     and not have RE method names and types clash). Semantics remained the
14451     same otherwise. For example startAsync and stopAsync in Guava are start
14452     and stop in RE.
14453
14454
14455 ---
14456
14457 * [HBASE-18347](https://issues.apache.org/jira/browse/HBASE-18347) | *Major* | **Implement a BufferedMutator for async client**
14458
14459 Introduce an AsyncBufferedMutator for batching requests to HBase for a single table.
14460
14461 Use AsyncConnection.getBufferedMutator method to get an AsyncBufferedMutator instance.
14462
14463
14464 ---
14465
14466 * [HBASE-18546](https://issues.apache.org/jira/browse/HBASE-18546) | *Critical* | **Always overwrite the TS for Append/Increment unless no existing cells are found**
14467
14468 If there is no existing cell in submitting Append/Increment, the custom ts won't be overridden. By contrast, the cell's ts will always be overridden by server.
14469
14470
14471 ---
14472
14473 * [HBASE-18224](https://issues.apache.org/jira/browse/HBASE-18224) | *Critical* | **Upgrade jetty**
14474
14475 Moved from Jetty 9.3.x to 9.4.x.
14476
14477 Jetty returns more correct HTTP code when Header is too long, 431 instead of 413, and it requires more threads to start up (made default 16 instead of 10).
14478
14479
14480 ---
14481
14482 * [HBASE-17442](https://issues.apache.org/jira/browse/HBASE-17442) | *Critical* | **Move most of the replication related classes from hbase-client to hbase-replication package**
14483
14484 Move replication implementation's classes from hbase-client to hbase-replication package.
14485
14486
14487 ---
14488
14489 * [HBASE-18653](https://issues.apache.org/jira/browse/HBASE-18653) | *Major* | **Undo hbase2 check against \< hadoop2.6.x; i.e. implement agreed drop of hadoop 2.4 and 2.5 support in hbase2**
14490
14491 Change the yetus profile for branch-2 so it no longer runs hadoop 2.4.x and 2.5.x build checks.
14492
14493
14494 ---
14495
14496 * [HBASE-18630](https://issues.apache.org/jira/browse/HBASE-18630) | *Major* | **Prune dependencies; as is branch-2 has duplicates**
14497
14498 Removed doubled instances of javax.inject and commons-beanutils where the versions were close.
14499
14500 Other instances of 'double' includes have different groupids so wary pruning especially when transitive includes (hadoop or jetty et al.)
14501
14502
14503 ---
14504
14505 * [HBASE-18631](https://issues.apache.org/jira/browse/HBASE-18631) | *Minor* | **Allow configuration of ChaosMonkey properties via hbase-site**
14506
14507 This change invalidates the need for a separate Java properties file to configure the ChaosMonkey included with HBase. These properties can be provided directly in hbase-site.xml. If configuration in provided in both locations, the Java properties file takes precendence.
14508
14509
14510 ---
14511
14512 * [HBASE-18489](https://issues.apache.org/jira/browse/HBASE-18489) | *Major* | **Expose scan cursor in RawScanResultConsumer**
14513
14514 Add a 'cursor' method which returns an 'Optional\<Cursor\>' in 'RawScanResultConsumer.ScanController'. You can use this method to obtain the scan cursor if available.
14515
14516
14517 ---
14518
14519 * [HBASE-18511](https://issues.apache.org/jira/browse/HBASE-18511) | *Blocker* | **Default no regions on master**
14520
14521 Changes the configuration hbase.balancer.tablesOnMaster from list of table names that the can carry (with 'none' meaning no tables on the master) to instead be a boolean that is set to true if master carries tables/regions and false if it does not. If true, the master acts like any regionserver.
14522
14523 If false, then the master carries no tables. This is the default for hbase-2.0.0.
14524
14525 Another boolean configuration, hbase.balancer.tablesOnMaster.systemTablesOnly, when set to true, enables hbase.balancer.tablesOnMaster and makes it so the master hosts system tables exclusively (the long-time deploy mode of master branch and branch-2 up until this commit).
14526
14527 UPDATE: This is broke. See HBASE-19785.
14528 UPDATE2: Master carrying Regions does not work reliably, see HBASE-19828.
14529
14530 See HBASE-19831, the issue to fix regions on Master
14531
14532 The change of hbase.balancer.tablesOnMaster from String list to boolean and
14533 the addition of a simple boolean to enable system-tables on Master was done
14534 to constrain what operators might ask for via this master configuration.
14535 Stipulating what tables are bound to the Master server verges into
14536 regionserver grouping territory, a more robust means of specifying table
14537 and server combinations. Operators should use this latter if they want
14538 layouts more exotic than those supplied by the provided booleans.
14539
14540
14541 ---
14542
14543 * [HBASE-18553](https://issues.apache.org/jira/browse/HBASE-18553) | *Major* | **Expose scan cursor for asynchronous scanner**
14544
14545 The ResultScanner which is gotten from an AsyncTable will also return cursor results if Scan.isNeedCursorResult is true.
14546
14547
14548 ---
14549
14550 * [HBASE-18598](https://issues.apache.org/jira/browse/HBASE-18598) | *Minor* | **AsyncNonMetaRegionLocator use FIFO algorithm to get a candidate locate request**
14551
14552 Introduce FIFO algorithm to get a candidate locate request for AsyncNonMetaRegionLocator.
14553
14554
14555 ---
14556
14557 * [HBASE-18533](https://issues.apache.org/jira/browse/HBASE-18533) | *Major* | **Expose BucketCache values to be configured**
14558
14559 This patch exposes configuration for Bucketcache. These configs are very similar to those for the LRU cache, but are described below:
14560
14561 "hbase.bucketcache.single.factor"; /\*\* Single access bucket size \*/
14562 "hbase.bucketcache.multi.factor"; /\*\* Multiple access bucket size \*/
14563 "hbase.bucketcache.memory.factor"; /\*\* In-memory bucket size \*/
14564 "hbase.bucketcache.extrafreefactor"; /\*\* Free this floating point factor of extra blocks when evicting. For example free the number of blocks requested \* (1 + extraFreeFactor) \*/
14565 "hbase.bucketcache.acceptfactor"; /\*\* Acceptable size of cache (no evictions if size \< acceptable) \*/
14566 "hbase.bucketcache.minfactor"; /\*\* Minimum threshold of cache (when evicting, evict until size \< min) \*/
14567
14568
14569 ---
14570
14571 * [HBASE-18528](https://issues.apache.org/jira/browse/HBASE-18528) | *Critical* | **DON'T allow user to modify the passed table/column descriptor**
14572
14573 **WARNING: No release note provided for this change.**
14574
14575
14576 ---
14577
14578 * [HBASE-18271](https://issues.apache.org/jira/browse/HBASE-18271) | *Blocker* | **Shade netty**
14579
14580 Depend on hbase-thirdparty for our netty instead of directly relying on netty-all. netty is relocated in hbase-thirdparty from io.netty to org.apache.hadoop.hbase.shaded.io.netty. One kink is that netty bundles an .so. Its files also are relocated. So netty can find the .so content, need to specify on command-line a system property telling netty about the shading.
14581
14582 The .so trick is from
14583              https://stackoverflow.com/questions/33825743/rename-files-inside-a-jar-using-some-maven-plugin
14584
14585 In essence we need the below defined whenever we run tests or deploy:
14586
14587 -Dorg.apache.hadoop.hbase.shaded.io.netty.packagePrefix=org.apache.hadoop.hbase.shaded.
14588
14589 (The trailing '.' is required)
14590
14591 See toward the end of this issue for how to pass config: https://github.com/netty/netty/issues/6665
14592
14593 The system property has been added to bin/hbase. If starting hbase with other than bin/hbase, add this system property (at least on linux).
14594
14595 For devs, going forward, do not reference io.netty. Reference org.apache.hadoop.hbase.io.netty instead. Here is sample:
14596
14597 {code}
14598 -import io.netty.channel.Channel;
14599 -import io.netty.channel.EventLoop;
14600 +import org.apache.hadoop.hbase.shaded.io.netty.channel.Channel;
14601 +import org.apache.hadoop.hbase.shaded.io.netty.channel.EventLoop;
14602 {code}
14603
14604
14605 ---
14606
14607 * [HBASE-15511](https://issues.apache.org/jira/browse/HBASE-15511) | *Major* | **ClusterStatus should be able to return responses by scope**
14608
14609 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14610 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14611
14612
14613 ---
14614
14615 * [HBASE-18551](https://issues.apache.org/jira/browse/HBASE-18551) | *Major* | **[AMv2] UnassignProcedure and crashed regionservers**
14616
14617 Unassign will not proceed if it is unable to talk to the remote server. Now it will expire the server it is unable to communicate with and then wait until it is signaled by ServerCrashProcedure that the server's logs have been split. Only then will judge the unassign successful.
14618
14619 We do this because a subsequent assign lacking the crashed server context might open a region w/o first splitting logs.
14620
14621
14622 ---
14623
14624 * [HBASE-18469](https://issues.apache.org/jira/browse/HBASE-18469) | *Critical* | **Correct  RegionServer metric of  totalRequestCount**
14625
14626 In HBASE-18469 we introduced a new RegionServer metrics in name of "totalRowActionRequestCount" which counts in all row actions and equals to the sum of "readRequestCount" and "writeRequestCount". Meantime, we have changed "totalRequestCount" to count only once for multi request, while previously we will count in action number of the request. As a result, existing monitoring system on totalRequestCount will still work but see a smaller value, and we strongly recommend to change to use the new metrics to monitor server load.
14627
14628
14629 ---
14630
14631 * [HBASE-18500](https://issues.apache.org/jira/browse/HBASE-18500) | *Major* | **Performance issue: Don't use BufferedMutator for HTable's put method**
14632
14633 Remove the deprecated method get/setWriteBufferSize from Table and remove writeBufferSize from TableBuilder. Remove the BufferedMutatorImpl from HTable.
14634
14635
14636 ---
14637
14638 * [HBASE-18387](https://issues.apache.org/jira/browse/HBASE-18387) | *Minor* | **[Thrift] Make principal configurable in DemoClient.java**
14639
14640 This change allows the demonstration Thrift client to customize the server principal used by the Thrift server for instances secured with Kerberos.
14641
14642
14643 ---
14644
14645 * [HBASE-17125](https://issues.apache.org/jira/browse/HBASE-17125) | *Critical* | **Inconsistent result when use filter to read data**
14646
14647 Marked Scan and Get's setMaxVersions() and setMaxVersions(int) as deprecated. They are easy to misunderstand with column family's max versions, so use readAllVersions() and readVersions(int) instead.
14648
14649
14650 ---
14651
14652 * [HBASE-18492](https://issues.apache.org/jira/browse/HBASE-18492) | *Major* | **[AMv2] Embed code for selecting highest versioned region server for system table regions in AssignmentManager.processAssignQueue()**
14653
14654 Favors new servers over older versions when assigning system table regions (more to follow in this area; i.e. changes in the AM itself).
14655
14656
14657 ---
14658
14659 * [HBASE-18517](https://issues.apache.org/jira/browse/HBASE-18517) | *Major* | **limit max log message width in log4j**
14660
14661 Sets a log length max of 1000 characters.
14662
14663
14664 ---
14665
14666 * [HBASE-18502](https://issues.apache.org/jira/browse/HBASE-18502) | *Critical* | **Change MasterObserver to use TableDescriptor and ColumnFamilyDescriptor**
14667
14668 The methods which change to use TableDescriptor/ColumnFamilyDescriptor are shown below.
14669 + preCreateTable( ObserverContext,TableDescriptor, HRegionInfo[])
14670 + postCreateTable(ObserverContext ,TableDescriptor, HRegionInfo[])
14671 + preCreateTableAction(ObserverContext, TableDescriptor,HRegionInfo[])
14672 + postCompletedCreateTableAction(ObserverContext,TableDescriptor,HRegionInfo[])
14673 + preModifyTable(ObserverContext,TableName, TableDescriptor)
14674 + postModifyTable(ObserverContext,TableName, TableDescriptor)
14675 + preModifyTableAction( ObserverContext,TableName,TableDescriptor)
14676 + postCompletedModifyTableAction( ObserverContext,TableName,TableDescriptor)
14677 + preAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14678 + postAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14679 + preAddColumnFamilyAction(ObserverContext,TableName,ColumnFamilyDescriptor)
14680 + postCompletedAddColumnFamilyAction(ObserverContext,TableName, ColumnFamilyDescriptor)
14681 + preModifyColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14682 + preModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment,TableName,ColumnFamilyDescriptor)
14683 + postCompletedModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment\>,TableName,ColumnFamilyDescriptor)
14684 + preCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescriptor)
14685 + postCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescripto)
14686 + preRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14687 + postRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14688 + preGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14689 + postGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14690 + preGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14691 + postGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14692
14693
14694 ---
14695
14696 * [HBASE-18520](https://issues.apache.org/jira/browse/HBASE-18520) | *Minor* | **Add jmx value to determine true Master Start time**
14697
14698 This JIRA adds a JMX value to track when the Master has finished initializing.
14699 The jmx config is 'masterFinishedInitializationTime' and details the time in millis that the Master is fully usable and ready to serve requests.
14700
14701
14702 ---
14703
14704 * [HBASE-17056](https://issues.apache.org/jira/browse/HBASE-17056) | *Critical* | **Remove checked in PB generated files**
14705
14706 Purge all checked in generated protobuf files (30MB). Generate protobuf files inline with the build. Remove checked-in and patched protobuf. Get it from new hbase-thirdparty instead.
14707
14708 Side-effect: Our protobuf went from 3.1.0 to 3.3.1.
14709
14710 Build does not take noticeably longer (still about 2.5 minutes to do a mvn clean install -DskipTests).
14711
14712 IDEs will probably require a mvn build first else they'll complain about missing (generated) files.
14713
14714
14715 ---
14716
14717 * [HBASE-18374](https://issues.apache.org/jira/browse/HBASE-18374) | *Major* | **RegionServer Metrics improvements**
14718
14719 This change adds the latency metrics checkAndPut, checkAndDelete, putBatch and deleteBatch . Also the previous regionserver "mutate" latency metrics are renamed to "put" metrics. Batch metrics capture the latency of the entire batch containing put/delete whereas put/delete metrics capture latency per operation. Note this change will break existing monitoring based on regionserver "mutate" latency metric.
14720
14721
14722 ---
14723
14724 * [HBASE-18023](https://issues.apache.org/jira/browse/HBASE-18023) | *Minor* | **Log multi-\* requests for more than threshold number of rows**
14725
14726 HBASE-18023 introduces a warning message in the RegionServer log when an RPC is received from a client that has more than 5000 "actions" (where an "action" is a collection of mutations for a specific row) in a single RPC. Misbehaving clients who send large RPCs to RegionServers can be malicious, causing temporary pauses via garbage collection or denial of service via crashes. The threshold of 5000 actions per RPC is defined by the property "hbase.rpc.rows.warning.threshold" in hbase-site.xml.
14727
14728
14729 ---
14730
14731 * [HBASE-15968](https://issues.apache.org/jira/browse/HBASE-15968) | *Major* | **New behavior of versions considering mvcc and ts rather than ts only**
14732
14733 This issue resolved two long-term issues in HBase:
14734 Puts may be masked by a delete before them.
14735 Major compactions change query results.
14736
14737 This issue offer a new behavior to fix this issue with a little performance reduction. Set NEW\_VERSION\_BEHAVIOR to true to enable this feature in CF level. See HBASE-15968 for details.
14738 Note if you enable this feature, the order of Mutations matters. But replication will disorder the entries by default. So you have to enable serial replication if you have slave clusters. See HBASE-9465 for details.
14739
14740
14741 ---
14742
14743 * [HBASE-18107](https://issues.apache.org/jira/browse/HBASE-18107) | *Major* | **[AMv2] Remove DispatchMergingRegionsRequest & DispatchMergingRegions**
14744
14745 Removes merge region code added into branch-2 but that was not needed after all. Branch-2 replaced dispatchMergingRegions with MergeTableRegionsProcedure.
14746
14747 Removed:
14748
14749 # dispatchMergingRegions from Connection (was superceded long ago in branch-1).
14750 # mergeRegions from RsRpcServices (was not used).
14751
14752
14753 ---
14754
14755 * [HBASE-15816](https://issues.apache.org/jira/browse/HBASE-15816) | *Major* | **Provide client with ability to set priority on Operations**
14756
14757 Added setPriority(int priority) API to Put, Delete, Increment, Append, Get and Scan pojos.  So for all these ops, the user can provide a custom priority level.
14758
14759
14760 ---
14761
14762 * [HBASE-18430](https://issues.apache.org/jira/browse/HBASE-18430) | *Major* | **Typo in "contributing to documentation" page**
14763
14764 Pushed to {{master}}. Thanks, Coral! Congratulations on your first Apache HBase commit!
14765
14766
14767 ---
14768
14769 * [HBASE-17908](https://issues.apache.org/jira/browse/HBASE-17908) | *Critical* | **Upgrade guava**
14770
14771 Use relocated guava 22.0 gotten from the new hbase-thirdparty ancillary project.
14772
14773 Incompatible change. ReplicationEndpoint and subclasses extend guava Service which changed pretty radically between 12.0 and 22.0. Change is kosher because implementations are marked audience private. Still, this will likely cause grief for the likes of the downstream lily indexer.
14774
14775
14776 ---
14777
14778 * [HBASE-16993](https://issues.apache.org/jira/browse/HBASE-16993) | *Major* | **BucketCache throw java.io.IOException: Invalid HFile block magic when configuring hbase.bucketcache.bucket.sizes**
14779
14780 Any value for hbase.bucketcache.bucket.sizes  configuration to be multiple of 256.  If that is not the case, instantiation of L2 Bucket cache itself will fail throwing IllegalArgumentException.
14781
14782
14783 ---
14784
14785 * [HBASE-16090](https://issues.apache.org/jira/browse/HBASE-16090) | *Major* | **ResultScanner is not closed in SyncTable#finishRemainingHashRanges()**
14786
14787 pushed to 1.3 and 1.2. SyncTable was introduced in 1.2, so skipping 1.1.
14788
14789
14790 ---
14791
14792 * [HBASE-18332](https://issues.apache.org/jira/browse/HBASE-18332) | *Minor* | **Upgrade asciidoctor-maven-plugin**
14793
14794 Committed to master and branch-2. Thanks!
14795
14796
14797 ---
14798
14799 * [HBASE-18161](https://issues.apache.org/jira/browse/HBASE-18161) | *Minor* | **Incremental Load support for Multiple-Table HFileOutputFormat**
14800
14801 In order to use this feature, a user must
14802 1. Register their tables when configuring their job
14803  2. Create a composite key of the tablename and original rowkey to send as the mapper output key.
14804
14805   To register their tables (and configure their job for incremental load into multiple tables), a user must call the static MultiHFileOutputFormat.configureIncrementalLoad function to register the HBase tables that will be ingested into.
14806
14807 To create the composite key, a helper function MultiHFileOutputFormat2.createCompositeKey should be called with the destination tablename and rowkey as arguments, and the result should be output as the mapper key.
14808
14809  Before this JIRA, for HFileOutputFormat2 a configuration for the storage policy was set per Column Family. This was set manually by the user. In this JIRA, this is unchanged when using HFileOutputFormat2. However, when specifically using MultiHFileOutputFormat2, the user now has to manually set the prefix by creating a composite of the table name and the column family. The user can create the new composite value by calling MultiHFileOutputFormat2.createCompositeKey with the tablename and column family as arguments.
14810
14811 Changes added through this JIRA are backwards compatible with existing HFileOutputFormat2 apis and functionality.
14812
14813 The configuration parameter "hbase.mapreduce.hfileoutputformat.table.name" is now a REQUIRED parameter though it is normally set automatically when configureIncrementalLoad method is called within HFileOutputFormat2
14814
14815
14816 ---
14817
14818 * [HBASE-18229](https://issues.apache.org/jira/browse/HBASE-18229) | *Critical* | **create new Async Split API to embrace AM v2**
14819
14820 A new splitRegionAsync() API is added in client. The existing splitRegion()  and split() API will call the new API so client does not have to change its code.
14821
14822 Move HBaseAdmin.splitXXX() logic to master, client splitXXX() API now go to master directly instead of going to RegionServer first.
14823
14824 Also added splitSync() API
14825
14826
14827 ---
14828
14829 * [HBASE-18339](https://issues.apache.org/jira/browse/HBASE-18339) | *Major* | **Update test-patch to use hadoop 3.0.0-alpha4**
14830
14831 HBase now defaults to Apache Hadoop 3.0.0-alpha4 when the Hadoop 3 profile is active.
14832
14833
14834 ---
14835
14836 * [HBASE-18267](https://issues.apache.org/jira/browse/HBASE-18267) | *Major* | **The result from the postAppend is ignored**
14837
14838 **WARNING: No release note provided for this change.**
14839
14840
14841 ---
14842
14843 * [HBASE-18307](https://issues.apache.org/jira/browse/HBASE-18307) | *Major* | **Share the same EventLoopGroup for NettyRpcServer, NettyRpcClient and AsyncFSWALProvider at RS side**
14844
14845 There are two configuration name changes as the event loop configs will not only effect rpc server but be shared by different components in the same RS instance.
14846
14847 'hbase.rpc.server.nativetransport' -\> 'hbase.netty.nativetransport'
14848
14849 'hbase.netty.rpc.server.worker.count' -\> 'hbase.netty.worker.count'
14850
14851
14852 ---
14853
14854 * [HBASE-18241](https://issues.apache.org/jira/browse/HBASE-18241) | *Critical* | **Change client.Table, client.Admin, Region, Store, and HBaseTestingUtility to not use HTableDescriptor or HColumnDescriptor**
14855
14856 - : removed API
14857 + : new API
14858 \* : deprecated API
14859 ---------------------------
14860 Region class
14861 - HTableDescriptor getTableDesc()
14862 +TableDescriptor getTableDescriptor()
14863
14864 Store class
14865 - HColumnDescriptor getFamily()
14866 + ColumnFamilyDescriptor getColumnFamilyDescriptor()
14867
14868 Table class
14869 \* HTableDescriptor getTableDescriptor()
14870 + TableDescriptor getDescriptor()\|
14871
14872 \*Admin class\*
14873 \* HTableDescriptor getTableDescriptor(TableName)
14874 + List\<TableDescriptor\> listTableDescriptor(TableName)\|
14875 \* HTableDescriptor[] getTableDescriptors(List\<String\>)
14876 \* HTableDescriptor[] getTableDescriptorsByTableName(List\<TableName\>)
14877 + List\<TableDescriptor\> listTableDescriptors(List\<TableName\>)
14878 \* HTableDescriptor[] listTables()
14879 + List\<TableDescriptor\> listTableDescriptors()
14880 \* HTableDescriptor[] listTables(Pattern)
14881 + List\<TableDescriptor\> listTableDescriptors(Pattern)
14882 \* HTableDescriptor[] listTables(String)
14883 + List\<TableDescriptor\> listTableDescriptors(String)
14884 \* HTableDescriptor[] listTables(Pattern, boolean)
14885 + List\<TableDescriptor\> listTableDescriptors(Pattern, boolean)
14886 \* HTableDescriptor[] listTables(String, boolean)
14887 + List\<TableDescriptor\> listTableDescriptors(String, boolean)
14888 \* HTableDescriptor[] deleteTables(String)
14889 \* HTableDescriptor[] deleteTables(Pattern)
14890 \* HTableDescriptor[] enableTables(String)
14891 \* HTableDescriptor[] enableTables(Pattern)
14892 \* HTableDescriptor[] disableTables(String)
14893 \* HTableDescriptor[] disableTables(Pattern)
14894 \* void modifyTable(TableName, HTableDescriptor)
14895 + void modifyTable(TableDescriptor)
14896 \* void modifyTableAsync(TableName, HTableDescriptor)
14897 + void modifyTableAsync(TableDescriptor)
14898 \* HTableDescriptor[] listTableDescriptorsByNamespace(String)
14899 + List\<TableDescriptor\> listTableDescriptorsByNamespace(byte[])
14900 \* void createTable(HTableDescriptor)
14901 + void createTable(TableDescriptor)
14902 \* void createTable(HTableDescriptor, byte[], byte[], int)
14903 + void createTable({color:red}TableDescriptor, byte[], byte[], int)
14904 \* void createTable(HTableDescriptor, byte[][])
14905 + void createTable(TableDescriptor, byte[][])
14906 \* Future\<Void\> createTableAsync(HTableDescriptor, byte[][])
14907 + Future\<Void\> createTableAsync(TableDescriptor, byte[][])
14908
14909 \*HBaseTestingUtility class\*
14910 \* Table createTable(HTableDescriptor, byte[][], Configuration)
14911 + Table createTable(TableDescriptor, byte[][], Configuration)
14912 \* Table createTable(HTableDescriptor, byte[][], byte[][], Configuration)
14913 + Table createTable(TableDescriptor, byte[][], byte[][], Configuration)
14914 \* public Table createTable(HTableDescriptor, byte[][])
14915 + public Table createTable(TableDescriptor, byte[][])
14916 \* void modifyTableSync(Admin, HTableDescriptor)
14917 + void modifyTableSync(Admin, TableDescriptor)
14918 \* HRegion createLocalHRegion(HTableDescriptor, byte [], byte [])
14919 + HRegion createLocalHRegion(TableDescriptor, byte [], byte [])
14920 \* HRegion createLocalHRegion(HRegionInf, HTableDescriptor)
14921 + HRegion createLocalHRegion(HRegionInf, TableDescriptor)
14922 \* HRegion createLocalHRegion(HRegionInfo, HTableDescriptor, WAL)
14923 + HRegion createLocalHRegion(HRegionInfo, TableDescriptor, WAL)
14924 \* List createMultiRegionsInMeta(final Configuration, HTableDescriptor, byte [][])
14925 + List createMultiRegionsInMeta(final Configuration, TableDescriptor, byte [][])
14926 \* HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, HTableDescriptor)
14927 + HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, TableDescriptor)
14928 \* HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, HTableDescriptor, boolean)
14929 + HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, TableDescriptor, boolean)
14930 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor)
14931 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor)
14932 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor, int)
14933 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor, int)
14934 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor[], int)
14935 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor[], int)
14936 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor[],SplitAlgorithm, int)
14937 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor[],SplitAlgorithm, int)
14938 \* HRegion createTestRegion(String, HColumnDescriptor)
14939 + HRegion createTestRegion(String, ColumnFamilyDescriptor)
14940
14941
14942 ---
14943
14944 * [HBASE-18083](https://issues.apache.org/jira/browse/HBASE-18083) | *Major* | **Make large/small file clean thread number configurable in HFileCleaner**
14945
14946 After HBASE-18083 we could configure HFileCleaner to use multiple threads for large/small (archived) hfile cleaning with hbase.regionserver.hfilecleaner.large.thread.count and hbase.regionserver.hfilecleaner.small.thread.count, both default to 1. These properties support online configuration change.
14947
14948
14949 ---
14950
14951 * [HBASE-17931](https://issues.apache.org/jira/browse/HBASE-17931) | *Blocker* | **Assign system tables to servers with highest version**
14952
14953 We usually keep compatibility between old client and new server so we can do rolling upgrade, HBase cluster first, then HBase client. But we don't guarantee new client can access old server.
14954 In an HBase cluster, we have system tables and region servers will access these tables so for servers they are also an HBase client. So if the system tables are in region servers with lower version we may get trouble because region servers with higher version may can not access them.
14955 After this patch, we will move all system regions to region servers with highest version. So when we do a rolling upgrade across two major or minor versions, we should ALWAYS UPGRADE MASTER FIRST and then upgrade region servers. The new master will handle system tables correctly.
14956
14957
14958 ---
14959
14960 * [HBASE-6581](https://issues.apache.org/jira/browse/HBASE-6581) | *Major* | **Build with hadoop.profile=3.0**
14961
14962 Make us build against hadoop trunk (3.0)
14963
14964
14965 ---
14966
14967 * [HBASE-16120](https://issues.apache.org/jira/browse/HBASE-16120) | *Minor* | **Add shell test for truncate\_preserve**
14968
14969 Add unit tests for truncate\_preserve
14970
14971
14972 ---
14973
14974 * [HBASE-18240](https://issues.apache.org/jira/browse/HBASE-18240) | *Major* | **Add hbase-thirdparty, a project with hbase utility including an hbase-shaded-thirdparty module with guava, netty, etc.**
14975
14976 Adds a new project, hbase-thirdparty, at https://git-wip-us.apache.org/repos/asf/hbase-thirdparty used by core hbase. GroupID org.apache.hbase.thirdparty. Version 1.0.0.
14977
14978 This project packages relocated third-party libraries used by Apache HBase such as protobuf, guava, and netty among others. HBase core depends on it.
14979
14980 It has threre submodules, one to patch and then relocate (shade) protobuf, and one to do messy .so renaming (netty). The remainder module relocates a bundle of other (unpatched) libs used by hbase. This latter set includes protobuf-util, netty-all, gson, and guava.
14981
14982 All shading is done using the same relocation offset of org.apache.hadoop.hbase.shaded; we add this prefix to the relocated thirdparty library class names.
14983
14984 See the pom.xml in hbase-thirdparty for the explicit version of each third-party lib included (of note, we update out internal protobuf from 3.1.0 to 3.3.1).
14985
14986
14987 ---
14988
14989 * [HBASE-15943](https://issues.apache.org/jira/browse/HBASE-15943) | *Major* | **Add page displaying JVM process metrics**
14990
14991 Adds new "Process Metrics' tab along the top which leads to new page that dumps mbean -- mostly jvm -- metrics
14992
14993
14994 ---
14995
14996 * [HBASE-14902](https://issues.apache.org/jira/browse/HBASE-14902) | *Major* | **Revert some of the stringency recently introduced by checkstyle tightening**
14997
14998 Changes the checkstyle so that on a continuation line for javadoc, instead of default four spaces, instead now it is two spaces. Also one line statements as in if (true) x =1; now pass checkstyle.
14999
15000
15001 ---
15002
15003 * [HBASE-17110](https://issues.apache.org/jira/browse/HBASE-17110) | *Major* | **Improve SimpleLoadBalancer to always take server-level balance into account**
15004
15005 After HBASE-17110 the bytable strategy for SimpleLoadBalancer will also take server level balance into account
15006
15007
15008 ---
15009
15010 * [HBASE-17928](https://issues.apache.org/jira/browse/HBASE-17928) | *Major* | **Shell tool to clear compaction queues**
15011
15012 Adds clear\_compaction\_queues to the hbase shell.
15013 {code}
15014   Clear compaction queues on a regionserver.
15015   The queue\_name contains short and long.
15016   short is shortCompactions's queue,long is longCompactions's queue.
15017
15018   Examples:
15019   hbase\> clear\_compaction\_queues 'host187.example.com,60020'
15020   hbase\> clear\_compaction\_queues 'host187.example.com,60020','long'
15021   hbase\> clear\_compaction\_queues 'host187.example.com,60020', ['long','short']
15022 {code}
15023
15024
15025 ---
15026
15027 * [HBASE-18164](https://issues.apache.org/jira/browse/HBASE-18164) | *Critical* | **Much faster locality cost function and candidate generator**
15028
15029 New locality cost function and candidate generator that use caching and incremental computation to allow the stochastic load balancer to consider ~20x more cluster configurations for big clusters.
15030
15031
15032 ---
15033
15034 * [HBASE-18226](https://issues.apache.org/jira/browse/HBASE-18226) | *Major* | **Disable reverse DNS lookup at HMaster and use the hostname provided by RegionServer**
15035
15036 The following config is added by this JIRA:
15037
15038 hbase.regionserver.hostname.disable.master.reversedns
15039
15040 This config is for experts: don't set its value unless you really know what you are doing.
15041 When set to true, regionserver will use the current node hostname for the servername and HMaster will skip reverse DNS lookup and use the hostname sent by regionserver instead. Note that this config and hbase.regionserver.hostname are mutually exclusive. See https://issues.apache.org/jira/browse/HBASE-18226 for more details.
15042
15043 Caution: please make sure rolling upgrade succeeds before turning on this feature.
15044
15045
15046 ---
15047
15048 * [HBASE-16242](https://issues.apache.org/jira/browse/HBASE-16242) | *Major* | **Upgrade Avro to 1.7.7**
15049
15050 Apache HBase now specifies that version 1.7.7 of the Apache Avro library should be pulled in by maven and included in the convenience binary tarball.
15051
15052
15053 ---
15054
15055 * [HBASE-18213](https://issues.apache.org/jira/browse/HBASE-18213) | *Major* | **Add documentation about the new async client**
15056
15057 Add documentation for async client in section '66. Client' in ref guide.
15058
15059
15060 ---
15061
15062 * [HBASE-17008](https://issues.apache.org/jira/browse/HBASE-17008) | *Critical* | **Examples to make AsyncClient go down easy**
15063
15064 Add two examples for async client. AsyncClientExample is a simple example to show you how to use AsyncTable. HttpProxyExample is an example for advance user to show you how to use RawAsyncTable to write a fully asynchronous HTTP proxy server. There is no extra thread pool, all operations are executed inside netty's event loop.
15065
15066
15067 ---
15068
15069 * [HBASE-18200](https://issues.apache.org/jira/browse/HBASE-18200) | *Major* | **Set hadoop check versions for branch-2 and branch-2.x in pre commit**
15070
15071 Allow setting different hadoop check versions for branch-2 and branch-2.x when running pre commit check.
15072
15073
15074 ---
15075
15076 * [HBASE-18187](https://issues.apache.org/jira/browse/HBASE-18187) | *Major* | **Release hbase-2.0.0-alpha1**
15077
15078 Pushed the release. For detail: http://apache-hbase.679495.n3.nabble.com/ANNOUNCE-Apache-HBase-2-0-0-alpha-1-is-now-available-for-download-td4088484.html
15079
15080
15081 ---
15082
15083 * [HBASE-18137](https://issues.apache.org/jira/browse/HBASE-18137) | *Critical* | **Replication gets stuck for empty WALs**
15084
15085 0-length WAL files can potentially cause the replication queue to get stuck.  A new config "replication.source.eof.autorecovery" has been added: if set to true (default is false), the 0-length WAL file will be skipped after 1) the max number of retries has been hit, and 2) there are more WAL files in the queue.  The risk of enabling this is that there is a chance the 0-length WAL file actually has some data (e.g. block went missing and will come back once a datanode is recovered).
15086
15087
15088 ---
15089
15090 * [HBASE-18192](https://issues.apache.org/jira/browse/HBASE-18192) | *Blocker* | **Replication drops recovered queues on region server shutdown**
15091
15092 If a region server that is processing recovered queue for another previously dead region server is gracefully shut down, it can drop the recovered queue under certain conditions. Running without this fix on a 1.2+ release means possibility of continuing data loss in replication, irrespective of which WALProvider is used.
15093 If a single WAL group (or DefaultWALProvider) is used, running without this fix will always cause dataloss in replication whenever a region server processing recovered queues is gracefully shutdown.
15094
15095
15096 ---
15097
15098 * [HBASE-18109](https://issues.apache.org/jira/browse/HBASE-18109) | *Critical* | **Assign system tables first (priority)**
15099
15100 Adds a sort of procedures before submission so system tables are queued first (which will help ensure they go out first). This should be good enough along w/ existing scheduling mechanisms to ensure system/meta are assigned first (See reasoning below). Open new issue if insufficient.
15101
15102
15103 ---
15104
15105 * [HBASE-18008](https://issues.apache.org/jira/browse/HBASE-18008) | *Major* | **Any HColumnDescriptor we give out should be immutable**
15106
15107 1) The HColumnDescriptor got from Admin, AsyncAdmin, and Table is immutable.
15108 2) HColumnDescriptor have been marked as "Deprecated" and user should substituted
15109      ColumnFamilyDescriptor for HColumnDescriptor.
15110 3) ColumnFamilyDescriptor is constructed through ColumnFamilyDescriptorBuilder and it contains all of the read-only methods from HColumnDescriptor
15111 4) The value to which the IS\_MOB/MOB\_THRESHOLD is mapped is stored as String rather than Boolean/Long. The MOB is an new feature to 2.0 so this change should be acceptable
15112
15113
15114 ---
15115
15116 * [HBASE-18149](https://issues.apache.org/jira/browse/HBASE-18149) | *Major* | **The setting rules for table-scope attributes and family-scope attributes should keep consistent**
15117
15118 If the table-scope attributes value is false, you need not to enclose 'false' in single quotation.Both COMPACTION\_ENABLED =\> false and COMPACTION\_ENABLED =\> 'false' will take effect
15119
15120
15121 ---
15122
15123 * [HBASE-17849](https://issues.apache.org/jira/browse/HBASE-17849) | *Major* | **PE tool random read is not totally random**
15124
15125 When randomRead and randomSeekScan is used with PE tool, now we allow using both --size and --rows. The --size specifies the total size of the data (the range) on which the reads should be performed and --rows specifies the number of rows to be read by each client with in that range.
15126
15127
15128 ---
15129
15130 * [HBASE-15576](https://issues.apache.org/jira/browse/HBASE-15576) | *Major* | **Scanning cursor to prevent blocking long time on ResultScanner.next()**
15131
15132 If you don't like scanning being blocked too long because of heartbeat and partial result, you can use Scan#setNeedCursorResult(true) to get a special result within scanning timeout setting time which will tell you where row the server is scanning. See its javadoc for more details.
15133
15134
15135 ---
15136
15137 * [HBASE-16549](https://issues.apache.org/jira/browse/HBASE-16549) | *Major* | **Procedure v2 - Add new AM metrics**
15138
15139 Following AMv2 procedures are modified to override onSubmit(), onFinish() hooks provided by HBASE-17888 to do
15140 metrics calculations when procedures are submitted and finshed:
15141 \* AssignProcedure
15142 \* UnassignProcedure
15143 \* MergeTableRegionProcedure
15144 \* SplitTableRegionProcedure
15145 \* ServerCrashProcedure
15146
15147 Following metrics is collected for each of the above procedure during lifetime of a process:
15148 \* Total number of requests submitted for a type of procedure
15149 \* Histogram of runtime in milliseconds for successfully completed procedures
15150 \* Total number of failed procedures
15151
15152 As we are moving away from Hadoop's metric2, hbase-metrics-api module is used for newly added metrics.
15153
15154
15155 ---
15156
15157 * [HBASE-9393](https://issues.apache.org/jira/browse/HBASE-9393) | *Critical* | **Hbase does not closing a closed socket resulting in many CLOSE\_WAIT**
15158
15159 To handle this issue client need to have Hadoop client 2.6.4 or 2.7.0+ Hadoop version as CanUnBuffer interface which was added as part of HDFS-7694 is available in only those versions.
15160
15161
15162 ---
15163
15164 * [HBASE-18038](https://issues.apache.org/jira/browse/HBASE-18038) | *Critical* | **Rename StoreFile to HStoreFile and add a StoreFile interface for CP**
15165
15166 StoreFile is now changed to an interface. This is an incompatible change. The coprocessors which implement RegionObserver may need to modify their code.
15167
15168
15169 ---
15170
15171 * [HBASE-16196](https://issues.apache.org/jira/browse/HBASE-16196) | *Critical* | **Update jruby to a newer version.**
15172
15173 The bundled JRuby 1.6.8 has been updated to version 9.1.9.0. The represents a change from Ruby 1.8 to Ruby 2.3.3, which introduces non-compatible language changes for user scripts.
15174
15175 This JRuby version update required an update to joni-2.1.11 and jcodings-1.0.18, used for regular expression matching, as well as several transitive dependency updates that should not be user-visible.
15176
15177
15178 ---
15179
15180 * [HBASE-14614](https://issues.apache.org/jira/browse/HBASE-14614) | *Major* | **Procedure v2: Core Assignment Manager**
15181
15182 Replaces the AssignmentManager with a new procedurev2-based AssignmentManager
15183
15184 h1. AMv2
15185 Puts AssignmentManager up on top of the ProcedureV2 state machine with persistence engine. Each assignment atom is now a Procedure implementation; e.g. an AssignProcedure and an UnassignProcedure. Molecules of aggregated Procedures are used to do more involved assignment steps: e.g. the move region procedure is made of an Unassign followed by an Assign subprocedure.
15186
15187 AMv2 is 1500 lines. Old AM was near 4000. Functionality has been moved out to Procedures. In-memory states of regions and servers has been cleaned up stored in new RegionStates implementation. RegionStateStore takes care of publishing final region state out to the hbase:meta table.
15188
15189 New RemoteProcedureDispatcher/RSProcedureDispatcher runs the Procedure-based assignments ‘remotely’. Knows about ‘servers’. Does aggregation of assignments by time on a time/count basis so can send procedures in batches rather than one per RPC. Procedure status comes back on the back of the RegionServer heartbeat reporting online regions. The response is passed to the AMv2 to ‘process’. It will check against the in-memory state. If there is a mismatch, it fences out the RegionServer on the assumption that something went wrong on the RS side.Timeouts trigger retries. The Procedure machine ensures only one operation at a time on any one region/table using locking and smarts about what is serial and what can be run concurrently.
15190
15191 New accounting of RegionServer version will be used running rolling restarts.
15192
15193 ‘States’ -- OPENING, CLOSING, etc. -- are now in-memory in-the-master only serialized out to the ProcedureV2 WAL. They are no longer persisted to ZooKeeper.
15194
15195 h2. Assign Detail
15196 The Assign starts by pushing the "assign" operation to the AssignmentManager and then will go into a “waiting" state. The AM will batch the "assign" requests and ask the Balancer where to put the region (the various policies will be respected: retain, round-robin, random). Once the AM and the balancer have found a place for the region, the procedure will be resumed and an "open region" request will be placed in the Remote Dispatcher queue, and the procedure once again will go into a "waiting state".  The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the assignment by publishing to new state on hbase:meta or it will retry the assignment.
15197
15198 h3. Unassign Detail
15199  The Unassign starts by placing a "close region" request in the Remote Dispatcher queue, and the procedure will then go into a "waiting state". The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the unassign by publishing its new state on meta or it will retry the unassign.
15200
15201 h1. New Configs
15202  \* "hbase.procedure.remote.dispatcher.threadpool.size" defaults 128
15203  \* "hbase.procedure.remote.dispatcher.delay.msec" default 150ms
15204  \* "hbase.procedure.remote.dispatcher.max.queue.size" with default 32
15205  \* "hbase.regionserver.rpc.startup.waittime" with default 60 seconds.
15206 h1. TODO
15207 As of this writing.
15208
15209 Put up a model diagram.
15210
15211  \* Handle region migration
15212  \* Handle meta assignment first
15213  \* Handle sys table assignment first (e.g. acl, namespace)
15214  \* Handle table priorities
15215  \* Do we report same AM metrics as we used too? We do it all in here now.
15216
15217 INCOMPATIBLE
15218 A known incompatible is that because splits and merges are now run from the master, Coprocessors that used to watch for merge/split from a RegionObserver now no longer work; to watch split/merges, you need to have an observer on the Master instead.
15219
15220
15221 ---
15222
15223 * [HBASE-3462](https://issues.apache.org/jira/browse/HBASE-3462) | *Major* | **Fix table.jsp in regards to splitting a region/table with an optional splitkey**
15224
15225 UI pages for splitting/merging now operate by taking a row key prefix from the user rather than a full region name.
15226
15227
15228 ---
15229
15230 * [HBASE-18129](https://issues.apache.org/jira/browse/HBASE-18129) | *Major* | **truncate\_preserve fails when the truncate method doesn't exists on the master**
15231
15232 The command truncate\_preserve will be fine when the truncate method doesn't exist on the master
15233
15234
15235 ---
15236
15237 * [HBASE-18122](https://issues.apache.org/jira/browse/HBASE-18122) | *Major* | **Scanner id should include ServerName of region server**
15238
15239 The scanner id is not from 1 anymore.
15240 The first 32 bits are MurmurHash32 of ServerName string "host,port,ts". The ServerName contains both host, port, and start timestamp so it can prevent collision. The lowest 32bit is generated by atomic int.
15241
15242
15243 ---
15244
15245 * [HBASE-17997](https://issues.apache.org/jira/browse/HBASE-17997) | *Major* | **In dev environment, add jruby-complete jar to classpath only when jruby is needed**
15246
15247 When JRUBY\_HOME is specified, if the command is "hbase shell" or "hbase org.jruby.Main", CLASSPATH and HBASE\_OPTS will be updated according to JRUBY\_HOME specified
15248 \* Jar under JRUBY\_HOME is added to CLASSPATH
15249 \* The following will be added into HBASE\_OPTS
15250
15251 -Djruby.home=$JRUBY\_HOME -Djruby.lib=$JRUBY\_HOME/lib
15252
15253
15254 That is, as long as JRUBY\_HOME is specified, JRUBY\_HOME specified will take precedence.
15255 \* In dev env, the jar recorded in cached\_classpath\_jruby.txt will be ignored
15256 \* In non dev env, jruby-complete jar packaged with HBase will be ignored
15257
15258
15259 ---
15260
15261 * [HBASE-15616](https://issues.apache.org/jira/browse/HBASE-15616) | *Major* | **Allow null qualifier for all table operations**
15262
15263 After this issue, all table operations will support null qualifier, such as put/get/scan/increment/append/checkAndMutate/checkAndPut/checkAndDelete.
15264
15265
15266 ---
15267
15268 * [HBASE-18035](https://issues.apache.org/jira/browse/HBASE-18035) | *Critical* | **Meta replica does not give any primaryOperationTimeout to primary meta region**
15269
15270 When a client is configured to use meta replica, it sends scan request to all meta replicas almost at the same time. Since meta replica contains stale data, if result from one of replica comes back first, the client may get wrong region locations. To fix this, "hbase.client.meta.replica.scan.timeout" is introduced, a client will always send to primary meta region first, wait the configured timeout for reply. If no result is received, it will send request to replica meta regions. The unit for "hbase.client.meta.replica.scan.timeout"  is microsecond, the default value is 1000000 (1 second).
15271
15272
15273 ---
15274
15275 * [HBASE-11013](https://issues.apache.org/jira/browse/HBASE-11013) | *Major* | **Clone Snapshots on Secure Cluster Should provide option to apply Retained User Permissions**
15276
15277 While creating a snapshot, it will save permissions of the original table into .snapshotinfo file(Backward compatibility) , which is in the snapshot root directory.  For clone\_snapshot/restore\_snapshot command, we provide an additional option( RESTORE\_ACL) to decide whether we will grant permissons of the origin table to the newly created table.
15278
15279
15280 ---
15281
15282 * [HBASE-18018](https://issues.apache.org/jira/browse/HBASE-18018) | *Major* | **Support abort for all procedures by default**
15283
15284 The default behavior for abort() method of StateMachineProcedure class is changed to support aborting all procedures irrespective of if procedure supports rollback or not.
15285
15286
15287 ---
15288
15289 * [HBASE-16851](https://issues.apache.org/jira/browse/HBASE-16851) | *Major* | **User-facing documentation for the In-Memory Compaction feature**
15290
15291 Two blog posts on Apache HBase blog: user manual and programmer manual.
15292 Ref. guide draft published: https://docs.google.com/document/d/1Xi1jh\_30NKnjE3wSR-XF5JQixtyT6H\_CdFTaVi78LKw/edit
15293
15294
15295 ---
15296
15297 * [HBASE-17343](https://issues.apache.org/jira/browse/HBASE-17343) | *Blocker* | **Make Compacting Memstore default in 2.0 with BASIC as the default type**
15298
15299  This JIRA changes the default MemStore to be CompactingMemStore instead of DefaultMemStore. In-memory compaction of CompactingMemStore demonstrated sizable improvement in HBase’s write amplification and read/write performance.
15300
15301 CompactingMemStore achieves these gains through smart use of RAM. The algorithm periodically re-organizes the in-memory data in efficient data structures and reduces redundancies. The  HBase server’s memory footprint therefore periodically expands and contracts. The outcome is longer lifetime of data in memory, less I/O, and overall faster performance. More details about the algorithm and its use appear in the Apache HBase Blog: https://blogs.apache.org/hbase/
15302
15303 How To Use:
15304 The in-memory compaction level can be configured both globally and per column family. The supported levels are none (DefaultMemStore), basic, and eager.
15305
15306 By default, all tables apply basic in-memory compaction. This global configuration can be overridden in hbase-site.xml, as follows:
15307
15308 \<property\>
15309  \<name\>hbase.hregion.compacting.memstore.type\</name\>
15310  \<value\>\<none\|basic\|eager\>\</value\>
15311  \</property\>
15312
15313 The level can also be configured in the HBase shell per column family, as follows:
15314
15315 create ‘\<tablename\>’,
15316 {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
15317
15318
15319 ---
15320
15321 * [HBASE-17786](https://issues.apache.org/jira/browse/HBASE-17786) | *Major* | **Create LoadBalancer perf-tests (test balancer algorithm decoupled from workload)**
15322
15323 $ bin/hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation -help
15324 usage: hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation \<options\>
15325 Options:
15326  -regions \<arg\>         Number of regions to consider by load balancer. Default: 1000000
15327  -servers \<arg\>         Number of servers to consider by load balancer. Default: 1000
15328  -load\_balancer \<arg\>   Type of Load Balancer to use. Default:
15329                         org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer
15330
15331
15332 ---
15333
15334 * [HBASE-17887](https://issues.apache.org/jira/browse/HBASE-17887) | *Blocker* | **Row-level consistency is broken for read**
15335
15336 Now we pass on list of memstoreScanners to the StoreScanner along with the new files to ensure that the StoreScanner sees the latest memstore after flush.
15337
15338
15339 ---
15340
15341 * [HBASE-15296](https://issues.apache.org/jira/browse/HBASE-15296) | *Major* | **Break out writer and reader from StoreFile**
15342
15343 \<!-- mardown --\>
15344 Refactor that breaks out StoreFile Reader and Writer inner classes as StoreFileReader and StoreFileWriter.
15345
15346 NOTE! Changes RegionObserver Coprocessor Interface so incompatible change (Discussed on dev list in thread "[Note breaking change on RegionObserver in hbase-2.0.0](https://s.apache.org/hbase-dev-note-about-HBASE-15296)"
15347
15348
15349 ---
15350
15351 * [HBASE-15199](https://issues.apache.org/jira/browse/HBASE-15199) | *Critical* | **Move jruby jar so only on hbase-shell module classpath; currently globally available**
15352
15353 The JRuby jar is no longer automatically included in classpaths for HBase server processes nor clients. It is still included in the classpath for the HBase shell and for invocations of org.jruby.Main, which should cover HBase provided support scripts.
15354
15355
15356 ---
15357
15358 * [HBASE-18009](https://issues.apache.org/jira/browse/HBASE-18009) | *Major* | **Move RpcServer.Call to a separated file**
15359
15360 The return value of CallRunner.getCall is changed so this is an incompatible change as CallRunner is declared as IA.LimitedPrivate. CallRunner is declared as IS.Evolving so we do not break the rule. And we still keep the getCall method to reduce the impact to user code.
15361
15362
15363 ---
15364
15365 * [HBASE-14925](https://issues.apache.org/jira/browse/HBASE-14925) | *Major* | **Develop HBase shell command/tool to list table's region info through command line**
15366
15367 Added a shell command 'list\_regions' for displaying the table's region info through command line.
15368
15369         List all regions for a particular table as an array and also filter them by server name (optional) as prefix
15370         and maximum locality (optional). By default, it will return all the regions for the table with any locality.
15371         The command displays server name, region name, start key, end key, size of the region in MB, number of requests
15372         and the locality. The information can be projected out via an array as third parameter. By default all these information
15373         is displayed. Possible array values are SERVER\_NAME, REGION\_NAME, START\_KEY, END\_KEY, SIZE, REQ and LOCALITY. Values
15374         are not case sensitive. If you don't want to filter by server name, pass an empty hash / string as shown below.
15375
15376         Examples:
15377         hbase\> list\_regions 'table\_name'
15378         hbase\> list\_regions 'table\_name', 'server\_name'
15379         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}
15380         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}, ['SERVER\_NAME']
15381         hbase\> list\_regions 'table\_name', {}, ['SERVER\_NAME', 'start\_key']
15382         hbase\> list\_regions 'table\_name', '', ['SERVER\_NAME', 'start\_key']
15383
15384
15385 ---
15386
15387 * [HBASE-17471](https://issues.apache.org/jira/browse/HBASE-17471) | *Critical* | **Region Seqid will be out of order in WAL if using mvccPreAssign**
15388
15389 MVCCPreAssign is added by HBASE-16698, but pre-assign mvcc is only used in put/delete path. Other write paths like increment/append still assign mvcc in ringbuffer's consumer thread. If put and increment are used parallel. Then seqid in WAL may not increase monotonically. Disorder in wals will lead to data loss.This patch bring all mvcc/seqid event in wal.append, and synchronize wal append and mvcc acquirement. No disorder in wal will happen. Performance test shows no regression with this patch.
15390
15391
15392 ---
15393
15394 * [HBASE-16466](https://issues.apache.org/jira/browse/HBASE-16466) | *Major* | **HBase snapshots support in VerifyReplication tool to reduce load on live HBase cluster with large tables**
15395
15396 Support for snapshots in VerifyReplication tool i.e. verifyrep can compare source table snapshot against peer table snapshot which reduces load on RS by reading data from HDFS directly using Snapshot scanners.
15397 Instead of comparing against live tables whose state changes due to writes and compactions its better to compare HBase  snapshots which are immutable in nature.
15398
15399
15400 ---
15401
15402 * [HBASE-17263](https://issues.apache.org/jira/browse/HBASE-17263) | *Major* | **  Netty based rpc server impl**
15403
15404 A new RPC server based on Netty4 which can improve random read (get) performance. By default, it is off. To use this feature, please set “hbase.rpc.server.impl" to “org.apache.hadoop.hbase.ipc.NettyRpcServer”.
15405
15406 In one deploy, doubled the throughput and lowered the latency significantly: see https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
15407
15408
15409 ---
15410
15411 * [HBASE-17957](https://issues.apache.org/jira/browse/HBASE-17957) | *Minor* | ** Custom metrics of replicate endpoints don't prepend "source." to global metrics**
15412
15413 Global custom metrics names follow the "source.metricsName" format.
15414
15415
15416 ---
15417
15418 * [HBASE-17757](https://issues.apache.org/jira/browse/HBASE-17757) | *Major* | **Unify blocksize after encoding to decrease memory fragment**
15419
15420 Blocksize is set in columnfamily's atrributes. It is used to control block sizes when generating blocks. But, it doesn't take encoding into count. If you set encoding to blocks, after encoding, the block size varies. Since blocks will be cached in memory after encoding (default), it will cause memory fragment if using blockcache, or decrease the pool efficiency if using bucketCache. This issue introduced a new config named 'hbase.writer.unified.encoded.blocksize.ratio'. The default value of this config is 1, meaning doing nothing. If this value is set to a smaller value like 0.5, and the blocksize is set to 64KB(default value of blocksize). It will unify the blocksize after encoding to 64KB \* 0.5 = 32KB. Unified blocksize will releaf the memory problems mentioned above.
15421
15422
15423 ---
15424
15425 * [HBASE-14286](https://issues.apache.org/jira/browse/HBASE-14286) | *Trivial* | **Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile**
15426
15427 HBASE-14286 Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile
15428
15429
15430 ---
15431
15432 * [HBASE-17817](https://issues.apache.org/jira/browse/HBASE-17817) | *Major* | **Make Regionservers log which tables it removed coprocessors from when aborting**
15433
15434 Add table name to exception logging when a coprocessor is removed from a table by the region server
15435
15436
15437 ---
15438
15439 * [HBASE-17877](https://issues.apache.org/jira/browse/HBASE-17877) | *Major* | **Improve HBase's byte[] comparator**
15440
15441 updated the lexicographic byte array comparator to use a slightly more optimized version similar to the one available in the guava library that compares only the first index where left[index] != right[index]. The comparator also returns the diff directly instead of mapping it to -1, 0, +1 range as was being done in the earlier version. We have seen significant performance gains, calculated in terms of throughput (ops/ms) with these changes ranging from approx 20% for smaller byte arrays upto 200 bytes and almost 100% for large byte array sizes that are in few KB's. We benchmarked with upto 16KB arrays and the general trend indicates that the performance improvement increases as the size of the byte array increases.
15442
15443
15444 ---
15445
15446 * [HBASE-9899](https://issues.apache.org/jira/browse/HBASE-9899) | *Major* | **for idempotent operation dups, return the result instead of throwing conflict exception**
15447
15448 Non-idempotent operations (increment/append/checkAndPut/...) may throw OperationConflictException even though the increment/append succeeded. For example (client rpc retries number set to 3):
15449
15450 1. first increment rpc request success
15451 2. client timeout and send second rpc request, but nonce is same and save in server. The server found that it has already succeed, so return a OperationConflictException to make sure that increment operation only be applied once in server.
15452
15453 This patch will solve this problem by read the previous result when receive a duplicate rpc request.
15454 1. Store the mvcc to OperationContext. When first rpc request succeed, store the mvcc for this operation nonce.
15455 2. When there are duplicate rpc request, convert to read result by the mvcc.
15456
15457
15458 ---
15459
15460 * [HBASE-15583](https://issues.apache.org/jira/browse/HBASE-15583) | *Minor* | **Any HTableDescriptor we give out should be immutable**
15461
15462 # The HTD got from Admin, AsyncAdmin, and Table is immutable.
15463 # DEFERRED\_LOG\_FLUSH is removed.
15464 # cleanup the deprecated construction of HTD
15465
15466
15467 ---
15468
15469 * [HBASE-17956](https://issues.apache.org/jira/browse/HBASE-17956) | *Major* | **Raw scan should ignore TTL**
15470
15471 Now raw scan can also read expired cells.
15472
15473
15474 ---
15475
15476 * [HBASE-15143](https://issues.apache.org/jira/browse/HBASE-15143) | *Minor* | **Procedure v2 - Web UI displaying queues**
15477
15478 Adds a new Admin#listLocks, a panel on the procedures page to list procedure locks, and a list\_locks command to the shell. Use it to see current state of procedure locking in Master process.
15479
15480
15481 ---
15482
15483 * [HBASE-17514](https://issues.apache.org/jira/browse/HBASE-17514) | *Minor* | **Warn when Thrift Server 1 is configured for proxy users but not the HTTP transport**
15484
15485 If users of the Thrift 1 Server enable proxy user support without enabling the prerequisite HTTP transport, we now log a WARN message about the mismatch.
15486
15487
15488 ---
15489
15490 * [HBASE-17914](https://issues.apache.org/jira/browse/HBASE-17914) | *Major* | **Create a new reader instead of cloning a new StoreFile when compaction**
15491
15492 StoreFile.createReader method is gone. Call initReader and then getReader instead.
15493
15494
15495 ---
15496
15497 * [HBASE-16477](https://issues.apache.org/jira/browse/HBASE-16477) | *Major* | **Remove Writable interface and related code from WALEdit/WALKey**
15498
15499 Removes the Writables, and related code from WALEdit class. HBase-2.0 will not be able to read WAL files written with 0.94.x and before.
15500
15501
15502 ---
15503
15504 * [HBASE-17858](https://issues.apache.org/jira/browse/HBASE-17858) | *Major* | **Update refguide about the IS annotation if necessary**
15505
15506 Updated refguide to tell users that IS annotation is only valid for IA.LimitedPrivate classes.
15507
15508
15509 ---
15510
15511 * [HBASE-17857](https://issues.apache.org/jira/browse/HBASE-17857) | *Major* | **Remove IS annotations from IA.Public classes**
15512
15513 Now we do not have InterfaceStability annotations for IA,Public API. The stability of these classes will follow the rule of 'Semantic Versioning'.
15514
15515
15516 ---
15517
15518 * [HBASE-17215](https://issues.apache.org/jira/browse/HBASE-17215) | *Major* | **Separate small/large file delete threads in HFileCleaner to accelerate archived hfile cleanup speed**
15519
15520 After HBASE-17215 we change to use two threads for (archived) hfile cleaning. The size throttling for large/small files could be set through "hbase.regionserver.thread.hfilecleaner.throttle" and default to 67108864 (64M). It supports online configuration change, just find the active master address through zookeeper dump and use it in update\_config command, e.g. update\_config 'hbasem1.et2.tbsite.net,60100,1488038696741'
15521
15522
15523 ---
15524
15525 * [HBASE-16780](https://issues.apache.org/jira/browse/HBASE-16780) | *Critical* | **Since move to protobuf3.1, Cells are limited to 64MB where previous they had no limit**
15526
15527 Upgrade internal pb to 3.2 from 3.1. 3.2 has fix for 64MB limit.
15528
15529
15530 ---
15531
15532 * [HBASE-17287](https://issues.apache.org/jira/browse/HBASE-17287) | *Blocker* | **Master becomes a zombie if filesystem object closes**
15533
15534 If filesystem is not available during log split, abort master server.
15535
15536
15537 ---
15538
15539 * [HBASE-17765](https://issues.apache.org/jira/browse/HBASE-17765) | *Major* | **Reviving the merge possibility in the CompactingMemStore**
15540
15541 Reviving the merge of the compacting pipeline: making the limit on the number of the segments in the pipeline configurable and adding the merge test.
15542
15543 In order to customize the pipeline size limit change the value of the "hbase.hregion.compacting.pipeline.segments.limit" in the hbase-site.xml
15544
15545 Value 1 means to merge the segments on any flush-in-memory. Value higher than 16 means no merge.
15546
15547
15548 ---
15549
15550 * [HBASE-13395](https://issues.apache.org/jira/browse/HBASE-13395) | *Major* | **Remove HTableInterface**
15551
15552 HTableInterface was deprecated in 0.21.0 and is removed in 2.0.0. Use org.apache.hadoop.hbase.client.Table instead.
15553
15554
15555 ---
15556
15557 * [HBASE-17595](https://issues.apache.org/jira/browse/HBASE-17595) | *Critical* | **Add partial result support for small/limited scan**
15558
15559 Now small scan and limited scan could also return partial results.
15560
15561
15562 ---
15563
15564 * [HBASE-16014](https://issues.apache.org/jira/browse/HBASE-16014) | *Major* | **Get and Put constructor argument lists are divergent**
15565
15566 Add 2 constructors fot API Get
15567 1. Get(byte[], int, int)
15568 2. Get(ByteBuffer)
15569
15570
15571 ---
15572
15573 * [HBASE-17584](https://issues.apache.org/jira/browse/HBASE-17584) | *Major* | **Expose ScanMetrics with ResultScanner rather than Scan**
15574
15575 Now you can use ResultScanner.getScanMetrics to get the scan metrics at any time during the scan operation. The old Scan.getScanMetrics is deprecated and still work, but if you use ResultScanner.getScanMetrics to get the scan metrics and reset it, then the metrics published to the Scan instaince will be messed up.
15576
15577
15578 ---
15579
15580 * [HBASE-17802](https://issues.apache.org/jira/browse/HBASE-17802) | *Major* | **Add note that minor versions can add methods to Interfaces**
15581
15582 Update our semver section to include a note on our allowing ourselves the right to add methods to an Interface over a minor version as agreed to up on the dev list:  "If a Client implements an HBase Interface, a recompile MAY be required upgrading to a newer minor version (See release notes for warning about incompatible changes). All effort will be made to provide a default implementation so this case should not arise."
15583
15584
15585 ---
15586
15587 * [HBASE-17426](https://issues.apache.org/jira/browse/HBASE-17426) | *Major* | **Inconsistent environment variable names for enabling JMX**
15588
15589 In bin/hbase-config.sh,
15590 if value for HBASE\_JMX\_BASE is empty, keep current behavior.
15591 if HBASE\_JMX\_OPTS is not empty, keep current behavior.
15592 otherwise use the value of HBASE\_JMX\_BASE
15593
15594
15595 ---
15596
15597 * [HBASE-17740](https://issues.apache.org/jira/browse/HBASE-17740) | *Critical* | **Correct the semantic of batch and partial for async client**
15598
15599 Now async client has the same semantic with sync client for batch and partial.
15600 '''
15601 Now setBatch doesn't mean setAllowPartialResult(true)
15602 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15603 '''
15604
15605 Also a minor API change:
15606 Result#createCompleteResult(List\<Result\>) is changed to Result#createCompleteResult(Iterable\<Result\>).
15607
15608
15609 ---
15610
15611 * [HBASE-17746](https://issues.apache.org/jira/browse/HBASE-17746) | *Major* | **TestSimpleRpcScheduler.testCoDelScheduling is broken**
15612
15613 The executor for CoDel is changed to FastPathBalancedQueueRpcExecutor
15614
15615
15616 ---
15617
15618 * [HBASE-17712](https://issues.apache.org/jira/browse/HBASE-17712) | *Major* | **Remove/Simplify the logic of RegionScannerImpl.handleFileNotFound**
15619
15620 Add a config named 'hbase.hregion.unassign.for.fnfe'. It is used to control whether to reopen a region when hitting FileNotFoundException. The default value is true.
15621
15622
15623 ---
15624
15625 * [HBASE-15941](https://issues.apache.org/jira/browse/HBASE-15941) | *Major* | **HBCK repair should not unsplit healthy splitted region**
15626
15627 A new option -removeParents is now available that will remove an old parent when two valid daughters for that parent exist and -fixHdfsOverlaps is used. If there is an issue trying to remove the parent from META or sidelining the parent from HDFS we will fallback to do a regular merge. For now this option only works when the overlap group consists only of 3 regions (a parent, daughter A and daughter B)
15628
15629
15630 ---
15631
15632 * [HBASE-17737](https://issues.apache.org/jira/browse/HBASE-17737) | *Major* | **Thrift2 proxy should support scan timeRange per column family**
15633
15634 Thrift2 proxy supports scan timeRange per column family
15635
15636
15637 ---
15638
15639 * [HBASE-17718](https://issues.apache.org/jira/browse/HBASE-17718) | *Major* | **Difference between RS's servername and its ephemeral node cause SSH stop working**
15640
15641 Fix our accidentally registering a RegionServer's ephermal znode BEFORE we checked in with the master.
15642
15643
15644 ---
15645
15646 * [HBASE-17717](https://issues.apache.org/jira/browse/HBASE-17717) | *Critical* | **Incorrect ZK ACL set for HBase superuser**
15647
15648 In previous versions of HBase, the system intended to set a ZooKeeper ACL on all "sensitive" ZNodes for the user specified in the hbase.superuser configuration property. Unfortunately, the ACL was malformed which resulted in the hbase.superuser being unable to access the sensitive ZNodes that HBase creates. This JIRA issue fixes this bug. HBase will automatically correct the ACLs on start so users do not need to manually correct the ACLs.
15649
15650
15651 ---
15652
15653 * [HBASE-17716](https://issues.apache.org/jira/browse/HBASE-17716) | *Minor* | **Formalize Scan Metric names**
15654
15655 HBASE-17716 breaks compatibility of ServerSideScanMetrics by changing public field names, and the issue is fixed through HBASE-17886
15656
15657
15658 ---
15659
15660 * [HBASE-15484](https://issues.apache.org/jira/browse/HBASE-15484) | *Blocker* | **Correct the semantic of batch and partial**
15661
15662 Now setBatch doesn't mean setAllowPartialResult(true)
15663 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15664 Scan#setBatch is helpful in paging queries, if you just want to prevent OOM at client, use setAllowPartialResults(true) is better.
15665 We deprecated isPartial and use mayHaveMoreCellsInRow. If it returns false, current Result must be the last one of this row.
15666
15667
15668 ---
15669
15670 * [HBASE-17312](https://issues.apache.org/jira/browse/HBASE-17312) | *Major* | **[JDK8] Use default method for Observer Coprocessors**
15671
15672 Deletes BaseMasterAndRegionObserver, BaseMasterObserver, BaseRegionObserver, BaseRegionServerObserver and BaseWALObserver.
15673 Their corresponding interface classes now use JDK8's 'default' keyword to provide empty/no-op implementations so that:
15674 1. Derived class don't break when more coprocessor hooks are added in future.
15675 2. Derived classes don't have to redundantly override functions they don't care about with empty implementations.
15676
15677 Earlier, BaseXXXObserver classes provided these exact two benefits, but with 'default' keyword in JDK8, they are not needed anymore.
15678
15679 To fix the breakages because of this change, simply change "Foo extends BaseXXXObserver" to "Foo implements XXXObserver".
15680
15681
15682 ---
15683
15684 * [HBASE-17647](https://issues.apache.org/jira/browse/HBASE-17647) | *Major* | **OffheapKeyValue#heapSize() implementation is wrong**
15685
15686 **WARNING: No release note provided for this change.**
15687
15688
15689 ---
15690
15691 * [HBASE-13718](https://issues.apache.org/jira/browse/HBASE-13718) | *Minor* | **Add a pretty printed table description to the table detail page of HBase's master**
15692
15693 <!-- markdown -->
15694
15695
15696 The table information page in the Master UI now includes a schema section that describes the column families defined for that table as well as any column family specific properties that are set.
15697
15698
15699 ---
15700
15701 * [HBASE-17472](https://issues.apache.org/jira/browse/HBASE-17472) | *Major* | **Correct the semantic of  permission grant**
15702
15703 Before this patch, later granted permissions will override previous granted permissions, and previous granted permissions LOST. this issue re-define grant semantic: for master branch, later granted permissions will merge with previous granted permissions.  for branch-1.4, grant keep override behavior for compatibility purpose, and a grant with mergeExistingPermission flag provided.
15704
15705
15706 ---
15707
15708 * [HBASE-17583](https://issues.apache.org/jira/browse/HBASE-17583) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan for sync client**
15709
15710 Now you can include/exlude the startRow and stopRow for a scan. And the new methods to specify startRow and stopRow are withStartRow and withStopRow. The old methods to specify startRow and Row(include constructors) are marked as deprecated as in the old time if startRow and stopRow are equal then we will consider it as a get scan and include the stopRow implicitly. This is strange after we can set inclusiveness explicitly so we add new methods and depredate the old methods. The deprecated methods will be removed in the future.
15711
15712
15713 ---
15714
15715 * [HBASE-9702](https://issues.apache.org/jira/browse/HBASE-9702) | *Major* | **Change unittests that use "table" or "testtable" to use method names.**
15716
15717 Changes all tests to use the TestName JUnit Rule everywhere rather than hardcode table/region/store names.
15718
15719
15720 ---
15721
15722 * [HBASE-17280](https://issues.apache.org/jira/browse/HBASE-17280) | *Minor* | **Add mechanism to control hbase cleaner behavior**
15723
15724 The HBase cleaner chore process cleans up old WAL files and archived HFiles. Cleaner operation can affect query performance when running heavy workloads, so disable the cleaner during peak hours. The cleaner has the following HBase shell commands:
15725
15726 - cleaner\_chore\_enabled: Queries whether cleaner chore is enabled/ disabled.
15727 - cleaner\_chore\_run: Manually runs the cleaner to remove files.
15728 - cleaner\_chore\_switch: enables or disables the cleaner and returns the previous state of the cleaner. For example, cleaner-switch true enables the cleaner.
15729
15730 Following APIs are added in Admin:
15731 - setCleanerChoreRunning(boolean on): Enable/Disable the cleaner chore
15732 - runCleanerChore(): Ask for cleaner chore to run
15733 - isCleanerChoreEnabled(): Query whether cleaner chore is enabled/ disabled.
15734
15735
15736 ---
15737
15738 * [HBASE-17599](https://issues.apache.org/jira/browse/HBASE-17599) | *Major* | **Use mayHaveMoreCellsInRow instead of isPartial**
15739
15740 The word 'isPartial' is ambiguous so we introduce a new method 'mayHaveMoreCellsInRow' to replace it. And the old meaning of 'isPartial' is not the same with 'mayHaveMoreCellsInRow' as for batched scan, if the number of returned cells equals to the batch, isPartial will be false. After this change the meaning of 'isPartial' will be same with 'mayHaveMoreCellsInRow'. This is an incompatible change but it is not likely to break a lot of things as for batched scan the old 'isPartial' is just a redundant information, i.e, if the number of returned cells reaches the batch limit. You have already know the number of returned cells and the value of batch.
15741
15742
15743 ---
15744
15745 * [HBASE-17437](https://issues.apache.org/jira/browse/HBASE-17437) | *Major* | **Support specifying a WAL directory outside of the root directory**
15746
15747 This patch adds support for specifying a WAL directory outside of the HBase root directory.
15748
15749 Multiple configuration variables were added to accomplish this:
15750 hbase.wal.dir: used to configure where the root WAL directory is located. Could be on a different FileSystem than the root directory. WAL directory can not be set to a subdirectory of the root directory. The default value of this is the root directory if unset.
15751
15752 hbase.rootdir.perms: Configures FileSystem permissions to set on the root directory. This is '700' by default.
15753
15754 hbase.wal.dir.perms: Configures FileSystem permissions to set on the WAL directory FileSystem. This is '700' by default.
15755
15756
15757 ---
15758
15759 * [HBASE-17350](https://issues.apache.org/jira/browse/HBASE-17350) | *Critical* | **Fixup of regionserver group-based assignment**
15760
15761 A few bug fixes and tweaks to the fsgroup feature.
15762
15763 Renamed shell command move\_rsgroup\_servers as move\_servers\_rsgroup
15764 Renamed shell comand move\_rsgroup\_tables as move\_tables\_rsgroup
15765
15766 Made the 'default' group more 'dynamic'; i.e. dead servers no longer show in the 'default' group.
15767
15768
15769 ---
15770
15771 * [HBASE-17578](https://issues.apache.org/jira/browse/HBASE-17578) | *Major* | **Thrift per-method metrics should still update in the case of exceptions**
15772
15773 In prior versions, the HBase Thrift handlers failed to increment per-method metrics when an exception was encountered.  These metrics will now always be incremented, whether an exception is encountered or not.  This change also adds exception-type metrics, similar to those exposed in regionservers, for individual exceptions which are received by the Thrift handlers.
15774
15775
15776 ---
15777
15778 * [HBASE-17508](https://issues.apache.org/jira/browse/HBASE-17508) | *Major* | **Unify the implementation of small scan and regular scan for sync client**
15779
15780 Now the scan.setSmall method is deprecated. Consider using scan.setLimit and scan.setReadType in the future. And we will open scanner lazily when you call scanner.next. This is an incompatible change which delays the table existence check and permission check.
15781
15782
15783 ---
15784
15785 * [HBASE-16981](https://issues.apache.org/jira/browse/HBASE-16981) | *Major* | **Expand Mob Compaction Partition policy from daily to weekly, monthly**
15786
15787 Mob compaction partition policy can be set by
15788 hbase\> create 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'weekly'}
15789
15790 or
15791
15792 hbase\> alter 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'monthly'}
15793
15794 Available MOB\_COMPACT\_PARTITION\_POLICY options are "daily", "weekly" and "monthly", the default is "daily".
15795
15796 When it is "weekly" policy, the mob compaction will try to compact files within one calendar week into one for a specific partition, similar for "daily" and "monthly".
15797
15798 With "weekly" policy, one mob file normally is compacted twice during its lifetime (that is first on daily basis and then all such daily based compacted files belonging to a week at the weekly interval), for one region, there normally are 52 files for one year. With "Monthly" policy, one mob file normally is compacted 3 times during its lifetime (First daily and then weekly followed by monthly at end of every month) and normally there are 12 files for one year.
15799
15800
15801 ---
15802
15803 * [HBASE-17197](https://issues.apache.org/jira/browse/HBASE-17197) | *Major* | **hfile does not work in 2.0**
15804
15805 The -f argument is no longer required specifying target file; just pass the file as an argument.
15806
15807
15808 ---
15809
15810 * [HBASE-16812](https://issues.apache.org/jira/browse/HBASE-16812) | *Minor* | **Clean up the locks in MOB**
15811
15812 In MOB-enabled column family, the lock in the major compaction is removed. All the delete markers are retained in the major compaction, and a MOB reference tag is appended to each of the retained delete markers.
15813
15814
15815 ---
15816
15817 * [HBASE-12894](https://issues.apache.org/jira/browse/HBASE-12894) | *Critical* | **Upgrade Jetty to 9.2.6**
15818
15819 Upgrades Jetty to 9.x from 6.x (Jetty9 is in different namespace from Jetty6). Also updated Jersey to 2.x and Servlet to 3.x.
15820
15821
15822 ---
15823
15824 * [HBASE-17566](https://issues.apache.org/jira/browse/HBASE-17566) | *Major* | **Jetty upgrade fixes**
15825
15826 Fix inability at finding static content post push of parent issue moving us to jetty9.
15827
15828
15829 ---
15830
15831 * [HBASE-9774](https://issues.apache.org/jira/browse/HBASE-9774) | *Major* | **HBase native metrics and metric collection for coprocessors**
15832
15833 This issue adds two new modules, hbase-metrics and hbase-metrics-api which define and implement the "new" metric system used internally within HBase. These two modules (and some other code in hbase-hadoop2-compat) module are referred as "HBase metrics framework" which is HBase-specific and independent of any other metrics library (including Hadoop metrics2 and dropwizards metrics).
15834
15835 HBase Metrics API (hbase-metrics-api) contains the interface that HBase exposes internally and to third party code (including coprocessors). It is a thin
15836 abstraction over the actual implementation for backwards compatibility guarantees. The metrics API in this hbase-metrics-api module is inspired by the Dropwizard metrics 3.1 API, however, the API is completely independent.
15837
15838 hbase-metrics module contains implementation of the "HBase Metrics API", including MetricRegistry, Counter, Histogram, etc. These are highly concurrent implementations of the Metric interfaces. Metrics in HBase are grouped into different sets (like WAL, RPC, RegionServer, etc). Each group of metrics should be tracked via a MetricRegistry specific to that group.
15839
15840 Historically, HBase has been using Hadoop's Metrics2 framework [3] for collecting and reporting the metrics internally. However, due to the difficultly of dealing with the Metrics2 framework, HBase is moving away from Hadoop's metrics implementation to its custom implementation. The move will happen incrementally, and during the time, both Hadoop Metrics2-based metrics and hbase-metrics module based classes will be in the source code. All new implementations for metrics SHOULD use the new API and framework.
15841
15842 This jira also introduces the metrics API to coprocessor implementations. Coprocessor writes can export custom metrics using the API and have those collected via metrics2 sinks, as well as exported via JMX in regionserver metrics.
15843
15844 More documentation available at: hbase-metrics-api/README.txt
15845
15846
15847 ---
15848
15849 * [HBASE-17491](https://issues.apache.org/jira/browse/HBASE-17491) | *Major* | **Remove all setters from HTable interface and introduce a TableBuilder to build Table instance**
15850
15851 After HBASE-17491 all setter methods in HTable are marked as deprecated, moved into TableBuilder, and will be removed later.
15852
15853
15854 ---
15855
15856 * [HBASE-17067](https://issues.apache.org/jira/browse/HBASE-17067) | *Major* | **Procedure v2 - remove tryAcquire\*Lock and use wait/wake to make framework event based**
15857
15858 Make the framework more 'lively'; undo 'suspend' notion in Procedure, rely on eventing mechanism instead. Lets us remove no longer needed synchronizations. Framework can now do more ops per second.
15859
15860
15861 ---
15862
15863 * [HBASE-16698](https://issues.apache.org/jira/browse/HBASE-16698) | *Major* | **Performance issue: handlers stuck waiting for CountDownLatch inside WALKey#getWriteEntry under high writing workload**
15864
15865 Assign sequenceid to an edit before we go on the ringbuffer; undoes contention on WALKey latch. Adds a new config "hbase.hregion.mvcc.preassign" which defaults to true: i.e. this speedup is enabled.
15866
15867 User could set this per-table level, like:
15868 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hregion.mvcc.preassign'=\>'false'}}
15869
15870
15871 ---
15872
15873 * [HBASE-17488](https://issues.apache.org/jira/browse/HBASE-17488) | *Trivial* | **WALEdit should be lazily instantiated**
15874
15875 prevent creating unused objects in the WALEdit's construction.
15876 +If the cp#preBatchMutate returns true, the WALEdit is useless. So we should create the WALEdit after step 2.
15877 +The cells came from cp should be counted because they are added into the WALEdit . The use case is the local index of phoenix
15878 +If the mutation contains the SKIP\_WAL property, its cells aren't added into the WALEdit. So these cells shouldn't be counted.
15879
15880
15881 ---
15882
15883 * [HBASE-16831](https://issues.apache.org/jira/browse/HBASE-16831) | *Minor* | **Procedure V2 - Remove org.apache.hadoop.hbase.zookeeper.lock**
15884
15885 Purges code that did zk-hosted locks for table ops (we do procedure-based locks now)
15886
15887
15888 ---
15889
15890 * [HBASE-16867](https://issues.apache.org/jira/browse/HBASE-16867) | *Major* | **Procedure V2 - Check ACLs for remote HBaseLock**
15891
15892 Add checking ACL when taking locks.
15893
15894
15895 ---
15896
15897 * [HBASE-16786](https://issues.apache.org/jira/browse/HBASE-16786) | *Major* | **Procedure V2 - Move ZK-lock's uses to Procedure framework locks (LockProcedure)**
15898
15899 Move locking to be procedure (Pv2) rather than zookeeper based. All locking moved over to new infrastructure including MOBing locking.
15900
15901
15902 ---
15903
15904 * [HBASE-17470](https://issues.apache.org/jira/browse/HBASE-17470) | *Major* | **Remove merge region code from region server**
15905
15906 In 1.x branches, Admin.mergeRegions calls MASTER via dispatchMergingRegions RPC; when executing dispatchMergingRegions RPC, MASTER calls RS via MergeRegions to complete the merge in RS-side.
15907
15908 With HBASE-16119, the merge logic moves to master-side.  This JIRA cleans up unused RPCs (dispatchMergingRegions and MergeRegions) , removes dangerous tools such as Merge and HMerge, and deletes unused RegionServer-side merge region logic in 2.0 release.
15909
15910
15911 ---
15912
15913 * [HBASE-16744](https://issues.apache.org/jira/browse/HBASE-16744) | *Major* | **Procedure V2 - Lock procedures to allow clients to acquire locks on tables/namespaces/regions**
15914
15915  Lock for HBase Entity either a Table, a Namespace, or Regions.
15916
15917 These are remote locks which live on master, and need periodic heartbeats to keep them alive. (Once we request the lock, internally an heartbeat thread will be started). If master doesn't receive the heartbeat in time, it'll release the lock and make it available to other users.
15918
15919 Use {@link LockServiceClient} to build instances. Then call {@link #requestLock()}. {@link #requestLock} will contact master to queue the lock and start the heartbeat thread which will check lock's status periodically and once the lock is acquired, it will send the heartbeats to the master.
15920
15921 Use {@link #await} or {@link #await(long, TimeUnit)} to wait for the lock to be acquired. Always call {@link #unlock()} irrespective of whether lock was acquired or not. If the lock was acquired, it'll be released. If it was not acquired, it is possible that master grants the lock in future and the heartbeat thread keeps it alive forever by sending heartbeats. Calling {@link #unlock()} will stop the heartbeat thread and cancel the lock queued on master.
15922
15923 There are 4 ways in which these remote locks may be released/can be lost:
15924   \* Call {@link #unlock}.
15925   \* Lock times out on master: Can happen because of network issues, GC pauses, etc. Worker thread will call the given abortable as soon as it detects such a situation. Fail to contact master: If worker thread can not contact mater and thus fails to send heartbeat before the timeout expires, it assumes that lock is lost and calls the
15926  \*     abortable.
15927 Worker thread is interrupted.
15928
15929 Use example:
15930
15931  EntityLock lock = lockServiceClient.\*Lock(...., "exampled lock", abortable);
15932   lock.requestLock();
15933   ....
15934    ....can do other initializations here since lock is 'asynchronous'...
15935  ....
15936  if (lock.await(timeout)) {
15937     ....logic requiring mutual exclusion
15938   }
15939    lock.unlock();
15940
15941
15942 ---
15943
15944 * [HBASE-14061](https://issues.apache.org/jira/browse/HBASE-14061) | *Major* | **Support CF-level Storage Policy**
15945
15946 After HBASE-14061 we support to set storage policy for HFile through "hbase.hstore.block.storage.policy" configuration, and we support CF-level setting to override the settings from configuration file. Currently supported storage policies include ALL\_SSD/ONE\_SSD/HOT/WARM/COLD, refer to http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html for more details
15947
15948 For example, to create a table with two families: "cf1" with "ALL\_SSD" storage policy and "cf2" with "ONE\_SSD", we could use below command in hbase shell:
15949 create 'table',{NAME=\>'f1',STORAGE\_POLICY=\>'ALL\_SSD'},{NAME=\>'f2',STORAGE\_POLICY=\>'ONE\_SSD'}
15950
15951 We could also set the configuration in table attribute like all other configurations:
15952 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ONE\_SSD'}}
15953
15954
15955 ---
15956
15957 * [HBASE-17337](https://issues.apache.org/jira/browse/HBASE-17337) | *Major* | **list replication peers request should be routed through master**
15958
15959 List replication peers request will be roughed through master.
15960
15961
15962 ---
15963
15964 * [HBASE-15172](https://issues.apache.org/jira/browse/HBASE-15172) | *Major* | **Support setting storage policy in bulkload**
15965
15966 After HBASE-15172/HBASE-19016 we could set storage policy through "hbase.hstore.block.storage.policy" property for bulkload, or "hbase.hstore.block.storage.policy.\<family\_name\>" for a specified family. Supported storage policy includes: ALL\_SSD, ONE\_SSD, HOT, WARM, COLD, etc.
15967
15968
15969 ---
15970
15971 * [HBASE-17336](https://issues.apache.org/jira/browse/HBASE-17336) | *Major* | **get/update replication peer config requests should be routed through master**
15972
15973 Get/update replication peer config requests will be routed through master.
15974
15975
15976 ---
15977
15978 * [HBASE-17320](https://issues.apache.org/jira/browse/HBASE-17320) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan**
15979
15980 Now you can specific the inclusive of startRow and stopRow for a scan using the new methods withStartRow(byte[] startRow, boolean inclusive) and withStopRow(byte[] stopRow, boolean inclusive). The old setStartRow and setStopRow methods, and the constructors are marked as deprecated because of an strange behavior that we will include the stopRow implicitly if startRow equals to stopRow. This is used to support get scan in the old time. Use withStartRow and withStopRow instead.
15981
15982 For developers, the ConnectionUtils.createClosestRowBefore is also marked as deprecated as the row returned by this method is only very very close to the current row, not closest. Avoid using this method in the future.
15983
15984
15985 ---
15986
15987 * [HBASE-17314](https://issues.apache.org/jira/browse/HBASE-17314) | *Major* | **Limit total buffered size for all replication sources**
15988
15989 Add a conf "replication.total.buffer.quota" to limit total size of buffered entries in all replication peers. It will prevent server getting OOM if there are many peers. Default value is 256MB.
15990
15991
15992 ---
15993
15994 * [HBASE-17174](https://issues.apache.org/jira/browse/HBASE-17174) | *Minor* | **Refactor the AsyncProcess, BufferedMutatorImpl, and HTable**
15995
15996 + cleanup some unused code
15997 + allow being able to share pool between BufferedMutatorImpl
15998 + setting "hbase.client.request.controller.impl" to the name of the alternate RequestController (traffic control) implementation class in Configuration
15999 + The default RequestController implementation is SimpleRequestController
16000 + setting "hbase.client.log.detail.period.ms" to call logger on a period when waiting for tasks to complete
16001
16002
16003 ---
16004
16005 * [HBASE-17335](https://issues.apache.org/jira/browse/HBASE-17335) | *Major* | **enable/disable replication peer requests should be routed through master**
16006
16007 Enable/Disable replication peer requests will be routed through master.
16008
16009
16010 ---
16011
16012 * [HBASE-5401](https://issues.apache.org/jira/browse/HBASE-5401) | *Major* | **PerformanceEvaluation generates 10x the number of expected mappers**
16013
16014 Changes how many tasks PE runs when clients are mapreduce. Now tasks == client count. Previous we hardcoded ten tasks per client instance.
16015
16016
16017 ---
16018
16019 * [HBASE-11392](https://issues.apache.org/jira/browse/HBASE-11392) | *Critical* | **add/remove peer requests should be routed through master**
16020
16021 Add/Remove replication peer requests will be routed through master. And make ReplicationAdmin as Deprecated.
16022
16023
16024 ---
16025
16026 * [HBASE-15924](https://issues.apache.org/jira/browse/HBASE-15924) | *Major* | **Enhance hbase services autorestart capability to hbase-daemon.sh**
16027
16028 Now one can start hbase services with enabled "autostart/autorestart" feature in controlled fashion with the help of "--autostart-window-size" to define the window period and the "--autostart-window-retry-limit" to define the number of times the hbase services have to be restarted upon being killed/terminated abnormally within the provided window perioid.
16029
16030 The following cases are supported with "autostart/autorestart":
16031
16032 a) --autostart-window-size=0 and --autostart-window-retry-limit=0, indicates infinite window size and no retry limit
16033 b) not providing the args, will default to a)
16034 c) --autostart-window-size=0 and --autostart-window-retry-limit=\<positive value\> indicates the autostart process to bail out if the retry limit exceeds irrespective of window period
16035 d) --autostart-window-size=\<x\> and --autostart-window-retry-limit=\<y\> indicates the autostart process to bail out if the retry limit "y" is exceeded for the last window period "x".
16036
16037
16038 ---
16039
16040 * [HBASE-17331](https://issues.apache.org/jira/browse/HBASE-17331) | *Minor* | **Avoid busy waiting in ThrottledInputStream**
16041
16042 For each read(), old ThrottledInputStream sleeps/wakes/checks for many times for controlling the throughput. After this patch, ThrottledInputStream sleeps/wakes/checks only once. So we can reduce CPU usage.
16043
16044
16045 ---
16046
16047 * [HBASE-17296](https://issues.apache.org/jira/browse/HBASE-17296) | *Major* | **Provide per peer throttling for replication**
16048
16049 Provide per peer throttling for replication. Add the bandwidth upper limit to ReplicationPeerConfig and a new shell cmd set\_peer\_bandwidth to update the bandwidth in need.
16050
16051
16052 ---
16053
16054 * [HBASE-17277](https://issues.apache.org/jira/browse/HBASE-17277) | *Major* | **Allow alternate BufferedMutator implementation**
16055
16056 Specify the name of an alternate BufferedMutator implementation by either:
16057
16058  \* Setting "hbase.client.bufferedmutator.classname" to the name of the alternate implementation class in Configuration
16059  \* Or, by setting BufferedMutatorParams#implementationClassName and passing the amended BufferedMutatorParams when calling Connection#getBufferedMutator.
16060
16061
16062 ---
16063
16064 * [HBASE-17294](https://issues.apache.org/jira/browse/HBASE-17294) | *Major* | **External Configuration for Memory Compaction**
16065
16066 This patch provides a single external knob to control memstore compaction. It also inmemory compaction with BASIC policy as our default (AFTERWORD: inmemory compaction as default was undone in HBASE-17333 because of test failures; will be reenabled in later, dedicated issue)
16067
16068 Possible memstore compaction policies are:
16069 (1) None - no memory compaction, when size threshold is exceeded data is flushed to disk
16070 (2) Basic policy applies optimizations which modify the index to a more compacted representation. This is beneficial in all access patterns. The smaller the cells are the greater the benefit of this policy. This is the default policy.
16071 (3) Eager - in addition to compacting the index representation as the basic policy, eager policy eliminates duplication while the data is still in memory (much like the on-disk compaction does after the data is flushed to disk). This policy is most useful for applications with high data churn or small working sets.
16072
16073 Memory compaction policeman be set at the column family level at table creation time:
16074 {code}
16075 create ‘\<tablename\>’,
16076    {NAME =\> ‘\<cfname\>’,
16077     IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
16078 {code}
16079 or as a property at the global configuration level by setting the property in hbase-site.xml, with BASIC being the default value:
16080 {code}
16081 \<property\>
16082         \<name\>hbase.hregion.compacting.memstore.type\</name\>
16083         \<value\>\<NONE\|BASIC\|EAGER\>\</value\>
16084 \</property\>
16085 {code}
16086 The values used in this property can change as memstore compaction policies evolve over time.
16087
16088
16089 ---
16090
16091 * [HBASE-16336](https://issues.apache.org/jira/browse/HBASE-16336) | *Major* | **Removing peers seems to be leaving spare queues**
16092
16093 Add a ReplicationZKNodeCleaner periodically check and delete the useless replication queue zk node belong to the peer which is not exist.
16094
16095
16096 ---
16097
16098 * [HBASE-17272](https://issues.apache.org/jira/browse/HBASE-17272) | *Major* | **Doc how to run Standalone HBase over an HDFS instance; all daemons in one JVM but persisting to an HDFS instance**
16099
16100 Adds section at http://hbase.apache.org/book.html#standalone.over.hdfs on how to make standalone persist to an hdfs instance (where standalone is all daemons in the one jvm).
16101
16102
16103 ---
16104
16105 * [HBASE-16700](https://issues.apache.org/jira/browse/HBASE-16700) | *Minor* | **Allow for coprocessor whitelisting**
16106
16107 Provides ability to restrict table coprocessors based on HDFS path whitelist. (Particularly useful for allowing Phoenix coprocessors but not arbitrary user created coprocessors.)
16108
16109
16110 ---
16111
16112 * [HBASE-17221](https://issues.apache.org/jira/browse/HBASE-17221) | *Major* | **Abstract out an interface for RpcServer.Call**
16113
16114 Provide an interface RpcCall on the server side.
16115 RpcServer.Call now is marked as @InterfaceAudience.Private, and implements the interface RpcCall,
16116
16117
16118 ---
16119
16120 * [HBASE-16119](https://issues.apache.org/jira/browse/HBASE-16119) | *Major* | **Procedure v2 - Reimplement merge**
16121
16122 The merge region logic is controlled by master in 2.0.0 (in 1.x, the core merge region logic is in the region server side).  The coprocessors related to merge region in RS-side would be no-op in 2.0.0 and later release.  Therefore, this is an incompatible change.  Users needs to move the CP logic to new master CP and registers them.
16123
16124 A new mergeRegionsAsync() API is added in client.  The existing mergeRegions() API will call the new API so client does not have to change its code.
16125
16126
16127 ---
16128
16129 * [HBASE-17112](https://issues.apache.org/jira/browse/HBASE-17112) | *Major* | **Prevent setting timestamp of delta operations the same as previous value's**
16130
16131 Before this issue, two concurrent Increments/Appends done in same millisecond or RS's clock going back will result in two results have same TS, which is not friendly to versioning and will get wrong result in slave cluster if the replication is disordered.
16132 After this issue, the result of Increment/Append will always have an incremental TS. There is no any inconsistent in replication for these operations. But there is a rare case that if there is a Delete in same millisecond, the later result can not be masked by this Delete. This can be fixed after we have new semantics that previous Delete will never mask later Put even its timestamp is higher.
16133
16134
16135 ---
16136
16137 * [HBASE-17181](https://issues.apache.org/jira/browse/HBASE-17181) | *Minor* | **Let HBase thrift2 support TThreadedSelectorServer**
16138
16139 Add TThreadedSelectorServer support for HBase Thrift2
16140
16141
16142 ---
16143
16144 * [HBASE-17178](https://issues.apache.org/jira/browse/HBASE-17178) | *Major* | **Add region balance throttling**
16145
16146 Add region balance throttling. Master execute every region balance plan per balance interval, which is equals to divide max balancing time by the size of region balance plan. And Introduce a new config hbase.master.balancer.maxRitPercent to protect availability. If config this to 0.01, then the max percent of regions in transition is 1% when balancing. Then the cluster's availability is at least 99% when balancing.
16147
16148
16149 ---
16150
16151 * [HBASE-15786](https://issues.apache.org/jira/browse/HBASE-15786) | *Major* | **Create DBB backed MSLAB pool**
16152
16153 Added a new config hbase.regionserver.offheap.global.memstore.size using which one can specify the global off heap limit that all memstores can use.  When this config is in MSLAB should be turned ON and we will use the entire size for the MSLAB pool. It will make off heap chunks and pool then. It will behave as if we are working with off heap memstores.  When this config is having a valid value and MSLAB is turned OFF, the system will just ignore the offheap size and continue to use global max heap space % for memstores and work with on heap memstores.
16154
16155
16156 ---
16157
16158 * [HBASE-17132](https://issues.apache.org/jira/browse/HBASE-17132) | *Major* | **Cleanup deprecated code for WAL**
16159
16160 Remove HLogKey and related classes and methods. Remove SequenceFile based log reader and writer. WALObserver and RegionObserver are changed so this is an incompatible change.
16161
16162
16163 ---
16164
16165 * [HBASE-16169](https://issues.apache.org/jira/browse/HBASE-16169) | *Major* | **Make RegionSizeCalculator scalable**
16166
16167 Added couple of API's to Admin.java:
16168
16169 Returns region load map of all regions hosted on a region server
16170 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn) throws IOException;
16171
16172 Returns region load map of all regions of a table hosted on a region server
16173 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn, TableName tableName) throws IOException
16174
16175 Added an API to region server:
16176
16177 public GetRegionLoadResponse getRegionLoad(RpcController controller,
16178     GetRegionLoadRequest request) throws ServiceException;
16179
16180 Primary intention is to use this API for RegionSizeCalculator and not rely on Master for ClusterStatus. On large clusters, ClusterStatus() can take a long time. IfMaster is down/busy, then some of the jobs timeout/fail. Other possible uses:
16181 1. If there is a lighter version of GetClusterStatus API (i.e without the ServerLoad for each RS), then custom maintenance tools can be better. In current world ClusterStatus is heavy. With the new APIs, each API's payload is smaller and distributed. So custom tools can call getRegionLoad() when needed, it will be more accurate. This helps with large clusters. For tools that don't need RegionLoad, the lighter version of API is fine enough.
16182 2. Another use case is a tool like RSTop - since we can see selective metrics at RegionLevel (possibly even deltas between each RPC to the server).
16183
16184
16185 ---
16186
16187 * [HBASE-15788](https://issues.apache.org/jira/browse/HBASE-15788) | *Major* | **Use Offheap ByteBuffers from BufferPool to read RPC requests.**
16188
16189 Using the ByteBuffers from ByteBufferPool to read the request bytes at server.  When the size of the request is smaller than 1/6th size of a BB in the pool, we will not use that but read into an on demand created, proper sized on heap ByteBuffer.
16190
16191
16192 ---
16193
16194 * [HBASE-17046](https://issues.apache.org/jira/browse/HBASE-17046) | *Major* | **Add 1.1 doc to hbase.apache.org**
16195
16196 Adds a 1.1. item to our 'Documentation and API' tab. Gives access to 1.1 APIs, XRef, etc.
16197
16198
16199 ---
16200
16201 * [HBASE-16962](https://issues.apache.org/jira/browse/HBASE-16962) | *Major* | **Add readPoint to preCompactScannerOpen() and preFlushScannerOpen() API**
16202
16203 The following RegionObserver methods are deprecated
16204
16205 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16206     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s)
16207     throws IOException;
16208
16209 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16210     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16211     final long earliestPutTs, final InternalScanner s, CompactionRequest request)
16212
16213 Instead, use the following methods:
16214
16215 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16216     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s,
16217     final long readPoint) throws IOException;
16218
16219 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16220     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16221     final long earliestPutTs, final InternalScanner s, final CompactionRequest request,
16222     final long readPoint) throws IOException
16223
16224
16225 ---
16226
16227 * [HBASE-17017](https://issues.apache.org/jira/browse/HBASE-17017) | *Major* | **Remove the current per-region latency histogram metrics**
16228
16229 Removes per-region level (get size, get time, scan size and scan time histogram) metrics that was exposed before. Per-region histogram metrics with 1000+ regions causes millions of objects to be allocated on heap. The patch introduces getCount and scanCount as counters rather than histograms. Other per-region level metrics are kept as they are.
16230
16231
16232 ---
16233
16234 * [HBASE-16955](https://issues.apache.org/jira/browse/HBASE-16955) | *Major* | **Fixup precommit protoc check to do new distributed protos and pb 3.1.0 build**
16235
16236 Test that environment no longer has to have protoc (2.5 and 3.1) available. Needed small adjustment in yetus protoc build but otherwise all works.
16237
16238
16239 ---
16240
16241 * [HBASE-17050](https://issues.apache.org/jira/browse/HBASE-17050) | *Minor* | **Upgrade Apache CLI version from 1.2 to 1.3.1**
16242
16243 Upgrade Apache CLI version from 1.2 to 1.3.1.
16244
16245 These are few good/important changes included in this update:
16246 - HelpFormatter now prints command-line options in the same order as they
16247   have been added. Fixes CLI-212.
16248 - Standard help text now shows mandatory arguments also for the first
16249   option. Fixes CLI-186.
16250 - A new parser is available: DefaultParser. It combines the features of the
16251   GnuParser and the PosixParser. It also provides additional features like
16252   partial matching for the long options, and long options without separator
16253   (i.e like the JVM memory settings: -Xmx512m). This new parser deprecates
16254   the previous ones. Fixes CLI-161,CLI-167,CLI-181.
16255
16256 For full list of changes:
16257   https://commons.apache.org/proper/commons-cli/changes-report.html#a1.3
16258
16259
16260 ---
16261
16262 * [HBASE-15513](https://issues.apache.org/jira/browse/HBASE-15513) | *Major* | **hbase.hregion.memstore.chunkpool.maxsize is 0.0 by default**
16263
16264 MSLAB chunk pool is on by default in hbase-2.0.0.
16265
16266
16267 ---
16268
16269 * [HBASE-16972](https://issues.apache.org/jira/browse/HBASE-16972) | *Major* | **Log more details for Scan#next request when responseTooSlow**
16270
16271 **WARNING: No release note provided for this change.**
16272
16273
16274 ---
16275
16276 * [HBASE-17014](https://issues.apache.org/jira/browse/HBASE-17014) | *Minor* | **Add clearly marked starting and shutdown log messages for all services.**
16277
16278 Delimit START, STOP, and ABORT messages with '\*\*\*\*\*' so denote.
16279
16280
16281 ---
16282
16283 * [HBASE-16765](https://issues.apache.org/jira/browse/HBASE-16765) | *Critical* | **New SteppingRegionSplitPolicy, avoid too aggressive spread of regions for small tables.**
16284
16285 Introduces a new split policy: SteppingSplitPolicy
16286 This will use a simple step function to split a region at (by default) 2  xflushSize when no other region of the same table is seen on the region server, or max-file-size when one or more other regions of the same table is seen.
16287
16288 In HBase 2.0 this is going to be the default. In previous versions it can be configured.
16289
16290
16291 ---
16292
16293 * [HBASE-16608](https://issues.apache.org/jira/browse/HBASE-16608) | *Major* | **Introducing the ability to merge ImmutableSegments without copy-compaction or SQM usage**
16294
16295 The index-compation and data-compaction variants of CompactingMemStore are introduced. In both types the active (mutable) segment is periodically flushed-in-memory and is added as immutable segment in the compaction pipeline. The CompactingMemStore of index-compaction type is merging all immutable segments of the compacting pipeline into one. The merging of N segments is explained below. The CompactingMemStore of data-compaction type is compacting all immutable segments of the compacting pipeline into one. After the merge/compaction the old segments in the compacting pipeline are replaced with one new.
16296
16297 Before explaining the process of merging N old segments into new one, note that segment structure includes ordered index that allows traversing the cells data efficiently. The merge is copying the ordered indexes of the old segments into one ordered index of new segment. No data is copied, no cells are filtered. Alternatively, in the process of compacting N old segments into new one, both data and index are copied. The old cells are filtered, meaning upon compaction unused versions of the cells are not copied so the new segment has less data then all old ones.
16298
16299 This issue introduces only the merging ability and simplifies the user intervention for switching between types. The previous CompactingMemStore structure was added by HBASE-16420 and HBASE-16421. The future refinements of the policy or merging/compacting will come in HBASE-16417.
16300
16301 In order to create a table with CompactingMemStore as a MemStore one should use:
16302 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16303 IN\_MEMORY\_COMPACTION default is false, so table created as following will have the known DefaultMemStore as a MemStore.
16304 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’}
16305
16306 The default type of CompactingMemStore is index-compaction. In order to change it to data-compaction one should add to the hbase-site.xml
16307 \<property\>
16308     \<name\>hbase.hregion.compacting.memstore.type\</name\>
16309     \<value\>data-compaction\</value\>
16310   \</property\>
16311
16312 in addition to creating the table as following
16313 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16314
16315
16316 ---
16317
16318 * [HBASE-16747](https://issues.apache.org/jira/browse/HBASE-16747) | *Major* | **Track memstore data size and heap overhead separately**
16319
16320 Marking it as incompatible change as there is a change in behavior for region flush decision. The default flush size of 128 MB per region was tracked against both actual data bytes size + overhead of these cells in memstore memory (Overhead because of Cell java objects and CSLM entry).  As part of this jira we will keep track of cell data size only in region level.  So 128 MB flush size means, 128 MB of cell data bytes (key+ value+..)
16321
16322 Globally we will track cell data size and heap overhead separately and will consider both for forced flushes. We will not allow over consume of heap memory by all memstore. This is as old case. Only tracking way is changed.
16323
16324
16325 ---
16326
16327 * [HBASE-16974](https://issues.apache.org/jira/browse/HBASE-16974) | *Minor* | **Update os-maven-plugin to 1.4.1.final+ for building shade file on RHEL/CentOS**
16328
16329 Upgrade os-maven-plugin mvn extension which figures the os we are running on from 1.4 to 1.5.
16330
16331
16332 ---
16333
16334 * [HBASE-16952](https://issues.apache.org/jira/browse/HBASE-16952) | *Major* | **Replace hadoop-maven-plugins with protobuf-maven-plugin for building protos**
16335
16336 Simplifies .proto manipulations. One step only now -- no need to keep pom.xml listing up to date with the protobuf protos directory content -- and no need to preinstall protoc; mvn does it all for you now.
16337
16338
16339 ---
16340
16341 * [HBASE-14551](https://issues.apache.org/jira/browse/HBASE-14551) | *Minor* | **Procedure v2 - Reimplement split**
16342
16343 Moved the Split Region logic to Master and most of split region coprocessor is in master now.  Need to change dependency such as Phoenix.
16344
16345
16346 ---
16347
16348 * [HBASE-15789](https://issues.apache.org/jira/browse/HBASE-15789) | *Major* | **PB related changes to work with offheap**
16349
16350 This issue adds a patch to our checked in internal, shaded protobuf, but it also adds a general means of apply patches to our version of protobuf. Patches found in the new src/main/patches directory are all applied as the last task when you run a build with the -Pcompile-protobuf profile under the hbase-protocol-shaded module. This commit also includes our first patch to protobuf; it adds ByteInput to mimic pb3.1's ByteOutput (src/main/patches/HBASE-15789\_V2.patch attached here).
16351
16352
16353 ---
16354
16355 * [HBASE-16930](https://issues.apache.org/jira/browse/HBASE-16930) | *Major* | **AssignmentManager#checkWals() function can recur infinitely**
16356
16357 Fixed potential infinite recursion in AssignmentManager.checkWals().
16358
16359
16360 ---
16361
16362 * [HBASE-16463](https://issues.apache.org/jira/browse/HBASE-16463) | *Major* | **Improve transparent table/CF encryption with Commons Crypto**
16363
16364 Improve transparent table/CF encryption with Commons Crypto. The change introduces a new optional CryptoCipherProvider (CommonsCryptoAES) for transparent table/CF encryption. And the encryption performance would be accelerated by hardware in modern CPU (AES-NI). This feature could be enabled by updating the configuration "hbase.crypto.cipherprovider" to "org.apache.hadoop.hbase.io.crypto.CryptoCipherProvider" in hbase-site.xml. For detailed information about transparent table/CF encryption including configuration examples see the Security section of the HBase manual.
16365
16366
16367 ---
16368
16369 * [HBASE-16414](https://issues.apache.org/jira/browse/HBASE-16414) | *Major* | **Improve performance for RPC encryption with Apache Common Crypto**
16370
16371 With the security RPC and encryption enabled, introduce Apache Commons Crypto to do the encryption/decryption which supports both supports both JCE Cipher and OpenSSL Cipher. Adds new configs "hbase.rpc.crypto.encryption.aes.enabled" which defaults to false, and "hbase.rpc.crypto.encryption.aes.cipher.class" which defaults to "org.apache.commons.crypto.cipher.JceCipher" to support JCE Cipher, it also can be set as "org.apache.hadoop.crypto.OpensslCipher" to support Openssl Cipher.
16372
16373
16374 ---
16375
16376 * [HBASE-16721](https://issues.apache.org/jira/browse/HBASE-16721) | *Critical* | **Concurrency issue in WAL unflushed seqId tracking**
16377
16378 Fixed a bug in sequenceId tracking for the WALs that caused WAL files to accumulate without being deleted due to a rare race condition.
16379
16380
16381 ---
16382
16383 * [HBASE-16834](https://issues.apache.org/jira/browse/HBASE-16834) | *Major* | **Add AsyncConnection support for ConnectionFactory**
16384
16385 Add createAsyncConnection method to ConnectionFactory for creating AsyncConnection. The default implementation is org.apache.hadoop.hbase.client.AsyncConnectionImpl. You can use 'hbase.client.async.connection.impl' to plug in your own AsyncConnection implementation.
16386
16387
16388 ---
16389
16390 * [HBASE-16729](https://issues.apache.org/jira/browse/HBASE-16729) | *Trivial* | **Define the behavior of (default) empty FilterList**
16391
16392 Empty filter list will behave as when there is no filter added. This change is a behavioral change for those who rely on Empty filter list.
16393
16394
16395 ---
16396
16397 * [HBASE-16799](https://issues.apache.org/jira/browse/HBASE-16799) | *Major* | **CP exposed Store should not expose unwanted APIs**
16398
16399 Below APIs from CP exposed Store interface are removed
16400 upsert(Iterable\<Cell\> cells, long readpoint)
16401 add(Cell cell)
16402 add(Iterable\<Cell\> cells)
16403 replayCompactionMarker(CompactionDescriptor compaction, boolean pickCompactionFiles,  boolean removeFiles)
16404 assertBulkLoadHFileOk(Path srcPath)
16405 bulkLoadHFile(String srcPathStr, long sequenceId)
16406 bulkLoadHFile(StoreFileInfo fileInfo)
16407
16408
16409 ---
16410
16411 * [HBASE-15921](https://issues.apache.org/jira/browse/HBASE-15921) | *Major* | **Add first AsyncTable impl and create TableImpl based on it**
16412
16413 Add AsyncConnection, AsyncTable and AsyncTableRegionLocator. Now the AsyncTable only support get, put and delete. And the implementation of AsyncTableRegionLocator is synchronous actually.
16414
16415
16416 ---
16417
16418 * [HBASE-16664](https://issues.apache.org/jira/browse/HBASE-16664) | *Major* | **Timeout logic in AsyncProcess is broken**
16419
16420 This issue fix three bugs:
16421 1.  rpcTimeout configuration not work for one rpc call in AP
16422 2.  operationTimeout configuration not work for multi-request (batch, put) in AP
16423 3.  setRpcTimeout and setOperationTimeout in HTable is not worked for AP and BufferedMutator.
16424
16425
16426 ---
16427
16428 * [HBASE-16661](https://issues.apache.org/jira/browse/HBASE-16661) | *Minor* | **Add last major compaction age to per-region metrics**
16429
16430 This adds a new per-region metric named "lastMajorCompactionAge" for tracking time since the last major compaction ran on a given region.  If a major compaction has never run, the age will be equal to the current timestamp.
16431
16432
16433 ---
16434
16435 * [HBASE-16117](https://issues.apache.org/jira/browse/HBASE-16117) | *Major* | **Fix Connection leak in mapred.TableOutputFormat**
16436
16437 (This change will be irrelevant after HBASE-16774 lands).
16438 There is a subtle change with error handling when a connection is not able to connect to ZK.  Attempts to create a connection when ZK is not up will now fail immediately instead of silently creating and then failing on a subsequent HBaseAdmin call.
16439
16440
16441 ---
16442
16443 * [HBASE-15984](https://issues.apache.org/jira/browse/HBASE-15984) | *Critical* | **Given failure to parse a given WAL that was closed cleanly, replay the WAL.**
16444
16445 In some particular deployments, the Replication code believes it has
16446 reached EOF for a WAL prior to successfully parsing all bytes known to
16447 exist in a cleanly closed file.
16448
16449 If an EOF is detected due to parsing or other errors while there are still unparsed bytes before the end-of-file trailer, we now reset the WAL to the very beginning and attempt a clean read-through. Because we will retry these failures indefinitely, two additional changes are made to help with diagnostics:
16450
16451 \* On each retry attempt, a log message like the below will be emitted at the WARN level:
16452
16453       Processing end of WAL file '{}'. At position {}, which is too far away
16454       from reported file length {}. Restarting WAL reading (see HBASE-15983
16455       for details).
16456
16457 \*  additional metrics measure the use of this recovery mechanism. they are described in the reference guide.
16458
16459
16460 ---
16461
16462 * [HBASE-16753](https://issues.apache.org/jira/browse/HBASE-16753) | *Minor* | **There is a mismatch between suggested Java version in hbase-env.sh**
16463
16464 Updates the comments and default values in a few scripts and docs to reflect our Java 1.8+ requirement.
16465
16466
16467 ---
16468
16469 * [HBASE-16567](https://issues.apache.org/jira/browse/HBASE-16567) | *Critical* | **Upgrade to protobuf-3.1.x**
16470
16471 Core is now up on protobuf 3.1.0 (Coprocessor Endpoints and REST are still on protobuf 2.5.0).
16472
16473
16474 ---
16475
16476 * [HBASE-15638](https://issues.apache.org/jira/browse/HBASE-15638) | *Critical* | **Shade protobuf**
16477
16478 Shade/relocate and include the protobuf we use internally. See protobuf chapter in the refguide for more on how we protobuf in hbase-.2.0.0 and going forward.
16479
16480 See https://docs.google.com/document/d/1H4NgLXQ9Y9KejwobddCqaVMEDCGbyDcXtdF5iAfDIEk/edit# for how we arrived at this approach.
16481
16482 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201610.mbox/%3C07850EDD-7230-431B-9AB0-C5C91B105EEC%40gmail.com%3E for discussion around merging this change and of how we might revert if an alternative to this awkward patch presents itself; e.g. an hadoop with CLASSPATH isolation (and means of dealing with Sparks use of protobuf 2.5.0, etc.)
16483
16484
16485 ---
16486
16487 * [HBASE-16264](https://issues.apache.org/jira/browse/HBASE-16264) | *Critical* | **Figure how to deal with endpoints and shaded pb**
16488
16489 Shade/relocate the protobuf hbase uses internally. All core now refers to new module added in this patch, hbase-protocol-shaded. Coprocessor Endpoints carry-on with references to the original hbase-protocol module. See new chapter in book on protobufs on how-to going forward.
16490
16491
16492 ---
16493
16494 * [HBASE-16672](https://issues.apache.org/jira/browse/HBASE-16672) | *Major* | **Add option for bulk load to always copy hfile(s) instead of renaming**
16495
16496 This issue adds a config, always.copy.files, to LoadIncrementalHFiles.
16497 When set to true, source hfiles would be copied. Meaning source hfiles would be kept after bulk load is done.
16498 Default value is false.
16499
16500
16501 ---
16502
16503 * [HBASE-16660](https://issues.apache.org/jira/browse/HBASE-16660) | *Critical* | **ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction**
16504
16505 "Please do not use DateTieredCompaction with Major Compaction unless you have a version with this. Otherwise your cluster will not compact any store files and you can end up running out of file descriptors." @churro morales
16506
16507
16508 ---
16509
16510 * [HBASE-16257](https://issues.apache.org/jira/browse/HBASE-16257) | *Blocker* | **Move staging dir to be under hbase root dir**
16511
16512 The HBase property 'hbase.bulkload.staging.dir' is deprecated and is ignored from HBase 2.0.  It will defaults to hbase.rootdir/staging automatically with the correct permissions.
16513
16514
16515 ---
16516
16517 * [HBASE-16650](https://issues.apache.org/jira/browse/HBASE-16650) | *Major* | **Wrong usage of BlockCache eviction stat for heap memory tuning**
16518
16519 Changed tracking of evictedBlocks count NOT to include evictions of blocks for a removed HFile. HFiles gets removed after compaction
16520
16521
16522 ---
16523
16524 * [HBASE-16294](https://issues.apache.org/jira/browse/HBASE-16294) | *Minor* | **hbck reporting "No HDFS region dir found" for replicas**
16525
16526 Fixed warning error message displayed for region directory not found for non-default/ non-primary replicas in hbck
16527
16528
16529 ---
16530
16531 * [HBASE-16540](https://issues.apache.org/jira/browse/HBASE-16540) | *Major* | **Scan should do additional validation on start and stop row**
16532
16533 Scan#setStartRow() and Scan#setStopRow() now validate the argument passed for each row key.  If the length of the byte[] passed exceeds Short.MAX\_VALUE, an IllegalArgumentException will be thrown.
16534
16535
16536 ---
16537
16538 * [HBASE-7612](https://issues.apache.org/jira/browse/HBASE-7612) | *Trivial* | **[JDK8] Replace use of high-scale-lib counters with intrinsic facilities**
16539
16540 org.apache.hadoop.hbase.util.Counter is deprecated now and will be removed in 3.0. Use LongAdder instead.
16541
16542
16543 ---
16544
16545 * [HBASE-16447](https://issues.apache.org/jira/browse/HBASE-16447) | *Critical* | **Replication by namespaces config in peer**
16546
16547 Support replication by namespaces config in peer.
16548 1. Set a namespace in peer config means that all tables in this namespace will be replicated.
16549 2. If the namespaces config is null, then the table-cfs config decide which table's edit can be replicated. If the table-cfs config is null, then the namespaces config decide which table's edit can be replicated.
16550 3. If you already have set a namespace in the peer config, then you can't set any table of this namespace to the peer config. If you already have set a table in the peer config, then you can't set this table's namespace to the peer config.
16551
16552
16553 ---
16554
16555 * [HBASE-16598](https://issues.apache.org/jira/browse/HBASE-16598) | *Major* | **Enable zookeeper useMulti always and clean up in HBase code**
16556
16557 Deprecate the configuration property 'hbase.zookeeper.useMulti'.
16558 useMulti will always be enabled. ZooKeeper 3.4.x and newer is required.
16559
16560 Internal:
16561
16562 The ZKUtil#multiOrSequential(ZooKeeperWatcher zkw, List\<ZKUtilOp\> ops, boolean runSequentialOnMultiFailure) will not check 'hbase.zookeeper.useMulti' anymore, and will always use multi.
16563 It can still fall back to sequential operations if:
16564
16565 RunSequentialOnMultiFailure is true
16566 On calling multi, we get a ZooKeeper exception that can be handled by a sequential call.
16567
16568
16569 ---
16570
16571 * [HBASE-16388](https://issues.apache.org/jira/browse/HBASE-16388) | *Major* | **Prevent client threads being blocked by only one slow region server**
16572
16573 Add a new configuration, hbase.client.perserver.requests.threshold, to limit the max number of concurrent request to one region server. If the user still create new request after reaching the limit, client will throw ServerTooBusyException and do not send the request to the server. This is a client side feature and can prevent client's threads being blocked by one slow region server resulting in the availability of client is much lower than the availability of region servers.
16574
16575 For completeness, here extract on new config from hbase-default.xml:
16576
16577 Property: hbase.client.perserver.requests.threshold
16578 Default: 2147483647
16579 Description: The max number of concurrent pending requests for one server in all client threads (process level). Exceeding requests will be thrown ServerTooBusyException immediately to prevent user's threads being occupied and blocked by only one slow region server. If you use a fix number of threads to access HBase in a synchronous way, set this to a suitable value which is  related to the number of threads will help you. See https://issues.apache.org/jira/browse/HBASE-16388 for details.
16580
16581
16582 ---
16583
16584 * [HBASE-15297](https://issues.apache.org/jira/browse/HBASE-15297) | *Minor* | **error message is wrong when a wrong namspace is specified in grant in hbase shell**
16585
16586 The security admin instance available within the HBase shell now returns "false" from the namespace\_exists? method for non-existent namespaces rather than raising a wrapped NamespaceNotFoundException.
16587
16588 As a side effect, when the "grant" and "revoke" commands in the HBase shell are invoked with a non-existent namespace the resulting error message now properly refers to said namespace rather than to the user.
16589
16590
16591 ---
16592
16593 * [HBASE-16086](https://issues.apache.org/jira/browse/HBASE-16086) | *Major* | **TableCfWALEntryFilter and ScopeWALEntryFilter should not redundantly iterate over cells.**
16594
16595 push to branch-1.3+
16596
16597
16598 ---
16599
16600 * [HBASE-16340](https://issues.apache.org/jira/browse/HBASE-16340) | *Critical* | **ensure no Xerces jars included**
16601
16602 HBase no longer includes Xerces implementation jars that were previously included via transitive dependencies. Downstream users relying on HBase for these artifacts will need to update their dependencies.
16603
16604
16605 ---
16606
16607 * [HBASE-16213](https://issues.apache.org/jira/browse/HBASE-16213) | *Major* | **A new HFileBlock structure for fast random get**
16608
16609 HBASE-16213 introduced a new DataBlockEncoding in name of ROW\_INDEX\_V1, which could improve random read (get) performance especially when the average record size (key-value size per row) is small. To use this feature, please set DATA\_BLOCK\_ENCODING to ROW\_INDEX\_V1 for CF of newly created table, or change existing CF with below command:
16610 alter 'table\_name',{NAME =\> 'cf', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}.
16611
16612 Please note that if we turn this DBE on, HFile block will be bigger than NONE encoding because it adds some meta infos for binary search:
16613 /\*\*
16614  \* Store cells following every row's start offset, so we can binary search to a row's cells.
16615  \*
16616  \* Format:
16617  \* flat cells
16618  \* integer: number of rows
16619  \* integer: row0's offset
16620  \* integer: row1's offset
16621  \* ....
16622  \* integer: dataSize
16623  \*
16624 \*/
16625
16626 Seek in row when random reading is one of the main consumers of CPU. This helps. See slide #7 here https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
16627
16628
16629 ---
16630
16631 * [HBASE-16409](https://issues.apache.org/jira/browse/HBASE-16409) | *Minor* | **Row key for bad row should be properly delimited in VerifyReplication**
16632
16633 --delimiter= option is added to verifyrep.
16634 The delimiter would wrap bad rows in log output.
16635
16636
16637 ---
16638
16639 * [HBASE-14921](https://issues.apache.org/jira/browse/HBASE-14921) | *Major* | **Inmemory Compaction Optimizations; Segment Structure**
16640
16641 A long, working issue that discussed Segment formats introducing CellArrayMap (delivered as the patch attached to this issue) and CellChunkMap (to be delivered later in HBASE-16421 but see patch v02 for an embryonic form named CellBlockSerialized); when to copy Segment data (and when not too); and then what to include at flush time (the suffix Segment or all Segments). Designs that evolved as discussion went on are attached. Outstanding issues turned up here, not including a CellChunkMap implementation, are listed below but are to be addressed in follow-ons (See HBASE-16417):
16642
16643 1. The flattening without compaction is causing many small segments in pipeline, and they are not flushed all together.
16644 2. The issue of compaction prediction cost.
16645
16646
16647 ---
16648
16649 * [HBASE-16450](https://issues.apache.org/jira/browse/HBASE-16450) | *Major* | **Shell tool to dump replication queues**
16650
16651 New tool to dump existing replication peers, configurations and queues when using HBase Replication. The tool provides two flags:
16652
16653  --distributed  This flag will poll each RS for information about the replication queues being processed on this RS.
16654 By default this is not enabled and the information about the replication queues and configuration will be obtained from ZooKeeper.
16655  --hdfs   When --distributed is used, this flag will attempt to calculate the total size of the WAL files used by the replication queues. Since its possible that multiple peers can be configured this value can be overestimated.
16656
16657
16658 ---
16659
16660 * [HBASE-16422](https://issues.apache.org/jira/browse/HBASE-16422) | *Major* | **Tighten our guarantees on compatibility across patch versions**
16661
16662 Adds below change to our compat guarantees:
16663
16664 {code}
16665 -\* Example: A user using a newly deprecated api does not need to modify application code with hbase api calls until the next major version.
16666  10 +\* New APIs introduced in a patch version will only be added in a source compatible way footnote:[See 'Source Compatibility' https://blogs.oracle.com/darcy/entry/kinds\_of\_compatibility]: i.e.     code that implements public APIs will continue to compile.
16667 {code}
16668
16669
16670 ---
16671
16672 * [HBASE-7621](https://issues.apache.org/jira/browse/HBASE-7621) | *Major* | **REST client (RemoteHTable) doesn't support binary row keys**
16673
16674 RemoteHTable now supports binary row keys with any character or byte by properly encoding request URLs. This is a both a behavioral change from earlier versions and an important fix for protocol correctness.
16675
16676
16677 ---
16678
16679 * [HBASE-12721](https://issues.apache.org/jira/browse/HBASE-12721) | *Major* | **Create Docker container cluster infrastructure to enable better testing**
16680
16681 Downstream users wishing to test HBase in a "distributed" fashion (multiple "nodes" running as separate containers on the same host) can now do so in an automated fashion while leveraging Docker for process isolation via the clusterdock project.
16682
16683 For details see the README.md in the dev-support/apache\_hbase\_topology folder.
16684
16685
16686 ---
16687
16688 * [HBASE-16267](https://issues.apache.org/jira/browse/HBASE-16267) | *Critical* | **Remove commons-httpclient dependency from hbase-rest module**
16689
16690 This issue upgrades httpclient to 4.5.2 and httpcore to 4.4.4 which are the versions used by hadoop-2.
16691 This is to handle the following CVE's.
16692
16693 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2015-5262 : http/conn/ssl/SSLConnectionSocketFactory.java in Apache HttpComponents HttpClient before 4.3.6 ignores the http.socket.timeout configuration setting during an SSL handshake, which allows remote attackers to cause a denial of service (HTTPS call hang) via unspecified vectors.
16694
16695 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-6153
16696 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-5783
16697 Apache Commons HttpClient 3.x, as used in Amazon Flexible Payments Service (FPS) merchant Java SDK and other products, does not verify that the server hostname matches a domain name in the subject's Common Name (CN) or subjectAltName field of the X.509 certificate, which allows man-in-the-middle attackers to spoof SSL servers via an arbitrary valid certificate.
16698
16699 Downstream users who are exposed to commons-httpclient via the HBase classpath will have to similarly update their dependency.
16700
16701
16702 ---
16703
16704 * [HBASE-16308](https://issues.apache.org/jira/browse/HBASE-16308) | *Major* | **Contain protobuf references**
16705
16706 Undo protobuf references through the codebase so protobuf references are contained rather than spread about the codebase. For example, moved protobuff-ing up into the various Callables rather than repeat on each method invocation cleaning up boilerplate around rpc calls. Having a few protobuf reference locations only simplifies the parent issue shading project.
16707
16708
16709 ---
16710
16711 * [HBASE-16321](https://issues.apache.org/jira/browse/HBASE-16321) | *Blocker* | **Ensure findbugs jsr305 jar isn't present**
16712
16713 HBase now ensures the jsr305 implementation from the findbugs project is not included in its binary artifacts or the compile / runtime dependencies of its user facing modules. Downstream users that rely on this jar will need to update their dependencies.
16714
16715
16716 ---
16717
16718 * [HBASE-8386](https://issues.apache.org/jira/browse/HBASE-8386) | *Major* | **deprecate TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)**
16719
16720 The MapReduce helper function \`TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)\` has been deprecated since it is easy to use incorrectly. Most users should rely on addDependencyJars(Job) instead.
16721
16722
16723 ---
16724
16725 * [HBASE-16287](https://issues.apache.org/jira/browse/HBASE-16287) | *Major* | **LruBlockCache size should not exceed acceptableSize too many**
16726
16727 In order to avoid blockcache size exceed acceptable size too much, we add one configuration "hbase.lru.blockcache.hard.capacity.limit.factor" to decide whether the block could be put into LruBlockCache or not.  This factor defaults to 1.2
16728 If blockcache size \>= factor\*acceptableSize, we will reject the block into cache.
16729
16730
16731 ---
16732
16733 * [HBASE-16355](https://issues.apache.org/jira/browse/HBASE-16355) | *Major* | **hbase-client dependency on hbase-common test-jar should be test scope**
16734
16735 The HBase client artifact previously incorrectly included the hbase-common test jar as a runtime dependency. With this change, that dependency has been moved to test scope. Downstream users are not expected to be impacted, unless they relied on the transitive dependency for these HBase internal test classes.
16736
16737
16738 ---
16739
16740 * [HBASE-16317](https://issues.apache.org/jira/browse/HBASE-16317) | *Blocker* | **revert all ESAPI changes**
16741
16742 This issue reverts fixes designed to prevent malicious content from rendering in HBase's UIs. Specifically, these changes shipped in 1.1.4+ and 1.2.0+. They were removed due to licensing issues discovered in the dependencies they introduced. Their implementation and those dependencies have been removed from HBase! Removal of these dependencies is against the strict definition of our version compatibility guidelines. However, inclusion of non-Apache approved licenses cannot be tolerated. Implementation of these fixes using an Apache-appropriate means is tracked in HBASE-16328.
16743
16744
16745 ---
16746
16747 * [HBASE-16288](https://issues.apache.org/jira/browse/HBASE-16288) | *Critical* | **HFile intermediate block level indexes might recurse forever creating multi TB files**
16748
16749 A new hfile configuration "hfile.index.block.min.entries" which defaults to 16 determines how many entries the hfile index block can have at least. The configuration which determines how large the index block can be at max (hfile.index.block.max.size) is ignored as long as we have fewer than hfile.index.block.min.entries entries. This ensures that multi-level index does not build up with too many levels.
16750
16751
16752 ---
16753
16754 * [HBASE-16186](https://issues.apache.org/jira/browse/HBASE-16186) | *Major* | **Fix AssignmentManager MBean name**
16755
16756 The AssignmentManager MBean was named AssignmentManger (note misspelling). This patch fixed the misspelling.
16757
16758
16759 ---
16760
16761 * [HBASE-16289](https://issues.apache.org/jira/browse/HBASE-16289) | *Critical* | **AsyncProcess stuck messages need to print region/server**
16762
16763 Adds logging of region and server. Helpful debugging. Logging now looks like this:
16764 {code}
16765 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess$AsyncRequestFutureImpl(1601): #1, waiting for 1  actions to finish on table: DUMMY\_TABLE
16766 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1720): Left over 1 task(s) are processed on server(s): [s1:1,1,1]
16767 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1728): Regions against which left over task(s) are processed: [DUMMY\_TABLE,DUMMY\_BYTES\_1,1.3fd12ea80b4df621fb15497ba75f7368.,DUMMY\_TABLE,DUMMY\_BYTES\_2,2.924207e242e313d2e5491c625e0a296e.]
16768 {code}
16769
16770
16771 ---
16772
16773 * [HBASE-14743](https://issues.apache.org/jira/browse/HBASE-14743) | *Minor* | **Add metrics around HeapMemoryManager**
16774
16775 A memory metrics reveals situations happened in both MemStores and BlockCache in RegionServer. Through this metrics, users/operators can know
16776 1). Current size of MemStores and BlockCache in bytes.
16777 2). Occurrence for Memstore minor and major flush. (named unblocked flush and blocked flush respectively, shown in histogram)
16778 3). Dynamic changes in size between MemStores and BlockCache. (with Increase/Decrease as prefix, shown in histogram). And a counter for no changes, named DoNothingCounter.
16779 4). Occurrence for memory usage alarm (used more than 95% by default) in RegionServer. (named AboveHeapOccupancyLowWatermarkCounter)
16780
16781
16782 ---
16783
16784 * [HBASE-13701](https://issues.apache.org/jira/browse/HBASE-13701) | *Major* | **Consolidate SecureBulkLoadEndpoint into HBase core as default for bulk load**
16785
16786 SecureBulkLoadEndpoint  has been integrated into HBase core as default bulk load mechanism. It is no longer needed to install it as a coprocessor endpoint.
16787 The new server is backward compatible, accommodating non-secure old client and secure old client requesting SecureBulkLoadEndpoint service.
16788 SecureBulkLoadEndpoint is deprecated. The backward compatibility support may be removed in future releases.
16789
16790
16791 ---
16792
16793 * [HBASE-16244](https://issues.apache.org/jira/browse/HBASE-16244) | *Major* | **LocalHBaseCluster start timeout should be configurable**
16794
16795 When LocalHBaseCluster is started from the command line the Master would give up after 30 seconds due to a hardcoded timeout meant for unit tests. This change allows the timeout to be configured via hbase-site as well as sets it to 5 minutes when LocalHBaseCluster is started from the command line.
16796
16797
16798 ---
16799
16800 * [HBASE-16052](https://issues.apache.org/jira/browse/HBASE-16052) | *Major* | **Improve HBaseFsck Scalability**
16801
16802 HBASE-16052 improves the performance and scalability of HBaseFsck, especially for large clusters with a small number of large tables.
16803
16804 Searching for lingering reference files is now a multi-threaded operation.  Loading HDFS region directory information is now multi-threaded at the region-level instead of the table-level to maximize concurrency.  A performance bug in HBaseFsck that resulted in redundant I/O and RPCs was fixed by introducing a FileStatusFilter that filters FileStatus objects directly.
16805
16806
16807 ---
16808
16809 * [HBASE-16144](https://issues.apache.org/jira/browse/HBASE-16144) | *Major* | **Replication queue's lock will live forever if RS acquiring the lock has died prematurely**
16810
16811 If zk based replication queue is used and useMulti is false, we will schedule a chore to clean up the orphan replication queue lock on zk.
16812
16813
16814 ---
16815
16816 * [HBASE-3727](https://issues.apache.org/jira/browse/HBASE-3727) | *Minor* | **MultiHFileOutputFormat**
16817
16818 MultiHFileOutputFormat support output of HFiles from multiple tables. It will output directories and hfiles as follow,
16819      --table1
16820        --family1
16821        --family2
16822          --Hfiles
16823      --table2
16824        --family3
16825          --hfiles
16826        --family4
16827
16828 family directory and its hfiles match the output of HFileOutputFormat2
16829
16830
16831 ---
16832
16833 * [HBASE-16231](https://issues.apache.org/jira/browse/HBASE-16231) | *Major* | **Integration tests should support client keytab login for secure clusters**
16834
16835 Prior to this change, the integration test clients (IntegrationTest\*) relied on the Kerberos credential cache for authentication against secured clusters.  This could lead to the tests failing due to authentication failures when the tickets in the credential cache expired.  With this change, the integration test clients will make use of the configuration properties for "hbase.client.keytab.file" and "hbase.client.kerberos.principal", when available.  This will perform a login from the configured keytab file and automatically refresh the credentials in the background for the process lifetime.
16836
16837
16838 ---
16839
16840 * [HBASE-13823](https://issues.apache.org/jira/browse/HBASE-13823) | *Major* | **Procedure V2: unnecessaery operations on AssignmentManager#recoverTableInDisablingState() and recoverTableInEnablingState()**
16841
16842 For cluster upgraded from 1.0.x or older releases, master startup would not continue the in-progress enable/disable table process.  If orphaned znode with ENABLING/DISABLING state exists in the cluster, run hbck or manually fix the issue.
16843
16844 For new cluster or cluster upgraded from 1.1.x and newer release, there is no issue to worry about.
16845
16846
16847 ---
16848
16849 * [HBASE-16095](https://issues.apache.org/jira/browse/HBASE-16095) | *Major* | **Add priority to TableDescriptor and priority region open thread pool**
16850
16851 Adds a PRIORITY property to the HTableDescriptor. PRIORITY should be in the same range as the RpcScheduler defines it (HConstants.XXX\_QOS).
16852
16853 Table priorities are only used for region opening for now. There can be other uses later (like RpcScheduling).
16854
16855 Regions of high priority tables (priority \>= than HIGH\_QOS) are opened from a different thread pool than the regular region open thread pool. However, table priorities are not used as a global order for region assigning or opening.
16856
16857
16858 ---
16859
16860 * [HBASE-16081](https://issues.apache.org/jira/browse/HBASE-16081) | *Blocker* | **Replication remove\_peer gets stuck and blocks WAL rolling**
16861
16862 When a replication endpoint is sent a shutdown request by the replication source in situations like removing a peer, we now try to gracefully shut it down by draining the items already sent for replication to the peer cluster. If the drain does not complete in the specified time (hbase.rpc.timeout \* replication.source.maxterminationmultiplier), the regionserver is aborted to avoid blocking the WAL roll.
16863
16864
16865 ---
16866
16867 * [HBASE-16087](https://issues.apache.org/jira/browse/HBASE-16087) | *Major* | **Replication shouldn't start on a master if if only hosts system tables**
16868
16869 Masters will no longer start any replication threads if they are hosting only system tables.
16870
16871 In order to change this add something to the config for tables on master that doesn't start with "hbase:" ( Replicating system tables is something that's currently unsupported and can open up security holes, so do this at your own peril)
16872
16873
16874 ---
16875
16876 * [HBASE-14548](https://issues.apache.org/jira/browse/HBASE-14548) | *Major* | **Expand how table coprocessor jar and dependency path can be specified**
16877
16878 Allow a directory containing the jars or some wildcards to be specified, such as: hdfs://namenode:port/user/hadoop-user/
16879 or
16880 hdfs://namenode:port/user/hadoop-user/\*.jar
16881
16882 Please note that if a directory is specified, all jar files(.jar) directly in the directory are added, but it does not search files in the subtree rooted in the directory.
16883 Do not contain any wildcard if you would like to specify a directory.
16884
16885
16886 ---
16887
16888 * [HBASE-15925](https://issues.apache.org/jira/browse/HBASE-15925) | *Blocker* | **compat-module maven variable not evaluated**
16889
16890 Downstream users of HBase dependencies that do not properly activate Maven profiles should now see a correct transitive dependency on the default hadoop-compatibility-module.
16891
16892
16893 ---
16894
16895 * [HBASE-16140](https://issues.apache.org/jira/browse/HBASE-16140) | *Major* | **bump owasp.esapi from 2.1.0 to 2.1.0.1**
16896
16897 The dependency owasp.esapi had a compatible change from 2.1.0 to 2.1.0.1. As a result, the transitive dependency commons-fileupload had a change from 1.2 to 1.3.1, which has some minor class changes that impact binary compatibility. Interested users should check the release notes of commons-fileupload to see if any of the incompatible changes impact them.
16898
16899 http://commons.apache.org/proper/commons-fileupload/changes-report.html
16900
16901
16902 ---
16903
16904 * [HBASE-16147](https://issues.apache.org/jira/browse/HBASE-16147) | *Major* | **Shell command for getting compaction state**
16905
16906 compaction\_state shell command would return compaction state in String form:
16907 NONE, MINOR, MAJOR, MAJOR\_AND\_MINOR
16908
16909
16910 ---
16911
16912 * [HBASE-14878](https://issues.apache.org/jira/browse/HBASE-14878) | *Major* | **maven archetype: client application with shaded jars**
16913
16914 Adds new hbase-shaded-client archetype; also corrects an omission found in hbase-archetypes/README.md in the section headed "How to add a new archetype".
16915
16916
16917 ---
16918
16919 * [HBASE-14877](https://issues.apache.org/jira/browse/HBASE-14877) | *Major* | **maven archetype: client application**
16920
16921 This patch introduces a new infrastructure for creation and maintenance of Maven archetypes in the context of the hbase project, and it also introduces the first archetype, which end-users may utilize to generate a simple hbase-client dependent project.
16922
16923 NOTE that this patch should introduce two new WARNINGs ("Using platform encoding ... to copy filtered resources") into the hbase install process. These warnings are hard-wired into the maven-archetype-plugin:create-from-project goal. See hbase/hbase-archetypes/README.md, footnote [6] for details.
16924
16925 After applying the patch, see hbase/hbase-archetypes/README.md for details regarding the new archetype infrastructure introduced by this patch. (The README text is also conveniently positioned at the top of the patch itself.)
16926
16927 Here is the opening paragraph of the README.md file:
16928 =================
16929 The hbase-archetypes subproject of hbase provides an infrastructure for creation and maintenance of Maven archetypes pertinent to HBase. Upon deployment to the archetype catalog of the central Maven repository, these archetypes may be used by end-user developers to autogenerate completely configured Maven projects (including fully-functioning sample code) through invocation of the archetype:generate goal of the maven-archetype-plugin.
16930 ========
16931 The README.md file also contains several paragraphs under the heading, "Notes for contributors and committers to the HBase project", which explains the layout of 'hbase-archetypes', and how archetypes are created and installed into the local Maven repository, ready for deployment to the central Maven repository. It also outlines how new archetypes may be developed and added to the collection in the future.
16932
16933
16934 ---
16935
16936 * [HBASE-15977](https://issues.apache.org/jira/browse/HBASE-15977) | *Major* | **Failed variable substitution on home page**
16937
16938 Done. Thanks, Dima, Andrew!
16939
16940
16941 ---
16942
16943 * [HBASE-5291](https://issues.apache.org/jira/browse/HBASE-5291) | *Major* | **Add Kerberos HTTP SPNEGO authentication support to HBase web consoles**
16944
16945 HBase Web UIs can be secured from general public access using SPNEGO to require a valid Kerberos ticket.
16946
16947 Setting 'hbase.security.authentication.ui' to 'kerberos' in hbase-site.xml is a global switch to have all Web UIs allow only authenticated clients via Kerberos. 'hbase.security.authentication.spnego.kerberos.principal' and 'hbase.security.authentication.spnego.kerberos.keytab' are two other required properties in hbase-site.xml, the Kerberos principal and keytab to use for the server to use to log in. The primary in the Kerberos principal must be 'HTTP' as required by the SPNEGO mechanism, e.g. 'HTTP/host.domain.com@DOMAIN.COM'.
16948
16949
16950 ---
16951
16952 * [HBASE-15950](https://issues.apache.org/jira/browse/HBASE-15950) | *Major* | **Fix memstore size estimates to be more tighter**
16953
16954 The estimates of heap usage by the memstore objects (KeyValue, object and array header sizes, etc) have been made more accurate for heap sizes up to 32G (using CompressedOops), resulting in them dropping by 10-50% in practice. This also results in less number of flushes and compactions due to "fatter" flushes. YMMV. As a result, the actual heap usage of the memstore before being flushed may increase by up to 100%. If configured memory limits for the region server had been tuned based on observed usage, this change could result in worse GC behavior or even OutOfMemory errors. Set the environment property (not hbase-site.xml) "hbase.memorylayout.use.unsafe" to false to disable.
16955
16956
16957 ---
16958
16959 * [HBASE-16023](https://issues.apache.org/jira/browse/HBASE-16023) | *Major* | **Fastpath for the FIFO rpcscheduler**
16960
16961 Adds a 'fastpath' when using the default FIFO rpc scheduler ('fifo'). Does direct handoff from Reader thread to Handler if there is one ready and willing. Will shine best when high random read workload (YCSB workloadc for instance)
16962
16963
16964 ---
16965
16966 * [HBASE-15971](https://issues.apache.org/jira/browse/HBASE-15971) | *Critical* | **Regression: Random Read/WorkloadC slower in 1.x than 0.98**
16967
16968 Change the default rpc scheduler from 'deadline' to 'fifo' instead so it is the same as in branch 0.98. 'deadline' was of questionable benefit but with a high cost scheduling. To re-enable 'deadline', set hbase.ipc.server.callqueue.type to 'deadline' in your hbase-site.xml.
16969
16970
16971 ---
16972
16973 * [HBASE-15525](https://issues.apache.org/jira/browse/HBASE-15525) | *Critical* | **OutOfMemory could occur when using BoundedByteBufferPool during RPC bursts**
16974
16975 Added a new ByteBufferPool which pools N ByteBuffers. By default it makes off heap ByteBuffers when getBuffer() is called. The size of each buffer defaults to 64KB. This can be configured using 'hbase.ipc.server.reservoir.initial.buffer.size'.   The max number of buffers which can be pooled defaults to twice the number of handler threads in RS. This can be configured with key 'hbase.ipc.server.reservoir.initial.max'.  While responding to read requests and client support Codec, we will create CellBlocks and directly return it as PB payload. For making this block, we will use N ByteBuffers from pool as per the total size of the response cells. The default size of 64 KB for the buffer is inline with the number of bytes written to RPC layer in one short.(That is also 64KB).  When at point of time, the calle not able to get a free buffer from the pool (it returns null then), it will make on heap Buffer of same size (as that of Buffers in pool) and use that to create cell block.
16976
16977
16978 ---
16979
16980 * [HBASE-15994](https://issues.apache.org/jira/browse/HBASE-15994) | *Major* | **Allow selection of RpcSchedulers**
16981
16982 Adds a FifoRpcSchedulerFactory so you can try the FifoRpcScheduler by setting  "hbase.region.server.rpc.scheduler.factory.class"
16983
16984
16985 ---
16986
16987 * [HBASE-15989](https://issues.apache.org/jira/browse/HBASE-15989) | *Major* | **Remove hbase.online.schema.update.enable**
16988
16989 Removes the "hbase.online.schema.update.enable" property.
16990 from now, every operation that alter the schema (e.g. modifyTable, addFamily, removeFamily, ...) will use the online schema update. there is no need to disable/enable the table.
16991
16992
16993 ---
16994
16995 * [HBASE-15981](https://issues.apache.org/jira/browse/HBASE-15981) | *Minor* | **Stripe and Date-tiered compactions inaccurately suggest disabling table in docs**
16996
16997 Removes reference to disabling table in docs for stripe and date-tiered compactions
16998
16999
17000 ---
17001
17002 * [HBASE-15931](https://issues.apache.org/jira/browse/HBASE-15931) | *Critical* | **Add log for long-running tasks in AsyncProcess**
17003
17004 After HBASE-15931, we will log more details for long-running tasks in AsyncProcess#waitForMaximumCurrentTasks every 10 seconds, including:
17005 1. Table name will be included in the tasks status log
17006 2. On which regionserver(s) the tasks are runnning will be logged when less than hbase.client.threshold.log.details tasks left, by default 10.
17007 3. Against which regions the tasks are running will be logged when less than 2 tasks left.
17008
17009
17010 ---
17011
17012 * [HBASE-15907](https://issues.apache.org/jira/browse/HBASE-15907) | *Major* | **Missing documentation of create table split options**
17013
17014 documentation changes only - added section to Shell tricks and cross reference from region splitting section
17015
17016
17017 ---
17018
17019 * [HBASE-15915](https://issues.apache.org/jira/browse/HBASE-15915) | *Major* | **Set timeouts on hanging tests**
17020
17021 Use @ClassRule to set timeout on test case level (instead of @Rule which sets timeout for the test methods). CategoryBasedTimeout.forClass(..) determines the timeout value based on category annotation (small/medium/large) on the test case.
17022
17023
17024 ---
17025
17026 * [HBASE-15875](https://issues.apache.org/jira/browse/HBASE-15875) | *Major* | **Remove HTable references and HTableInterface**
17027
17028 **WARNING: No release note provided for this change.**
17029
17030
17031 ---
17032
17033 * [HBASE-15610](https://issues.apache.org/jira/browse/HBASE-15610) | *Blocker* | **Remove deprecated HConnection for 2.0 thus removing all PB references for 2.0**
17034
17035 **WARNING: No release note provided for this change.**
17036
17037
17038 ---
17039
17040 * [HBASE-15890](https://issues.apache.org/jira/browse/HBASE-15890) | *Major* | **Allow thrift to set/unset "cacheBlocks" for Scans**
17041
17042 Adds cacheBlocks to Scan
17043
17044
17045 ---
17046
17047 * [HBASE-15876](https://issues.apache.org/jira/browse/HBASE-15876) | *Blocker* | **Remove doBulkLoad(Path hfofDir, final HTable table) though it has not been through a full deprecation cycle**
17048
17049 Removes a doBulkLoad method though it has not been through a full deprecation cycle (but it is 'damaged' because it has a parameter that has been properly deprecated). Use the alternative {code}public void doBulkLoad(Path hfofDir, final Admin admin, Table table, RegionLocator regionLocator){code}
17050
17051 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201605.mbox/%3CCAMUu0w-ZiLoLBLO3D76=n3AjUr=VMtTUeYA28weLHYeq8+e3bQ@mail.gmail.com%3E for NOTICE on this 'premature' removal.
17052
17053
17054 ---
17055
17056 * [HBASE-15228](https://issues.apache.org/jira/browse/HBASE-15228) | *Major* | **Add the methods to RegionObserver to trigger start/complete restoring WALs**
17057
17058 Added two hooks around WAL restore.
17059 preReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17060 and
17061 postReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17062
17063 Will be called at start and end of restore of a WAL file.
17064 The other hook around WAL restore (preWALRestore ) will be called before restore of every entry within the WAL file.
17065
17066
17067 ---
17068
17069 * [HBASE-15856](https://issues.apache.org/jira/browse/HBASE-15856) | *Critical* | **Cached Connection instances can wind up with addresses never resolved**
17070
17071 During periods where DNS resolution was not available or not working correctly, we could previously cache unresolved hostnames forever, in some cases preventing further connections to these hosts even when DNS service was restored.  With this change, unresolved hostnames will no longer be cached, and will instead throw an UnknownHostException during connection setup.
17072
17073
17074 ---
17075
17076 * [HBASE-15593](https://issues.apache.org/jira/browse/HBASE-15593) | *Major* | **Time limit of scanning should be offered by client**
17077
17078 Add a new configuration: hbase.ipc.min.client.request.timeout
17079 Minimum allowable timeout (in milliseconds) in rpc request's header. This configuration exists to prevent the rpc service regarding this request as timeout immediately.
17080
17081
17082 ---
17083
17084 * [HBASE-15784](https://issues.apache.org/jira/browse/HBASE-15784) | *Major* | **Misuse core/maxPoolSize of LinkedBlockingQueue in ThreadPoolExecutor**
17085
17086 The core pool size and max pool size of ThreadPoolExecutor should be the same when LinkedBlockingQueue is used. Thus the configurations hbase.hconnection.threads.max, hbase.hconnection.meta.lookup.threads.max, hbase.region.replica.replication.threads.max and hbase.multihconnection.threads.max are used as the number of the core threads, and the related configurations \*.thread.core are not used any more.
17087
17088
17089 ---
17090
17091 * [HBASE-15651](https://issues.apache.org/jira/browse/HBASE-15651) | *Major* | **Add report-flakies.py to use jenkins api to get failing tests**
17092
17093 To find recent set of flakies, run the script added by this patch. Run it to get usage information passing -h:
17094
17095 {code}
17096 $ ./dev-support/report-flakies.py -h
17097 {code}
17098
17099 If you get the below:
17100
17101 {code}
17102 $ python ./dev-support/report-flakies.py
17103 Traceback (most recent call last):
17104   File "./dev-support/report-flakies.py", line 25, in \<module\>
17105     import requests
17106 ImportError: No module named requests
17107 {code}
17108
17109 ... install the requests module:
17110
17111 {code}
17112 $ sudo pip install requests
17113 {code}
17114
17115
17116 ---
17117
17118 * [HBASE-15780](https://issues.apache.org/jira/browse/HBASE-15780) | *Critical* | **Expose AuthUtil as IA.Public**
17119
17120 Downstream users with long lived applications that need to communicate with secure HBase instances can now rely on the AuthUtil class to handle authenticating via keytab.
17121
17122 For more information, see the javadoc for the org.apache.hadoop.hbase.AuthUtil class.
17123
17124
17125 ---
17126
17127 * [HBASE-15811](https://issues.apache.org/jira/browse/HBASE-15811) | *Blocker* | **Batch Get after batch Put does not fetch all Cells**
17128
17129 We were not waiting on all executors in a batch to complete which meant a read-your-own-writes could sometimes fail -- especially if client is loaded; i.e. putting to multiple machines in a cluster. The test for no-more-executors was damaged by the 0.99/0.98.4 fix "HBASE-11403 Fix race conditions around Object#notify"
17130
17131
17132 ---
17133
17134 * [HBASE-15801](https://issues.apache.org/jira/browse/HBASE-15801) | *Major* | **Upgrade checkstyle for all branches**
17135
17136 All active branches now use maven-checkstyle-plugin 2.17 and checkstyle 6.18.
17137
17138
17139 ---
17140
17141 * [HBASE-15236](https://issues.apache.org/jira/browse/HBASE-15236) | *Major* | **Inconsistent cell reads over multiple bulk-loaded HFiles**
17142
17143 This jira fixes that following bug:
17144 During bulkloading, if there are multiple hfiles corresponding to same region, and if they have same timestamps (which may have been set using importtsv.timestamp) and duplicate keys across them, then get and scan may return values coming from different hfiles.
17145
17146
17147 ---
17148
17149 * [HBASE-15740](https://issues.apache.org/jira/browse/HBASE-15740) | *Major* | **Replication source.shippedKBs metric is undercounting because it is in KB**
17150
17151 Removed Replication source.shippedKBs metric in favor of source.shippedBytes
17152
17153
17154 ---
17155
17156 * [HBASE-15773](https://issues.apache.org/jira/browse/HBASE-15773) | *Major* | **CellCounter improvements**
17157
17158 The CellCounter map reduce job now supports additional configuration options on the Scan instance it creates, using the org.apache.hadoop.hbase.mapreduce.TableInputFormat defined property names.  For a full list of the options, run ./hbase org.apache.hadoop.hbase.mapreduce.CellCounter with no arguments.
17159
17160 CellCounter also no longer creates job counters for per-rowkey and per-rowkey/qualifier cell counts.  For most tables, these counters would cause the job to fail due to mapreduce job counter limits.
17161
17162
17163 ---
17164
17165 * [HBASE-15759](https://issues.apache.org/jira/browse/HBASE-15759) | *Minor* | **RegionObserver.preStoreScannerOpen() doesn't have acces to current readpoint**
17166
17167 The following RegionObserver method is deprecated and would no longer be called in hbase 2.0:
17168
17169   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17170       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17171       final KeyValueScanner s) throws IOException {
17172
17173 Instead, override this method:
17174
17175   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17176       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17177       final KeyValueScanner s, final long readPt) throws IOException {
17178
17179
17180 ---
17181
17182 * [HBASE-15743](https://issues.apache.org/jira/browse/HBASE-15743) | *Major* | **Add Transparent Data Encryption support for FanOutOneBlockAsyncDFSOutput**
17183
17184 Now the AsyncFSWAL can write data to a encryption zone on HDFS.
17185
17186
17187 ---
17188
17189 * [HBASE-15767](https://issues.apache.org/jira/browse/HBASE-15767) | *Major* | **Upgrade httpclient dependency**
17190
17191 HBase now relies on version 4.3.6 of the Apache Commons HTTPClient library. Downstream users who are exposed to it via the HBase classpath will have to similarly update their dependency.
17192
17193
17194 ---
17195
17196 * [HBASE-15575](https://issues.apache.org/jira/browse/HBASE-15575) | *Minor* | **Rename table DDL \*Handler methods in MasterObserver to more meaningful names**
17197
17198 **WARNING: No release note provided for this change.**
17199
17200
17201 ---
17202
17203 * [HBASE-15720](https://issues.apache.org/jira/browse/HBASE-15720) | *Major* | **Print row locks at the debug dump page**
17204
17205 Adds a section to the debug dump page listing current row locks held.
17206
17207
17208 ---
17209
17210 * [HBASE-15703](https://issues.apache.org/jira/browse/HBASE-15703) | *Critical* | **Deadline scheduler needs to return to the client info about skipped calls, not just drop them**
17211
17212 With previous deadline mode of RPC scheduling (the implementation in SimpleRpcScheduler, which is basically a FIFO except that long-running scans are de-prioritized) and FIFO-based RPC scheduler clients are getting CallQueueTooBigException when RPC call queue is full.
17213
17214 With this patch and when hbase.ipc.server.callqueue.type property is set to "codel" mode, clients will also be getting CallDroppedException, which means that the request was discarded by the server as it considers itself to be overloaded and starts to drop requests to avoid going down under the load. The clients will retry upon receiving this exception. It doesn't clear MetaCache with region locations.
17215
17216
17217 ---
17218
17219 * [HBASE-15281](https://issues.apache.org/jira/browse/HBASE-15281) | *Major* | **Allow the FileSystem inside HFileSystem to be wrapped**
17220
17221 This patch adds new configuration property - hbase.fs.wrapper. If provided, it should be fully qualified class name of the class used as a pluggable wrapper for HFileSystem. This may be useful for specific debugging/tracing needs.
17222
17223
17224 ---
17225
17226 * [HBASE-15551](https://issues.apache.org/jira/browse/HBASE-15551) | *Minor* | **Make call queue too big exception use servername**
17227
17228 Fixes issue when CallQueueTooBig exception returned to the client could print useless address info (like 0.0.0.0) if RPC server is listening on something other than the host name, making troubleshooting inconvenient.
17229
17230
17231 ---
17232
17233 * [HBASE-15711](https://issues.apache.org/jira/browse/HBASE-15711) | *Major* | **Add client side property to allow logging details for batch errors**
17234
17235 In HBASE-15711 a new client side property hbase.client.log.batcherrors.details is introduced to allow logging full stacktrace of exceptions for batch error. It's disabled by default and set the property to true will enable it.
17236
17237
17238 ---
17239
17240 * [HBASE-15686](https://issues.apache.org/jira/browse/HBASE-15686) | *Major* | **Add override mechanism for the exempt classes when dynamically loading table coprocessor**
17241
17242 New coprocessor table descriptor attribute, hbase.coprocessor.classloader.included.classes, is added.
17243 User can specify class name prefixes (semicolon separated) which should be loaded by CoprocessorClassLoader through this attribute using the following syntax:
17244 {code}
17245   hbase\> alter 't1',    'coprocessor'=\>'hdfs:///foo.jar\|com.foo.FooRegionObserver\|1001\|arg1=1,arg2=2'
17246 {code}
17247
17248
17249 ---
17250
17251 * [HBASE-15645](https://issues.apache.org/jira/browse/HBASE-15645) | *Critical* | **hbase.rpc.timeout is not used in operations of HTable**
17252
17253 Fixes regression where hbase.rpc.timeout configuration was ignored in branch-1.0+
17254
17255 Adds new methods setOperationTimeout, getOperationTimeout, setRpcTimeout, and getRpcTimeout to Table. In branch-1.3+ they are public interfaces and in 1.0-1.2 they are labeled as @InterfaceAudience.Private.
17256
17257 Adds hbase.client.operation.timeout to hbase-default.xml with default of 1200000
17258
17259
17260 ---
17261
17262 * [HBASE-15477](https://issues.apache.org/jira/browse/HBASE-15477) | *Major* | **Do not save 'next block header' when we cache hfileblocks**
17263
17264 Fix over-persisting in blockcache; no longer save the block PLUS the header of the next block (33 bytes) when writing the cache.
17265
17266 Also removes support for hfileblock v1; hfile block v1 was used writing hfile v1. hfile v1 was the default in hbase before hbase-0.92. hbase.96 would not start unless all v1 hfiles had been compacted out of the cluster.
17267
17268
17269 ---
17270
17271 * [HBASE-15628](https://issues.apache.org/jira/browse/HBASE-15628) | *Major* | **Implement an AsyncOutputStream which can work with any FileSystem implementation**
17272
17273 Introduce an AsyncFSOutput interface which is an abstraction of the original FanOutOneBlockAsyncDFSOutput. Now you can create AsyncFSOutput on any FileSystem using the method AsyncFSOutputHelper.createOutput. The returned AsyncFSOutput will be FanOutOneBlockAsyncDFSOutput if the given FileSystem is a DistributedFileSystem.
17274
17275
17276 ---
17277
17278 * [HBASE-15392](https://issues.apache.org/jira/browse/HBASE-15392) | *Major* | **Single Cell Get reads two HFileBlocks**
17279
17280 When an explicit Get with a one or more columns specified, we at a minimum, were overseeking, reading until we tripped over the next row, regardless, and only then returning. If the next row was in-block, we'd just do too much seeking but if the next row was in the next (or in the next block beyond that), we would keep seeking and loading blocks until we found the next row before we'd return.
17281
17282 There remains one case where we will still 'overread'. It is when the row end aligns with the end of the block. In this case we will load the next block just to find that there are no more cells in the current row. See HBASE-15457.
17283
17284
17285 ---
17286
17287 * [HBASE-15671](https://issues.apache.org/jira/browse/HBASE-15671) | *Major* | **Add per-table metrics on memstore, storefile and regionsize**
17288
17289 Adds storeFileSize, memstoreSize and tableSize to the per-table metrics.
17290
17291
17292 ---
17293
17294 * [HBASE-15366](https://issues.apache.org/jira/browse/HBASE-15366) | *Major* | **Add doc, trace-level logging, and test around hfileblock**
17295
17296 No functional change. Added javadoc, comments, and extra trace-level logging to make clear what is happening around the reading and caching of hfile blocks.
17297
17298
17299 ---
17300
17301 * [HBASE-15368](https://issues.apache.org/jira/browse/HBASE-15368) | *Major* | **Add pluggable window support**
17302
17303 Use 'hbase.hstore.compaction.date.tiered.window.factory.class' to specify the window implementation you like for date tiered compaction. Now the only and default implementation is org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory.
17304
17305 {code}
17306 \<property\>
17307 \<name\>hbase.hstore.compaction.date.tiered.window.factory.class\</name\>
17308 \<value\>org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory\</value\>
17309 \</property\>
17310 \<property\>
17311 {code}
17312
17313
17314 ---
17315
17316 * [HBASE-15518](https://issues.apache.org/jira/browse/HBASE-15518) | *Major* | **Add Per-Table metrics back**
17317
17318 Adds per-table metrics aggregated from per-region metrics in region server metrics. New metrics are available under JMX section "Hadoop:service=HBase,name=RegionServer,sub=Tables" and they are available via hadoop metrics2 collectors.
17319
17320
17321 ---
17322
17323 * [HBASE-15640](https://issues.apache.org/jira/browse/HBASE-15640) | *Major* | **L1 cache doesn't give fair warning that it is showing partial stats only when it hits limit**
17324
17325 The blockcache UI tab would stop refreshing at 100k blocks (configurable, see "hbase.ui.blockcache.by.file.max"), which isn't very many blocks when doing a big cache, giving a misleading picture of the content of L1 and/or L2 cache. Up the default limit to 1M blocks (UI takes a while but just a few seconds counting over 1M blocks).
17326
17327 Also, when beyond the limit give the user a noticeable WARNING in the UI.
17328
17329
17330 ---
17331
17332 * [HBASE-15386](https://issues.apache.org/jira/browse/HBASE-15386) | *Major* | **PREFETCH\_BLOCKS\_ON\_OPEN in HColumnDescriptor is ignored**
17333
17334 This was a non-issue. The PREFETCH\_... flag actually works. While here though made the following additions.
17335
17336 Changes the prefetch TRACE-level loggings to include the word 'Prefetch' in them so you know what they are about.
17337
17338 Changes the cryptic logging of the CacheConfig#toString to have some preamble saying why and what column family is responsible (helps figure what is going on)
17339
17340 Add test that verifies setting flag on HColumnDescriptor actually works.
17341
17342
17343 ---
17344
17345 * [HBASE-13372](https://issues.apache.org/jira/browse/HBASE-13372) | *Major* | **Unit tests for SplitTransaction and RegionMergeTransaction listeners**
17346
17347 HBASE-13372 Add unit tests for SplitTransaction and RegionMergeTransaction listeners
17348
17349
17350 ---
17351
17352 * [HBASE-15187](https://issues.apache.org/jira/browse/HBASE-15187) | *Major* | **Integrate CSRF prevention filter to REST gateway**
17353
17354 Protection against CSRF attack can be turned on with config parameter, hbase.rest.csrf.enabled - default value is false.
17355
17356 The custom header to be sent can be changed via config parameter, hbase.rest.csrf.custom.header whose default value is "X-XSRF-HEADER".
17357
17358 Config parameter, hbase.rest.csrf.methods.to.ignore , controls which HTTP methods are not associated with customer header check.
17359
17360 Config parameter, hbase.rest-csrf.browser-useragents-regex , is a comma-separated list of regular expressions used to match against an HTTP request's User-Agent header when protection against cross-site request forgery (CSRF) is enabled for REST server by setting hbase.rest.csrf.enabled to true.
17361
17362 The implementation came from hadoop/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/http/RestCsrfPreventionFilter.java
17363
17364 We should periodically update the RestCsrfPreventionFilter.java in hbase codebase to include fixes to the hadoop implementation.
17365
17366
17367 ---
17368
17369 * [HBASE-15481](https://issues.apache.org/jira/browse/HBASE-15481) | *Trivial* | **Add pre/post roll to WALObserver**
17370
17371 <!-- markdown -->
17372
17373
17374 WALObserver coprocessors now can receive notifications of WAL rolling via the new methods `preWALRoll` and `postWALRoll`.
17375
17376 This change is incompatible due to the addition of these methods to the `WALObserver` interface. Downstream users are encouraged to instead extend the `BaseWALObserver` class, which remains compatible through this change.
17377
17378
17379 ---
17380
17381 * [HBASE-15507](https://issues.apache.org/jira/browse/HBASE-15507) | *Major* | **Online modification of enabled ReplicationPeerConfig**
17382
17383 Added update\_peer\_config to the HBase shell and ReplicationAdmin, and provided a callback for custom replication endpoints to be notified of changes to their configuration and peer data
17384
17385
17386 ---
17387
17388 * [HBASE-15537](https://issues.apache.org/jira/browse/HBASE-15537) | *Major* | **Make multi WAL work with WALs other than FSHLog**
17389
17390 Add the delegate config for multiwal back. Now you can use 'hbase.wal.regiongrouping.delegate.provider' to specify the wal provider you want to use for multiwal. For example:
17391 {code}
17392 \<property\>
17393 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
17394 \<value\>asyncfs\</value\>
17395 \</property\>
17396 {code}
17397 And the default value is filesystem which is the alias of DefaultWALProvider, i.e., the FSHLog.
17398
17399
17400 ---
17401
17402 * [HBASE-15400](https://issues.apache.org/jira/browse/HBASE-15400) | *Major* | **Use DateTieredCompactor for Date Tiered Compaction**
17403
17404 With this patch combined with HBASE-15389, when we compact, we can output multiple files along the current window boundaries. There are two use cases:
17405 1. Major compaction: We want to output date tiered store files with data older than max age archived in trunks of the window size on the higher tier. Once a window is old enough, we don't combine the windows to promote to the next tier any further. So files in these windows retain the same timespan as they were minor-compacted last time, which is the window size of the highest tier. Major compaction will touch these files and we want to maintain the same layout. This way, TTL and archiving will be simpler and more efficient.
17406 2. Bulk load files and the old file generated by major compaction before upgrading to DTCP.
17407
17408 This will change the way to enable date tiered compaction.
17409 To turn it on:
17410 hbase.hstore.engine.class: org.apache.hadoop.hbase.regionserver.DateTieredStoreEngine
17411
17412 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17413 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17414 hbase.hstore.compaction.throughput.higher.bound and hbase.hstore.compaction.throughput.lower.bound need to be set for desired throughput range as uncompressed rates.
17415
17416 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17417 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17418
17419 Because major compaction is turned on now, we also need to adjust the configuration for max file to compact according to the larger file count:
17420 hbase.hstore.compaction.max: set to the same number as hbase.hstore.blockingStoreFiles.
17421
17422 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17423
17424
17425 ---
17426
17427 * [HBASE-15592](https://issues.apache.org/jira/browse/HBASE-15592) | *Major* | **Print Procedure WAL content**
17428
17429 Use hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter
17430 to print the content of a Procedure WAL.
17431 e.g.
17432 hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter -f /hbase/MasterProcWALs/state-00000000000000002571.log
17433
17434
17435 ---
17436
17437 * [HBASE-15396](https://issues.apache.org/jira/browse/HBASE-15396) | *Minor* | **Enhance mapreduce.TableSplit to add encoded region name**
17438
17439 To aid troubleshooting of MapReduce job that rely on the HBase provided input format, splits now include the encoded region name they cover.
17440
17441
17442 ---
17443
17444 * [HBASE-15568](https://issues.apache.org/jira/browse/HBASE-15568) | *Major* | **Procedure V2 - Remove CreateTableHandler in HBase Apache 2.0 release**
17445
17446 **WARNING: No release note provided for this change.**
17447
17448
17449 ---
17450
17451 * [HBASE-15521](https://issues.apache.org/jira/browse/HBASE-15521) | *Major* | **Procedure V2 - RestoreSnapshot and CloneSnapshot**
17452
17453 **WARNING: No release note provided for this change.**
17454
17455
17456 ---
17457
17458 * [HBASE-15538](https://issues.apache.org/jira/browse/HBASE-15538) | *Major* | **Implement secure async protobuf wal writer**
17459
17460 Add the following config in hbase-site.xml if you want to use secure protobuf wal writer together with AsyncFSWAL
17461 {code}
17462 \<property\>
17463 \<name\>hbase.regionserver.hlog.async.writer.impl\</name\>
17464 \<value\>org.apache.hadoop.hbase.regionserver.wal.SecureAsyncProtobufLogWriter\</value\>
17465 \</property\>
17466 \<property\>
17467 {code}
17468
17469
17470 ---
17471
17472 * [HBASE-11393](https://issues.apache.org/jira/browse/HBASE-11393) | *Major* | **Replication TableCfs should be a PB object rather than a string**
17473
17474 **WARNING: No release note provided for this change.**
17475
17476
17477 ---
17478
17479 * [HBASE-15265](https://issues.apache.org/jira/browse/HBASE-15265) | *Major* | **Implement an asynchronous FSHLog**
17480
17481 To enable, set the WALProvider as follows:
17482
17483 {code}
17484 \<property\>
17485 \<name\>hbase.wal.provider\</name\>
17486 \<value\>asyncfs\</value\>
17487 \</property\>
17488 \<property\>
17489 {code}
17490
17491 To check which provider is active, look for the log line:
17492
17493 LOG.info("Instantiating WALProvider of type " + clazz);
17494
17495
17496 ---
17497
17498 * [HBASE-14256](https://issues.apache.org/jira/browse/HBASE-14256) | *Major* | **Flush task message may be confusing when region is recovered**
17499
17500 HBASE-14256 Correct confusing flush task message
17501
17502
17503 ---
17504
17505 * [HBASE-15212](https://issues.apache.org/jira/browse/HBASE-15212) | *Major* | **RPCServer should enforce max request size**
17506
17507 Adds a configuration parameter "hbase.ipc.max.request.size" which defaults to 256MB to protect the server against very large incoming RPC requests. All requests larger than this size will be immediately rejected before allocating any resources (memory allocation, etc).
17508
17509
17510 ---
17511
17512 * [HBASE-15412](https://issues.apache.org/jira/browse/HBASE-15412) | *Major* | **Add average region size metric**
17513
17514 Adds a new metric for called "averageRegionSize" that is emitted as a regionserver metric. Metric description:
17515 Average region size over the region server including memstore and storefile sizes
17516
17517
17518 ---
17519
17520 * [HBASE-15479](https://issues.apache.org/jira/browse/HBASE-15479) | *Major* | **No more garbage or beware of autoboxing**
17521
17522 This fix decreases client's memory allocation during writes by more than 50%.
17523
17524
17525 ---
17526
17527 * [HBASE-15322](https://issues.apache.org/jira/browse/HBASE-15322) | *Critical* | **Operations using Unsafe path broken for platforms not having sun.misc.Unsafe**
17528
17529 **WARNING: No release note provided for this change.**
17530
17531
17532 ---
17533
17534 * [HBASE-12940](https://issues.apache.org/jira/browse/HBASE-12940) | *Major* | **Expose listPeerConfigs and getPeerConfig to the HBase shell**
17535
17536 Adds get\_peer\_config and list\_peer\_configs to the hbase shell.
17537
17538
17539 ---
17540
17541 * [HBASE-15430](https://issues.apache.org/jira/browse/HBASE-15430) | *Critical* | **Failed taking snapshot - Manifest proto-message too large**
17542
17543 Failed taking snapshot - Manifest proto-message too large. add property ("snapshot.manifest.size.limit")  to change max size of proto-message
17544
17545
17546 ---
17547
17548 * [HBASE-15323](https://issues.apache.org/jira/browse/HBASE-15323) | *Major* | **Hbase Rest CheckAndDeleteAPi should be able to delete more cells**
17549
17550 Fixed an issue in REST server checkAndDelete operation where the remaining cells other than the to-be-checked column are also applied in the Delete operation. Also fixed an issue in RemoteHTable where the Delete object was not passed correctly to the REST server side.
17551
17552
17553 ---
17554
17555 * [HBASE-15377](https://issues.apache.org/jira/browse/HBASE-15377) | *Major* | **Per-RS Get metric is time based, per-region metric is size-based**
17556
17557 Per-region metrics related to Get histograms are changed from being response size based into being latency based similar to the per-regionserver metrics of the same name.
17558
17559 Added GetSize histogram metrics at the per-regionserver and per-region level for the response sizes.
17560
17561
17562 ---
17563
17564 * [HBASE-6721](https://issues.apache.org/jira/browse/HBASE-6721) | *Major* | **RegionServer Group based Assignment**
17565
17566 [ADVANCED USERS ONLY] This patch adds a new experimental module hbase-rsgroup. It is an advanced feature for partitioning regionservers into distinctive groups for strict isolation, and should only be used by users who are sophisticated enough to understand the full implications and have a sufficient background in managing HBase clusters.
17567
17568 RSGroups can be defined and managed with shell commands or corresponding Java APIs. A server can be added to a group with hostname and port pair, and tables can be moved to this group so that only regionservers in the same rsgroup can host the regions of the table. RegionServers and tables can only belong to 1 group at a time. By default, all tables and regionservers belong to the "default" group. System tables can also be put into a group using the regular APIs. A custom balancer implementation tracks assignments per rsgroup and makes sure to move regions to the relevant regionservers in that group. The group information is stored in a regular HBase table, and a zookeeper-based read-only cache is used at the cluster bootstrap time.
17569
17570 To enable, add the following to your hbase-site.xml and restart your Master:
17571
17572
17573  \<property\>
17574    \<name\>hbase.coprocessor.master.classes\</name\>
17575    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupAdminEndpoint\</value\>
17576  \</property\>
17577  \<property\>
17578    \<name\>hbase.master.loadbalancer.class\</name\>
17579    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupBasedLoadBalancer\</value\>
17580  \</property\>
17581
17582
17583 Then use the shell 'rsgroup' commands to create and manipulate regionserver groups: e.g. to add a group and then add a server to it, do as follows:
17584
17585  hbase(main):008:0\> add\_rsgroup 'my\_group'
17586  Took 0.5610 seconds
17587
17588 This adds a group to the 'hbase:rsgroup' system table. Add a server (hostname + port) to the group using the 'move\_rsgroup\_servers' command as follows:
17589
17590  hbase(main):010:0\> move\_rsgroup\_servers 'my\_group',['k.att.net:51129']
17591
17592
17593 ---
17594
17595 * [HBASE-15435](https://issues.apache.org/jira/browse/HBASE-15435) | *Major* | **Add WAL (in bytes) written metric**
17596
17597 Adds a new metric named "writtenBytes" as a per-regionserver metric. Metric Description:
17598 Size (in bytes) of the data written to the WAL.
17599
17600
17601 ---
17602
17603 * [HBASE-13963](https://issues.apache.org/jira/browse/HBASE-13963) | *Critical* | **avoid leaking jdk.tools**
17604
17605 HBase now ensures that the JDK tools jar used during the build process is not exposed to downstream clients as a transitive dependency of hbase-annotations.
17606
17607 If you need to have the JDK tools jar in your classpath, you should add a system dependency on it. See the hbase-annotations pom for an example of the necessary pom additions.
17608
17609
17610 ---
17611
17612 * [HBASE-15271](https://issues.apache.org/jira/browse/HBASE-15271) | *Major* | **Spark Bulk Load: Need to write HFiles to tmp location then rename to protect from Spark Executor Failures**
17613
17614 When using the bulk load helper provided by the hbase-spark module, output files will now be written into temporary files and only made available when the executor has successfully completed.
17615
17616 Previously, failed executors would leave their files in place in a way that would be picked up by a bulk load command. This caused retried failures to include spurious copies of some cells.
17617
17618
17619 ---
17620
17621 * [HBASE-15364](https://issues.apache.org/jira/browse/HBASE-15364) | *Major* | **Fix unescaped \< characters in Javadoc**
17622
17623 HBASE-15364 Fix unescaped \< and \> characters in Javadoc
17624
17625
17626 ---
17627
17628 * [HBASE-15243](https://issues.apache.org/jira/browse/HBASE-15243) | *Major* | **Utilize the lowest seek value when all Filters in MUST\_PASS\_ONE FilterList return SEEK\_NEXT\_USING\_HINT**
17629
17630 When all filters in a MUST\_PASS\_ONE FilterList return a SEEK\_USING\_NEXT\_HINT code, we return SEEK\_NEXT\_USING\_HINT from the FilterList#filterKeyValue() to utilize the lowest seek value.
17631
17632
17633 ---
17634
17635 * [HBASE-15354](https://issues.apache.org/jira/browse/HBASE-15354) | *Major* | **Use same criteria for clearing meta cache for all operations**
17636
17637 This patch fixes some issues when MetaCache (region location cache) gets unnecessarily dropped on the client.
17638
17639 On master branch we now in RegionServerCallable and RegionServerAdminCallable pass the actual exception down to Connection#updateCachedLocation, so we could check there if the exception is "meta-clearing" or not.
17640
17641 on branch-1, branch-1.2 and branch 1.3 we now check if the exception is meta-clearing or not in AsyncProcess (this check was there on master, but not on earlier branches)
17642
17643
17644 ---
17645
17646 * [HBASE-15376](https://issues.apache.org/jira/browse/HBASE-15376) | *Major* | **ScanNext metric is size-based while every other per-operation metric is time based**
17647
17648 Removed ScanNext histogram metrics as regionserver level and per-region level metrics since the semantics is not compatible with other similar metrics (size histogram vs latency histogram).
17649
17650 Instead, this patch adds ScanTime and ScanSize histogram metrics at the regionserver and per-region level.
17651
17652
17653 ---
17654
17655 * [HBASE-15338](https://issues.apache.org/jira/browse/HBASE-15338) | *Minor* | **Add a option to disable the data block cache for testing the performance of underlying file system**
17656
17657 Add a new config: hbase.block.data.cacheonread, which is a global switch for caching data blocks on read. The default value of this switch is true, and data blocks will be cached on read if the block cache is enabled for the family and cacheBlocks flag is set to be true for get and scan operations. If this global switch is set to false, data blocks won't be cached even if the block cache is enabled for the family and the cacheBlocks flag of Gets or Scans are sets as true. Bloom blocks and index blocks are always be cached if the block cache of the regionserver is enabled. One usage of this switch is for the performance tests for the extreme case that  the cache for data blocks all missed and all data blocks are read from underlying file system.
17658
17659
17660 ---
17661
17662 * [HBASE-15136](https://issues.apache.org/jira/browse/HBASE-15136) | *Critical* | **Explore different queuing behaviors while busy**
17663
17664 Previously RPC request scheduler in HBase had 2 modes in could operate in:
17665
17666  - simple FIFO
17667  - "partial" deadline, where deadline constraints are only imposed on long-running scan requests.
17668
17669 This patch adds new type of scheduler to HBase, based on the research around controlled delay (CoDel) algorithm [1], used in networking to combat bufferbloat, as well as some analysis on generalizing it to generic request queues [2]. The purpose of that work is to prevent long standing call queues caused by discrepancy between request rate and available throughput, caused by kernel/disk IO/networking stalls.
17670
17671 New RPC scheduler could be enabled by setting hbase.ipc.server.callqueue.type=codel in configuration. Several additional params allow to configure algorithm behavior -
17672
17673 hbase.ipc.server.callqueue.codel.target.delay
17674 hbase.ipc.server.callqueue.codel.interval
17675 hbase.ipc.server.callqueue.codel.lifo.threshold
17676
17677 [1] Controlling Queue Delay / A modern AQM is just one piece of the solution to bufferbloat. http://queue.acm.org/detail.cfm?id=2209336
17678 [2] Fail at Scale / Reliability in the face of rapid change. http://queue.acm.org/detail.cfm?id=2839461
17679
17680
17681 ---
17682
17683 * [HBASE-15181](https://issues.apache.org/jira/browse/HBASE-15181) | *Major* | **A simple implementation of date based tiered compaction**
17684
17685 Date tiered compaction policy is a date-aware store file layout that is beneficial for time-range scans for time-series data.
17686
17687 When it performs well:
17688
17689     reads for limited time ranges, especially scans of recent data
17690
17691 When it doesn't perform as well:
17692
17693     random gets without a time range
17694     frequent deletes and updates
17695     out of order data writes, especially writes with timestamps in the future
17696     bulk loads of historical data
17697
17698 Recommended configuration:
17699 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
17700 hbase.hstore.compaction.compaction.policy: org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
17701
17702 Parameters for Date Tiered Compaction:
17703 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
17704 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
17705 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
17706 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
17707 hbase.hstore.compaction.date.tiered.window.policy.class: the policy to select store files within the same time window. It doesn’t apply to the incoming window. Default at exploring compaction. This is to avoid wasteful compaction.
17708
17709 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17710 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17711
17712 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17713 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17714
17715 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17716
17717
17718 ---
17719
17720 * [HBASE-15290](https://issues.apache.org/jira/browse/HBASE-15290) | *Major* | **Hbase Rest CheckAndAPI should save other cells along with compared cell**
17721
17722 Fixed an issue in REST server checkAndPut operation where the remaining cells other than the to-be-checked column are also applied in the put operation .
17723
17724
17725 ---
17726
17727 * [HBASE-15264](https://issues.apache.org/jira/browse/HBASE-15264) | *Major* | **Implement a fan out HDFS OutputStream**
17728
17729 Implement a fan-out asynchronous DFSOutputStream for implementing new WAL writer.
17730
17731
17732 ---
17733
17734 * [HBASE-13259](https://issues.apache.org/jira/browse/HBASE-13259) | *Critical* | **mmap() based BucketCache IOEngine**
17735
17736 mmap() based bucket cache can be configured by specifying the property
17737 {code}
17738 \<property\>
17739   \<name\>hbase.bucketcache.ioengine\</name\>
17740   \<value\> mmap://filepath \</value\>
17741 \</property\>
17742 {code}
17743 This mode of bucket cache is ideal when your file based bucket cache size is lesser than then available RAM. When the cache is bigger than the available RAM then the kernel page faults will make this cache perform lesser particularly in case of scans.
17744
17745
17746 ---
17747
17748 * [HBASE-11927](https://issues.apache.org/jira/browse/HBASE-11927) | *Major* | **Use Native Hadoop Library for HFile checksum (And flip default from CRC32 to CRC32C)**
17749
17750 Checksumming is cpu intensive. HBase computes additional checksums for HFiles (hdfs does checksums too) and stores them inline with file data. During reading, these checksums are verified to ensure data is not corrupted. This patch tries to use Hadoop Native Library for checksum computation, if it’s available, otherwise falls back to standard Java libraries. Instructions to load NHL in HBase can be found here (http://hbase.apache.org/book.html#hadoop.native.lib).
17751
17752 Default checksum algorithm has been changed from CRC32 to CRC32C primarily because of two reasons: 1) CRC32C has better error detection properties, and 2) New Intel processors have a dedicated instruction for crc32c computation (SSE4.2 instruction set)\*. This change is fully backward compatible. Also, users should not see any differences except decrease in cpu usage. To keep old settings, set configuration ‘hbase.hstore.checksum.algorithm’ to ‘CRC32’.
17753
17754 \* On linux, run 'cat /proc/cpuinfo’ and look for sse4\_2 in list of flags to see if your processor supports SSE4.2.
17755
17756
17757 ---
17758
17759 * [HBASE-15219](https://issues.apache.org/jira/browse/HBASE-15219) | *Critical* | **Canary tool does not return non-zero exit code when one of regions is in stuck state**
17760
17761 A new flag is added for Canary tool: -treatFailureAsError
17762 When this flag is specified, read / write failure would result in Canary tool exit code of 5.
17763
17764
17765 ---
17766
17767 * [HBASE-14949](https://issues.apache.org/jira/browse/HBASE-14949) | *Major* | **Resolve name conflict when splitting if there are duplicated WAL entries**
17768
17769 Now we can write duplicated WAL entries into different WAL files. This feature is required by the replication consistency fix and new implementation of WAL writer.
17770
17771
17772 ---
17773
17774 * [HBASE-15100](https://issues.apache.org/jira/browse/HBASE-15100) | *Blocker* | **Master WALProcs still never clean up**
17775
17776 The constructor for o.a.h.hbase.ProcedureInfo was mistakenly labeled IA.Public in previous releases and has now changed to IA.Private. Downstream users are safe to consume ProcedureInfo objects returned from HBase public interfaces, but should not expect to be able to reliably create new instances themselves.
17777
17778 The method ProcedureInfo.setNonceKey has been removed, because it should not have been exposed to clients.
17779
17780
17781 ---
17782
17783 * [HBASE-14355](https://issues.apache.org/jira/browse/HBASE-14355) | *Major* | **Scan different TimeRange for each column family**
17784
17785 Adds being able to Scan each column family with a different time range. Adds new methods setColumnFamilyTimeRange and getColumnFamilyTimeRange to Scan.
17786
17787
17788 ---
17789
17790 * [HBASE-14460](https://issues.apache.org/jira/browse/HBASE-14460) | *Critical* | **[Perf Regression] Merge of MVCC and SequenceId (HBASE-8763) slowed Increments, CheckAndPuts, batch operations**
17791
17792 This release note tries to tell the general story. Dive into sub-tasks for more specific release noting.
17793
17794 Increments, appends, checkAnd\* have been slow since hbase-.1.0.0. The unification of mvcc and sequence id done by HBASE-8763 was responsible.
17795
17796 A ‘fast-path’ workaround was added by HBASE-15031 “Fix merge of MVCC and SequenceID performance regression in branch-1.0 for Increments”. It became available in 1.0.3 and 1.1.3. To enable the fast path, set "hbase.increment.fast.but.narrow.consistency" and then rolling restart. The workaround was for increments only (appends, checkAndPut, etc., were not addressed. See HBASE-15031 release note for more detail).
17797
17798 Subsequently, the regression was properly identified and fixed in HBASE-15213 and the fix applied to branch-1.0 and branch-1.1. As it happens, hbase-1.2.0 does not suffer from the performance regression (though the thought was that it did -- and so it got the fast-path patch too via HBASE-15092) nor does the master branch. HBASE-15213 identified that HBASE-12751 (as a side effect) had cured the regression.
17799
17800 hbase-1.0.4 (if it is ever released -- 1.0 has been end-of-lifed) and hbase-1.1.4 will have the HBASE-15213 fix.  If you are suffering from the increment regression and you are on 1.0.3 or 1.1.3, you can enable the work around to get back your increment performance but you should upgrade.
17801
17802
17803 ---
17804
17805 * [HBASE-15046](https://issues.apache.org/jira/browse/HBASE-15046) | *Major* | **Perf test doing all mutation steps under row lock**
17806
17807 In here we perf tested a realignment of the write pipeline and mvcc handling.  Thought was that this work was a predicate for a general fix of HBASE-14460 (turns out, realignment of write path was not needed to fix the increment perf regression). The perf testing here made it so we were able to simplify writing. HBASE-15158 was just committed. This work is done.
17808
17809
17810 ---
17811
17812 * [HBASE-15158](https://issues.apache.org/jira/browse/HBASE-15158) | *Major* | **Change order in which we do write pipeline operations; do all under row locks!**
17813
17814 Changed the write pipeline order; made it more rational, easier-to-reason-about doing all updates to WA, MemStore, and mvcc while read/write rowlock is held where before we'd release after WAL append and then do sync and mvcc.
17815
17816
17817 ---
17818
17819 * [HBASE-15157](https://issues.apache.org/jira/browse/HBASE-15157) | *Major* | **Add \*PerformanceTest for Append, CheckAnd\***
17820
17821 Add append, increment, checkAndMutate, checkAndPut, and checkAndDelete tests to PerformanceEvaluation tool. Below are excerpts from new usage from PE:
17822
17823 ....
17824 Command:
17825  append          Append on each row; clients overlap on keyspace so some concurrent operations
17826  checkAndDelete  CheckAndDelete on each row; clients overlap on keyspace so some concurrent operations
17827  checkAndMutate  CheckAndMutate on each row; clients overlap on keyspace so some concurrent operations
17828  checkAndPut     CheckAndPut on each row; clients overlap on keyspace so some concurrent operations
17829  filterScan      Run scan test using a filter to find a specific row based on it's value (make sure to use --rows=20)
17830  increment       Increment on each row; clients overlap on keyspace so some concurrent operations
17831  randomRead      Run random read test
17832 ....
17833 Examples:
17834 ...
17835  To run 10 clients doing increments over ten rows:
17836  $ bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation --rows=10 --nomapred increment 10
17837
17838 Removed IncrementPerformanceTest. It is not as configurable as the additions made here.
17839
17840
17841 ---
17842
17843 * [HBASE-15218](https://issues.apache.org/jira/browse/HBASE-15218) | *Blocker* | **On RS crash and replay of WAL, loosing all Tags in Cells**
17844
17845 This issue fixes
17846 - In case of normal WAL (Not encrypted) we were loosing all cell tags on WAL replay after an RS crash
17847 - In case of encrypted WAL we were not even persisting Cell tags in WAL.  Tags from all unflushed (to HFile) Cells will get lost even after WAL replay recovery is done.
17848
17849 As we use tags for Cell level security, this fixes 2 security issues
17850  - Cell level visibility labels security breach . Making a visibility restricted cell global readable
17851  - Cell level ACL availability issue.  A user who is cell level authorized to read this cell can not read it. It is a data loss for him.
17852
17853
17854 ---
17855
17856 * [HBASE-15129](https://issues.apache.org/jira/browse/HBASE-15129) | *Major* | **Set default value for hbase.fs.tmp.dir rather than fully depend on hbase-default.xml**
17857
17858 Before HBASE-15129, if somehow hbase-default.xml is not on classpath, default values for hbase.fs.tmp.dir and hbase.bulkload.staging.dir are left empty. After HBASE-15129,  default values of both properties are set to "/user/\<user.name\>/hbase-staging".
17859
17860
17861 ---
17862
17863 * [HBASE-14969](https://issues.apache.org/jira/browse/HBASE-14969) | *Major* | **Add throughput controller for flush**
17864
17865 Adds means of throttling flush throughput. By default there is no limit; we use NoLimitThroughputController. An alternative controller, PressureAwareFlushThroughputController, allows specifying throughput bounds. A new simple factor, flush pressure, influences throughput. See PressureAwareFlushThroughputController.java class for detail.
17866
17867
17868 ---
17869
17870 * [HBASE-11425](https://issues.apache.org/jira/browse/HBASE-11425) | *Major* | **Cell/DBB end-to-end on the read-path**
17871
17872 For E2E off heaped read path, first of all there should be an off heap backed BucketCache(BC). Configure 'hbase.bucketcache.ioengine' to offheap in hbase-site.xml. Also specify the total capacity of the BC using hbase.bucketcache.size config.  Please remember to adjust value of 'HBASE\_OFFHEAPSIZE' in hbase-env.sh as per this capacity. Here-by we specify the max possible off-heap memory allocation for the RS java process. So this should be bigger than the off-heap BC size. Please keep in mind that there is no default for hbase.bucketcache.ioengine which means the BC is turned OFF by default.
17873
17874 Next thing to tune is the ByteBuffer pool in the RPC server side. The buffers from this pool will be used to accumulate the cell bytes and create a result cell block to send back to the client side. 'hbase.ipc.server.reservoir.enabled' can be used to turn this pool ON or OFF. By default this pool is ON and available. HBase will create off heap ByteBuffers and pool them. Please make sure not to turn this OFF if you want E2E off heaping in read path. If this pool is turned off, the server will create temp buffers on heap to accumulate the cell bytes and make a result cell block. This can impact the GC on a highly read loaded server.  The user can tune this pool with respect to how many buffers are in the pool and what should be the size of each ByteBuffer.
17875 Use the config 'hbase.ipc.server.reservoir.initial.buffer.size' to tune each of the buffer sizes. Defaults is 64 KB.
17876
17877 When the read pattern is a random row read and each of the rows are smaller in size compared to this 64 KB, try reducing this. When the result size is larger than one ByteBuffer size, the server will try to grab more than one buffer and make a result cell block out of these.  When the pool is running out of buffers, the server will end up creating temporary on-heap buffers.
17878
17879 The maximum number of ByteBuffers in the pool can be tuned using the config 'hbase.ipc.server.reservoir.initial.max'. Its value defaults to 64 \* region server handlers configured (See the config 'hbase.regionserver.handler.count'). The math is such that by default we consider 2 MB as the result cell block size per read result and each handler will be handling a read. For 2 MB size, we need 32 buffers each of size 64 KB (See default buffer size in pool).  So per handler 32 ByteBuffers(BB). We allocate twice this size as the max BBs count such that one handler can be creating the response and handing it to the RPC Responder thread and then handling a new request creating a new response cell block (using pooled buffers). Even if the responder could not send back the first TCP reply immediately, our count should allow that we should still have enough buffers in our pool without having to make temporary buffers on the heap.  Again for smaller sized random row reads, tune this max count. There are lazily created buffers and the count is the max count to be pooled.
17880
17881 The setting for HBASE\_OFFHEAPSIZE in hbase-env.sh should consider this off heap buffer pool at the RPC side also.  We need to config this max off heap size for RS as a bit higher than the sum of this max pool size and the off heap cache size. The TCP layer will also need to create direct bytebuffers for TCP communication. Also the DFS client will need some off-heap to do its workings especially if short-circuit reads are configured. Allocating an extra of 1 - 2 GB for the max direct memory size has worked in tests.
17882
17883 If you still see GC issues even after making E2E read path off heap, look for issues in the appropriate buffer pool. Check the below RS log with INFO level:
17884
17885   "Pool already reached its max capacity : XXX and no free buffers now. Consider increasing the value for 'hbase.ipc.server.reservoir.initial.max' ?"
17886
17887 If you are using co processors and refer the Cells in the read results, DO NOT store reference to these Cells out of the scope of the CP hook methods. Some times the CPs need store info about the cell (Like its row key) for considering in the next CP hook call etc. For such cases, pls clone the required fields of the entire Cell as per the use cases.  [ See CellUtil#cloneXXX(Cell) APIs ]
17888
17889
17890 ---
17891
17892 * [HBASE-15145](https://issues.apache.org/jira/browse/HBASE-15145) | *Major* | **HBCK and Replication should authenticate to zookepeer using server principal**
17893
17894 Added a new command line argument: --auth-as-server to enable authenticating to ZooKeeper as the HBase Server principal. This is required for secure clusters for doing replication operations like add\_peer, list\_peers, etc until HBASE-11392 is fixed. This advanced option can also be used for manually fixing secure znodes.
17895
17896 Commands can now be invoked like:
17897 hbase --auth-as-server shell
17898 hbase --auth-as-server zkcli
17899
17900 HBCK in secure setup also needs to authenticate to ZK using servers principals.This is turned on by default (no need to pass additional argument).
17901
17902 When authenticating as server, HBASE\_SERVER\_JAAS\_OPTS is concatenated to HBASE\_OPTS if defined in hbase-env.sh. Otherwise, HBASE\_REGIONSERVER\_OPTS is concatenated.
17903
17904
17905 ---
17906
17907 * [HBASE-15125](https://issues.apache.org/jira/browse/HBASE-15125) | *Major* | **HBaseFsck's adoptHdfsOrphan function creates region with wrong end key boundary**
17908
17909 **WARNING: No release note provided for this change.**
17910
17911
17912 ---
17913
17914 * [HBASE-13082](https://issues.apache.org/jira/browse/HBASE-13082) | *Major* | **Coarsen StoreScanner locks to RegionScanner**
17915
17916 After this JIRA we will not be doing any scanner reset after compaction during a course of a scan. The files that were compacted will still be continued to be used in the scan process. The compacted files will be archived by a background thread that runs every 2 mins by default only when there are no active scanners on those comapcted files. The above duration can be controlled using the knob 'hbase.hfile.compactions.cleaner.interval'.
17917
17918
17919 ---
17920
17921 * [HBASE-14865](https://issues.apache.org/jira/browse/HBASE-14865) | *Major* | **Support passing multiple QOPs to SaslClient/Server via hbase.rpc.protection**
17922
17923 With this patch, hbase.rpc.protection can now take multiple comma-separate QOP values. Accepted QOP values remain unchanged and are 'authentication', 'integrity', and 'privacy'. Server or client can use this configuration to specify their preference (in decreasing order) while negotiating QOP.
17924 This feature can be used to upgrade or downgrade QOP in an online cluster without compromising availability (i.e. taking cluster offline). For e.g. to change qop from A to B, typical steps would be:
17925 "A" --\> "B,A" --\> rolling restart --\> "B" --\> rolling restart
17926
17927 Sidenote: Based on experimentation, server's choice is given higher preference than client's choice. i.e. if server's choices are "A,B,C" and client's choices are "B,C,A", both A and B are acceptable, but A is chosen.
17928
17929
17930 ---
17931
17932 * [HBASE-15098](https://issues.apache.org/jira/browse/HBASE-15098) | *Blocker* | **Normalizer switch in configuration is not used**
17933
17934 The config parameter, hbase.normalizer.enabled, has been dropped since it is not used in the code base.
17935
17936
17937 ---
17938
17939 * [HBASE-15111](https://issues.apache.org/jira/browse/HBASE-15111) | *Trivial* | **"hbase version" should write to stdout**
17940
17941 The \`hbase version\` command now outputs directly to stdout rather than to a logger. This change allows the version information to be output consistently regardless of logger configuration. Naturally, this also means the command output ignores all logger configuration. Furthermore, the move from loggers to direct output changes the output of the command to omit metadata commonly included in logger ouput such as a timestamp, log level, and logger name.
17942
17943
17944 ---
17945
17946 * [HBASE-15027](https://issues.apache.org/jira/browse/HBASE-15027) | *Major* | **Refactor the way the CompactedHFileDischarger threads are created**
17947
17948 The property 'hbase.hfile.compactions.discharger.interval' has been renamed to 'hbase.hfile.compaction.discharger.interval' that describes the interval after which the compaction discharger chore service should run.
17949 The property 'hbase.hfile.compaction.discharger.thread.count' describes the thread count that does the compaction discharge work.
17950 The CompactedHFilesDischarger is a chore service now started as part of the RegionServer and this chore service iterates over all the onlineRegions in that RS and uses the RegionServer's executor service to launch a set of threads that does this job of compaction files clean up.
17951
17952
17953 ---
17954
17955 * [HBASE-14468](https://issues.apache.org/jira/browse/HBASE-14468) | *Major* | **Compaction improvements: FIFO compaction policy**
17956
17957 FIFO compaction policy selects only files which have all cells expired. The column family MUST have non-default TTL.
17958 Essentially, FIFO compactor does only one job: collects expired store files.
17959
17960 Because we do not do any real compaction, we do not use CPU and IO (disk and network), we do not evict hot data from a block cache. The result: improved throughput and latency both write and read.
17961 See: https://github.com/facebook/rocksdb/wiki/FIFO-compaction-style
17962
17963
17964 ---
17965
17966 * [HBASE-14888](https://issues.apache.org/jira/browse/HBASE-14888) | *Major* | **ClusterSchema: Add Namespace Operations**
17967
17968 This patch changes the semantic around namespace create/delete/modify when coprocessor asks that the invocation be by-passed. Previous the by-pass was done silently -- the method would just return with no indication as to whether by-pass route had been taken or not.  This patch adds throwing of a BypassCoprocessorException which is thrown if we have been asked to bypass a call.
17969
17970 The bypass facility has been in place since hbase 1.0.0 when namespace creation/deletion, etc.., was originally added in HBASE-8408 (HBASE-15071 is about addressing bypass handling in a general way)
17971
17972
17973 ---
17974
17975 * [HBASE-15018](https://issues.apache.org/jira/browse/HBASE-15018) | *Major* | **Inconsistent way of handling TimeoutException in the rpc client implementations**
17976
17977 When using the new AsyncRpcClient introduced in HBase 1.1.0 (HBASE-12684), time outs now result in an IOException wrapped around a CallTimeoutException instead of a bare CallTimeoutException. This change makes the AsyncRpcClient behave the same as the default HBase 1.y RPC client implementation.
17978
17979
17980 ---
17981
17982 * [HBASE-14796](https://issues.apache.org/jira/browse/HBASE-14796) | *Minor* | **Enhance the Gets in the connector**
17983
17984 spark.hbase.bulkGetSize  in HBaseSparkConf is for grouping bulkGet, and default value is 1000.
17985
17986
17987 ---
17988
17989 * [HBASE-14976](https://issues.apache.org/jira/browse/HBASE-14976) | *Minor* | **Add RPC call queues to the web ui**
17990
17991 Adds column displaying current aggregated call queues size in region server queues tab UI.
17992
17993
17994 ---
17995
17996 * [HBASE-14822](https://issues.apache.org/jira/browse/HBASE-14822) | *Major* | **Renewing leases of scanners doesn't work**
17997
17998 And 1.1, 1.0, and 0.98.
17999
18000
18001 ---
18002
18003 * [HBASE-14205](https://issues.apache.org/jira/browse/HBASE-14205) | *Critical* | **RegionCoprocessorHost System.nanoTime() performance bottleneck**
18004
18005 **WARNING: No release note provided for this change.**
18006
18007
18008 ---
18009
18010 * [HBASE-14978](https://issues.apache.org/jira/browse/HBASE-14978) | *Blocker* | **Don't allow Multi to retain too many blocks**
18011
18012 Limiting the amount of memory resident for any one request allows the server to handle concurrent requests smoothly. To this end we added the ability to limit the size of responses to a multi request. That worked well however it correctly represent the amount of memory resident. So this issue adds on a an approximation of the number of blocks held for a request.
18013
18014 All clients before 1.2.0 will not get this multi request chunking based upon blocks kept. All clients 1.2.0 and after will.
18015
18016
18017 ---
18018
18019 * [HBASE-14951](https://issues.apache.org/jira/browse/HBASE-14951) | *Minor* | **Make hbase.regionserver.maxlogs obsolete**
18020
18021 Rolling WAL events across a cluster can be highly correlated, hence flushing memstores, hence triggering minor compactions, that can be promoted to major ones. These events are highly correlated in time if there is a balanced write-load on the regions in a table. Default value for maximum WAL files (\* hbase.regionserver.maxlogs\*), which controls WAL rolling events - 32 is too small for many modern deployments.
18022 Now we calculate this value dynamically (if not defined by user), using the following formula:
18023
18024 maxLogs = Math.max( 32, HBASE\_HEAP\_SIZE \* memstoreRatio \* 2/ LogRollSize), where
18025
18026 memstoreRatio is \*hbase.regionserver.global.memstore.size\*
18027 LogRollSize is maximum WAL file size (default 0.95 \* HDFS block size)
18028
18029 We need to make sure that we avoid fully or minimize events when RS has to flush memstores prematurely only because it reached artificial limit of hbase.regionserver.maxlogs, this is why we put this 2 x multiplier in equation, this gives us maximum WAL capacity of 2 x RS memstore-size.
18030
18031 Runaway WAL files.
18032
18033 The default log rolling period (1h) allows to accumulate up to 2 X Memstore Size data in a WAL. For heap size - 32G and all other default setting, this gives ~ 26GB of data. Under heavy write load, the number of WAL files can increase dramatically. RegionServer LogRoller will be archiving old WALs periodically. User has three options, either override default hbase.regionserver.maxlogs or override default hbase.regionserver.logroll.period (decrease), or both to control runaway WALs.
18034
18035 For system with bursty write load,  the hbase.regionserver.logroll.period can be decreased to lower value. In this case the maximum number of wal files will be defined by the total size of memstore (unflushed data), not by the hbase.regionserver.maxlogs. But for majority of applications there will be no issues with defaults. Data will be flushed periodically from memstore, the LogRoller will archive old wal files and the system will never reach the new defaults for hbase.regionserver.maxlogs, unless the system is under extreme load for prolonged period of time, but in this case, decreasing hbase.regionserver.logroll.period allows us to control runaway wal files.
18036
18037 The following table gives the new default maximum log files values for several different Region Server heap sizes:
18038
18039 heap    memstore perc   maxLogs
18040 1G              40%                             32
18041 2G              40%                             32
18042 10G             40%                             80
18043 20G             40%                             160
18044 32G             40%                             256
18045
18046
18047 ---
18048
18049 * [HBASE-14984](https://issues.apache.org/jira/browse/HBASE-14984) | *Major* | **Allow memcached block cache to set optimze to false**
18050
18051 Setting hbase.cache.memcached.spy.optimze to true will allow the spy memcached client to try and optimize for the number of requests outstanding. This can increase throughput but can also increase variance for request times.
18052
18053 Setting it to true will help when round trip times are longer.
18054 Setting it to false ( the default ) will help ensure a more even distribution of response times.
18055
18056
18057 ---
18058
18059 * [HBASE-14534](https://issues.apache.org/jira/browse/HBASE-14534) | *Minor* | **Bump yammer/coda/dropwizard metrics dependency version**
18060
18061 Updated yammer metrics to version 3.1.2 (now it's been renamed to dropwizard). API has changed quite a bit, consult https://dropwizard.github.io/metrics/3.1.0/manual/core/ for additional information.
18062
18063 Note that among other things, in yammer 2.2.0 histograms were by default created in non-biased mode (uniform sampling), while in 3.1.0 histograms created via MetricsRegistry.histogram(...) are by default exponentially decayed. This shouldn't affect end users, though.
18064
18065
18066 ---
18067
18068 * [HBASE-14960](https://issues.apache.org/jira/browse/HBASE-14960) | *Major* | **Fallback to using default RPCControllerFactory if class cannot be loaded**
18069
18070 If the configured RPC controller factory (via hbase.rpc.controllerfactory.class) cannot be found in the classpath or loaded, we fall back to using the default RPC controller factory in HBase.
18071
18072
18073 ---
18074
18075 * [HBASE-14946](https://issues.apache.org/jira/browse/HBASE-14946) | *Critical* | **Don't allow multi's to over run the max result size.**
18076
18077 The HBase region server will now send a chunk of get responses to a client if the total response size is too large. This will only be done for clients 1.2.0 and beyond. Older clients by default will have the old behavior.
18078
18079 This patch is for the case where the basic flow is like this:
18080
18081 I want to get a single column from lots of rows. So I create a list of gets. Then I send them to table.get(List\<Get\>). If the regions for that table are spread out then those requests get chunked out to all the region servers. No one regionserver gets too many. However if one region server contains lots of regions for that table then a multi action can contain lots of gets. No single get is too onerous. However the regionserver won't return until every get is complete. So if there are thousands of gets that are sent in one multi then the regionserver can retain lots of data in one thread.
18082
18083
18084 ---
18085
18086 * [HBASE-14906](https://issues.apache.org/jira/browse/HBASE-14906) | *Major* | **Improvements on FlushLargeStoresPolicy**
18087
18088 In HBASE-14906 we use "hbase.hregion.memstore.flush.size/column\_family\_number" as the default threshold for memstore flush instead of the fixed value through "hbase.hregion.percolumnfamilyflush.size.lower.bound" property, which makes  the default threshold more flexible to various use case. We also introduce a new property in name of "hbase.hregion.percolumnfamilyflush.size.lower.bound.min" with 16M as the default value to avoid small flush in cases like hundreds of column families.
18089
18090 After this change setting "hbase.hregion.percolumnfamilyflush.size.lower.bound" in hbase-site.xml won't take effect anymore, but expert users could still set this property in table descriptor to override the default value just as before
18091
18092
18093 ---
18094
18095 * [HBASE-14769](https://issues.apache.org/jira/browse/HBASE-14769) | *Major* | **Remove unused functions and duplicate javadocs from HBaseAdmin**
18096
18097 - Removes functions from HBaseAdmin which require table name parameter as either byte[] or String. Use their counterparts which take TableName instead.
18098 - Removes redundant javadocs from HBaseAdmin as they will be automatically inherited from Admin interface.
18099 - HBaseAdmin is marked Audience.private so it should have been straight forward okay to remove the functions. But HBaseTestingUtility, which is marked Audience.public had a public function returning its instance, which moved this decision into gray area. Discussing in the community, it was decided that it would be okay to do so in this particular case.
18100
18101
18102 ---
18103
18104 * [HBASE-13153](https://issues.apache.org/jira/browse/HBASE-13153) | *Major* | **Bulk Loaded HFile Replication**
18105
18106 This enhances the HBase replication to support replication of bulk loaded data. This is configurable, by default it is set to false which means it will not replicate the bulk loaded data to its peer(s). To enable it set "hbase.replication.bulkload.enabled" to true.
18107
18108 Following are the additional configurations added for this enhancement,
18109  a. hbase.replication.cluster.id - This is manadatory to configure in cluster where replication for bulk loaded data is enabled. A source cluster is uniquely identified by sink cluster using this id. This should be configured in the source cluster configuration file for all the RS.
18110  b. hbase.replication.conf.dir - This represents the directory where all the active cluster's file system client configurations are defined in subfolders corresponding to their respective replication cluster id in peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is HBASE\_CONF\_DIR.
18111  c. hbase.replication.source.fs.conf.provider - This represents the class which provides the source cluster file system client configuration to peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is org.apache.hadoop.hbase.replication.regionserver.DefaultSourceFSConfigurationProvider
18112
18113  For example: If source cluster FS client configurations are copied in peer cluster under directory /home/user/dc1/ then  hbase.replication.cluster.id should be configured as dc1 and hbase.replication.conf.dir as /home/user
18114
18115 Note:
18116  a. Any modification to source cluster FS client configuration files in peer cluster side replication configuration directory then it needs to restart all its peer(s) cluster RS with default hbase.replication.source.fs.conf.provider.
18117  b. Only 'xml' type files will be loaded by the default hbase.replication.source.fs.conf.provider.
18118
18119 As part of this we have made following changes to LoadIncrementalHFiles class which is marked as Public and Stable class,
18120  a. Raised the visibility scope of LoadQueueItem class from package private to public.
18121  b. Added a new method loadHFileQueue, which loads the queue of LoadQueueItem into the table as per the region keys provided.
18122
18123
18124 ---
18125
18126 * [HBASE-7171](https://issues.apache.org/jira/browse/HBASE-7171) | *Major* | **Initial web UI for region/memstore/storefiles details**
18127
18128 HBASE-7171 adds 2 new pages to the region server Web UI to ease debugging and provide greater insight into the physical data layout.
18129
18130 Region names in UI table listing all regions (on the RS status page) are now hyperlinks leading to region detail page which shows some aggregate memstore information (currently just memory used) along with the list of all Store Files (HFiles) in the region. Names of Store Files are also hyperlinks leading to Store File detail page, which currently runs 'hbase hfile' command behind the scene and displays statistics about store file.
18131
18132
18133 ---
18134
18135 * [HBASE-14655](https://issues.apache.org/jira/browse/HBASE-14655) | *Blocker* | **Narrow the scope of doAs() calls to region observer notifications for compaction**
18136
18137 Region observer notifications w.r.t. compaction request are now audited with request user through proper scope of doAs() calls.
18138
18139
18140 ---
18141
18142 * [HBASE-14631](https://issues.apache.org/jira/browse/HBASE-14631) | *Blocker* | **Region merge request should be audited with request user through proper scope of doAs() calls to region observer notifications**
18143
18144 Region observer notifications w.r.t. merge request are now audited with request user through proper scope of doAs() calls.
18145
18146
18147 ---
18148
18149 * [HBASE-14605](https://issues.apache.org/jira/browse/HBASE-14605) | *Blocker* | **Split fails due to 'No valid credentials' error when SecureBulkLoadEndpoint#start tries to access hdfs**
18150
18151 When split is requested by non-super user, split related notifications for Coprocessor are executed using the login of the request user.
18152 Previously the notifications were carried out as super user.
18153
18154
18155 ---
18156
18157 * [HBASE-14926](https://issues.apache.org/jira/browse/HBASE-14926) | *Major* | **Hung ThriftServer; no timeout on read from client; if client crashes, worker thread gets stuck reading**
18158
18159 Adds a timeout to server read from clients. Adds new configs hbase.thrift.server.socket.read.timeout for setting read timeout on server socket in milliseconds. Default is 60000;
18160
18161
18162 ---
18163
18164 * [HBASE-14825](https://issues.apache.org/jira/browse/HBASE-14825) | *Minor* | **HBase Ref Guide corrections of typos/misspellings**
18165
18166 Corrections to content of "book.html", which is pulled from various \*.adoc files and \*.xml files.
18167 -- corrects typos/misspellings
18168 -- corrects incorrectly formatted links
18169
18170
18171 ---
18172
18173 * [HBASE-14821](https://issues.apache.org/jira/browse/HBASE-14821) | *Major* | **CopyTable should allow overriding more config properties for peer cluster**
18174
18175 Configuration properties for org.apache.hadoop.hbase.mapreduce.TableOutputFormat can now be overridden by prefixing the property keys with "hbase.mapred.output.".  When the configuration is applied to TableOutputFormat, these entries will be rewritten with the prefix removed -- ie. "hbase.mapred.output.hbase.security.authentication" becomes "hbase.security.authentication".  This can be useful when directing output to a peer cluster with different security configuration, for example.
18176
18177
18178 ---
18179
18180 * [HBASE-14799](https://issues.apache.org/jira/browse/HBASE-14799) | *Critical* | **Commons-collections object deserialization remote command execution vulnerability**
18181
18182 This issue resolves a potential security vulnerability. For all versions we update our commons-collections dependency to the release that fixes the reported vulnerability in that library. In 0.98 we additionally disable by default a feature of code carried from 0.94 for backwards compatibility that is not needed.
18183
18184
18185 ---
18186
18187 * [HBASE-12751](https://issues.apache.org/jira/browse/HBASE-12751) | *Major* | **Allow RowLock to be reader writer**
18188
18189 Locks on row are now reader/writer rather than exclusive.
18190
18191 Moves sequenceid out of HRegion and into MVCC class; MVCC is now in charge. A WAL append is still stamped in same way (we pass MVCC context in a few places where we previously we did not).
18192
18193 MVCC methods cleaned up. Make a bit more sense now. Less of them.
18194
18195 Simplifies our update of MemStore/WAL. Now we update memstore AFTER we add to WAL (but before we sync). This fixes possible dataloss when two edits came in with same coordinates; we could order the edits in memstore differently to how they arrived in the WAL.
18196
18197 Marked as an incompatible change because it breaks Distributed Log Replay, a feature we'd determined already was unreliable and to be removed.
18198
18199
18200 ---
18201
18202 * [HBASE-14793](https://issues.apache.org/jira/browse/HBASE-14793) | *Major* | **Allow limiting size of block into L1 block cache.**
18203
18204 Very large blocks can fragment the heap and cause bad issues for the garbage collector, especially the G1GC. Now there is a maximum size that a block can be and still stick in the LruBlockCache. That size defaults to 16mb but can be controlled by changing "hbase.lru.max.block.size"
18205
18206
18207 ---
18208
18209 * [HBASE-14387](https://issues.apache.org/jira/browse/HBASE-14387) | *Major* | **Compaction improvements: Maximum off-peak compaction size**
18210
18211 New configuration option: hbase.hstore.compaction.max.size.offpeak - maximum selection size eligible for minor compaction during off peak hours.
18212 hbase.hstore.compaction.max.size - this is default maximum if no off-peak hours are defined or if no maximum off-peak maximum size is defined.
18213
18214
18215 ---
18216
18217 * [HBASE-12822](https://issues.apache.org/jira/browse/HBASE-12822) | *Minor* | **Option for Unloading regions through region\_mover.rb without Acknowledging**
18218
18219 Incorporated in HBASE-13014.
18220
18221
18222 ---
18223
18224 * [HBASE-14700](https://issues.apache.org/jira/browse/HBASE-14700) | *Major* | **Support a "permissive" mode for secure clusters to allow "simple" auth clients**
18225
18226 Secure HBase now supports a permissive mode to allow mixed secure and insecure clients.  This allows clients to be incrementally migrated over to a secure configuration.  To enable clients to continue to connect using SIMPLE authentication when the cluster is configured for security, set "hbase.ipc.server.fallback-to-simple-auth-allowed" equal to "true" in hbase-site.xml.  NOTE: This setting should ONLY be used as a temporary measure while converting clients over to secure authentication.  It MUST BE DISABLED for secure operation.
18227
18228
18229 ---
18230
18231 * [HBASE-14257](https://issues.apache.org/jira/browse/HBASE-14257) | *Major* | **Periodic flusher only handles hbase:meta, not other system tables**
18232
18233 Memstore periodic flusher used to flush META table every 5 minutes but not any other system tables. This jira extends it to flush all system tables within this time period.
18234
18235
18236 ---
18237
18238 * [HBASE-14658](https://issues.apache.org/jira/browse/HBASE-14658) | *Major* | **Allow loading a MonkeyFactory by class name**
18239
18240 You can specify one of the predefined set of Monkeys when you run Integration Tests by passing the -m\|--monkey arguments on the command line; e.g -m CALM or -m SLOW\_DETERMINISTIC
18241
18242 This patch  makes it so you can pass the name of a class as the monkey to run: e.g. -m org.example.KingKong
18243
18244
18245 ---
18246
18247 * [HBASE-14521](https://issues.apache.org/jira/browse/HBASE-14521) | *Major* | **Unify the semantic of hbase.client.retries.number**
18248
18249 After this change, hbase.client.reties.number universally means the number of retry which is one less than total tries number,  for both non-batch operations like get/scan/increment etc. which uses RpcRetryingCallerImpl#callWithRetries to submit the call or batch operations like put through AsyncProcess#submit.
18250
18251 Note that previously this property means total tries number for puts, so please adjust the setting of its value if necessary. Please also be cautious when setting it to zero since retry is necessary for client cache update when region move happens.
18252
18253
18254 ---
18255
18256 * [HBASE-13819](https://issues.apache.org/jira/browse/HBASE-13819) | *Major* | **Make RPC layer CellBlock buffer a DirectByteBuffer**
18257
18258 For master branch(2.0 version), the BoundedByteBufferPool always create Direct (off heap) ByteBuffers and return that.
18259 For branch-1(1.3 version), byte default the buffers returned will be off heap. This can be changed to return on heap ByteBuffers by configuring 'hbase.ipc.server.reservoir.direct.buffer' to false.
18260
18261
18262 ---
18263
18264 * [HBASE-14517](https://issues.apache.org/jira/browse/HBASE-14517) | *Minor* | **Show regionserver's version in master status page**
18265
18266 Adds server version to the listing of regionservers on the master home page.
18267
18268 if a cluster where the versions deviate, at the bottom of the 'Version' column on the master home page listing of 'Region Servers', you will see a note in red that says something like: 'Total:10              9 nodes with inconsistent version'
18269
18270
18271 ---
18272
18273 * [HBASE-12911](https://issues.apache.org/jira/browse/HBASE-12911) | *Major* | **Client-side metrics**
18274
18275 Introduces collection and reporting of various client-perceived metrics. Metrics are exposed via JMX under "org.apache.hadoop.hbase.client.MetricsConnection". Metrics are scoped according to connection instance, so multiple connection objects (ie, to different clusters) will report their metrics separately. Metrics are disabled by default, must be enabled by configuring "hbase.client.metrics.enable=true".
18276
18277
18278 ---
18279
18280 * [HBASE-14529](https://issues.apache.org/jira/browse/HBASE-14529) | *Major* | **Respond to SIGHUP to reload config**
18281
18282 HBase daemons can now be signaled to reload their config by sending SIGHUP to the java process. Not all config parameters can be reloaded.
18283
18284 In order for this new feature to work the hbase-daemon.sh script was changed to use disown rather than nohup. Functionally this shouldn't change anything but the processes will have a different parent when being run from a connected login shell.
18285
18286
18287 ---
18288
18289 * [HBASE-14502](https://issues.apache.org/jira/browse/HBASE-14502) | *Major* | **Purge use of jmock and remove as dependency**
18290
18291 HBASE-14502 Purge use of jmock and remove as dependency
18292
18293
18294 ---
18295
18296 * [HBASE-14544](https://issues.apache.org/jira/browse/HBASE-14544) | *Major* | **Allow HConnectionImpl to not refresh the dns on errors**
18297
18298 By setting hbase.resolve.hostnames.on.failure to false you can reduce the number of dns name resolutions that a client will do. However if machines leave and come back with different ip's the changes will not be noticed by the clients. So only set hbase.resolve.hostnames.on.failure to false if your cluster dns is not changing while clients are connected.
18299
18300
18301 ---
18302
18303 * [HBASE-14367](https://issues.apache.org/jira/browse/HBASE-14367) | *Major* | **Add normalization support to shell**
18304
18305 This patch adds shell support for region normalizer (see HBASE-13103).
18306
18307 3 commands have been added to hbase shell 'tools' command group (modeled on how the balancer works):
18308
18309  - 'normalizer\_enabled' checks whether region normalizer is turned on
18310  - 'normalizer\_switch' allows user to turn normalizer on and off
18311  - 'normalize' runs region normalizer if it's turned on.
18312
18313 Also 'alter' command has been extended to allow user to enable/disable region normalization per table (disabled by default). Use it as
18314
18315 alter 'testtable', {NORMALIZATION\_MODE =\> 'true'}
18316
18317 Here is the help for the normalize command:
18318
18319 {code}
18320 hbase(main):008:0\> help 'normalize'
18321 Trigger region normalizer for all tables which have NORMALIZATION\_MODE flag set. Returns true
18322  if normalizer ran successfully, false otherwise. Note that this command has no effect
18323  if region normalizer is disabled (make sure it's turned on using 'normalizer\_switch' command).
18324
18325  Examples:
18326
18327    hbase\> normalize
18328 {code}
18329
18330
18331 ---
18332
18333 * [HBASE-14475](https://issues.apache.org/jira/browse/HBASE-14475) | *Major* | **Region split requests are always audited with "hbase" user rather than request user**
18334
18335 Region observer notifications w.r.t. split request are now audited with request user through proper scope of doAs() calls.
18336
18337
18338 ---
18339
18340 * [HBASE-14230](https://issues.apache.org/jira/browse/HBASE-14230) | *Minor* | **replace reflection in FSHlog with HdfsDataOutputStream#getCurrentBlockReplication()**
18341
18342 Remove calling getNumCurrentReplicas on HdfsDataOutputStream via reflection. getNumCurrentReplicas showed up in hadoop 1+ and hadoop 0.2x. In hadoop-2 it was deprecated.
18343
18344
18345 ---
18346
18347 * [HBASE-14495](https://issues.apache.org/jira/browse/HBASE-14495) | *Major* | **TestHRegion#testFlushCacheWhileScanning goes zombie**
18348
18349 The WAL append was changed by HBASE-12751. Every append now sets a latch on an edit. The latch needs to be cleared or else the WAL will hang. The original failures in TestHRegion turned up 'holes' where we were failing to throw the latch if we skipped out early because we were interrupted. Other 'holes' were found where we had mocked up a WAL so the latch would just stay in place.  Futher holes were found appending WAL markers... here we were skipping the mvcc completely for a few edits.  A clean up of WALUtils made all markers take the same code paths.
18350
18351
18352 ---
18353
18354 * [HBASE-14280](https://issues.apache.org/jira/browse/HBASE-14280) | *Minor* | **Bulk Upload from HA cluster to remote HA hbase cluster fails**
18355
18356 Patch will effectively work with Hadoop version 2.6 or greater with a launch of "internal.nameservices".
18357 There will be no change in versions older than 2.6.
18358
18359
18360 ---
18361
18362 * [HBASE-14334](https://issues.apache.org/jira/browse/HBASE-14334) | *Major* | **Move Memcached block cache in to it's own optional module.**
18363
18364 Move external block cache to it's own module. This  will reduce dependencies for people who use hbase-server.
18365 Currently Memcached is the reference implementation for external block cache. External block caches allow HBase to take advantage of other more complex caches that can live longer than the HBase regionserver process and are not necessarily tied to a single computer
18366     life time. However external block caches add in extra operational overhead.
18367
18368
18369 ---
18370
18371 * [HBASE-14433](https://issues.apache.org/jira/browse/HBASE-14433) | *Major* | **Set down the client executor core thread count from 256 in tests**
18372
18373 Tests run with client executors that have core thread count of 4 and a keepalive of 3 seconds. They used to default to 256 core threads and 60 seconds  for keepalive.
18374
18375
18376 ---
18377
18378 * [HBASE-14400](https://issues.apache.org/jira/browse/HBASE-14400) | *Critical* | **Fix HBase RPC protection documentation**
18379
18380 To use rpc protection in HBase, set the value of 'hbase.rpc.protection' to:
18381 'authentication' : simple authentication using kerberos
18382 'integrity' : authentication and integrity
18383 'privacy' : authentication and confidentiality
18384
18385 Earlier, HBase reference guide erroneously mentioned in some places to set the value to 'auth-conf'. This patch fixes the guide and adds temporary support for erroneously recommended values.
18386
18387
18388 ---
18389
18390 * [HBASE-14306](https://issues.apache.org/jira/browse/HBASE-14306) | *Major* | **Refine RegionGroupingProvider: fix issues and make it more scalable**
18391
18392 In HBASE-14306 we've changed default strategy of RegionGroupingProvider from "identify" to "bounded", so it's required to explicitly set "hbase.wal.regiongrouping.strategy" to "identify" if user still wants to use one WAL per region
18393
18394 Please also notice that in the new framework there will be one WAL per group, and the region-group mapping is decided by RegionGroupingStrategy. Accordingly, we've removed BoundedRegionGroupingProvider and added BoundedRegionGroupingStrategy as a replacement. If you already have a customized class for hbase.wal.regiongrouping.strategy, please check the new logic and make updates if necessary.
18395
18396
18397 ---
18398
18399 * [HBASE-6617](https://issues.apache.org/jira/browse/HBASE-6617) | *Major* | **ReplicationSourceManager should be able to track multiple WAL paths**
18400
18401 ReplicationSourceManager now could track multiple wal paths. Notice that although most changes are internal and all metrics names remain the same, signature of below methods in MetricsSource are changed:
18402
18403 1. refreshAgeOfLastShippedOp now requires a String parameter which indicates the wal group id of the reporter
18404 2. setAgeOfLastShippedOp also adds a String parameter for wal group id
18405
18406
18407 ---
18408
18409 * [HBASE-14314](https://issues.apache.org/jira/browse/HBASE-14314) | *Major* | **Metrics for block cache should take region replicas into account**
18410
18411 The following metrics for primary region replica are added:
18412
18413 blockCacheHitCountPrimary
18414 blockCacheMissCountPrimary
18415 blockCacheEvictionCountPrimary
18416
18417
18418 ---
18419
18420 * [HBASE-14317](https://issues.apache.org/jira/browse/HBASE-14317) | *Blocker* | **Stuck FSHLog: bad disk (HDFS-8960) and can't roll WAL**
18421
18422 Tighten up WAL-use semantic.
18423
18424 1. If an append or a sync throws an exception, all subsequent attempts at using the log will also throw this same exception. The WAL is now a lame-duck until you roll it.
18425 2. If a successful append, and then we fail to sync the append, this is a fatal exception. The container must abort to replay the WAL logs even though we have told the client that the appends failed.
18426
18427 The above rules have been applied laxly up to this; it used to be possible to get a good sync to go in over the top of a failed append. This has been fixed in this patch.
18428
18429 Also fixed a hang in the WAL subsystem if a request to pause the write pipeline took on a failed sync. before the roll requests sync got scheduled.
18430
18431
18432 TODO: Revisit our WAL system. HBASE-12751 helps rationalize our write pipeline. In particular, it manages sequenceid inside mvcc which should make it so we can purge mechanism that writes empty, unflushed appends just to get the next sequenceid... problematic when WAL goes lame-duck. Lets get it in.
18433 TODO: A successful append followed by a failed sync probably only needs us replace the WAL (if we have signalled the client that the appends failed). Bummer is that replicating, these last appends might make it to the sink cluster or get replayed during recovery. HBase should keep its own WAL length? Or sequenceid of last successful sync should be passed when doing recovery and replication?
18434
18435
18436 ---
18437
18438 * [HBASE-14261](https://issues.apache.org/jira/browse/HBASE-14261) | *Major* | **Enhance Chaos Monkey framework by adding zookeeper and datanode fault injections.**
18439
18440 This change augments existing chaos monkey framework with actions for restarting underlying zookeeper quorum and hdfs nodes of distributed hbase cluster. One assumption made while creating zk actions are that zookeper ensemble is an independent external service and won't be managed by hbase cluster.  For these actions to work as expected, the following parameters need to be configured appropriately.
18441
18442 {code}
18443 \<property\>
18444   \<name\>hbase.it.clustermanager.hadoop.home\</name\>
18445   \<value\>$HADOOP\_HOME\</value\>
18446 \</property\>
18447 \<property\>
18448   \<name\>hbase.it.clustermanager.zookeeper.home\</name\>
18449   \<value\>$ZOOKEEPER\_HOME\</value\>
18450 \</property\>
18451 \<property\>
18452   \<name\>hbase.it.clustermanager.hbase.user\</name\>
18453   \<value\>hbase\</value\>
18454 \</property\>
18455 \<property\>
18456   \<name\>hbase.it.clustermanager.hadoop.hdfs.user\</name\>
18457   \<value\>hdfs\</value\>
18458 \</property\>
18459 \<property\>
18460   \<name\>hbase.it.clustermanager.zookeeper.user\</name\>
18461   \<value\>zookeeper\</value\>
18462 \</property\>
18463 {code}
18464
18465 The service user related configurations are newly introduced since in prod/test environments each service is managed by different user. Once the above parameters are configured properly, you can start using them as needed. An example usage for invoking these new actions is:
18466
18467 {{./hbase org.apache.hadoop.hbase.IntegrationTestAcidGuarantees -m serverAndDependenciesKilling}}
18468
18469
18470 ---
18471
18472 * [HBASE-14309](https://issues.apache.org/jira/browse/HBASE-14309) | *Major* | **Allow load balancer to operate when there is region in transition by adding force flag**
18473
18474 This issue adds boolean parameter, force, to 'balancer' command so that admin can force region balancing even when there is region (other than hbase:meta) in transition - assuming RIT being transient.
18475 If hbase:meta is in transition, balancer command returns false.
18476
18477 WARNING: For experts only. Forcing a balance may do more damage than repair when assignment is confused
18478 Note: enclose the force parameter in double quotes
18479
18480
18481 ---
18482
18483 * [HBASE-14313](https://issues.apache.org/jira/browse/HBASE-14313) | *Critical* | **After a Connection sees ConnectionClosingException it never recovers**
18484
18485 HConnection could get stuck when talking to a host that went down and then returned. This has been fixed by closing the connection in all paths.
18486
18487
18488 ---
18489
18490 * [HBASE-13339](https://issues.apache.org/jira/browse/HBASE-13339) | *Blocker* | **Update default Hadoop version to latest for master**
18491
18492 Master/2.0.0 now builds on the latest stable hadoop by default.
18493
18494
18495 ---
18496
18497 * [HBASE-14224](https://issues.apache.org/jira/browse/HBASE-14224) | *Critical* | **Fix coprocessor handling of duplicate classes**
18498
18499 Prevent Coprocessors being doubly-loaded; a particular coprocessor can only be loaded once.
18500
18501
18502 ---
18503
18504 * [HBASE-13127](https://issues.apache.org/jira/browse/HBASE-13127) | *Major* | **Add timeouts on all tests so less zombie sightings**
18505
18506 Use junit facility to impose timeout on test. Use test category to chose which timeout to apply: small tests timeout after 30 seconds, medium tests after 180 seconds, and large tests after ten minutes.
18507
18508 Updated junit version from 4.11 to 4.12. 4.12 has support for feature used here.
18509
18510 Add this at the head of your junit4 class to add a category-based timeout:
18511
18512 {code}
18513 @Rule public final TestRule timeout =   CategoryBasedTimeout.builder().withTimeout(this.getClass()).
18514       withLookingForStuckThread(true).build();
18515 {code}
18516
18517 For example:
18518
18519
18520 ---
18521
18522 * [HBASE-14148](https://issues.apache.org/jira/browse/HBASE-14148) | *Major* | **Web UI Framable Page**
18523
18524 Security fix: Adds protection from clickjacking using X-Frame-Options header.
18525 This will prevent use of HBase UI in frames. To disable this feature, set the configuration 'hbase.http.filter.xframeoptions.mode' to 'ALLOW' (default is 'DENY').
18526
18527
18528 ---
18529
18530 * [HBASE-10844](https://issues.apache.org/jira/browse/HBASE-10844) | *Major* | **Coprocessor failure during batchmutation leaves the memstore datastructs in an inconsistent state**
18531
18532 Promotes an -ea assert to logged FATAL and RS abort when memstore is found to be in an inconsistent state.
18533
18534
18535 ---
18536
18537 * [HBASE-13966](https://issues.apache.org/jira/browse/HBASE-13966) | *Minor* | **Limit column width in table.jsp**
18538
18539 Wraps region, start key, end key columns if too long.
18540
18541
18542 ---
18543
18544 * [HBASE-13706](https://issues.apache.org/jira/browse/HBASE-13706) | *Minor* | **CoprocessorClassLoader should not exempt Hive classes**
18545
18546 Starting from HBase 2.0, CoprocessorClassLoader will not exempt hadoop classes or zookeeper classes.  This means that if the custom coprocessor jar contains hadoop or zookeeper packages and classes, they will be loaded by the CoprocessorClassLoader.  Only hbase packages and classes  are exempted from the CoprocessorClassLoader. They (and their dependencies) are loaded by the parent server class loader.
18547
18548
18549 ---
18550
18551 * [HBASE-14054](https://issues.apache.org/jira/browse/HBASE-14054) | *Major* | **Acknowledged writes may get lost if regionserver clock is set backwards**
18552
18553 In {{checkAndPut}} write path use max(max timestamp for the row, System.currentTimeMillis()) in the, instead of blindly taking System.currentTimeMillis() to ensure that checkAndPut() cannot do writes which is already eclipsed. This is similar to what has been done in HBASE-12449 for increment and append.
18554
18555
18556 ---
18557
18558 * [HBASE-13985](https://issues.apache.org/jira/browse/HBASE-13985) | *Minor* | **Add configuration to skip validating HFile format when bulk loading**
18559
18560 A new config, hbase.loadincremental.validate.hfile , is introduced - default to true
18561 When set to false, checking hfile format is skipped during bulkloading.
18562
18563
18564 ---
18565
18566 * [HBASE-14201](https://issues.apache.org/jira/browse/HBASE-14201) | *Major* | **hbck should not take a lock unless fixing errors**
18567
18568 HBCK no longer takes a lock until there are changes to the cluster being made.
18569
18570 The old behavior can be achieved by passing the -exclusive flag.
18571
18572
18573 ---
18574
18575 * [HBASE-14081](https://issues.apache.org/jira/browse/HBASE-14081) | *Minor* | **(outdated) references to SVN/trunk in documentation**
18576
18577 HBASE-14081 Remove (outdated) references to SVN/trunk from documentation
18578
18579
18580 ---
18581
18582 * [HBASE-13865](https://issues.apache.org/jira/browse/HBASE-13865) | *Trivial* | **Increase the default value for hbase.hregion.memstore.block.multipler from 2 to 4 (part 2)**
18583
18584 Increase default hbase.hregion.memstore.block.multiplier from 2 to 4 in the code to match the default value in the config files.
18585
18586
18587 ---
18588
18589 * [HBASE-12295](https://issues.apache.org/jira/browse/HBASE-12295) | *Major* | **Prevent block eviction under us if reads are in progress from the BBs**
18590
18591 We try to delay the eviction of the block till the cellblocks are formed at the Rpc layer. A simple reference counting mechanism is introduced when ever a block is accessed from the Bucket cache.  Once a scanner completes using a block the reference count is decremented.  The eviction of the block happens only when the reference count of that block is 0.
18592 We also introduce a concept of ShareableMemory based on the type of blocks we create from the Block cache. The blocks from the ByteBufferIOEngine directly refer to the buckets in offheap and such blocks are marked SHARED memory type. The blocks from LRU, HDFS and file mode of Bucket cache are all marked EXCLUSIVE because these blocks have their own exclusive memory.
18593 For the CP case, any cell coming out of SHARED memory block is copied before returning the results, because CPs can use the results as its state so that eviction cannot corrupt the results.
18594
18595
18596 ---
18597
18598 * [HBASE-11339](https://issues.apache.org/jira/browse/HBASE-11339) | *Major* | **HBase MOB**
18599
18600 The Moderate Object Storage (MOB) feature (HBASE-11339[1]) is modified I/O and compaction path that allows individual moderately sized values (100KB-10MB) to be stored in a way that write amplification is reduced when compared to the normal I/O path. MOB is defined in the column family and it is almost isolated with other components, the features and performance cannot be effected in normal columns.
18601
18602 For more details on how to use the feature please consult the HBase Reference Guide
18603
18604
18605 ---
18606
18607 * [HBASE-13954](https://issues.apache.org/jira/browse/HBASE-13954) | *Major* | **Remove HTableInterface#getRowOrBefore related server side code**
18608
18609 Removed Table#getRowOrBefore, Region#getClosestRowBefore, Store#getRowKeyAtOrBefore, RemoteHTable#getRowOrBefore apis and Thrift support for getRowOrBefore.
18610 Also removed two coprocessor hooks preGetClosestRowBefore and postGetClosestRowBefore.
18611 User using this api can instead use reverse scan something like below,
18612 {code}
18613  Scan scan = new Scan(row);
18614   scan.setSmall(true);
18615   scan.setCaching(1);
18616   scan.setReversed(true);
18617   scan.addFamily(family);
18618 {code}
18619 pass this scan object to the scanner and retrieve the first Result from scanner output.
18620
18621
18622 ---
18623
18624 * [HBASE-12296](https://issues.apache.org/jira/browse/HBASE-12296) | *Major* | **Filters should work with ByteBufferedCell**
18625
18626 Change to support offheaping.
18627
18628 Incompatible change for filters ColumnPrefixFilter and MultipleColumnPrefixFilter
18629
18630 Changes parameters to filterColumn so takes a Cell rather than a byte [].
18631
18632 hbase-client-1.2.7-SNAPSHOT.jar, ColumnPrefixFilter.class
18633 package org.apache.hadoop.hbase.filter
18634 ColumnPrefixFilter.filterColumn ( byte[ ] buffer, int qualifierOffset, int qualifierLength )  :  Filter.ReturnCode
18635 org/apache/hadoop/hbase/filter/ColumnPrefixFilter.filterColumn:([BII)Lorg/apache/hadoop/hbase/filter/Filter$ReturnCode;
18636
18637 Ditto for filterColumnValue in SingleColumnValueFilter. Takes a Cell instead of byte array.
18638
18639
18640 ---
18641
18642 * [HBASE-14045](https://issues.apache.org/jira/browse/HBASE-14045) | *Major* | **Bumping thrift version to 0.9.2.**
18643
18644 This changes upgrades thrift dependency of HBase to 0.9.2. Though this doesn't break any HBase compatibility promises, it might impact any downstream projects that share thrift dependency with HBase.
18645
18646
18647 ---
18648
18649 * [HBASE-14027](https://issues.apache.org/jira/browse/HBASE-14027) | *Major* | **Clean up netty dependencies**
18650
18651 HBase's convenience binary artifact no longer contains the netty 3.2.4 jar . This jar was not directly used by HBase, but may have been relied on by downstream applications.
18652
18653
18654 ---
18655
18656 * [HBASE-7782](https://issues.apache.org/jira/browse/HBASE-7782) | *Minor* | **HBaseTestingUtility.truncateTable() not acting like CLI**
18657
18658 HBaseTestingUtility now uses the truncate API added in HBASE-8332 so that calls to HBTU.truncateTable will behave like the shell command: effectively dropping the table and recreating a new one with the same split points.
18659
18660 Previously, HBTU.truncateTable instead issued deletes for all the data already in the table. If you wish to maintain the same behavior, you should use the newly added HBTU.deleteTableData method.
18661
18662
18663 ---
18664
18665 * [HBASE-14047](https://issues.apache.org/jira/browse/HBASE-14047) | *Major* | **Cleanup deprecated APIs from Cell class**
18666
18667 The following API from Cell (which were deprecated since past few major versions) are removed now.
18668 getRow
18669 getFamily
18670 getQualifier
18671 getValue
18672 getMvccVersion
18673 The above apis can be replaced with their respective CellUtil#cloneXXX (allocates a copy) or Cell#getXXXArray (essentially just returns a pointer) based on the use case.
18674
18675
18676 ---
18677
18678 * [HBASE-14029](https://issues.apache.org/jira/browse/HBASE-14029) | *Major* | **getting started for standalone still references hadoop-version-specific binary artifacts**
18679
18680 HBASE-14029 Correct documentation for Hadoop version specific artifacts
18681
18682
18683 ---
18684
18685 * [HBASE-13849](https://issues.apache.org/jira/browse/HBASE-13849) | *Major* | **Remove restore and clone snapshot from the WebUI**
18686
18687 The HBase master status web page no longer allows operators to clone snapshots nor restore snapshots.
18688
18689
18690 ---
18691
18692 * [HBASE-13646](https://issues.apache.org/jira/browse/HBASE-13646) | *Major* | **HRegion#execService should not try to build incomplete messages**
18693
18694 When RegionServerCoprocessors throw an exception we will no longer attempt to build an incomplete RPC response message. Instead, the response message will be null.
18695
18696
18697 ---
18698
18699 * [HBASE-13639](https://issues.apache.org/jira/browse/HBASE-13639) | *Major* | **SyncTable - rsync for HBase tables**
18700
18701 Tool to sync two tables that tries to send the differences only like rsync.
18702
18703 Adds two new MapReduce jobs, SyncTable and HashTable. See usage for these jobs on how to use. See design doc for generally overview: https://docs.google.com/document/d/1-2c9kJEWNrXf5V4q\_wBcoIXfdchN7Pxvxv1IO6PW0-U/edit
18704
18705 From comments below, "It can be challenging to run against a table getting live writes, if those writes are updates/overwrites. In general, you can run it against a time range to ignore new writes, but if those writes update existing cells, then the time range scan may or may not see older versions of those cells depending on whether major compaction has happened, which may be different in remote clusters."
18706
18707
18708 ---
18709
18710 * [HBASE-13895](https://issues.apache.org/jira/browse/HBASE-13895) | *Critical* | **DATALOSS: Region assigned before WAL replay when abort**
18711
18712 If the master went to assign a region concurrent with a RegionServer abort, the returned RegionServerAbortedException was being handled as though the region had been cleanly offlined so assign was allowed proceed. If the region was opened in its new location before WAL replay completion, the replayed edits were ignored, worst case, or were later played over the top of edits that had come in since open and so susceptible to overwrite. In either case, DATALOSS.
18713
18714
18715 ---
18716
18717 * [HBASE-13983](https://issues.apache.org/jira/browse/HBASE-13983) | *Minor* | **Doc how the oddball HTable methods getStartKey, getEndKey, etc. will be removed in 2.0.0**
18718
18719 Adds extra doc on getStartKeys, getEndKeys, and getStartEndKeys in HTable explaining that they will be removed in 2.0.0 (these methods did not get the proper full major version deprecation cycle).
18720
18721 In this issue, we actually also remove these methods in master/2.0.0 branch.
18722
18723
18724 ---
18725
18726 * [HBASE-13747](https://issues.apache.org/jira/browse/HBASE-13747) | *Critical* | **Promote Java 8 to "yes" in support matrix**
18727
18728 Java 8 is considered supported and tested as of HBase 1.2+
18729
18730
18731 ---
18732
18733 * [HBASE-13959](https://issues.apache.org/jira/browse/HBASE-13959) | *Critical* | **Region splitting uses a single thread in most common cases**
18734
18735 The performance of region splitting has been improved by using a thread pool to split the store files concurrently. Prior to this change, the store files were always split sequentially in a single thread, so a region with multiple store files ended up taking several seconds. The thread pool is sized dynamically with the aim of getting maximum concurrency, without exceeding the number of cores available for HBase Java process. A lower limit for the thread pool can be explicitly set using the property hbase.regionserver.region.split.threads.max.
18736
18737
18738 ---
18739
18740 * [HBASE-13930](https://issues.apache.org/jira/browse/HBASE-13930) | *Major* | **Exclude Findbugs packages from shaded jars**
18741
18742 Exclude Findbugs packages from shaded jars
18743
18744
18745 ---
18746
18747 * [HBASE-13214](https://issues.apache.org/jira/browse/HBASE-13214) | *Major* | **Remove deprecated and unused methods from HTable class**
18748
18749 **WARNING: No release note provided for this change.**
18750
18751
18752 ---
18753
18754 * [HBASE-13869](https://issues.apache.org/jira/browse/HBASE-13869) | *Trivial* | **Fix typo in HBase book**
18755
18756 Fix typo in HBase book
18757
18758
18759 ---
18760
18761 * [HBASE-13938](https://issues.apache.org/jira/browse/HBASE-13938) | *Major* | **Deletes done during the region merge transaction may get eclipsed**
18762
18763 Use the master's timestamp when sending hbase:meta edits on region merge to ensure proper ordering of new region addition and old region deletes.
18764
18765
18766 ---
18767
18768 * [HBASE-13898](https://issues.apache.org/jira/browse/HBASE-13898) | *Minor* | **correct additional javadoc failures under java 8**
18769
18770 Correct Javadoc generation errors
18771
18772
18773 ---
18774
18775 * [HBASE-13103](https://issues.apache.org/jira/browse/HBASE-13103) | *Major* | **[ergonomics] add region size balancing as a feature of master**
18776
18777 This patch adds optional ability for HMaster to normalize regions in size (disabled by default, change hbase.normalizer.enabled property to true to turn it on). If enabled, HMaster periodically (every 30 minutes by default) monitors tables for which normalization is enabled in table configuration and performs splits/merges as seems appropriate. Users may implement their own normalization strategies by implementing RegionNormalizer interface and configuring it in hbase-site.xml.
18778
18779
18780 ---
18781
18782 * [HBASE-13900](https://issues.apache.org/jira/browse/HBASE-13900) | *Minor* | **duplicate methods between ProtobufMagic and ProtobufUtil**
18783
18784 Use ProtobufMagic methods in ProtobufUtil
18785
18786
18787 ---
18788
18789 * [HBASE-13843](https://issues.apache.org/jira/browse/HBASE-13843) | *Trivial* | **Fix internal constant text in ReplicationManager.java**
18790
18791 In previous versions of HBase, the ReplicationAdmin utility erroneously used the string key "columnFamlyName" when listing replicated column families. It now uses the corrected spelling of "columnFamilyName" (note the added "i").
18792
18793 Downstream code that parsed the replication entries returned from listReplicated will need to be updated to use the new key. Previously compiled code that relied on the static CFNAME member of ReplicationAdmin will need to be recompiled in order to see the updated value.
18794
18795
18796 ---
18797
18798 * [HBASE-13886](https://issues.apache.org/jira/browse/HBASE-13886) | *Major* | **Return empty value when the mob file is corrupt instead of throwing exceptions**
18799
18800 By default the Get/Scan will throw Exception when it is not able to find a mob cell because the mob file is missing/corrupted. This jira adds a facility to continue scan/get and get other cells with mob cell value as empty. Set an attribute MobConstants.EMPTY\_VALUE\_ON\_MOBCELL\_MISS = true in Scan/Get for getting this behaviour
18801
18802
18803 ---
18804
18805 * [HBASE-13686](https://issues.apache.org/jira/browse/HBASE-13686) | *Major* | **Fail to limit rate in RateLimiter**
18806
18807 As per this jira contribution. We now support two kinds of RateLimiter.
18808 1) org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter : This limiter will refill resources at every TimeUnit/resources interval.
18809 Example: For a limiter configured with 10resources/second, then 1resource will be refilled after every 100ms.
18810
18811 2) org.apache.hadoop.hbase.quotas.FixedIntervalRateLimiter: This limiter will refill resources only after a given fixed interval of time.
18812
18813 Client can configure anyone of this rate limiter for the cluster by setting the value for the property "hbase.quota.rate.limiter" in the hbase-site.xml. org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter is the default value.
18814 Note: Client needs to restart the cluster for the configuration to take into effect.
18815
18816
18817 ---
18818
18819 * [HBASE-13816](https://issues.apache.org/jira/browse/HBASE-13816) | *Major* | **Build shaded modules only in release profile**
18820
18821 hbase-shaded-client and hbase-shaded-server modules will not build the actual jars unless -Prelease is supplied in mvn.
18822
18823
18824 ---
18825
18826 * [HBASE-13754](https://issues.apache.org/jira/browse/HBASE-13754) | *Major* | **Allow non KeyValue Cell types also to oswrite**
18827
18828 This jira has removed the already deprecated method
18829 KeyValue#oswrite(final KeyValue kv, final OutputStream out)
18830
18831
18832 ---
18833
18834 * [HBASE-13375](https://issues.apache.org/jira/browse/HBASE-13375) | *Major* | **Provide HBase superuser higher priority over other users in the RPC handling**
18835
18836 This JIRA modifies the signature of PriorityFunction#getPriority() method to also take request user as a parameter; all RPC requests sent by super users (as determined by cluster configuration) are executed with Admin QoS.
18837
18838
18839 ---
18840
18841 * [HBASE-5980](https://issues.apache.org/jira/browse/HBASE-5980) | *Minor* | **Scanner responses from RS should include metrics on rows/KVs filtered**
18842
18843 Adds scan metrics to the result. In the shell, set the ALL\_METRICS attribute to true on your scan to see dump of metrics after results (see the scan help for examples).
18844
18845 If you would prefer to see only a subset of the metrics, the METRICS array can be defined to include the names of only the metrics you care about.
18846
18847
18848 ---
18849
18850 * [HBASE-13698](https://issues.apache.org/jira/browse/HBASE-13698) | *Major* | **Add RegionLocator methods to Thrift2 proxy.**
18851
18852 Added getRegionLocation and getAllRegionLocations to the thrift2 interface.
18853
18854
18855 ---
18856
18857 * [HBASE-13636](https://issues.apache.org/jira/browse/HBASE-13636) | *Major* | **Remove deprecation for HBASE-4072 (Reading of zoo.cfg)**
18858
18859 Purge support for parsing zookeepers zoo.cfg deprecated since hbase-0.96.0
18860
18861
18862 ---
18863
18864 * [HBASE-13071](https://issues.apache.org/jira/browse/HBASE-13071) | *Major* | **Hbase Streaming Scan Feature**
18865
18866 MOTIVATION
18867
18868 A pipelined scan API is introduced for speeding up applications that combine massive data traversal with compute-intensive processing. Traditional HBase scans save network trips through prefetching the data to the client side cache. However, they prefetch synchronously: the fetch request to regionserver is invoked only when the entire cache is consumed. This leads to a stop-and-wait access pattern, in which the client stalls until the next chunk of data is fetched. Applications that do significant processing can benefit from background data prefetching, which eliminates this bottleneck. The pipelined scan implementation overlaps the cache population at the client side with application processing. Namely, it issues a new scan RPC when the iteration retrieves 50% of the cache. If the application processing (that is, the time between invocations of next()) is substantial, the new chunk of data will be available before the previous one is exhausted, and the client will not experience any delay. Ideally, the prefetch and the processing times should be balanced.
18869
18870 API AND CONFIGURATION
18871
18872 Asynchronous scanning can be configured either globally for all tables and scans, or on per-scan basis via a new Scan class API.
18873
18874 Configuration in hbase-site.xml: hbase.client.scanner.async.prefetch, default false:
18875
18876  \<property\>
18877    \<name\>hbase.client.scanner.async.prefetch\</name\>
18878    \<value\>true\</value\>
18879  \</property\>
18880
18881 API - Scan#setAsyncPrefetch(boolean)
18882
18883       Scan scan = new Scan();
18884       scan.setCaching(1000);
18885       scan.setMaxResultSize(BIG\_SIZE);
18886       scan.setAsyncPrefetch(true);
18887         ...
18888       ResultScanner scanner = table.getScanner(scan);
18889
18890 IMPLEMENTATION NOTES
18891
18892 Pipelined scan is implemented by a new ClientAsyncPrefetchScanner class, which is fully API-compatible with the synchronous ClientSimpleScanner. ClientAsyncPrefetchScanner is not instantiated in case of small (Scan#setSmall) and reversed (Scan#setReversed) scanners. The application is responsible for setting the prefetch size in a way that the prefetch time and the processing times are balanced. Note that due to double buffering, the client side cache can use twice as much memory as the synchronous scanner.
18893
18894 Generally, this feature will put more load on the server (higher fetch rate -- which is the whole point).  Also, YMMV.
18895
18896
18897 ---
18898
18899 * [HBASE-13533](https://issues.apache.org/jira/browse/HBASE-13533) | *Trivial* | **section on configuring ~/.m2/settings.xml has no anchor**
18900
18901 Correct setting.xml anchor in book
18902
18903
18904 ---
18905
18906 * [HBASE-13625](https://issues.apache.org/jira/browse/HBASE-13625) | *Major* | **Use HDFS for HFileOutputFormat2 partitioner's path**
18907
18908 Introduces a new config hbase.fs.tmp.dir which is a directory in HDFS (or default file system) to use as a staging directory for HFileOutputFormat2. This is also used as the default for hbase.bulkload.staging.dir
18909
18910
18911 ---
18912
18913 * [HBASE-10800](https://issues.apache.org/jira/browse/HBASE-10800) | *Major* | **Use CellComparator instead of KVComparator**
18914
18915 From 2.0 branch onwards KVComparator and its subclasses MetaComparator, RawBytesComparator are all deprecated.
18916 All the comparators are moved to CellComparator.  MetaCellComparator, a subclass of CellComparator, will be used to compare hbase:meta cells.
18917 Previously exposed static instances KeyValue.COMPARATOR, KeyValue.META\_COMPARATOR and KeyValue.RAW\_COMPARATOR are deprecated instead use CellComparator.COMPARATOR and CellComparator.META\_COMPARATOR.
18918 Also note that there will be no RawBytesComparator.  Where ever we need to compare raw bytes use Bytes.BYTES\_RAWCOMPARATOR.
18919 CellComparator will always operate on cells and its components, abstracting the fact that a cell can be backed by a single byte[] as opposed to how KVComparators were working.
18920
18921
18922 ---
18923
18924 * [HBASE-13333](https://issues.apache.org/jira/browse/HBASE-13333) | *Major* | **Renew Scanner Lease without advancing the RegionScanner**
18925
18926 Adds a renewLease call to ClientScanner
18927
18928
18929 ---
18930
18931 * [HBASE-13564](https://issues.apache.org/jira/browse/HBASE-13564) | *Major* | **Master MBeans are not published**
18932
18933 To use the coprocessor-based JMX implementation provided by HBase for Master.
18934 Add below property in hbase-site.xml file:
18935
18936 \<property\>
18937   \<name\>hbase.coprocessor.master.classes\</name\>
18938   \<value\>org.apache.hadoop.hbase.JMXListener\</value\>
18939 \</property\>
18940
18941 NOTE: DO NOT set \`com.sun.management.jmxremote.port\` for Java VM at the same time.
18942
18943 By default, the JMX listens on TCP port 10101 for Master, we can further configure the port using below properties:
18944
18945 \<property\>
18946   \<name\>master.rmi.registry.port\</name\>
18947   \<value\>61110\</value\>
18948 \</property\>
18949 \<property\>
18950   \<name\>master.rmi.connector.port\</name\>
18951   \<value\>61120\</value\>
18952 \</property\>
18953 ----
18954
18955 The registry port can be shared with connector port in most cases, so you only need to configure master.rmi.registry.port.
18956 However if you want to use SSL communication, the 2 ports must be configured to different values.
18957
18958
18959 ---
18960
18961 * [HBASE-13537](https://issues.apache.org/jira/browse/HBASE-13537) | *Major* | **Procedure V2 - Change the admin interface for async operations to return Future (incompatible with branch-1.x)**
18962
18963 As we made changes to return types in asynchronous methods of Admin API, this change is going to break binary compatibility. The source compatibility is kept intact though. The applications running against this change needs to be recompiled to keep things working.
18964
18965
18966 ---
18967
18968 * [HBASE-13517](https://issues.apache.org/jira/browse/HBASE-13517) | *Major* | **Publish a client artifact with shaded dependencies**
18969
18970 HBase now provides added convenience artifacts that shade most dependencies. These jars hbase-shaded-client and hbase-shaded-server are meant to be used when dependency conflicts can not be solved any other way. The normal jars hbase-client and hbase-server should still be preferred when possible.
18971
18972 Do not use hbase-shaded-server or hbase-shaded-client inside of a co-processor as bad things will happen.
18973
18974
18975 ---
18976
18977 * [HBASE-13149](https://issues.apache.org/jira/browse/HBASE-13149) | *Blocker* | **HBase MR is broken on Hadoop 2.5+ Yarn**
18978
18979 In HBase 1.1.0 and above we have upgraded the version of Jackson dependencies (jackson-core-asl, jackson-mapper-asl, jackson-jaxrs and jackson-xc) from 1.8.8 to 1.9.13. This is to follow the upgrade to Jackson 1.9.13 in Hadoop 2.5 and above which causes Jackson class incompatibility for HBase as reported in HBASE-13149.  Refer to HADOOP-10104 and YARN-2092 for additional information. Jackson1.9.13 is not completely backward compatible with the prior version 1.8.8 used in HBase. See the Compatibility reports attached in HBASE-13149 and http://svn.codehaus.org/jackson/trunk/release-notes/VERSION for more information.
18980
18981 This upgrade does not have direct impact on HBase users and HBase applications in most cases. In the rare case where your HBase application uses Jackson directly AND your application has compatibility issue with Jackson 1.9.13, you can do the following to mitigate the problem.
18982
18983 1. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, we recommend you update your application to use Jackson 1.9.13. You may be able to explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you, but the general recommendation is that you upgrade to Jackson 1.9.13.
18984 2. You may choose to continue using Jackson 1.8.8 and not to use Jackson 1.9.13 in your classpath.  You can also choose to replace the Jackson 1.9.13 jars in $HBASE\_HOME/lib with 1.8.8 jars.  It can work for you in the following cases:
18985 a) You are on a Hadoop version earlier than Hadoop 2.5,  or
18986 b) You are on Hadoop 2.5 or above, but your HBase application does not involve running Yarn jobs.
18987 3. You may experiment with further isolation using the shaded jars introduced with 1.1.0 via HBASE-13517.
18988
18989 Note that it may not be tested or guaranteed that using Jackson 1.8.8 in $HBASE\_HOME/lib will work in future HBase releases.
18990 It is recommended that your HBase application matches the Jackson version provided in HBase.
18991
18992 In HBase 0.98.x and HBase 1.0.x, we have NOT upgraded the version of Jackson dependencies. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, you may encounter Jackson class incomparability issue, as reported in HBASE-13149.
18993
18994 You can do the following to mitigate the problem:
18995 1. Use 'hadoop jar' command to run your HBase jobs.
18996 2. Explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you.
18997 3. You can also choose to replace the Jackson 1.8.8 jars in $HBASE\_HOME/lib with 1.9.13 jars from your Hadoop lib directory. We have tested HBase 0.98 with Jackson 1.9.13.
18998
18999
19000 ---
19001
19002 * [HBASE-13481](https://issues.apache.org/jira/browse/HBASE-13481) | *Major* | **Master should respect master (old) DNS/bind related configurations**
19003
19004 Master now honors configuration options as was before 1.0.0 releases:
19005 hbase.master.ipc.address
19006 hbase.master.dns.interface
19007 hbase.master.dns.nameserver
19008 hbase.master.info.bindAddress
19009 This jira also adds hbase.master.hostname parameter as an extension to HBASE-12954.
19010
19011
19012 ---
19013
19014 * [HBASE-13090](https://issues.apache.org/jira/browse/HBASE-13090) | *Major* | **Progress heartbeats for long running scanners**
19015
19016 Previously, there was no way to enforce a time limit on scan RPC requests. The server would receive a scan RPC request and take as much time as it needed to accumulate enough results to reach a limit or exhaust the region. The problem with this approach was that, in the case of a very selective scan, the processing of the scan could take too long and cause timeouts client side.
19017
19018 With this fix, the server will now enforce a time limit on the execution of scan RPC requests. When a scan RPC request arrives to the server, a time limit is calculated to be half of whichever timeout value is more restictive between the configurations ("hbase.client.scanner.timeout.period" and "hbase.rpc.timeout"). When the time limit is reached, the server will return whatever results it has accumulated up to that point. The results may be empty.
19019
19020 To ensure that timeout checks do not occur too often (which would hurt the performance of scans), the configuration "hbase.cells.scanned.per.heartbeat.check" has been introduced. This configuration controls how often System.currentTimeMillis() is called to update the progress towards the time limit. Currently, the default value of this configuration value is 10000. Specifying a smaller value will provide a tighter bound on the time limit, but may hurt scan performance due to the higher frequency of calls to System.currentTimeMillis().
19021
19022 Protobuf models for ScanRequest and ScanResponse have been updated so that heartbeat support can be communicated. Support for heartbeat messages is specified in the request sent to the server via ScanRequest.Builder#setClientHandlesHeartbeats. Only when the server sees that ScanRequest#getClientHandlesHeartbeats() is true will it send heartbeat messages back to the client. A response is marked as a heartbeat message via the boolean flag ScanResponse#getHeartbeatMessage
19023
19024
19025 ---
19026
19027 * [HBASE-13307](https://issues.apache.org/jira/browse/HBASE-13307) | *Major* | **Making methods under ScannerV2#next inlineable, faster**
19028
19029 Made methods smaller under Scanner#next so inlinable and compilable (was getting 'too big to compile' from hotspot). Use of unsafe to parse shorts rather than use BB#getShort... faster, etc.
19030
19031
19032 ---
19033
19034 * [HBASE-13453](https://issues.apache.org/jira/browse/HBASE-13453) | *Critical* | **Master should not bind to region server ports**
19035
19036 In 1.0.x, master by default binds to the region server ports (both rpc and info). This change brings back the usage of old master rpc and info ports in 1.1+ and master (2.0) branches. The motivation for this change is to ease the life of the user so that he does not need to do anything to bring up a RS on the same host and also to make the migration from 0.98 to 1.1  hassle free.  However, the users going from 1.0 to 1.1 would see the change in the master ports.
19037
19038
19039 ---
19040
19041 * [HBASE-13419](https://issues.apache.org/jira/browse/HBASE-13419) | *Major* | **Thrift gateway should propagate text from exception causes.**
19042
19043 Compose thrift exception text from the text of the entire cause chain of the underlying exception.
19044
19045
19046 ---
19047
19048 * [HBASE-13275](https://issues.apache.org/jira/browse/HBASE-13275) | *Major* | **Setting hbase.security.authorization to false does not disable authorization**
19049
19050 Prior to this change the configuration setting 'hbase.security.authorization' had no effect if security coprocessor were installed. The act of installing the security coprocessors was assumed to indicate active authorizaton was desired and required. Now it is possible to install the security coprocessors yet have them operate in a passive state with active authorization disabled by setting 'hbase.security.authorization' to false. This can be useful but is probably not what you want. For more information, consult the Security section of the HBase online manual.
19051
19052 'hbase.security.authorization' defaults to true for backwards comptatible behavior.
19053
19054
19055 ---
19056
19057 * [HBASE-13118](https://issues.apache.org/jira/browse/HBASE-13118) | *Major* | **[PE] Add being able to write many columns**
19058
19059 Adds a --columns option to PE so you can write more than one column (changes default qualifier from 'data' to '0').
19060
19061
19062 ---
19063
19064 * [HBASE-13270](https://issues.apache.org/jira/browse/HBASE-13270) | *Major* | **Setter for Result#getStats is #addResults; confusing!**
19065
19066 Deprecates Result#addResults in favor of Result#setStatistics
19067
19068
19069 ---
19070
19071 * [HBASE-13362](https://issues.apache.org/jira/browse/HBASE-13362) | *Major* | **Set max result size from client only (like scanner caching).**
19072
19073 This introduces a new config option: hbase.server.scanner.max.result.size
19074 This setting enforces a maximum result size (in bytes), when reached the server will return the results is has so far.
19075 This is a safety setting and should be kept large. The default is inifinite in 0.98 and 1.0.x and 100mb in 1.1 and later.
19076
19077 Use hbase.client.scanner.max.result.size instead to enforce practical chunk sizes of a few mb (defaults to 2mb)
19078
19079
19080 ---
19081
19082 * [HBASE-11544](https://issues.apache.org/jira/browse/HBASE-11544) | *Critical* | **[Ergonomics] hbase.client.scanner.caching is dogged and will try to return batch even if it means OOME**
19083
19084 Results returned from RPC calls may now be returned as partials
19085
19086 When is a Result marked as a partial?
19087 When the server must stop the scan because the max size limit has been reached. Means that the LAST Result returned within the ScanResult's Result array may be marked as a partial if the scan's max size limit caused it to stop in the middle of a row.
19088
19089 Incompatible Change: The return type of InternalScanners#next and RegionScanners#nextRaw has been changed to NextState from boolean
19090 The previous boolean return value can be accessed via NextState#hasMoreValues()
19091 Provides more context as to what happened inside the scanner
19092
19093 Scan caching default has been changed to Integer.Max\_Value
19094 This value works together with the new maxResultSize value from HBASE-12976 (defaults to 2MB)
19095 Results returned from server on basis of size rather than number of rows
19096 Provides better use of network since row size varies amongst tables
19097
19098 Protobuf models have changed for Result, ScanRequest, and ScanResponse to support new partial Results
19099
19100 Partial Results should be invisible to application layer unless Scan#setAllowPartials is set
19101
19102 Scan#setAllowPartials has been added to allow the application to request to see the partial Results returned by the server rather than have the ClientScanner form the complete Result prior to returning it to the application
19103
19104 To disable the use of partial Results on the server, set ScanRequest.Builder#setClientHandlesPartials() to be false in the ScanRequest issued to server
19105
19106 Partial Results should allow the server to return large rows in parts rather than accumulate all the cells for that particular row and run out of memory
19107
19108
19109 ---
19110
19111 * [HBASE-11864](https://issues.apache.org/jira/browse/HBASE-11864) | *Minor* | **Enhance HLogPrettyPrinter to print information from WAL Header**
19112
19113 Enhance WALPrettyPrinter to print information (writer classnames and cell codec classname) from WAL Header
19114
19115
19116 ---
19117
19118 * [HBASE-13289](https://issues.apache.org/jira/browse/HBASE-13289) | *Major* | **typo in splitSuccessCount  metric**
19119
19120 In hbase 1.0.0, 0.98.10, 0.98.10.1, 0.98.11, and 0.98.12 'splitSuccessCount' was misspelled as 'splitSuccessCounnt'
19121
19122
19123 ---
19124
19125 * [HBASE-12990](https://issues.apache.org/jira/browse/HBASE-12990) | *Major* | **MetaScanner should be replaced by MetaTableAccessor**
19126
19127 Removes MetaScanner. Use MetaTableAccessor instead.
19128
19129
19130 ---
19131
19132 * [HBASE-13373](https://issues.apache.org/jira/browse/HBASE-13373) | *Major* | **Squash HFileReaderV3 together with HFileReaderV2 and AbstractHFileReader; ditto for Scanners and BlockReader, etc.**
19133
19134 Marking as incompatible change. Requires hfiles be major version \>= 2 and \>= minor version 3.  Version 3 files are enabled by default in 1.0.  0.98 writes version 2 minor version 3.  You cannot go to 1.0 from anything before 0.98.
19135
19136
19137 ---
19138
19139 * [HBASE-13252](https://issues.apache.org/jira/browse/HBASE-13252) | *Major* | **Get rid of managed connections and connection caching**
19140
19141 For a long time, HBase supported 2 types of connections - managed, which were cached and closed automatically when not needed, and unmanaged, where user is responsible for closing the connections by calling #close() on them.
19142
19143 The concept of managed connections in HBase (deprecated before) has now been extinguished completely, and now all callers are responsible for managing the lifecycle of connections they acquire.
19144
19145
19146 ---
19147
19148 * [HBASE-12954](https://issues.apache.org/jira/browse/HBASE-12954) | *Minor* | **Ability impaired using HBase on multihomed hosts**
19149
19150 The following config is added by this JIRA:
19151
19152 hbase.regionserver.hostname
19153
19154 This config is for experts: don't set its value unless you really know what you are doing.
19155 When set to a non-empty value, this represents the (external facing) hostname for the underlying server.
19156 See https://issues.apache.org/jira/browse/HBASE-12954 for details.
19157
19158 Caution: please make sure rolling upgrade succeeds before turning on this feature.
19159
19160
19161 ---
19162
19163 * [HBASE-13187](https://issues.apache.org/jira/browse/HBASE-13187) | *Critical* | **Add ITBLL that exercises per CF flush**
19164
19165 Pass the -D flag generator.multiple.columnfamilies on the command-line if you want the generator to write three column families rather than the default one. When set, we will write the usual 'meta' column family and use it checking linked-list is wholesome but we will also write a 'tiny' column family and a 'big' column family to provoke uneven flushing; good for testing the flush-by-columnfamily feature.
19166
19167
19168 ---
19169
19170 * [HBASE-13361](https://issues.apache.org/jira/browse/HBASE-13361) | *Minor* | **Remove or undeprecate {get\|set}ScannerCaching in HTable**
19171
19172 Removed getScannerCaching and setScannerCaching from Table
19173
19174
19175 ---
19176
19177 * [HBASE-10728](https://issues.apache.org/jira/browse/HBASE-10728) | *Major* | **get\_counter value is never used.**
19178
19179 for 0.98 and 1.0 changes are compatible (due to mitigation by HBASE-13433):
19180
19181 \* The "get\_counter" command no longer requires a dummy 4th argument. Downstream users are encouraged to migrate code to not pass this argument because it will result in an error for HBase 1.1+.
19182 \* The "incr" command now outputs the current value of the counter to stdout.
19183 ex:
19184 {code}
19185 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19186 COUNTER VALUE = 1772
19187 0 row(s) in 0.1180 seconds
19188 {code}
19189
19190 for 1.1+ changes are incompatible:
19191
19192 \* The "get\_counter" command no longer accepts a dummy 4th argument. Downstream users will need to update their code to not pass this argument.
19193 ex:
19194 {code}
19195 jruby-1.6.8 :006 \> get\_counter 'counter\_example', 'r1', 'cf1:foo'
19196 COUNTER VALUE = 1772
19197
19198 {code}
19199 \* The "incr" command now outputs the current value of the counter to stdout.
19200 ex:
19201 {code}
19202 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19203 COUNTER VALUE = 1772
19204 0 row(s) in 0.1180 seconds
19205 {code}
19206
19207
19208 ---
19209
19210 * [HBASE-13170](https://issues.apache.org/jira/browse/HBASE-13170) | *Major* | **Allow block cache to be external**
19211
19212 HBase can use memcached as an external block cache. To use this change your config to set hbase.blockcache.use.external to true and hbase.cache.memcached.servers to contain the list of memcached servers to use.
19213
19214
19215 ---
19216
19217 * [HBASE-13316](https://issues.apache.org/jira/browse/HBASE-13316) | *Minor* | **Reduce the downtime on planned moves of regions**
19218
19219 When issuing an Admin.move command the RegionServer that receive the region will try and open the StoreFiles of that region to prime the block cache with index blocks.
19220
19221
19222 ---
19223
19224 * [HBASE-13298](https://issues.apache.org/jira/browse/HBASE-13298) | *Critical* | **Clarify if Table.{set\|get}WriteBufferSize() is deprecated or not**
19225
19226 Deprecate said methods. They were mistakenly included in Table Interface.
19227
19228
19229 ---
19230
19231 * [HBASE-13248](https://issues.apache.org/jira/browse/HBASE-13248) | *Major* | **Make HConnectionImplementation top-level class.**
19232
19233 **WARNING: No release note provided for this change.**
19234
19235
19236 ---
19237
19238 * [HBASE-13331](https://issues.apache.org/jira/browse/HBASE-13331) | *Blocker* | **Exceptions from DFS client can cause CatalogJanitor to delete referenced files**
19239
19240 Fixes an issue where files from a split region that were still referenced were erroneously deleted leading to data loss.
19241
19242
19243 ---
19244
19245 * [HBASE-13273](https://issues.apache.org/jira/browse/HBASE-13273) | *Major* | **Make Result.EMPTY\_RESULT read-only; currently it can be modified**
19246
19247 The Result.EMPTY\_RESULT object is now immutable. In previous releases, the object could be modified by a caller to no longer be empty. Code that relies on this behavior will now receive an UnsupportedOperationException.
19248
19249
19250 ---
19251
19252 * [HBASE-12867](https://issues.apache.org/jira/browse/HBASE-12867) | *Major* | **Shell does not support custom replication endpoint specification**
19253
19254 Adds support to add\_peer in hbase shell to add a custom replication endpoint from HBASE-12254.
19255
19256
19257 ---
19258
19259 * [HBASE-13198](https://issues.apache.org/jira/browse/HBASE-13198) | *Major* | **Remove HConnectionManager**
19260
19261 **WARNING: No release note provided for this change.**
19262
19263
19264 ---
19265
19266 * [HBASE-12586](https://issues.apache.org/jira/browse/HBASE-12586) | *Major* | **Task 6 & 7 from HBASE-9117,  delete all public HTable constructors and delete ConnectionManager#{delete,get}Connection**
19267
19268 HTable class has been marked as private API before, and now it's no longer directly instantiable from client code (all public constructors have been removed). All clients should use Connection#getTable() and Connection#getRegionLocator() when appropriate to obtain Table and RegionLocator implementations to work with.
19269
19270
19271 ---
19272
19273 * [HBASE-13171](https://issues.apache.org/jira/browse/HBASE-13171) | *Minor* | **Change AccessControlClient methods to accept connection object to reduce setup time.**
19274
19275 **WARNING: No release note provided for this change.**
19276
19277
19278 ---
19279
19280 * [HBASE-12706](https://issues.apache.org/jira/browse/HBASE-12706) | *Critical* | **Support multiple port numbers in ZK quorum string**
19281
19282 hbase.zookeeper.quorum configuration now allows servers together with client ports consistent with the way Zookeeper java client accepts the quorum string. In this case, using hbase.zookeeper.clientPort is not needed. eg.  hbase.zookeeper.quorum=myserver1:2181,myserver2:20000,myserver3:31111
19283
19284
19285 ---
19286
19287 * [HBASE-13142](https://issues.apache.org/jira/browse/HBASE-13142) | *Major* | **[PERF] Reuse the IPCUtil#buildCellBlock buffer**
19288
19289 Adds buffer reuse sending Cell results. It is on by default and should not need configuration. Improves GC profile and ups throughput. The benefit gets better the larger the row size returned.
19290
19291 The buffer reservoir is bounded at a maximum count after which we will start logging at WARN level that the reservoir is running at capacity (returned buffers will be discarded and not added back to the reservoir pool). Default maximum is twice the handler count: i.e. 2 \* hbase.regionserver.handler.count. This should be more than enough. Set the maximum with the new configuration: hbase.ipc.server.reservoir.max
19292
19293 The reservoir will not cache buffers in excess of hbase.ipc.server.reservoir.max.buffer.size  The default is 10MB. This means that if a row is very large, then we will allocate a buffer of the average size that is currently in the pool and we will then resize it till we can accommodate the return. These resizes are expensive. The resultant buffer will be used and then discarded.
19294
19295 To check how the reservoir is doing, enable trace level logging for a few seconds on a regionserver. You can do this from the regionserver UI. See 'Log Level'. Set org.apache.hadoop.hbase.io.BoundedByteBufferPool to TRACE. The BoundedByteBufferPool will spew report to the log. Disable the TRACE level and then check the log. You'll see allocation rate, size of pool, size of buffers in pool, etc.
19296
19297
19298 ---
19299
19300 * [HBASE-13012](https://issues.apache.org/jira/browse/HBASE-13012) | *Major* | **Add shell commands to trigger the mob file compactor**
19301
19302 This adds two new shell commands -- compact\_mob and major\_compact\_mob to the hbase shell.
19303
19304 Run compaction on a mob enabled column family or all mob enabled column families within a table
19305           Examples:
19306           Compact a column family within a table:
19307           hbase\> compact\_mob 't1', 'c1'
19308           Compact all mob enabled column families
19309           hbase\> compact\_mob 't1'
19310
19311 Run major compaction on a mob enabled column family or all mob enabled column families within a table
19312           Examples:
19313           Compact a column family within a table:
19314           hbase\> major\_compact\_mob 't1', 'c1'
19315           Compact all mob enabled column families within a table
19316           hbase\> major\_compact\_mob 't1'
19317
19318
19319 ---
19320
19321 * [HBASE-12869](https://issues.apache.org/jira/browse/HBASE-12869) | *Major* | **Add a REST API implementation of the ClusterManager interface**
19322
19323 Adds an implementation of ClusterManager to control REST API-managed HBase clusters.
19324
19325
19326 ---
19327
19328 * [HBASE-13047](https://issues.apache.org/jira/browse/HBASE-13047) | *Trivial* | **Add "HBase Configuration" link missing on the table details pages**
19329
19330 Add a '/conf' link to UI
19331
19332
19333 ---
19334
19335 * [HBASE-13044](https://issues.apache.org/jira/browse/HBASE-13044) | *Minor* | **Configuration option for disabling coprocessor loading**
19336
19337 This change adds two new configuration options:
19338 - "hbase.coprocessor.enabled" controls globally if any coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19339 - "hbase.coprocessor.user.enabled" controls if any user (aka table) coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19340
19341
19342 ---
19343
19344 * [HBASE-12961](https://issues.apache.org/jira/browse/HBASE-12961) | *Minor* | **Negative values in read and write region server metrics**
19345
19346 Change read and write request count in ServerLoad from int to long
19347
19348
19349 ---
19350
19351 * [HBASE-7332](https://issues.apache.org/jira/browse/HBASE-7332) | *Minor* | **[webui] HMaster webui should display the number of regions a table has.**
19352
19353 Adds counts for various regions states to the table listing on main page. See attached screenshot.
19354
19355
19356 ---
19357
19358 * [HBASE-8329](https://issues.apache.org/jira/browse/HBASE-8329) | *Major* | **Limit compaction speed**
19359
19360 Adds compaction throughput limit mechanism(the word "throttle" is already used when choosing compaction thread pool, so use a different word here to avoid ambiguity). Default is org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController, will limit throughput as follow:
19361 1. In off peak hours, use a fixed limitation "hbase.hstore.compaction.throughput.offpeak" (default is Long.MAX\_VALUE which means no limitation).
19362 2. In normal hours, the limitation is tuned between "hbase.hstore.compaction.throughput.lower.bound"(default 10MB/sec) and "hbase.hstore.compaction.throughput.higher.bound"(default 20MB/sec), using the formula "lower + (higer - lower) \* param" where param is in range [0.0, 1.0] and calculate based on store files count on this regionserver.
19363 3. If some stores have too many store files(storefilesCount \> blockingFileCount), then there is no limitation no matter peak or off peak.
19364 You can set "hbase.regionserver.throughput.controller" to org.apache.hadoop.hbase.regionserver.throttle.NoLimitThroughputController to disable throughput controlling.
19365 And we have implemented ConfigurationObserver which means you can change all configurations above and do not need to restart cluster.
19366
19367 The throttle is on by default in hbase-2.0.0. There is no limit in hbase-1.x.
19368
19369
19370 ---
19371
19372 * [HBASE-6778](https://issues.apache.org/jira/browse/HBASE-6778) | *Major* | **Deprecate Chore; its a thread per task when we should have one thread to do all tasks**
19373
19374 Corresponding usages for new ScheduledChore vs. Deprecated Chore:
19375 Chore.interrupt() -\> ScheduledChore.cancel(mayInterruptWhileRunning = true)
19376 Threads.setDaemonThreadRunning(Chore) -\> ChoreService.scheduleChore(ScheduledChore)
19377 Chore.isAlive -\> ScheduledChore.isScheduled()
19378 Chore.getSleeper().skipSleepCycle() -\> ScheduledChore.triggerNow()
19379
19380
19381 ---
19382
19383 * [HBASE-11574](https://issues.apache.org/jira/browse/HBASE-11574) | *Major* | **hbase:meta's regions can be replicated**
19384
19385 On the server side, set hbase.meta.replica.count to the number of replicas of meta that you want to have in the cluster (defaults to 1). hbase.regionserver. meta.storefile.refresh.period should be set to a non-zero number in milliseconds - something like 30000 (defaults to 0).
19386 On the client/user side, set hbase.meta.replicas.use to true.
19387
19388
19389 ---
19390
19391 * [HBASE-12808](https://issues.apache.org/jira/browse/HBASE-12808) | *Major* | **Use Java API Compliance Checker for binary/source compatibility**
19392
19393 Adds a dev-support/check\_compatibility.sh script for comparing versions. Run the script to see usage.
19394
19395
19396 ---
19397
19398 * [HBASE-12684](https://issues.apache.org/jira/browse/HBASE-12684) | *Major* | **Add new AsyncRpcClient**
19399
19400 Retrofit a new, netty-based rpc transport on the client. This client is slightly slower if little contention given the extra tier or so that netty adds and that we block on a Future waiting on the call to finish.  This client opens the way for HBase having a native Async API.
19401
19402 This client is on by default in master branch (2.0 hbase). It is off in branch-1.0 (hbase-1.1.x).  To enable it, set "hbase.rpc.client.impl" to "org.apache.hadoop.hbase.ipc.AsyncRpcClient"
19403
19404
19405 ---
19406
19407 * [HBASE-8410](https://issues.apache.org/jira/browse/HBASE-8410) | *Major* | **Basic quota support for namespaces**
19408
19409 Namespace auditor provides basic quota support for namespaces in terms of number of tables and number of regions. In order to use namespace quotas, quota support must be enabled by setting
19410 "hbase.quota.enabled" property to true in hbase-site.xml file.
19411
19412 The users can add quota information to namespace, while creating new namespaces or by altering existing ones.
19413
19414 Examples:
19415 1. create\_namespace 'ns1', {'hbase.namespace.quota.maxregions'=\>'10'}
19416 2. create\_namespace 'ns2', {'hbase.namespace.quota.maxtables'=\>'2','hbase.namespace.quota.maxregions'=\>'5'}
19417 3. alter\_namespace 'ns3', {METHOD =\> 'set', 'hbase.namespace.quota.maxtables'=\>'5','hbase.namespace.quota.maxregions'=\>'25'}
19418
19419 The quotas can be modified/added to namespace at any point of time. To remove quotas, the following command can be used:
19420
19421 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxtables'}
19422 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxregions'}
19423
19424
19425 ---
19426
19427 * [HBASE-12902](https://issues.apache.org/jira/browse/HBASE-12902) | *Major* | **Post-asciidoc conversion fix-ups**
19428
19429 Pushed to master. Shout if there are any issues.
19430
19431
19432 ---
19433
19434 * [HBASE-12848](https://issues.apache.org/jira/browse/HBASE-12848) | *Major* | **Utilize Flash storage for WAL**
19435
19436 For users on a version of Hadoop that supports tiered storage policies (i.e. Apache Hadoop 2.6.0+), HBase now allows users to opt-in to having the write ahead log placed on the SSD tier. Users on earlier versions of Hadoop will be unable to take advantage of this feature.
19437
19438 Use of tiered storage is controlled by a new RegionServer config, hbase.wal.storage.policy. It defaults to the value 'NONE', which will rely on HDFS defaults for a policy decision.
19439
19440 User can specify ONE\_SSD or ALL\_SSD as the value:
19441 ONE\_SSD: place only one replica of WAL files in SSD and the remaining in default storage
19442 ALL\_SSD: all replica for WAL files are placed on SSD
19443
19444 See [the HDFS docs on storage policy\|http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html]
19445
19446
19447 ---
19448
19449 * [HBASE-11144](https://issues.apache.org/jira/browse/HBASE-11144) | *Major* | **Filter to support scanning multiple row key ranges**
19450
19451 MultiRowRangeFilter is a filter to support scanning multiple row key ranges. If the number of the ranges is small, using multiple scans can also do the same thing and can work well. But when the number of ranges are quite big (e.g. millions), use the MultiRowRangeFilter will be nice. In this filter, the ranges will be sorted and merged, so users do not have to take care of ranges are not continuous. And if users are using something like rest, thrift or pig to access the data the filter might be the practical solution.
19452
19453
19454 ---
19455
19456 * [HBASE-12268](https://issues.apache.org/jira/browse/HBASE-12268) | *Major* | **Add support for Scan.setRowPrefixFilter to shell**
19457
19458 Added new option, ROWPREFIXFILTER, to the scan command in the HBase shell to easily scan for a specific row prefix.
19459
19460
19461 ---
19462
19463 * [HBASE-12775](https://issues.apache.org/jira/browse/HBASE-12775) | *Major* | **CompressionTest ate my HFile (sigh!)**
19464
19465 CompressionTest will now abort when the target path exists.
19466
19467
19468 ---
19469
19470 * [HBASE-12695](https://issues.apache.org/jira/browse/HBASE-12695) | *Critical* | **JDK 1.8 compilation broken**
19471
19472 Use the -Pjavac maven profile in order to compile HBase using the compiler provided by the JDK instead of the default error-prone compiler plugin. This is useful for now if you are building HBase with JDK 1.8 or a JDK that doesn't support error-prone.
19473
19474
19475 ---
19476
19477 * [HBASE-10201](https://issues.apache.org/jira/browse/HBASE-10201) | *Major* | **Port 'Make flush decisions per column family' to trunk**
19478
19479 Adds new flushing policy mechanism. Default, org.apache.hadoop.hbase.regionserver.FlushLargeStoresPolicy, will try to avoid flushing out the small column families in a region, those whose memstores are \< hbase.hregion.percolumnfamilyflush.size.lower.bound. To restore the old behavior of flushes writing out all column families, set hbase.regionserver.flush.policy to org.apache.hadoop.hbase.regionserver.FlushAllStoresPolicy either in hbase-default.xml or on a per-table basis by setting the policy to use with HTableDescriptor.getFlushPolicyClassName().
19480
19481
19482 ---
19483
19484 * [HBASE-12559](https://issues.apache.org/jira/browse/HBASE-12559) | *Major* | **Provide LoadBalancer with online configuration capability**
19485
19486 updateConfiguration(ServerName server) method of Admin now updates config for HMaster as well.
19487 Specifically, config update would be taken by load balancer.
19488
19489
19490 ---
19491
19492 * [HBASE-10378](https://issues.apache.org/jira/browse/HBASE-10378) | *Major* | **Divide HLog interface into User and Implementor specific interfaces**
19493
19494 HBase internals for the write ahead log have been refactored. Advanced users of HBase should be aware of the following changes.
19495
19496 Public Audience
19497   - The Admin API for asking a region server to roll WAL files has changed from a synchronous command that returns a set of regions the WAL implementation would like flushed into an asynchronous command that returns nothing. Older clients relying on the former behavior will still be able to interact with newer servers, but the response body will always contain an empty list of regions to flush.
19498   - The shell command "hlog\_roll" has been deprecated. Operators should use the "wal\_roll" command instead. This command is subject to the changes described above for the Admin API to roll WAL files.
19499   - The command for analyzing write ahead logs has been renamed from 'hlog' to 'wal'. The old usage is deprecated and will be removed in a future version.
19500   - Some utility methods in the HBaseTesetingUtility related to testing write-ahead-logs were changed in incompatible ways. No functionality has been removed, but method names and arguments have changed. See the HBaseTestingUtility javadoc for details.
19501   - The WALPlayer utility has deprecated the configuration keys used for advanced customization. Users should switch to the updated configuration keys. See the usage information on the WALPlayer tool for details.
19502   - The HLogInputFormat utility class for processing logs with MapReduce has been deprecated and will be removed in a future version. Users should switch to the WALInputFormat.
19503   - The labeling of server metrics on the region server status pages changed. Previously, the number of backing files for the write ahead log was labeled 'Num. HLog Files'. If you wish to see this statistic now, please look for the label 'Num. WAL Files.'  If you rely on JMX for these metrics, their location has not changed.
19504
19505 LimitedPrivate(COPROC) Audience, LimitedPrivate(PHOENIX)
19506   - The RegionObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseRegionObserver class. For those that implement RegionObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the RegionObserver javadoc for details.
19507   - Classes related to reading WAL entries (ReaderBase, ProtobufLogReader, SequenceFileLogReader) have changed in a backwards incompatible way. Users who referenced HLog.Reader directly or HLog.Entry will have to update. These changes do not impact compatibility with extant wal files.
19508   - The WALObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseWALObserver class. For those that implement WALObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the WALObserver javadoc for details.
19509  - The WALCoprocessorEnvironment  has changed in a backwards incompatible way. WALObserver coprocessors that relied on retrieving an object representing the write ahead log instance will have to be updated.
19510
19511 LimitedPrivate(REPLICATION) Audience
19512  - The WALEntryFilter API has changed in a backwards incompatible way. Implementers will have to be updated.
19513  - The ReplicationEndpoint.ReplicateContext API has changed in a backwards incompatible way. Implementers who use this interface will have to be updated. These changes do not impact wire compatibility for replicating between clusters.
19514  - The HLogKey API is deprecated in favor of the WALKey API. Additionally, the HLogKey API has changed in a backwards incompatible way by changing from implementing WriteableComparable\<HLogKey\> to implementing Writeable and Comparable\<WALKey\>.
19515
19516
19517 ---
19518
19519 * [HBASE-11683](https://issues.apache.org/jira/browse/HBASE-11683) | *Major* | **Metrics for MOB**
19520
19521 Adds new mob related metrics:
19522
19523 mobCompactedIntoMobCellsCount
19524 mobCompactedIntoMobCellsSize
19525 mobCompactedFromMobCellsCount
19526 mobCompactedFromMobCellsSize
19527 mobFlushCount
19528 mobFlushedCellsCount
19529 mobFlushedCellsSize
19530 mobScanCellsCount
19531 mobScanCellsSize
19532 mobFileCacheAccessCount
19533 mobFileCacheMissCount
19534 mobFileCacheHitPercent
19535 mobFileCacheEvictedCount
19536 mobFileCacheCount
19537
19538
19539 ---
19540
19541 * [HBASE-11912](https://issues.apache.org/jira/browse/HBASE-11912) | *Major* | **Catch some bad practices at compile time with error-prone**
19542
19543 Errors from error-prone will fail the build in the compile phase. Warnings look like Javac warnings and are counted as such by test-patch etc
19544
19545
19546 ---
19547
19548 * [HBASE-12220](https://issues.apache.org/jira/browse/HBASE-12220) | *Major* | **Add hedgedReads and hedgedReadWins metrics**
19549
19550 Adds metrics hedgedReads and hedgedReadWins counts.
19551
19552
19553 ---
19554
19555 * [HBASE-6290](https://issues.apache.org/jira/browse/HBASE-6290) | *Minor* | **Add a function a mark a server as dead and start the recovery the process**
19556
19557 Adds a script to mark a server as dead.
19558
19559 Usage: considerAsDead.sh --hostname serverName
19560
19561
19562 ---
19563
19564 * [HBASE-12111](https://issues.apache.org/jira/browse/HBASE-12111) | *Major* | **Remove deprecated APIs from Mutation(s)**
19565
19566 Removed the below from hbase-2 (were deprecated on release of hbase-1.0.0)
19567
19568 Mutation setWriteToWAL(boolean)
19569 boolean getWriteToWAL()
19570 Mutation setFamilyMap(NavigableMap\<byte [], List\<KeyValue\>\>)
19571 NavigableMap\<byte [], List\<KeyValue\>\> getFamilyMap()
19572
19573
19574 ---
19575
19576 * [HBASE-12084](https://issues.apache.org/jira/browse/HBASE-12084) | *Major* | **Remove deprecated APIs from Result**
19577
19578 The below KeyValue based APIs are removed from Result
19579 KeyValue[] raw()
19580 List\<KeyValue\> list()
19581 List\<KeyValue\> getColumn(byte [] family, byte [] qualifier)
19582 KeyValue getColumnLatest(byte [] family, byte [] qualifier)
19583 KeyValue getColumnLatest(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19584
19585 They are replaced with
19586 Cell[] rawCells()
19587 List\<Cell\> listCells()
19588 List\<Cell\> getColumnCells(byte [] family, byte [] qualifier)
19589 Cell getColumnLatestCell(byte [] family, byte [] qualifier)
19590 Cell getColumnLatestCell(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19591 respectively
19592
19593 Also the constructors which were taking KeyValues also removed
19594 Result(KeyValue [] cells)
19595 Result(List\<KeyValue\> kvs)
19596
19597
19598 ---
19599
19600 * [HBASE-12048](https://issues.apache.org/jira/browse/HBASE-12048) | *Major* | **Remove deprecated APIs from Filter**
19601
19602 The following APIs are removed from Filter
19603 KeyValue transform(KeyValue)
19604 KeyValue getNextKeyHint(KeyValue)
19605 and replaced with
19606 Cell transformCell(Cell)
19607 Cell getNextCellHint(Cell)
19608 respectively.
19609 If a custom Filter implementation have overridden any of these methods, we will no longer call them. User has to change the custom Filter to override cell based methods as shown above
19610
19611
19612 ---
19613
19614 * [HBASE-7767](https://issues.apache.org/jira/browse/HBASE-7767) | *Major* | **Get rid of ZKTable, and table enable/disable state in ZK**
19615
19616 Keeps table enabled/disabled state in HDFS rather than up in ZooKeeper.  Auto-migrates any existing zk state.
19617
19618
19619 ---
19620
19621 * [HBASE-11911](https://issues.apache.org/jira/browse/HBASE-11911) | *Major* | **Break up tests into more fine grained categories**
19622
19623 Adds new test categories besides the class smalltests, mediumtests, and largetests.  Adds:
19624
19625 ClientTests
19626 CoprocessorTests
19627 FilterTests
19628 FlakeyTests
19629 IOTests
19630 MapReduceTests
19631 MasterTests
19632 MiscTests
19633 RegionServerTests
19634 ReplicationTests
19635 RestTests
19636 SecurityTests
19637 VerySlowMapReduceTests
19638 VerySlowRegionServerTests
19639
19640 See description for examples on how to use them.
19641
19642
19643 ---
19644
19645 * [HBASE-11658](https://issues.apache.org/jira/browse/HBASE-11658) | *Major* | **Piped commands to hbase shell should return non-zero if shell command failed.**
19646
19647 Adds a noninteractive mode (-n or --noninteractive) to the hbase shell that exits with a non-zero error code on failed or invalid shell command executions, and exits with a zero error code upon successful execution.
19648
19649
19650 ---
19651
19652 * [HBASE-11640](https://issues.apache.org/jira/browse/HBASE-11640) | *Major* | **Add syntax highlighting support to HBase Ref Guide programlistings**
19653
19654 This got committed, so I guess it is safe to resolve it?
19655
19656
19657 ---
19658
19659 * [HBASE-11606](https://issues.apache.org/jira/browse/HBASE-11606) | *Minor* | **Enable ZK-less region assignment by default**
19660
19661 By default, we don't use ZK for region assignment now. To fall back to the old way, you can set hbase.assignment.usezk to true.
19662
19663
19664 ---
19665
19666 * [HBASE-3135](https://issues.apache.org/jira/browse/HBASE-3135) | *Major* | **Make our MR jobs implement Tool and use ToolRunner so can do -D trickery, etc.**
19667
19668 All MR jobs implement Tool Interface, http://hadoop.apache.org/docs/current/api/org/apache/hadoop/util/Tool.html, so now you can pass properties on command line with the -D flag, etc.
19669
19670
19671 ---
19672
19673 * [HBASE-11556](https://issues.apache.org/jira/browse/HBASE-11556) | *Major* | **Move HTablePool to hbase-thrift module.**
19674
19675 HTablePool was deprecated in 0.98.1 but was still present and usable by apps built against versions before HBase 2.0.  It has been moved and is not intended to be used by user applications, and is now an internal part of the thrift2 proxy server only.
19676
19677
19678 ---
19679
19680 * [HBASE-11548](https://issues.apache.org/jira/browse/HBASE-11548) | *Trivial* | **[PE] Add 'cycling' test N times and unit tests for size/zipf/valueSize calculations**
19681
19682 Adds --cycles=N argument.
19683
19684
19685 ---
19686
19687 * [HBASE-11344](https://issues.apache.org/jira/browse/HBASE-11344) | *Major* | **Hide row keys and such from the web UIs**
19688
19689 Configure "hbase.display.keys" to false (default: true) in the master/regionservers if the row-keys should be hidden in the webUIs (like in the webUI for table details).
19690
19691
19692 ---
19693
19694 * [HBASE-6580](https://issues.apache.org/jira/browse/HBASE-6580) | *Major* | **Deprecate HTablePool in favor of HConnection.getTable(...)**
19695
19696 This issue introduces a few new APIs:
19697 \* HConnectionManager:
19698 {code}
19699     public static HConnection createConnection(Configuration conf)
19700     public static HConnection createConnection(Configuration conf, ExecutorService pool)
19701 {code}
19702 \* HConnection:
19703 {code}
19704     public HTableInterface getTable(String tableName) throws IOException
19705     public HTableInterface getTable(byte[] tableName) throws IOException
19706     public HTableInterface getTable(String tableName, ExecutorService pool) throws IOException
19707     public HTableInterface getTable(byte[] tableName, ExecutorService pool) throws IOException
19708 {code}
19709
19710 By default HConnectionImplementation will create an ExecutorService when needed. The ExecutorService can optionally passed be passed in.
19711 HTableInterfaces are retrieved from the HConnection. By default the HConnection's ExecutorService is used, but optionally that can be overridden for each HTable.
19712
19713
19714 ---
19715
19716 * [HBASE-8450](https://issues.apache.org/jira/browse/HBASE-8450) | *Critical* | **Update hbase-default.xml and general recommendations to better suit current hw, h2, experience, etc.**
19717
19718 Changed defaults:
19719
19720 + max versions now 1 instead of 3
19721 + row blooms on by default (except on .META. table)
19722 + handlers 30 instead of 10
19723 + upped memstore lower limit from .35 to .38
19724 + zookeeper timeout default is 90seconds instead of 180
19725 + client pause is 100ms instead of 1000ms
19726 + retries are now 20 instead of 10 (so overall we still wait same amount of time)
19727 + bulkload retries is 10 instead of infinite
19728 + major compactions are now once a week instead of once every 24 hours; they are staggered so all regionservers do not start compacting at the same time
19729 + blockingstorefiles is 10 instead of 7
19730 + block cache is 0.4 instead of 0.25
19731 + Previous, default for hbase.rootdir was /tmp/hbase-${user.name}.  Now it is ${java.io.tmpdir}/hbase-${user.name} which is usually the same location but may not be (on macos, it points to /var/tmp....).
19732
19733
19734 ---
19735
19736 * [HBASE-4072](https://issues.apache.org/jira/browse/HBASE-4072) | *Major* | **Deprecate/disable and remove support for reading ZooKeeper zoo.cfg files from the classpath**
19737
19738 The Apache ZooKeeper config file zoo.cfg will no longer be read when instantiating a HBaseConfiguration object, as it causes various inconsistency issues. Instead, users have to specify all HBase-relevant ZooKeeper properties in the hbase-site.xml using the various "hbase.zookeeper" prefixed properties. For example, specify "hbase.zookeeper.quorum" to provide a ZK quorum server list.
19739
19740 To enable zoo.cfg reading, for which support may be removed in a future release, set the property "hbase.config.read.zookeeper.config" to true in the hbase-site.xml at the client and servers like so:
19741
19742 \<property\>
19743   \<name\>hbase.config.read.zookeeper.config\</name\>
19744   \<value\>true\</value\>
19745   \<description\>
19746         Set to true to allow HBaseConfiguration to read the
19747         zoo.cfg file for ZooKeeper properties. Switching this to true
19748         is not recommended, since the functionality of reading ZK
19749         properties from a zoo.cfg file has been deprecated.
19750   \</description\>
19751 \</property\>
19752
19753
19754