RELEASENOTES.md

   1 # RELEASENOTES
   2
   3 <!---
   4 # Licensed to the Apache Software Foundation (ASF) under one
   5 # or more contributor license agreements.  See the NOTICE file
   6 # distributed with this work for additional information
   7 # regarding copyright ownership.  The ASF licenses this file
   8 # to you under the Apache License, Version 2.0 (the
   9 # "License"); you may not use this file except in compliance
  10 # with the License.  You may obtain a copy of the License at
  11 #
  12 #     http://www.apache.org/licenses/LICENSE-2.0
  13 #
  14 # Unless required by applicable law or agreed to in writing, software
  15 # distributed under the License is distributed on an "AS IS" BASIS,
  16 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  17 # See the License for the specific language governing permissions and
  18 # limitations under the License.
  19
  20 # Be careful doing manual edits in this file. Do not change format
  21 # of release header or remove the below marker. This file is generated.
  22 # DO NOT REMOVE THIS MARKER; FOR INTERPOLATING CHANGES!-->
  23 # HBASE  2.4.10 Release Notes
  24
  25 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  26
  27
  28 ---
  29
  30 * [HBASE-26742](https://issues.apache.org/jira/browse/HBASE-26742) | *Major* | **Comparator of NOT\_EQUAL NULL is invalid for checkAndMutate**
  31
  32 The semantics of checkAndPut for null(or empty) value comparator is changed, the old match is always true.
  33 But we should consider that  EQUAL or NOT\_EQUAL for null check is a common usage, so the semantics of checkAndPut for matching null is correct now.
  34 There is rare use of LESS or GREATER null, so keep the semantics for them.
  35
  36
  37 ---
  38
  39 * [HBASE-26688](https://issues.apache.org/jira/browse/HBASE-26688) | *Major* | **Threads shared EMPTY\_RESULT may lead to unexpected client job down.**
  40
  41 Result#advance with empty cell list will always return false but not raise NoSuchElementException when called multiple times.
  42 This is a behavior change so it is an 'incompatible change', but since it will not introduce any compile error and the old behavior is 'broken', so we also fix it for current release branches.
  43
  44
  45 ---
  46
  47 * [HBASE-26469](https://issues.apache.org/jira/browse/HBASE-26469) | *Critical* | **correct HBase shell exit behavior to match code passed to exit**
  48
  49 <!-- markdown -->
  50 User input handling has been refactored to make use of IRB sessions directly and the HBase shell attempts to ensure user provided calls to exit are able to convey failure and success.
  51
  52 Those scripting use of the HBase shell should be aware that the exit code may have changed:
  53     * a 0 code, or no code, passed to a call to exit from stdin in non-interactive mode will now exit cleanly. in prior versions this would have exited with an error and non-zero exit code. (note that in HBase 2.4.x this call will still result in a non-zero exit code)
  54     * for other combinations of passing in an initialization script or reading from stdin with using the non-interactive flag, the exit code being 0 or non-0 should now line up with releases prior to 2.4, which is a change in behavior compared to versions 2.4.0 - 2.4.9.
  55
  56 Please see the issue details for a table of expected exit codes.
  57
  58
  59 ---
  60
  61 * [HBASE-26631](https://issues.apache.org/jira/browse/HBASE-26631) | *Major* | **Upgrade junit to 4.13.2**
  62
  63 Upgrade junit to 4.13.2 for addressing CVE-2020-15250.
  64
  65
  66
  67 # HBASE  2.4.9 Release Notes
  68
  69 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  70
  71
  72 ---
  73
  74 * [HBASE-26542](https://issues.apache.org/jira/browse/HBASE-26542) | *Minor* | **Apply a \`package\` to test protobuf files**
  75
  76 The protobuf structures used in test are all now scoped by the package name \`hbase.test.pb\`.
  77
  78
  79 ---
  80
  81 * [HBASE-26512](https://issues.apache.org/jira/browse/HBASE-26512) | *Major* | **Make timestamp format configurable in HBase shell scan output**
  82
  83 HBASE-23930 changed the formatting of the timestamp attribute on each Cell as displayed by the HBase shell to be formatted as an ISO-8601 string rather that milliseconds since the epoch. Some users may have logic which expects the timestamp to be displayed as milliseconds since the epoch. This change introduces the configuration property hbase.shell.timestamp.format.epoch which controls whether the shell will print an ISO-8601 formatted timestamp (the default "false") or milliseconds since the epoch ("true").
  84
  85
  86
  87 # HBASE  2.4.8 Release Notes
  88
  89 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  90
  91
  92 ---
  93
  94 * [HBASE-26362](https://issues.apache.org/jira/browse/HBASE-26362) | *Major* | **Upload mvn site artifacts for nightly build to nightlies**
  95
  96 Now we will upload the site artifacts to nightlies for nightly build as well as pre commit build.
  97
  98
  99 ---
 100
 101 * [HBASE-26329](https://issues.apache.org/jira/browse/HBASE-26329) | *Major* | **Upgrade commons-io to 2.11.0**
 102
 103 Upgraded commons-io to 2.11.0.
 104
 105
 106 ---
 107
 108 * [HBASE-26186](https://issues.apache.org/jira/browse/HBASE-26186) | *Major* | **jenkins script for caching artifacts should verify cached file before relying on it**
 109
 110 Add a '--verify-tar-gz' option to cache-apache-project-artifact.sh for verifying whether the cached file can be parsed as a gzipped tarball.
 111 Use this option in our nightly job to avoid failures on broken cached hadoop tarballs.
 112
 113
 114 ---
 115
 116 * [HBASE-26339](https://issues.apache.org/jira/browse/HBASE-26339) | *Major* | **SshPublisher will skip uploading artifacts if the build is failure**
 117
 118 Now we will mark build as unstable instead of failure when the yetus script returns error. This is used to solve the problem that the SshPublisher jenkins plugin will skip uploading artifacts if the build is marked as failure. In fact, the test output will be more important when there are UT failures.
 119
 120
 121
 122 # HBASE  2.4.7 Release Notes
 123
 124 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 125
 126
 127 ---
 128
 129 * [HBASE-26274](https://issues.apache.org/jira/browse/HBASE-26274) | *Major* | **Create an option to reintroduce BlockCache to mapreduce job**
 130
 131 Introduce \`hfile.onheap.block.cache.fixed.size\` and default to disable. When using ClientSideRegionScanner, it will be enabled with a fixed size for caching INDEX/LEAF\_INDEX block when a client, e.g. snapshot scanner, scans the entire HFile and does not need to seek/reseek to index block multiple times.
 132
 133
 134 ---
 135
 136 * [HBASE-26270](https://issues.apache.org/jira/browse/HBASE-26270) | *Minor* | **Provide getConfiguration method for Region and Store interface**
 137
 138 Provide 'getReadOnlyConfiguration' method for Store and Region interface
 139
 140
 141 ---
 142
 143 * [HBASE-26273](https://issues.apache.org/jira/browse/HBASE-26273) | *Major* | **TableSnapshotInputFormat/TableSnapshotInputFormatImpl should use ReadType.STREAM for scanning HFiles**
 144
 145 HBase's MapReduce API which can operate over HBase snapshots will now default to using ReadType.STREAM instead of ReadType.DEFAULT (which is PREAD) as a result of this change. HBase developers expect that STREAM will perform significantly better for average Snapshot-based batch jobs. Users can restore the previous functionality (using PREAD) by updating their code to explicitly set a value of \`ReadType.PREAD\` on the \`Scan\` object they provide to TableSnapshotInputFormat, or by setting the configuration property "hbase.TableSnapshotInputFormat.scanner.readtype" to "PREAD" in hbase-site.xml.
 146
 147
 148 ---
 149
 150 * [HBASE-26276](https://issues.apache.org/jira/browse/HBASE-26276) | *Major* | **Allow HashTable/SyncTable to perform rawScan when comparing cells**
 151
 152 Added --rawScan option to HashTable job, which allows HashTable/SyncTable to perform raw scans. If this property is omitted, it defaults to false. When used together with --versions set to a high value, SyncTable will fabricate delete markers to all old versions still hanging (not cleaned yet by major compaction), avoiding the inconsistencies reported in HBASE-21596.
 153
 154
 155
 156 # HBASE  2.4.6 Release Notes
 157
 158 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 159
 160
 161 ---
 162
 163 * [HBASE-26204](https://issues.apache.org/jira/browse/HBASE-26204) | *Major* | **VerifyReplication should obtain token for peerQuorumAddress too**
 164
 165 VerifyReplication obtains tokens even if the peer quorum parameter is used. VerifyReplication with peer quorum can be used for secure clusters also.
 166
 167
 168 ---
 169
 170 * [HBASE-24652](https://issues.apache.org/jira/browse/HBASE-24652) | *Minor* | **master-status UI make date type fields sortable**
 171
 172 Makes RegionServer 'Start time' sortable in the Master UI
 173
 174
 175 ---
 176
 177 * [HBASE-26200](https://issues.apache.org/jira/browse/HBASE-26200) | *Major* | **Undo 'HBASE-25165 Change 'State time' in UI so sorts (#2508)' in favor of HBASE-24652**
 178
 179 Undid showing RegionServer 'Start time' in ISO-8601 format. Revert.
 180
 181
 182 ---
 183
 184 * [HBASE-6908](https://issues.apache.org/jira/browse/HBASE-6908) | *Major* | **Pluggable Call BlockingQueue for HBaseServer**
 185
 186 Can pass in a FQCN to load as the call queue implementation.
 187
 188 Standardized arguments to the constructor are the max queue length, the PriorityFunction, and the Configuration.
 189
 190 PluggableBlockingQueue abstract class provided to help guide the correct constructor signature.
 191
 192 Hard fails with PluggableRpcQueueNotFound if the class fails to load as a BlockingQueue\<CallRunner\>
 193
 194 Upstreaming on behalf of Hubspot, we are interested in defining our own custom RPC queue and don't want to get involved in necessarily upstreaming internal requirements/iterations.
 195
 196
 197 ---
 198
 199 * [HBASE-26196](https://issues.apache.org/jira/browse/HBASE-26196) | *Major* | **Support configuration override for remote cluster of HFileOutputFormat locality sensitive**
 200
 201 Allow any configuration for the remote cluster in HFileOutputFormat2 that could be useful the different configuration from the job's configuration is necessary to connect the remote cluster, for instance, non-secure vs secure.
 202
 203
 204 ---
 205
 206 * [HBASE-26160](https://issues.apache.org/jira/browse/HBASE-26160) | *Minor* | **Configurable disallowlist for live editing of loglevels**
 207
 208 Adds a new hbase.ui.logLevels.readonly.loggers config which takes a comma-separated list of logger names. Similar to log4j configurations, the logger names can be prefixes or a full logger name. The log level of read only loggers cannot be changed via the logLevel UI or setlevel CLI. This is useful for securing sensitive loggers, such as the SecurityLogger used for audit logs.
 209
 210
 211 ---
 212
 213 * [HBASE-26154](https://issues.apache.org/jira/browse/HBASE-26154) | *Minor* | **Provide exception metric for quota exceeded and throttling**
 214
 215 Adds "exceptions.quotaExceeded" and "exceptions.rpcThrottling" to HBase server and Thrift server metrics.
 216
 217
 218 ---
 219
 220 * [HBASE-26146](https://issues.apache.org/jira/browse/HBASE-26146) | *Minor* | **Allow custom opts for hbck in hbase bin**
 221
 222 Adds HBASE\_HBCK\_OPTS environment variable to bin/hbase for passing extra options to hbck/hbck2. Defaults to HBASE\_SERVER\_JAAS\_OPTS if specified, or HBASE\_REGIONSERVER\_OPTS.
 223
 224
 225
 226 # HBASE  2.4.5 Release Notes
 227
 228 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 229
 230
 231 ---
 232
 233 * [HBASE-26088](https://issues.apache.org/jira/browse/HBASE-26088) | *Critical* | **conn.getBufferedMutator(tableName) leaks thread executors and other problems**
 234
 235 The API doc for Connection#getBufferedMutator(TableName) and Connection#getBufferedMutator(BufferedMutatorParams) mentioned that when user dont pass a ThreadPool to be used, we use the ThreadPool in the Connection.  But in reality, we were creating new ThreadPool in such cases.
 236
 237 We are keeping the behaviour of code as is but corrected the Javadoc and also a bug of not closing this new pool while Closing the BufferedMutator.
 238
 239
 240 ---
 241
 242 * [HBASE-25986](https://issues.apache.org/jira/browse/HBASE-25986) | *Minor* | **Expose the NORMALIZARION\_ENABLED table descriptor through a property in hbase-site**
 243
 244 New config: hbase.table.normalization.enabled
 245
 246 Default value: false
 247
 248 Description: This config is used to set default behaviour of normalizer at table level. To override this at table level one can set NORMALIZATION\_ENABLED at table descriptor level and that property will be honored. Of course, this property at table level can only work if normalizer is enabled at cluster level using "normalizer\_switch true" command.
 249
 250
 251 ---
 252
 253 * [HBASE-22923](https://issues.apache.org/jira/browse/HBASE-22923) | *Major* | **hbase:meta is assigned to localhost when we downgrade the hbase version**
 254
 255 Introduced new config: hbase.min.version.move.system.tables
 256
 257 When the operator uses this configuration option, any version between
 258 the current cluster version and the value of "hbase.min.version.move.system.tables"
 259 does not trigger any auto-region movement. Auto-region movement here
 260 refers to auto-migration of system table regions to newer server versions.
 261 It is assumed that the configured range of versions does not require special
 262 handling of moving system table regions to higher versioned RegionServer.
 263 This auto-migration is done by AssignmentManager#checkIfShouldMoveSystemRegionAsync().
 264 Example: Let's assume the cluster is on version 1.4.0 and we have
 265 set "hbase.min.version.move.system.tables" as "2.0.0". Now if we upgrade
 266 one RegionServer on 1.4.0 cluster to 1.6.0 (\< 2.0.0), then AssignmentManager will
 267 not move hbase:meta, hbase:namespace and other system table regions
 268 to newly brought up RegionServer 1.6.0 as part of auto-migration.
 269 However, if we upgrade one RegionServer on 1.4.0 cluster to 2.2.0 (\> 2.0.0),
 270 then AssignmentManager will move all system table regions to newly brought
 271 up RegionServer 2.2.0 as part of auto-migration done by
 272 AssignmentManager#checkIfShouldMoveSystemRegionAsync().
 273
 274 Overall, assuming we have system RSGroup where we keep HBase system tables, if we use
 275 config "hbase.min.version.move.system.tables" with value x.y.z then while upgrading cluster to
 276 version greater than or equal to x.y.z, the first RegionServer that we upgrade must
 277 belong to system RSGroup only.
 278
 279
 280 ---
 281
 282 * [HBASE-25902](https://issues.apache.org/jira/browse/HBASE-25902) | *Critical* | **Add missing CFs in meta during HBase 1 to 2.3+ Upgrade**
 283
 284 While upgrading cluster from 1.x to 2.3+ versions, after the active master is done setting it's status as 'Initialized', it attempts to add 'table' and 'repl\_barrier' CFs in meta. Once CFs are added successfully, master is aborted with PleaseRestartMasterException because master has missed certain initialization events (e.g ClusterSchemaService is not initialized and tableStateManager fails to migrate table states from ZK to meta due to missing CFs). Subsequent active master initialization is expected to be smooth.
 285 In the presence of multi masters, when one of them becomes active for the first time after upgrading to HBase 2.3+, it is aborted after fixing CFs in meta and one of the other backup masters will take over and become active soon. Hence, overall this is expected to be smooth upgrade if we have backup masters configured. If not, operator is expected to restart same master again manually.
 286
 287
 288 ---
 289
 290 * [HBASE-25877](https://issues.apache.org/jira/browse/HBASE-25877) | *Major* | **Add access  check for compactionSwitch**
 291
 292 Now calling RSRpcService.compactionSwitch, i.e, Admin.compactionSwitch at client side, requires ADMIN permission.
 293 This is an incompatible change but it is also a bug, as we should not allow any users to disable compaction on a regionserver, so we apply this to all active branches.
 294
 295
 296 ---
 297
 298 * [HBASE-25984](https://issues.apache.org/jira/browse/HBASE-25984) | *Critical* | **FSHLog WAL lockup with sync future reuse [RS deadlock]**
 299
 300 Fixes a WAL lockup issue due to premature reuse of the sync futures by the WAL consumers. The lockup causes the WAL system to hang resulting in blocked appends and syncs thus holding up the RPC handlers from progressing. Only workaround without this fix is to force abort the region server.
 301
 302
 303 ---
 304
 305 * [HBASE-25993](https://issues.apache.org/jira/browse/HBASE-25993) | *Major* | **Make excluded SSL cipher suites configurable for all Web UIs**
 306
 307 Add "ssl.server.exclude.cipher.list" configuration to excluded cipher suites for the http server started by the InfoServer.
 308
 309
 310 ---
 311
 312 * [HBASE-25969](https://issues.apache.org/jira/browse/HBASE-25969) | *Major* | **Cleanup netty-all transitive includes**
 313
 314 We have an (old) netty-all in our produced artifacts. It is transitively included from hadoop. It is needed by MiniMRCluster referenced from a few MR tests in hbase. This commit adds netty-all excludes everywhere else but where tests will fail unless the transitive is allowed through. TODO: move MR and/or MR tests out of hbase core.
 315
 316
 317
 318 # HBASE  2.4.4 Release Notes
 319
 320 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 321
 322
 323 ---
 324
 325 * [HBASE-25963](https://issues.apache.org/jira/browse/HBASE-25963) | *Major* | **HBaseCluster should be marked as IA.Public**
 326
 327 Change HBaseCluster to IA.Public as its sub class MiniHBaseCluster is IA.Public.
 328
 329
 330
 331 # HBASE  2.4.3 Release Notes
 332
 333 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 334
 335
 336 ---
 337
 338 * [HBASE-25766](https://issues.apache.org/jira/browse/HBASE-25766) | *Major* | **Introduce RegionSplitRestriction that restricts the pattern of the split point**
 339
 340 After HBASE-25766, we can specify a split restriction, "KeyPrefix" or "DelimitedKeyPrefix", to a table with the "hbase.regionserver.region.split\_restriction.type" property. The "KeyPrefix" split restriction groups rows by a prefix of the row-key. And the "DelimitedKeyPrefix" split restriction groups rows by a prefix of the row-key with a delimiter.
 341
 342 For example:
 343 \`\`\`
 344 # Create a table with a "KeyPrefix" split restriction, where the prefix length is 2 bytes
 345 hbase\> create 'tbl1', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'KeyPrefix', 'hbase.regionserver.region.split\_restriction.prefix\_length' =\> '2'}}
 346
 347 # Create a table with a "DelimitedKeyPrefix" split restriction, where the delimiter is a comma (,)
 348 hbase\> create 'tbl2', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'DelimitedKeyPrefix', 'hbase.regionserver.region.split\_restriction.delimiter' =\> ','}}
 349 \`\`\`
 350
 351 Instead of specifying a split restriction to a table directly, we can also set the properties in hbase-site.xml. In this case, the specified split restriction is applied for all the tables.
 352
 353 Note that the split restriction is also applied to a user-specified split point so that we don't allow users to break the restriction, which is different behavior from the existing KeyPrefixRegionSplitPolicy and DelimitedKeyPrefixRegionSplitPolicy.
 354
 355
 356 ---
 357
 358 * [HBASE-25775](https://issues.apache.org/jira/browse/HBASE-25775) | *Major* | **Use a special balancer to deal with maintenance mode**
 359
 360 Introduced a MaintenanceLoadBalancer to be used only under maintenance mode. Typically you should not use it as your balancer implementation.
 361
 362
 363 ---
 364
 365 * [HBASE-25767](https://issues.apache.org/jira/browse/HBASE-25767) | *Major* | **CandidateGenerator.getRandomIterationOrder is too slow on large cluster**
 366
 367 In the actual implementation classes of CandidateGenerator, now we just random select a start point and then iterate sequentially, instead of using the old way, where we will create a big array to hold all the integers in [0, num\_regions\_in\_cluster), shuffle the array, and then iterate on the array.
 368 The new implementation is 'random' enough as every time we just select one candidate. The problem for the old implementation is that, it will create an array every time when we want to get a candidate, if we have tens of thousands regions, we will create an array with tens of thousands length everytime, which causes big GC pressure and slow down the balancer execution.
 369
 370
 371 ---
 372
 373 * [HBASE-25734](https://issues.apache.org/jira/browse/HBASE-25734) | *Minor* | **Backport HBASE-24305 to branch-2.4**
 374
 375 The following method was added to ServerName
 376
 377 - #valueOf(Address, long)
 378
 379
 380 ---
 381
 382 * [HBASE-25199](https://issues.apache.org/jira/browse/HBASE-25199) | *Minor* | **Remove HStore#getStoreHomedir**
 383
 384 Moved the following methods from HStore to HRegionFileSystem
 385
 386 - #getStoreHomedir(Path, RegionInfo, byte[])
 387 - #getStoreHomedir(Path, String, byte[])
 388
 389
 390 ---
 391
 392 * [HBASE-25685](https://issues.apache.org/jira/browse/HBASE-25685) | *Major* | **asyncprofiler2.0 no longer supports svg; wants html**
 393
 394 If asyncprofiler 1.x, all is good. If asyncprofiler 2.x and it is hbase-2.3.x or hbase-2.4.x, add '?output=html' to get flamegraphs from the profiler.
 395
 396 Otherwise, if hbase-2.5+ and asyncprofiler2, all works. If asyncprofiler1 and hbase-2.5+, you may have to add '?output=svg' to the query.
 397
 398
 399 ---
 400
 401 * [HBASE-25518](https://issues.apache.org/jira/browse/HBASE-25518) | *Major* | **Support separate child regions to different region servers**
 402
 403 Config key for enable/disable automatically separate child regions to different region servers in the procedure of split regions. One child will be kept to the server where parent region is on, and the other child will be assigned to a random server.
 404
 405 hbase.master.auto.separate.child.regions.after.split.enabled
 406
 407 Default setting is false/off.
 408
 409
 410 ---
 411
 412 * [HBASE-25374](https://issues.apache.org/jira/browse/HBASE-25374) | *Minor* | **Make REST Client connection and socket time out configurable**
 413
 414 Configuration parameter to set rest client connection timeout
 415
 416 "hbase.rest.client.conn.timeout" Default is 2 \* 1000
 417
 418 "hbase.rest.client.socket.timeout" Default of 30 \* 1000
 419
 420
 421 ---
 422
 423 * [HBASE-25587](https://issues.apache.org/jira/browse/HBASE-25587) | *Major* | **[hbck2] Schedule SCP for all unknown servers**
 424
 425 Adds scheduleSCPsForUnknownServers to Hbck Service.
 426
 427
 428 ---
 429
 430 * [HBASE-25636](https://issues.apache.org/jira/browse/HBASE-25636) | *Minor* | **Expose HBCK report as metrics**
 431
 432 Expose HBCK repost results in metrics, includes: "orphanRegionsOnRS", "orphanRegionsOnFS", "inconsistentRegions", "holes", "overlaps", "unknownServerRegions" and "emptyRegionInfoRegions".
 433
 434
 435 ---
 436
 437 * [HBASE-24305](https://issues.apache.org/jira/browse/HBASE-24305) | *Minor* | **Handle deprecations in ServerName**
 438
 439 The following methods were removed or made private from ServerName (due to HBASE-17624):
 440
 441 - getHostNameMinusDomain(String): Was made private without a replacement.
 442 - parseHostname(String): Use #valueOf(String) instead.
 443 - parsePort(String): Use #valueOf(String) instead.
 444 - parseStartcode(String): Use #valueOf(String) instead.
 445 - getServerName(String, int, long): Was made private. Use #valueOf(String, int, long) instead.
 446 - getServerName(String, long): Use #valueOf(String, long) instead.
 447 - getHostAndPort(): Use #getAddress() instead.
 448 - getServerStartcodeFromServerName(String): Use instance of ServerName to pull out start code)
 449 - getServerNameLessStartCode(String): Use #getAddress() instead.
 450
 451
 452
 453 # HBASE  2.4.2 Release Notes
 454
 455 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 456
 457
 458 ---
 459
 460 * [HBASE-25492](https://issues.apache.org/jira/browse/HBASE-25492) | *Major* | **Create table with rsgroup info in branch-2**
 461
 462 HBASE-25492 added a new interface in TableDescriptor which allows user to define RSGroup name while creating or modifying a table.
 463
 464
 465 ---
 466
 467 * [HBASE-25460](https://issues.apache.org/jira/browse/HBASE-25460) | *Major* | **Expose drainingServers as cluster metric**
 468
 469 Exposed new jmx metrics: "draininigRegionServers" and "numDrainingRegionServers" to provide "comma separated names for regionservers that are put in draining mode" and "num of such regionservers" respectively.
 470
 471
 472 ---
 473
 474 * [HBASE-25615](https://issues.apache.org/jira/browse/HBASE-25615) | *Major* | **Upgrade java version in pre commit docker file**
 475
 476 jdk8u232-b09 -\> jdk8u282-b08
 477 jdk-11.0.6\_10 -\> jdk-11.0.10\_9
 478
 479
 480 ---
 481
 482 * [HBASE-23887](https://issues.apache.org/jira/browse/HBASE-23887) | *Major* | **New L1 cache : AdaptiveLRU**
 483
 484 Introduced new L1 cache: AdaptiveLRU. This is supposed to provide better performance than default LRU cache.
 485 Set config key "hfile.block.cache.policy" to "AdaptiveLRU" in hbase-site in order to start using this new cache.
 486
 487
 488
 489 # HBASE  2.4.1 Release Notes
 490
 491 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 492
 493
 494 ---
 495
 496 * [HBASE-25449](https://issues.apache.org/jira/browse/HBASE-25449) | *Major* | **'dfs.client.read.shortcircuit' should not be set in hbase-default.xml**
 497
 498 The presence of HDFS short-circuit read configuration properties in hbase-default.xml inadvertently causes short-circuit reads to not happen inside of RegionServers, despite short-circuit reads being enabled in hdfs-site.xml.
 499
 500
 501 ---
 502
 503 * [HBASE-25333](https://issues.apache.org/jira/browse/HBASE-25333) | *Major* | **Add maven enforcer rule to ban VisibleForTesting imports**
 504
 505 Ban the imports of guava VisiableForTesting, which means you should not use this annotation in HBase any more.
 506 For IA.Public and IA.LimitedPrivate classes, typically you should not expose any test related fields/methods there, and if you want to hide something, use IA.Private on the specific fields/methods.
 507 For IA.Private classes, if you want to expose something only for tests, use the RestrictedApi annotation from error prone, where it could cause a compilation error if someone break the rule in the future.
 508
 509
 510 ---
 511
 512 * [HBASE-25441](https://issues.apache.org/jira/browse/HBASE-25441) | *Critical* | **add security check for some APIs in RSRpcServices**
 513
 514 RsRpcServices APIs that can be accessed only through Admin rights:
 515 - stopServer
 516 - updateFavoredNodes
 517 - updateConfiguration
 518 - clearRegionBlockCache
 519 - clearSlowLogsResponses
 520
 521
 522 ---
 523
 524 * [HBASE-25432](https://issues.apache.org/jira/browse/HBASE-25432) | *Blocker* | **we should add security checks for setTableStateInMeta and fixMeta**
 525
 526 setTableStateInMeta and fixMeta can be accessed only through Admin rights
 527
 528
 529 ---
 530
 531 * [HBASE-25318](https://issues.apache.org/jira/browse/HBASE-25318) | *Minor* | **Configure where IntegrationTestImportTsv generates HFiles**
 532
 533 Added IntegrationTestImportTsv.generatedHFileFolder configuration property to override the default location in IntegrationTestImportTsv. Useful for running the integration test when HDFS Transparent Encryption is enabled.
 534
 535
 536 ---
 537
 538 * [HBASE-25456](https://issues.apache.org/jira/browse/HBASE-25456) | *Critical* | **setRegionStateInMeta need security check**
 539
 540 setRegionStateInMeta can be accessed only through Admin rights
 541
 542
 543
 544 # HBASE  2.4.0 Release Notes
 545
 546 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 547
 548
 549 ---
 550
 551 * [HBASE-25127](https://issues.apache.org/jira/browse/HBASE-25127) | *Major* | **Enhance PerformanceEvaluation to profile meta replica performance.**
 552
 553 Three new commands are added to PE:
 554
 555 metaWrite, metaRandomRead and cleanMeta.
 556
 557 Usage example:
 558 hbase pe  --rows=100000 metaWrite  1
 559 hbase pe  --nomapreduce --rows=100000 metaRandomRead  32
 560 hbase pe  --rows=100000 cleanMeta 1
 561
 562 metaWrite and cleanMeta should be run with only 1 thread and the same number of rows so all the rows inserted will be cleaned up properly.
 563
 564 metaRandomRead can be run with multiple threads. The rows option should set to within the range of rows inserted by metaWrite
 565
 566
 567 ---
 568
 569 * [HBASE-25237](https://issues.apache.org/jira/browse/HBASE-25237) | *Major* | **'hbase master stop' shuts down the cluster, not the master only**
 570
 571 \`hbase master stop\` should shutdown only master by default.
 572 1. Help added to \`hbase master stop\`:
 573 To stop cluster, use \`stop-hbase.sh\` or \`hbase master stop --shutDownCluster\`
 574
 575 2. Help added to \`stop-hbase.sh\`:
 576 stop-hbase.sh can only be used for shutting down entire cluster. To shut down (HMaster\|HRegionServer) use hbase-daemon.sh stop (master\|regionserver)
 577
 578
 579 ---
 580
 581 * [HBASE-25242](https://issues.apache.org/jira/browse/HBASE-25242) | *Critical* | **Add Increment/Append support to RowMutations**
 582
 583 After HBASE-25242, we can add Increment/Append operations to RowMutations and perform those operations atomically in a single row.
 584 HBASE-25242 includes an API change where the mutateRow() API returns a Result object to get the result of the Increment/Append operations.
 585
 586
 587 ---
 588
 589 * [HBASE-25263](https://issues.apache.org/jira/browse/HBASE-25263) | *Major* | **Change encryption key generation algorithm used in the HBase shell**
 590
 591 Since the backward-compatible change we introduced in HBASE-25263,  we use the more secure PBKDF2WithHmacSHA384  key generation algorithm (instead of PBKDF2WithHmacSHA1) to generate a secret key for HFile / WalFile encryption, when the user is defining a string encryption key in the hbase shell.
 592
 593
 594 ---
 595
 596 * [HBASE-24268](https://issues.apache.org/jira/browse/HBASE-24268) | *Minor* | **REST and Thrift server do not handle the "doAs" parameter case insensitively**
 597
 598 This change allows the REST and Thrift servers to handle the "doAs" parameter case-insensitively, which is deemed as correct per the "specification" provided by the Hadoop community.
 599
 600
 601 ---
 602
 603 * [HBASE-25278](https://issues.apache.org/jira/browse/HBASE-25278) | *Minor* | **Add option to toggle CACHE\_BLOCKS in count.rb**
 604
 605 A new option, CACHE\_BLOCKS, was added to the \`count\` shell command which will force the data for a table to be loaded into the block cache. By default, the \`count\` command will not cache any blocks. This option can serve as a means to for a table's data to be loaded into block cache on demand. See the help message on the count shell command for usage details.
 606
 607
 608 ---
 609
 610 * [HBASE-18070](https://issues.apache.org/jira/browse/HBASE-18070) | *Critical* | **Enable memstore replication for meta replica**
 611
 612 "Async WAL Replication" [1] was added by HBASE-11183 "Timeline Consistent region replicas - Phase 2 design" but only for user-space tables. This feature adds "Async WAL Replication" for the hbase:meta table.  It also adds a client 'LoadBalance' mode that has reads go to replicas first and to the primary only on fail so as to shed read load from the primary to alleviate \*hotspotting\* on the hbase:meta Region.
 613
 614 Configuration is as it was for the user-space 'Async WAL Replication'. See [2] and [3] for details on how to enable.
 615
 616 1. http://hbase.apache.org/book.html#async.wal.replication
 617 2. http://hbase.apache.org/book.html#async.wal.replication.meta
 618 3. http://hbase.apache.org/book.html#\_async\_wal\_replication\_for\_meta\_table\_as\_of\_hbase\_2\_4\_0
 619
 620
 621 ---
 622
 623 * [HBASE-25126](https://issues.apache.org/jira/browse/HBASE-25126) | *Major* | **Add load balance logic in hbase-client to distribute read load over meta replica regions.**
 624
 625 See parent issue, HBASE-18070, release notes for how to enable.
 626
 627
 628 ---
 629
 630 * [HBASE-25026](https://issues.apache.org/jira/browse/HBASE-25026) | *Minor* | **Create a metric to track full region scans RPCs**
 631
 632 Adds a new metric where we collect the number of full region scan requests at the RPC layer. This will be collected under "name" : "Hadoop:service=HBase,name=RegionServer,sub=Server"
 633
 634
 635 ---
 636
 637 * [HBASE-25253](https://issues.apache.org/jira/browse/HBASE-25253) | *Major* | **Deprecated master carrys regions related methods and configs**
 638
 639 Since 2.4.0, deprecated all master carrys regions related methods(LoadBalancer,BaseLoadBalancer,ZNodeClearer) and configs(hbase.balancer.tablesOnMaster, hbase.balancer.tablesOnMaster.systemTablesOnly), they will be removed in 3.0.0.
 640
 641
 642 ---
 643
 644 * [HBASE-20598](https://issues.apache.org/jira/browse/HBASE-20598) | *Major* | **Upgrade to JRuby 9.2**
 645
 646 <!-- markdown -->
 647 The HBase shell now relies on JRuby 9.2. This is a new major version change for JRuby. The most significant change is Ruby compatibility changed from Ruby 2.3 to Ruby 2.5. For more detailed changes please see [the JRuby release announcement for the start of the 9.2 series](https://www.jruby.org/2018/05/24/jruby-9-2-0-0.html) as well as the [general release announcement page for updates since that version](https://www.jruby.org/news).
 648
 649 The runtime dependency versions present on the server side classpath for the Joni (now 2.1.31) and JCodings (now 1.0.55) libraries have also been updated to match those found in the JRuby version shipped with HBase. These version changes are maintenance releases and should be backwards compatible when updated in tandem.
 650
 651
 652 ---
 653
 654 * [HBASE-25181](https://issues.apache.org/jira/browse/HBASE-25181) | *Major* | **Add options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys.**
 655
 656 <!-- markdown -->
 657 This change adds options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys. Changes are done such that defaults will keep the same behavior prior to this issue.
 658
 659 Prior to this change HBase always used the MD5 hash algorithm to store a hash for encryption keys. This hash is needed to verify the secret key of the subject. (e.g. making sure that the same secrey key is used during encrypted HFile read and write). The MD5 algorithm is considered weak, and can not be used in some (e.g. FIPS compliant) clusters. Having a configurable hash enables us to use newer and more secure hash algorithms like SHA-384 or SHA-512 (which are FIPS compliant).
 660
 661 The hash is set via the configuration option `hbase.crypto.key.hash.algorithm`. It should be set to a JDK `MessageDigest` algorithm like "MD5", "SHA-256" or "SHA-384". The default is "MD5" for backward compatibility.
 662
 663 Alternatively, clusters which rely on an encryption at rest mechanism outside of HBase (e.g. those offered by HDFS) and wish to ensure HBase's encryption at rest system is inactive can set `hbase.crypto.enabled` to `false`.
 664
 665
 666 ---
 667
 668 * [HBASE-25238](https://issues.apache.org/jira/browse/HBASE-25238) | *Critical* | **Upgrading HBase from 2.2.0 to 2.3.x fails because of “Message missing required fields: state”**
 669
 670 Fixes master procedure store migration issues going from 2.0.x to 2.2.x and/or 2.3.x. Also fixes failed heartbeat parse during rolling upgrade from 2.0.x. to 2.3.x.
 671
 672
 673 ---
 674
 675 * [HBASE-25234](https://issues.apache.org/jira/browse/HBASE-25234) | *Major* | **[Upgrade]Incompatibility in reading RS report from 2.1 RS when Master is upgraded to a version containing HBASE-21406**
 676
 677 Fixes so auto-migration of master procedure store works again going from 2.0.x =\> 2.2+. Also make it so heartbeats work when rolling upgrading from 2.0.x =\> 2.3+.
 678
 679
 680 ---
 681
 682 * [HBASE-25212](https://issues.apache.org/jira/browse/HBASE-25212) | *Major* | **Optionally abort requests in progress after deciding a region should close**
 683
 684 If hbase.regionserver.close.wait.abort is set to true, interrupt RPC handler threads holding the region close lock.
 685
 686 Until requests in progress can be aborted, wait on the region close lock for a configurable interval (specified by hbase.regionserver.close.wait.time.ms, default 60000 (1 minute)). If we have failed to acquire the close lock after this interval elapses, if allowed (also specified by hbase.regionserver.close.wait.abort), abort the regionserver.
 687
 688 We will attempt to interrupt any running handlers every hbase.regionserver.close.wait.interval.ms (default 10000 (10 seconds)) until either the close lock is acquired or we reach the maximum wait time.
 689
 690
 691 ---
 692
 693 * [HBASE-25167](https://issues.apache.org/jira/browse/HBASE-25167) | *Major* | **Normalizer support for hot config reloading**
 694
 695 <!-- markdown -->
 696 This patch adds [dynamic configuration](https://hbase.apache.org/book.html#dyn_config) support for the following configuration keys related to the normalizer:
 697 * hbase.normalizer.throughput.max_bytes_per_sec
 698 * hbase.normalizer.split.enabled
 699 * hbase.normalizer.merge.enabled
 700 * hbase.normalizer.min.region.count
 701 * hbase.normalizer.merge.min_region_age.days
 702 * hbase.normalizer.merge.min_region_size.mb
 703
 704
 705 ---
 706
 707 * [HBASE-25224](https://issues.apache.org/jira/browse/HBASE-25224) | *Major* | **Maximize sleep for checking meta and namespace regions availability**
 708
 709 Changed the max sleep time during meta and namespace regions availability check to be 60 sec. Previously there was no such cap
 710
 711
 712 ---
 713
 714 * [HBASE-24628](https://issues.apache.org/jira/browse/HBASE-24628) | *Major* | **Region normalizer now respects a rate limit**
 715
 716 <!-- markdown -->
 717 Introduces a new configuration, `hbase.normalizer.throughput.max_bytes_per_sec`, for specifying a limit on the throughput of actions executed by the normalizer. Note that while this configuration value is in bytes, the minimum honored valued is `1,000,000`, or `1m`. Supports values configured using the human-readable suffixes honored by [`Configuration.getLongBytes`](https://hadoop.apache.org/docs/current/api/org/apache/hadoop/conf/Configuration.html#getLongBytes-java.lang.String-long-)
 718
 719
 720 ---
 721
 722 * [HBASE-14067](https://issues.apache.org/jira/browse/HBASE-14067) | *Major* | **bundle ruby files for hbase shell into a jar.**
 723
 724 <!-- markdown -->
 725 The `hbase-shell` artifact now contains the ruby files that implement the hbase shell. There should be no downstream impact for users of the shell that rely on the `hbase shell` command.
 726
 727 Folks that wish to include the HBase ruby classes defined for the shell in their own JRuby scripts should add the `hbase-shell.jar` file to their classpath rather than add `${HBASE_HOME}/lib/ruby` to their load paths.
 728
 729
 730 ---
 731
 732 * [HBASE-24875](https://issues.apache.org/jira/browse/HBASE-24875) | *Major* | **Remove the force param for unassign since it dose not take effect any more**
 733
 734 <!-- markdown -->
 735 The "force" flag to various unassign commands (java api, shell, etc) has been ignored since HBase 2. As of this change the methods that take it are now deprecated. Downstream users should stop passing/using this flag.
 736
 737 The Admin and AsyncAdmin Java APIs will have the deprecated version of the unassign method with a force flag removed in HBase 4. Callers can safely continue to use the deprecated API until then; the internal implementation just calls the new method.
 738
 739 The MasterObserver coprocessor API deprecates the `preUnassign` and `postUnassign` methods that include the force parameter and replaces them with versions that omit this parameter. The deprecated methods will be removed from the API in HBase 3. Until then downstream coprocessor implementations can safely continue to *just* implement the deprecated method if they wish; the replacement methods provide a default implementation that calls the deprecated method with force set to `false`.
 740
 741
 742 ---
 743
 744 * [HBASE-25099](https://issues.apache.org/jira/browse/HBASE-25099) | *Major* | **Change meta replica count by altering meta table descriptor**
 745
 746 Now you can change the region replication config for meta table by altering meta table.
 747 The old "hbase.meta.replica.count" is deprecated and will be removed in 4.0.0. But if it is set, we will still honor it, which means, when master restart, if we find out that the value of 'hbase.meta.replica.count' is different with the region replication config of meta table, we will schedule an alter table operation to change the region replication config to the value you configured for 'hbase.meta.replica.count'.
 748
 749
 750 ---
 751
 752 * [HBASE-23834](https://issues.apache.org/jira/browse/HBASE-23834) | *Major* | **HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch**
 753
 754 Use shaded json and jersey in HBase.
 755 Ban the imports of unshaded json and jersey in code.
 756
 757
 758 ---
 759
 760 * [HBASE-25163](https://issues.apache.org/jira/browse/HBASE-25163) | *Major* | **Increase the timeout value for nightly jobs**
 761
 762 Increase timeout value for nightly jobs to 16 hours since the new build machines are dedicated to hbase project, so we are allowed to use it all the time.
 763
 764
 765 ---
 766
 767 * [HBASE-22976](https://issues.apache.org/jira/browse/HBASE-22976) | *Major* | **[HBCK2] Add RecoveredEditsPlayer**
 768
 769 WALPlayer can replay the content of recovered.edits directories.
 770
 771 Side-effect is that WAL filename timestamp is now factored when setting start/end times for WALInputFormat; i.e. wal.start.time and wal.end.time values on a job context. Previous we looked at wal.end.time only. Now we consider wal.start.time too. If a file has a name outside of wal.start.time\<-\>wal.end.time, it'll be by-passed. This change-in-behavior will make it easier on operator crafting timestamp filters processing WALs.
 772
 773
 774 ---
 775
 776 * [HBASE-25165](https://issues.apache.org/jira/browse/HBASE-25165) | *Minor* | **Change 'State time' in UI so sorts**
 777
 778 Start time on the Master UI is now displayed using ISO8601 format instead of java Date#toString().
 779
 780
 781 ---
 782
 783 * [HBASE-25124](https://issues.apache.org/jira/browse/HBASE-25124) | *Major* | **Support changing region replica count without disabling table**
 784
 785 Now you do not need to disable a table before changing its 'region replication' property.
 786 If you are decreasing the replica count, the excess region replicas will be closed before reopening other replicas.
 787 If you are increasing the replica count, the new region replicas will be opened after reopening the existing replicas.
 788
 789
 790 ---
 791
 792 * [HBASE-25154](https://issues.apache.org/jira/browse/HBASE-25154) | *Major* | **Set java.io.tmpdir to project build directory to avoid writing std\*deferred files to /tmp**
 793
 794 Change the java.io.tmpdir to project.build.directory in surefire-maven-plugin, to avoid writing std\*deferred files to /tmp which may blow up the /tmp disk on our jenkins build node.
 795
 796
 797 ---
 798
 799 * [HBASE-25055](https://issues.apache.org/jira/browse/HBASE-25055) | *Major* | **Add ReplicationSource for meta WALs; add enable/disable when hbase:meta assigned to RS**
 800
 801 Set hbase.region.replica.replication.catalog.enabled to enable async WAL Replication for hbase:meta region replicas. Its off by default.
 802
 803 Defaults to the RegionReadReplicaEndpoint.class shipping edits -- set hbase.region.replica.catalog.replication to target a different endpoint implementation.
 804
 805
 806 ---
 807
 808 * [HBASE-25109](https://issues.apache.org/jira/browse/HBASE-25109) | *Major* | **Add MR Counters to WALPlayer; currently hard to tell if it is doing anything**
 809
 810 Adds a WALPlayer to MR Counter output:
 811
 812         org.apache.hadoop.hbase.mapreduce.WALPlayer$Counter
 813                 CELLS\_READ=89574
 814                 CELLS\_WRITTEN=89572
 815                 DELETES=64
 816                 PUTS=5305
 817                 WALEDITS=4375
 818
 819
 820 ---
 821
 822 * [HBASE-24896](https://issues.apache.org/jira/browse/HBASE-24896) | *Major* | **'Stuck' in static initialization creating RegionInfo instance**
 823
 824 1. Untangle RegionInfo, RegionInfoBuilder, and MutableRegionInfo static
 825 initializations.
 826 2. Undo static initializing references from RegionInfo to RegionInfoBuilder.
 827 3. Mark RegionInfo#UNDEFINED IA.Private and deprecated;
 828 it is for internal use only and likely to be removed in HBase4. (sub-task HBASE-24918)
 829 4. Move MutableRegionInfo from inner-class of
 830 RegionInfoBuilder to be (package private) standalone. (sub-task HBASE-24918)
 831
 832
 833 ---
 834
 835 * [HBASE-24956](https://issues.apache.org/jira/browse/HBASE-24956) | *Major* | **ConnectionManager#locateRegionInMeta waits for user region lock indefinitely.**
 836
 837 <!-- markdown -->
 838
 839 Without this fix there are situations in which locateRegionInMeta() on a client is not bound by a timeout. This happens because of a global lock whose acquisition was not under any lock scope. This affects client facing API calls that rely on this method to locate a table region in meta. This fix brings the lock acquisition under the scope of "hbase.client.meta.operation.timeout" and that guarantees a bounded wait time.
 840
 841
 842 ---
 843
 844 * [HBASE-24764](https://issues.apache.org/jira/browse/HBASE-24764) | *Minor* | **Add support of adding base peer configs via hbase-site.xml for all replication peers.**
 845
 846 <!-- markdown -->
 847
 848 Adds a new configuration parameter "hbase.replication.peer.base.config" which accepts a semi-colon separated key=CSV pairs (example: k1=v1;k2=v2_1,v3...). When this configuration is set on the server side, these kv pairs are added to every peer configuration if not already set. Peer specific configuration overrides have precedence over the above default configuration. This is useful in cases when some configuration has to be set for all the peers by default and one does not want to add to every peer definition.
 849
 850
 851 ---
 852
 853 * [HBASE-24994](https://issues.apache.org/jira/browse/HBASE-24994) | *Minor* | **Add hedgedReadOpsInCurThread metric**
 854
 855 Expose Hadoop hedgedReadOpsInCurThread metric to HBase.
 856 This metric counts the number of times the hedged reads service executor rejected a read task, falling back to the current thread.
 857 This will help determine the proper size of the thread pool (dfs.client.hedged.read.threadpool.size).
 858
 859
 860 ---
 861
 862 * [HBASE-24776](https://issues.apache.org/jira/browse/HBASE-24776) | *Major* | **[hbtop] Support Batch mode**
 863
 864 HBASE-24776 added the following command line parameters to hbtop:
 865 \| Argument \| Description \|
 866 \|---\|---\|
 867 \| -n,--numberOfIterations \<arg\> \| The number of iterations \|
 868 \| -O,--outputFieldNames \| Print each of the available field names on a separate line, then quit \|
 869 \| -f,--fields \<arg\> \| Show only the given fields. Specify comma separated fields to show multiple fields \|
 870 \| -s,--sortField \<arg\> \| The initial sort field. You can prepend a \`+' or \`-' to the field name to also override the sort direction. A leading \`+' will force sorting high to low, whereas a \`-' will ensure a low to high ordering \|
 871 \| -i,--filters \<arg\> \| The initial filters. Specify comma separated filters to set multiple filters \|
 872 \| -b,--batchMode \| Starts hbtop in Batch mode, which could be useful for sending output from hbtop to other programs or to a file. In this mode, hbtop will not accept input and runs until the iterations limit you've set with the \`-n' command-line option or until killed \|
 873
 874
 875 ---
 876
 877 * [HBASE-24602](https://issues.apache.org/jira/browse/HBASE-24602) | *Major* | **Add Increment and Append support to CheckAndMutate**
 878
 879 Summary of the change of HBASE-24602:
 880 - Add \`build(Increment)\` and \`build(Append)\` methods to the \`Builder\` class of the \`CheckAndMutate\` class. After this change, we can perform checkAndIncrement/Append operations as follows:
 881 \`\`\`
 882 // Build a CheckAndMutate object with a Increment object
 883 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 884   .ifEquals(family, qualifier, value)
 885   .build(increment);
 886
 887 // Perform a CheckAndIncrement operation
 888 CheckAndMutateResult checkAndMutateResult = table.checkAndMutate(checkAndMutate);
 889
 890 // Get whether or not the CheckAndIncrement operation is successful
 891 boolean success = checkAndMutateResult.isSuccess();
 892
 893 // Get the result of the increment operation
 894 Result result = checkAndMutateResult.getResult();
 895 \`\`\`
 896 - After this change, \`HRegion.batchMutate()\` is used for increment/append operations.
 897 - As the side effect of the above change, the following coprocessor methods of RegionObserver are called when increment/append operations are performed:
 898   - preBatchMutate()
 899   - postBatchMutate()
 900   - postBatchMutateIndispensably()
 901
 902
 903 ---
 904
 905 * [HBASE-24694](https://issues.apache.org/jira/browse/HBASE-24694) | *Major* | **Support flush a single column family of table**
 906
 907 Adds option for the flush command to flush all stores from the specified column family only, among all regions of the given table (stores from other column families on this table would not get flushed).
 908
 909
 910 ---
 911
 912 * [HBASE-24625](https://issues.apache.org/jira/browse/HBASE-24625) | *Critical* | **AsyncFSWAL.getLogFileSizeIfBeingWritten does not return the expected synced file length.**
 913
 914 We add a method getSyncedLength in  WALProvider.WriterBase interface for  WALFileLengthProvider used for replication, considering the case if we use  AsyncFSWAL,we write to 3 DNs concurrently,according to the visibility guarantee of HDFS, the data will be available immediately
 915 when arriving at DN since all the DNs will be considered as the last one in pipeline.This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency.The method WriterBase#getLength may return length which just in hdfs client buffer and not successfully synced to HDFS, so we use this method WriterBase#getSyncedLength to return the length successfully synced to HDFS and replication thread could only read writing WAL file limited by this length.
 916 see also HBASE-14004 and this document for more details:
 917 https://docs.google.com/document/d/11AyWtGhItQs6vsLRIx32PwTxmBY3libXwGXI25obVEY/edit#
 918
 919 Before this patch, replication may read uncommitted data and replicate it to the slave cluster and cause data inconsistency between master and slave cluster, we could use FSHLog instead of AsyncFSWAL  to reduce probability of inconsistency without this patch applied.
 920
 921
 922 ---
 923
 924 * [HBASE-24779](https://issues.apache.org/jira/browse/HBASE-24779) | *Minor* | **Improve insight into replication WAL readers hung on checkQuota**
 925
 926 New metrics are exposed, on the global source, for replication which indicate the "WAL entry buffer" that was introduced in HBASE-15995. When this usage reaches the limit, that RegionServer will cease to read more data for the sake of trying to replicate it. This usage (and limit) is local to each RegionServer is shared across all peers being handled by that RegionServer.
 927
 928
 929 ---
 930
 931 * [HBASE-24404](https://issues.apache.org/jira/browse/HBASE-24404) | *Major* | **Support flush a single column family of region**
 932
 933 This adds an extra "flush" command option that allows for specifying an individual family to have its store flushed.
 934
 935 Usage:
 936 flush 'REGIONNAME','FAMILYNAME'
 937 flush 'ENCODED\_REGIONNAME','FAMILYNAME'
 938
 939
 940 ---
 941
 942 * [HBASE-24805](https://issues.apache.org/jira/browse/HBASE-24805) | *Major* | **HBaseTestingUtility.getConnection should be threadsafe**
 943
 944 <!-- markdown -->
 945 Users of `HBaseTestingUtility` can now safely call the `getConnection` method from multiple threads.
 946
 947 As a consequence of refactoring to improve the thread safety of the HBase testing classes, the protected `conf` member of the  `HBaseCommonTestingUtility` class has been marked final. Downstream users who extend from the class hierarchy rooted at this class will need to pass the Configuration instance they want used to their super constructor rather than overwriting the instance variable.
 948
 949
 950 ---
 951
 952 * [HBASE-24767](https://issues.apache.org/jira/browse/HBASE-24767) | *Major* | **Change default to false for HBASE-15519 per-user metrics**
 953
 954 Disables per-user metrics. They were enabled by default for the first time in hbase-2.3.0 but they need some work before they can be on all the time (See HBASE-15519)
 955
 956
 957 ---
 958
 959 * [HBASE-24704](https://issues.apache.org/jira/browse/HBASE-24704) | *Major* | **Make the Table Schema easier to view even there are multiple families**
 960
 961 Improve the layout of column family from vertical to horizontal in table UI.
 962
 963
 964 ---
 965
 966 * [HBASE-11686](https://issues.apache.org/jira/browse/HBASE-11686) | *Minor* | **Shell code should create a binding / irb workspace instead of polluting the root namespace**
 967
 968 In shell, all HBase constants and commands have been moved out of the top-level and into an IRB Workspace. Piped stdin and scripts passed by name to the shell will be evaluated within this workspace. If you absolutely need the top-level definitions, use the new compatibility flag, ie. hbase shell --top-level-defs or hbase shell --top-level-defs script2run.rb.
 969
 970
 971 ---
 972
 973 * [HBASE-24632](https://issues.apache.org/jira/browse/HBASE-24632) | *Major* | **Enable procedure-based log splitting as default in hbase3**
 974
 975 Enables procedure-based distributed WAL splitting as default (HBASE-20610). To use 'classic' zk-coordinated splitting instead, set 'hbase.split.wal.zk.coordinated' to 'true'.
 976
 977
 978 ---
 979
 980 * [HBASE-24698](https://issues.apache.org/jira/browse/HBASE-24698) | *Major* | **Turn OFF Canary WebUI as default**
 981
 982 Flips default for 'HBASE-23994 Add WebUI to Canary' The UI defaulted to on at port 16050. This JIRA changes it so new UI is off by default.
 983
 984 To enable the UI, set property 'hbase.canary.info.port' to the port you want the UI to use.
 985
 986
 987 ---
 988
 989 * [HBASE-24650](https://issues.apache.org/jira/browse/HBASE-24650) | *Major* | **Change the return types of the new checkAndMutate methods introduced in HBASE-8458**
 990
 991 HBASE-24650 introduced CheckAndMutateResult class and changed the return type of checkAndMutate methods to this class in order to support CheckAndMutate with Increment/Append. CheckAndMutateResult class has two fields, one is \*success\* that indicates whether the operation is successful or not, and the other one is \*result\* that's the result of the operation and is used for  CheckAndMutate with Increment/Append.
 992
 993 The new APIs for the Table interface:
 994 \`\`\`
 995 /\*\*
 996  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 997  \* it performs the specified action.
 998  \*
 999  \* @param checkAndMutate The CheckAndMutate object.
1000  \* @return A CheckAndMutateResult object that represents the result for the CheckAndMutate.
1001  \* @throws IOException if a remote or network exception occurs.
1002  \*/
1003 default CheckAndMutateResult checkAndMutate(CheckAndMutate checkAndMutate) throws IOException {
1004   return checkAndMutate(Collections.singletonList(checkAndMutate)).get(0);
1005 }
1006
1007 /\*\*
1008  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
1009  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
1010  \* atomically (and thus, each may fail independently of others).
1011  \*
1012  \* @param checkAndMutates The list of CheckAndMutate.
1013  \* @return A list of CheckAndMutateResult objects that represents the result for each
1014  \*   CheckAndMutate.
1015  \* @throws IOException if a remote or network exception occurs.
1016  \*/
1017 default List\<CheckAndMutateResult\> checkAndMutate(List\<CheckAndMutate\> checkAndMutates)
1018   throws IOException {
1019   throw new NotImplementedException("Add an implementation!");
1020 }
1021 {code}
1022
1023 The new APIs for the AsyncTable interface:
1024 {code}
1025 /\*\*
1026  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
1027  \* it performs the specified action.
1028  \*
1029  \* @param checkAndMutate The CheckAndMutate object.
1030  \* @return A {@link CompletableFuture}s that represent the result for the CheckAndMutate.
1031  \*/
1032 CompletableFuture\<CheckAndMutateResult\> checkAndMutate(CheckAndMutate checkAndMutate);
1033
1034 /\*\*
1035  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
1036  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
1037  \* atomically (and thus, each may fail independently of others).
1038  \*
1039  \* @param checkAndMutates The list of CheckAndMutate.
1040  \* @return A list of {@link CompletableFuture}s that represent the result for each
1041  \*   CheckAndMutate.
1042  \*/
1043 List\<CompletableFuture\<CheckAndMutateResult\>\> checkAndMutate(
1044   List\<CheckAndMutate\> checkAndMutates);
1045
1046 /\*\*
1047  \* A simple version of batch checkAndMutate. It will fail if there are any failures.
1048  \*
1049  \* @param checkAndMutates The list of rows to apply.
1050  \* @return A {@link CompletableFuture} that wrapper the result list.
1051  \*/
1052 default CompletableFuture\<List\<CheckAndMutateResult\>\> checkAndMutateAll(
1053   List\<CheckAndMutate\> checkAndMutates) {
1054   return allOf(checkAndMutate(checkAndMutates));
1055 }
1056 \`\`\`
1057
1058
1059 ---
1060
1061 * [HBASE-24671](https://issues.apache.org/jira/browse/HBASE-24671) | *Major* | **Add excludefile and designatedfile options to graceful\_stop.sh**
1062
1063 Add excludefile and designatedfile options to graceful\_stop.sh.
1064
1065 Designated file with \<hostname:port\> per line as unload targets.
1066
1067 Exclude file should have \<hostname:port\> per line. We do not unload regions to hostnames given in exclude file.
1068
1069 Here is a simple example using graceful\_stop.sh with designatedfile option:
1070 ./bin/graceful\_stop.sh --maxthreads 4 --designatedfile /path/designatedfile hostname
1071 The usage of the excludefile option is the same as the above.
1072
1073
1074 ---
1075
1076 * [HBASE-24560](https://issues.apache.org/jira/browse/HBASE-24560) | *Major* | **Add a new option of designatedfile in RegionMover**
1077
1078 Add a new option "designatedfile" in RegionMover.
1079
1080 If designated file is present with some contents, we will unload regions to hostnames provided in designated file.
1081
1082 Designated file should have 'host:port' per line.
1083
1084
1085 ---
1086
1087 * [HBASE-24289](https://issues.apache.org/jira/browse/HBASE-24289) | *Major* | **Heterogeneous Storage for Date Tiered Compaction**
1088
1089 Enhance DateTieredCompaction to support HDFS storage policy within one class family.
1090 # First you need enable DTCP.
1091 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
1092 hbase.hstore.compaction.compaction.policy=org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
1093 ## Parameters for Date Tiered Compaction:
1094 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
1095 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
1096 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
1097 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
1098
1099 # Then enable HDTCP(Heterogeneous Date Tiered Compaction) as follow example configurations:
1100 hbase.hstore.compaction.date.tiered.storage.policy.enable=true
1101 hbase.hstore.compaction.date.tiered.hot.window.age.millis=3600000
1102 hbase.hstore.compaction.date.tiered.hot.window.storage.policy=ALL\_SSD
1103 hbase.hstore.compaction.date.tiered.warm.window.age.millis=20600000
1104 hbase.hstore.compaction.date.tiered.warm.window.storage.policy=ONE\_SSD
1105 hbase.hstore.compaction.date.tiered.cold.window.storage.policy=HOT
1106 ## It is better to enable WAL and flushing HFile storage policy with HDTCP. You can tune follow settings as well:
1107 hbase.wal.storage.policy=ALL\_SSD
1108 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ALL\_SSD'}}
1109
1110 # Disable HDTCP as follow:
1111 hbase.hstore.compaction.date.tiered.storage.policy.enable=false
1112
1113
1114 ---
1115
1116 * [HBASE-24648](https://issues.apache.org/jira/browse/HBASE-24648) | *Major* | **Remove the legacy 'forceSplit' related code at region server side**
1117
1118 Add a canSplit method to RegionSplitPolicy to determine whether we can split a region. Usually it is not related to RegionSplitPolicy so in the default implementation, it will test whether region is available and does not have reference file, but in DisabledRegionSplitPolicy, we will always return false.
1119
1120
1121 ---
1122
1123 * [HBASE-24382](https://issues.apache.org/jira/browse/HBASE-24382) | *Major* | **Flush partial stores of region filtered by seqId when archive wal due to too many wals**
1124
1125 Change the flush level from region to store when there are too many wals, benefit from this we can reduce unnessary flush tasks and small hfiles.
1126
1127
1128 ---
1129
1130 * [HBASE-24038](https://issues.apache.org/jira/browse/HBASE-24038) | *Major* | **Add a metric to show the locality of ssd in table.jsp**
1131
1132 Add a metric to show the locality of ssd in table.jsp, and move the locality related metrics to a new tab named localities.
1133
1134
1135 ---
1136
1137 * [HBASE-8458](https://issues.apache.org/jira/browse/HBASE-8458) | *Major* | **Support for batch version of checkAndMutate()**
1138
1139 HBASE-8458 introduced CheckAndMutate class that's used to perform CheckAndMutate operations. Use the builder class to instantiate a CheckAndMutate object. This builder class is fluent style APIs, the code are like:
1140 \`\`\`
1141 // A CheckAndMutate operation where do the specified action if the column (specified by the
1142 family and the qualifier) of the row equals to the specified value
1143 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1144   .ifEquals(family, qualifier, value)
1145   .build(put);
1146
1147 // A CheckAndMutate operation where do the specified action if the column (specified by the
1148 // family and the qualifier) of the row doesn't exist
1149 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1150   .ifNotExists(family, qualifier)
1151   .build(put);
1152
1153 // A CheckAndMutate operation where do the specified action if the row matches the filter
1154 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1155   .ifMatches(filter)
1156   .build(delete);
1157 \`\`\`
1158
1159 And This added new checkAndMutate APIs to the Table and AsyncTable interfaces, and deprecated the old checkAndMutate APIs. The example code for the new APIs are as follows:
1160 \`\`\`
1161 Table table = ...;
1162
1163 CheckAndMutate checkAndMutate = ...;
1164
1165 // Perform the checkAndMutate operation
1166 boolean success = table.checkAndMutate(checkAndMutate);
1167
1168 CheckAndMutate checkAndMutate1 = ...;
1169 CheckAndMutate checkAndMutate2 = ...;
1170
1171 // Batch version
1172 List\<Boolean\> successList = table.checkAndMutate(Arrays.asList(checkAndMutate1, checkAndMutate2));
1173 \`\`\`
1174
1175 This also has Protocol Buffers level changes. Old clients without this patch will work against new servers with this patch. However, new clients will break against old servers without this patch for checkAndMutate with RM and mutateRow. So, for rolling upgrade, we will need to upgrade servers first, and then roll out the new clients.
1176
1177
1178 ---
1179
1180 * [HBASE-24471](https://issues.apache.org/jira/browse/HBASE-24471) | *Major* | **The way we bootstrap meta table is confusing**
1181
1182 Move all the meta initialization code in MasterFileSystem and HRegionServer to InitMetaProcedure. Add a new step for InitMetaProcedure called INIT\_META\_WRITE\_FS\_LAYOUT to place the moved code.
1183
1184 This is an incompatible change, but should not have much impact. InitMetaProcedure will only be executed once when bootstraping a fresh new cluster, so typically this will not effect rolling upgrading. And even if you hit this problem, as long as InitMetaProcedure has not been finished, we can make sure that there is no user data in the cluster, you can just clean up the cluster and try again. There will be no data loss.
1185
1186
1187 ---
1188
1189 * [HBASE-24017](https://issues.apache.org/jira/browse/HBASE-24017) | *Major* | **Turn down flakey rerun rate on all but hot branches**
1190
1191 Changed master, branch-2, and branch-2.1 to twice a day.
1192 Left branch-2.3, branch-2.2, and branch-1 at every 4 hours.
1193 Changed branch-1.4 and branch-1.3 to @daily (1.3 was running every hour).
1194
1195
1196
1197 # HBASE  2.3.0 Release Notes
1198
1199 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
1200
1201
1202 ---
1203
1204 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
1205
1206 <!-- markdown -->
1207
1208 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
1209
1210
1211 ---
1212
1213 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
1214
1215 <!-- markdown -->
1216 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
1217 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
1218
1219
1220 ---
1221
1222 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
1223
1224 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
1225 The metric is now collected under the mbean for Tables and under the mbean for regions.
1226 Under table mbean ie.-
1227 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
1228 The new metrics will be listed as
1229 {code}
1230     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1231  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
1232 {code}
1233 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
1234 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
1235 {code}
1236
1237 The same one under the region ie.
1238 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
1239 comes as
1240 {code}
1241    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1242     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
1243 {code}
1244 where
1245 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
1246 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
1247 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
1248
1249
1250 ---
1251
1252 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
1253
1254 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
1255
1256 $hbase rowcounter -h
1257
1258 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
1259 Options:
1260     --starttime=\<arg\>       starting time filter to start counting rows from.
1261     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
1262     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
1263     --expectedCount=\<arg\>   expected number of rows to be count.
1264 For performance, consider the following configuration properties:
1265 -Dhbase.client.scanner.caching=100
1266 -Dmapreduce.map.speculative=false
1267
1268
1269 ---
1270
1271 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
1272
1273 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
1274
1275
1276 ---
1277
1278 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
1279
1280 Adds being able to edit hbase:meta table schema. For example,
1281
1282 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
1283 Updating all regions with the new schema...
1284 All regions updated.
1285 Done.
1286 Took 1.2138 seconds
1287
1288 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
1289
1290
1291 ---
1292
1293 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
1294
1295 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
1296
1297
1298 ---
1299
1300 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
1301
1302 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
1303
1304
1305 ---
1306
1307 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
1308
1309 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
1310
1311
1312 ---
1313
1314 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
1315
1316 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
1317
1318
1319 ---
1320
1321 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
1322
1323 <!-- markdown -->
1324 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
1325 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
1326 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
1327 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
1328 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
1329 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
1330
1331
1332 ---
1333
1334 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
1335
1336 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
1337
1338 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
1339
1340 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
1341
1342 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
1343
1344
1345 ---
1346
1347 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
1348
1349 Added new metric to differentiate sink startup time from last OP applied time.
1350
1351 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
1352
1353 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
1354
1355 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
1356
1357 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
1358
1359
1360 ---
1361
1362 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
1363
1364 <!-- markdown -->
1365 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
1366
1367 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
1368
1369
1370 ---
1371
1372 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
1373
1374 Add backoff. Avoid retrying every 100ms.
1375
1376
1377 ---
1378
1379 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
1380
1381 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
1382
1383 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
1384
1385
1386 ---
1387
1388 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
1389
1390 Introduced a general 'local region' at master side to store the procedure data, etc.
1391
1392 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
1393
1394 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
1395
1396
1397 ---
1398
1399 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
1400
1401 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
1402
1403
1404 ---
1405
1406 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
1407
1408 Config key: hbase.regionserver.slowlog.systable.enabled
1409 Default value: false
1410
1411 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
1412 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
1413
1414 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
1415
1416 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
1417
1418  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
1419  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
1420  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
1421  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
1422                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
1423                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
1424                                                              rics: false
1425  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
1426  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
1427  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
1428  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
1429  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
1430  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
1431  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
1432  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
1433
1434
1435 ---
1436
1437 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
1438
1439 <!-- markdown -->
1440 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
1441
1442
1443 ---
1444
1445 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
1446
1447 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
1448
1449 The request log is disabled by default in conf/log4j.properties by the following lines:
1450
1451 # Disable request log by default, you can enable this by changing the appender
1452 log4j.category.http.requests=INFO,NullAppender
1453 log4j.additivity.http.requests=false
1454
1455 Change the 'NullAppender' to what ever you want if you want to enable request log.
1456
1457 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
1458
1459
1460 ---
1461
1462 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
1463
1464 Use a empty string to represent no column specified for deleteall in shell mode.
1465 useage:
1466 deleteall 'test','r1','',12345
1467 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
1468
1469
1470 ---
1471
1472 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
1473
1474 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
1475
1476
1477 ---
1478
1479 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
1480
1481 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
1482
1483
1484 ---
1485
1486 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
1487
1488 Moved to hbase-thirdparty 3.3.0.
1489
1490
1491 ---
1492
1493 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
1494
1495 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
1496
1497 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
1498
1499 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
1500
1501
1502 ---
1503
1504 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
1505
1506 <!-- markdown -->
1507 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
1508
1509 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
1510
1511
1512 ---
1513
1514 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
1515
1516 New Config: hbase.rpc.rows.size.threshold.reject
1517 -----------------------------------------------------------------------
1518
1519 Default value: false
1520 Description:
1521 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
1522
1523
1524 ---
1525
1526 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
1527
1528 StochasticLoadBalancer functional improvement:
1529
1530 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
1531
1532
1533 ---
1534
1535 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
1536
1537 user or admin can now use
1538 hbase shell \> rename\_rsgroup 'oldname', 'newname'
1539 to rename rsgroup.
1540
1541
1542 ---
1543
1544 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
1545
1546 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
1547
1548 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
1549
1550
1551 ---
1552
1553 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
1554
1555 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
1556
1557
1558 ---
1559
1560 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
1561
1562 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
1563
1564
1565 ---
1566
1567 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
1568
1569 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
1570
1571
1572 ---
1573
1574 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
1575
1576 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
1577
1578
1579 ---
1580
1581 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
1582
1583 <!-- markdown -->
1584 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
1585
1586
1587 ---
1588
1589 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
1590
1591 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
1592
1593 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
1594
1595 For running tests locally, to go faster, up fork count.
1596
1597 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
1598
1599 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
1600
1601
1602 ---
1603
1604 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
1605
1606 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
1607
1608
1609 ---
1610
1611 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
1612
1613 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
1614
1615
1616 ---
1617
1618 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
1619
1620 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
1621
1622
1623 ---
1624
1625 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
1626
1627 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
1628
1629
1630 ---
1631
1632 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
1633
1634 <!-- markdown -->
1635 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
1636
1637 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
1638
1639
1640 ---
1641
1642 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
1643
1644 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
1645
1646
1647 ---
1648
1649 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
1650
1651 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
1652
1653
1654 ---
1655
1656 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
1657
1658 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
1659
1660
1661 ---
1662
1663 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
1664
1665 ColumnFamilyDescriptor new builder API:
1666
1667     /\*\*
1668      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
1669      \* of versions(versionAfterInterval) after that interval elapses.
1670      \*
1671      \* @param retentionInterval Retain all versions for this interval
1672      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
1673      \*/
1674     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
1675         final int retentionInterval, final int versionAfterInterval)
1676
1677
1678 ---
1679
1680 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
1681
1682 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
1683
1684
1685 ---
1686
1687 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
1688
1689 Expose file system level read metrics for RegionServer.
1690
1691 If the HBase RS runs on top of HDFS, calculate the aggregation of
1692 ReadStatistics of each HdfsFileInputStream. These metrics include:
1693 (1) total number of bytes read from HDFS.
1694 (2) total number of bytes read from local DataNode.
1695 (3) total number of bytes read locally through short-circuit read.
1696 (4) total number of bytes read locally through zero-copy read.
1697
1698 Because HDFS ReadStatistics is calculated per input stream, it is not
1699 feasible to update the aggregated number in real time. Instead, the
1700 metrics are updated when an input stream is closed.
1701
1702
1703 ---
1704
1705 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
1706
1707 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
1708
1709 Here is a simple example of script:
1710 {code}
1711 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
1712 #!/bin/bash
1713 namespace=$1
1714 tablename=$2
1715 if [[ $namespace == test ]]; then
1716   echo test
1717 elif [[ $tablename == \*foo\* ]]; then
1718   echo other
1719 else
1720   echo default
1721 fi
1722 {code}
1723
1724
1725 ---
1726
1727 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
1728
1729 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
1730
1731
1732 ---
1733
1734 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
1735
1736 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
1737
1738
1739 ---
1740
1741 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
1742
1743 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
1744
1745 User used to see....
1746
1747   column=table:state, timestamp=1583967620343 .....
1748
1749 ... but now sees:
1750
1751   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
1752
1753
1754 ---
1755
1756 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
1757
1758 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
1759
1760
1761 ---
1762
1763 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
1764
1765 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
1766
1767 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
1768
1769
1770 ---
1771
1772 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
1773
1774 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
1775
1776 New Admin APIs:
1777 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
1778       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
1779
1780 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
1781       throws IOException;
1782
1783 Configs:
1784
1785 1. hbase.regionserver.slowlog.ringbuffer.size:
1786 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
1787
1788 Default
1789 256
1790
1791 2. hbase.regionserver.slowlog.buffer.enabled:
1792 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
1793
1794 Default
1795 false
1796
1797
1798 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
1799
1800
1801 ---
1802
1803 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
1804
1805 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
1806
1807
1808 ---
1809
1810 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
1811
1812 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
1813
1814 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
1815
1816 This is a fluent style API, the code is like:
1817
1818 For Table interface:
1819 {code}
1820 table.checkAndMutate(row, filter).thenPut(put);
1821 {code}
1822
1823 For AsyncTable interface:
1824 {code}
1825 table.checkAndMutate(row, filter).thenPut(put)
1826     .thenAccept(succ -\> {
1827       if (succ) {
1828         System.out.println("Check and put succeeded");
1829       } else {
1830         System.out.println("Check and put failed");
1831       }
1832     });
1833 {code}
1834
1835
1836 ---
1837
1838 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
1839
1840 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
1841
1842
1843 ---
1844
1845 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
1846
1847 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
1848
1849
1850 ---
1851
1852 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
1853
1854     Adds shell command regioninfo:
1855
1856       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
1857       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
1858       Took 0.4737 seconds
1859
1860
1861 ---
1862
1863 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
1864
1865 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
1866
1867 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
1868
1869
1870 ---
1871
1872 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
1873
1874 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
1875
1876 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
1877 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
1878
1879 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
1880
1881
1882 ---
1883
1884 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
1885
1886 <!-- markdown -->
1887 Enables master based registry as the default registry used by clients to fetch connection metadata.
1888 Refer to the section "Master Registry" in the client documentation for more details and advantages
1889 of this implementation over the default Zookeeper based registry.
1890
1891 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
1892
1893 Where to set this: HBase client configuration (hbase-site.xml)
1894
1895 Possible values:
1896 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
1897 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
1898
1899 Notes on defaults:
1900
1901 - For v3.0.0 and later, MasterRegistry is the default registry
1902 - For all releases in 2.x line, ZK based registry is the default.
1903
1904 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
1905
1906 ```
1907 <property>
1908   <name>hbase.client.registry.impl</name>
1909   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
1910 </property>
1911 ```
1912
1913
1914 ---
1915
1916 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
1917
1918 caffeine: 2.6.2 =\> 2.8.1
1919 commons-codec: 1.10 =\> 1.13
1920 commons-io: 2.5 =\> 2.6
1921 disrupter: 3.3.6 =\> 3.4.2
1922 httpcore: 4.4.6 =\> 4.4.13
1923 jackson: 2.9.10 =\> 2.10.1
1924 jackson.databind: 2.9.10.1 =\> 2.10.1
1925 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
1926 protobuf.plugin: 0.5.0 =\> 0.6.1
1927 zookeeper: 3.4.10 =\> 3.4.14
1928 slf4j: 1.7.25 =\> 1.7.30
1929 rat: 0.12 =\> 0.13
1930 asciidoctor: 1.5.5 =\> 1.5.8
1931 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
1932 error-prone: 2.3.3 =\> 2.3.4
1933
1934
1935 ---
1936
1937 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
1938
1939 - Reverts a binary incompatible binary change for ByteRangeUtils
1940 - Usage of reflection inside CommonFSUtils removed
1941
1942
1943 ---
1944
1945 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
1946
1947 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
1948
1949 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
1950
1951
1952 ---
1953
1954 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
1955
1956 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
1957
1958
1959 ---
1960
1961 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
1962
1963 Add a new config to hbase-default.xml
1964
1965   \<property\>
1966     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
1967     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
1968     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
1969     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
1970     called in order, so put the cleaner that prunes the most files in front. To
1971     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
1972     and add the fully qualified class name here. Always add the above
1973     default hfile cleaners in the list as they will be overwritten in
1974     hbase-site.xml.\</description\>
1975   \</property\>
1976
1977 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
1978
1979
1980 ---
1981
1982 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
1983
1984 Updated parent pom to Apache version 22.
1985
1986
1987 ---
1988
1989 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
1990
1991 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
1992
1993 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
1994
1995
1996 ---
1997
1998 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
1999
2000 Add a new feature to improve MTTR which have 3 steps to failover:
2001 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
2002 2. Open region.
2003 3. Bulkload the recovered.hfiles for every column family.
2004
2005 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
2006
2007 Config hbase.wal.split.to.hfile to true to enable this featue.
2008
2009
2010 ---
2011
2012 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
2013
2014 Changed the logging in hbase-zookeeper to use built-in formatting
2015
2016
2017 ---
2018
2019 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
2020
2021 From the PR:
2022
2023 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
2024
2025 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
2026
2027
2028 ---
2029
2030 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
2031
2032 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
2033
2034
2035 ---
2036
2037 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
2038
2039 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
2040
2041
2042 ---
2043
2044 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
2045
2046 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
2047
2048
2049 ---
2050
2051 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
2052
2053 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
2054
2055 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
2056
2057
2058 ---
2059
2060 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
2061
2062 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
2063
2064
2065 ---
2066
2067 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
2068
2069 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
2070 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
2071
2072 Fixed this bug as part of this Jira.
2073 Updated description for corresponding configs:
2074
2075 1. hbase.master.regions.recovery.check.interval :
2076
2077 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
2078
2079 2. hbase.regions.recovery.store.file.ref.count :
2080
2081 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
2082
2083
2084 ---
2085
2086 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
2087
2088 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
2089
2090
2091 ---
2092
2093 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
2094
2095 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
2096
2097
2098 ---
2099
2100 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
2101
2102 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
2103
2104
2105 ---
2106
2107 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
2108
2109 Bumped surefire plugin to 3.0.0-M4
2110
2111
2112 ---
2113
2114 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
2115
2116 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
2117
2118
2119 ---
2120
2121 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
2122
2123 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
2124 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
2125 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
2126 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
2127 From the shell this can be enabled by using the option per Column Family also by using the below format
2128 {code}
2129 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
2130 {code}
2131
2132
2133 ---
2134
2135 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
2136
2137 <!-- markdown -->
2138
2139 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
2140
2141 ```
2142 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
2143     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
2144 ```
2145
2146 See javadocs of the class `MobRefReporter` for more details.
2147
2148 the reference guide has added some information about MOB internals and troubleshooting.
2149
2150
2151 ---
2152
2153 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
2154
2155 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
2156
2157
2158 ---
2159
2160 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
2161
2162 Fixed unbalanced braces in string representation within HBase shell
2163
2164
2165 ---
2166
2167 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
2168
2169 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
2170 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
2171
2172
2173 ---
2174
2175 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
2176
2177 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
2178
2179
2180 ---
2181
2182 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
2183
2184 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
2185
2186 1. RowFilter
2187 2. ValueFilter
2188 3. QualifierFilter
2189 4. FamilyFilter
2190 5. ColumnValueFilter
2191
2192
2193 ---
2194
2195 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
2196
2197 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
2198
2199
2200 ---
2201
2202 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
2203
2204 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
2205
2206
2207 ---
2208
2209 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
2210
2211 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
2212
2213
2214 ---
2215
2216 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
2217
2218 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
2219
2220
2221 ---
2222
2223 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
2224
2225 <!-- markdown -->
2226 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
2227
2228 Such messages will happen at most once per five minutes.
2229
2230
2231 ---
2232
2233 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
2234
2235 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
2236
2237
2238 ---
2239
2240 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
2241
2242 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
2243
2244
2245 ---
2246
2247 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
2248
2249 <!-- markdown -->
2250
2251 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
2252
2253   - CVE-2019-16942
2254   - CVE-2019-16943
2255
2256 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
2257
2258
2259 ---
2260
2261 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
2262
2263 <!-- markdown -->
2264
2265 The MOB compaction process in the HBase Master now logs more about its activity.
2266
2267 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
2268
2269 Caveats:
2270 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
2271 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
2272 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
2273 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
2274
2275
2276 ---
2277
2278 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
2279
2280 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
2281
2282
2283 ---
2284
2285 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
2286
2287 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
2288
2289 Configs:
2290
2291 1. hbase.master.regions.recovery.check.interval :
2292
2293 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
2294
2295 2. hbase.regions.recovery.store.file.ref.count :
2296
2297 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
2298
2299
2300 ---
2301
2302 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
2303
2304 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
2305
2306 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
2307
2308
2309 ---
2310
2311 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
2312
2313 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
2314
2315
2316 ---
2317
2318 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
2319
2320 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
2321
2322
2323 ---
2324
2325 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
2326
2327 <!-- markdown -->
2328 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
2329
2330 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
2331
2332
2333 ---
2334
2335 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
2336
2337 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
2338
2339
2340 ---
2341
2342 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
2343
2344 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
2345
2346
2347 ---
2348
2349 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
2350
2351 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
2352
2353 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
2354
2355
2356 ---
2357
2358 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
2359
2360 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
2361 \<property\>
2362     \<name\>hbase.bucketcache.ioengine\</name\>
2363     \<value\> pmem:///path in persistent memory \</value\>
2364   \</property\>
2365
2366
2367 ---
2368
2369 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
2370
2371 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
2372 hbase\> snapshot\_cleanup\_switch false
2373
2374 We can re-enable it using:
2375 hbase\> snapshot\_cleanup\_switch true
2376
2377 We can query whether snapshot auto cleanup is enabled for cluster using:
2378 hbase\> snapshot\_cleanup\_enabled
2379
2380
2381 ---
2382
2383 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
2384
2385 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
2386
2387
2388 ---
2389
2390 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
2391
2392 This issue adds via its subtasks:
2393
2394  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
2395  \*\* Master thought this region opened, but no regionserver reported it.
2396  \*\* Master thought this region opened on Server1, but regionserver reported Server2
2397  \*\* More than one regionservers reported opened this region
2398  Both chores can be triggered from the shell to regenerate ‘new’ reports.
2399  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
2400  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
2401  \* Offline replace of hbase.version and hbase.id
2402  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
2403  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
2404  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
2405  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
2406  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
2407  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
2408
2409 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
2410
2411
2412 ---
2413
2414 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
2415
2416 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
2417
2418
2419 ---
2420
2421 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
2422
2423 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
2424
2425
2426 ---
2427
2428 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
2429
2430 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
2431
2432 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
2433
2434 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
2435
2436
2437 ---
2438
2439 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
2440
2441 <!-- markdown -->
2442 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
2443
2444
2445 ---
2446
2447 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
2448
2449 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
2450
2451
2452 ---
2453
2454 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
2455
2456 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
2457
2458
2459 ---
2460
2461 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
2462
2463 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
2464 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
2465
2466
2467 ---
2468
2469 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
2470
2471 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
2472 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
2473 \* TimeRange#until: Represents the time interval [0, maxStamp)
2474 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
2475
2476
2477 ---
2478
2479 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
2480
2481 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
2482 {code}
2483 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
2484 {code}
2485
2486
2487 ---
2488
2489 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
2490
2491 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
2492
2493
2494 ---
2495
2496 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
2497
2498 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
2499
2500 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
2501
2502
2503 ---
2504
2505 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
2506
2507 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
2508
2509
2510 ---
2511
2512 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
2513
2514 New shaded artifact for testing: hbase-shaded-testing-util.
2515
2516
2517 ---
2518
2519 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
2520
2521 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
2522 1. Check HDFS configuration
2523 2. Add master coprocessor:
2524     hbase.coprocessor.master.classes=
2525     “org.apache.hadoop.hbase.security.access.AccessController,
2526 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
2527 3. Enable this feature:
2528     hbase.acl.sync.to.hdfs.enable=true
2529 4. Modify table scheme to enable this feature for a table:
2530     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
2531
2532
2533 ---
2534
2535 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
2536
2537 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
2538
2539 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
2540
2541 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
2542 java.lang.ArrayIndexOutOfBoundsException: 18056
2543         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
2544         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
2545         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
2546         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
2547         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
2548         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
2549         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
2550         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
2551         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
2552
2553 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
2554
2555 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
2556
2557 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
2558
2559
2560 ---
2561
2562 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
2563
2564 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
2565
2566
2567 ---
2568
2569 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
2570
2571 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
2572
2573
2574 ---
2575
2576 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
2577
2578 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
2579
2580 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
2581
2582
2583 ---
2584
2585 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
2586
2587 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
2588
2589
2590 ---
2591
2592 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
2593
2594 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
2595 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
2596
2597
2598 ---
2599
2600 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
2601
2602 1. Add a new chore thread in master to do hbck checking
2603 2. Add a new web ui "HBCK Report" page to display checking results.
2604
2605 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
2606
2607 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
2608
2609
2610 ---
2611
2612 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
2613
2614 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
2615 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
2616
2617
2618 ---
2619
2620 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
2621
2622 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
2623
2624 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
2625
2626 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
2627
2628
2629 ---
2630
2631 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
2632
2633 Add a new master web UI to show the potentially problematic opened regions. There are three case:
2634 1. Master thought this region opened, but no regionserver reported it.
2635 2. Master thought this region opened on Server1, but regionserver reported Server2
2636 3. More than one regionservers reported opened this region
2637
2638
2639 ---
2640
2641 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
2642
2643 Feature: Take a Snapshot With TTL for auto-cleanup
2644
2645 Attribute:
2646 1. TTL
2647      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
2648
2649 Configs:
2650 1. Default Snapshot TTL:
2651      - FOREVER by default
2652      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
2653
2654 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
2655      - hbase.master.cleaner.snapshot.disable: "true"
2656     With this config, HMaster needs restart just like any other hbase-site config.
2657
2658
2659 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
2660
2661
2662 ---
2663
2664 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
2665
2666 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
2667
2668
2669 ---
2670
2671 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
2672
2673 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
2674
2675 This tool is deprecated in 2.x and will be removed in 3.0.
2676
2677
2678 ---
2679
2680 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
2681
2682 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
2683
2684
2685 ---
2686
2687 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
2688
2689 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
2690
2691 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
2692
2693 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
2694
2695
2696 ---
2697
2698 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
2699
2700 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
2701 To use this feature, please make sure the HDFS config is set:
2702 dfs.namenode.acls.enabled=true
2703 fs.permissions.umask-mode=027
2704
2705 and set the HBase config:
2706 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
2707 hbase.user.scan.snapshot.enable=true
2708
2709
2710 ---
2711
2712 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
2713
2714 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2715
2716 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2717
2718
2719 ---
2720
2721 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
2722
2723 <!-- markdown -->
2724
2725 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
2726
2727
2728 ---
2729
2730 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
2731
2732 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
2733
2734
2735 ---
2736
2737 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
2738
2739 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
2740
2741
2742 ---
2743
2744 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
2745
2746 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
2747
2748
2749 ---
2750
2751 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
2752
2753 The HBase "source checksum" now uses SHA512 instead of MD5.
2754
2755
2756 ---
2757
2758 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
2759
2760 <!-- markdown -->
2761
2762 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
2763
2764 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
2765
2766
2767 ---
2768
2769 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
2770
2771 The access method was used to the HttpServerFunctionalTest class as a common place.
2772
2773
2774 ---
2775
2776 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
2777
2778 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
2779
2780
2781 ---
2782
2783 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
2784
2785 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
2786
2787
2788 ---
2789
2790 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
2791
2792 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
2793
2794
2795 ---
2796
2797 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
2798
2799 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
2800
2801
2802 ---
2803
2804 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
2805
2806 Support get\|set LogLevel in secure(kerberized) environment.
2807
2808
2809 ---
2810
2811 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
2812
2813 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
2814
2815
2816 ---
2817
2818 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
2819
2820 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
2821
2822
2823 ---
2824
2825 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
2826
2827 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
2828
2829
2830 ---
2831
2832 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
2833
2834 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
2835
2836
2837 ---
2838
2839 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
2840
2841 Updated metrics core from 3.2.1 to 3.2.6.
2842
2843
2844 ---
2845
2846 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
2847
2848 The rubocop definition for the maximum method length was set to 75.
2849
2850
2851 ---
2852
2853 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
2854
2855 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
2856
2857
2858 ---
2859
2860 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
2861
2862 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
2863
2864
2865 ---
2866
2867 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
2868
2869 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
2870
2871 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
2872
2873 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
2874
2875 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
2876
2877 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
2878 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
2879
2880 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
2881 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
2882 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
2883
2884
2885 ---
2886
2887 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
2888
2889 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
2890
2891 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
2892
2893
2894 ---
2895
2896 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
2897
2898 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
2899
2900
2901 ---
2902
2903 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
2904
2905 <!-- markdown -->
2906 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
2907
2908 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
2909
2910
2911 ---
2912
2913 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
2914
2915 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
2916
2917 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
2918
2919
2920 ---
2921
2922 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
2923
2924 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
2925
2926
2927 ---
2928
2929 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
2930
2931 <!-- markdown -->
2932
2933 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
2934
2935 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
2936
2937
2938 ---
2939
2940 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
2941
2942 Add below method in Table interface:
2943
2944 RegionLocator getRegionLocator() throws IOException;
2945
2946 Add below methods in AsyncTable interface:
2947
2948 AsyncTableRegionLocator getRegionLocator();
2949 CompletableFuture\<TableDescriptor\> getDescriptor();
2950
2951
2952 ---
2953
2954 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
2955
2956 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
2957
2958 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
2959
2960 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
2961
2962
2963 ---
2964
2965 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
2966
2967 Introduced
2968
2969 Future\<Void\> createTableAsync(TableDescriptor);
2970
2971
2972 ---
2973
2974 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
2975
2976 Introduced these methods:
2977 void move(byte[]);
2978 void move(byte[], ServerName);
2979 Future\<Void\> splitRegionAsync(byte[]);
2980
2981 These methods are deprecated:
2982 void move(byte[], byte[])
2983
2984
2985 ---
2986
2987 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
2988
2989 Add a new jenkins file for running pre commit check for GitHub PR.
2990
2991
2992 ---
2993
2994 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
2995
2996 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
2997
2998
2999 ---
3000
3001 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
3002
3003 When insufficient permissions, you now get:
3004
3005 HTTP/1.1 403 Forbidden
3006
3007 on the HTTP side, and in the message
3008
3009 Forbidden
3010 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
3011 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
3012 and the rest of the ADE stack
3013
3014
3015 ---
3016
3017 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
3018
3019 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
3020
3021
3022 ---
3023
3024 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
3025
3026 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
3027
3028
3029 ---
3030
3031 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
3032
3033 <!-- markdown -->
3034 Fixed awkward dependency issue that prevented site building.
3035
3036 #### note specific to HBase 2.1.4
3037 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
3038 ```
3039 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
3040 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
3041         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
3042         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
3043         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
3044         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
3045         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
3046         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
3047         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
3048         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
3049         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
3050         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
3051         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
3052         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
3053         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
3054         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
3055         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
3056         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
3057         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
3058         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
3059         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
3060         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
3061         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
3062         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
3063         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
3064         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
3065         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
3066         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
3067 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
3068         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
3069         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
3070         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
3071         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
3072         ... 26 more
3073
3074 ```
3075
3076 Workaround via any _one_ of the following:
3077 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
3078 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
3079 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
3080 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
3081 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
3082
3083
3084 ---
3085
3086 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
3087
3088 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
3089
3090
3091 ---
3092
3093 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
3094
3095 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
3096
3097
3098 ---
3099
3100 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
3101
3102 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
3103
3104 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
3105
3106
3107 ---
3108
3109 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
3110
3111 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
3112
3113
3114 ---
3115
3116 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
3117
3118 <!-- markdown -->
3119
3120 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
3121
3122 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
3123
3124
3125 ---
3126
3127 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
3128
3129 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
3130
3131
3132 ---
3133
3134 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
3135
3136 Add a cloneSnapshotAsync method with restoreAcl parameter.
3137 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
3138 Make snapshotAsync method returns a Future\<Void\>.
3139 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
3140 Use default methods to reduce the code base for implementation classes.
3141
3142
3143 ---
3144
3145 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
3146
3147 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
3148
3149
3150 ---
3151
3152 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
3153
3154 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
3155 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
3156
3157 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
3158
3159 For example:
3160 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
3161
3162
3163 ---
3164
3165 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
3166
3167 Adds below flush, split, and compaction metrics
3168
3169  +  // split related metrics
3170  +  private MutableFastCounter splitRequest;
3171  +  private MutableFastCounter splitSuccess;
3172  +  private MetricHistogram splitTimeHisto;
3173  +
3174  +  // flush related metrics
3175  +  private MetricHistogram flushTimeHisto;
3176  +  private MetricHistogram flushMemstoreSizeHisto;
3177  +  private MetricHistogram flushOutputSizeHisto;
3178  +  private MutableFastCounter flushedMemstoreBytes;
3179  +  private MutableFastCounter flushedOutputBytes;
3180  +
3181  +  // compaction related metrics
3182  +  private MetricHistogram compactionTimeHisto;
3183  +  private MetricHistogram compactionInputFileCountHisto;
3184  +  private MetricHistogram compactionInputSizeHisto;
3185  +  private MetricHistogram compactionOutputFileCountHisto;
3186  +  private MetricHistogram compactionOutputSizeHisto;
3187  +  private MutableFastCounter compactedInputBytes;
3188  +  private MutableFastCounter compactedOutputBytes;
3189  +
3190  +  private MetricHistogram majorCompactionTimeHisto;
3191  +  private MetricHistogram majorCompactionInputFileCountHisto;
3192  +  private MetricHistogram majorCompactionInputSizeHisto;
3193  +  private MetricHistogram majorCompactionOutputFileCountHisto;
3194  +  private MetricHistogram majorCompactionOutputSizeHisto;
3195  +  private MutableFastCounter majorCompactedInputBytes;
3196  +  private MutableFastCounter majorCompactedOutputBytes;
3197
3198
3199 ---
3200
3201 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
3202
3203 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
3204
3205
3206 ---
3207
3208 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
3209
3210 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
3211 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
3212
3213
3214 ---
3215
3216 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
3217
3218 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
3219
3220 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
3221
3222
3223 ---
3224
3225 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
3226
3227 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
3228 Shell commands are as follows:
3229 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3230
3231 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
3232 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
3233 Shell commands are as follows:
3234 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3235 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
3236 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
3237
3238
3239 ---
3240
3241 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
3242
3243 Change spotbugs version to 3.1.11.
3244
3245
3246 ---
3247
3248 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
3249
3250 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
3251
3252 It also introduces additional info for each recovery queue, which was not accounted by this command before.
3253
3254 The new output for "status 'replication'" command is explained in details below:
3255 a) Source started, target stopped, no edits arrived on source yet:
3256 ...
3257  SOURCE: PeerID=1
3258          Normal Queue: 1
3259            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3260 ...
3261 b) Source started, target stopped, add edit on source:
3262 ...
3263 Normal Queue: 1
3264            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
3265 ...
3266 c) Source started, target stopped, edit added on source, restart source:
3267 ...
3268 SOURCE: PeerID=1
3269          Normal Queue: 1
3270            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3271          Recovered Queue: 1-hbase01.home,16020,1542784524057
3272            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
3273 ...
3274 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
3275 ...
3276 SOURCE: PeerID=1
3277          Normal Queue: 1
3278            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
3279          Recovered Queue: 1-hbase01.home,16020,1542782758742
3280            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
3281 ...
3282 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
3283 ...
3284        SOURCE: PeerID=1
3285          Normal Queue: 1
3286            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
3287 ...
3288 f) Source started, target stopped, add edit on source, restart source, restart target:
3289 ...
3290 SOURCE: PeerID=1
3291          Normal Queue: 1
3292            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3293 ...
3294
3295
3296 ---
3297
3298 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
3299
3300 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
3301
3302
3303 ---
3304
3305 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
3306
3307 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
3308 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
3309 disable\_exceed\_throttle\_quota
3310 There are two limits when enable exceed throttle quota:
3311 1. Must set at least one read and one write region server throttle quota;
3312 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
3313
3314
3315 ---
3316
3317 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
3318
3319 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
3320
3321
3322 ---
3323
3324 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
3325
3326 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
3327
3328
3329 ---
3330
3331 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
3332
3333 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
3334
3335
3336 ---
3337
3338 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
3339
3340 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
3341
3342 hbase\> help 'scan'
3343
3344
3345 ---
3346
3347 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
3348
3349 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
3350
3351 For example:
3352 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
3353
3354
3355 ---
3356
3357 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
3358
3359 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
3360 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
3361
3362
3363 ---
3364
3365 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
3366
3367 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
3368
3369
3370 ---
3371
3372 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
3373
3374 Make StoppedRpcClientException extend DoNotRetryIOException.
3375
3376
3377 ---
3378
3379 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
3380
3381 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
3382 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
3383
3384
3385 ---
3386
3387 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
3388
3389 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
3390
3391 The effect releases are:
3392 2.1.x: 2.1.2 and below
3393 2.0.x: 2.0.4 and below
3394 1.x: 1.4.x and below
3395
3396 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
3397
3398
3399 ---
3400
3401 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
3402
3403 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
3404
3405
3406
3407 # HBASE  2.3.0 Release Notes
3408
3409 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
3410
3411
3412 ---
3413
3414 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
3415
3416 <!-- markdown -->
3417
3418 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
3419
3420
3421 ---
3422
3423 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
3424
3425 <!-- markdown -->
3426 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
3427 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
3428
3429
3430 ---
3431
3432 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
3433
3434 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
3435 The metric is now collected under the mbean for Tables and under the mbean for regions.
3436 Under table mbean ie.-
3437 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
3438 The new metrics will be listed as
3439 {code}
3440     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3441  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
3442 {code}
3443 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
3444 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
3445 {code}
3446
3447 The same one under the region ie.
3448 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
3449 comes as
3450 {code}
3451    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3452     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
3453 {code}
3454 where
3455 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
3456 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
3457 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
3458
3459
3460 ---
3461
3462 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
3463
3464 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
3465
3466 $hbase rowcounter -h
3467
3468 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
3469 Options:
3470     --starttime=\<arg\>       starting time filter to start counting rows from.
3471     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
3472     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
3473     --expectedCount=\<arg\>   expected number of rows to be count.
3474 For performance, consider the following configuration properties:
3475 -Dhbase.client.scanner.caching=100
3476 -Dmapreduce.map.speculative=false
3477
3478
3479 ---
3480
3481 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
3482
3483 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
3484
3485
3486 ---
3487
3488 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
3489
3490 Adds being able to edit hbase:meta table schema. For example,
3491
3492 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
3493 Updating all regions with the new schema...
3494 All regions updated.
3495 Done.
3496 Took 1.2138 seconds
3497
3498 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
3499
3500
3501 ---
3502
3503 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
3504
3505 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
3506
3507
3508 ---
3509
3510 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
3511
3512 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
3513
3514
3515 ---
3516
3517 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
3518
3519 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
3520
3521
3522 ---
3523
3524 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
3525
3526 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
3527
3528
3529 ---
3530
3531 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
3532
3533 <!-- markdown -->
3534 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
3535 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
3536 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
3537 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
3538 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
3539 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
3540
3541
3542 ---
3543
3544 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
3545
3546 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
3547
3548 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
3549
3550 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
3551
3552 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
3553
3554
3555 ---
3556
3557 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
3558
3559 Added new metric to differentiate sink startup time from last OP applied time.
3560
3561 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
3562
3563 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
3564
3565 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
3566
3567 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
3568
3569
3570 ---
3571
3572 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
3573
3574 <!-- markdown -->
3575 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
3576
3577 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
3578
3579
3580 ---
3581
3582 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
3583
3584 Add backoff. Avoid retrying every 100ms.
3585
3586
3587 ---
3588
3589 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
3590
3591 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
3592
3593 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
3594
3595
3596 ---
3597
3598 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
3599
3600 Introduced a general 'local region' at master side to store the procedure data, etc.
3601
3602 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
3603
3604 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
3605
3606
3607 ---
3608
3609 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
3610
3611 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
3612
3613
3614 ---
3615
3616 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
3617
3618 Config key: hbase.regionserver.slowlog.systable.enabled
3619 Default value: false
3620
3621 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
3622 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
3623
3624 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
3625
3626 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
3627
3628  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
3629  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
3630  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
3631  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
3632                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
3633                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
3634                                                              rics: false
3635  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
3636  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
3637  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
3638  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
3639  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
3640  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
3641  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
3642  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
3643
3644
3645 ---
3646
3647 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
3648
3649 <!-- markdown -->
3650 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
3651
3652
3653 ---
3654
3655 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
3656
3657 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
3658
3659 The request log is disabled by default in conf/log4j.properties by the following lines:
3660
3661 # Disable request log by default, you can enable this by changing the appender
3662 log4j.category.http.requests=INFO,NullAppender
3663 log4j.additivity.http.requests=false
3664
3665 Change the 'NullAppender' to what ever you want if you want to enable request log.
3666
3667 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
3668
3669
3670 ---
3671
3672 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
3673
3674 Use a empty string to represent no column specified for deleteall in shell mode.
3675 useage:
3676 deleteall 'test','r1','',12345
3677 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
3678
3679
3680 ---
3681
3682 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
3683
3684 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
3685
3686
3687 ---
3688
3689 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
3690
3691 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
3692
3693
3694 ---
3695
3696 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
3697
3698 Moved to hbase-thirdparty 3.3.0.
3699
3700
3701 ---
3702
3703 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
3704
3705 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
3706
3707 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
3708
3709 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
3710
3711
3712 ---
3713
3714 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
3715
3716 <!-- markdown -->
3717 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
3718
3719 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
3720
3721
3722 ---
3723
3724 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
3725
3726 New Config: hbase.rpc.rows.size.threshold.reject
3727 -----------------------------------------------------------------------
3728
3729 Default value: false
3730 Description:
3731 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
3732
3733
3734 ---
3735
3736 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
3737
3738 StochasticLoadBalancer functional improvement:
3739
3740 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
3741
3742
3743 ---
3744
3745 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
3746
3747 user or admin can now use
3748 hbase shell \> rename\_rsgroup 'oldname', 'newname'
3749 to rename rsgroup.
3750
3751
3752 ---
3753
3754 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
3755
3756 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
3757
3758 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
3759
3760
3761 ---
3762
3763 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
3764
3765 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
3766
3767
3768 ---
3769
3770 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
3771
3772 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
3773
3774
3775 ---
3776
3777 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
3778
3779 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
3780
3781
3782 ---
3783
3784 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
3785
3786 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
3787
3788
3789 ---
3790
3791 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
3792
3793 <!-- markdown -->
3794 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
3795
3796
3797 ---
3798
3799 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
3800
3801 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
3802
3803 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
3804
3805 For running tests locally, to go faster, up fork count.
3806
3807 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
3808
3809 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
3810
3811
3812 ---
3813
3814 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
3815
3816 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
3817
3818
3819 ---
3820
3821 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
3822
3823 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
3824
3825
3826 ---
3827
3828 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
3829
3830 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
3831
3832
3833 ---
3834
3835 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
3836
3837 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
3838
3839
3840 ---
3841
3842 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
3843
3844 <!-- markdown -->
3845 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
3846
3847 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
3848
3849
3850 ---
3851
3852 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
3853
3854 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
3855
3856
3857 ---
3858
3859 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
3860
3861 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
3862
3863
3864 ---
3865
3866 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
3867
3868 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
3869
3870
3871 ---
3872
3873 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
3874
3875 ColumnFamilyDescriptor new builder API:
3876
3877     /\*\*
3878      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
3879      \* of versions(versionAfterInterval) after that interval elapses.
3880      \*
3881      \* @param retentionInterval Retain all versions for this interval
3882      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
3883      \*/
3884     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
3885         final int retentionInterval, final int versionAfterInterval)
3886
3887
3888 ---
3889
3890 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
3891
3892 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
3893
3894
3895 ---
3896
3897 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
3898
3899 Expose file system level read metrics for RegionServer.
3900
3901 If the HBase RS runs on top of HDFS, calculate the aggregation of
3902 ReadStatistics of each HdfsFileInputStream. These metrics include:
3903 (1) total number of bytes read from HDFS.
3904 (2) total number of bytes read from local DataNode.
3905 (3) total number of bytes read locally through short-circuit read.
3906 (4) total number of bytes read locally through zero-copy read.
3907
3908 Because HDFS ReadStatistics is calculated per input stream, it is not
3909 feasible to update the aggregated number in real time. Instead, the
3910 metrics are updated when an input stream is closed.
3911
3912
3913 ---
3914
3915 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
3916
3917 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
3918
3919 Here is a simple example of script:
3920 {code}
3921 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
3922 #!/bin/bash
3923 namespace=$1
3924 tablename=$2
3925 if [[ $namespace == test ]]; then
3926   echo test
3927 elif [[ $tablename == \*foo\* ]]; then
3928   echo other
3929 else
3930   echo default
3931 fi
3932 {code}
3933
3934
3935 ---
3936
3937 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
3938
3939 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
3940
3941
3942 ---
3943
3944 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
3945
3946 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
3947
3948
3949 ---
3950
3951 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
3952
3953 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
3954
3955 User used to see....
3956
3957   column=table:state, timestamp=1583967620343 .....
3958
3959 ... but now sees:
3960
3961   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
3962
3963
3964 ---
3965
3966 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
3967
3968 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
3969
3970
3971 ---
3972
3973 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
3974
3975 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
3976
3977 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
3978
3979
3980 ---
3981
3982 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
3983
3984 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
3985
3986 New Admin APIs:
3987 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
3988       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
3989
3990 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
3991       throws IOException;
3992
3993 Configs:
3994
3995 1. hbase.regionserver.slowlog.ringbuffer.size:
3996 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
3997
3998 Default
3999 256
4000
4001 2. hbase.regionserver.slowlog.buffer.enabled:
4002 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
4003
4004 Default
4005 false
4006
4007
4008 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
4009
4010
4011 ---
4012
4013 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
4014
4015 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
4016
4017
4018 ---
4019
4020 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
4021
4022 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
4023
4024 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
4025
4026 This is a fluent style API, the code is like:
4027
4028 For Table interface:
4029 {code}
4030 table.checkAndMutate(row, filter).thenPut(put);
4031 {code}
4032
4033 For AsyncTable interface:
4034 {code}
4035 table.checkAndMutate(row, filter).thenPut(put)
4036     .thenAccept(succ -\> {
4037       if (succ) {
4038         System.out.println("Check and put succeeded");
4039       } else {
4040         System.out.println("Check and put failed");
4041       }
4042     });
4043 {code}
4044
4045
4046 ---
4047
4048 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
4049
4050 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
4051
4052
4053 ---
4054
4055 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
4056
4057 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
4058
4059
4060 ---
4061
4062 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
4063
4064     Adds shell command regioninfo:
4065
4066       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
4067       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
4068       Took 0.4737 seconds
4069
4070
4071 ---
4072
4073 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
4074
4075 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
4076
4077 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
4078
4079
4080 ---
4081
4082 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
4083
4084 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
4085
4086 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
4087 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
4088
4089 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
4090
4091
4092 ---
4093
4094 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
4095
4096 <!-- markdown -->
4097 Enables master based registry as the default registry used by clients to fetch connection metadata.
4098 Refer to the section "Master Registry" in the client documentation for more details and advantages
4099 of this implementation over the default Zookeeper based registry.
4100
4101 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
4102
4103 Where to set this: HBase client configuration (hbase-site.xml)
4104
4105 Possible values:
4106 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
4107 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
4108
4109 Notes on defaults:
4110
4111 - For v3.0.0 and later, MasterRegistry is the default registry
4112 - For all releases in 2.x line, ZK based registry is the default.
4113
4114 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
4115
4116 ```
4117 <property>
4118   <name>hbase.client.registry.impl</name>
4119   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
4120 </property>
4121 ```
4122
4123
4124 ---
4125
4126 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
4127
4128 caffeine: 2.6.2 =\> 2.8.1
4129 commons-codec: 1.10 =\> 1.13
4130 commons-io: 2.5 =\> 2.6
4131 disrupter: 3.3.6 =\> 3.4.2
4132 httpcore: 4.4.6 =\> 4.4.13
4133 jackson: 2.9.10 =\> 2.10.1
4134 jackson.databind: 2.9.10.1 =\> 2.10.1
4135 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
4136 protobuf.plugin: 0.5.0 =\> 0.6.1
4137 zookeeper: 3.4.10 =\> 3.4.14
4138 slf4j: 1.7.25 =\> 1.7.30
4139 rat: 0.12 =\> 0.13
4140 asciidoctor: 1.5.5 =\> 1.5.8
4141 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
4142 error-prone: 2.3.3 =\> 2.3.4
4143
4144
4145 ---
4146
4147 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
4148
4149 - Reverts a binary incompatible binary change for ByteRangeUtils
4150 - Usage of reflection inside CommonFSUtils removed
4151
4152
4153 ---
4154
4155 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
4156
4157 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
4158
4159 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
4160
4161
4162 ---
4163
4164 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
4165
4166 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
4167
4168
4169 ---
4170
4171 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
4172
4173 Add a new config to hbase-default.xml
4174
4175   \<property\>
4176     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
4177     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
4178     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
4179     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
4180     called in order, so put the cleaner that prunes the most files in front. To
4181     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
4182     and add the fully qualified class name here. Always add the above
4183     default hfile cleaners in the list as they will be overwritten in
4184     hbase-site.xml.\</description\>
4185   \</property\>
4186
4187 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
4188
4189
4190 ---
4191
4192 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
4193
4194 Updated parent pom to Apache version 22.
4195
4196
4197 ---
4198
4199 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
4200
4201 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
4202
4203 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
4204
4205
4206 ---
4207
4208 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
4209
4210 Add a new feature to improve MTTR which have 3 steps to failover:
4211 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
4212 2. Open region.
4213 3. Bulkload the recovered.hfiles for every column family.
4214
4215 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
4216
4217 Config hbase.wal.split.to.hfile to true to enable this featue.
4218
4219
4220 ---
4221
4222 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
4223
4224 Changed the logging in hbase-zookeeper to use built-in formatting
4225
4226
4227 ---
4228
4229 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
4230
4231 From the PR:
4232
4233 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
4234
4235 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
4236
4237
4238 ---
4239
4240 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
4241
4242 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
4243
4244
4245 ---
4246
4247 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
4248
4249 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
4250
4251
4252 ---
4253
4254 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
4255
4256 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
4257
4258
4259 ---
4260
4261 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
4262
4263 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
4264
4265 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
4266
4267
4268 ---
4269
4270 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
4271
4272 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
4273
4274
4275 ---
4276
4277 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
4278
4279 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
4280 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
4281
4282 Fixed this bug as part of this Jira.
4283 Updated description for corresponding configs:
4284
4285 1. hbase.master.regions.recovery.check.interval :
4286
4287 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4288
4289 2. hbase.regions.recovery.store.file.ref.count :
4290
4291 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4292
4293
4294 ---
4295
4296 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
4297
4298 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
4299
4300
4301 ---
4302
4303 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
4304
4305 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
4306
4307
4308 ---
4309
4310 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
4311
4312 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
4313
4314
4315 ---
4316
4317 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
4318
4319 Bumped surefire plugin to 3.0.0-M4
4320
4321
4322 ---
4323
4324 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
4325
4326 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
4327
4328
4329 ---
4330
4331 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
4332
4333 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
4334 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
4335 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
4336 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
4337 From the shell this can be enabled by using the option per Column Family also by using the below format
4338 {code}
4339 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
4340 {code}
4341
4342
4343 ---
4344
4345 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
4346
4347 <!-- markdown -->
4348
4349 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
4350
4351 ```
4352 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
4353     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
4354 ```
4355
4356 See javadocs of the class `MobRefReporter` for more details.
4357
4358 the reference guide has added some information about MOB internals and troubleshooting.
4359
4360
4361 ---
4362
4363 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
4364
4365 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
4366
4367
4368 ---
4369
4370 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
4371
4372 Fixed unbalanced braces in string representation within HBase shell
4373
4374
4375 ---
4376
4377 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
4378
4379 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
4380 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
4381
4382
4383 ---
4384
4385 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
4386
4387 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
4388
4389
4390 ---
4391
4392 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
4393
4394 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
4395
4396 1. RowFilter
4397 2. ValueFilter
4398 3. QualifierFilter
4399 4. FamilyFilter
4400 5. ColumnValueFilter
4401
4402
4403 ---
4404
4405 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
4406
4407 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
4408
4409
4410 ---
4411
4412 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
4413
4414 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
4415
4416
4417 ---
4418
4419 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
4420
4421 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
4422
4423
4424 ---
4425
4426 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
4427
4428 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
4429
4430
4431 ---
4432
4433 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
4434
4435 <!-- markdown -->
4436 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
4437
4438 Such messages will happen at most once per five minutes.
4439
4440
4441 ---
4442
4443 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
4444
4445 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
4446
4447
4448 ---
4449
4450 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
4451
4452 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
4453
4454
4455 ---
4456
4457 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
4458
4459 <!-- markdown -->
4460
4461 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
4462
4463   - CVE-2019-16942
4464   - CVE-2019-16943
4465
4466 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
4467
4468
4469 ---
4470
4471 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
4472
4473 <!-- markdown -->
4474
4475 The MOB compaction process in the HBase Master now logs more about its activity.
4476
4477 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
4478
4479 Caveats:
4480 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
4481 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
4482 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
4483 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
4484
4485
4486 ---
4487
4488 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
4489
4490 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
4491
4492
4493 ---
4494
4495 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
4496
4497 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
4498
4499 Configs:
4500
4501 1. hbase.master.regions.recovery.check.interval :
4502
4503 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4504
4505 2. hbase.regions.recovery.store.file.ref.count :
4506
4507 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4508
4509
4510 ---
4511
4512 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
4513
4514 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
4515
4516 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
4517
4518
4519 ---
4520
4521 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
4522
4523 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
4524
4525
4526 ---
4527
4528 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
4529
4530 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
4531
4532
4533 ---
4534
4535 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
4536
4537 <!-- markdown -->
4538 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
4539
4540 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
4541
4542
4543 ---
4544
4545 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
4546
4547 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
4548
4549
4550 ---
4551
4552 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
4553
4554 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
4555
4556
4557 ---
4558
4559 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
4560
4561 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
4562
4563 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
4564
4565
4566 ---
4567
4568 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
4569
4570 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
4571 \<property\>
4572     \<name\>hbase.bucketcache.ioengine\</name\>
4573     \<value\> pmem:///path in persistent memory \</value\>
4574   \</property\>
4575
4576
4577 ---
4578
4579 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
4580
4581 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
4582 hbase\> snapshot\_cleanup\_switch false
4583
4584 We can re-enable it using:
4585 hbase\> snapshot\_cleanup\_switch true
4586
4587 We can query whether snapshot auto cleanup is enabled for cluster using:
4588 hbase\> snapshot\_cleanup\_enabled
4589
4590
4591 ---
4592
4593 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
4594
4595 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
4596
4597
4598 ---
4599
4600 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
4601
4602 This issue adds via its subtasks:
4603
4604  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
4605  \*\* Master thought this region opened, but no regionserver reported it.
4606  \*\* Master thought this region opened on Server1, but regionserver reported Server2
4607  \*\* More than one regionservers reported opened this region
4608  Both chores can be triggered from the shell to regenerate ‘new’ reports.
4609  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
4610  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
4611  \* Offline replace of hbase.version and hbase.id
4612  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
4613  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
4614  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
4615  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
4616  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
4617  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
4618
4619 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
4620
4621
4622 ---
4623
4624 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
4625
4626 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
4627
4628
4629 ---
4630
4631 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
4632
4633 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
4634
4635
4636 ---
4637
4638 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
4639
4640 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
4641
4642 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
4643
4644 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
4645
4646
4647 ---
4648
4649 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
4650
4651 <!-- markdown -->
4652 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
4653
4654
4655 ---
4656
4657 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
4658
4659 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
4660
4661
4662 ---
4663
4664 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
4665
4666 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
4667
4668
4669 ---
4670
4671 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
4672
4673 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
4674 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
4675
4676
4677 ---
4678
4679 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
4680
4681 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
4682 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
4683 \* TimeRange#until: Represents the time interval [0, maxStamp)
4684 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
4685
4686
4687 ---
4688
4689 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
4690
4691 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
4692 {code}
4693 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
4694 {code}
4695
4696
4697 ---
4698
4699 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
4700
4701 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
4702
4703
4704 ---
4705
4706 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
4707
4708 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
4709
4710 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
4711
4712
4713 ---
4714
4715 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
4716
4717 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
4718
4719
4720 ---
4721
4722 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
4723
4724 New shaded artifact for testing: hbase-shaded-testing-util.
4725
4726
4727 ---
4728
4729 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
4730
4731 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
4732 1. Check HDFS configuration
4733 2. Add master coprocessor:
4734     hbase.coprocessor.master.classes=
4735     “org.apache.hadoop.hbase.security.access.AccessController,
4736 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
4737 3. Enable this feature:
4738     hbase.acl.sync.to.hdfs.enable=true
4739 4. Modify table scheme to enable this feature for a table:
4740     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
4741
4742
4743 ---
4744
4745 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
4746
4747 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
4748
4749 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
4750
4751 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
4752 java.lang.ArrayIndexOutOfBoundsException: 18056
4753         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
4754         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
4755         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
4756         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
4757         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
4758         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
4759         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
4760         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
4761         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
4762
4763 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
4764
4765 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
4766
4767 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
4768
4769
4770 ---
4771
4772 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
4773
4774 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
4775
4776
4777 ---
4778
4779 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
4780
4781 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
4782
4783
4784 ---
4785
4786 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
4787
4788 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
4789
4790 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
4791
4792
4793 ---
4794
4795 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
4796
4797 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
4798
4799
4800 ---
4801
4802 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
4803
4804 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
4805 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
4806
4807
4808 ---
4809
4810 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
4811
4812 1. Add a new chore thread in master to do hbck checking
4813 2. Add a new web ui "HBCK Report" page to display checking results.
4814
4815 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
4816
4817 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
4818
4819
4820 ---
4821
4822 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
4823
4824 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
4825 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
4826
4827
4828 ---
4829
4830 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
4831
4832 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
4833
4834 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
4835
4836 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
4837
4838
4839 ---
4840
4841 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
4842
4843 Add a new master web UI to show the potentially problematic opened regions. There are three case:
4844 1. Master thought this region opened, but no regionserver reported it.
4845 2. Master thought this region opened on Server1, but regionserver reported Server2
4846 3. More than one regionservers reported opened this region
4847
4848
4849 ---
4850
4851 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
4852
4853 Feature: Take a Snapshot With TTL for auto-cleanup
4854
4855 Attribute:
4856 1. TTL
4857      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
4858
4859 Configs:
4860 1. Default Snapshot TTL:
4861      - FOREVER by default
4862      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
4863
4864 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
4865      - hbase.master.cleaner.snapshot.disable: "true"
4866     With this config, HMaster needs restart just like any other hbase-site config.
4867
4868
4869 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
4870
4871
4872 ---
4873
4874 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
4875
4876 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
4877
4878
4879 ---
4880
4881 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
4882
4883 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
4884
4885 This tool is deprecated in 2.x and will be removed in 3.0.
4886
4887
4888 ---
4889
4890 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
4891
4892 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
4893
4894
4895 ---
4896
4897 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
4898
4899 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
4900
4901 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
4902
4903 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
4904
4905
4906 ---
4907
4908 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
4909
4910 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
4911 To use this feature, please make sure the HDFS config is set:
4912 dfs.namenode.acls.enabled=true
4913 fs.permissions.umask-mode=027
4914
4915 and set the HBase config:
4916 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
4917 hbase.user.scan.snapshot.enable=true
4918
4919
4920 ---
4921
4922 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
4923
4924 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4925
4926 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4927
4928
4929 ---
4930
4931 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
4932
4933 <!-- markdown -->
4934
4935 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
4936
4937
4938 ---
4939
4940 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
4941
4942 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
4943
4944
4945 ---
4946
4947 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
4948
4949 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
4950
4951
4952 ---
4953
4954 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
4955
4956 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
4957
4958
4959 ---
4960
4961 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
4962
4963 The HBase "source checksum" now uses SHA512 instead of MD5.
4964
4965
4966 ---
4967
4968 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
4969
4970 <!-- markdown -->
4971
4972 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
4973
4974 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
4975
4976
4977 ---
4978
4979 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
4980
4981 The access method was used to the HttpServerFunctionalTest class as a common place.
4982
4983
4984 ---
4985
4986 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
4987
4988 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
4989
4990
4991 ---
4992
4993 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
4994
4995 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
4996
4997
4998 ---
4999
5000 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
5001
5002 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
5003
5004
5005 ---
5006
5007 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
5008
5009 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
5010
5011
5012 ---
5013
5014 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
5015
5016 Support get\|set LogLevel in secure(kerberized) environment.
5017
5018
5019 ---
5020
5021 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
5022
5023 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
5024
5025
5026 ---
5027
5028 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
5029
5030 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
5031
5032
5033 ---
5034
5035 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
5036
5037 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
5038
5039
5040 ---
5041
5042 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
5043
5044 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
5045
5046
5047 ---
5048
5049 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
5050
5051 Updated metrics core from 3.2.1 to 3.2.6.
5052
5053
5054 ---
5055
5056 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
5057
5058 The rubocop definition for the maximum method length was set to 75.
5059
5060
5061 ---
5062
5063 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
5064
5065 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
5066
5067
5068 ---
5069
5070 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
5071
5072 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
5073
5074
5075 ---
5076
5077 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
5078
5079 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
5080
5081 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
5082
5083 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
5084
5085 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
5086
5087 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
5088 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
5089
5090 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
5091 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
5092 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
5093
5094
5095 ---
5096
5097 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
5098
5099 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
5100
5101 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
5102
5103
5104 ---
5105
5106 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
5107
5108 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
5109
5110
5111 ---
5112
5113 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
5114
5115 <!-- markdown -->
5116 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
5117
5118 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
5119
5120
5121 ---
5122
5123 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
5124
5125 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
5126
5127 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
5128
5129
5130 ---
5131
5132 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
5133
5134 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
5135
5136
5137 ---
5138
5139 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
5140
5141 <!-- markdown -->
5142
5143 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
5144
5145 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
5146
5147
5148 ---
5149
5150 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
5151
5152 Add below method in Table interface:
5153
5154 RegionLocator getRegionLocator() throws IOException;
5155
5156 Add below methods in AsyncTable interface:
5157
5158 AsyncTableRegionLocator getRegionLocator();
5159 CompletableFuture\<TableDescriptor\> getDescriptor();
5160
5161
5162 ---
5163
5164 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
5165
5166 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
5167
5168 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
5169
5170 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
5171
5172
5173 ---
5174
5175 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
5176
5177 Introduced
5178
5179 Future\<Void\> createTableAsync(TableDescriptor);
5180
5181
5182 ---
5183
5184 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
5185
5186 Introduced these methods:
5187 void move(byte[]);
5188 void move(byte[], ServerName);
5189 Future\<Void\> splitRegionAsync(byte[]);
5190
5191 These methods are deprecated:
5192 void move(byte[], byte[])
5193
5194
5195 ---
5196
5197 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
5198
5199 Add a new jenkins file for running pre commit check for GitHub PR.
5200
5201
5202 ---
5203
5204 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
5205
5206 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
5207
5208
5209 ---
5210
5211 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
5212
5213 When insufficient permissions, you now get:
5214
5215 HTTP/1.1 403 Forbidden
5216
5217 on the HTTP side, and in the message
5218
5219 Forbidden
5220 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
5221 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
5222 and the rest of the ADE stack
5223
5224
5225 ---
5226
5227 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
5228
5229 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
5230
5231
5232 ---
5233
5234 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
5235
5236 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
5237
5238
5239 ---
5240
5241 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
5242
5243 <!-- markdown -->
5244 Fixed awkward dependency issue that prevented site building.
5245
5246 #### note specific to HBase 2.1.4
5247 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
5248 ```
5249 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
5250 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
5251         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
5252         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
5253         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
5254         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
5255         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
5256         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
5257         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
5258         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
5259         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
5260         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
5261         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
5262         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
5263         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
5264         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
5265         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
5266         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
5267         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
5268         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
5269         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
5270         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
5271         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
5272         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
5273         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
5274         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
5275         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
5276         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
5277 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
5278         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
5279         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
5280         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
5281         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
5282         ... 26 more
5283
5284 ```
5285
5286 Workaround via any _one_ of the following:
5287 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
5288 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
5289 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
5290 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
5291 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
5292
5293
5294 ---
5295
5296 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
5297
5298 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
5299
5300
5301 ---
5302
5303 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
5304
5305 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
5306
5307
5308 ---
5309
5310 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
5311
5312 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
5313
5314 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
5315
5316
5317 ---
5318
5319 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
5320
5321 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
5322
5323
5324 ---
5325
5326 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
5327
5328 <!-- markdown -->
5329
5330 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
5331
5332 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
5333
5334
5335 ---
5336
5337 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
5338
5339 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
5340
5341
5342 ---
5343
5344 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
5345
5346 Add a cloneSnapshotAsync method with restoreAcl parameter.
5347 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
5348 Make snapshotAsync method returns a Future\<Void\>.
5349 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
5350 Use default methods to reduce the code base for implementation classes.
5351
5352
5353 ---
5354
5355 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
5356
5357 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
5358
5359
5360 ---
5361
5362 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
5363
5364 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
5365 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
5366
5367 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
5368
5369 For example:
5370 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
5371
5372
5373 ---
5374
5375 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
5376
5377 Adds below flush, split, and compaction metrics
5378
5379  +  // split related metrics
5380  +  private MutableFastCounter splitRequest;
5381  +  private MutableFastCounter splitSuccess;
5382  +  private MetricHistogram splitTimeHisto;
5383  +
5384  +  // flush related metrics
5385  +  private MetricHistogram flushTimeHisto;
5386  +  private MetricHistogram flushMemstoreSizeHisto;
5387  +  private MetricHistogram flushOutputSizeHisto;
5388  +  private MutableFastCounter flushedMemstoreBytes;
5389  +  private MutableFastCounter flushedOutputBytes;
5390  +
5391  +  // compaction related metrics
5392  +  private MetricHistogram compactionTimeHisto;
5393  +  private MetricHistogram compactionInputFileCountHisto;
5394  +  private MetricHistogram compactionInputSizeHisto;
5395  +  private MetricHistogram compactionOutputFileCountHisto;
5396  +  private MetricHistogram compactionOutputSizeHisto;
5397  +  private MutableFastCounter compactedInputBytes;
5398  +  private MutableFastCounter compactedOutputBytes;
5399  +
5400  +  private MetricHistogram majorCompactionTimeHisto;
5401  +  private MetricHistogram majorCompactionInputFileCountHisto;
5402  +  private MetricHistogram majorCompactionInputSizeHisto;
5403  +  private MetricHistogram majorCompactionOutputFileCountHisto;
5404  +  private MetricHistogram majorCompactionOutputSizeHisto;
5405  +  private MutableFastCounter majorCompactedInputBytes;
5406  +  private MutableFastCounter majorCompactedOutputBytes;
5407
5408
5409 ---
5410
5411 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
5412
5413 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
5414
5415
5416 ---
5417
5418 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
5419
5420 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
5421 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
5422
5423
5424 ---
5425
5426 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
5427
5428 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
5429
5430 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
5431
5432
5433 ---
5434
5435 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
5436
5437 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
5438 Shell commands are as follows:
5439 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5440
5441 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
5442 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
5443 Shell commands are as follows:
5444 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5445 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
5446 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
5447
5448
5449 ---
5450
5451 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
5452
5453 Change spotbugs version to 3.1.11.
5454
5455
5456 ---
5457
5458 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
5459
5460 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
5461
5462 It also introduces additional info for each recovery queue, which was not accounted by this command before.
5463
5464 The new output for "status 'replication'" command is explained in details below:
5465 a) Source started, target stopped, no edits arrived on source yet:
5466 ...
5467  SOURCE: PeerID=1
5468          Normal Queue: 1
5469            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5470 ...
5471 b) Source started, target stopped, add edit on source:
5472 ...
5473 Normal Queue: 1
5474            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
5475 ...
5476 c) Source started, target stopped, edit added on source, restart source:
5477 ...
5478 SOURCE: PeerID=1
5479          Normal Queue: 1
5480            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5481          Recovered Queue: 1-hbase01.home,16020,1542784524057
5482            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
5483 ...
5484 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
5485 ...
5486 SOURCE: PeerID=1
5487          Normal Queue: 1
5488            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
5489          Recovered Queue: 1-hbase01.home,16020,1542782758742
5490            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
5491 ...
5492 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
5493 ...
5494        SOURCE: PeerID=1
5495          Normal Queue: 1
5496            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
5497 ...
5498 f) Source started, target stopped, add edit on source, restart source, restart target:
5499 ...
5500 SOURCE: PeerID=1
5501          Normal Queue: 1
5502            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5503 ...
5504
5505
5506 ---
5507
5508 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
5509
5510 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
5511
5512
5513 ---
5514
5515 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
5516
5517 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
5518 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
5519 disable\_exceed\_throttle\_quota
5520 There are two limits when enable exceed throttle quota:
5521 1. Must set at least one read and one write region server throttle quota;
5522 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
5523
5524
5525 ---
5526
5527 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
5528
5529 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
5530
5531
5532 ---
5533
5534 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
5535
5536 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
5537
5538
5539 ---
5540
5541 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
5542
5543 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
5544
5545
5546 ---
5547
5548 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
5549
5550 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
5551
5552 hbase\> help 'scan'
5553
5554
5555 ---
5556
5557 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
5558
5559 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
5560
5561 For example:
5562 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
5563
5564
5565 ---
5566
5567 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
5568
5569 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
5570 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
5571
5572
5573 ---
5574
5575 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
5576
5577 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
5578
5579
5580 ---
5581
5582 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
5583
5584 Make StoppedRpcClientException extend DoNotRetryIOException.
5585
5586
5587 ---
5588
5589 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
5590
5591 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
5592 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
5593
5594
5595 ---
5596
5597 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
5598
5599 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
5600
5601 The effect releases are:
5602 2.1.x: 2.1.2 and below
5603 2.0.x: 2.0.4 and below
5604 1.x: 1.4.x and below
5605
5606 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
5607
5608
5609 ---
5610
5611 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
5612
5613 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
5614
5615
5616
5617 # HBASE  2.3.0 Release Notes
5618
5619 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
5620
5621
5622 ---
5623
5624 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
5625
5626 <!-- markdown -->
5627 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
5628 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
5629
5630
5631 ---
5632
5633 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
5634
5635 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
5636 The metric is now collected under the mbean for Tables and under the mbean for regions.
5637 Under table mbean ie.-
5638 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
5639 The new metrics will be listed as
5640 {code}
5641     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5642  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
5643 {code}
5644 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
5645 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
5646 {code}
5647
5648 The same one under the region ie.
5649 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
5650 comes as
5651 {code}
5652    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5653     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
5654 {code}
5655 where
5656 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
5657 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
5658 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
5659
5660
5661 ---
5662
5663 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
5664
5665 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
5666
5667 $hbase rowcounter -h
5668
5669 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
5670 Options:
5671     --starttime=\<arg\>       starting time filter to start counting rows from.
5672     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
5673     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
5674     --expectedCount=\<arg\>   expected number of rows to be count.
5675 For performance, consider the following configuration properties:
5676 -Dhbase.client.scanner.caching=100
5677 -Dmapreduce.map.speculative=false
5678
5679
5680 ---
5681
5682 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
5683
5684 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
5685
5686
5687 ---
5688
5689 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
5690
5691 Adds being able to edit hbase:meta table schema. For example,
5692
5693 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
5694 Updating all regions with the new schema...
5695 All regions updated.
5696 Done.
5697 Took 1.2138 seconds
5698
5699 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
5700
5701
5702 ---
5703
5704 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
5705
5706 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
5707
5708
5709 ---
5710
5711 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
5712
5713 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
5714
5715
5716 ---
5717
5718 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
5719
5720 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
5721
5722
5723 ---
5724
5725 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
5726
5727 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
5728
5729
5730 ---
5731
5732 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
5733
5734 <!-- markdown -->
5735 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
5736 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
5737 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
5738 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
5739 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
5740 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
5741
5742
5743 ---
5744
5745 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
5746
5747 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
5748
5749 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
5750
5751 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
5752
5753 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
5754
5755
5756 ---
5757
5758 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
5759
5760 Added new metric to differentiate sink startup time from last OP applied time.
5761
5762 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
5763
5764 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
5765
5766 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
5767
5768 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
5769
5770
5771 ---
5772
5773 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
5774
5775 <!-- markdown -->
5776 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
5777
5778 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
5779
5780
5781 ---
5782
5783 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
5784
5785 Add backoff. Avoid retrying every 100ms.
5786
5787
5788 ---
5789
5790 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
5791
5792 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
5793
5794 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
5795
5796
5797 ---
5798
5799 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
5800
5801 Introduced a general 'local region' at master side to store the procedure data, etc.
5802
5803 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
5804
5805 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
5806
5807
5808 ---
5809
5810 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
5811
5812 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
5813
5814
5815 ---
5816
5817 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
5818
5819 Config key: hbase.regionserver.slowlog.systable.enabled
5820 Default value: false
5821
5822 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
5823 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
5824
5825 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
5826
5827 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
5828
5829  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
5830  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
5831  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
5832  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
5833                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
5834                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
5835                                                              rics: false
5836  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
5837  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
5838  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
5839  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
5840  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
5841  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
5842  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
5843  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
5844
5845
5846 ---
5847
5848 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
5849
5850 <!-- markdown -->
5851 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
5852
5853
5854 ---
5855
5856 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
5857
5858 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
5859
5860 The request log is disabled by default in conf/log4j.properties by the following lines:
5861
5862 # Disable request log by default, you can enable this by changing the appender
5863 log4j.category.http.requests=INFO,NullAppender
5864 log4j.additivity.http.requests=false
5865
5866 Change the 'NullAppender' to what ever you want if you want to enable request log.
5867
5868 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
5869
5870
5871 ---
5872
5873 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
5874
5875 Use a empty string to represent no column specified for deleteall in shell mode.
5876 useage:
5877 deleteall 'test','r1','',12345
5878 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
5879
5880
5881 ---
5882
5883 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
5884
5885 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
5886
5887
5888 ---
5889
5890 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
5891
5892 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
5893
5894
5895 ---
5896
5897 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
5898
5899 Moved to hbase-thirdparty 3.3.0.
5900
5901
5902 ---
5903
5904 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
5905
5906 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
5907
5908 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
5909
5910 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
5911
5912
5913 ---
5914
5915 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
5916
5917 <!-- markdown -->
5918 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
5919
5920 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
5921
5922
5923 ---
5924
5925 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
5926
5927 New Config: hbase.rpc.rows.size.threshold.reject
5928 -----------------------------------------------------------------------
5929
5930 Default value: false
5931 Description:
5932 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
5933
5934
5935 ---
5936
5937 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
5938
5939 StochasticLoadBalancer functional improvement:
5940
5941 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
5942
5943
5944 ---
5945
5946 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
5947
5948 user or admin can now use
5949 hbase shell \> rename\_rsgroup 'oldname', 'newname'
5950 to rename rsgroup.
5951
5952
5953 ---
5954
5955 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
5956
5957 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
5958
5959 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
5960
5961
5962 ---
5963
5964 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
5965
5966 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
5967
5968
5969 ---
5970
5971 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
5972
5973 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
5974
5975
5976 ---
5977
5978 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
5979
5980 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
5981
5982
5983 ---
5984
5985 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
5986
5987 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
5988
5989
5990 ---
5991
5992 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
5993
5994 <!-- markdown -->
5995 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
5996
5997
5998 ---
5999
6000 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
6001
6002 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
6003
6004 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
6005
6006 For running tests locally, to go faster, up fork count.
6007
6008 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
6009
6010 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
6011
6012
6013 ---
6014
6015 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
6016
6017 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
6018
6019
6020 ---
6021
6022 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
6023
6024 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
6025
6026
6027 ---
6028
6029 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
6030
6031 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
6032
6033
6034 ---
6035
6036 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
6037
6038 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
6039
6040
6041 ---
6042
6043 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
6044
6045 <!-- markdown -->
6046 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
6047
6048 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
6049
6050
6051 ---
6052
6053 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
6054
6055 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
6056
6057
6058 ---
6059
6060 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
6061
6062 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
6063
6064
6065 ---
6066
6067 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
6068
6069 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
6070
6071
6072 ---
6073
6074 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
6075
6076 ColumnFamilyDescriptor new builder API:
6077
6078     /\*\*
6079      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
6080      \* of versions(versionAfterInterval) after that interval elapses.
6081      \*
6082      \* @param retentionInterval Retain all versions for this interval
6083      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
6084      \*/
6085     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
6086         final int retentionInterval, final int versionAfterInterval)
6087
6088
6089 ---
6090
6091 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
6092
6093 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
6094
6095
6096 ---
6097
6098 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
6099
6100 Expose file system level read metrics for RegionServer.
6101
6102 If the HBase RS runs on top of HDFS, calculate the aggregation of
6103 ReadStatistics of each HdfsFileInputStream. These metrics include:
6104 (1) total number of bytes read from HDFS.
6105 (2) total number of bytes read from local DataNode.
6106 (3) total number of bytes read locally through short-circuit read.
6107 (4) total number of bytes read locally through zero-copy read.
6108
6109 Because HDFS ReadStatistics is calculated per input stream, it is not
6110 feasible to update the aggregated number in real time. Instead, the
6111 metrics are updated when an input stream is closed.
6112
6113
6114 ---
6115
6116 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
6117
6118 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
6119
6120 Here is a simple example of script:
6121 {code}
6122 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
6123 #!/bin/bash
6124 namespace=$1
6125 tablename=$2
6126 if [[ $namespace == test ]]; then
6127   echo test
6128 elif [[ $tablename == \*foo\* ]]; then
6129   echo other
6130 else
6131   echo default
6132 fi
6133 {code}
6134
6135
6136 ---
6137
6138 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
6139
6140 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
6141
6142
6143 ---
6144
6145 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
6146
6147 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
6148
6149
6150 ---
6151
6152 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
6153
6154 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
6155
6156 User used to see....
6157
6158   column=table:state, timestamp=1583967620343 .....
6159
6160 ... but now sees:
6161
6162   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
6163
6164
6165 ---
6166
6167 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
6168
6169 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
6170
6171
6172 ---
6173
6174 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
6175
6176 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
6177
6178 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
6179
6180
6181 ---
6182
6183 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
6184
6185 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
6186
6187 New Admin APIs:
6188 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
6189       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
6190
6191 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
6192       throws IOException;
6193
6194 Configs:
6195
6196 1. hbase.regionserver.slowlog.ringbuffer.size:
6197 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
6198
6199 Default
6200 256
6201
6202 2. hbase.regionserver.slowlog.buffer.enabled:
6203 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
6204
6205 Default
6206 false
6207
6208
6209 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
6210
6211
6212 ---
6213
6214 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
6215
6216 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
6217
6218
6219 ---
6220
6221 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
6222
6223 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
6224
6225 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
6226
6227 This is a fluent style API, the code is like:
6228
6229 For Table interface:
6230 {code}
6231 table.checkAndMutate(row, filter).thenPut(put);
6232 {code}
6233
6234 For AsyncTable interface:
6235 {code}
6236 table.checkAndMutate(row, filter).thenPut(put)
6237     .thenAccept(succ -\> {
6238       if (succ) {
6239         System.out.println("Check and put succeeded");
6240       } else {
6241         System.out.println("Check and put failed");
6242       }
6243     });
6244 {code}
6245
6246
6247 ---
6248
6249 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
6250
6251 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
6252
6253
6254 ---
6255
6256 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
6257
6258 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
6259
6260
6261 ---
6262
6263 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
6264
6265     Adds shell command regioninfo:
6266
6267       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
6268       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
6269       Took 0.4737 seconds
6270
6271
6272 ---
6273
6274 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
6275
6276 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
6277
6278 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
6279
6280
6281 ---
6282
6283 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
6284
6285 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
6286
6287 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
6288 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
6289
6290 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
6291
6292
6293 ---
6294
6295 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
6296
6297 <!-- markdown -->
6298 Enables master based registry as the default registry used by clients to fetch connection metadata.
6299 Refer to the section "Master Registry" in the client documentation for more details and advantages
6300 of this implementation over the default Zookeeper based registry.
6301
6302 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
6303
6304 Where to set this: HBase client configuration (hbase-site.xml)
6305
6306 Possible values:
6307 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
6308 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
6309
6310 Notes on defaults:
6311
6312 - For v3.0.0 and later, MasterRegistry is the default registry
6313 - For all releases in 2.x line, ZK based registry is the default.
6314
6315 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
6316
6317 ```
6318 <property>
6319   <name>hbase.client.registry.impl</name>
6320   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
6321 </property>
6322 ```
6323
6324
6325 ---
6326
6327 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
6328
6329 caffeine: 2.6.2 =\> 2.8.1
6330 commons-codec: 1.10 =\> 1.13
6331 commons-io: 2.5 =\> 2.6
6332 disrupter: 3.3.6 =\> 3.4.2
6333 httpcore: 4.4.6 =\> 4.4.13
6334 jackson: 2.9.10 =\> 2.10.1
6335 jackson.databind: 2.9.10.1 =\> 2.10.1
6336 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
6337 protobuf.plugin: 0.5.0 =\> 0.6.1
6338 zookeeper: 3.4.10 =\> 3.4.14
6339 slf4j: 1.7.25 =\> 1.7.30
6340 rat: 0.12 =\> 0.13
6341 asciidoctor: 1.5.5 =\> 1.5.8
6342 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
6343 error-prone: 2.3.3 =\> 2.3.4
6344
6345
6346 ---
6347
6348 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
6349
6350 - Reverts a binary incompatible binary change for ByteRangeUtils
6351 - Usage of reflection inside CommonFSUtils removed
6352
6353
6354 ---
6355
6356 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
6357
6358 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
6359
6360 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
6361
6362
6363 ---
6364
6365 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
6366
6367 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
6368
6369
6370 ---
6371
6372 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
6373
6374 Add a new config to hbase-default.xml
6375
6376   \<property\>
6377     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
6378     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
6379     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
6380     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
6381     called in order, so put the cleaner that prunes the most files in front. To
6382     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
6383     and add the fully qualified class name here. Always add the above
6384     default hfile cleaners in the list as they will be overwritten in
6385     hbase-site.xml.\</description\>
6386   \</property\>
6387
6388 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
6389
6390
6391 ---
6392
6393 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
6394
6395 Updated parent pom to Apache version 22.
6396
6397
6398 ---
6399
6400 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
6401
6402 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
6403
6404 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
6405
6406
6407 ---
6408
6409 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
6410
6411 Add a new feature to improve MTTR which have 3 steps to failover:
6412 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
6413 2. Open region.
6414 3. Bulkload the recovered.hfiles for every column family.
6415
6416 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
6417
6418 Config hbase.wal.split.to.hfile to true to enable this featue.
6419
6420
6421 ---
6422
6423 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
6424
6425 Changed the logging in hbase-zookeeper to use built-in formatting
6426
6427
6428 ---
6429
6430 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
6431
6432 From the PR:
6433
6434 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
6435
6436 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
6437
6438
6439 ---
6440
6441 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
6442
6443 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
6444
6445
6446 ---
6447
6448 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
6449
6450 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
6451
6452
6453 ---
6454
6455 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
6456
6457 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
6458
6459
6460 ---
6461
6462 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
6463
6464 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
6465
6466 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
6467
6468
6469 ---
6470
6471 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
6472
6473 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
6474
6475
6476 ---
6477
6478 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
6479
6480 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
6481 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
6482
6483 Fixed this bug as part of this Jira.
6484 Updated description for corresponding configs:
6485
6486 1. hbase.master.regions.recovery.check.interval :
6487
6488 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6489
6490 2. hbase.regions.recovery.store.file.ref.count :
6491
6492 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6493
6494
6495 ---
6496
6497 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
6498
6499 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
6500
6501
6502 ---
6503
6504 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
6505
6506 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
6507
6508
6509 ---
6510
6511 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
6512
6513 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
6514
6515
6516 ---
6517
6518 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
6519
6520 Bumped surefire plugin to 3.0.0-M4
6521
6522
6523 ---
6524
6525 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
6526
6527 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
6528
6529
6530 ---
6531
6532 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
6533
6534 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
6535 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
6536 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
6537 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
6538 From the shell this can be enabled by using the option per Column Family also by using the below format
6539 {code}
6540 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
6541 {code}
6542
6543
6544 ---
6545
6546 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
6547
6548 <!-- markdown -->
6549
6550 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
6551
6552 ```
6553 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
6554     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
6555 ```
6556
6557 See javadocs of the class `MobRefReporter` for more details.
6558
6559 the reference guide has added some information about MOB internals and troubleshooting.
6560
6561
6562 ---
6563
6564 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
6565
6566 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
6567
6568
6569 ---
6570
6571 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
6572
6573 Fixed unbalanced braces in string representation within HBase shell
6574
6575
6576 ---
6577
6578 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
6579
6580 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
6581 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
6582
6583
6584 ---
6585
6586 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
6587
6588 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
6589
6590
6591 ---
6592
6593 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
6594
6595 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
6596
6597 1. RowFilter
6598 2. ValueFilter
6599 3. QualifierFilter
6600 4. FamilyFilter
6601 5. ColumnValueFilter
6602
6603
6604 ---
6605
6606 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
6607
6608 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
6609
6610
6611 ---
6612
6613 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
6614
6615 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
6616
6617
6618 ---
6619
6620 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
6621
6622 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
6623
6624
6625 ---
6626
6627 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
6628
6629 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
6630
6631
6632 ---
6633
6634 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
6635
6636 <!-- markdown -->
6637 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
6638
6639 Such messages will happen at most once per five minutes.
6640
6641
6642 ---
6643
6644 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
6645
6646 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
6647
6648
6649 ---
6650
6651 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
6652
6653 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
6654
6655
6656 ---
6657
6658 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
6659
6660 <!-- markdown -->
6661
6662 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
6663
6664   - CVE-2019-16942
6665   - CVE-2019-16943
6666
6667 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
6668
6669
6670 ---
6671
6672 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
6673
6674 <!-- markdown -->
6675
6676 The MOB compaction process in the HBase Master now logs more about its activity.
6677
6678 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
6679
6680 Caveats:
6681 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
6682 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
6683 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
6684 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
6685
6686
6687 ---
6688
6689 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
6690
6691 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
6692
6693
6694 ---
6695
6696 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
6697
6698 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
6699
6700 Configs:
6701
6702 1. hbase.master.regions.recovery.check.interval :
6703
6704 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6705
6706 2. hbase.regions.recovery.store.file.ref.count :
6707
6708 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6709
6710
6711 ---
6712
6713 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
6714
6715 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
6716
6717 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
6718
6719
6720 ---
6721
6722 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
6723
6724 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
6725
6726
6727 ---
6728
6729 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
6730
6731 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
6732
6733
6734 ---
6735
6736 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
6737
6738 <!-- markdown -->
6739 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
6740
6741 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
6742
6743
6744 ---
6745
6746 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
6747
6748 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
6749
6750
6751 ---
6752
6753 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
6754
6755 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
6756
6757
6758 ---
6759
6760 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
6761
6762 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
6763
6764 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
6765
6766
6767 ---
6768
6769 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
6770
6771 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
6772 \<property\>
6773     \<name\>hbase.bucketcache.ioengine\</name\>
6774     \<value\> pmem:///path in persistent memory \</value\>
6775   \</property\>
6776
6777
6778 ---
6779
6780 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
6781
6782 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
6783 hbase\> snapshot\_cleanup\_switch false
6784
6785 We can re-enable it using:
6786 hbase\> snapshot\_cleanup\_switch true
6787
6788 We can query whether snapshot auto cleanup is enabled for cluster using:
6789 hbase\> snapshot\_cleanup\_enabled
6790
6791
6792 ---
6793
6794 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
6795
6796 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
6797
6798
6799 ---
6800
6801 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
6802
6803 This issue adds via its subtasks:
6804
6805  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
6806  \*\* Master thought this region opened, but no regionserver reported it.
6807  \*\* Master thought this region opened on Server1, but regionserver reported Server2
6808  \*\* More than one regionservers reported opened this region
6809  Both chores can be triggered from the shell to regenerate ‘new’ reports.
6810  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
6811  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
6812  \* Offline replace of hbase.version and hbase.id
6813  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
6814  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
6815  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
6816  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
6817  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
6818  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
6819
6820 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
6821
6822
6823 ---
6824
6825 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
6826
6827 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
6828
6829
6830 ---
6831
6832 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
6833
6834 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
6835
6836
6837 ---
6838
6839 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
6840
6841 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
6842
6843 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
6844
6845 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
6846
6847
6848 ---
6849
6850 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
6851
6852 <!-- markdown -->
6853 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
6854
6855
6856 ---
6857
6858 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
6859
6860 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
6861
6862
6863 ---
6864
6865 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
6866
6867 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
6868
6869
6870 ---
6871
6872 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
6873
6874 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
6875 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
6876
6877
6878 ---
6879
6880 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
6881
6882 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
6883 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
6884 \* TimeRange#until: Represents the time interval [0, maxStamp)
6885 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
6886
6887
6888 ---
6889
6890 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
6891
6892 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
6893 {code}
6894 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
6895 {code}
6896
6897
6898 ---
6899
6900 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
6901
6902 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
6903
6904
6905 ---
6906
6907 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
6908
6909 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
6910
6911 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
6912
6913
6914 ---
6915
6916 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
6917
6918 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
6919
6920
6921 ---
6922
6923 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
6924
6925 New shaded artifact for testing: hbase-shaded-testing-util.
6926
6927
6928 ---
6929
6930 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
6931
6932 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
6933 1. Check HDFS configuration
6934 2. Add master coprocessor:
6935     hbase.coprocessor.master.classes=
6936     “org.apache.hadoop.hbase.security.access.AccessController,
6937 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
6938 3. Enable this feature:
6939     hbase.acl.sync.to.hdfs.enable=true
6940 4. Modify table scheme to enable this feature for a table:
6941     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
6942
6943
6944 ---
6945
6946 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
6947
6948 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
6949
6950 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
6951
6952 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
6953 java.lang.ArrayIndexOutOfBoundsException: 18056
6954         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
6955         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
6956         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
6957         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
6958         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
6959         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
6960         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
6961         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
6962         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
6963
6964 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
6965
6966 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
6967
6968 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
6969
6970
6971 ---
6972
6973 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
6974
6975 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
6976
6977
6978 ---
6979
6980 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
6981
6982 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
6983
6984
6985 ---
6986
6987 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
6988
6989 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
6990
6991 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
6992
6993
6994 ---
6995
6996 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
6997
6998 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
6999
7000
7001 ---
7002
7003 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
7004
7005 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
7006 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
7007
7008
7009 ---
7010
7011 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
7012
7013 1. Add a new chore thread in master to do hbck checking
7014 2. Add a new web ui "HBCK Report" page to display checking results.
7015
7016 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
7017
7018 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
7019
7020
7021 ---
7022
7023 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
7024
7025 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
7026 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
7027
7028
7029 ---
7030
7031 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
7032
7033 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
7034
7035 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
7036
7037 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
7038
7039
7040 ---
7041
7042 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
7043
7044 Add a new master web UI to show the potentially problematic opened regions. There are three case:
7045 1. Master thought this region opened, but no regionserver reported it.
7046 2. Master thought this region opened on Server1, but regionserver reported Server2
7047 3. More than one regionservers reported opened this region
7048
7049
7050 ---
7051
7052 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
7053
7054 Feature: Take a Snapshot With TTL for auto-cleanup
7055
7056 Attribute:
7057 1. TTL
7058      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
7059
7060 Configs:
7061 1. Default Snapshot TTL:
7062      - FOREVER by default
7063      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
7064
7065 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
7066      - hbase.master.cleaner.snapshot.disable: "true"
7067     With this config, HMaster needs restart just like any other hbase-site config.
7068
7069
7070 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
7071
7072
7073 ---
7074
7075 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
7076
7077 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
7078
7079
7080 ---
7081
7082 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
7083
7084 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
7085
7086 This tool is deprecated in 2.x and will be removed in 3.0.
7087
7088
7089 ---
7090
7091 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
7092
7093 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
7094
7095
7096 ---
7097
7098 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
7099
7100 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
7101
7102 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
7103
7104 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
7105
7106
7107 ---
7108
7109 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
7110
7111 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
7112 To use this feature, please make sure the HDFS config is set:
7113 dfs.namenode.acls.enabled=true
7114 fs.permissions.umask-mode=027
7115
7116 and set the HBase config:
7117 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
7118 hbase.user.scan.snapshot.enable=true
7119
7120
7121 ---
7122
7123 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
7124
7125 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
7126
7127 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
7128
7129
7130 ---
7131
7132 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
7133
7134 <!-- markdown -->
7135
7136 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
7137
7138
7139 ---
7140
7141 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
7142
7143 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
7144
7145
7146 ---
7147
7148 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
7149
7150 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
7151
7152
7153 ---
7154
7155 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
7156
7157 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
7158
7159
7160 ---
7161
7162 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
7163
7164 The HBase "source checksum" now uses SHA512 instead of MD5.
7165
7166
7167 ---
7168
7169 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
7170
7171 <!-- markdown -->
7172
7173 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
7174
7175 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
7176
7177
7178 ---
7179
7180 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
7181
7182 The access method was used to the HttpServerFunctionalTest class as a common place.
7183
7184
7185 ---
7186
7187 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
7188
7189 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
7190
7191
7192 ---
7193
7194 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
7195
7196 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
7197
7198
7199 ---
7200
7201 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
7202
7203 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
7204
7205
7206 ---
7207
7208 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
7209
7210 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
7211
7212
7213 ---
7214
7215 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
7216
7217 Support get\|set LogLevel in secure(kerberized) environment.
7218
7219
7220 ---
7221
7222 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
7223
7224 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
7225
7226
7227 ---
7228
7229 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
7230
7231 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
7232
7233
7234 ---
7235
7236 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
7237
7238 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
7239
7240
7241 ---
7242
7243 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
7244
7245 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
7246
7247
7248 ---
7249
7250 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
7251
7252 Updated metrics core from 3.2.1 to 3.2.6.
7253
7254
7255 ---
7256
7257 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
7258
7259 The rubocop definition for the maximum method length was set to 75.
7260
7261
7262 ---
7263
7264 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
7265
7266 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
7267
7268
7269 ---
7270
7271 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
7272
7273 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
7274
7275
7276 ---
7277
7278 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
7279
7280 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
7281
7282 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
7283
7284 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
7285
7286 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
7287
7288 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
7289 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
7290
7291 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
7292 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
7293 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
7294
7295
7296 ---
7297
7298 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
7299
7300 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
7301
7302 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
7303
7304
7305 ---
7306
7307 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
7308
7309 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
7310
7311
7312 ---
7313
7314 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
7315
7316 <!-- markdown -->
7317 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
7318
7319 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
7320
7321
7322 ---
7323
7324 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
7325
7326 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
7327
7328 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
7329
7330
7331 ---
7332
7333 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
7334
7335 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
7336
7337
7338 ---
7339
7340 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
7341
7342 <!-- markdown -->
7343
7344 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
7345
7346 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
7347
7348
7349 ---
7350
7351 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
7352
7353 Add below method in Table interface:
7354
7355 RegionLocator getRegionLocator() throws IOException;
7356
7357 Add below methods in AsyncTable interface:
7358
7359 AsyncTableRegionLocator getRegionLocator();
7360 CompletableFuture\<TableDescriptor\> getDescriptor();
7361
7362
7363 ---
7364
7365 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
7366
7367 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
7368
7369 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
7370
7371 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
7372
7373
7374 ---
7375
7376 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
7377
7378 Introduced
7379
7380 Future\<Void\> createTableAsync(TableDescriptor);
7381
7382
7383 ---
7384
7385 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
7386
7387 Introduced these methods:
7388 void move(byte[]);
7389 void move(byte[], ServerName);
7390 Future\<Void\> splitRegionAsync(byte[]);
7391
7392 These methods are deprecated:
7393 void move(byte[], byte[])
7394
7395
7396 ---
7397
7398 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
7399
7400 Add a new jenkins file for running pre commit check for GitHub PR.
7401
7402
7403 ---
7404
7405 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
7406
7407 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
7408
7409
7410 ---
7411
7412 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
7413
7414 When insufficient permissions, you now get:
7415
7416 HTTP/1.1 403 Forbidden
7417
7418 on the HTTP side, and in the message
7419
7420 Forbidden
7421 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
7422 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
7423 and the rest of the ADE stack
7424
7425
7426 ---
7427
7428 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
7429
7430 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
7431
7432
7433 ---
7434
7435 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
7436
7437 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
7438
7439
7440 ---
7441
7442 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
7443
7444 <!-- markdown -->
7445 Fixed awkward dependency issue that prevented site building.
7446
7447 #### note specific to HBase 2.1.4
7448 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
7449 ```
7450 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
7451 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
7452         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
7453         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
7454         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
7455         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
7456         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
7457         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
7458         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
7459         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
7460         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
7461         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
7462         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
7463         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
7464         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
7465         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
7466         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
7467         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
7468         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
7469         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
7470         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
7471         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
7472         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
7473         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
7474         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
7475         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
7476         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
7477         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
7478 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
7479         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
7480         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
7481         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
7482         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
7483         ... 26 more
7484
7485 ```
7486
7487 Workaround via any _one_ of the following:
7488 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
7489 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
7490 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
7491 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
7492 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
7493
7494
7495 ---
7496
7497 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
7498
7499 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
7500
7501
7502 ---
7503
7504 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
7505
7506 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
7507
7508
7509 ---
7510
7511 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
7512
7513 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
7514
7515 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
7516
7517
7518 ---
7519
7520 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
7521
7522 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
7523
7524
7525 ---
7526
7527 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
7528
7529 <!-- markdown -->
7530
7531 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
7532
7533 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
7534
7535
7536 ---
7537
7538 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
7539
7540 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
7541
7542
7543 ---
7544
7545 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
7546
7547 Add a cloneSnapshotAsync method with restoreAcl parameter.
7548 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
7549 Make snapshotAsync method returns a Future\<Void\>.
7550 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
7551 Use default methods to reduce the code base for implementation classes.
7552
7553
7554 ---
7555
7556 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
7557
7558 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
7559
7560
7561 ---
7562
7563 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
7564
7565 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
7566 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
7567
7568 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
7569
7570 For example:
7571 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
7572
7573
7574 ---
7575
7576 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
7577
7578 Adds below flush, split, and compaction metrics
7579
7580  +  // split related metrics
7581  +  private MutableFastCounter splitRequest;
7582  +  private MutableFastCounter splitSuccess;
7583  +  private MetricHistogram splitTimeHisto;
7584  +
7585  +  // flush related metrics
7586  +  private MetricHistogram flushTimeHisto;
7587  +  private MetricHistogram flushMemstoreSizeHisto;
7588  +  private MetricHistogram flushOutputSizeHisto;
7589  +  private MutableFastCounter flushedMemstoreBytes;
7590  +  private MutableFastCounter flushedOutputBytes;
7591  +
7592  +  // compaction related metrics
7593  +  private MetricHistogram compactionTimeHisto;
7594  +  private MetricHistogram compactionInputFileCountHisto;
7595  +  private MetricHistogram compactionInputSizeHisto;
7596  +  private MetricHistogram compactionOutputFileCountHisto;
7597  +  private MetricHistogram compactionOutputSizeHisto;
7598  +  private MutableFastCounter compactedInputBytes;
7599  +  private MutableFastCounter compactedOutputBytes;
7600  +
7601  +  private MetricHistogram majorCompactionTimeHisto;
7602  +  private MetricHistogram majorCompactionInputFileCountHisto;
7603  +  private MetricHistogram majorCompactionInputSizeHisto;
7604  +  private MetricHistogram majorCompactionOutputFileCountHisto;
7605  +  private MetricHistogram majorCompactionOutputSizeHisto;
7606  +  private MutableFastCounter majorCompactedInputBytes;
7607  +  private MutableFastCounter majorCompactedOutputBytes;
7608
7609
7610 ---
7611
7612 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
7613
7614 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
7615
7616
7617 ---
7618
7619 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
7620
7621 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
7622 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
7623
7624
7625 ---
7626
7627 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
7628
7629 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
7630
7631 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
7632
7633
7634 ---
7635
7636 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
7637
7638 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
7639 Shell commands are as follows:
7640 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7641
7642 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
7643 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
7644 Shell commands are as follows:
7645 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7646 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
7647 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
7648
7649
7650 ---
7651
7652 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
7653
7654 Change spotbugs version to 3.1.11.
7655
7656
7657 ---
7658
7659 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
7660
7661 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
7662
7663 It also introduces additional info for each recovery queue, which was not accounted by this command before.
7664
7665 The new output for "status 'replication'" command is explained in details below:
7666 a) Source started, target stopped, no edits arrived on source yet:
7667 ...
7668  SOURCE: PeerID=1
7669          Normal Queue: 1
7670            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7671 ...
7672 b) Source started, target stopped, add edit on source:
7673 ...
7674 Normal Queue: 1
7675            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
7676 ...
7677 c) Source started, target stopped, edit added on source, restart source:
7678 ...
7679 SOURCE: PeerID=1
7680          Normal Queue: 1
7681            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7682          Recovered Queue: 1-hbase01.home,16020,1542784524057
7683            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
7684 ...
7685 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
7686 ...
7687 SOURCE: PeerID=1
7688          Normal Queue: 1
7689            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
7690          Recovered Queue: 1-hbase01.home,16020,1542782758742
7691            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
7692 ...
7693 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
7694 ...
7695        SOURCE: PeerID=1
7696          Normal Queue: 1
7697            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
7698 ...
7699 f) Source started, target stopped, add edit on source, restart source, restart target:
7700 ...
7701 SOURCE: PeerID=1
7702          Normal Queue: 1
7703            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7704 ...
7705
7706
7707 ---
7708
7709 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
7710
7711 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
7712
7713
7714 ---
7715
7716 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
7717
7718 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
7719 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
7720 disable\_exceed\_throttle\_quota
7721 There are two limits when enable exceed throttle quota:
7722 1. Must set at least one read and one write region server throttle quota;
7723 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
7724
7725
7726 ---
7727
7728 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
7729
7730 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
7731
7732
7733 ---
7734
7735 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
7736
7737 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
7738
7739
7740 ---
7741
7742 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
7743
7744 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
7745
7746
7747 ---
7748
7749 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
7750
7751 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
7752
7753 hbase\> help 'scan'
7754
7755
7756 ---
7757
7758 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
7759
7760 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
7761
7762 For example:
7763 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
7764
7765
7766 ---
7767
7768 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
7769
7770 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
7771 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
7772
7773
7774 ---
7775
7776 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
7777
7778 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
7779
7780
7781 ---
7782
7783 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
7784
7785 Make StoppedRpcClientException extend DoNotRetryIOException.
7786
7787
7788 ---
7789
7790 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
7791
7792 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
7793 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
7794
7795
7796 ---
7797
7798 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
7799
7800 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
7801
7802 The effect releases are:
7803 2.1.x: 2.1.2 and below
7804 2.0.x: 2.0.4 and below
7805 1.x: 1.4.x and below
7806
7807 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
7808
7809
7810 ---
7811
7812 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
7813
7814 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
7815
7816
7817
7818 # HBASE  2.3.0 Release Notes
7819
7820 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
7821
7822
7823 ---
7824
7825 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
7826
7827 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
7828
7829
7830 ---
7831
7832 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
7833
7834 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
7835
7836
7837 ---
7838
7839 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
7840
7841 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
7842
7843
7844 ---
7845
7846 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
7847
7848 <!-- markdown -->
7849 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
7850 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
7851 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
7852 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
7853 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
7854 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
7855
7856
7857 ---
7858
7859 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
7860
7861 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
7862
7863 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
7864
7865 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
7866
7867 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
7868
7869
7870 ---
7871
7872 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
7873
7874 Added new metric to differentiate sink startup time from last OP applied time.
7875
7876 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
7877
7878 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
7879
7880 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
7881
7882 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
7883
7884
7885 ---
7886
7887 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
7888
7889 <!-- markdown -->
7890 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
7891
7892 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
7893
7894
7895 ---
7896
7897 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
7898
7899 Add backoff. Avoid retrying every 100ms.
7900
7901
7902 ---
7903
7904 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
7905
7906 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
7907
7908 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
7909
7910
7911 ---
7912
7913 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
7914
7915 Introduced a general 'local region' at master side to store the procedure data, etc.
7916
7917 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
7918
7919 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
7920
7921
7922 ---
7923
7924 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
7925
7926 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
7927
7928
7929 ---
7930
7931 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
7932
7933 Config key: hbase.regionserver.slowlog.systable.enabled
7934 Default value: false
7935
7936 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
7937 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
7938
7939 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
7940
7941 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
7942
7943  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
7944  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
7945  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
7946  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
7947                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
7948                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
7949                                                              rics: false
7950  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
7951  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
7952  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
7953  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
7954  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
7955  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
7956  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
7957  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
7958
7959
7960 ---
7961
7962 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
7963
7964 <!-- markdown -->
7965 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
7966
7967
7968 ---
7969
7970 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
7971
7972 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
7973
7974 The request log is disabled by default in conf/log4j.properties by the following lines:
7975
7976 # Disable request log by default, you can enable this by changing the appender
7977 log4j.category.http.requests=INFO,NullAppender
7978 log4j.additivity.http.requests=false
7979
7980 Change the 'NullAppender' to what ever you want if you want to enable request log.
7981
7982 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
7983
7984
7985 ---
7986
7987 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
7988
7989 Use a empty string to represent no column specified for deleteall in shell mode.
7990 useage:
7991 deleteall 'test','r1','',12345
7992 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
7993
7994
7995 ---
7996
7997 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
7998
7999 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
8000
8001
8002 ---
8003
8004 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
8005
8006 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
8007
8008
8009 ---
8010
8011 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
8012
8013 Moved to hbase-thirdparty 3.3.0.
8014
8015
8016 ---
8017
8018 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
8019
8020 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
8021
8022 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
8023
8024 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
8025
8026
8027 ---
8028
8029 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
8030
8031 <!-- markdown -->
8032 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
8033
8034 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
8035
8036
8037 ---
8038
8039 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
8040
8041 New Config: hbase.rpc.rows.size.threshold.reject
8042 -----------------------------------------------------------------------
8043
8044 Default value: false
8045 Description:
8046 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
8047
8048
8049 ---
8050
8051 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
8052
8053 StochasticLoadBalancer functional improvement:
8054
8055 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
8056
8057
8058 ---
8059
8060 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
8061
8062 user or admin can now use
8063 hbase shell \> rename\_rsgroup 'oldname', 'newname'
8064 to rename rsgroup.
8065
8066
8067 ---
8068
8069 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
8070
8071 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
8072
8073 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
8074
8075
8076 ---
8077
8078 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
8079
8080 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
8081
8082
8083 ---
8084
8085 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
8086
8087 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
8088
8089
8090 ---
8091
8092 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
8093
8094 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
8095
8096
8097 ---
8098
8099 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
8100
8101 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
8102
8103
8104 ---
8105
8106 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
8107
8108 <!-- markdown -->
8109 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
8110
8111
8112 ---
8113
8114 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
8115
8116 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
8117
8118 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
8119
8120 For running tests locally, to go faster, up fork count.
8121
8122 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
8123
8124 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
8125
8126
8127 ---
8128
8129 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
8130
8131 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
8132
8133
8134 ---
8135
8136 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
8137
8138 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
8139
8140
8141 ---
8142
8143 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
8144
8145 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
8146
8147
8148 ---
8149
8150 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
8151
8152 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
8153
8154
8155 ---
8156
8157 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
8158
8159 <!-- markdown -->
8160 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
8161
8162 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
8163
8164
8165 ---
8166
8167 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
8168
8169 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
8170
8171
8172 ---
8173
8174 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
8175
8176 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
8177
8178
8179 ---
8180
8181 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
8182
8183 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
8184
8185
8186 ---
8187
8188 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
8189
8190 ColumnFamilyDescriptor new builder API:
8191
8192     /\*\*
8193      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
8194      \* of versions(versionAfterInterval) after that interval elapses.
8195      \*
8196      \* @param retentionInterval Retain all versions for this interval
8197      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
8198      \*/
8199     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
8200         final int retentionInterval, final int versionAfterInterval)
8201
8202
8203 ---
8204
8205 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
8206
8207 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
8208
8209
8210 ---
8211
8212 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
8213
8214 Expose file system level read metrics for RegionServer.
8215
8216 If the HBase RS runs on top of HDFS, calculate the aggregation of
8217 ReadStatistics of each HdfsFileInputStream. These metrics include:
8218 (1) total number of bytes read from HDFS.
8219 (2) total number of bytes read from local DataNode.
8220 (3) total number of bytes read locally through short-circuit read.
8221 (4) total number of bytes read locally through zero-copy read.
8222
8223 Because HDFS ReadStatistics is calculated per input stream, it is not
8224 feasible to update the aggregated number in real time. Instead, the
8225 metrics are updated when an input stream is closed.
8226
8227
8228 ---
8229
8230 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
8231
8232 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
8233
8234 Here is a simple example of script:
8235 {code}
8236 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
8237 #!/bin/bash
8238 namespace=$1
8239 tablename=$2
8240 if [[ $namespace == test ]]; then
8241   echo test
8242 elif [[ $tablename == \*foo\* ]]; then
8243   echo other
8244 else
8245   echo default
8246 fi
8247 {code}
8248
8249
8250 ---
8251
8252 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
8253
8254 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
8255
8256
8257 ---
8258
8259 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
8260
8261 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
8262
8263
8264 ---
8265
8266 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
8267
8268 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
8269
8270 User used to see....
8271
8272   column=table:state, timestamp=1583967620343 .....
8273
8274 ... but now sees:
8275
8276   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
8277
8278
8279 ---
8280
8281 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
8282
8283 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
8284
8285
8286 ---
8287
8288 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
8289
8290 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
8291
8292 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
8293
8294
8295 ---
8296
8297 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
8298
8299 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
8300
8301 New Admin APIs:
8302 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
8303       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
8304
8305 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
8306       throws IOException;
8307
8308 Configs:
8309
8310 1. hbase.regionserver.slowlog.ringbuffer.size:
8311 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
8312
8313 Default
8314 256
8315
8316 2. hbase.regionserver.slowlog.buffer.enabled:
8317 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
8318
8319 Default
8320 false
8321
8322
8323 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
8324
8325
8326 ---
8327
8328 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
8329
8330 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
8331
8332
8333 ---
8334
8335 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
8336
8337 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
8338
8339 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
8340
8341 This is a fluent style API, the code is like:
8342
8343 For Table interface:
8344 {code}
8345 table.checkAndMutate(row, filter).thenPut(put);
8346 {code}
8347
8348 For AsyncTable interface:
8349 {code}
8350 table.checkAndMutate(row, filter).thenPut(put)
8351     .thenAccept(succ -\> {
8352       if (succ) {
8353         System.out.println("Check and put succeeded");
8354       } else {
8355         System.out.println("Check and put failed");
8356       }
8357     });
8358 {code}
8359
8360
8361 ---
8362
8363 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
8364
8365 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
8366
8367
8368 ---
8369
8370 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
8371
8372 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
8373
8374
8375 ---
8376
8377 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
8378
8379     Adds shell command regioninfo:
8380
8381       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
8382       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
8383       Took 0.4737 seconds
8384
8385
8386 ---
8387
8388 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
8389
8390 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
8391
8392 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
8393
8394
8395 ---
8396
8397 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
8398
8399 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
8400
8401 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
8402 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
8403
8404 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
8405
8406
8407 ---
8408
8409 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
8410
8411 <!-- markdown -->
8412 Enables master based registry as the default registry used by clients to fetch connection metadata.
8413 Refer to the section "Master Registry" in the client documentation for more details and advantages
8414 of this implementation over the default Zookeeper based registry.
8415
8416 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
8417
8418 Where to set this: HBase client configuration (hbase-site.xml)
8419
8420 Possible values:
8421 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
8422 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
8423
8424 Notes on defaults:
8425
8426 - For v3.0.0 and later, MasterRegistry is the default registry
8427 - For all releases in 2.x line, ZK based registry is the default.
8428
8429 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
8430
8431 ```
8432 <property>
8433   <name>hbase.client.registry.impl</name>
8434   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
8435 </property>
8436 ```
8437
8438
8439 ---
8440
8441 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
8442
8443 caffeine: 2.6.2 =\> 2.8.1
8444 commons-codec: 1.10 =\> 1.13
8445 commons-io: 2.5 =\> 2.6
8446 disrupter: 3.3.6 =\> 3.4.2
8447 httpcore: 4.4.6 =\> 4.4.13
8448 jackson: 2.9.10 =\> 2.10.1
8449 jackson.databind: 2.9.10.1 =\> 2.10.1
8450 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
8451 protobuf.plugin: 0.5.0 =\> 0.6.1
8452 zookeeper: 3.4.10 =\> 3.4.14
8453 slf4j: 1.7.25 =\> 1.7.30
8454 rat: 0.12 =\> 0.13
8455 asciidoctor: 1.5.5 =\> 1.5.8
8456 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
8457 error-prone: 2.3.3 =\> 2.3.4
8458
8459
8460 ---
8461
8462 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
8463
8464 - Reverts a binary incompatible binary change for ByteRangeUtils
8465 - Usage of reflection inside CommonFSUtils removed
8466
8467
8468 ---
8469
8470 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
8471
8472 Adds being able to edit hbase:meta table schema. For example,
8473
8474 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
8475 Updating all regions with the new schema...
8476 All regions updated.
8477 Done.
8478 Took 1.2138 seconds
8479
8480 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
8481
8482
8483 ---
8484
8485 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
8486
8487 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
8488
8489 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
8490
8491
8492 ---
8493
8494 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
8495
8496 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
8497
8498
8499 ---
8500
8501 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
8502
8503 Add a new config to hbase-default.xml
8504
8505   \<property\>
8506     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
8507     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
8508     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
8509     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
8510     called in order, so put the cleaner that prunes the most files in front. To
8511     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
8512     and add the fully qualified class name here. Always add the above
8513     default hfile cleaners in the list as they will be overwritten in
8514     hbase-site.xml.\</description\>
8515   \</property\>
8516
8517 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
8518
8519
8520 ---
8521
8522 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
8523
8524 Updated parent pom to Apache version 22.
8525
8526
8527 ---
8528
8529 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
8530
8531 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
8532
8533 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
8534
8535
8536 ---
8537
8538 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
8539
8540 Add a new feature to improve MTTR which have 3 steps to failover:
8541 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
8542 2. Open region.
8543 3. Bulkload the recovered.hfiles for every column family.
8544
8545 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
8546
8547 Config hbase.wal.split.to.hfile to true to enable this featue.
8548
8549
8550 ---
8551
8552 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
8553
8554 Changed the logging in hbase-zookeeper to use built-in formatting
8555
8556
8557 ---
8558
8559 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
8560
8561 From the PR:
8562
8563 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
8564
8565 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
8566
8567
8568 ---
8569
8570 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
8571
8572 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
8573
8574
8575 ---
8576
8577 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
8578
8579 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
8580
8581
8582 ---
8583
8584 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
8585
8586 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
8587
8588
8589 ---
8590
8591 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
8592
8593 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
8594
8595 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
8596
8597
8598 ---
8599
8600 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
8601
8602 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
8603
8604
8605 ---
8606
8607 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
8608
8609 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
8610 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
8611
8612 Fixed this bug as part of this Jira.
8613 Updated description for corresponding configs:
8614
8615 1. hbase.master.regions.recovery.check.interval :
8616
8617 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8618
8619 2. hbase.regions.recovery.store.file.ref.count :
8620
8621 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8622
8623
8624 ---
8625
8626 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
8627
8628 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
8629
8630
8631 ---
8632
8633 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
8634
8635 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
8636
8637
8638 ---
8639
8640 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
8641
8642 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
8643
8644
8645 ---
8646
8647 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
8648
8649 Bumped surefire plugin to 3.0.0-M4
8650
8651
8652 ---
8653
8654 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
8655
8656 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
8657
8658
8659 ---
8660
8661 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
8662
8663 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
8664 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
8665 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
8666 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
8667 From the shell this can be enabled by using the option per Column Family also by using the below format
8668 {code}
8669 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
8670 {code}
8671
8672
8673 ---
8674
8675 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
8676
8677 <!-- markdown -->
8678
8679 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
8680
8681 ```
8682 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
8683     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
8684 ```
8685
8686 See javadocs of the class `MobRefReporter` for more details.
8687
8688 the reference guide has added some information about MOB internals and troubleshooting.
8689
8690
8691 ---
8692
8693 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
8694
8695 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
8696
8697
8698 ---
8699
8700 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
8701
8702 Fixed unbalanced braces in string representation within HBase shell
8703
8704
8705 ---
8706
8707 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
8708
8709 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
8710 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
8711
8712
8713 ---
8714
8715 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
8716
8717 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
8718
8719
8720 ---
8721
8722 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
8723
8724 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
8725
8726 1. RowFilter
8727 2. ValueFilter
8728 3. QualifierFilter
8729 4. FamilyFilter
8730 5. ColumnValueFilter
8731
8732
8733 ---
8734
8735 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
8736
8737 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
8738
8739
8740 ---
8741
8742 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
8743
8744 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
8745
8746
8747 ---
8748
8749 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
8750
8751 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
8752
8753
8754 ---
8755
8756 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
8757
8758 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
8759
8760
8761 ---
8762
8763 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
8764
8765 <!-- markdown -->
8766 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
8767
8768 Such messages will happen at most once per five minutes.
8769
8770
8771 ---
8772
8773 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
8774
8775 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
8776
8777
8778 ---
8779
8780 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
8781
8782 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
8783
8784
8785 ---
8786
8787 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
8788
8789 <!-- markdown -->
8790
8791 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
8792
8793   - CVE-2019-16942
8794   - CVE-2019-16943
8795
8796 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
8797
8798
8799 ---
8800
8801 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
8802
8803 <!-- markdown -->
8804
8805 The MOB compaction process in the HBase Master now logs more about its activity.
8806
8807 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
8808
8809 Caveats:
8810 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
8811 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
8812 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
8813 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
8814
8815
8816 ---
8817
8818 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
8819
8820 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
8821
8822
8823 ---
8824
8825 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
8826
8827 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
8828
8829 Configs:
8830
8831 1. hbase.master.regions.recovery.check.interval :
8832
8833 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8834
8835 2. hbase.regions.recovery.store.file.ref.count :
8836
8837 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8838
8839
8840 ---
8841
8842 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
8843
8844 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
8845
8846 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
8847
8848
8849 ---
8850
8851 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
8852
8853 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
8854
8855
8856 ---
8857
8858 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
8859
8860 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
8861
8862
8863 ---
8864
8865 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
8866
8867 <!-- markdown -->
8868 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
8869
8870 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
8871
8872
8873 ---
8874
8875 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
8876
8877 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
8878
8879
8880 ---
8881
8882 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
8883
8884 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
8885
8886
8887 ---
8888
8889 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
8890
8891 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
8892
8893 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
8894
8895
8896 ---
8897
8898 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
8899
8900 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
8901 \<property\>
8902     \<name\>hbase.bucketcache.ioengine\</name\>
8903     \<value\> pmem:///path in persistent memory \</value\>
8904   \</property\>
8905
8906
8907 ---
8908
8909 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
8910
8911 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
8912 hbase\> snapshot\_cleanup\_switch false
8913
8914 We can re-enable it using:
8915 hbase\> snapshot\_cleanup\_switch true
8916
8917 We can query whether snapshot auto cleanup is enabled for cluster using:
8918 hbase\> snapshot\_cleanup\_enabled
8919
8920
8921 ---
8922
8923 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
8924
8925 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
8926
8927
8928 ---
8929
8930 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
8931
8932 This issue adds via its subtasks:
8933
8934  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
8935  \*\* Master thought this region opened, but no regionserver reported it.
8936  \*\* Master thought this region opened on Server1, but regionserver reported Server2
8937  \*\* More than one regionservers reported opened this region
8938  Both chores can be triggered from the shell to regenerate ‘new’ reports.
8939  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
8940  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
8941  \* Offline replace of hbase.version and hbase.id
8942  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
8943  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
8944  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
8945  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
8946  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
8947  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
8948
8949 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
8950
8951
8952 ---
8953
8954 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
8955
8956 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
8957
8958
8959 ---
8960
8961 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
8962
8963 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
8964
8965
8966 ---
8967
8968 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
8969
8970 Before this issue, we've made the read path 100% offheap when block hit the BucketCache 100%, but if the cache missed then RS need to read the block by on-heap API, which would cause high young GC pressure.
8971 This issue will read the block by offheap even if reading the block from filesystem directly, it have some requirement for hadoop version(\>=2.9.3) but can also works with older hadoop version(means still works fine but will read block onheap). We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex, for more details please read it.
8972
8973
8974 ---
8975
8976 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
8977
8978 <!-- markdown -->
8979 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
8980
8981
8982 ---
8983
8984 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
8985
8986 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
8987
8988
8989 ---
8990
8991 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
8992
8993 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
8994
8995
8996 ---
8997
8998 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
8999
9000 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
9001 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
9002
9003
9004 ---
9005
9006 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
9007
9008 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
9009 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
9010 \* TimeRange#until: Represents the time interval [0, maxStamp)
9011 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
9012
9013
9014 ---
9015
9016 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
9017
9018 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
9019 {code}
9020 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
9021 {code}
9022
9023
9024 ---
9025
9026 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
9027
9028 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
9029
9030
9031 ---
9032
9033 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
9034
9035 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
9036
9037 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
9038
9039
9040 ---
9041
9042 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
9043
9044 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
9045
9046
9047 ---
9048
9049 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
9050
9051 New shaded artifact for testing: hbase-shaded-testing-util.
9052
9053
9054 ---
9055
9056 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
9057
9058 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
9059 1. Check HDFS configuration
9060 2. Add master coprocessor:
9061     hbase.coprocessor.master.classes=
9062     “org.apache.hadoop.hbase.security.access.AccessController,
9063 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
9064 3. Enable this feature:
9065     hbase.acl.sync.to.hdfs.enable=true
9066 4. Modify table scheme to enable this feature for a table:
9067     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
9068
9069
9070 ---
9071
9072 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
9073
9074 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
9075
9076 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
9077
9078 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
9079 java.lang.ArrayIndexOutOfBoundsException: 18056
9080         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
9081         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
9082         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
9083         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
9084         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
9085         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
9086         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
9087         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
9088         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
9089
9090 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
9091
9092 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
9093
9094 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
9095
9096
9097 ---
9098
9099 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
9100
9101 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
9102
9103
9104 ---
9105
9106 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
9107
9108 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
9109
9110
9111 ---
9112
9113 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
9114
9115 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
9116
9117 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
9118
9119
9120 ---
9121
9122 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
9123
9124 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
9125
9126
9127 ---
9128
9129 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
9130
9131 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
9132 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
9133
9134
9135 ---
9136
9137 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
9138
9139 1. Add a new chore thread in master to do hbck checking
9140 2. Add a new web ui "HBCK Report" page to display checking results.
9141
9142 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
9143
9144 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
9145
9146
9147 ---
9148
9149 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
9150
9151 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
9152
9153 $hbase rowcounter -h
9154
9155 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
9156 Options:
9157     --starttime=\<arg\>       starting time filter to start counting rows from.
9158     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
9159     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
9160     --expectedCount=\<arg\>   expected number of rows to be count.
9161 For performance, consider the following configuration properties:
9162 -Dhbase.client.scanner.caching=100
9163 -Dmapreduce.map.speculative=false
9164
9165
9166 ---
9167
9168 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
9169
9170 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
9171 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
9172
9173
9174 ---
9175
9176 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
9177
9178 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
9179
9180 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
9181
9182 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
9183
9184
9185 ---
9186
9187 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
9188
9189 Add a new master web UI to show the potentially problematic opened regions. There are three case:
9190 1. Master thought this region opened, but no regionserver reported it.
9191 2. Master thought this region opened on Server1, but regionserver reported Server2
9192 3. More than one regionservers reported opened this region
9193
9194
9195 ---
9196
9197 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
9198
9199 Feature: Take a Snapshot With TTL for auto-cleanup
9200
9201 Attribute:
9202 1. TTL
9203      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
9204
9205 Configs:
9206 1. Default Snapshot TTL:
9207      - FOREVER by default
9208      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
9209
9210 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
9211      - hbase.master.cleaner.snapshot.disable: "true"
9212     With this config, HMaster needs restart just like any other hbase-site config.
9213
9214
9215 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
9216
9217
9218 ---
9219
9220 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
9221
9222 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
9223
9224
9225 ---
9226
9227 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
9228
9229 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
9230
9231 This tool is deprecated in 2.x and will be removed in 3.0.
9232
9233
9234 ---
9235
9236 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
9237
9238 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
9239
9240
9241 ---
9242
9243 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
9244
9245 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
9246
9247 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
9248
9249 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
9250
9251
9252 ---
9253
9254 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
9255
9256 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
9257 To use this feature, please make sure the HDFS config is set:
9258 dfs.namenode.acls.enabled=true
9259 fs.permissions.umask-mode=027
9260
9261 and set the HBase config:
9262 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
9263 hbase.user.scan.snapshot.enable=true
9264
9265
9266 ---
9267
9268 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
9269
9270 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9271
9272 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9273
9274
9275 ---
9276
9277 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
9278
9279 <!-- markdown -->
9280
9281 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
9282
9283
9284 ---
9285
9286 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9287
9288 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9289
9290
9291 ---
9292
9293 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9294
9295 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9296
9297
9298 ---
9299
9300 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
9301
9302 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
9303
9304
9305 ---
9306
9307 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
9308
9309 The HBase "source checksum" now uses SHA512 instead of MD5.
9310
9311
9312 ---
9313
9314 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9315
9316 <!-- markdown -->
9317
9318 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9319
9320 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9321
9322
9323 ---
9324
9325 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
9326
9327 The access method was used to the HttpServerFunctionalTest class as a common place.
9328
9329
9330 ---
9331
9332 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9333
9334 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9335
9336
9337 ---
9338
9339 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9340
9341 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9342
9343
9344 ---
9345
9346 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9347
9348 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9349
9350
9351 ---
9352
9353 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9354
9355 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9356
9357
9358 ---
9359
9360 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
9361
9362 Support get\|set LogLevel in secure(kerberized) environment.
9363
9364
9365 ---
9366
9367 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9368
9369 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9370
9371
9372 ---
9373
9374 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
9375
9376 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
9377
9378
9379 ---
9380
9381 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9382
9383 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9384
9385
9386 ---
9387
9388 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9389
9390 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9391
9392
9393 ---
9394
9395 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
9396
9397 Updated metrics core from 3.2.1 to 3.2.6.
9398
9399
9400 ---
9401
9402 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
9403
9404 The rubocop definition for the maximum method length was set to 75.
9405
9406
9407 ---
9408
9409 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
9410
9411 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
9412
9413
9414 ---
9415
9416 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
9417
9418 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
9419
9420
9421 ---
9422
9423 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
9424
9425 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
9426
9427 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
9428
9429 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
9430
9431 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
9432
9433 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
9434 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
9435
9436 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
9437 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
9438 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
9439
9440
9441 ---
9442
9443 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
9444
9445 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
9446
9447 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
9448
9449
9450 ---
9451
9452 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
9453
9454 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
9455
9456
9457 ---
9458
9459 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
9460
9461 <!-- markdown -->
9462 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
9463
9464 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
9465
9466
9467 ---
9468
9469 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
9470
9471 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
9472
9473 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
9474
9475
9476 ---
9477
9478 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
9479
9480 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
9481
9482
9483 ---
9484
9485 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
9486
9487 <!-- markdown -->
9488
9489 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
9490
9491 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
9492
9493
9494 ---
9495
9496 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
9497
9498 Add below method in Table interface:
9499
9500 RegionLocator getRegionLocator() throws IOException;
9501
9502 Add below methods in AsyncTable interface:
9503
9504 AsyncTableRegionLocator getRegionLocator();
9505 CompletableFuture\<TableDescriptor\> getDescriptor();
9506
9507
9508 ---
9509
9510 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
9511
9512 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
9513
9514 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
9515
9516 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
9517
9518
9519 ---
9520
9521 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
9522
9523 Introduced
9524
9525 Future\<Void\> createTableAsync(TableDescriptor);
9526
9527
9528 ---
9529
9530 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
9531
9532 Introduced these methods:
9533 void move(byte[]);
9534 void move(byte[], ServerName);
9535 Future\<Void\> splitRegionAsync(byte[]);
9536
9537 These methods are deprecated:
9538 void move(byte[], byte[])
9539
9540
9541 ---
9542
9543 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
9544
9545 Add a new jenkins file for running pre commit check for GitHub PR.
9546
9547
9548 ---
9549
9550 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
9551
9552 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
9553
9554
9555 ---
9556
9557 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
9558
9559 When insufficient permissions, you now get:
9560
9561 HTTP/1.1 403 Forbidden
9562
9563 on the HTTP side, and in the message
9564
9565 Forbidden
9566 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
9567 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
9568 and the rest of the ADE stack
9569
9570
9571 ---
9572
9573 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
9574
9575 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
9576
9577
9578 ---
9579
9580 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
9581
9582 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
9583
9584
9585 ---
9586
9587 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
9588
9589 <!-- markdown -->
9590 Fixed awkward dependency issue that prevented site building.
9591
9592 #### note specific to HBase 2.1.4
9593 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
9594 ```
9595 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
9596 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
9597         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
9598         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
9599         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
9600         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
9601         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
9602         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
9603         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
9604         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
9605         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
9606         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
9607         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
9608         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
9609         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
9610         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
9611         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
9612         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
9613         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
9614         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
9615         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
9616         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
9617         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
9618         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
9619         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
9620         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
9621         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
9622         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
9623 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
9624         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
9625         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
9626         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
9627         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
9628         ... 26 more
9629
9630 ```
9631
9632 Workaround via any _one_ of the following:
9633 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
9634 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
9635 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
9636 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
9637 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
9638
9639
9640 ---
9641
9642 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
9643
9644 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
9645
9646
9647 ---
9648
9649 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
9650
9651 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
9652
9653
9654 ---
9655
9656 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
9657
9658 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
9659
9660 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
9661
9662
9663 ---
9664
9665 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
9666
9667 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
9668
9669
9670 ---
9671
9672 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
9673
9674 <!-- markdown -->
9675
9676 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
9677
9678 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
9679
9680
9681 ---
9682
9683 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
9684
9685 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
9686
9687
9688 ---
9689
9690 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
9691
9692 Add a cloneSnapshotAsync method with restoreAcl parameter.
9693 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
9694 Make snapshotAsync method returns a Future\<Void\>.
9695 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
9696 Use default methods to reduce the code base for implementation classes.
9697
9698
9699 ---
9700
9701 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
9702
9703 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
9704
9705
9706 ---
9707
9708 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
9709
9710 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
9711 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
9712
9713 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
9714
9715 For example:
9716 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
9717
9718
9719 ---
9720
9721 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
9722
9723 Adds below flush, split, and compaction metrics
9724
9725  +  // split related metrics
9726  +  private MutableFastCounter splitRequest;
9727  +  private MutableFastCounter splitSuccess;
9728  +  private MetricHistogram splitTimeHisto;
9729  +
9730  +  // flush related metrics
9731  +  private MetricHistogram flushTimeHisto;
9732  +  private MetricHistogram flushMemstoreSizeHisto;
9733  +  private MetricHistogram flushOutputSizeHisto;
9734  +  private MutableFastCounter flushedMemstoreBytes;
9735  +  private MutableFastCounter flushedOutputBytes;
9736  +
9737  +  // compaction related metrics
9738  +  private MetricHistogram compactionTimeHisto;
9739  +  private MetricHistogram compactionInputFileCountHisto;
9740  +  private MetricHistogram compactionInputSizeHisto;
9741  +  private MetricHistogram compactionOutputFileCountHisto;
9742  +  private MetricHistogram compactionOutputSizeHisto;
9743  +  private MutableFastCounter compactedInputBytes;
9744  +  private MutableFastCounter compactedOutputBytes;
9745  +
9746  +  private MetricHistogram majorCompactionTimeHisto;
9747  +  private MetricHistogram majorCompactionInputFileCountHisto;
9748  +  private MetricHistogram majorCompactionInputSizeHisto;
9749  +  private MetricHistogram majorCompactionOutputFileCountHisto;
9750  +  private MetricHistogram majorCompactionOutputSizeHisto;
9751  +  private MutableFastCounter majorCompactedInputBytes;
9752  +  private MutableFastCounter majorCompactedOutputBytes;
9753
9754
9755 ---
9756
9757 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
9758
9759 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
9760
9761
9762 ---
9763
9764 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
9765
9766 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
9767 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
9768
9769
9770 ---
9771
9772 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
9773
9774 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
9775
9776 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
9777
9778
9779 ---
9780
9781 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
9782
9783 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
9784 Shell commands are as follows:
9785 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9786
9787 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
9788 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
9789 Shell commands are as follows:
9790 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9791 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
9792 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
9793
9794
9795 ---
9796
9797 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
9798
9799 Change spotbugs version to 3.1.11.
9800
9801
9802 ---
9803
9804 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
9805
9806 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
9807
9808 It also introduces additional info for each recovery queue, which was not accounted by this command before.
9809
9810 The new output for "status 'replication'" command is explained in details below:
9811 a) Source started, target stopped, no edits arrived on source yet:
9812 ...
9813  SOURCE: PeerID=1
9814          Normal Queue: 1
9815            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9816 ...
9817 b) Source started, target stopped, add edit on source:
9818 ...
9819 Normal Queue: 1
9820            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
9821 ...
9822 c) Source started, target stopped, edit added on source, restart source:
9823 ...
9824 SOURCE: PeerID=1
9825          Normal Queue: 1
9826            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9827          Recovered Queue: 1-hbase01.home,16020,1542784524057
9828            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
9829 ...
9830 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
9831 ...
9832 SOURCE: PeerID=1
9833          Normal Queue: 1
9834            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
9835          Recovered Queue: 1-hbase01.home,16020,1542782758742
9836            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
9837 ...
9838 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
9839 ...
9840        SOURCE: PeerID=1
9841          Normal Queue: 1
9842            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
9843 ...
9844 f) Source started, target stopped, add edit on source, restart source, restart target:
9845 ...
9846 SOURCE: PeerID=1
9847          Normal Queue: 1
9848            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9849 ...
9850
9851
9852 ---
9853
9854 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
9855
9856 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
9857
9858
9859 ---
9860
9861 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
9862
9863 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
9864 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
9865 disable\_exceed\_throttle\_quota
9866 There are two limits when enable exceed throttle quota:
9867 1. Must set at least one read and one write region server throttle quota;
9868 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
9869
9870
9871 ---
9872
9873 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
9874
9875 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
9876
9877
9878 ---
9879
9880 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
9881
9882 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
9883
9884
9885 ---
9886
9887 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
9888
9889 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
9890
9891
9892 ---
9893
9894 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
9895
9896 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
9897
9898 hbase\> help 'scan'
9899
9900
9901 ---
9902
9903 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
9904
9905 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
9906
9907 For example:
9908 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
9909
9910
9911 ---
9912
9913 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
9914
9915 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
9916 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
9917
9918
9919 ---
9920
9921 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
9922
9923 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
9924
9925
9926 ---
9927
9928 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
9929
9930 Make StoppedRpcClientException extend DoNotRetryIOException.
9931
9932
9933 ---
9934
9935 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
9936
9937 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
9938 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
9939
9940
9941 ---
9942
9943 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
9944
9945 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
9946
9947 The effect releases are:
9948 2.1.x: 2.1.2 and below
9949 2.0.x: 2.0.4 and below
9950 1.x: 1.4.x and below
9951
9952 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
9953
9954
9955 ---
9956
9957 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
9958
9959 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
9960
9961
9962
9963
9964 # HBASE  2.2.0 Release Notes
9965
9966 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
9967
9968
9969 ---
9970
9971 * [HBASE-21970](https://issues.apache.org/jira/browse/HBASE-21970) | *Major* | **Document that how to upgrade from 2.0 or 2.1 to 2.2+**
9972
9973 See the document http://hbase.apache.org/book.html#upgrade2.2 about how to upgrade from 2.0 or 2.1 to 2.2+.
9974
9975 HBase 2.2+ uses a new Procedure form assiging/unassigning/moving Regions. It does not process HBase 2.1 and 2.0's Unassign/Assign Procedure types. Upgrade requires that we first drain the Master Procedure Store of old style Procedures before starting the new 2.2 Master. So you need to make sure that before you kill the old version (2.0 or 2.1) Master, there is no region in transition. And once the new version (2.2+) Master is up, you can rolling upgrade RegionServers one by one.
9976
9977 And there is a more safer way if you are running 2.1.1+ or 2.0.3+ cluster. It need four steps to upgrade Master.
9978
9979 1. Shutdown both active and standby Masters (Your cluster will continue to server reads and writes without interruption).
9980 2. Set the property hbase.procedure.upgrade-to-2-2 to true in hbase-site.xml for the Master, and start only one Master, still using the 2.1.1+ (or 2.0.3+) version.
9981 3. Wait until the Master quits. Confirm that there is a 'READY TO ROLLING UPGRADE' message in the Master log as the cause of the shutdown. The Procedure Store is now empty.
9982 4. Start new Masters with the new 2.2+ version.
9983
9984 Then you can rolling upgrade RegionServers one by one. See HBASE-21075 for more details.
9985
9986
9987 ---
9988
9989 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9990
9991 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9992
9993
9994 ---
9995
9996 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9997
9998 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9999
10000
10001 ---
10002
10003 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
10004
10005 <!-- markdown -->
10006
10007 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
10008
10009 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
10010
10011
10012 ---
10013
10014 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
10015
10016 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
10017
10018
10019 ---
10020
10021 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
10022
10023 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
10024
10025
10026 ---
10027
10028 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
10029
10030 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
10031
10032
10033 ---
10034
10035 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
10036
10037 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
10038
10039
10040 ---
10041
10042 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
10043
10044 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
10045
10046
10047 ---
10048
10049 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
10050
10051 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
10052
10053
10054 ---
10055
10056 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
10057
10058 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
10059
10060
10061 ---
10062
10063 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
10064
10065 Updated metrics core from 3.2.1 to 3.2.6.
10066
10067
10068 ---
10069
10070 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
10071
10072 The rubocop definition for the maximum method length was set to 75.
10073
10074
10075 ---
10076
10077 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
10078
10079 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
10080
10081
10082 ---
10083
10084 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
10085
10086 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
10087
10088
10089 ---
10090
10091 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
10092
10093 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
10094
10095
10096 ---
10097
10098 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
10099
10100 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
10101
10102
10103 ---
10104
10105 * [HBASE-22155](https://issues.apache.org/jira/browse/HBASE-22155) | *Major* | **Move 2.2.0 on to hbase-thirdparty-2.2.0**
10106
10107  Updates libs used internally by hbase via hbase-thirdparty as follows:
10108
10109  gson 2.8.1 -\\\> 2.8.5
10110  guava 22.0 -\\\> 27.1-jre
10111  pb 3.5.1 -\\\> 3.7.0
10112  netty 4.1.17 -\\\> 4.1.34
10113  commons-collections4 4.1 -\\\> 4.3
10114
10115
10116 ---
10117
10118 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
10119
10120 Introduced
10121
10122 Future\<Void\> createTableAsync(TableDescriptor);
10123
10124
10125 ---
10126
10127 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
10128
10129 Introduced these methods:
10130 void move(byte[]);
10131 void move(byte[], ServerName);
10132 Future\<Void\> splitRegionAsync(byte[]);
10133
10134 These methods are deprecated:
10135 void move(byte[], byte[])
10136
10137
10138 ---
10139
10140 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
10141
10142 Add a new jenkins file for running pre commit check for GitHub PR.
10143
10144
10145 ---
10146
10147 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
10148
10149 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
10150
10151
10152 ---
10153
10154 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
10155
10156 When insufficient permissions, you now get:
10157
10158 HTTP/1.1 403 Forbidden
10159
10160 on the HTTP side, and in the message
10161
10162 Forbidden
10163 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
10164 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
10165 and the rest of the ADE stack
10166
10167
10168 ---
10169
10170 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
10171
10172 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
10173
10174
10175 ---
10176
10177 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
10178
10179 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
10180
10181
10182 ---
10183
10184 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
10185
10186 <!-- markdown -->
10187 Fixed awkward dependency issue that prevented site building.
10188
10189 #### note specific to HBase 2.1.4
10190 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
10191 ```
10192 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
10193 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
10194         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
10195         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
10196         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
10197         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
10198         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
10199         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
10200         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
10201         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
10202         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
10203         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
10204         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
10205         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
10206         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
10207         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
10208         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
10209         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
10210         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
10211         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
10212         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
10213         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
10214         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
10215         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
10216         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
10217         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
10218         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
10219         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
10220 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
10221         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
10222         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
10223         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
10224         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
10225         ... 26 more
10226
10227 ```
10228
10229 Workaround via any _one_ of the following:
10230 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
10231 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
10232 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
10233 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
10234 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
10235
10236
10237 ---
10238
10239 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
10240
10241 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
10242
10243
10244 ---
10245
10246 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
10247
10248 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
10249
10250 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
10251
10252
10253 ---
10254
10255 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
10256
10257 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
10258
10259
10260 ---
10261
10262 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
10263
10264 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
10265
10266
10267 ---
10268
10269 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
10270
10271 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
10272
10273
10274 ---
10275
10276 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
10277
10278 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
10279 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
10280
10281 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
10282
10283 For example:
10284 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
10285
10286
10287 ---
10288
10289 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
10290
10291 Adds below flush, split, and compaction metrics
10292
10293  +  // split related metrics
10294  +  private MutableFastCounter splitRequest;
10295  +  private MutableFastCounter splitSuccess;
10296  +  private MetricHistogram splitTimeHisto;
10297  +
10298  +  // flush related metrics
10299  +  private MetricHistogram flushTimeHisto;
10300  +  private MetricHistogram flushMemstoreSizeHisto;
10301  +  private MetricHistogram flushOutputSizeHisto;
10302  +  private MutableFastCounter flushedMemstoreBytes;
10303  +  private MutableFastCounter flushedOutputBytes;
10304  +
10305  +  // compaction related metrics
10306  +  private MetricHistogram compactionTimeHisto;
10307  +  private MetricHistogram compactionInputFileCountHisto;
10308  +  private MetricHistogram compactionInputSizeHisto;
10309  +  private MetricHistogram compactionOutputFileCountHisto;
10310  +  private MetricHistogram compactionOutputSizeHisto;
10311  +  private MutableFastCounter compactedInputBytes;
10312  +  private MutableFastCounter compactedOutputBytes;
10313  +
10314  +  private MetricHistogram majorCompactionTimeHisto;
10315  +  private MetricHistogram majorCompactionInputFileCountHisto;
10316  +  private MetricHistogram majorCompactionInputSizeHisto;
10317  +  private MetricHistogram majorCompactionOutputFileCountHisto;
10318  +  private MetricHistogram majorCompactionOutputSizeHisto;
10319  +  private MutableFastCounter majorCompactedInputBytes;
10320  +  private MutableFastCounter majorCompactedOutputBytes;
10321
10322
10323 ---
10324
10325 * [HBASE-20886](https://issues.apache.org/jira/browse/HBASE-20886) | *Critical* | **[Auth] Support keytab login in hbase client**
10326
10327 From 2.2.0, hbase supports client login via keytab. To use this feature, client should specify \`hbase.client.keytab.file\` and \`hbase.client.keytab.principal\` in hbase-site.xml, then the connection will contain the needed credentials which be renewed periodically to communicate with kerberized hbase cluster.
10328
10329
10330 ---
10331
10332 * [HBASE-21410](https://issues.apache.org/jira/browse/HBASE-21410) | *Major* | **A helper page that help find all problematic regions and procedures**
10333
10334 After HBASE-21410, we add a helper page to Master UI. This helper page is mainly to help HBase operator quickly found all regions and pids that are get stuck.
10335 There are 2 entries to get in this page.
10336 One is showing in the Regions in Transition section, it made "num region(s) in transition" a link that you can click and check all regions in transition and their related procedure IDs.
10337 The other one is showing in the table details section, it made the number of CLOSING or OPENING regions a link, which you can click and check regions and related procedure IDs of CLOSING or OPENING regions of a certain table.
10338 In this helper page, not only you can see all regions and related procedures, there are 2 buttons at the top which will show these regions or procedure IDs in text format. This is mainly aim to help operator to easily copy and paste all problematic procedure IDs and encoded region names to HBCK2's command line, by which we HBase operator can bypass these procedures or assign these regions.
10339
10340
10341 ---
10342
10343 * [HBASE-21588](https://issues.apache.org/jira/browse/HBASE-21588) | *Major* | **Procedure v2 wal splitting implementation**
10344
10345 After HBASE-21588, we introduce a new way to do WAL splitting coordination by procedure framework. This can simplify the process of WAL splitting and no need to connect zookeeper any more.
10346 During ServerCrashProcedure, it will create a SplitWALProcedure for each WAL that need to split. Then each SplitWALProcedure will spawn a SplitWALRemoteProcedure to send the request to regionserver.
10347 At the RegionServer side, whole process is handled by SplitWALCallable. It split the WAL and return the result to master.
10348 According to my test, this patch has a better performance as the number of WALs that need to split increase. And it can relieve the pressure on zookeeper.
10349
10350
10351 ---
10352
10353 * [HBASE-20734](https://issues.apache.org/jira/browse/HBASE-20734) | *Major* | **Colocate recovered edits directory with hbase.wal.dir**
10354
10355 Previously the recovered.edits directory was under the root directory. This JIRA moves the recovered.edits directory to be under the hbase.wal.dir if set. It also adds a check for any recovered.edits found under the root directory for backwards compatibility. This gives improvements when a faster media(like SSD) or more local FileSystem is used for the hbase.wal.dir than the root dir.
10356
10357
10358 ---
10359
10360 * [HBASE-20401](https://issues.apache.org/jira/browse/HBASE-20401) | *Minor* | **Make \`MAX\_WAIT\` and \`waitIfNotFinished\` in CleanerContext configurable**
10361
10362 When oldwals (and hfile) cleaner cleans stale wals (and hfiles), it will periodically check and wait the clean results from filesystem, the total wait time will be no more than a max time.
10363
10364 The periodically wait and check configurations are hbase.oldwals.cleaner.thread.check.interval.msec (default is 500 ms) and hbase.regionserver.hfilecleaner.thread.check.interval.msec (default is 1000 ms).
10365
10366 Meanwhile, The max time configurations are hbase.oldwals.cleaner.thread.timeout.msec and hbase.regionserver.hfilecleaner.thread.timeout.msec, they are set to 60 seconds by default.
10367
10368 All support dynamic configuration.
10369
10370 e.g. in the oldwals cleaning scenario, one may consider tuning hbase.oldwals.cleaner.thread.timeout.msec and hbase.oldwals.cleaner.thread.check.interval.msec
10371
10372 1. While deleting a oldwal never complete (strange but possible), then delete file task needs to wait for a max of 60 seconds. Here, 60 seconds might be too long, or the opposite way is to increase more than 60 seconds in the use cases of slow file delete.
10373 2. The check and wait of a file delete is set to default in the period of 500 milliseconds, one might want to tune this checking period to a short interval to check more frequently or to a longer interval to avoid checking too often to manage their delete file task checking period (the longer interval may be use to avoid checking too fast while using a high latency storage).
10374
10375
10376 ---
10377
10378 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
10379
10380 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
10381
10382
10383 ---
10384
10385 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
10386
10387 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
10388 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
10389
10390
10391 ---
10392
10393 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
10394
10395 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
10396
10397 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
10398
10399
10400 ---
10401
10402 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
10403
10404 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
10405 Shell commands are as follows:
10406 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10407
10408 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
10409 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
10410 Shell commands are as follows:
10411 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10412 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
10413 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
10414
10415
10416 ---
10417
10418 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
10419
10420 Change spotbugs version to 3.1.11.
10421
10422
10423 ---
10424
10425 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
10426
10427 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
10428
10429
10430 ---
10431
10432 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
10433
10434 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
10435 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
10436 disable\_exceed\_throttle\_quota
10437 There are two limits when enable exceed throttle quota:
10438 1. Must set at least one read and one write region server throttle quota;
10439 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
10440
10441
10442 ---
10443
10444 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
10445
10446 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
10447
10448
10449 ---
10450
10451 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
10452
10453 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
10454
10455
10456 ---
10457
10458 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
10459
10460 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
10461
10462
10463 ---
10464
10465 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
10466
10467 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
10468
10469 hbase\> help 'scan'
10470
10471
10472 ---
10473
10474 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
10475
10476 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
10477
10478 For example:
10479 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
10480
10481
10482 ---
10483
10484 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
10485
10486 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
10487 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
10488
10489
10490 ---
10491
10492 * [HBASE-21727](https://issues.apache.org/jira/browse/HBASE-21727) | *Minor* | **Simplify documentation around client timeout**
10493
10494 Deprecated HBaseConfiguration#getInt(Configuration, String, String, int) method and removed it from 3.0.0 version.
10495
10496
10497 ---
10498
10499 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
10500
10501 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
10502
10503
10504 ---
10505
10506 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
10507
10508 Make StoppedRpcClientException extend DoNotRetryIOException.
10509
10510
10511 ---
10512
10513 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
10514
10515 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
10516 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
10517
10518
10519 ---
10520
10521 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
10522
10523 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
10524
10525 The effect releases are:
10526 2.1.x: 2.1.2 and below
10527 2.0.x: 2.0.4 and below
10528 1.x: 1.4.x and below
10529
10530 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
10531
10532
10533 ---
10534
10535 * [HBASE-21792](https://issues.apache.org/jira/browse/HBASE-21792) | *Major* | **Mark HTableMultiplexer as deprecated and remove it in 3.0.0**
10536
10537 HTableMultiplexer exposes the implementation class, and it is incomplete, so we mark it as deprecated and remove it in 3.0.0 release.
10538
10539 There is no direct replacement for HTableMultiplexer, please use BufferedMutator if you want to batch mutations to a table.
10540
10541
10542 ---
10543
10544 * [HBASE-21782](https://issues.apache.org/jira/browse/HBASE-21782) | *Major* | **LoadIncrementalHFiles should not be IA.Public**
10545
10546 Introduce a BulkLoadHFiles interface which is marked as IA.Public, for doing bulk load programmatically.
10547 Introduce a BulkLoadHFilesTool which extends BulkLoadHFiles, and is marked as IA.LimitedPrivate(TOOLS), for using from command line.
10548 The old LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
10549
10550
10551 ---
10552
10553 * [HBASE-21762](https://issues.apache.org/jira/browse/HBASE-21762) | *Major* | **Move some methods in ClusterConnection to Connection**
10554
10555 Move the two getHbck method from ClusterConnection to Connection, and mark the methods as IA.LimitedPrivate(HBCK), as ClusterConnection is IA.Private and should not be depended by HBCK2.
10556
10557 Add a clearRegionLocationCache method in Connection to clear the region location cache for all the tables. As in RegionLocator, most of the methods have a 'reload' parameter, which implicitly tells user that we have a region location cache, so adding a method to clear the cache is fine.
10558
10559
10560 ---
10561
10562 * [HBASE-21713](https://issues.apache.org/jira/browse/HBASE-21713) | *Major* | **Support set region server throttle quota**
10563
10564 Support set region server rpc throttle quota which represents the read/write ability of region servers and throttles when region server's total requests exceeding the limit.
10565
10566 Use the following shell command to set RS quota:
10567 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', THROTTLE\_TYPE =\> WRITE, LIMIT =\> '20000req/sec'
10568 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', LIMIT =\> NONE
10569 "all" represents the throttle quota of all region servers and setting specified region server quota isn't supported currently.
10570
10571
10572 ---
10573
10574 * [HBASE-21689](https://issues.apache.org/jira/browse/HBASE-21689) | *Minor* | **Make table/namespace specific current quota info available in shell(describe\_namespace & describe)**
10575
10576 In shell commands "describe\_namespace" and "describe", which are used to see the descriptors of the namespaces and tables respectively, quotas set on that particular namespace/table will also be printed along.
10577
10578
10579 ---
10580
10581 * [HBASE-17370](https://issues.apache.org/jira/browse/HBASE-17370) | *Major* | **Fix or provide shell scripts to drain and decommission region server**
10582
10583 Adds shell support for the following:
10584 - List decommissioned/draining region servers
10585 - Decommission a list of region servers, optionally offload corresponding regions
10586 - Recommission a region server, optionally load a list of passed regions
10587
10588
10589 ---
10590
10591 * [HBASE-21734](https://issues.apache.org/jira/browse/HBASE-21734) | *Major* | **Some optimization in FilterListWithOR**
10592
10593 After HBASE-21620, the filterListWithOR has been a bit slow because we need to merge each sub-filter's RC , while before HBASE-21620, we will skip many RC merging, but the logic was wrong. So here we choose another way to optimaze the performance: removing the KeyValueUtil#toNewKeyCell.
10594 Anoop Sam John suggested that the KeyValueUtil#toNewKeyCell can save some GC before because if we copy key part of cell into a single byte[], then the block the cell refering won't be refered by the filter list any more, the upper layer can GC the data block quickly. while after HBASE-21620, we will update the prevCellList for every encountered cell now, so the lifecycle of cell in prevCellList for FilterList will be quite shorter. so just use the cell ref for saving cpu.
10595 BTW, we removed all the arrays streams usage in filter list, because it's also quite time-consuming in our test.
10596
10597
10598 ---
10599
10600 * [HBASE-21738](https://issues.apache.org/jira/browse/HBASE-21738) | *Critical* | **Remove all the CSLM#size operation in our memstore because it's an quite time consuming.**
10601
10602 We found the memstore snapshotting would cost much time because of calling the time-consuming ConcurrentSkipListMap#Size, it would make the p999 latency spike happen. So in this issue, we remove all ConcurrentSkipListMap#size in memstore by counting the cellsCount in MemstoreSizeing. As the issue described, the p999 latency spike was mitigated.
10603
10604
10605 ---
10606
10607 * [HBASE-21034](https://issues.apache.org/jira/browse/HBASE-21034) | *Major* | **Add new throttle type: read/write capacity unit**
10608
10609 Provides a new throttle type: capacity unit. One read/write/request capacity unit represents that read/write/read+write up to 1K data. If data size is more than 1K, then consume additional capacity units.
10610
10611 Use shell command to set capacity unit(CU):
10612 set\_quota TYPE =\> THROTTLE, THROTTLE\_TYPE =\> WRITE, USER =\> 'u1', LIMIT =\> '10CU/sec'
10613
10614 Use the "hbase.quota.read.capacity.unit" property to set the data size of one read capacity unit in bytes, the default value is 1K. Use the "hbase.quota.write.capacity.unit" property to set the data size of one write capacity unit in bytes, the default value is 1K.
10615
10616
10617 ---
10618
10619 * [HBASE-21595](https://issues.apache.org/jira/browse/HBASE-21595) | *Minor* | **Print thread's information and stack traces when RS is aborting forcibly**
10620
10621 Does thread dump on stdout on abort.
10622
10623
10624 ---
10625
10626 * [HBASE-21732](https://issues.apache.org/jira/browse/HBASE-21732) | *Critical* | **Should call toUpperCase before using Enum.valueOf in some methods for ColumnFamilyDescriptor**
10627
10628 Now all the Enum configs in ColumnFamilyDescriptor can accept lower case config value.
10629
10630
10631 ---
10632
10633 * [HBASE-21712](https://issues.apache.org/jira/browse/HBASE-21712) | *Minor* | **Make submit-patch.py python3 compatible**
10634
10635 Python3 support was added to dev-support/submit-patch.py. To install newly required dependencies run \`pip install -r dev-support/python-requirements.txt\` command.
10636
10637
10638 ---
10639
10640 * [HBASE-21657](https://issues.apache.org/jira/browse/HBASE-21657) | *Major* | **PrivateCellUtil#estimatedSerializedSizeOf has been the bottleneck in 100% scan case.**
10641
10642 In HBASE-21657,  I simplified the path of estimatedSerialiedSize() & estimatedSerialiedSizeOfCell() by moving the general getSerializedSize()
10643 and heapSize() from ExtendedCell to Cell interface. The patch also included some other improvments:
10644
10645 1. For 99%  of case, our cells has no tags, so let the HFileScannerImpl just return the NoTagsByteBufferKeyValue if no tags, which means we can save
10646    lots of cpu time when sending no tags cell to rpc because can just return the length instead of getting the serialize size by caculating offset/length
10647    of each fields(row/cf/cq..)
10648 2. Move the subclass's getSerializedSize implementation from ExtendedCell to their own class, which mean we did not need to call ExtendedCell's
10649    getSerialiedSize() firstly, then forward to subclass's getSerializedSize(withTags).
10650 3. Give a estimated result arraylist size for avoiding the frequent list extension when in a big scan, now we estimate the array size as min(scan.rows, 512).
10651    it's also help a lot.
10652
10653 We gain almost ~40% throughput improvement in 100% scan case for branch-2 (cacheHitRatio~100%)[1], it's a good thing. While it's a incompatible change in
10654 some case, such as if the upstream user implemented their own Cells, although it's rare but can happen, then their compile will be error.
10655
10656
10657 ---
10658
10659 * [HBASE-21647](https://issues.apache.org/jira/browse/HBASE-21647) | *Major* | **Add status track for splitting WAL tasks**
10660
10661 Adds task monitor that shows ServerCrashProcedure progress in UI.
10662
10663
10664 ---
10665
10666 * [HBASE-21652](https://issues.apache.org/jira/browse/HBASE-21652) | *Major* | **Refactor ThriftServer making thrift2 server inherited from thrift1 server**
10667
10668 Before this issue, thrift1 server and thrift2 server are totally different servers. If a new feature is added to thrift1 server, thrfit2 server have to make the same change to support it(e.g. authorization). After this issue, thrift2 server is inherited from thrift1, thrift2 server now have all the features thrift1 server has(e.g http support, which thrift2 server doesn't have before).  The way to start thrift1 or thrift2 server remain the same after this issue.
10669
10670
10671 ---
10672
10673 * [HBASE-21661](https://issues.apache.org/jira/browse/HBASE-21661) | *Major* | **Provide Thrift2 implementation of Table/Admin**
10674
10675 ThriftAdmin/ThriftTable are implemented based on Thrift2. With ThriftAdmin/ThriftTable, People can use thrift2 protocol just like HTable/HBaseAdmin.
10676 Example of using ThriftConnection
10677 Configuration conf = HBaseConfiguration.create();
10678 conf.set(ClusterConnection.HBASE\_CLIENT\_CONNECTION\_IMPL,ThriftConnection.class.getName());
10679 Connection conn = ConnectionFactory.createConnection(conf);
10680 Table table = conn.getTable(tablename)
10681 It is just like a normal Connection, similar use experience with the default ConnectionImplementation
10682
10683
10684 ---
10685
10686 * [HBASE-21618](https://issues.apache.org/jira/browse/HBASE-21618) | *Critical* | **Scan with the same startRow(inclusive=true) and stopRow(inclusive=false) returns one result**
10687
10688 There was a bug when scan with the same startRow(inclusive=true) and stopRow(inclusive=false). The old incorrect behavior is return one result. After this fix, the new correct behavior is return nothing.
10689
10690
10691 ---
10692
10693 * [HBASE-21159](https://issues.apache.org/jira/browse/HBASE-21159) | *Major* | **Add shell command to switch throttle on or off**
10694
10695 Support enable or disable rpc throttle when hbase quota is enabled. If hbase quota is enabled, rpc throttle is enabled by default.  When disable rpc throttle, HBase will not throttle any request. Use the following commands to switch rpc throttle : enable\_rpc\_throttle / disable\_rpc\_throttle.
10696
10697
10698 ---
10699
10700 * [HBASE-21659](https://issues.apache.org/jira/browse/HBASE-21659) | *Minor* | **Avoid to load duplicate coprocessors in system config and table descriptor**
10701
10702 Add a new configuration "hbase.skip.load.duplicate.table.coprocessor". The default value is false to keep compatible with the old behavior. Config it true to skip load duplicate table coprocessor.
10703
10704
10705 ---
10706
10707 * [HBASE-21650](https://issues.apache.org/jira/browse/HBASE-21650) | *Major* | **Add DDL operation and some other miscellaneous to thrift2**
10708
10709 Added DDL operations and some other structure definition to thrift2. Methods added:
10710 create/modify/addColumnFamily/deleteColumnFamily/modifyColumnFamily/enable/disable/truncate/delete table
10711 create/modify/delete namespace
10712 get(list)TableDescriptor(s)/get(list)NamespaceDescirptor(s)
10713 tableExists/isTableEnabled/isTableDisabled/isTableAvailabe
10714 And some class definitions along with those methods
10715
10716
10717 ---
10718
10719 * [HBASE-21643](https://issues.apache.org/jira/browse/HBASE-21643) | *Major* | **Introduce two new region coprocessor method and deprecated postMutationBeforeWAL**
10720
10721 Deprecated region coprocessor postMutationBeforeWAL and introduce two new region coprocessor postIncrementBeforeWAL and postAppendBeforeWAL instead.
10722
10723
10724 ---
10725
10726 * [HBASE-21635](https://issues.apache.org/jira/browse/HBASE-21635) | *Major* | **Use maven enforcer to ban imports from illegal packages**
10727
10728 Use de.skuzzle.enforcer.restrict-imports-enforcer-rule extension for maven enforcer plugin to ban illegal imports at compile time. Now if you use illegal imports, for example, import com.google.common.\*, there will be a compile error, instead of a checkstyle warning.
10729
10730
10731 ---
10732
10733 * [HBASE-21401](https://issues.apache.org/jira/browse/HBASE-21401) | *Critical* | **Sanity check when constructing the KeyValue**
10734
10735 Add a sanity check when constructing KeyValue from a byte[]. we use the constructor when we're reading kv from socket or HFIle or WAL(replication). the santiy check isn't designed for discovering the bits corruption in network transferring or disk IO. It is designed to detect bugs inside HBase in advance. and HBASE-21459 indicated that there's extremely small performance loss for diff kinds of keyvalue.
10736
10737
10738 ---
10739
10740 * [HBASE-21554](https://issues.apache.org/jira/browse/HBASE-21554) | *Minor* | **Show replication endpoint classname for replication peer on master web UI**
10741
10742 The replication UI on master will show the replication endpoint classname.
10743
10744
10745 ---
10746
10747 * [HBASE-21549](https://issues.apache.org/jira/browse/HBASE-21549) | *Major* | **Add shell command for serial replication peer**
10748
10749 Add a SERIAL flag for add\_peer command to identifiy whether or not the replication peer is a serial replication peer. The default serial flag is false.
10750
10751
10752 ---
10753
10754 * [HBASE-21453](https://issues.apache.org/jira/browse/HBASE-21453) | *Major* | **Convert ReadOnlyZKClient to DEBUG instead of INFO**
10755
10756 Log level of ReadOnlyZKClient moved to debug.
10757
10758
10759 ---
10760
10761 * [HBASE-21283](https://issues.apache.org/jira/browse/HBASE-21283) | *Minor* | **Add new shell command 'rit' for listing regions in transition**
10762
10763 <!-- markdown -->
10764
10765 The HBase `shell` now includes a command to list regions currently in transition.
10766
10767 ```
10768 HBase Shell
10769 Use "help" to get list of supported commands.
10770 Use "exit" to quit this interactive shell.
10771 Version 1.5.0-SNAPSHOT, r9bb6d2fa8b760f16cd046657240ebd4ad91cb6de, Mon Oct  8 21:05:50 UTC 2018
10772
10773 hbase(main):001:0> help 'rit'
10774 List all regions in transition.
10775 Examples:
10776   hbase> rit
10777
10778 hbase(main):002:0> create ...
10779 0 row(s) in 2.5150 seconds
10780 => Hbase::Table - IntegrationTestBigLinkedList
10781
10782 hbase(main):003:0> rit
10783 0 row(s) in 0.0340 seconds
10784
10785 hbase(main):004:0> unassign '56f0c38c81ae453d19906ce156a2d6a1'
10786 0 row(s) in 0.0540 seconds
10787
10788 hbase(main):005:0> rit
10789 IntegrationTestBigLinkedList,L\xCC\xCC\xCC\xCC\xCC\xCC\xCB,1539117183224.56f0c38c81ae453d19906ce156a2d6a1. state=PENDING_CLOSE, ts=Tue Oct 09 20:33:34 UTC 2018 (0s ago), server=null
10790 1 row(s) in 0.0170 seconds
10791 ```
10792
10793
10794 ---
10795
10796 * [HBASE-21567](https://issues.apache.org/jira/browse/HBASE-21567) | *Major* | **Allow overriding configs starting up the shell**
10797
10798 Allow passing of -Dkey=value option to shell to override hbase-\* configuration: e.g.:
10799
10800 $ ./bin/hbase shell -Dhbase.zookeeper.quorum=ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org -Draining=false
10801 ...
10802 hbase(main):001:0\> @shell.hbase.configuration.get("hbase.zookeeper.quorum")
10803 =\> "ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org"
10804 hbase(main):002:0\> @shell.hbase.configuration.get("raining")
10805 =\> "false"
10806
10807
10808 ---
10809
10810 * [HBASE-21560](https://issues.apache.org/jira/browse/HBASE-21560) | *Major* | **Return a new TableDescriptor for MasterObserver#preModifyTable to allow coprocessor modify the TableDescriptor**
10811
10812 Incompatible change. Allow MasterObserver#preModifyTable to return a new TableDescriptor. And master will use this returned TableDescriptor to modify table.
10813
10814
10815 ---
10816
10817 * [HBASE-21551](https://issues.apache.org/jira/browse/HBASE-21551) | *Blocker* | **Memory leak when use scan with STREAM at server side**
10818
10819 <!-- markdown -->
10820 ### Summary
10821 HBase clusters will experience Region Server failures due to out of memory errors due to a leak given any of the following:
10822
10823 * User initiates Scan operations set to use the STREAM reading type
10824 * User initiates Scan operations set to use the default reading type that read more than 4 * the block size of column families involved in the scan (e.g. by default 4*64KiB)
10825 * Compactions run
10826
10827 ### Root cause
10828
10829 When there are long running scans the Region Server process attempts to optimize access by using a different API geared towards sequential access. Due to an error in HBASE-20704 for HBase 2.0+ the Region Server fails to release related resources when those scans finish. That same optimization path is always used for the HBase internal file compaction process.
10830
10831 ### Workaround
10832
10833 Impact for this error can be minimized by setting the config value “hbase.storescanner.pread.max.bytes” to MAX_INT to avoid the optimization for default user scans. Clients should also be checked to ensure they do not pass the STREAM read type to the Scan API. This will have a severe impact on performance for long scans.
10834
10835 Compactions always use this sequential optimized reading mechanism so downstream users will need to periodically restart Region Server roles after compactions have happened.
10836
10837
10838 ---
10839
10840 * [HBASE-21550](https://issues.apache.org/jira/browse/HBASE-21550) | *Major* | **Add a new method preCreateTableRegionInfos for MasterObserver which allows CPs to modify the TableDescriptor**
10841
10842 Add a new method preCreateTableRegionInfos for MasterObserver, which will be called before creating region infos for the given table,  before the preCreateTable method. It allows you to return a new TableDescritor to override the original one. Returns null or throws exception will stop the creation.
10843
10844
10845 ---
10846
10847 * [HBASE-21492](https://issues.apache.org/jira/browse/HBASE-21492) | *Critical* | **CellCodec Written To WAL Before It's Verified**
10848
10849 After HBASE-21492 the return type of WALCellCodec#getWALCellCodecClass has been changed from String to Class
10850
10851
10852 ---
10853
10854 * [HBASE-21387](https://issues.apache.org/jira/browse/HBASE-21387) | *Major* | **Race condition surrounding in progress snapshot handling in snapshot cache leads to loss of snapshot files**
10855
10856 To prevent race condition between in progress snapshot (performed by TakeSnapshotHandler) and HFileCleaner which results in data loss, this JIRA introduced mutual exclusion between taking snapshot and running HFileCleaner. That is, at any given moment, either some snapshot can be taken or, HFileCleaner checks hfiles which are not referenced, but not both can be running.
10857
10858
10859 ---
10860
10861 * [HBASE-21452](https://issues.apache.org/jira/browse/HBASE-21452) | *Major* | **Illegal character in hbase counters group name**
10862
10863 Changes group name of hbase metrics from "HBase Counters" to "HBaseCounters".
10864
10865
10866 ---
10867
10868 * [HBASE-21443](https://issues.apache.org/jira/browse/HBASE-21443) | *Major* | **[hbase-connectors] Purge hbase-\* modules from core now they've been moved to hbase-connectors**
10869
10870 Parent issue moved hbase-spark\* modules to hbase-connectors. This issue removes hbase-spark\* modules from hbase core repo.
10871
10872
10873 ---
10874
10875 * [HBASE-21430](https://issues.apache.org/jira/browse/HBASE-21430) | *Major* | **[hbase-connectors] Move hbase-spark\* modules to hbase-connectors repo**
10876
10877 hbase-spark\* modules have been cloned to https://github.com/apache/hbase-connectors All spark connector dev is to happen in that repo from here on out.
10878
10879 Let me file a subtask to remove hbase-spark\* modules from hbase core.
10880
10881
10882 ---
10883
10884 * [HBASE-21417](https://issues.apache.org/jira/browse/HBASE-21417) | *Critical* | **Pre commit build is broken due to surefire plugin crashes**
10885
10886 Add -Djdk.net.URLClassPath.disableClassPathURLCheck=true when executing surefire plugin.
10887
10888
10889 ---
10890
10891 * [HBASE-21191](https://issues.apache.org/jira/browse/HBASE-21191) | *Major* | **Add a holding-pattern if no assign for meta or namespace (Can happen if masterprocwals have been cleared).**
10892
10893 Puts master startup into holding pattern if meta is not assigned (previous it would exit). To make progress again, operator needs to inject an assign (Caveats and instruction can be found in HBASE-21035).
10894
10895
10896 ---
10897
10898 * [HBASE-21322](https://issues.apache.org/jira/browse/HBASE-21322) | *Critical* | **Add a scheduleServerCrashProcedure() API to HbckService**
10899
10900 Adds scheduleServerCrashProcedure to the HbckService.
10901
10902
10903 ---
10904
10905 * [HBASE-21325](https://issues.apache.org/jira/browse/HBASE-21325) | *Major* | **Force to terminate regionserver when abort hang in somewhere**
10906
10907 Add two new config hbase.regionserver.abort.timeout and hbase.regionserver.abort.timeout.task. If regionserver abort timeout, it will schedule an abort timeout task to run. The default abort task is SystemExitWhenAbortTimeout, which will force to terminate region server when abort timeout. And you can config a special abort timeout task by hbase.regionserver.abort.timeout.task.
10908
10909
10910 ---
10911
10912 * [HBASE-21215](https://issues.apache.org/jira/browse/HBASE-21215) | *Major* | **Figure how to invoke hbck2; make it easy to find**
10913
10914 Adds to bin/hbase means of invoking hbck2. Pass the new '-j' option on the 'hbck' command with a value of the full path to the HBCK2.jar.
10915
10916 E.g:
10917
10918 $ ./bin/hbase hbck -j ~/checkouts/hbase-operator-tools/hbase-hbck2/target/hbase-hbck2-1.0.0-SNAPSHOT.jar  setTableState x ENABLED
10919
10920
10921 ---
10922
10923 * [HBASE-21372](https://issues.apache.org/jira/browse/HBASE-21372) | *Major* | **Set hbase.assignment.maximum.attempts to Long.MAX**
10924
10925 Retry assigns 'forever' (or until an intervention such as a ServerCrashProcedure).
10926
10927 Previous retry was a maximum of ten times but on failure, handling was an indeterminate.
10928
10929
10930 ---
10931
10932 * [HBASE-21338](https://issues.apache.org/jira/browse/HBASE-21338) | *Major* | **[balancer] If balancer is an ill-fit for cluster size, it gives little indication**
10933
10934 The description claims the balancer not dynamically configurable but this is an error; it is http://hbase.apache.org/book.html#dyn\_config
10935
10936 Also, if balancer is seen to be cutting out too soon, try setting "hbase.master.balancer.stochastic.runMaxSteps" to true.
10937
10938 Adds cleaner logging around balancer start.
10939
10940
10941 ---
10942
10943 * [HBASE-21073](https://issues.apache.org/jira/browse/HBASE-21073) | *Major* | **"Maintenance mode" master**
10944
10945     Instead of being an ephemeral state set by hbck, maintenance mode is now
10946     an explicit toggle set by either configuration property or environment
10947     variable. In maintenance mode, master will host system tables and not
10948     assign any user-space tables to RSs. This gives operators the ability to
10949     affect repairs to meta table with fewer moving parts.
10950
10951
10952 ---
10953
10954 * [HBASE-21335](https://issues.apache.org/jira/browse/HBASE-21335) | *Critical* | **Change the default wait time of HBCK2 tool**
10955
10956 Changed waitTime parameter to lockWait on bypass. Changed default waitTime from 0 -- i.e. wait for ever -- to 1ms so if lock is held, we'll go past it and if override enforce bypass.
10957
10958
10959 ---
10960
10961 * [HBASE-21291](https://issues.apache.org/jira/browse/HBASE-21291) | *Major* | **Add a test for bypassing stuck state-machine procedures**
10962
10963 bypass will now throw an Exception if passed a lockWait \<= 0; i.e bypass will prevent an operator getting stuck on an entity lock waiting forever (lockWait == 0)
10964
10965
10966 ---
10967
10968 * [HBASE-21320](https://issues.apache.org/jira/browse/HBASE-21320) | *Major* | **[canary] Cleanup of usage and add commentary**
10969
10970 Cleans up usage and docs around Canary.  Does not change command-line args (though we should -- smile).
10971
10972
10973 ---
10974
10975 * [HBASE-21278](https://issues.apache.org/jira/browse/HBASE-21278) | *Critical* | **Do not rollback successful sub procedures when rolling back a procedure**
10976
10977 For the sub procedures which are successfully finished, do not do rollback. This is a change in rollback behavior.
10978
10979 State changes which are done by sub procedures should be handled by parent procedures when rolling back. For example, when rolling back a MergeTableProcedure, we will schedule new procedures to bring the offline regions online instead of rolling back the original procedures which off-lined the regions (in fact these procedures can not be rolled back...).
10980
10981
10982 ---
10983
10984 * [HBASE-21158](https://issues.apache.org/jira/browse/HBASE-21158) | *Critical* | **Empty qualifier cell should not be returned if it does not match QualifierFilter**
10985
10986 <!-- markdown -->
10987
10988 Scans that make use of `QualifierFilter` previously would erroneously return both columns with an empty qualifier along with those that matched. After this change that behavior has changed to only return those columns that match.
10989
10990
10991 ---
10992
10993 * [HBASE-21098](https://issues.apache.org/jira/browse/HBASE-21098) | *Major* | **Improve Snapshot Performance with Temporary Snapshot Directory when rootDir on S3**
10994
10995 It is recommended to place the working directory on-cluster on HDFS as doing so has shown a strong performance increase due to data locality. It is important to note that the working directory should not overlap with any existing directories as the working directory will be cleaned out during the snapshot process. Beyond that, any well-named directory on HDFS should be sufficient.
10996
10997
10998 ---
10999
11000 * [HBASE-21185](https://issues.apache.org/jira/browse/HBASE-21185) | *Minor* | **WALPrettyPrinter: Additional useful info to be printed by wal printer tool, for debugability purposes**
11001
11002 This adds two extra features to WALPrettyPrinter tool:
11003
11004 1) Output for each cell combined size of cell descriptors, plus the cell value itself, in a given WAL edit. This is printed on the results as "cell total size sum:" info by default;
11005
11006 2) An optional -g/--goto argument, that allows to seek straight to that specific WAL file position, then sequentially reading the WAL from that point towards its end;
11007
11008
11009 ---
11010
11011 * [HBASE-21287](https://issues.apache.org/jira/browse/HBASE-21287) | *Major* | **JVMClusterUtil Master initialization wait time not configurable**
11012
11013 Local HBase cluster (as used by unit tests) wait times on startup and initialization can be configured via \`hbase.master.start.timeout.localHBaseCluster\` and \`hbase.master.init.timeout.localHBaseCluster\`
11014
11015
11016 ---
11017
11018 * [HBASE-21280](https://issues.apache.org/jira/browse/HBASE-21280) | *Trivial* | **Add anchors for each heading in UI**
11019
11020 Adds anchors #tables, #tasks, etc.
11021
11022
11023 ---
11024
11025 * [HBASE-21232](https://issues.apache.org/jira/browse/HBASE-21232) | *Major* | **Show table state in Tables view on Master home page**
11026
11027 Add table state column to the tables panel
11028
11029
11030 ---
11031
11032 * [HBASE-21223](https://issues.apache.org/jira/browse/HBASE-21223) | *Critical* | **[amv2] Remove abort\_procedure from shell**
11033
11034 Removed the abort\_procedure command from shell -- dangerous -- and deprecated abortProcedure in Admin API.
11035
11036
11037 ---
11038
11039 * [HBASE-20636](https://issues.apache.org/jira/browse/HBASE-20636) | *Major* | **Introduce two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED**
11040
11041 Add two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED
11042 1. ROWPREFIX\_FIXED\_LENGTH: specify the length of the prefix
11043 2. ROWPREFIX\_DELIMITED: specify the delimiter of the prefix
11044 Need to specify parameters for these two types of bloomfilter, otherwise the table will fail to create
11045 Example:
11046 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_FIXED\_LENGTH', CONFIGURATION =\> {'RowPrefixBloomFilter.prefix\_length' =\> '10'}}
11047 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_DELIMITED', CONFIGURATION =\> {'RowPrefixDelimitedBloomFilter.delimiter' =\> '#'}}
11048
11049
11050 ---
11051
11052 * [HBASE-21156](https://issues.apache.org/jira/browse/HBASE-21156) | *Critical* | **[hbck2] Queue an assign of hbase:meta and bulk assign/unassign**
11053
11054 Adds 'raw' assigns/unassigns to the Hbck Service. Takes a list of encoded region names and bulk assigns/unassigns. Skirts Master 'state' check and does not invoke Coprocessors. For repair only.
11055
11056 Here is what HBCK2 usage looks like now:
11057
11058 {code}
11059 $ java -cp hbase-hbck2-1.0.0-SNAPSHOT.jar  org.apache.hbase.HBCK2
11060 usage: HBCK2 \<OPTIONS\> COMMAND [\<ARGS\>]
11061
11062 Options:
11063  -d,--debug                      run with debug output
11064  -h,--help                       output this help message
11065     --hbase.zookeeper.peerport   peerport of target hbase ensemble
11066     --hbase.zookeeper.quorum     ensemble of target hbase
11067     --zookeeper.znode.parent     parent znode of target hbase
11068
11069 Commands:
11070  setTableState \<TABLENAME\> \<STATE\>
11071    Possible table states: ENABLED, DISABLED, DISABLING, ENABLING
11072    To read current table state, in the hbase shell run:
11073      hbase\> get 'hbase:meta', '\<TABLENAME\>', 'table:state'
11074    A value of \\x08\\x00 == ENABLED, \\x08\\x01 == DISABLED, etc.
11075    An example making table name 'user' ENABLED:
11076      $ HBCK2 setTableState users ENABLED
11077    Returns whatever the previous table state was.
11078
11079  assign \<ENCODED\_REGIONNAME\> ...
11080    A 'raw' assign that can be used even during Master initialization.
11081    Skirts Coprocessors. Pass one or more encoded RegionNames:
11082    e.g. 1588230740 is hard-coded encoding for hbase:meta region and
11083    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
11084    user-space encoded Region name looks like. For example:
11085      $ HBCK2 assign 1588230740 de00010733901a05f5a2a3a382e27dd4
11086    Returns the pid of the created AssignProcedure or -1 if none.
11087
11088  unassign \<ENCODED\_REGIONNAME\> ...
11089    A 'raw' unassign that can be used even during Master initialization.
11090    Skirts Coprocessors. Pass one or more encoded RegionNames:
11091    Skirts Coprocessors. Pass one or more encoded RegionNames:
11092    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
11093    user-space encoded Region name looks like. For example:
11094      $ HBCK2 unassign 1588230740 de00010733901a05f5a2a3a382e27dd4
11095    Returns the pid of the created UnassignProcedure or -1 if none.
11096 {code}
11097
11098
11099 ---
11100
11101 * [HBASE-21021](https://issues.apache.org/jira/browse/HBASE-21021) | *Major* | **Result returned by Append operation should be ordered**
11102
11103 This change ensures Append operations are assembled into the expected order.
11104
11105
11106 ---
11107
11108 * [HBASE-21171](https://issues.apache.org/jira/browse/HBASE-21171) | *Major* | **[amv2] Tool to parse a directory of MasterProcWALs standalone**
11109
11110 Make it so can run the WAL parse and load system in isolation. Here is an example:
11111
11112 {code}$ HBASE\_OPTS=" -XX:+UnlockDiagnosticVMOptions -XX:+UnlockCommercialFeatures -XX:+FlightRecorder -XX:+DebugNonSafepoints" ./bin/hbase org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore ~/big\_set\_of\_masterprocwals/
11113 {code}
11114
11115
11116 ---
11117
11118 * [HBASE-21107](https://issues.apache.org/jira/browse/HBASE-21107) | *Minor* | **add a metrics for netty direct memory**
11119
11120 Add a new nettyDirectMemoryUsage under server's ipc metrics to show direct memory usage for netty rpc server.
11121
11122
11123 ---
11124
11125 * [HBASE-21153](https://issues.apache.org/jira/browse/HBASE-21153) | *Major* | **Shaded client jars should always build in relevant phase to avoid confusion**
11126
11127 Client facing artifacts are now built whenever Maven is run through the "package" goal. Previously, the client facing artifacts would create placeholder jars that skipped repackaging HBase and third-party dependencies unless the "release" profile was active.
11128
11129 Build times may be noticeably longer depending on your build hardware. For example, the Jenkins worker nodes maintained by ASF Infra take ~14% longer to do a full packaging build. An example portability-focused personal laptop took ~25% longer.
11130
11131
11132 ---
11133
11134 * [HBASE-20942](https://issues.apache.org/jira/browse/HBASE-20942) | *Major* | **Improve RpcServer TRACE logging**
11135
11136 Allows configuration of the length of RPC messages printed to the log at TRACE level via "hbase.ipc.trace.param.size" in RpcServer.
11137
11138
11139 ---
11140
11141 * [HBASE-20649](https://issues.apache.org/jira/browse/HBASE-20649) | *Minor* | **Validate HFiles do not have PREFIX\_TREE DataBlockEncoding**
11142
11143 <!-- markdown -->
11144 Users who have previously made use of prefix tree encoding can now check that their existing HFiles no longer contain data that uses it with an additional preupgrade check command.
11145
11146 ```
11147 hbase pre-upgrade validate-hfile
11148 ```
11149
11150 Please see the "HFile Content validation" section of the ref guide's coverage of the pre-upgrade validator tool for usage details.
11151
11152
11153 ---
11154
11155 * [HBASE-20941](https://issues.apache.org/jira/browse/HBASE-20941) | *Major* | **Create and implement HbckService in master**
11156
11157 Adds an HBCK Service and a first method to force-change-in-table-state for use by an HBCK client effecting 'repair' to a malfunctioning HBase.
11158
11159
11160 ---
11161
11162 * [HBASE-21071](https://issues.apache.org/jira/browse/HBASE-21071) | *Major* | **HBaseTestingUtility::startMiniCluster() to use builder pattern**
11163
11164 Cleanup all the cluster start override combos in HBaseTestingUtility by adding a StartMiniClusterOption and Builder.
11165
11166
11167 ---
11168
11169 * [HBASE-21072](https://issues.apache.org/jira/browse/HBASE-21072) | *Major* | **Block out HBCK1 in hbase2**
11170
11171 Fence out hbase-1.x hbck1 instances. Stop them making state changes on an hbase-2.x cluster; they could do damage. We do this by writing the hbck1 lock file into place on hbase-2.x Master start-up.
11172
11173 To disable this new behavior, set hbase.write.hbck1.lock.file to false
11174
11175
11176 ---
11177
11178 * [HBASE-20881](https://issues.apache.org/jira/browse/HBASE-20881) | *Major* | **Introduce a region transition procedure to handle all the state transition for a region**
11179
11180 Introduced a new TransitRegionStateProcedure to replace the old AssignProcedure/UnassignProcedure/MoveRegionProcedure. In the old code, MRP will not be attached to RegionStateNode, so it can not be interrupted by ServerCrashProcedure, which introduces lots of tricky code to deal with races, and also causes lots of other difficulties on how to prevent scheduling redundant or even conflict procedures for a region.
11181
11182 And now TRSP is the only one procedure which can bring region online or offline. When you want to schedule one, you need to check whether there is already one attached to the RegionStateNode, under the lock of the RegionStateNode. If not just go ahead, and if there is one, then you should do something, for example, give up and fail directly, or tell the TRSP to give up(This is what SCP does). Since the check and attach are both under the lock of RSN, it will greatly reduce the possible races, and make the code much simpler.
11183
11184
11185 ---
11186
11187 * [HBASE-21012](https://issues.apache.org/jira/browse/HBASE-21012) | *Critical* | **Revert the change of serializing TimeRangeTracker**
11188
11189 HFiles generated by 2.0.0, 2.0.1, 2.1.0 are not forward compatible to 1.4.6-, 1.3.2.1-, 1.2.6.1-, and other inactive releases. Why HFile lose compatability is hbase in new versions (2.0.0, 2.0.1, 2.1.0) use protobuf to serialize/deserialize TimeRangeTracker (TRT) while old versions use DataInput/DataOutput. To solve this, We have to put HBASE-21012 to 2.x and put HBASE-21013 in 1.x. For more information, please check HBASE-21008.
11190
11191
11192 ---
11193
11194 * [HBASE-20965](https://issues.apache.org/jira/browse/HBASE-20965) | *Major* | **Separate region server report requests to new handlers**
11195
11196 After HBASE-20965, we can use MasterFifoRpcScheduler in master to separate RegionServerReport requests to indenpedent handler. To use this feature, please set "hbase.master.rpc.scheduler.factory.class" to
11197  "org.apache.hadoop.hbase.ipc.MasterFifoRpcScheduler". Use "hbase.master.server.report.handler.count" to set RegionServerReport handlers count, the default value is half of "hbase.regionserver.handler.count" value, but at least 1, and the other handlers count in master is "hbase.regionserver.handler.count" value minus RegionServerReport handlers count, but at least 1 too.
11198
11199
11200 ---
11201
11202 * [HBASE-20813](https://issues.apache.org/jira/browse/HBASE-20813) | *Minor* | **Remove RPC quotas when the associated table/Namespace is dropped off**
11203
11204 In previous releases, when a Space Quota was configured on a table or namespace and that table or namespace was deleted, the Space Quota was also deleted. This change improves the implementation so that the same is also done for RPC Quotas.
11205
11206
11207 ---
11208
11209 * [HBASE-20986](https://issues.apache.org/jira/browse/HBASE-20986) | *Major* | **Separate the config of block size when we do log splitting and write Hlog**
11210
11211 After HBASE-20986, we can set different value to block size of WAL and recovered edits. Both of their default value is 2 \* default HDFS blocksize. And hbase.regionserver.recoverededits.blocksize is for block size of recovered edits while hbase.regionserver.hlog.blocksize is for block size of WAL.
11212
11213
11214 ---
11215
11216 * [HBASE-20856](https://issues.apache.org/jira/browse/HBASE-20856) | *Minor* | **PITA having to set WAL provider in two places**
11217
11218 With this change if a WAL's meta provider (hbase.wal.meta\_provider) is not explicitly set, it now defaults to whatever hbase.wal.provider is set to. Previous, the two settings operated independently, each with its own default.
11219
11220 This change is operationally incompatible with previous HBase versions because the default WAL meta provider no longer defaults to AsyncFSWALProvider but to hbase.wal.provider.
11221
11222 The thought is that this is more in line with an operator's expectation, that a change in hbase.wal.provider is sufficient to change how WALs are written, especially given hbase.wal.meta\_provider is an obscure configuration and that the very idea that meta regions would have their own wal provider would likely come as a surprise.
11223
11224
11225 ---
11226
11227 * [HBASE-20538](https://issues.apache.org/jira/browse/HBASE-20538) | *Critical* | **Upgrade our hadoop versions to 2.7.7 and 3.0.3**
11228
11229 Update hadoop-two.version to 2.7.7 and hadoop-three.version to 3.0.3 due to a JDK issue which is solved by HADOOP-15473.
11230
11231
11232 ---
11233
11234 * [HBASE-20846](https://issues.apache.org/jira/browse/HBASE-20846) | *Major* | **Restore procedure locks when master restarts**
11235
11236 1. Make hasLock method final, and add a locked field in Procedure to record whether we have the lock. We will set it to true in doAcquireLock and to false in doReleaseLock. The sub procedures do not need to manage it any more.
11237
11238 2. Also added a locked field in the proto message. When storing, the field will be set according to the return value of hasLock. And when loading, there is a new field in Procedure called lockedWhenLoading. We will set it to true if the locked field in proto message is true.
11239
11240 3. The reason why we can not set the locked field directly to true by calling doAcquireLock is that, during initialization, most procedures need to wait until master is initialized. So the solution here is that, we introduced a new method called waitInitialized in Procedure, and move the wait master initialized related code from acquireLock to this method. And we added a restoreLock method to Procedure, if lockedWhenLoading is true, we will call the acquireLock to get the lock, but do not set locked to true. And later when we call doAcquireLock and pass the waitInitialized check, we will test lockedWhenLoading, if it is true, when we just set the locked field to true and return, without actually calling the acquireLock method since we have already called it once.
11241
11242
11243 ---
11244
11245 * [HBASE-20672](https://issues.apache.org/jira/browse/HBASE-20672) | *Minor* | **New metrics ReadRequestRate and WriteRequestRate**
11246
11247 Exposing 2 new metrics in HBase to provide ReadRequestRate and WriteRequestRate at region server level. These metrics give the rate of request handled by the region server and are reset after every monitoring interval.
11248
11249
11250 ---
11251
11252 * [HBASE-6028](https://issues.apache.org/jira/browse/HBASE-6028) | *Minor* | **Implement a cancel for in-progress compactions**
11253
11254 Added a new command to the shell to switch on/off compactions called "compaction\_switch". Disabling compactions will interrupt any currently ongoing compactions. This setting will be lost on restart of the server. Added the configuration hbase.regionserver.compaction.enabled so user can enable/disable compactions via hbase-site.xml.
11255
11256
11257 ---
11258
11259 * [HBASE-20884](https://issues.apache.org/jira/browse/HBASE-20884) | *Major* | **Replace usage of our Base64 implementation with java.util.Base64**
11260
11261 Class org.apache.hadoop.hbase.util.Base64 has been removed in it's entirety from HBase 2+. In HBase 1, unused methods have been removed from the class and the audience was changed from  Public to Private. This class was originally intended as an internal utility class that could be used externally but thinking since changed; these classes should not have been advertised as public to end-users.
11262
11263 This represents an incompatible change for users who relied on this implementation. An alternative implementation for affected clients is available at java.util.Base64 when using Java 8 or newer; be aware, it may encode/decode differently. For clients seeking to restore this specific implementation, it is available in the public domain for download at http://iharder.sourceforge.net/current/java/base64/
11264
11265
11266 ---
11267
11268 * [HBASE-20357](https://issues.apache.org/jira/browse/HBASE-20357) | *Major* | **AccessControlClient API Enhancement**
11269
11270 This enhances the AccessControlClient APIs to retrieve the permissions based on namespace, table name, family and qualifier for specific user. AccessControlClient can also validate a user whether allowed to perform specified operations on a particular table.
11271 Following APIs have been added,
11272 1) getUserPermissions(Connection connection, String tableRegex, byte[] columnFamily, byte[] columnQualifier, String userName)
11273          Scope of retrieving permission will be same as existing.
11274 2) hasPermission(onnection connection, String tableName, byte[] columnFamily, byte[] columnQualifier, String userName, Permission.Action... actions)
11275      Scope of validating user privilege,
11276            User can perform self check without any special privilege but ADMIN privilege will be required to perform check for other users.
11277            For example, suppose there are two users "userA" & "userB" then there can be below scenarios,
11278             a. When userA want to check whether userA have privilege to perform mentioned actions
11279                  userA don't need ADMIN privilege, as it's a self query.
11280             b. When userA want to check whether userB have privilege to perform mentioned actions,
11281                  userA must have ADMIN or superuser privilege, as it's trying to query for other user.
11282
11283
11284
11285 # HBASE  2.1.0 Release Notes
11286
11287 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11288
11289
11290 ---
11291
11292 * [HBASE-20691](https://issues.apache.org/jira/browse/HBASE-20691) | *Blocker* | **Storage policy should allow deferring to HDFS**
11293
11294 After HBASE-20691 we have changed the default setting of hbase.wal.storage.policy from "HOT" back to "NONE" which means we defer the policy to HDFS. This fixes the problem of release 2.0.0 that the storage policy of WAL directory will defer to HDFS and may not be "HOT" even if you explicitly set hbase.wal.storage.policy to "HOT"
11295
11296
11297 ---
11298
11299 * [HBASE-20839](https://issues.apache.org/jira/browse/HBASE-20839) | *Blocker* | **Fallback to FSHLog if we can not instantiated AsyncFSWAL when user does not specify AsyncFSWAL explicitly**
11300
11301 As we hack into the internal of DFSClient when implementing AsyncFSWAL to get better performance, a patch release of hadoop can make it broken.
11302
11303 So now, if user does not specify a wal provider, then we will first try to use 'asyncfs', i.e, the AsyncFSWALProvider. If we fail due to some compatible issues, we will fallback to 'filesystem', i.e, FSHLog.
11304
11305
11306 ---
11307
11308 * [HBASE-20193](https://issues.apache.org/jira/browse/HBASE-20193) | *Critical* | **Basic Replication Web UI - Regionserver**
11309
11310 After HBASE-20193, we add a section to web ui to show the replication status of each wal group. There are 2 parts of this section, they both show the peerId, wal group and current replicating log of each replication source. And one is showing the information of replication log queue, i.e. size of current log, log queue size and replicating offset. The other one is showing the delay of replication, i.e. last shipped age and replication delay.
11311 If the offset shows -1 and replication delay is UNKNOWN, that means replication is not started. This may be caused by this peer is disabled or the replicationEndpoint is sleeping due to some reason.
11312
11313
11314 ---
11315
11316 * [HBASE-19997](https://issues.apache.org/jira/browse/HBASE-19997) | *Blocker* | **[rolling upgrade] 1.x =\> 2.x**
11317
11318 Now we have a 'basically work' solution for rolling upgrade from 1.4.x to 2.x. Please see the "Rolling Upgrade from 1.x to 2.x" section in ref guide for more details.
11319
11320
11321 ---
11322
11323 * [HBASE-20270](https://issues.apache.org/jira/browse/HBASE-20270) | *Major* | **Turn off command help that follows all errors in shell**
11324
11325 <!-- markdown -->
11326 The command help that followed all errors, before, is now no longer available. Erroneous command inputs would now just show error-texts followed by the shell command to try for seeing the help message. It looks like: For usage try 'help “create”’. Operators can copy-paste the command to get the help message.
11327
11328
11329 ---
11330
11331 * [HBASE-20194](https://issues.apache.org/jira/browse/HBASE-20194) | *Critical* | **Basic Replication WebUI - Master**
11332
11333 After HBASE-20194, we added 2 parts to master's web page.
11334 One is Peers that shows all replication peers and some of their configurations, like peer id, cluster key, state, bandwidth, and which namespace or table it will replicate.
11335 The other one is replication status of all regionservers, we added a tab to region servers division, then we can check the replication delay of all region servers for any peer. This table shows AgeOfLastShippedOp, SizeOfLogQueue and ReplicationLag for each regionserver and the table is sort by ReplicationLag in descending order. By this way we can easily find the problematic region server. If the replication delay is UNKNOWN, that means this walGroup doesn't start replicate yet and it may get disabled. ReplicationLag will update once this peer start replicate.
11336
11337
11338 ---
11339
11340 * [HBASE-18569](https://issues.apache.org/jira/browse/HBASE-18569) | *Major* | **Add prefetch support for async region locator**
11341
11342 Add prefetch support for async region locator. The default value is 10. Set 'hbase.client.locate.prefetch.limit' in hbase-site.xml if you want to use another value for it.
11343
11344
11345 ---
11346
11347 * [HBASE-20642](https://issues.apache.org/jira/browse/HBASE-20642) | *Major* | **IntegrationTestDDLMasterFailover throws 'InvalidFamilyOperationException**
11348
11349 This changes client-side nonce generation to use the same nonce for re-submissions of client RPC DDL operations.
11350
11351
11352 ---
11353
11354 * [HBASE-20708](https://issues.apache.org/jira/browse/HBASE-20708) | *Blocker* | **Remove the usage of RecoverMetaProcedure in master startup**
11355
11356 Introduce an InitMetaProcedure to initialize meta table for a new HBase deploy. Marked RecoverMetaProcedure deprecated and remove the usage of it in the current code base. We still need to keep it in place for compatibility. The code in RecoverMetaProcedure has been moved to ServerCrashProcedure, and SCP will always be enabled and we will rely on it to bring meta region online.
11357
11358 For more on the issue addressed by this commit, see the design doc for overview and plan: https://docs.google.com/document/d/1\_872oHzrhJq4ck7f6zmp1J--zMhsIFvXSZyX1Mxg5MA/edit#heading=h.xy1z4alsq7uy
11359
11360
11361 ---
11362
11363 * [HBASE-20334](https://issues.apache.org/jira/browse/HBASE-20334) | *Major* | **add a test that expressly uses both our shaded client and the one from hadoop 3**
11364
11365 <!-- markdown -->
11366
11367 HBase now includes a helper script that can be used to run a basic functionality test for a given HBase installation at in `dev_support`. The test can optionally be given an HBase client artifact to rely on and can optionally be given specific Hadoop client artifacts to use.
11368
11369 For usage information see `./dev-support/hbase_nightly_pseudo-distributed-test.sh --help`.
11370
11371 The project nightly tests now make use of this test to check running on top of Hadoop 2, Hadoop 3, and Hadoop 3 with shaded client artifacts.
11372
11373
11374 ---
11375
11376 * [HBASE-19735](https://issues.apache.org/jira/browse/HBASE-19735) | *Major* | **Create a minimal "client" tarball installation**
11377
11378 <!-- markdown -->
11379
11380 The HBase convenience binary artifacts now includes a client focused tarball that a) includes more docs and b) does not include scripts or jars only needed for running HBase cluster services.
11381
11382 The new artifact is made as a normal part of the `assembly:single` maven command.
11383
11384
11385 ---
11386
11387 * [HBASE-20615](https://issues.apache.org/jira/browse/HBASE-20615) | *Major* | **emphasize use of shaded client jars when they're present in an install**
11388
11389 <!-- markdown -->
11390
11391 HBase's built in scripts now rely on the downstream facing shaded artifacts where possible. In particular interest to downstream users, the `hbase classpath` and `hbase mapredcp` commands now return the relevant shaded client artifact and only those third paty jars needed to make use of them (e.g. slf4j-api, commons-logging, htrace, etc).
11392
11393 Downstream users should note that by default the `hbase classpath` command will treat having `hadoop` on the shell's PATH as an implicit request to include the output of the `hadoop classpath` command in the returned classpath. This long-existing behavior can be opted out of by setting the environment variable `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` to the value "true". For example: `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP="true" bin/hbase classpath`.
11394
11395
11396 ---
11397
11398 * [HBASE-20333](https://issues.apache.org/jira/browse/HBASE-20333) | *Critical* | **break up shaded client into one with no Hadoop and one that's standalone**
11399
11400 <!-- markdown -->
11401
11402 Downstream users who need to use both HBase and Hadoop APIs should switch to relying on the new `hbase-shaded-client-byo-hadoop` artifact rather than the existing `hbase-shaded-client` artifact. The new artifact no longer includes and Hadoop classes.
11403
11404 It should work in combination with either the output of `hadoop classpath` or the Hadoop provided client-facing shaded artifacts in Hadoop 3+.
11405
11406
11407 ---
11408
11409 * [HBASE-20332](https://issues.apache.org/jira/browse/HBASE-20332) | *Critical* | **shaded mapreduce module shouldn't include hadoop**
11410
11411 <!-- markdown -->
11412
11413 The `hbase-shaded-mapreduce` artifact no longer include its own copy of Hadoop classes. Users who make use of the artifact via YARN should be able to get these classes from YARN's classpath without having to make any changes.
11414
11415
11416 ---
11417
11418 * [HBASE-20681](https://issues.apache.org/jira/browse/HBASE-20681) | *Major* | **IntegrationTestDriver fails after HADOOP-15406 due to missing hamcrest-core**
11419
11420 <!-- markdown -->
11421
11422 Users of our integration tests on Hadoop 3 can now add all needed dependencies by pointing at jars included in our binary convenience artifact.
11423
11424 Prior to this fix, downstream users on Hadoop 3 would need to get a copy of the Hamcrest v1.3 jar from elsewhere.
11425
11426
11427 ---
11428
11429 * [HBASE-19852](https://issues.apache.org/jira/browse/HBASE-19852) | *Major* | **HBase Thrift 1 server SPNEGO Improvements**
11430
11431 Adds two new properties for hbase-site.xml for THRIFT SPNEGO when in HTTP mode:
11432 \* hbase.thrift.spnego.keytab.file
11433 \* hbase.thrift.spnego.principal
11434
11435
11436 ---
11437
11438 * [HBASE-20590](https://issues.apache.org/jira/browse/HBASE-20590) | *Critical* | **REST Java client is not able to negotiate with the server in the secure mode**
11439
11440 Adds a negotiation logic between a secure java REST client and server. After this jira the Java REST client will start responding to the Negotiate challenge sent by the server. Adds RESTDemoClient which can be used to verify whether the secure Java REST client works against secure REST server or not.
11441
11442
11443 ---
11444
11445 * [HBASE-20634](https://issues.apache.org/jira/browse/HBASE-20634) | *Critical* | **Reopen region while server crash can cause the procedure to be stuck**
11446
11447 A second attempt at fixing HBASE-20173. Fixes unfinished keeping of server state inside AM (ONLINE=\>SPLITTING=\>OFFLINE=\>null). Concurrent unassigns look at server state to figure if they should wait on SCP to wake them up or not.
11448
11449
11450 ---
11451
11452 * [HBASE-20579](https://issues.apache.org/jira/browse/HBASE-20579) | *Minor* | **Improve snapshot manifest copy in ExportSnapshot**
11453
11454 This patch adds an FSUtil.copyFilesParallel() to help copy files in parallel, and it will return all the paths of directories and files traversed. Thus when we copy manifest in ExportSnapshot, we can copy reference files concurrently and use the paths it returns to help setOwner and setPermission.
11455 The size of thread pool is determined by the configuration snapshot.export.copy.references.threads, and its default value is the number of runtime available processors.
11456
11457
11458 ---
11459
11460 * [HBASE-18116](https://issues.apache.org/jira/browse/HBASE-18116) | *Major* | **Replication source in-memory accounting should not include bulk transfer hfiles**
11461
11462 Before this change we would incorrectly include the size of enqueued store files for bulk replication in the calculation for determining whether or not to rate limit the transfer of WAL edits. Because bulk replication uses a separate and asynchronous mechanism for file transfer this could incorrectly limit the batch sizes for WAL replication if bulk replication in progress, with negative impact on latency and throughput.
11463
11464
11465 ---
11466
11467 * [HBASE-20592](https://issues.apache.org/jira/browse/HBASE-20592) | *Minor* | **Create a tool to verify tables do not have prefix tree encoding**
11468
11469 PreUpgradeValidator tool with DataBlockEncoding validator was added to verify cluster is upgradable to HBase 2.
11470
11471
11472 ---
11473
11474 * [HBASE-20501](https://issues.apache.org/jira/browse/HBASE-20501) | *Blocker* | **Change the Hadoop minimum version to 2.7.1**
11475
11476 <!-- markdown -->
11477 HBase is no longer able to maintain compatibility with Apache Hadoop versions that are no longer receiving updates. This release raises the minimum supported version to Hadoop 2.7.1. Downstream users are strongly advised to upgrade to the latest Hadoop 2.7 maintenance release.
11478
11479 Downstream users of earlier HBase versions are similarly advised to upgrade to Hadoop 2.7.1+. When doing so, it is especially important to follow the guidance from [the HBase Reference Guide's Hadoop section](http://hbase.apache.org/book.html#hadoop) on replacing the Hadoop artifacts bundled with HBase.
11480
11481
11482 ---
11483
11484 * [HBASE-20601](https://issues.apache.org/jira/browse/HBASE-20601) | *Minor* | **Add multiPut support and other miscellaneous to PE**
11485
11486 1. Add multiPut support
11487 Set --multiPut=number to enable batchput(meanwhile, --autoflush need be set to false)
11488
11489 2. Add Connection Count support
11490 Added a new parameter connCount to PE. set --connCount=2 means all threads will share 2 connections.
11491 oneCon option and connCount option shouldn't be set at the same time.
11492
11493 3. Add avg RT and avg TPS/QPS statstic for all threads
11494
11495 4. Delete some redundant code
11496 Now RandomWriteTest is inherited from SequentialWrite.
11497
11498
11499 ---
11500
11501 * [HBASE-20544](https://issues.apache.org/jira/browse/HBASE-20544) | *Blocker* | **downstream HBaseTestingUtility fails with invalid port**
11502
11503 <!-- markdown -->
11504
11505 HBase now relies on an internal mechanism to determine when it is running a local hbase cluster meant for external interaction vs an encapsulated test. When created via the `HBaseTestingUtility`, ports for Master and RegionServer services and UIs will be set to random ports to allow for multiple parallel uses on a single machine. Normally when running a Standalone HBase Deployment (as described in the HBase Reference Guide) the ports will be picked according to the same defaults used in a full cluster set up. If you wish to instead use the random port assignment set `hbase.localcluster.assign.random.ports` to true.
11506
11507
11508 ---
11509
11510 * [HBASE-20004](https://issues.apache.org/jira/browse/HBASE-20004) | *Minor* | **Client is not able to execute REST queries in a secure cluster**
11511
11512 Added 'hbase.rest.http.allow.options.method' configuration property to allow user to decide whether Rest Server HTTP should allow OPTIONS method or not. By default it is enabled in HBase 2.1.0+ versions and in other versions it is disabled.
11513 Similarly 'hbase.thrift.http.allow.options.method' is added HBase 1.5, 2.1.0 and 3.0.0 versions. It is disabled by default.
11514
11515
11516 ---
11517
11518 * [HBASE-20327](https://issues.apache.org/jira/browse/HBASE-20327) | *Minor* | **When qualifier is not specified, append and incr operation do not work (shell)**
11519
11520 This change will enable users to perform append and increment operation with null qualifier via hbase-shell.
11521
11522
11523 ---
11524
11525 * [HBASE-18842](https://issues.apache.org/jira/browse/HBASE-18842) | *Minor* | **The hbase shell clone\_snaphost command returns bad error message**
11526
11527 <!-- markdown -->
11528
11529 When attempting to clone a snapshot but using a namespace that does not exist, the HBase shell will now correctly report the exception as caused by the passed namespace. Previously, the shell would report that the problem was an unknown namespace but it would claim the user provided table name was not found as a namespace. Both before and after this change the shell properly used the passed namespace to attempt to handle the request.
11530
11531
11532 ---
11533
11534 * [HBASE-20406](https://issues.apache.org/jira/browse/HBASE-20406) | *Major* | **HBase Thrift HTTP - Shouldn't handle TRACE/OPTIONS methods**
11535
11536 <!-- markdown -->
11537 When configured to do thrift-over-http, the HBase Thrift API Server no longer accepts the HTTP methods TRACE nor OPTIONS.
11538
11539
11540 ---
11541
11542 * [HBASE-20046](https://issues.apache.org/jira/browse/HBASE-20046) | *Major* | **Reconsider the implementation for serial replication**
11543
11544 Now in replication we can make sure the order of pushing logs is same as the order of requests from client. Set the serial flag to true for a replication peer to enable this feature.
11545
11546
11547 ---
11548
11549 * [HBASE-20159](https://issues.apache.org/jira/browse/HBASE-20159) | *Major* | **Support using separate ZK quorums for client**
11550
11551 After HBASE-20159 we allow client to use different ZK quorums by introducing three new properties: hbase.client.zookeeper.quorum and hbase.client.zookeeper.property.clientPort to specify client zookeeper properties (note that the combination of these two properties should be different from the server ZK quorums), and hbase.client.zookeeper.observer.mode to indicate whether the client ZK nodes are in observer mode (false by default)
11552
11553 HConstants.DEFAULT\_ZOOKEPER\_CLIENT\_PORT has been removed in HBase 3.0 and replaced by the correctly spelled DEFAULT\_ZOOKEEPER\_CLIENT\_PORT.
11554
11555
11556 ---
11557
11558 * [HBASE-20242](https://issues.apache.org/jira/browse/HBASE-20242) | *Major* | **The open sequence number will grow if we fail to open a region after writing the max sequence id file**
11559
11560 Now when opening a region, we will store the current max sequence id of the region to its max sequence id file instead of the 'next sequence id'. This could avoid the sequence id bumping when we fail to open a region, and also align to the behavior when we close a region.
11561
11562
11563 ---
11564
11565 * [HBASE-19024](https://issues.apache.org/jira/browse/HBASE-19024) | *Critical* | **Configurable default durability for synchronous WAL**
11566
11567 The default durability setting for the synchronous WAL is Durability.SYNC\_WAL, which triggers HDFS hflush() to flush edits to the datanodes. We also support Durability.FSYNC\_WAL, which instead triggers HDFS hsync() to flush \_and\_ fsync edits. This change introduces the new configuration setting "hbase.wal.hsync", defaulting to FALSE, that if set to TRUE changes the default durability setting for the synchronous WAL to  FSYNC\_WAL.
11568
11569
11570 ---
11571
11572 * [HBASE-19389](https://issues.apache.org/jira/browse/HBASE-19389) | *Critical* | **Limit concurrency of put with dense (hundreds) columns to prevent write handler exhausted**
11573
11574 After HBASE-19389 we introduced a RegionServer self-protection mechanism to prevent write handler getting exhausted by high concurrency put with dense columns, mainly through two new properties: hbase.region.store.parallel.put.limit.min.column.count to decide what kind of put (with how many columns within a single column family) to limit (100 by default) and hbase.region.store.parallel.put.limit to limit the concurrency (10 by default). There's another property for advanced user and please check source and javadoc of StoreHotnessProtector for more details.
11575
11576
11577 ---
11578
11579 * [HBASE-20148](https://issues.apache.org/jira/browse/HBASE-20148) | *Major* | **Make serial replication as a option for a peer instead of a table**
11580
11581 A new method setSerial has been added to the interface ReplicationPeerConfigBuilder which is marked as IA.Public. This interface is not supposed to be implemented by client code, but if you do, this will be an incompatible change as you need to add this method to your implementation too.
11582
11583
11584 ---
11585
11586 * [HBASE-19397](https://issues.apache.org/jira/browse/HBASE-19397) | *Major* | **Design  procedures for ReplicationManager to notify peer change event from master**
11587
11588 Introduce 5 procedures to do peer modifications:
11589 AddPeerProcedure
11590 RemovePeerProcedure
11591 UpdatePeerConfigProcedure
11592 EnablePeerProcedure
11593 DisablePeerProcedure
11594
11595 The procedures are all executed with the following stage:
11596 1. Call pre CP hook, if an exception is thrown then give up
11597 2. Check whether the operation is valid, if not then give up
11598 3. Update peer storage. Notice that if we have entered this stage, then we can not rollback any more.
11599 4. Schedule sub procedures to refresh the peer config on every RS.
11600 5. Do post cleanup if any.
11601 6. Call post CP hook. The exception thrown will be ignored since we have already done the work.
11602
11603 The procedure will hold an exclusive lock on the peer id, so now there is no concurrent modifications on a single peer.
11604
11605 And now it is guaranteed that once the procedure is done, the peer modification has already taken effect on all RSes.
11606
11607 Abstracte a storage layer for replication peer/queue manangement, and refactored the upper layer to remove zk related naming/code/comment.
11608
11609 Add pre/postExecuteProcedures CP hooks to RegionServerObserver, and add permission check for executeProcedures method which requires the caller to be system user or super user.
11610
11611 On rolling upgrade: just do not do any replication peer modifications during the rolling upgrading. There is no pb/layout changes on the peer/queue storage on zk.
11612 # HBASE  2.0.0 Release Notes
11613
11614
11615 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11616
11617
11618 ---
11619
11620 * [HBASE-20464](https://issues.apache.org/jira/browse/HBASE-20464) | *Major* | **Disable IMC**
11621
11622 Change the default so that on creation of new tables, In-Memory Compaction BASIC is NOT enabled.
11623
11624 This change is in branch-2.0 only, not in branch-2.
11625
11626
11627 ---
11628
11629 * [HBASE-20276](https://issues.apache.org/jira/browse/HBASE-20276) | *Blocker* | **[shell] Revert shell REPL change and document**
11630
11631 <!-- markdown -->
11632
11633
11634
11635 The HBase shell now behaves as it did prior to the changes that started in HBASE-15965. Namely, some shell commands return values that may be further manipulated within the shell's IRB session.
11636
11637 The command line option `--return-values` is no longer acted on by the shell since it now always behaves as it did when passed this parameter. Passing the option results in a harmless warning about this change.
11638
11639 Users who wish to maintain the behavior seen in the 1.4.0-1.4.2 releases of the HBase shell should refer to the section _irbrc_ in the reference guide for how to configure their IRB session to avoid echoing expression results to the console.
11640
11641
11642 ---
11643
11644 * [HBASE-18792](https://issues.apache.org/jira/browse/HBASE-18792) | *Blocker* | **hbase-2 needs to defend against hbck operations**
11645
11646 As of HBase version 2.0, the hbck tool is significantly changed. In general, all Read-Only options are supported and can be be used safely. Most -fix/ -repair options are NOT supported. Please see usage below for details on which options are not supported:
11647
11648
11649 Usage: fsck [opts] {only tables}
11650  where [opts] are:
11651    -help Display help options (this)
11652    -details Display full report of all regions.
11653    -timelag \<timeInSeconds\>  Process only regions that  have not experienced any metadata updates in the last  \<timeInSeconds\> seconds.
11654    -sleepBeforeRerun \<timeInSeconds\> Sleep this many seconds before checking if the fix worked if run with -fix
11655    -summary Print only summary of the tables and status.
11656    -metaonly Only check the state of the hbase:meta table.
11657    -sidelineDir \<hdfs://\> HDFS path to backup existing meta.
11658    -boundaries Verify that regions boundaries are the same between META and store files.
11659    -exclusive Abort if another hbck is exclusive or fixing.
11660
11661   Datafile Repair options: (expert features, use with caution!)
11662    -checkCorruptHFiles     Check all Hfiles by opening them to make sure they are valid
11663    -sidelineCorruptHFiles  Quarantine corrupted HFiles.  implies -checkCorruptHFiles
11664
11665  Replication options
11666    -fixReplication   Deletes replication queues for removed peers
11667
11668   Metadata Repair options supported as of version 2.0: (expert features, use with caution!)
11669    -fixVersionFile   Try to fix missing hbase.version file in hdfs.
11670    -fixReferenceFiles  Try to offline lingering reference store files
11671    -fixHFileLinks  Try to offline lingering HFileLinks
11672    -noHdfsChecking   Don't load/check region info from HDFS. Assumes hbase:meta region info is good. Won't check/fix any HDFS issue, e.g. hole, orphan, or overlap
11673    -ignorePreCheckPermission  ignore filesystem permission pre-check
11674
11675 NOTE: Following options are NOT supported as of HBase version 2.0+.
11676
11677   UNSUPPORTED Metadata Repair options: (expert features, use with caution!)
11678    -fix              Try to fix region assignments.  This is for backwards compatiblity
11679    -fixAssignments   Try to fix region assignments.  Replaces the old -fix
11680    -fixMeta          Try to fix meta problems.  This assumes HDFS region info is good.
11681    -fixHdfsHoles     Try to fix region holes in hdfs.
11682    -fixHdfsOrphans   Try to fix region dirs with no .regioninfo file in hdfs
11683    -fixTableOrphans  Try to fix table dirs with no .tableinfo file in hdfs (online mode only)
11684    -fixHdfsOverlaps  Try to fix region overlaps in hdfs.
11685    -maxMerge \<n\>     When fixing region overlaps, allow at most \<n\> regions to merge. (n=5 by default)
11686    -sidelineBigOverlaps  When fixing region overlaps, allow to sideline big overlaps
11687    -maxOverlapsToSideline \<n\>  When fixing region overlaps, allow at most \<n\> regions to sideline per group. (n=2 by default)
11688    -fixSplitParents  Try to force offline split parents to be online.
11689    -removeParents    Try to offline and sideline lingering parents and keep daughter regions.
11690    -fixEmptyMetaCells  Try to fix hbase:meta entries not referencing any region (empty REGIONINFO\_QUALIFIER rows)
11691
11692   UNSUPPORTED Metadata Repair shortcuts
11693    -repair           Shortcut for -fixAssignments -fixMeta -fixHdfsHoles -fixHdfsOrphans -fixHdfsOverlaps -fixVersionFile -sidelineBigOverlaps -fixReferenceFiles-fixHFileLinks
11694    -repairHoles      Shortcut for -fixAssignments -fixMeta -fixHdfsHoles
11695
11696
11697 ---
11698
11699 * [HBASE-19994](https://issues.apache.org/jira/browse/HBASE-19994) | *Major* | **Create a new class for RPC throttling exception, make it retryable.**
11700
11701 A new RpcThrottlingException deprecates ThrottlingException. The new RpcThrottlingException is a retryable Exception that clients will retry when Rpc throttling quota is exceeded. The deprecated ThrottlingException is a nonretryable Exception.
11702
11703
11704 ---
11705
11706 * [HBASE-20224](https://issues.apache.org/jira/browse/HBASE-20224) | *Blocker* | **Web UI is broken in standalone mode**
11707
11708 Standalone webui was broken inadvertently by HBASE-20027.
11709
11710
11711 ---
11712
11713 * [HBASE-18784](https://issues.apache.org/jira/browse/HBASE-18784) | *Major* | **Use of filesystem that requires hflush / hsync / append / etc should query outputstream capabilities**
11714
11715 <!-- markdown -->
11716
11717
11718
11719 If HBase is run on top of Apache Hadoop libraries that support the needed APIs it will verify that underlying Filesystem implementations provide the needed durability mechanisms to safely operate. The needed APIs *should* be present in Hadoop 3 release and Hadoop 2 releases starting in the Hadoop 2.9 series. If the APIs are not available, HBase behaves as it has in previous releases (that is, it moves forward assuming such a check would pass).
11720
11721 Where this check fails, it is unsafe to rely on HBase in a production setting. In the event of process or node failure, the HBase RegionServer process may fail to have access to all the data it previously wrote to its write ahead log, resulting in data loss. In the event of process or node failure, the HBase master process may lose all or part of the write ahead log that it relies on for cluster management operations, leaving the cluster in an inconsistent state that we aren't sure it could recover from.
11722
11723 Notably, the LocalFileSystem implementation provided by Hadoop reports (accurately) via these new APIs that it can not provide the durability HBase needs to operate. As such, the current instructions for single-node HBase operation have been updated both with a) how to bypass this safety check and b) a strong warning about the dire consequences of doing so outside of a dev/test environment.
11724
11725
11726 ---
11727
11728 * [HBASE-20219](https://issues.apache.org/jira/browse/HBASE-20219) | *Critical* | **An error occurs when scanning with reversed=true and loadColumnFamiliesOnDemand=true**
11729
11730 Throws DoNotRetryIOException when you ask for a reverse scan loading adjacent column families on demand. Previous it threw IllegalStateException
11731
11732
11733 ---
11734
11735 * [HBASE-20358](https://issues.apache.org/jira/browse/HBASE-20358) | *Minor* | **Fix bin/hbase thrift usage text**
11736
11737 Cleanup usage message and command-line processing (no functional change).
11738
11739
11740 ---
11741
11742 * [HBASE-20182](https://issues.apache.org/jira/browse/HBASE-20182) | *Blocker* | **Can not locate region after split and merge**
11743
11744 Now if we hit a split parent when locating a region, we will skip to the next row and try again until the region does not contain our row. So there will be no RegionOfflineException for a split parent any more, instead, if the split children have not been onlined yet, i.e, we finally arrive at a region which does not contain our row, an IOException will be thrown.
11745
11746
11747 ---
11748
11749 * [HBASE-20149](https://issues.apache.org/jira/browse/HBASE-20149) | *Critical* | **Purge dev javadoc from bin tarball (or make a separate tarball of javadoc)**
11750
11751 We no longer include dev or dev test javadocs in our binary bundle. We still build them; they are just not included because they were half the size of the resultant tarball.
11752
11753 Here is our story on javadoc as of this commit:
11754
11755  \* apidocs - user facing main api javadocs. currently for a release line, published on website and linked from menu. included in the bin tarball
11756  \* devapidocs - hbase internal javadocs. currently for a release line, published on the website but not linked from the menu. no longer included in the bin tarball.
11757  \* testapidocs - user facing test scope api javadocs. currently for a release line, not published. included in the bin tarball.
11758  \* testdevapidocs - hbase internal test scope javadocs. currently for a release line, not published. no longer included in the bin tarball
11759
11760
11761 ---
11762
11763 * [HBASE-18828](https://issues.apache.org/jira/browse/HBASE-18828) | *Blocker* | **[2.0] Generate CHANGES.txt**
11764
11765 Moves us over to yetus releasedocmaker tooling generating CHANGES. CHANGES is not markdown (CHANGES.md) as opposed to CHANGES.txt. We've also added a new RELEASENOTES.md that lists JIRA release notes (courtesy of releasedocmaker).
11766
11767 CHANGES/RELEASENOTES are current as of now. Will need a 'freshening' when we cut the RC.
11768
11769
11770 ---
11771
11772 * [HBASE-14175](https://issues.apache.org/jira/browse/HBASE-14175) | *Critical* | **Adopt releasedocmaker for better generated release notes**
11773
11774 We will use yetus releasedocmaker to make our changes doc from here on out. A CHANGELOG.md will replace our current CHANGES.txt. Adjacent, we'll keep up a RELEASENOTES.md doc courtesy of releasedocmaker.
11775
11776 Over in HBASE-18828 is where we are working through steps for the RM integrating this new tooling.
11777
11778
11779 ---
11780
11781 * [HBASE-16499](https://issues.apache.org/jira/browse/HBASE-16499) | *Critical* | **slow replication for small HBase clusters**
11782
11783 Changed the default value for replication.source.ratio from 0.1 to 0.5. Which means now by default 50% of the total RegionServers in peer cluster(s) will participate in replication.
11784
11785
11786 ---
11787
11788 * [HBASE-16459](https://issues.apache.org/jira/browse/HBASE-16459) | *Trivial* | **Remove unused hbase shell --format option**
11789
11790 <!-- markdown -->
11791
11792
11793
11794
11795 The HBase `shell` command no longer recognizes the option `--format`. Previously this option only recognized the default value of 'console'. The default value is now always used.
11796
11797
11798 ---
11799
11800 * [HBASE-20259](https://issues.apache.org/jira/browse/HBASE-20259) | *Critical* | **Doc configs for in-memory-compaction and add detail to in-memory-compaction logging**
11801
11802 Disables in-memory compaction as default.
11803
11804 Adds logging of in-memory compaction configuration on creation.
11805
11806 Adds a chapter to the refguide on this new feature.
11807
11808
11809 ---
11810
11811 * [HBASE-20282](https://issues.apache.org/jira/browse/HBASE-20282) | *Major* | **Provide short name invocations for useful tools**
11812
11813 \`hbase regionsplitter\` is a new short invocation for \`hbase org.apache.hadoop.hbase.util.RegionSplitter\`
11814
11815
11816 ---
11817
11818 * [HBASE-20314](https://issues.apache.org/jira/browse/HBASE-20314) | *Major* | **Precommit build for master branch fails because of surefire fork fails**
11819
11820 Upgrade surefire plugin to 2.21.0.
11821
11822
11823 ---
11824
11825 * [HBASE-20130](https://issues.apache.org/jira/browse/HBASE-20130) | *Critical* | **Use defaults (16020 & 16030) as base ports when the RS is bound to localhost**
11826
11827 <!-- markdown -->
11828
11829
11830
11831 When region servers bind to localhost (mostly in pseudo distributed mode), default ports (16020 & 16030) are used as base ports. This will support up to 9 instances of region servers by default with `local-regionservers.sh` script. If additional instances are needed, see the reference guide on how to deploy with a different range using the environment variables `HBASE_RS_BASE_PORT` and `HBASE_RS_INFO_BASE_PORT`.
11832
11833
11834 ---
11835
11836 * [HBASE-20111](https://issues.apache.org/jira/browse/HBASE-20111) | *Critical* | **Able to split region explicitly even on shouldSplit return false from split policy**
11837
11838 When a split is requested on a Region, the RegionServer hosting that Region will now consult the configured SplitPolicy for that table when determining if a split of that Region is allowed. When a split is disallowed (due to the Region not being OPEN or the SplitPolicy denying the request), the operation will \*not\* be implicitly retried as it has previously done. Users will need to guard against and explicitly retry region split requests which are denied by the system.
11839
11840
11841 ---
11842
11843 * [HBASE-20223](https://issues.apache.org/jira/browse/HBASE-20223) | *Blocker* | **Use hbase-thirdparty 2.1.0**
11844
11845 Moves commons-cli and commons-collections4 into the HBase thirdparty shaded jar which means that these are no longer generally available for users on the classpath.
11846
11847
11848 ---
11849
11850 * [HBASE-19128](https://issues.apache.org/jira/browse/HBASE-19128) | *Major* | **Purge Distributed Log Replay from codebase, configurations, text; mark the feature as unsupported, broken.**
11851
11852 Removes Distributed Log Replay feature. Disable the feature before upgrading.
11853
11854
11855 ---
11856
11857 * [HBASE-19504](https://issues.apache.org/jira/browse/HBASE-19504) | *Major* | **Add TimeRange support into checkAndMutate**
11858
11859 1) checkAndMutate accept a TimeRange to query the specified cell
11860 2) remove writeToWAL flag from Region#checkAndMutate since it is useless (this is a incompatible change)
11861
11862
11863 ---
11864
11865 * [HBASE-20237](https://issues.apache.org/jira/browse/HBASE-20237) | *Critical* | **Put back getClosestRowBefore and throw UnknownProtocolException instead... for asynchbase client**
11866
11867 Throw UnknownProtocolException if a client connects and tries to invoke the old getClosestRowOrBefore method. Pre-hbase-1.0.0 or asynchbase do this instead of using its replacement, the reverse Scan.
11868
11869 getClosestRowOrBefore was implemented as a flag on Get. Before this patch though the flag was set, hbase2 were ignoring it. This made it look like a pre-1.0.0 client was 'working' but then it'd fail finding the appropriate Region for a client-specified row doing lookups into hbase:meta.
11870
11871
11872 ---
11873
11874 * [HBASE-20247](https://issues.apache.org/jira/browse/HBASE-20247) | *Major* | **Set version as 2.0.0 in branch-2.0 in prep for first RC**
11875
11876 Set version as 2.0.0 on branch-2.0.
11877
11878
11879 ---
11880
11881 * [HBASE-20090](https://issues.apache.org/jira/browse/HBASE-20090) | *Major* | **Properly handle Preconditions check failure in MemStoreFlusher$FlushHandler.run**
11882
11883 When there is concurrent region split, MemStoreFlusher may not find flushable region if the only candidate region left hasn't received writes (resulting in 0 data size).
11884 After this JIRA, such scenario wouldn't trigger Precondition assertion (replaced by an if statement to see whether there is any flushable region).
11885 If there is no flushable region, a DEBUG log would appear in region server log, saying "Above memory mark but there is no flushable region".
11886
11887
11888 ---
11889
11890 * [HBASE-19552](https://issues.apache.org/jira/browse/HBASE-19552) | *Major* | **update hbase to use new thirdparty libs**
11891
11892 hbase-thirdparty libs have moved to o.a.h.thirdparty offset. Netty shading system property is no longer necessary.
11893
11894
11895 ---
11896
11897 * [HBASE-20119](https://issues.apache.org/jira/browse/HBASE-20119) | *Minor* | **Introduce a pojo class to carry coprocessor information in order to make TableDescriptorBuilder accept multiple cp at once**
11898
11899 1) Make all methods in TableDescriptorBuilder be setter pattern.
11900 addCoprocessor -\> setCoprocessor
11901 addColumnFamily -\> setColumnFamily
11902 (addCoprocessor and addColumnFamily are still in branch-2 but they are marked as deprecated)
11903 2) add CoprocessorDescriptor to carry cp information
11904 3) add CoprocessorDescriptorBuilder to build CoprocessorDescriptor
11905 4) TD disallow user to set negative priority to coprocessor since parsing the negative value will cause a exception
11906
11907
11908 ---
11909
11910 * [HBASE-17165](https://issues.apache.org/jira/browse/HBASE-17165) | *Critical* | **Add retry to LoadIncrementalHFiles tool**
11911
11912 Adds retry to load of incremental hfiles. Pertinent key is HConstants.HBASE\_CLIENT\_RETRIES\_NUMBER. Default is HConstants.DEFAULT\_HBASE\_CLIENT\_RETRIES\_NUMBER.
11913
11914
11915 ---
11916
11917 * [HBASE-20108](https://issues.apache.org/jira/browse/HBASE-20108) | *Critical* | **\`hbase zkcli\` falls into a non-interactive prompt after HBASE-15199**
11918
11919 This issue fixes a runtime dependency issues where JLine is not made available on the classpath which causes the ZooKeeper CLI to appear non-interactive. JLine was being made available unintentionally via the JRuby jar file on the classpath for the HBase shell. While the JRuby jar is not always present, the fix made here was to selectively include the JLine dependency on the zkcli command's classpath.
11920
11921
11922 ---
11923
11924 * [HBASE-8770](https://issues.apache.org/jira/browse/HBASE-8770) | *Blocker* | **deletes and puts with the same ts should be resolved according to mvcc/seqNum**
11925
11926 This behavior is available as a new feature. See HBASE-15968 release note.
11927
11928 This issue is just about adding to the refguide documentation on the HBASE\_15968 feature.
11929
11930
11931 ---
11932
11933 * [HBASE-19114](https://issues.apache.org/jira/browse/HBASE-19114) | *Major* | **Split out o.a.h.h.zookeeper from hbase-server and hbase-client**
11934
11935 Splits out most of ZooKeeper related code into a separate new module: hbase-zookeeper.
11936 Also, renames some ZooKeeper related classes to follow a common naming pattern - "ZK" prefix - as compared to many different styles earlier.
11937
11938
11939 ---
11940
11941 * [HBASE-19437](https://issues.apache.org/jira/browse/HBASE-19437) | *Critical* | **Batch operation can't handle the null result for Append/Increment**
11942
11943 The result from server is changed from null to Result.EMPTY\_RESULT when Append/Increment operation can't retrieve any data from server,
11944
11945
11946 ---
11947
11948 * [HBASE-17448](https://issues.apache.org/jira/browse/HBASE-17448) | *Major* | **Export metrics from RecoverableZooKeeper**
11949
11950 Committed to master and branch-1
11951
11952
11953 ---
11954
11955 * [HBASE-19400](https://issues.apache.org/jira/browse/HBASE-19400) | *Major* | **Add missing security checks in MasterRpcServices**
11956
11957 Added ACL check to following Admin functions:
11958 enableCatalogJanitor, runCatalogJanitor, cleanerChoreSwitch, runCleanerChore, execProcedure, execProcedureWithReturn, normalize, normalizerSwitch, coprocessorService.
11959 When ACL is enabled, only those with ADMIN rights will be able to invoke these operations successfully.
11960
11961
11962 ---
11963
11964 * [HBASE-20048](https://issues.apache.org/jira/browse/HBASE-20048) | *Blocker* | **Revert serial replication feature**
11965
11966 Revert the serial replication feature from all branches. Plan to reimplement it soon and land onto 2.1 release line.
11967
11968
11969 ---
11970
11971 * [HBASE-19166](https://issues.apache.org/jira/browse/HBASE-19166) | *Blocker* | **AsyncProtobufLogWriter persists ProtobufLogWriter as class name for backward compatibility**
11972
11973 For backward compatibility, AsyncProtobufLogWriter uses "ProtobufLogWriter" as writer class name and SecureAsyncProtobufLogWriter uses "SecureProtobufLogWriter" as writer class name.
11974
11975
11976 ---
11977
11978 * [HBASE-18596](https://issues.apache.org/jira/browse/HBASE-18596) | *Blocker* | **[TEST] A hbase1 cluster should be able to replicate to a hbase2 cluster; verify**
11979
11980 Replication between versions verified as basically working. 0.98.25-SNAPSHOT to beta-2 hbase2 and a 1.2-ish version tried.
11981
11982
11983 ---
11984
11985 * [HBASE-20017](https://issues.apache.org/jira/browse/HBASE-20017) | *Blocker* | **BufferedMutatorImpl submit the same mutation repeatedly**
11986
11987 This change fixes multithreading issues in the implementation of BufferedMutator. BufferedMutator should not be used with 1.4 releases prior to 1.4.2.
11988
11989
11990 ---
11991
11992 * [HBASE-20032](https://issues.apache.org/jira/browse/HBASE-20032) | *Minor* | **Receving multiple warnings for missing reporting.plugins.plugin.version**
11993
11994 Add (latest) version elements missing from reporting plugins in top-level pom.
11995
11996
11997 ---
11998
11999 * [HBASE-19954](https://issues.apache.org/jira/browse/HBASE-19954) | *Major* | **Separate TestBlockReorder into individual tests to avoid ShutdownHook suppression error against hadoop3**
12000
12001 hadoop3 minidfscluster removes all shutdown handlers when the cluster goes down which made this test that does FS-stuff fail (Fix was to break up the test so each test method ran with an unadulterated FS).
12002
12003
12004 ---
12005
12006 * [HBASE-20014](https://issues.apache.org/jira/browse/HBASE-20014) | *Major* | **TestAdmin1 Times out**
12007
12008 Ups the overall test timeout from 10 minutes to 13minutes. 15minutes is the surefire timeout.
12009
12010
12011 ---
12012
12013 * [HBASE-20020](https://issues.apache.org/jira/browse/HBASE-20020) | *Critical* | **Make sure we throw DoNotRetryIOException when ConnectionImplementation is closed**
12014
12015 Add checkClosed to core Client methods. Avoid unnecessary retry.
12016
12017
12018 ---
12019
12020 * [HBASE-19978](https://issues.apache.org/jira/browse/HBASE-19978) | *Major* | **The keepalive logic is incomplete in ProcedureExecutor**
12021
12022 Completes keep-alive logic and then enables it; ProcedureExecutor Workers will spin up more threads when need settling back to the core count after the burst in demand has passed. Default keep-alive is one minute. Default core-count is CPUs/4 or 16, which ever is greater. Maximum is an arbitrary core-count \* 10 (a limit that should never be hit and if it is, there is something else very wrong).
12023
12024
12025 ---
12026
12027 * [HBASE-19950](https://issues.apache.org/jira/browse/HBASE-19950) | *Minor* | **Introduce a ColumnValueFilter**
12028
12029 ColumnValueFilter provides a way to fetch matched cells only by providing specified column, value and a comparator, which is different from SingleValueFilter, fetching an entire row as soon as a matched cell found.
12030
12031
12032 ---
12033
12034 * [HBASE-18294](https://issues.apache.org/jira/browse/HBASE-18294) | *Major* | **Reduce global heap pressure: flush based on heap occupancy**
12035
12036 A region is flushed if its memory component exceeds the region flush threshold.
12037 A flush policy decides which stores to flush by comparing the size of the store to a column-family-flush threshold.
12038 If the overall size of all memstores in the machine exceeds the bounds defined by the administrator (denoted global pressure) a region is selected and flushed.
12039 HBASE-18294 changes flush decisions to be based on heap-occupancy and not data (key-value) size, consistently across levels. This rolls back some of the changes by HBASE-16747. Specifically,
12040 (1) RSs, Regions and stores track their overall on-heap and off-heap occupancy,
12041 (2) A region is flushed when its on-heap+off-heap size exceeds the region flush threshold specified in hbase.hregion.memstore.flush.size,
12042 (3) The store to be flushed is chosen based on its on-heap+off-heap size
12043 (4) At the RS level, a flush is triggered when the overall on-heap exceeds the on-heap limit, or when the overall off-heap size exceeds the off-heap limit (low/high water marks).
12044
12045 Note that when the region flush size is set to XXmb a region flush may be triggered even before writing keys and values of size XX because the total heap occupancy of the region which includes additional metadata exceeded the threshold.
12046
12047
12048 ---
12049
12050 * [HBASE-19116](https://issues.apache.org/jira/browse/HBASE-19116) | *Critical* | **Currently the tail of hfiles with CellComparator\* classname makes it so hbase1 can't open hbase2 written hfiles; fix**
12051
12052 hbase-2.x sets KeyValue Comparators into the tail of hfiles rather than CellComparator, what it uses internally, just so hbase-1.x can continue to read hbase-2.x written hfiles.
12053
12054
12055 ---
12056
12057 * [HBASE-19948](https://issues.apache.org/jira/browse/HBASE-19948) | *Major* | **Since HBASE-19873, HBaseClassTestRule, Small/Medium/Large has different semantic**
12058
12059 In subtask, fixed doc and annotations to be more explicit that test timings are for the whole Test Fixture/Test Class/Test Suite NOT the test method only as we'd measuring up to this (tother subtasks untethered Categorization and test timeout such that all categories now have a ten minute timeout -- no test can run longer than ten minutes or it gets killed/timedout).
12060
12061
12062 ---
12063
12064 * [HBASE-16060](https://issues.apache.org/jira/browse/HBASE-16060) | *Blocker* | **1.x clients cannot access table state talking to 2.0 cluster**
12065
12066 By default, we mirror table state to zookeeper so hbase-1.x clients will work against an hbase-2 cluster (With this patch, hbase-1.x clients can do most Admin functions including table create; hbase-1.x clients can do all Table/DML against hbase-2 cluster).
12067
12068 Flag to disable mirroring is hbase.mirror.table.state.to.zookeeper; set it to false in Configuration.
12069
12070 Related, Master on startup will look to see if there are table state znodes left over by an hbase-1 instance. If any found, it will migrate the table state to hbase-2 setting the state into the hbase:meta table where table state is now kept. We will do this check on every Master start. Notion is that this will be overall beneficial with low impediment. To disable the migration check, set hbase.migrate.table.state.from.zookeeper to false.
12071
12072
12073 ---
12074
12075 * [HBASE-19900](https://issues.apache.org/jira/browse/HBASE-19900) | *Critical* | **Region-level exception destroy the result of batch**
12076
12077 This fix makes the following changes to how client handle the both of action result and region exception.
12078 1) honor the action result rather than region exception. If the action have both of true result and region exception, the action is fine as the exception is caused by other actions which are in the same region.
12079 2) honor the action exception rather than region exception. If the action have both of action exception and region exception, we deal with the action exception only. If we also handle the region exception for the same action, it will introduce the negative count of actions in progress. The AsyncRequestFuture#waitUntilDone will block forever.
12080
12081
12082 ---
12083
12084 * [HBASE-19841](https://issues.apache.org/jira/browse/HBASE-19841) | *Major* | **Tests against hadoop3 fail with StreamLacksCapabilityException**
12085
12086 HBaseTestingUtility now assumes that all clusters will use local storage until a MiniDFSCluster is started or assigned.
12087
12088
12089 ---
12090
12091 * [HBASE-19528](https://issues.apache.org/jira/browse/HBASE-19528) | *Major* | **Major Compaction Tool**
12092
12093 Tool allows you to compact a cluster with given concurrency of regionservers compacting at a given time.  If tool completes successfully everything requested for compaction will be compacted, regardless of region moves, splits and merges.
12094
12095
12096 ---
12097
12098 * [HBASE-19919](https://issues.apache.org/jira/browse/HBASE-19919) | *Major* | **Tidying up logging**
12099
12100 (I thought this change innocuous but I made work for a co-worker when I upped interval between log cleaner runs -- meant a smoke test failed because we were slow doing an expected cleanup).
12101
12102 Edit of log lines removing redundancy. Shorten thread names shown in log.  Made some log TRACE instead of DEBUG.  Capitalizations.
12103
12104 Upped log cleaner interval from every minute to every ten minutes. hbase.master.cleaner.interval
12105
12106 Lowered default count of threads started by Procedure Executor from count of CPUs to 1/4 of count of CPUs.
12107
12108
12109 ---
12110
12111 * [HBASE-19901](https://issues.apache.org/jira/browse/HBASE-19901) | *Major* | **Up yetus proclimit on nightlies**
12112
12113 Pass to yetus a dockermemlimit of 20G and a proclimit of 10000. Defaults are 4G and 1G respectively.
12114
12115
12116 ---
12117
12118 * [HBASE-19912](https://issues.apache.org/jira/browse/HBASE-19912) | *Minor* | **The flag "writeToWAL" of Region#checkAndRowMutate is useless**
12119
12120 Remove useless 'writeToWAL' flag of Region#checkAndRowMutate & related class
12121
12122
12123 ---
12124
12125 * [HBASE-19911](https://issues.apache.org/jira/browse/HBASE-19911) | *Major* | **Convert some tests from small to medium because they are timing out: TestNettyRpcServer, TestClientClusterStatus, TestCheckTestClasses**
12126
12127 Changed a few tests so they are medium sized rather than small size.
12128
12129 Also, upped the time we wait on small tests to 60seconds from 30seconds. Small tests are tests that run in 15seconds or less. What we changed was the timeout watcher. It is now more lax, more tolerant of dodgy infrastructure that might be running tests slowly.
12130
12131
12132 ---
12133
12134 * [HBASE-19892](https://issues.apache.org/jira/browse/HBASE-19892) | *Major* | **Checking 'patch attach' and yetus 0.7.0 and move to Yetus 0.7.0**
12135
12136 Moved our internal yetus reference from 0.6.0 to 0.7.0. Concurrently, I changed hadoopqa to run with 0.7.0 (by editing the config in jenkins).
12137
12138
12139 ---
12140
12141 * [HBASE-19873](https://issues.apache.org/jira/browse/HBASE-19873) | *Major* | **Add a CategoryBasedTimeout ClassRule for all UTs**
12142
12143 Along with @category -- small, medium, large -- all hbase tests must now carry a ClassRule as follows:
12144
12145 +  @ClassRule
12146 +  public static final HBaseClassTestRule CLASS\_RULE =
12147 +      HBaseClassTestRule.forClass(TestInterfaceAudienceAnnotations.class);
12148
12149 where the class changes by test.
12150
12151 Currently the classrule enforces timeout for the whole test suite -- i.e. if a SmallTest Category then all the tests in the TestSuite must complete inside 60seconds, the timeout we set on SmallTest Category test suite -- but is meant to be a repository for general, runtime, hbase test facility.
12152
12153
12154 ---
12155
12156 * [HBASE-19770](https://issues.apache.org/jira/browse/HBASE-19770) | *Critical* | **Add '--return-values' option to Shell to print return values of commands in interactive mode**
12157
12158 Introduces a new option to the HBase shell: -r, --return-values. When the shell is in "interactive" mode (default), the return value of shell commands are not returned to the user as they dirty the console output. For those who desire this functionality, the "--return-values" option restores the old functionality of the commands passing their return value to the user.
12159
12160
12161 ---
12162
12163 * [HBASE-15321](https://issues.apache.org/jira/browse/HBASE-15321) | *Major* | **Ability to open a HRegion from hdfs snapshot.**
12164
12165 HRegion.openReadOnlyFileSystemHRegion() provides the ability to open HRegion from a read-only hdfs snapshot.  Because hdfs snapshots are read-only, no cleanup happens when using this API.
12166
12167
12168 ---
12169
12170 * [HBASE-17513](https://issues.apache.org/jira/browse/HBASE-17513) | *Critical* | **Thrift Server 1 uses different QOP settings than RPC and Thrift Server 2 and can easily be misconfigured so there is no encryption when the operator expects it.**
12171
12172 This change fixes an issue where users could have unintentionally configured the HBase Thrift1 server to run without wire-encryption, when they believed they had configured the Thrift1 server to do so.
12173
12174
12175 ---
12176
12177 * [HBASE-19828](https://issues.apache.org/jira/browse/HBASE-19828) | *Major* | **Flakey TestRegionsOnMasterOptions.testRegionsOnAllServers**
12178
12179 Disables TestRegionsOnMasterOptions because Regions on Master does not work reliably; see HBASE-19831.
12180
12181
12182 ---
12183
12184 * [HBASE-18963](https://issues.apache.org/jira/browse/HBASE-18963) | *Major* | **Remove MultiRowMutationProcessor and implement mutateRows... methods using batchMutate()**
12185
12186 Modified HRegion.mutateRow() APIs to use batchMutate() instead of processRowsWithLocks() with MultiRowMutationProcessor. MultiRowMutationProcessor is removed to have single write path that uses batchMutate().
12187
12188
12189 ---
12190
12191 * [HBASE-19163](https://issues.apache.org/jira/browse/HBASE-19163) | *Major* | **"Maximum lock count exceeded" from region server's batch processing**
12192
12193 When there are many mutations against the same row in a batch, as each mutation will acquire a shared row lock, it will exceed the maximum shared lock count the java ReadWritelock supports (64k). Along with other optimization, the batch is divided into multiple possible minibatches. A new config is added to limit the maximum number of mutations in the minibatch.
12194
12195    \<property\>
12196     \<name\>hbase.regionserver.minibatch.size\</name\>
12197     \<value\>20000\</value\>
12198    \</property\>
12199 The default value is 20000.
12200
12201
12202 ---
12203
12204 * [HBASE-19739](https://issues.apache.org/jira/browse/HBASE-19739) | *Minor* | **Include thrift IDL files in HBase binary distribution**
12205
12206 Thrift IDLs are now shipped, bundled up in the respective hbase-\*thrift.jars (look for files ending in .thrift).
12207
12208
12209 ---
12210
12211 * [HBASE-11409](https://issues.apache.org/jira/browse/HBASE-11409) | *Major* | **Add more flexibility for input directory structure to LoadIncrementalHFiles**
12212
12213 Allows for users to bulk load entire tables from hdfs by specifying the parameter -loadTable.  This allows you to pass in a table level directory and have all regions column families bulk loaded, if you do not specify the -loadTable parameter LoadIncrementalHFiles will work as before. Note: you must have a pre-created table to run with -loadTable it will not create one for you.
12214
12215
12216 ---
12217
12218 * [HBASE-19769](https://issues.apache.org/jira/browse/HBASE-19769) | *Critical* | **IllegalAccessError on package-private Hadoop metrics2 classes in MapReduce jobs**
12219
12220 Client-side ZooKeeper metrics which were added to 2.0.0 alpha/beta releases cause issues when launching MapReduce jobs via {{yarn jar}} on the command line. This stems from ClassLoader separation issues that YARN implements. It was chosen that the easiest solution was to remove these ZooKeeper metrics entirely.
12221
12222
12223 ---
12224
12225 * [HBASE-19783](https://issues.apache.org/jira/browse/HBASE-19783) | *Minor* | **Change replication peer cluster key/endpoint from a not-null value to null is not allowed**
12226
12227 To reduce the confusing behavior, now when you call updatePeerConfig with empty ClusterKey or ReplicationEndpointImpl, but the value of field of the to-be-updated ReplicationPeerConfig is not null, we will throw exception instead of ignoring them.
12228
12229
12230 ---
12231
12232 * [HBASE-19483](https://issues.apache.org/jira/browse/HBASE-19483) | *Major* | **Add proper privilege check for rsgroup commands**
12233
12234 This JIRA aims at refactoring AccessController, using ACL as core library in CPs.
12235 1. Stripping out a public class AccessChecker from AccessController, using ACL as core library in CPs. AccessChecker don't have any dependency on anything CP related. Create it's instance from other CPS.
12236 2. Change the default value of hbase.security.authorization to false.
12237 3. Don't use CP hooks to check access in RSGroup. Use the access checker instance directly in functions of RSGroupAdminServiceImpl.
12238
12239
12240 ---
12241
12242 * [HBASE-19358](https://issues.apache.org/jira/browse/HBASE-19358) | *Major* | **Improve the stability of splitting log when do fail over**
12243
12244 After HBASE-19358 we introduced a new property hbase.split.writer.creation.bounded to limit the opening writers for each WALSplitter. If set to true, we won't open any writer for recovered.edits until the entries accumulated in memory reaching hbase.regionserver.hlog.splitlog.buffersize (which defaults at 128M) and will write and close the file in one go instead of keeping the writer open. It's false by default and we recommend to set it to true if your cluster has a high region load (like more than 300 regions per RS), especially when you observed obvious NN/HDFS slow down during hbase (single RS or cluster) failover.
12245
12246
12247 ---
12248
12249 * [HBASE-19651](https://issues.apache.org/jira/browse/HBASE-19651) | *Minor* | **Remove LimitInputStream**
12250
12251 HBase had copied from guava the file LmiitedInputStream. This commit removes the copied file in favor of (our internal, shaded) guava's ByteStreams.limit. Guava 14.0's LIS noted: "Use ByteStreams.limit(java.io.InputStream, long) instead. This class is scheduled to be removed in Guava release 15.0."
12252
12253
12254 ---
12255
12256 * [HBASE-19691](https://issues.apache.org/jira/browse/HBASE-19691) | *Critical* | **Do not require ADMIN permission for obtaining ClusterStatus**
12257
12258 This change reverts an unintentional requirement for global ADMIN permission to obtain cluster status from the active HMaster.
12259
12260
12261 ---
12262
12263 * [HBASE-19486](https://issues.apache.org/jira/browse/HBASE-19486) | *Major* | ** Periodically ensure records are not buffered too long by BufferedMutator**
12264
12265 The BufferedMutator now supports two settings that are used to ensure records do not stay too long in the buffer of a BufferedMutator. For periodically flushing the BufferedMutator there is now a "Timeout": "How old may the oldest record in the buffer be before we force a flush" and a "TimerTick": How often do we check if the timeout has been exceeded. Using these settings you can make the BufferedMutator automatically flush the write buffer if after the specified number of milliseconds no flush has occurred.
12266
12267 This is mainly useful in streaming scenarios (i.e. writing data into HBase using Apache Flink/Beam/Storm) where it is common (especially in a test/development situation) to see small unpredictable bursts of data that need to be written into HBase. When using the BufferedMutator till now the effect was that records would remain in the write buffer until the buffer was full or an explicit flush was triggered. In practice this would mean that the 'last few records' of a burst would remain in the write buffer until the next burst arrives filling the buffer to capacity and thus triggering a flush.
12268
12269
12270 ---
12271
12272 * [HBASE-19670](https://issues.apache.org/jira/browse/HBASE-19670) | *Major* | **Workaround: Purge User API building from branch-2 so can make a beta-1**
12273
12274 Disable filtering of User API based off yetus annotation done in doclet. See parent issue for build failure currently being worked on but not done in time for a beta-1.
12275
12276
12277 ---
12278
12279 * [HBASE-19282](https://issues.apache.org/jira/browse/HBASE-19282) | *Major* | **CellChunkMap Benchmarking and User Interface**
12280
12281 When MSLAB is in use (that is the default config) , we will always use the CellChunkMap indexing variant for in memory flushed Immutable segments. When MSLAB is turned off, we will use CellAraryMap. These can not be changed with any configs.  The in memory flush threshold been made to be default to 10% of region flush size. This can be turned using 'hbase.memstore.inmemoryflush.threshold.factor'.
12282
12283
12284 ---
12285
12286 * [HBASE-19628](https://issues.apache.org/jira/browse/HBASE-19628) | *Major* | **ByteBufferCell should extend ExtendedCell**
12287
12288 ByteBufferCell → ByteBufferExtendedCell
12289 MapReduceCell → MapReduceExtendedCell
12290 ByteBufferChunkCell → ByteBufferChunkKeyValue
12291 NoTagByteBufferChunkCell → NoTagByteBufferChunkKeyValue
12292 KeyOnlyByteBufferCell → KeyOnlyByteBufferExtendedCell
12293 TagRewriteByteBufferCell → TagRewriteByteBufferExtendedCell
12294 ValueAndTagRewriteByteBufferCell → ValueAndTagRewriteByteBufferExtendedCell
12295 EmptyByteBufferCell → EmptyByteBufferExtendedCell
12296 FirstOnRowByteBufferCell → FirstOnRowByteBufferExtendedCell
12297 LastOnRowByteBufferCell → LastOnRowByteBufferExtendedCell
12298 FirstOnRowColByteBufferCell → FirstOnRowColByteBufferExtendedCell
12299 FirstOnRowColTSByteBufferCell → FirstOnRowColTSByteBufferExtendedCell
12300 LastOnRowColByteBufferCell → LastOnRowColByteBufferCell
12301 OffheapDecodedCell → OffheapDecodedExtendedCell
12302
12303
12304 ---
12305
12306 * [HBASE-19576](https://issues.apache.org/jira/browse/HBASE-19576) | *Major* | **Introduce builder for ReplicationPeerConfig and make it immutable**
12307
12308 Add a ReplicationPeerConfigBuilder to create ReplicationPeerConfig and make ReplicationPeerConfig immutable. Meanwhile, deprecated set\* methods in ReplicationPeerConfig.
12309
12310
12311 ---
12312
12313 * [HBASE-10092](https://issues.apache.org/jira/browse/HBASE-10092) | *Critical* | **Move to slf4j**
12314
12315 We now have slf4j as our front-end. Be careful adding logging from here on out; make sure it slf4j.
12316
12317 From here on out, as us devs go, we need to convert log messages from being 'guarded' -- i.e. surrounded by if (LOG.isDebugEnabled...) -- to instead being parameterized log messages. e.g. the latter rather than the former in the below:
12318
12319 logger.debug("The new entry is "+entry+".");
12320 logger.debug("The new entry is {}.", entry);
12321
12322 See [1] for background on perf benefits.
12323
12324 Note, FATAL log level is not present in slf4j. It is noted as a Marker but won't show in logs as a LEVEL.
12325
12326 1.  https://www.slf4j.org/faq.html#logging\_performance
12327
12328
12329 ---
12330
12331 * [HBASE-19148](https://issues.apache.org/jira/browse/HBASE-19148) | *Blocker* | **Reevaluate default values of configurations**
12332
12333 Removed unused hbase.fs.tmp.dir from hbase-default.xml.
12334
12335 Upped hbase.master.fileSplitTimeout from 30s to 10minutes (suggested by production experience)
12336
12337 Added note that handler-count should be ~CPU count.
12338
12339 hbase.regionserver.logroll.multiplier has been changed from 0.95 to 0.5 AND the default block size has been doubled.
12340
12341 A few of the core configs are now dumped to the log on startup.
12342
12343
12344 ---
12345
12346 * [HBASE-19492](https://issues.apache.org/jira/browse/HBASE-19492) | *Major* | **Add EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS support to replication peer config**
12347
12348 Add two new field:  EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS to replication peer config.
12349
12350 If replicate\_all flag is true, it means all user tables will be replicated to peer cluster. Then allow config exclude namespaces or exclude table-cfs which can't be replicated to  peer cluster.
12351
12352 If replicate\_all flag is false, it means all user tables can't be replicated to peer cluster. Then allow to config namespaces or table-cfs which will be replicated to peer cluster.
12353
12354
12355 ---
12356
12357 * [HBASE-19494](https://issues.apache.org/jira/browse/HBASE-19494) | *Major* | **Create simple WALKey filter that can be plugged in on the Replication Sink**
12358
12359 Adds means of adding very basic filter on the sink side of replication. We already have a means of installing filter source-side, which is better place to filter edits before they are shipped over the network, but this facility is needed by hbase-indexer.
12360
12361 Set hbase.replication.sink.walentrysinkfilter with a no-param Constructor implementation. See test in patch for example.
12362
12363
12364 ---
12365
12366 * [HBASE-19112](https://issues.apache.org/jira/browse/HBASE-19112) | *Blocker* | **Suspect methods on Cell to be deprecated**
12367
12368 Adds method Cell#getType which returns enum describing Cell Type.
12369
12370 Deprecates the following Cell methods:
12371
12372  getTypeByte
12373  getSequenceId
12374  getTagsArray
12375  getTagsOffset
12376  getTagsLength
12377
12378 CPs trying to build cells should use RawCellBuilderFactory that supports  building cells with tags.
12379
12380
12381 ---
12382
12383 * [HBASE-14790](https://issues.apache.org/jira/browse/HBASE-14790) | *Major* | **Implement a new DFSOutputStream for logging WAL only**
12384
12385 Implement a FanOutOneBlockAsyncDFSOutput for writing WAL only, the WAL provider which uses this class is AsyncFSWALProvider.
12386
12387 It is based on netty, and will write to 3 DNs at the same time concurrently(fan-out) so generally it will lead to a lower latency. And it is also fail-fast, the stream will become unwritable immediately after there are any read/write errors, no pipeline recovery. You need to call recoverLease to force close the output for this case. And it only supports to write a file with a single block. For WAL this is a good behavior as we can always open a new file when the old one is broken. The performance analysis in HBASE-16890 shows that it has a better performance.
12388
12389 Behavior changes:
12390 1. As now we write to 3 DNs concurrently, according to the visibility guarantee of HDFS, the data will be available immediately when arriving at DN since all the DNs will be considered as the last one in pipeline. This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency. HBASE-14004 is used to solve the problem.
12391 2. There will be no sync failure. When the output is broken, we will open a new file and write all the unacked wal entries to the new file. This means that we may have duplicated entries in wal files. HBASE-14949 is used to solve this problem.
12392
12393
12394 ---
12395
12396 * [HBASE-15536](https://issues.apache.org/jira/browse/HBASE-15536) | *Critical* | **Make AsyncFSWAL as our default WAL**
12397
12398 Now the default WALProvider is AsyncFSWALProvider, i.e. 'asyncfs'.
12399 If you want to change back to use FSHLog, please add this in hbase-site.xml
12400 {code}
12401 \<property\>
12402 \<name\>hbase.wal.provider\</name\>
12403 \<value\>filesystem\</value\>
12404 \</property\>
12405 {code}
12406 If you want to use FSHLog with multiwal, please add this in hbase-site.xml
12407 {code}
12408 \<property\>
12409 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
12410 \<value\>filesystem\</value\>
12411 \</property\>
12412 {code}
12413
12414 This patch also sets hbase.wal.async.use-shared-event-loop to false so WAL has its own netty event group.
12415
12416
12417 ---
12418
12419 * [HBASE-19462](https://issues.apache.org/jira/browse/HBASE-19462) | *Major* | **Deprecate all addImmutable methods in Put**
12420
12421 Deprecates Put#addImmutable as of release 2.0.0, this will be removed in HBase 3.0.0. Use {@link #add(Cell)} and {@link org.apache.hadoop.hbase.CellBuilder} instead
12422
12423
12424 ---
12425
12426 * [HBASE-19213](https://issues.apache.org/jira/browse/HBASE-19213) | *Minor* | **Align check and mutate operations in Table and AsyncTable**
12427
12428 In Table interface deprecate checkAndPut, checkAndDelete and checkAndMutate methods.
12429 Similarly to AsyncTable a new method was added to replace the deprecated ones: CheckAndMutateBuilder checkAndMutate(byte[] row, byte[] family) with CheckAndMutateBuilder interface which can be used to construct the checkAnd\*() operations.
12430
12431
12432 ---
12433
12434 * [HBASE-19134](https://issues.apache.org/jira/browse/HBASE-19134) | *Major* | **Make WALKey an Interface; expose Read-Only version to CPs**
12435
12436 Made WALKey an Interface and added a WALKeyImpl implementation. WALKey comes through to Coprocessors. WALKey is read-only.
12437
12438
12439 ---
12440
12441 * [HBASE-18169](https://issues.apache.org/jira/browse/HBASE-18169) | *Blocker* | **Coprocessor fix and cleanup before 2.0.0 release**
12442
12443 Refactor of Coprocessor API for hbase2. Purged methods that exposed too much of our internals. Other hooks were recast so they no longer took or returned internal classes; instead we pass Interfaces or read-only versions of implementations.
12444
12445 Here is some overview doc on changes in hbase2 for Coprocessors including detail on why the change was made:
12446 https://github.com/apache/hbase/blob/branch-2.0/dev-support/design-docs/Coprocessor\_Design\_Improvements-Use\_composition\_instead\_of\_inheritance-HBASE-17732.adoc
12447
12448
12449 ---
12450
12451 * [HBASE-19301](https://issues.apache.org/jira/browse/HBASE-19301) | *Major* | **Provide way for CPs to create short circuited connection with custom configurations**
12452
12453 Provided a way for the CP users to create a short circuitable connection with custom configs.
12454
12455 createConnection(Configuration) is added to MasterCoprocessorEnvironment, RegionServerCoprocessorEnvironment and RegionCoprocessorEnvironment.
12456
12457 The getConnection() method already available in these Env interfaces returns the cluster connection used by the server (which the server also uses) where as this new method will create a new connection on request. The difference from connection created using ConnectionFactory APIs is that this connection can short circuit the calls to same server avoiding the RPC paths. The connection will NOT be cached/maintained by server. That should be done the CPs.
12458
12459 Be careful creating Connections out of a Coprocessor. See the javadoc on these createConnection and getConnection.
12460
12461
12462 ---
12463
12464 * [HBASE-19357](https://issues.apache.org/jira/browse/HBASE-19357) | *Major* | **Bucket cache no longer L2 for LRU cache**
12465
12466 Removed cacheDataInL1 option for HCD
12467 BucketCache is no longer the L2 for LRU on heap cache. When BC is used, data blocks will be strictly on BC only where as index/bloom blocks are on LRU L1 cache.
12468 Config 'hbase.bucketcache.combinedcache.enabled' is removed. There is no way set combined mode = false. Means make BC as victim handler for LRU cache.
12469 This will be one more noticeable change when one uses BucketCache in File mode.  Then the system table's data block(Including the META table)  will be cached in Bucket Cache files only. Plain scan from META files alone test reveal that the throughput of file mode BC is almost half only.  But for META entries we have RegionLocation cache at client side connections. So this would not be a big concern in a real cluster usage. Will check more on this and probably fix even when we do tiered BucketCache.
12470
12471
12472 ---
12473
12474 * [HBASE-19430](https://issues.apache.org/jira/browse/HBASE-19430) | *Major* | **Remove the SettableTimestamp and SettableSequenceId**
12475
12476 All the cells which are used in server side are of ExtendedCell now.
12477
12478
12479 ---
12480
12481 * [HBASE-19295](https://issues.apache.org/jira/browse/HBASE-19295) | *Major* | **The Configuration returned by CPEnv should be read-only.**
12482
12483 CoprocessorEnvironment#getConfiguration returns a READ-ONLY Configuration. Attempts at altering the returned Configuration -- whether setting or adding resources -- will result in an IllegalStateException warning of the Read-only condition of the returned Configuration.
12484
12485
12486 ---
12487
12488 * [HBASE-19410](https://issues.apache.org/jira/browse/HBASE-19410) | *Major* | **Move zookeeper related UTs to hbase-zookeeper and mark them as ZKTests**
12489
12490 There is a new HBaseZKTestingUtility which can only start a mini zookeeper cluster. And we will publish sources for test-jar for all modules.
12491
12492
12493 ---
12494
12495 * [HBASE-19323](https://issues.apache.org/jira/browse/HBASE-19323) | *Major* | **Make netty engine default in hbase2**
12496
12497 NettyRpcServer is now our default RPC server replacing SimpleRpcServer.
12498
12499
12500 ---
12501
12502 * [HBASE-19426](https://issues.apache.org/jira/browse/HBASE-19426) | *Major* | **Move has() and setTimestamp() to Mutation**
12503
12504 Moves #has and #setTimestamp back up to Mutation from the subclass Put so available to other Mutation implementations.
12505
12506
12507 ---
12508
12509 * [HBASE-19384](https://issues.apache.org/jira/browse/HBASE-19384) | *Critical* | **Results returned by preAppend hook in a coprocessor are replaced with null from other coprocessor even on bypass**
12510
12511 When a coprocessor sets 'bypass', we will skip calling subsequent Coprocessors that may be stacked-up on the method invocation; e.g. if a prePut has three coprocessors hooked up, if the first coprocessor decides to set 'bypass', we will not call the two subsequent coprocessors (this is similar to the 'complete' functionality that was in hbase1, removed in hbase2).
12512
12513
12514 ---
12515
12516 * [HBASE-19408](https://issues.apache.org/jira/browse/HBASE-19408) | *Trivial* | **Remove WALActionsListener.Base**
12517
12518 1) remove the WALActionsListener.Base
12519 2) provide default method implementation to WALActionsListener
12520 The person who want to receive the notification of WAL events should implements the WALActionsListener rather than WALActionsListener.Base.
12521
12522
12523 ---
12524
12525 * [HBASE-19339](https://issues.apache.org/jira/browse/HBASE-19339) | *Critical* | **Eager policy results in the negative size of memstore**
12526
12527 Enable TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy
12528
12529
12530 ---
12531
12532 * [HBASE-19336](https://issues.apache.org/jira/browse/HBASE-19336) | *Major* | **Improve rsgroup to allow assign all tables within a specified namespace by only writing namespace**
12533
12534 Add two new shell cmd.
12535 move\_namespaces\_rsgroup is used to reassign tables of specified namespaces from one RegionServer group to another.
12536 move\_servers\_namespaces\_rsgroup is used to reassign regionServers and tables of specified namespaces from one group to another.
12537
12538
12539 ---
12540
12541 * [HBASE-19285](https://issues.apache.org/jira/browse/HBASE-19285) | *Critical* | **Add per-table latency histograms**
12542
12543 Per-RegionServer table latency histograms have been returned to HBase (after being removed due to impacting performance). These metrics are exposed via a new JMX bean "TableLatencies" with the typical naming conventions: namespace, table, and histogram component.
12544
12545
12546 ---
12547
12548 * [HBASE-19359](https://issues.apache.org/jira/browse/HBASE-19359) | *Major* | **Revisit the default config of hbase client retries number**
12549
12550 The default value of hbase.client.retries.number was 35. It is now 10.
12551 And for server side, the default hbase.client.serverside.retries.multiplier was 10. So the server side retries number was 35 \* 10 = 350. It is now 3.
12552
12553
12554 ---
12555
12556 * [HBASE-18090](https://issues.apache.org/jira/browse/HBASE-18090) | *Major* | **Improve TableSnapshotInputFormat to allow more multiple mappers per region**
12557
12558 In this task, we make it possible to run multiple mappers per region in the table snapshot. The following code is primary table snapshot mapper initializatio:
12559
12560 TableMapReduceUtil.initTableSnapshotMapperJob(
12561           snapshotName,                     // The name of the snapshot (of a table) to read from
12562           scan,                                      // Scan instance to control CF and attribute selection
12563           mapper,                                 // mapper
12564           outputKeyClass,                   // mapper output key
12565           outputValueClass,                // mapper output value
12566           job,                                       // The current job to adjust
12567           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12568           restoreDir,                           // a temporary directory to copy the snapshot files into
12569 );
12570
12571 The job only run one map task per region in the table snapshot. With this feature, client can specify the desired num of mappers when init table snapshot mapper job：
12572
12573 TableMapReduceUtil.initTableSnapshotMapperJob(
12574           snapshotName,                     // The name of the snapshot (of a table) to read from
12575           scan,                                      // Scan instance to control CF and attribute selection
12576           mapper,                                 // mapper
12577           outputKeyClass,                   // mapper output key
12578           outputValueClass,                // mapper output value
12579           job,                                       // The current job to adjust
12580           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12581           restoreDir,                           // a temporary directory to copy the snapshot files into
12582           splitAlgorithm,                     // splitAlgo algorithm to split, current split algorithms  support RegionSplitter.UniformSplit() and RegionSplitter.HexStringSplit()
12583           n                                         // how many input splits to generate per one region
12584 );
12585
12586
12587 ---
12588
12589 * [HBASE-19035](https://issues.apache.org/jira/browse/HBASE-19035) | *Major* | **Miss metrics when coprocessor use region scanner to read data**
12590
12591 1. Move read requests count to region level. Because RegionScanner is exposed to CP.
12592 2. Update write requests count in processRowsWithLocks.
12593 3. Remove requestRowActionCount in RSRpcServices. This metric can be computed by region's readRequestsCount and writeRequestsCount.
12594
12595
12596 ---
12597
12598 * [HBASE-19318](https://issues.apache.org/jira/browse/HBASE-19318) | *Critical* | **MasterRpcServices#getSecurityCapabilities explicitly checks for the HBase AccessController implementation**
12599
12600 Fixes an issue with loading customer coprocessor endpoint implementations inside of the HBase Master which breaks Apache Ranger.
12601
12602
12603 ---
12604
12605 * [HBASE-19092](https://issues.apache.org/jira/browse/HBASE-19092) | *Critical* | **Make Tag IA.LimitedPrivate and expose for CPs**
12606
12607 This JIRA aims at exposing Tags for Coprocessor usage.
12608 Tag interface is now exposed to Coprocessors and CPs can make use of this interface to create their own Tags.
12609 RawCell is a new interface that is a subtype of Cell and that is exposed to CPs. RawCell has the following APIs
12610
12611 List\<Tag\> getTags()
12612 Optional\<Tag\> getTag(byte type)
12613 byte[] cloneTags()
12614
12615 The above APIs helps to read tags from the Cell.
12616
12617 CellUtil#createCell(Cell cell, List\<Tag\> tags)
12618 CellUtil#createCell(Cell cell, byte[] tags)
12619 CellUtil#createCell(Cell cell, byte[] value, byte[] tags)
12620 are deprecated.
12621 If CPs want to create a cell with Tags they can use the RegionCoprocessorEnvironment#getCellBuilder() that returns an ExtendedCellBuilder.
12622 Using ExtendedCellBuilder the CP can create Cells with Tags. Other helper methods to work on Tags are available as static APIs in Tag interface.
12623
12624
12625 ---
12626
12627 * [HBASE-19266](https://issues.apache.org/jira/browse/HBASE-19266) | *Minor* | **TestAcidGuarantees should cover adaptive in-memory compaction**
12628
12629 separate the TestAcidGuarantees by the policy:
12630 1) NONE -\> TestAcidGuaranteesWithNoInMemCompaction
12631 2) BASIC -\> TestAcidGuaranteesWithBasicPolicy
12632 3) EAGER -\> TestAcidGuaranteesWithEagerPolicy
12633 4) ADAPTIVE -\> TestAcidGuaranteesWithAdaptivePolicy
12634
12635 TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy are disabled by default as the eager policy may cause the negative size of memstore.
12636
12637
12638 ---
12639
12640 * [HBASE-16868](https://issues.apache.org/jira/browse/HBASE-16868) | *Critical* | **Add a replicate\_all flag to avoid misuse the namespaces and table-cfs config of replication peer**
12641
12642 Add a replicate\_all flag to replication peer config. The default value is true, which means all user tables (REPLICATION\_SCOPE != 0 ) will be replicated to peer cluster.
12643
12644 How to config a peer from replicate all to only replicate special namespace/tablecfs?
12645 Step1. Add a new peer with no namespace/tablecfs config, the replicate\_all flag will be true automatically.
12646 Step2. User want only replicate some namespaces or tables, so set replicate\_all flag to false first.
12647 Step3. Add special namespaces or table-cfs config to the replication peer.
12648
12649 How to config a peer from replicate special namespace/tablecfs to replicate all?
12650 Step1. Add a new peer with special namespace/tablecfs config, the replicate\_all flag will be false automatically.
12651 Step2. User want replicate all user tables, so remove the special namespace/tablecfs config first.
12652 Step3. Set replicate\_all flag to true.
12653
12654 How to config replicate nothing?
12655 Set replicate\_all flag to false and no namespace/tablecfs config, then all tables cannot be replicated to peer cluster.
12656
12657
12658 ---
12659
12660 * [HBASE-19122](https://issues.apache.org/jira/browse/HBASE-19122) | *Critical* | **preCompact and preFlush can bypass by returning null scanner; shut it down**
12661
12662 Remove the ability to 'bypass' preFlush and preCompact by returning a null Scanner. Bypass is disallowed on these methods in hbase2.
12663
12664
12665 ---
12666
12667 * [HBASE-19200](https://issues.apache.org/jira/browse/HBASE-19200) | *Major* | **make hbase-client only depend on ZKAsyncRegistry and ZNodePaths**
12668
12669 ConnectionImplementation now uses asynchronous connections to zookeeper via ZKAsyncRegistry to get cluster id, master address, meta region location, etc.
12670 Since ZKAsyncRegistry uses curator framework, this change purges a lot of zookeeper dependencies in hbase-client.
12671 Now hbase-client only depends on only ZKAsyncRegistry, ZNodePaths and the newly introduced ZKMetadata.
12672
12673
12674 ---
12675
12676 * [HBASE-19311](https://issues.apache.org/jira/browse/HBASE-19311) | *Major* | **Promote TestAcidGuarantees to LargeTests and start mini cluster once to make it faster**
12677
12678 Introduce a AcidGuaranteesTestTool and expose as tool instead of TestAcidGuarantees. Now TestAcidGuarantees is just a UT.
12679
12680
12681 ---
12682
12683 * [HBASE-19293](https://issues.apache.org/jira/browse/HBASE-19293) | *Major* | **Support adding a new replication peer in disabled state**
12684
12685 Add a boolean parameter which means the new replication peer's state is enabled or disabled for Admin/AsyncAdmin's addReplicationPeer method. Meanwhile, you can use shell cmd to add a enabled/disabled replication peer. The STATE parameter is optional and the default state is enabled.
12686
12687 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "ENABLED"
12688 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "DISABLED"
12689
12690
12691 ---
12692
12693 * [HBASE-19123](https://issues.apache.org/jira/browse/HBASE-19123) | *Major* | **Purge 'complete' support from Coprocesor Observers**
12694
12695 This issue removes the 'complete' facility that was in ObserverContext. It is no longer possible for a Coprocessor to cut the chain-of-invocation and insist its response prevails.
12696
12697
12698 ---
12699
12700 * [HBASE-18911](https://issues.apache.org/jira/browse/HBASE-18911) | *Major* | **Unify Admin and AsyncAdmin's methods name**
12701
12702 Deprecated 4 methods for Admin interface.
12703 Deprecated compactRegionServer(ServerName, boolean). Use compactRegionServer(ServerName) and majorCompactcompactRegionServer(ServerName) instead.
12704 Deprecated getRegionLoad(ServerName) method. Use getRegionLoads(ServerName) instead.
12705 Deprecated getRegionLoad(ServerName, TableName) method. Use getRegionLoads(ServerName, TableName) instead.
12706 Deprecated getQuotaRetriever(QuotaFilter) instead. Use  getQuota(QuotaFilter) instead.
12707
12708 Add 7 methods for Admin interface.
12709 ServerName getMaster();
12710 Collection\<ServerName\> getBackupMasters();
12711 Collection\<ServerName\> getRegionServers();
12712 boolean splitSwitch(boolean enabled, boolean synchronous);
12713 boolean mergeSwitch(boolean enabled, boolean synchronous);
12714 boolean isSplitEnabled();
12715 boolean isMergeEnabled();
12716
12717
12718 ---
12719
12720 * [HBASE-18703](https://issues.apache.org/jira/browse/HBASE-18703) | *Critical* | **Inconsistent behavior for preBatchMutate in doMiniBatchMutate and processRowsWithLocks**
12721
12722 Two write paths Region.batchMutate() and Region.mutateRows() are unified and inconsistencies are resolved.
12723
12724
12725 ---
12726
12727 * [HBASE-18964](https://issues.apache.org/jira/browse/HBASE-18964) | *Major* | **Deprecate RowProcessor and processRowsWithLocks() APIs that take RowProcessor as an argument**
12728
12729 RowProcessor and Region#processRowsWithLocks() methods that take RowProcessor as an argument are deprecated. Use Coprocessors if you want to customize handling.
12730
12731
12732 ---
12733
12734 * [HBASE-19251](https://issues.apache.org/jira/browse/HBASE-19251) | *Major* | **Merge RawAsyncTable and AsyncTable**
12735
12736 Merge the RawAsyncTable and AsyncTable interfaces. Use generic to reflection the difference between the observer style scan API. For the implementation which does not have a user specified thread pool, the observer is AdvancedScanResultConsumer. For the implementation which needs a user specified thread pool, the observer is ScanResultConsumer.
12737
12738
12739 ---
12740
12741 * [HBASE-19262](https://issues.apache.org/jira/browse/HBASE-19262) | *Major* | **Revisit checkstyle rules**
12742
12743 Change the import order rule that now we should put the shaded import at bottom. Ignore the VisibilityModifier warnings for test code.
12744
12745
12746 ---
12747
12748 * [HBASE-19187](https://issues.apache.org/jira/browse/HBASE-19187) | *Minor* | **Remove option to create on heap bucket cache**
12749
12750 Removing the on heap Bucket cache feature.
12751 The config "hbase.bucketcache.ioengine" no longer support the 'heap' value.
12752 Its supported values now are 'offheap',  'file:\<path\>', 'files:\<path\>'  and 'mmap:\<path\>'
12753
12754
12755 ---
12756
12757 * [HBASE-12350](https://issues.apache.org/jira/browse/HBASE-12350) | *Minor* | **Backport error-prone build support to branch-1 and branch-2**
12758
12759 This change introduces compile time support for running the error-prone suite of static analyses. Enable with -PerrorProne on the Maven command line. Requires JDK 8 or higher. (Don't enable if building with JDK 7.)
12760
12761
12762 ---
12763
12764 * [HBASE-14350](https://issues.apache.org/jira/browse/HBASE-14350) | *Blocker* | **Procedure V2 Phase 2: Assignment Manager**
12765
12766 (Incomplete)
12767
12768 = Incompatbiles
12769
12770 == Coprocessor Incompatibilities
12771
12772 Split/Merge have moved to the Master; it runs them now. Means hooks around Split/Merge are now noops. To intercept Split/Merge phases, CPs need to intercept on MasterObserver.
12773
12774
12775 ---
12776
12777 * [HBASE-19189](https://issues.apache.org/jira/browse/HBASE-19189) | *Major* | **Ad-hoc test job for running a subset of tests lots of times**
12778
12779 <!-- markdown -->
12780
12781
12782 Folks can now test out tests on an arbitrary release branch. Head over to [builds.a.o job "HBase-adhoc-run-tests"](https://builds.apache.org/view/H-L/view/HBase/job/HBase-adhoc-run-tests/), then pick "Build with parameters".
12783 Tests are specified as just names e.g. TestLogRollingNoCluster. can also be a glob. e.g. TestHFile*
12784
12785
12786 ---
12787
12788 * [HBASE-19220](https://issues.apache.org/jira/browse/HBASE-19220) | *Major* | **Async tests time out talking to zk; 'clusterid came back null'**
12789
12790 Changed retries from 3 to 30 for zk initial connect for registry.
12791
12792
12793 ---
12794
12795 * [HBASE-19002](https://issues.apache.org/jira/browse/HBASE-19002) | *Minor* | **Introduce more examples to show how to intercept normal region operations**
12796
12797 With the change in Coprocessor APIs, the hbase-examples module has been updated to provide additional examples that show how to write Coprocessors against the new API.
12798
12799
12800 ---
12801
12802 * [HBASE-18961](https://issues.apache.org/jira/browse/HBASE-18961) | *Major* | **doMiniBatchMutate() is big, split it into smaller methods**
12803
12804 HRegion.batchMutate()/ doMiniBatchMutate() is refactored with aim to unify batchMutate() and mutateRows() code paths later. batchMutate() currently handles 2 types of batches: MutationBatchOperations and ReplayBatchOperations. Common base class BatchOperations is augmented with common methods which are overridden in derived classes as needed. doMiniBatchMutate() is implemented using common methods in base class BatchOperations.
12805
12806
12807 ---
12808
12809 * [HBASE-19103](https://issues.apache.org/jira/browse/HBASE-19103) | *Minor* | **Add BigDecimalComparator for filter**
12810
12811 If BigDecimal is stored as value, and you need to add a matched comparator to the value filter when scanning, a BigDecimalComparator can be used.
12812
12813
12814 ---
12815
12816 * [HBASE-19111](https://issues.apache.org/jira/browse/HBASE-19111) | *Critical* | **Add missing CellUtil#isPut(Cell) methods**
12817
12818 A new public API method was added to CellUtil "isPut(Cell)" for clients to use to determine if the Cell is for a Put operation.
12819
12820 Additionally, other CellUtil API calls which expose Cell-implementation were marked as deprecated and will be removed in a future version.
12821
12822
12823 ---
12824
12825 * [HBASE-19160](https://issues.apache.org/jira/browse/HBASE-19160) | *Critical* | **Re-expose CellComparator**
12826
12827 CellComparator is now InterfaceAudience.Public
12828
12829
12830 ---
12831
12832 * [HBASE-19131](https://issues.apache.org/jira/browse/HBASE-19131) | *Major* | **Add the ClusterStatus hook and cleanup other hooks which can be replaced by ClusterStatus hook**
12833
12834 1) Add preGetClusterStatus() and postGetClusterStatus() hooks
12835 2) add preGetClusterStatus() to access control check - an admin action
12836
12837
12838 ---
12839
12840 * [HBASE-19095](https://issues.apache.org/jira/browse/HBASE-19095) | *Major* | **Add CP hooks in RegionObserver for in memory compaction**
12841
12842 Add 4 methods in RegionObserver:
12843 preMemStoreCompaction
12844 preMemStoreCompactionCompactScannerOpen
12845 preMemStoreCompactionCompact
12846 postMemStoreCompaction
12847 preMemStoreCompaction and postMemStoreCompaction will always be called for all in memory compactions. Under eager mode, preMemStoreCompactionCompactScannerOpen will be called before opening store scanner to allow you changing the max versions and TTL, and preMemStoreCompactionCompact will be called after the creation to let you do wrapping.
12848
12849
12850 ---
12851
12852 * [HBASE-19152](https://issues.apache.org/jira/browse/HBASE-19152) | *Trivial* | **Update refguide 'how to build an RC' and the make\_rc.sh script**
12853
12854 The make\_rc.sh script can run an hbase2 build now generating tarballs and pushing up to maven repository. TODO: Sign and checksum, check tarball, push to apache dist.....
12855
12856
12857 ---
12858
12859 * [HBASE-19179](https://issues.apache.org/jira/browse/HBASE-19179) | *Critical* | **Remove hbase-prefix-tree**
12860
12861 Purged the hbase-prefix-tree module and all references from the code base.
12862
12863 prefix-tree data block encoding was a super cool experimental feature that saw some usage initially but has since languished. If interested in carrying this sweet facility forward, write the dev list and we'll restore this module.
12864
12865
12866 ---
12867
12868 * [HBASE-19176](https://issues.apache.org/jira/browse/HBASE-19176) | *Major* | **Remove hbase-native-client from branch-2**
12869
12870 Removed the hbase-native-client module from branch-2 (it is still in Master). It is not complete. Look for a finished C++ client in the near future. Will restore native client to branch-2 at that point.
12871
12872
12873 ---
12874
12875 * [HBASE-19144](https://issues.apache.org/jira/browse/HBASE-19144) | *Major* | **[RSgroups] Retry assignments in FAILED\_OPEN state when servers (re)join the cluster**
12876
12877 When regionserver placement groups (RSGroups) is active, as servers join the cluster the Master will attempt to reassign regions in FAILED\_OPEN state.
12878
12879
12880 ---
12881
12882 * [HBASE-18770](https://issues.apache.org/jira/browse/HBASE-18770) | *Critical* | **Remove bypass method in ObserverContext and implement the 'bypass' logic case by case**
12883
12884 Removes blanket bypass mechanism (Observer#bypass). Instead, a curated subset of methods are bypassable.
12885
12886     Changes Coprocessor ObserverContext 'bypass' semantic. We flip the
12887     default so bypass is NOT supported on Observer invocations; only a
12888     couple of preXXX methods in RegionObserver allow it: e.g.  preGet
12889     and prePut but not preFlush, etc. Everywhere else, we throw
12890     a Exception if a Coprocessor Observer tries to invoke bypass. Master
12891     Observers can no longer stop or change move, split, assign, create table, etc.
12892     preBatchMutate can no longer be bypassed (bypass the finer-grained
12893     prePut, preDelete, etc. instead)
12894
12895     Ditto on complete, the mechanism that allowed a Coprocessor
12896     rule that all subsequent Coprocessors are skipped in an
12897     invocation chain; now, complete is only available to
12898     bypassable methods (and Coprocessors will get an exception if
12899     they try to 'complete' when it is not allowed).
12900
12901     See javadoc for whether a Coprocessor Observer method supports
12902     'bypass'. If no mention, 'bypass' is NOT supported.
12903
12904 The below methods have been marked deprecated in hbase2. We would have liked to have removed them because they use IA.Private parameters but they are in use by CoreCoprocessors or are critical to downstreamers and we have no alternatives to provide currently.
12905
12906 @Deprecated public boolean prePrepareTimeStampForDeleteVersion(final Mutation mutation, final Cell kv, final byte[] byteNow, final Get get) throws IOException {
12907
12908 @Deprecated public boolean preWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12909
12910 @Deprecated public void postWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12911
12912 @Deprecated public DeleteTracker postInstantiateDeleteTracker(DeleteTracker result) throws IOException
12913
12914 Metrics are updated now even if the Coprocessor does a bypass; e.g. The put count is updated even if a Coprocessor bypasses the core put operation (We do it this way so no need for Coprocessors to have access to our core metrics system).
12915
12916
12917 ---
12918
12919 * [HBASE-19033](https://issues.apache.org/jira/browse/HBASE-19033) | *Blocker* | **Allow CP users to change versions and TTL before opening StoreScanner**
12920
12921 Add back the three methods without a return value:
12922 preFlushScannerOpen
12923 preCompactScannerOpen
12924 preStoreScannerOpen
12925
12926 Introduce a ScanOptions interface to let CP users change the max versions and TTL of a ScanInfo. It will be passed as a parameter in the three methods above.
12927
12928 Inntroduce a new example WriteHeavyIncrementObserver which convert increment to put and do aggregating when get. It uses the above three methods.
12929
12930
12931 ---
12932
12933 * [HBASE-19110](https://issues.apache.org/jira/browse/HBASE-19110) | *Minor* | **Add default for Server#isStopping & #getFileSystem**
12934
12935 Made defaults for Server#isStopping and Server#getFileSystem. Should have done this when I added them (lesson learned, was actually mentioned in a review).
12936
12937
12938 ---
12939
12940 * [HBASE-19047](https://issues.apache.org/jira/browse/HBASE-19047) | *Critical* | **CP exposed Scanner types should not extend Shipper**
12941
12942 RegionObserver#preScannerOpen signature changed
12943 RegionScanner preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan,  RegionScanner s)   -\>   void preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan)
12944 The pre hook can no longer return a RegionScanner instance.
12945
12946
12947 ---
12948
12949 * [HBASE-18995](https://issues.apache.org/jira/browse/HBASE-18995) | *Critical* | **Move methods that are for internal usage from CellUtil to Private util class**
12950
12951 Split CellUtil into public CellUtil and PrivateCellUtil for Internal use only.
12952
12953
12954 ---
12955
12956 * [HBASE-18906](https://issues.apache.org/jira/browse/HBASE-18906) | *Critical* | **Provide Region#waitForFlushes API**
12957
12958 Provided an API in Region (Exposed to CPs)
12959 boolean waitForFlushes(long timeout)
12960 This call will make the current thread to be waiting for all flushes in this region to be finished.  (Upto the time out time being specified). The boolean return value specify whether the flushes are really over or the time out being elapsed. Return false when timeout elapsed but flushes are not over or  true when flushes are over
12961
12962
12963 ---
12964
12965 * [HBASE-18905](https://issues.apache.org/jira/browse/HBASE-18905) | *Major* | **Allow CPs to request flush on Region and know the completion of the requested flush**
12966
12967 Add a FlushLifeCycleTracker which is similiar to CompactionLifeCycleTracker for tracking flush.
12968 Add a requestFlush method in Region interface to let CP users request flush on a region. The operation is asynchronous, you need to use the FlushLifeCycleTracker to track the flush.
12969 The difference with CompactionLifeCycleTracker is that, flush is per region so we do not use Store as a parameter of the methods. And also, notExecuted means the whole flush has not been executed, and afterExecution means the whole flush has been finished, so we do not have a separated completed method. A flush will be ended either by notExecuted or afterExecution.
12970
12971
12972 ---
12973
12974 * [HBASE-19048](https://issues.apache.org/jira/browse/HBASE-19048) | *Major* | **Cleanup MasterObserver hooks which takes IA private params**
12975
12976 Purged InterfaceAudience.Private parameters from methods in MasterObserver.
12977
12978 preAbortProcedure no longer takes a ProcedureExecutor.
12979
12980 postGetProcedures no longer takes a list of Procedures.
12981
12982 postGetLocks no longer takes a list of locks.
12983
12984 preRequestLock and postRequestLock no longer take lock type.
12985
12986 preLockHeartbeat and postLockHeartbeat no longer takes a lock procedure.
12987
12988 The implication is that that the Coprocessors that depended on these params have had to coarsen so for example, the AccessController can not do access per Procedure or Lock but rather, makes a judgement on the general access (You'll need to be ADMIN to see list of procedures and locks).
12989
12990
12991 ---
12992
12993 * [HBASE-18994](https://issues.apache.org/jira/browse/HBASE-18994) | *Major* | **Decide if META/System tables should use Compacting Memstore or Default Memstore**
12994
12995 Added a new config 'hbase.systemtables.compacting.memstore.type"  for the system tables. By default all the system tables will have 'NONE' as the type and so it will be using the default memstore by default.
12996 {code}
12997  \<property\>
12998     \<name\>hbase.systemtables.compacting.memstore.type\</name\>
12999     \<value\>NONE\</value\>
13000   \</property\>
13001 {code}
13002
13003
13004 ---
13005
13006 * [HBASE-19029](https://issues.apache.org/jira/browse/HBASE-19029) | *Critical* | **Align RPC timout methods in Table and AsyncTableBase**
13007
13008 Deprecate the following methods in Table:
13009 - int getRpcTimeout()
13010 - int getReadRpcTimeout()
13011 - int getWriteRpcTimeout()
13012 - int getOperationTimeout()
13013
13014 Add the following methods to Table:
13015 - long getRpcTimeout(TimeUnit)
13016 - long getReadRpcTimeout(TimeUnit)
13017 - long getWriteRpcTimeout(TimeUnit)
13018 - long getOperationTimeout(TimeUnit)
13019
13020 Add missing deprecation tag for long getRpcTimeout(TimeUnit unit) in AsyncTableBase
13021
13022
13023 ---
13024
13025 * [HBASE-18410](https://issues.apache.org/jira/browse/HBASE-18410) | *Major* | **FilterList  Improvement.**
13026
13027 In this task, we fixed all existing bugs in FilterList, and did the code refactor which ensured interface compatibility .
13028
13029 The primary bug  fixes are :
13030 1. For sub-filter in FilterList with MUST\_PASS\_ONE, if previous filterKeyValue() of sub-filter returns NEXT\_COL, we cannot make sure that the next cell will be the first cell in next column, because FilterList choose the minimal forward step among sub-filters, and it may return a SKIP. so here we add an extra check to ensure that the next cell will match preivous return code for sub-filters.
13031 2. Previous logic about transforming cell of FilterList is incorrect, we should set the previous transform result (rather than the given cell in question) as the initial vaule of transform cell before call filterKeyValue() of FilterList.
13032 3. Handle the ReturnCodes which the previous code did not handle.
13033
13034 About code refactor, we divided the FilterList into two separated sub-classes: FilterListWithOR and FilterListWithAND,  The FilterListWithOR has been optimised to choose the next minimal step to seek cell rather than SKIP cell one by one, and the FilterListWithAND  has been optimised to choose the next maximal key to seek among sub-filters in filter list. All in all, The code in FilterList is clean and easier to follow now.
13035
13036 Note that ReturnCode NEXT\_ROW has been redefined as skipping to next row in current family,   not to next row in all family. it’s more reasonable, because ReturnCode is a concept in store level, not in region level.
13037
13038 Another bug that needs attention is: filterAllRemaining() in FilterList with MUST\_PASS\_ONE  will now return false if the filter list is empty whereas earlier it used to return true for Operator.MUST\_PASS\_ONE.  it's more reasonable now.
13039
13040
13041 ---
13042
13043 * [HBASE-19077](https://issues.apache.org/jira/browse/HBASE-19077) | *Critical* | **Have Region\*CoprocessorEnvironment provide an ImmutableOnlineRegions**
13044
13045 Adds getOnlineRegions to the RegionCoprocessorEnvironment (Context) and ditto to RegionServerCoprocessorEnvironment. Allows Coprocessor get list of Regions online on the currently hosting RegionServer.
13046
13047
13048 ---
13049
13050 * [HBASE-19021](https://issues.apache.org/jira/browse/HBASE-19021) | *Critical* | **Restore a few important missing logics for balancer in 2.0**
13051
13052 Re-enabled 'hbase.master.loadbalance.bytable', default 'false'.
13053 Draining servers are removed from consideration by blancer.balanceCluster() call.
13054
13055
13056 ---
13057
13058 * [HBASE-19049](https://issues.apache.org/jira/browse/HBASE-19049) | *Major* | **Update kerby to 1.0.1 GA release**
13059
13060 HBase now relies on Kerby version 1.0.1 for its test environment. No downstream facing change is expected.
13061
13062
13063 ---
13064
13065 * [HBASE-16290](https://issues.apache.org/jira/browse/HBASE-16290) | *Major* | **Dump summary of callQueue content; can help debugging**
13066
13067 Patch to print summary of call queues by size and count. This is displayed on the debug dump page of region server UI
13068
13069
13070 ---
13071
13072 * [HBASE-18846](https://issues.apache.org/jira/browse/HBASE-18846) | *Major* | **Accommodate the hbase-indexer/lily/SEP consumer deploy-type**
13073
13074 Makes it so hbase-indexer/lily can move off dependence on internal APIs and instead move to public APIs.
13075
13076 Adds being able to disable near-all HRegionServer services. This along with an existing plugin mechanism which allows configuring the RegionServer to host an alternate Connection implementation, makes it so we can put up a cluster of hollowed-out HRegionServers purposed to pose as a Replication Sink for a source HBase Cluster (Users do not need to figure our RPC, our PB encodings, build a distributed service, etc.). In the alternate supplied Connection implementation, hbase-indexer would install its own code to catch the Replication.
13077
13078 Below and attached are sample hbase-server.xml files and alternate Connection implementations. To start up an HRegionServer as a sink, first make sure there is a ZooKeeper ensemble we can talk to. If none, just start one:
13079 {code}
13080 ./bin/hbase-daemon.sh start zookeeper
13081 {code}
13082
13083 To start up a single RegionServer, put in place the below sample hbase-site.xml and a derviative of the below IndexerConnection on the CLASSPATH, and then start the RegionServer:
13084 {code}
13085 ./bin/hbase-daemon.sh  start  org.apache.hadoop.hbase.regionserver.HRegionServer
13086 {code}
13087 Stdout and Stderr will go into files under configured logs directory. Browse to localhost:16030 to find webui (unless disabled).
13088
13089 DETAILS
13090
13091 This patch adds configuration to disable RegionServer internal Services, Managers, Caches, etc., starting up.
13092
13093 By default a RegionServer starts up an Admin and Client Service. To disable either or both, use the below booleans:
13094 {code}
13095 hbase.regionserver.admin.service
13096 hbase.regionserver.client.service
13097 {code}
13098
13099 Both default true.
13100
13101 To make a HRegionServer startup and stay up without expecting to communicate with a master, set the below boolean to false:
13102
13103 {code}
13104 hbase.masterless
13105 {code]
13106 Default is false.
13107
13108 h3. Sample hbase-site.xml that disables internal HRegionServer Services
13109 Below is an example hbase-site.xml that turns off most Services and that then installs an alternate Connection implementation, one that is nulled out in all regards except in being able to return a "Table" that can catch a Replication Stream in its {code}batch(List\<? extends Row\> actions, Object[] results){code} method. i.e. what the hbase-indexer wants. I also add the example alternate Connection implementation below (both of these files are also attached to this issue). Expects there to be an up and running zookeeper ensemble.
13110
13111 {code}
13112 \<configuration\>
13113   \<!-- This file is an example for hbase-indexer. It shuts down
13114        facility in the regionserver and interjects a special
13115        Connection implementation which is how hbase-indexer will
13116        receive the replication stream from source hbase cluster.
13117        See the class referenced in the config.
13118
13119        Most of the config in here is booleans set to off and
13120        setting values to zero so services doon't start. Some of
13121        the flags are new via this patch.
13122 --\>
13123   \<!--Need this for the RegionServer to come up standalone--\>
13124   \<property\>
13125     \<name\>hbase.cluster.distributed\</name\>
13126     \<value\>true\</value\>
13127   \</property\>
13128
13129   \<!--This is what you implement, a Connection that returns a Table that
13130        overrides the batch call. It is at this point you do your indexer inserts.
13131     --\>
13132   \<property\>
13133     \<name\>hbase.client.connection.impl\</name\>
13134     \<value\>org.apache.hadoop.hbase.client.IndexerConnection\</value\>
13135     \<description\>A customs connection implementation just so we can interject our
13136       own Table class, one that has an override for the batch call which receives
13137       the replication stream edits; i.e. it is called by the replication sink
13138       #replicateEntries method.\</description\>
13139   \</property\>
13140
13141   \<!--Set hbase.regionserver.info.port to -1 for no webui--\>
13142
13143   \<!--Below are configs to shut down unused services in hregionserver--\>
13144   \<property\>
13145     \<name\>hbase.regionserver.admin.service\</name\>
13146     \<value\>false\</value\>
13147     \<description\>Do NOT stand up an Admin Service Interface on RPC\</description\>
13148   \</property\>
13149   \<property\>
13150     \<name\>hbase.regionserver.client.service\</name\>
13151     \<value\>false\</value\>
13152     \<description\>Do NOT stand up a client-facing Service on RPC\</description\>
13153   \</property\>
13154   \<property\>
13155     \<name\>hbase.wal.provider\</name\>
13156     \<value\>org.apache.hadoop.hbase.wal.DisabledWALProvider\</value\>
13157     \<description\>Set WAL service to be the null WAL\</description\>
13158   \</property\>
13159   \<property\>
13160     \<name\>hbase.regionserver.workers\</name\>
13161     \<value\>false\</value\>
13162     \<description\>Turn off all background workers, log splitters, executors, etc.\</description\>
13163   \</property\>
13164   \<property\>
13165     \<name\>hfile.block.cache.size\</name\>
13166     \<value\>0.0001\</value\>
13167     \<description\>Turn off block cache completely\</description\>
13168   \</property\>
13169   \<property\>
13170     \<name\>hbase.mob.file.cache.size\</name\>
13171     \<value\>0\</value\>
13172     \<description\>Disable MOB cache.\</description\>
13173   \</property\>
13174   \<property\>
13175     \<name\>hbase.masterless\</name\>
13176     \<value\>true\</value\>
13177     \<description\>Do not expect Master in cluster.\</description\>
13178   \</property\>
13179   \<property\>
13180     \<name\>hbase.regionserver.metahandler.count\</name\>
13181     \<value\>1\</value\>
13182     \<description\>How many priority handlers to run; we probably need none.
13183     Default is 20 which is too much on a server like this.\</description\>
13184   \</property\>
13185   \<property\>
13186     \<name\>hbase.regionserver.replication.handler.count\</name\>
13187     \<value\>1\</value\>
13188     \<description\>How many replication handlers to run; we probably need none.
13189     Default is 3 which is too much on a server like this.\</description\>
13190   \</property\>
13191   \<property\>
13192     \<name\>hbase.regionserver.handler.count\</name\>
13193     \<value\>10\</value\>
13194     \<description\>How many default handlers to run; tie to # of CPUs.
13195     Default is 30 which is too much on a server like this.\</description\>
13196   \</property\>
13197   \<property\>
13198     \<name\>hbase.ipc.server.read.threadpool.size\</name\>
13199     \<value\>3\</value\>
13200     \<description\>How many Listener request reaaders to run; tie to a portion # of CPUs (1/4?).
13201     Default is 10 which is too much on a server like this.\</description\>
13202   \</property\>
13203 \</configuration\>
13204 {code}
13205
13206 h2. Sample Connection Implementation
13207 Has call-out for where an hbase-indexer would insert its capture code.
13208 {code}
13209 package org.apache.hadoop.hbase.client;
13210
13211 import com.google.protobuf.Descriptors;
13212 import com.google.protobuf.Message;
13213 import com.google.protobuf.Service;
13214 import com.google.protobuf.ServiceException;
13215 import org.apache.hadoop.conf.Configuration;
13216 import org.apache.hadoop.hbase.CompareOperator;
13217 import org.apache.hadoop.hbase.HTableDescriptor;
13218 import org.apache.hadoop.hbase.TableName;
13219 import org.apache.hadoop.hbase.client.coprocessor.Batch;
13220 import org.apache.hadoop.hbase.filter.CompareFilter;
13221 import org.apache.hadoop.hbase.ipc.CoprocessorRpcChannel;
13222 import org.apache.hadoop.hbase.security.User;
13223
13224 import java.io.IOException;
13225 import java.util.List;
13226 import java.util.Map;
13227 import java.util.concurrent.ExecutorService;
13228
13229
13230 /\*\*
13231  \* Sample class for hbase-indexer.
13232  \* DO NOT COMMIT TO HBASE CODEBASE!!!
13233  \* Overrides Connection just so we can return a Table that has the
13234  \* method that the replication sink calls, i.e. Table#batch.
13235  \* It is at this point that the hbase-indexer catches the replication
13236  \* stream so it can insert into the lucene index.
13237  \*/
13238 public class IndexerConnection implements Connection {
13239   private final Configuration conf;
13240   private final User user;
13241   private final ExecutorService pool;
13242   private volatile boolean closed = false;
13243
13244   public IndexerConnection(Configuration conf, ExecutorService pool, User user) throws IOException {
13245     this.conf = conf;
13246     this.user = user;
13247     this.pool = pool;
13248   }
13249
13250   @Override
13251   public void abort(String why, Throwable e) {}
13252
13253   @Override
13254   public boolean isAborted() {
13255     return false;
13256   }
13257
13258   @Override
13259   public Configuration getConfiguration() {
13260     return this.conf;
13261   }
13262
13263   @Override
13264   public BufferedMutator getBufferedMutator(TableName tableName) throws IOException {
13265     return null;
13266   }
13267
13268   @Override
13269   public BufferedMutator getBufferedMutator(BufferedMutatorParams params) throws IOException {
13270     return null;
13271   }
13272
13273   @Override
13274   public RegionLocator getRegionLocator(TableName tableName) throws IOException {
13275     return null;
13276   }
13277
13278   @Override
13279   public Admin getAdmin() throws IOException {
13280     return null;
13281   }
13282
13283   @Override
13284   public void close() throws IOException {
13285     if (!this.closed) this.closed = true;
13286   }
13287
13288   @Override
13289   public boolean isClosed() {
13290     return this.closed;
13291   }
13292
13293   @Override
13294   public TableBuilder getTableBuilder(final TableName tn, ExecutorService pool) {
13295     if (isClosed()) {
13296       throw new RuntimeException("IndexerConnection is closed.");
13297     }
13298     final Configuration passedInConfiguration = getConfiguration();
13299     return new TableBuilder() {
13300       @Override
13301       public TableBuilder setOperationTimeout(int timeout) {
13302         return null;
13303       }
13304
13305       @Override
13306       public TableBuilder setRpcTimeout(int timeout) {
13307         return null;
13308       }
13309
13310       @Override
13311       public TableBuilder setReadRpcTimeout(int timeout) {
13312         return null;
13313       }
13314
13315       @Override
13316       public TableBuilder setWriteRpcTimeout(int timeout) {
13317         return null;
13318       }
13319
13320       @Override
13321       public Table build() {
13322         return new Table() {
13323           private final Configuration conf = passedInConfiguration;
13324           private final TableName tableName = tn;
13325
13326           @Override
13327           public TableName getName() {
13328             return this.tableName;
13329           }
13330
13331           @Override
13332           public Configuration getConfiguration() {
13333             return this.conf;
13334           }
13335
13336           @Override
13337           public void batch(List\<? extends Row\> actions, Object[] results)
13338           throws IOException, InterruptedException {
13339             // Implementation goes here.
13340           }
13341
13342           @Override
13343           public HTableDescriptor getTableDescriptor() throws IOException {
13344             return null;
13345           }
13346
13347           @Override
13348           public TableDescriptor getDescriptor() throws IOException {
13349             return null;
13350           }
13351
13352           @Override
13353           public boolean exists(Get get) throws IOException {
13354             return false;
13355           }
13356
13357           @Override
13358           public boolean[] existsAll(List\<Get\> gets) throws IOException {
13359             return new boolean[0];
13360           }
13361
13362           @Override
13363           public \<R\> void batchCallback(List\<? extends Row\> actions, Object[] results, Batch.Callback\<R\> callback) throws IOException, InterruptedException {
13364
13365           }
13366
13367           @Override
13368           public Result get(Get get) throws IOException {
13369             return null;
13370           }
13371
13372           @Override
13373           public Result[] get(List\<Get\> gets) throws IOException {
13374             return new Result[0];
13375           }
13376
13377           @Override
13378           public ResultScanner getScanner(Scan scan) throws IOException {
13379             return null;
13380           }
13381
13382           @Override
13383           public ResultScanner getScanner(byte[] family) throws IOException {
13384             return null;
13385           }
13386
13387           @Override
13388           public ResultScanner getScanner(byte[] family, byte[] qualifier) throws IOException {
13389             return null;
13390           }
13391
13392           @Override
13393           public void put(Put put) throws IOException {
13394
13395           }
13396
13397           @Override
13398           public void put(List\<Put\> puts) throws IOException {
13399
13400           }
13401
13402           @Override
13403           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, byte[] value, Put put) throws IOException {
13404             return false;
13405           }
13406
13407           @Override
13408           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Put put) throws IOException {
13409             return false;
13410           }
13411
13412           @Override
13413           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Put put) throws IOException {
13414             return false;
13415           }
13416
13417           @Override
13418           public void delete(Delete delete) throws IOException {
13419
13420           }
13421
13422           @Override
13423           public void delete(List\<Delete\> deletes) throws IOException {
13424
13425           }
13426
13427           @Override
13428           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, byte[] value, Delete delete) throws IOException {
13429             return false;
13430           }
13431
13432           @Override
13433           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Delete delete) throws IOException {
13434             return false;
13435           }
13436
13437           @Override
13438           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Delete delete) throws IOException {
13439             return false;
13440           }
13441
13442           @Override
13443           public void mutateRow(RowMutations rm) throws IOException {
13444
13445           }
13446
13447           @Override
13448           public Result append(Append append) throws IOException {
13449             return null;
13450           }
13451
13452           @Override
13453           public Result increment(Increment increment) throws IOException {
13454             return null;
13455           }
13456
13457           @Override
13458           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount) throws IOException {
13459             return 0;
13460           }
13461
13462           @Override
13463           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount, Durability durability) throws IOException {
13464             return 0;
13465           }
13466
13467           @Override
13468           public void close() throws IOException {
13469
13470           }
13471
13472           @Override
13473           public CoprocessorRpcChannel coprocessorService(byte[] row) {
13474             return null;
13475           }
13476
13477           @Override
13478           public \<T extends Service, R\> Map\<byte[], R\> coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable) throws ServiceException, Throwable {
13479             return null;
13480           }
13481
13482           @Override
13483           public \<T extends Service, R\> void coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13484
13485           }
13486
13487           @Override
13488           public \<R extends Message\> Map\<byte[], R\> batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype) throws ServiceException, Throwable {
13489             return null;
13490           }
13491
13492           @Override
13493           public \<R extends Message\> void batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13494
13495           }
13496
13497           @Override
13498           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, RowMutations mutation) throws IOException {
13499             return false;
13500           }
13501
13502           @Override
13503           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, RowMutations mutation) throws IOException {
13504             return false;
13505           }
13506
13507           @Override
13508           public void setOperationTimeout(int operationTimeout) {
13509
13510           }
13511
13512           @Override
13513           public int getOperationTimeout() {
13514             return 0;
13515           }
13516
13517           @Override
13518           public int getRpcTimeout() {
13519             return 0;
13520           }
13521
13522           @Override
13523           public void setRpcTimeout(int rpcTimeout) {
13524
13525           }
13526
13527           @Override
13528           public int getReadRpcTimeout() {
13529             return 0;
13530           }
13531
13532           @Override
13533           public void setReadRpcTimeout(int readRpcTimeout) {
13534
13535           }
13536
13537           @Override
13538           public int getWriteRpcTimeout() {
13539             return 0;
13540           }
13541
13542           @Override
13543           public void setWriteRpcTimeout(int writeRpcTimeout) {
13544
13545           }
13546         };
13547       }
13548     };
13549   }
13550 }
13551 {code}
13552
13553
13554 ---
13555
13556 * [HBASE-18873](https://issues.apache.org/jira/browse/HBASE-18873) | *Critical* | **Hide protobufs in GlobalQuotaSettings**
13557
13558 GlobalQuotaSettings was introduced to avoid protocol-specific Java classes from leaking into API which is users may leverage. This class has a number of methods which return plain-Java-objects instead of these protocol-specific classes in an effort to better provide stability in the future.
13559
13560
13561 ---
13562
13563 * [HBASE-18893](https://issues.apache.org/jira/browse/HBASE-18893) | *Major* | **Remove Add/Modify/DeleteColumnFamilyProcedure in favor of using ModifyTableProcedure**
13564
13565 The RPC calls for Add/Modify/DeleteColumn have been removed and are now backed by ModifyTable functionality. The corresponding permissions in AccessController have been removed as well.
13566
13567 The shell already bypassed these RPCs and used ModifyTable directly, and thus would not be getting these permission checks, this change brings the rest of the RPC inline with that.
13568
13569 Coprocessor hooks for pre/post Add/Modify/DeleteColumn have likewise been removed. Coprocessors needing to take special actions on schema change should instead process ModifyTable events (which they should have been doing already, but it was easy for developers to miss this nuance).
13570
13571
13572 ---
13573
13574 * [HBASE-16338](https://issues.apache.org/jira/browse/HBASE-16338) | *Major* | **update jackson to 2.y**
13575
13576 HBase has upgraded from Jackson 1 to Jackson 2. JSON output should not have changed and this should not be user facing, but server classpaths should be adjusted accordingly.
13577
13578
13579 ---
13580
13581 * [HBASE-19051](https://issues.apache.org/jira/browse/HBASE-19051) | *Minor* | **Add new split algorithm for num string**
13582
13583 Add new split algorithm DecimalStringSplit，row are decimal-encoded long values in the range "00000000" =\> "99999999" .
13584 create 't1','f', { NUMREGIONS =\> 10 , SPLITALGO =\> 'DecimalStringSplit' }
13585 The split point will be 10000000,20000000,...,90000000
13586
13587
13588 ---
13589
13590 * [HBASE-19067](https://issues.apache.org/jira/browse/HBASE-19067) | *Major* | **Do not expose getHDFSBlockDistribution in StoreFile**
13591
13592 Removed CP exposed StoreFile#getHDFSBlockDistribution
13593
13594
13595 ---
13596
13597 * [HBASE-18989](https://issues.apache.org/jira/browse/HBASE-18989) | *Major* | **Polish the compaction related CP hooks**
13598
13599 Add two new methods in CompactionLifeCycleTracker.
13600 The notExecuted method will be called if the selectCompaction failed or space quota limitation reached.
13601 The completed method will be called after all the requested compactions are finished. The compaction scheduling is pre Store so if you request compaction on a region it may lead to multiple compactions.
13602 Remove the User parameter in Region.requestCompaction methods as it is useless for CP users.
13603 Add a boolean parameter to indicate whether you want to do a major compaction. And so that the triggerMajorCompaction method is removed.
13604 Remove the getCompactionProgress method in Store interface.
13605 Add a UT to confirm that CompactionLifeCycleTracker works correctly, and it also shows how to use CompactionLifeCycleTracker to wait for the completion of a compaction.
13606
13607
13608 ---
13609
13610 * [HBASE-19046](https://issues.apache.org/jira/browse/HBASE-19046) | *Major* | **RegionObserver#postCompactSelection  Avoid passing shaded ImmutableList param**
13611
13612 RegionObserver#postCompactSelection signature is changed.
13613 Arg type org.apache.hadoop.hbase.shaded.com.google.common.collect.ImmutableList is replaced with java.util.List
13614
13615
13616 ---
13617
13618 * [HBASE-19043](https://issues.apache.org/jira/browse/HBASE-19043) | *Major* | **Purge TableWrapper and CoprocessorHConnnection**
13619
13620 Removes getTable from the CoprocessorEnvrionment Interface and from the BaseEnvironment implementation. Also removes TableWrapper and CoprocessorHConnection, two classes that were used by BaseEnvironment to keep a tag on Tables created by Coprocessors that BaseEnvironment might close them out on #shutdown.
13621
13622 Long after these classes and methods were added, in HBase 1.0.0, we moved to a mode where management of Tables was shifted from HBase to the Client; the Client is to manage lifecycle. Table also became a (relatively) lightweight construct so folks are used to getting a Table instance, using it, and then immediately closing it when done.
13623
13624 Coprocessors should do the same in hbase2.0.0.
13625
13626 CoprocessorHConnection short-circuited RPC. This feature has since been integrated into Server Connections; when they create a Connection, they get one that will short-circuit if the request is to a localhost so no need of CoprocessorHConnection any more.
13627
13628 Coprocessors get the Server Connection when they ask for a Connection from their \*CoprocessorEnvironment.
13629
13630
13631 ---
13632
13633 * [HBASE-19014](https://issues.apache.org/jira/browse/HBASE-19014) | *Major* | **surefire fails; When writing xml report stdout/stderr ... No such file or directory**
13634
13635 Running tests with a wildcard selector, i.e.{{-Dtest=org.apache.hadoop.hbase.server.\*}} no longer works.
13636
13637
13638 ---
13639
13640 * [HBASE-10367](https://issues.apache.org/jira/browse/HBASE-10367) | *Major* | **RegionServer graceful stop / decommissioning**
13641
13642 Added three top level Admin APIs to help decommissioning and graceful stop of region servers.
13643
13644   /\*\*
13645    \* Mark region server(s) as decommissioned to prevent additional regions from getting
13646    \* assigned to them. Optionally unload the regions on the servers. If there are multiple servers
13647    \* to be decommissioned, decommissioning them at the same time can prevent wasteful region
13648    \* movements. Region unloading is asynchronous.
13649    \* @param servers The list of servers to decommission.
13650    \* @param offload True to offload the regions from the decommissioned servers
13651    \*/
13652   void decommissionRegionServers(List\<ServerName\> servers, boolean offload) throws IOException;
13653
13654   /\*\*
13655    \* List region servers marked as decommissioned, which can not be assigned regions.
13656    \* @return List of decommissioned region servers.
13657    \*/
13658   List\<ServerName\> listDecommissionedRegionServers() throws IOException;
13659
13660   /\*\*
13661    \* Remove decommission marker from a region server to allow regions assignments.
13662    \* Load regions onto the server if a list of regions is given. Region loading is
13663    \* asynchronous.
13664    \* @param server The server to recommission.
13665    \* @param encodedRegionNames Regions to load onto the server.
13666    \*/
13667   void recommissionRegionServer(ServerName server, List\<byte[]\> encodedRegionNames)  throws IOException;
13668
13669
13670 ---
13671
13672 * [HBASE-19042](https://issues.apache.org/jira/browse/HBASE-19042) | *Blocker* | **Oracle Java 8u144 downloader broken in precommit check**
13673
13674 Precommit switched from Oracle JDK 8 to OpenJDK-8.
13675
13676
13677 ---
13678
13679 * [HBASE-18945](https://issues.apache.org/jira/browse/HBASE-18945) | *Major* | **Make a IA.LimitedPrivate interface for CellComparator**
13680
13681 CellCompartor has been added as an interface with IA.LimitedPrivate. It has the following methods
13682 #int compare(Cell leftCell, Cell rightCell);
13683 #int compareRows(Cell leftCell, Cell rightCell)
13684 #int compareRows(Cell cell, byte[] bytes, int offset, int length)
13685 #int compareWithoutRow(Cell leftCell, Cell rightCell)
13686 #int compareFamilies(Cell leftCell, Cell rightCell
13687 #int compareQualifiers(Cell leftCell, Cell rightCell)
13688 #int compareTimestamps(Cell leftCell, Cell rightCell)
13689 #int compareTimestamps(long leftCellts, long rightCellts)
13690
13691 This is exposed to CPs and CPs can make use of the above methods to do comparisons on the cells.
13692 For internal usage we have CellComparatorImpl and it has static references to COMPARATOR and META\_CELL\_COMPARATOR.
13693 So when a region or store is initialized we should use one of the above comparator. For META table we need the META\_CELL\_COMPARATOR and all other table's  regions/stores will use the COMPARTOR.
13694 While writing the comparator name in FixedFileTrailer of the Hfile we have now ensured that this rename of CellComparator.COMPARATOR/CellComparator.META\_CELL\_COMPARATOR to CellComparatorImpl.COMPARATOR/CellComparatorImpl.META\_CELL\_COMPARATOR is handled.
13695
13696 CellUtils is an util method that provides lot of APIs that helps to do compare, matching functionalities between two cells, or with a cell and a corrpesponding byte[] etc. Some of the APIs are internally used which will be cleaned up in a follow on JIRA HBASE-18995.
13697
13698
13699 ---
13700
13701 * [HBASE-19001](https://issues.apache.org/jira/browse/HBASE-19001) | *Major* | **Remove the hooks in RegionObserver which are designed to construct a StoreScanner which is marked as IA.Private**
13702
13703 These methods are removed:
13704 KeyValueScanner preStoreScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13705       Store store, Scan scan, NavigableSet\<byte[]\> targetCols, KeyValueScanner s, long readPt)
13706       throws IOException;
13707 InternalScanner preFlushScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13708       Store store, List\<KeyValueScanner\> scanners, InternalScanner s, long readPoint)
13709       throws IOException;
13710 InternalScanner preCompactScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13711       Store store, List\<? extends KeyValueScanner\> scanners, ScanType scanType, long earliestPutTs,
13712       InternalScanner s, CompactionLifeCycleTracker tracker, CompactionRequest request,
13713       long readPoint) throws IOException;
13714
13715 For flush and compaction, CP users are expected to wrap the InternalScanner in preFlush/preCompact. And for normal region operation, just use preGetOp/preScannerOpen to modify the Get/Scan object.
13716
13717 This method in Region interface is also removed as we do not need to use read point in CP hooks anymore:
13718 long getReadPoint(IsolationLevel isolationLevel);
13719
13720
13721 ---
13722
13723 * [HBASE-18350](https://issues.apache.org/jira/browse/HBASE-18350) | *Blocker* | **RSGroups are broken under AMv2**
13724
13725 Moves RSGroup on to AMv2. Reenables disabled RSGroups tests.
13726
13727
13728 ---
13729
13730 * [HBASE-18960](https://issues.apache.org/jira/browse/HBASE-18960) | *Major* | **A few bug fixes and minor improvements around batchMutate()**
13731
13732 All operations for which further processing is skipped by preBatchMutate coprocessor hook are treated as SUCCESS instead of FAILED.
13733
13734
13735 ---
13736
13737 * [HBASE-14247](https://issues.apache.org/jira/browse/HBASE-14247) | *Critical* | **Separate the old WALs into different regionserver directories**
13738
13739 Add a new config hbase.separate.oldlogdir.by.regionserver. The default value is false. If this config is true, the old wal dir will be separated by regionservers. This will change the oldWALs layout. The oldWALs is used by replication. So if a cluster didn't use replication, it can be rolling upgrade (upgrade this config from false to true) directly. If a cluster use replication, the oldWALs will be not found when layout changed. So the cluster need rolling upgrade twice. Firstly, only rolling cluster to use new version code. Secondly rolling the config from false to true. Because the cluster already rolling to new version code, so it can find the oldWALs in the new dir layout.
13740
13741
13742 ---
13743
13744 * [HBASE-18954](https://issues.apache.org/jira/browse/HBASE-18954) | *Major* | **Make \*CoprocessorHost classes private**
13745
13746 - Make CoprocessorHost and its implementations InterfaceAudience.Private
13747 - Configurations from "CoprocessorHost" have been moved to new "CoprocessorConfigurations" class.
13748
13749
13750 ---
13751
13752 * [HBASE-15410](https://issues.apache.org/jira/browse/HBASE-15410) | *Major* | **Utilize the max seek value when all Filters in MUST\_PASS\_ALL FilterList return SEEK\_NEXT\_USING\_HINT**
13753
13754 This optimization, targeting SEEK\_NEXT\_USING\_HINT return values, utilizes the max seek value and is transparent to Filters.
13755
13756
13757 ---
13758
13759 * [HBASE-18747](https://issues.apache.org/jira/browse/HBASE-18747) | *Critical* | **Introduce new example and helper classes to tell CP users how to do filtering on scanners**
13760
13761 Modify ZooKeeperScanPolicyObserver in hbase-examples to show how to do filtering in the CP hooks of flush and compaction in hbase-2.0.
13762
13763
13764 ---
13765
13766 * [HBASE-18108](https://issues.apache.org/jira/browse/HBASE-18108) | *Blocker* | **Procedure WALs are archived but not cleaned; fix**
13767
13768 The archived Procedure WALs are moved to \<hbase\_root\>/oldWALs/masterProcedureWALs
13769 directory. TimeToLiveProcedureWALCleaner class was added which regularly cleans the Procedure WAL files from there.
13770
13771 The TimeToLiveProcedureWALCleaner is added to hbase.master.logcleaner.plugins configuration value.
13772
13773 A new config parameter is added: hbase.master.procedurewalcleaner.ttl, which specifies how long a Procedure WAL should stay in the archive directory.
13774
13775
13776 ---
13777
13778 * [HBASE-18183](https://issues.apache.org/jira/browse/HBASE-18183) | *Major* | **Region interface cleanup for CP expose**
13779
13780 Below methods are removed from CP exposed Region interface
13781 getOpenSeqNum
13782 getOldestSeqIdOfStore
13783 isLoadingCfsOnDemandDefault
13784 getReadpoint
13785 updateReadRequestsCount
13786 updateWriteRequestsCount
13787 getRegionServicesForStores
13788 getMetrics
13789 getHDFSBlocksDistribution
13790 releaseRowLocks
13791 batchReplay
13792 get(Get get, boolean withCoprocessor, long nonceGroup, long nonce)
13793 bulkLoadHFiles
13794 execService
13795 registerService
13796 checkFamilies
13797 checkTimestamps
13798 prepareDelete
13799 prepareDeleteTimestamps
13800 updateCellTimestamps
13801 flush
13802 compact
13803 waitForFlushesAndCompactions
13804 waitForFlushes
13805
13806 Change signature of below methods by dropping params 'nonceGroup', 'nonce'
13807 append(Append append, long nonceGroup, long nonce)
13808 batchMutate(Mutation[] mutations, long nonceGroup, long nonce)
13809 increment(Increment increment, long nonceGroup, long nonce)
13810
13811
13812 ---
13813
13814 * [HBASE-18949](https://issues.apache.org/jira/browse/HBASE-18949) | *Major* | **Remove the CompactionRequest parameter in preCompactSelection**
13815
13816 Remove the CompactionRequest parameter in preCompactSelection as we do not have a CompactionRequest at that time.
13817
13818
13819 ---
13820
13821 * [HBASE-18909](https://issues.apache.org/jira/browse/HBASE-18909) | *Major* | **Deprecate Admin's methods which used String regex**
13822
13823 Pushed to master and branch-2. Thanks all for reviewing.
13824
13825
13826 ---
13827
13828 * [HBASE-18931](https://issues.apache.org/jira/browse/HBASE-18931) | *Major* | **Make ObserverContext an interface and remove private/testing methods**
13829
13830 Changes ObserverContext from a class to an interface and hides away constructor, testing functions and other internal-only functions in the implementation class.
13831
13832
13833 ---
13834
13835 * [HBASE-18878](https://issues.apache.org/jira/browse/HBASE-18878) | *Major* | **Use Optional\<T\> return types when T can be null**
13836
13837 **WARNING: No release note provided for this change.**
13838
13839
13840 ---
13841
13842 * [HBASE-18649](https://issues.apache.org/jira/browse/HBASE-18649) | *Major* | **Deprecate KV Usage in MR to move to Cells in 3.0**
13843
13844 All the mappers and reducers output type will be now of MapReduceCell type. No more KeyValue type. How ever in branch-2 for compatibility we have allowed the older interfaces/classes that work with KeyValue to stay in the code base but they have been marked as deprecated.
13845 The following interfaces/classes have been deprecated in branch-2
13846 Import#KeyValueWritableComparablePartitioner
13847 Import#KeyValueWritableComparator
13848 Import#KeyValueWritableComparable
13849 Import#KeyValueReducer
13850 Import#KeyValueSortImporter
13851 Import#KeyValueImporter
13852 KeyValueSortReducer
13853 KeyValueSerialization
13854 WALPlayer#WALKeyValueMapper
13855
13856 So any existing MR jobs that is using the above public interfaces/classes will continue to work in branch-2 and the expected output value type of those mappers and reducers can continue to be KeyValue type.
13857
13858 In branch-3 the mappers and reducers output will only expect MapReduceCell as the type and will no longer work with KeyValue type.
13859 The new public classes/interfaces added for branch-3 and in branch-2 are
13860 CellSerialization
13861 CellSortReducer
13862 Import#CellWritableComparablePartitioner
13863 Import#CellWritableComparable
13864 Import#CellWritableComparator
13865 Import#CellReducer
13866 Import#CellSortImporter
13867 Import#CellImporter
13868 WALPlayer#WALCellMapper
13869
13870
13871 ---
13872
13873 * [HBASE-18897](https://issues.apache.org/jira/browse/HBASE-18897) | *Major* | **Substitute MemStore for Memstore**
13874
13875 The changes of IA.Public/IA.LimitedPrivate classes are shown below:
13876 HTableDescriptor class
13877 \* boolean hasRegionMemstoreReplication()
13878 + boolean hasRegionMemStoreReplication()
13879 \* HTableDescriptor setRegionMemstoreReplication(boolean)
13880 + HTableDescriptor setRegionMemStoreReplication(boolean)
13881
13882 RegionLoadStats class
13883 \* int getMemstoreLoad()
13884 + int getMemStoreLoad()
13885
13886 ServerLoad class
13887 \* int getMemstoreSizeInMB()
13888 + int getMemStoreSizeMB()
13889
13890 Region class
13891 - long getMemstoreSize()
13892 + long getMemStoreSize()
13893
13894 Store class
13895 - MemstoreSize getMemStoreSize()
13896 + MemStoreSize getMemStoreSize()
13897 - MemstoreSize getFlushableSize()
13898 + MemStoreSize getFlushableSize()
13899 - MemstoreSize getSnapshotSize()
13900 + MemStoreSize getSnapshotSize()
13901
13902 StoreFile class
13903 - long getMaxMemstoreTS()
13904 + long getMaxMemStoreTS()
13905
13906
13907 ---
13908
13909 * [HBASE-18010](https://issues.apache.org/jira/browse/HBASE-18010) | *Major* | **Connect CellChunkMap to be used for flattening in CompactingMemStore**
13910
13911 The CellChunkMap is very dense index for Memstore ImmutableSegment and the only one that can be taken off-heap. However, CellChunkMap works on-heap as well. The coding of the entire flow of working with CellChunkMap is not yet finished, thus CellChunkMap is disabled for usage so far. The continuation is done under HBASE-18232.
13912
13913
13914 ---
13915
13916 * [HBASE-18883](https://issues.apache.org/jira/browse/HBASE-18883) | *Major* | **Upgrade to Curator 4.0**
13917
13918 Curator version has been updated from 2.x to 4.0 (running in ZK 3.4 compatibility mode).
13919
13920 Users who experience classpath issues due to version conflicts are recommended to use either the hbase-shaded-client or hbase-shaded-mapreduce artifacts.
13921
13922
13923 ---
13924
13925 * [HBASE-13844](https://issues.apache.org/jira/browse/HBASE-13844) | *Minor* | **Move static helper methods from KeyValue into CellUtils**
13926
13927 Move KeyValue.parseColumn() to CellUtil
13928
13929
13930 ---
13931
13932 * [HBASE-18839](https://issues.apache.org/jira/browse/HBASE-18839) | *Major* | **Apply RegionInfo to code base**
13933
13934 The incompatible changes of IA.Public/LimitedPrivate classes are shown below.
13935 + new method
13936 - removed method
13937 \* deprecated method
13938 -------------------------------------
13939 HRegionLocation class
13940 + RegionInfo getRegion()
13941 \* HRegionInfo getRegionInfo()
13942
13943 AsyncAdmin class
13944 + CompletableFuture\<List\<RegionInfo\>\> getOnlineRegions(ServerName serverName);
13945 - CompletableFuture\<List\<HRegionInfo\>\> getOnlineRegions(ServerName serverName);
13946 + CompletableFuture\<List\<RegionInfo\>\> getTableRegions(TableName tableName);
13947 - CompletableFuture\<List\<HRegionInfo\>\> getTableRegions(TableName tableName);
13948
13949 HBaseTestingUtility class
13950 - Table createTable(HTableDescriptor htd, byte[][] families, Configuration c)
13951 - Table createTable(HTableDescriptor htd, byte[][] families, byte[][] splitKeys, Configuration c)
13952 - Table createTable(HTableDescriptor htd, byte[][] splitRows)
13953 - void modifyTableSync(Admin admin, HTableDescriptor desc)
13954 - HRegion createLocalHRegion(HTableDescriptor desc, byte [] startKey, byte [] endKey)
13955 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc)
13956 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc)
13957 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc)
13958 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc, WAL wal)
13959 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc, WAL wal)
13960 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc, WAL wal)
13961 - List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13962 + List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13963 - WAL createWal(final Configuration conf, final Path rootDir, final HRegionInfo hri)
13964 + WAL createWal(final Configuration conf, final Path rootDir, final RegionInfo hri)
13965 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir,final Configuration conf, final HTableDescriptor htd)
13966 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13967 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13968 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13969 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13970 - boolean assignRegion(final HRegionInfo regionInfo)
13971 + boolean assignRegion(final RegionInfo regionInfo)
13972 - void moveRegionAndWait(HRegionInfo destRegion, ServerName destServer)
13973 + void moveRegionAndWait(RegionInfo destRegion, ServerName destServer)
13974 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd)
13975 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd, int numRegionsPerServer)
13976 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor[] hcds, int numRegionsPerServer)
13977 - HRegion createTestRegion(String tableName, HColumnDescriptor cd)
13978
13979 WALEdit class
13980 - WALEdit createFlushWALEdit(HRegionInfo hri, FlushDescriptor f)
13981 + WALEdit createFlushWALEdit(RegionInfo hri, FlushDescriptor f)
13982 - WALEdit createRegionEventWALEdit(HRegionInfo hri,RegionEventDescriptor regionEventDesc)
13983 + WALEdit createRegionEventWALEdit(RegionInfo hri,RegionEventDescriptor regionEventDesc)
13984 - WALEdit createCompaction(final HRegionInfo hri, final CompactionDescriptor c)
13985 + WALEdit createCompaction(final RegionInfo hri, final CompactionDescriptor c)
13986 - byte[] getRowForRegion(HRegionInfo hri)
13987 + byte[] getRowForRegion(RegionInfo hri)
13988 - WALEdit createBulkLoadEvent(HRegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13989 + - WALEdit createBulkLoadEvent(RegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13990
13991 RegionScanner class
13992 - HRegionInfo getRegionInfo();
13993 + RegionInfo getRegionInfo();
13994
13995 RegionPlan class
13996 - RegionPlan(final HRegionInfo hri, ServerName source, ServerName dest)
13997 + RegionPlan(final RegionInfo hri, ServerName source, ServerName dest)
13998
13999 Region class
14000 - HRegionInfo getRegionInfo();
14001 + RegionInfo getRegionInfo();
14002
14003 TableSnapshotInputFormat.TableSnapshotRegionSplit class
14004 \* HRegionInfo getRegionInfo()
14005 + RegionInfo getRegion()
14006
14007 RawAsyncTable.CoprocessorCallback class
14008 - void onRegionComplete(HRegionInfo region, R resp)
14009 + void onRegionComplete(RegionInfo region, R resp)
14010 - void onRegionError(RegionInfo region, Throwable error);
14011 + void onRegionError(HRegionInfo region, Throwable error);
14012
14013
14014 ---
14015
14016 * [HBASE-18826](https://issues.apache.org/jira/browse/HBASE-18826) | *Major* | **Use HStore instead of Store in our own code base and remove unnecessary methods in Store interface**
14017
14018 **WARNING: No release note provided for this change.**
14019
14020
14021 ---
14022
14023 * [HBASE-17732](https://issues.apache.org/jira/browse/HBASE-17732) | *Critical* | **Coprocessor Design Improvements**
14024
14025 We are moving from Inheritence
14026 - Observer \*is\* Coprocessor
14027 - FooService \*is\* CoprocessorService
14028 To Composition
14029 - Coprocessor \*has\* Observer
14030 - Coprocessor \*has\* Service
14031 ------------------------------------------------------
14032 Summary
14033 ------------------------------------------------------
14034 - Adds four new interfaces - MasterCoprocessor, RegionCoprocessor, RegionServierCoprocessor,
14035   WALCoprocessor
14036 - These new \*Coprocessor interfaces have a get\*Observer() function for each observer type
14037   supported by them.
14038 - Added Coprocessor#getService() to base interface. All extending \*Coprocessor interfaces will
14039   get it from the base interface.
14040 - Added BulkLoadObserver hooks to RegionCoprocessorHost instad of SecureBulkLoadManager doing its
14041   own trickery.
14042 - CoprocessorHost#find\*() fuctions: Too many testing hooks digging into CP internals.
14043   Deleted if can, else marked @VisibleForTesting.
14044 ------------------------------------------------------
14045 Backward Compatibility
14046 ------------------------------------------------------
14047 - Old coprocessors implementing \*Observer won't get loaded (no backward compatibility guarantees).
14048 - Third party coprocessors only implementing Coprocessor will not get loaded (just like Observers).
14049 - Old coprocessors implementing CoprocessorService (for master/region host)
14050   /SingletonCoprocessorService (for RegionServer host) will continue to work with 2.0.
14051 - Added test to ensure backward compatibility of CoprocessorService/SingletonCoprocessorService
14052 - Note that if a coprocessor implements both observer and service in same class, its service
14053   component will continue to work but it's observer component won't work.
14054
14055
14056 ---
14057
14058 * [HBASE-18298](https://issues.apache.org/jira/browse/HBASE-18298) | *Critical* | **RegionServerServices Interface cleanup for CP expose**
14059
14060 We used to pass the RegionServerServices (RSS) which gave Coprocesosrs (CP) all sort of access to internal Server machinery. We now only allows the CP a subset of the RSS in the form of the CPRSS Interface. Particulars:
14061
14062 Removed method getRegionServerServices from CP exposed RegionCoprocessorEnvironment and RegionServerCoprocessorEnvironment and replaced with getCoprocessorRegionServerServices. This returns a new interface CoprocessorRegionServerServices which is only a subset of RegionServerServices. With that below methods are no longer exposed for CPs
14063 WAL getWAL(HRegionInfo regionInfo)
14064 List\<WAL\> getWALs()
14065 FlushRequester getFlushRequester()
14066 RegionServerAccounting getRegionServerAccounting()
14067 RegionServerRpcQuotaManager getRegionServerRpcQuotaManager()
14068 SecureBulkLoadManager getSecureBulkLoadManager()
14069 RegionServerSpaceQuotaManager getRegionServerSpaceQuotaManager()
14070 void postOpenDeployTasks(final PostOpenDeployContext context)
14071 void postOpenDeployTasks(final Region r)
14072 boolean reportRegionStateTransition(final RegionStateTransitionContext context)
14073 boolean reportRegionStateTransition(TransitionCode code, long openSeqNum, HRegionInfo... hris)
14074 boolean reportRegionStateTransition(TransitionCode code, HRegionInfo... hris)
14075 RpcServerInterface getRpcServer()
14076 ConcurrentMap\<byte[], Boolean\> getRegionsInTransitionInRS()
14077 Leases getLeases()
14078 ExecutorService getExecutorService()
14079 Map\<String, Region\> getRecoveringRegions()
14080 public ServerNonceManager getNonceManager()
14081 boolean registerService(Service service)
14082 HeapMemoryManager getHeapMemoryManager()
14083 double getCompactionPressure()
14084 ThroughputController getFlushThroughputController()
14085 double getFlushPressure()
14086 MetricsRegionServer getMetrics()
14087 EntityLock regionLock(List\<HRegionInfo\> regionInfos, String description, Abortable abort)
14088 void unassign(byte[] regionName)
14089 Configuration getConfiguration()
14090 ZooKeeperWatcher getZooKeeper()
14091 ClusterConnection getClusterConnection()
14092 MetaTableLocator getMetaTableLocator()
14093 CoordinatedStateManager getCoordinatedStateManager()
14094 ChoreService getChoreService()
14095 void stop(String why)
14096 void abort(String why, Throwable e)
14097 boolean isAborted()
14098 void updateRegionFavoredNodesMapping(String encodedRegionName, List\<ServerName\> favoredNodes)
14099 InetSocketAddress[] getFavoredNodesForRegion(String encodedRegionName)
14100 void addToOnlineRegions(Region region)
14101 boolean removeFromOnlineRegions(final Region r, ServerName destination)
14102
14103 Also 3 methods name have been changed
14104 List\<Region\> getOnlineRegions(TableName tableName) -\> List\<Region\> getRegions(TableName tableName)
14105 List\<Region\> getOnlineRegions() -\> List\<Region\> getRegions()
14106 Region getFromOnlineRegions(final String encodedRegionName) -\> Region getRegion(final String encodedRegionName)
14107
14108
14109 ---
14110
14111 * [HBASE-16769](https://issues.apache.org/jira/browse/HBASE-16769) | *Blocker* | **Deprecate/remove PB references from MasterObserver and RegionServerObserver**
14112
14113 Signature of below methods in MasterObserver changed and instead of org.apache.hadoop.hbase.shaded.protobuf.generated.SnapshotDescription param, we will be passing org.apache.hadoop.hbase.client.SnapshotDescription
14114 preListSnapshot
14115 postListSnapshot
14116 preSnapshot
14117 postSnapshot
14118 preCloneSnapshot
14119 postCloneSnapshot
14120 preRestoreSnapshot
14121 postRestoreSnapshot
14122 preDeleteSnapshot
14123 postDeleteSnapshot
14124
14125 Also changed signature of RegionServerObserver#preReplicateLogEntries and preReplicateLogEntries by removing params List\<org.apache.hadoop.hbase.shaded.protobuf.generated.AdminProtos.WALEntry\>, org.apache.hadoop.hbase.CellScanner
14126
14127
14128 ---
14129
14130 * [HBASE-18859](https://issues.apache.org/jira/browse/HBASE-18859) | *Major* | **Purge PB from BulkLoadObserver**
14131
14132 No longer pass the protobuf request to prePrepareBulkLoad and preCleanupBulkLoad in BulkLoadObserver as part of our effort to purge protobuf from our Coprocessor API Interface (if you need to read the Table and RegionInfo, pull it from the passed in RegionCoprocessorEnvironment ObserverContext).
14133
14134
14135 ---
14136
14137 * [HBASE-18731](https://issues.apache.org/jira/browse/HBASE-18731) | *Major* | **[compat 1-2] Mark protected methods of QuotaSettings that touch Protobuf internals as IA.Private**
14138
14139 The following methods in QuotaSettings were annotated InterfaceAudience.Private; they are for internal use only in hbase-2.0.0
14140
14141 buildSetQuotaRequestProto(final QuotaSettings settings)
14142 setupSetQuotaRequest(SetQuotaRequest.Builder builder)
14143
14144 Note that there were versions of these methods in HBase 1.y that used classes in the {{org.apache.hadoop.hbase.protobuf.generated}} package. That package no longer exists as a part of our cleanup of protobufs from our public facing API and the related methods have been removed.
14145
14146
14147 ---
14148
14149 * [HBASE-18825](https://issues.apache.org/jira/browse/HBASE-18825) | *Major* | **Use HStoreFile instead of StoreFile in our own code base and remove unnecessary methods in StoreFile interface**
14150
14151 Cleanup the StoreFile interface.
14152
14153 The metadata keys are moved to HStoreFile.
14154
14155 These methods are removed:
14156 CacheConfig getCacheConf();
14157 byte[] getMetadataValue(byte[] key);
14158 boolean isCompactedAway();
14159 boolean isReferencedInReads();
14160 void initReader() throws IOException;
14161 StoreFileScanner getPreadScanner(boolean cacheBlocks, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn);
14162 StoreFileScanner getStreamScanner(boolean canUseDropBehind, boolean cacheBlocks, boolean isCompaction, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn) throws IOException;
14163 StoreFileReader getReader();
14164 void closeReader(boolean evictOnClose) throws IOException;
14165 void markCompactedAway();
14166 void deleteReader() throws IOException;
14167
14168 Notice that these methods are still available in HStoreFile.
14169
14170 And the return value of getFirstKey and getLastKey are changed from Cell to Optional\<Cell\> to better indicate that they may not be available.
14171
14172
14173 ---
14174
14175 * [HBASE-18786](https://issues.apache.org/jira/browse/HBASE-18786) | *Major* | **FileNotFoundException should not be silently handled for primary region replicas**
14176
14177 FileNotFoundException opening a StoreFile in a primary replica now causes a RegionServer to crash out where before it would be ignored (or optionally handled via close/reopen).
14178
14179
14180 ---
14181
14182 * [HBASE-10504](https://issues.apache.org/jira/browse/HBASE-10504) | *Blocker* | **Define Replication Interface**
14183
14184 Adds a new plugin point ReplicationEndpoint. ReplicationSource, internal to hbase, tails the WAL and calls registered ReplicationEndpoints. ReplicationEndpoint implementations are responsible for actually shipping the edits to the other (hbase or non-hbase) cluster. ReplicationEndpoint can be defined per peer. Default inter-cluster replication works without any changes (lily etc should still work). ReplicationEndpoints have various facility including means for filtering out WAL edits source-side before they can be shipped to remote peers.
14185
14186
14187 ---
14188
14189 * [HBASE-18142](https://issues.apache.org/jira/browse/HBASE-18142) | *Major* | **Deletion of a cell deletes the previous versions too**
14190
14191 Now, delete.rb won't delete all versions of the specified column. It only delete the specified version (if user assigns a timestamp) or the latest version (default behavior)
14192
14193
14194 ---
14195
14196 * [HBASE-18446](https://issues.apache.org/jira/browse/HBASE-18446) | *Critical* | **Mark StoreFileScanner/StoreFileReader as IA.LimitedPrivate(Phoenix)**
14197
14198 Mark StoreFileScanner and StoreFileReader as IA.LimitPrivate(Phoenix).
14199 Deprecated the preStoreFileReaderOpen and postStoreFileReaderOpen method in RegionObserver to indicate that these methods are only supposed to be used by Phoenix.
14200
14201
14202 ---
14203
14204 * [HBASE-18798](https://issues.apache.org/jira/browse/HBASE-18798) | *Major* | **Remove the unused methods in RegionServerObserver**
14205
14206 Remove the following APIs from RegionServerObserver:
14207 # preRollBackMerge
14208 # postRollBackMerge
14209 # preMergeCommit
14210 # postMergeCommit
14211 # postMerge
14212 # preMerge
14213
14214
14215 ---
14216
14217 * [HBASE-18831](https://issues.apache.org/jira/browse/HBASE-18831) | *Major* | **Add explicit dependency on javax.el**
14218
14219 Specify an explicit version for javax.el. Without it we rely on repository cached metadata of which a prevalent version seems to list all versions between b01 and b08 but finishes with a b08-jbossorg which is in the jboss repo, a repo most of us do not list in our poms.
14220
14221
14222 ---
14223
14224 * [HBASE-17980](https://issues.apache.org/jira/browse/HBASE-17980) | *Major* | **Any HRegionInfo we give out should be immutable**
14225
14226 Provide alternate user-facing API that takes a RegionInfo Interface instead of a HRegionInfo; the old HRegionInfo methods have been deprecated in 2.0.0 and will be removed in 3.0.0.
14227
14228
14229 ---
14230
14231 * [HBASE-14004](https://issues.apache.org/jira/browse/HBASE-14004) | *Critical* | **[Replication] Inconsistency between Memstore and WAL may result in data in remote cluster that is not in the origin**
14232
14233 Now when replicating a wal file which is still opened for write, we will get its committed length from the WAL instance in the same RS to prevent replicating uncommit WALEdit.
14234
14235 This is very important if you use AsyncFSWAL, as we use fan-out in AsyncFSWAL. The data written to DN will be visible immediately as all DNs think it is the end of a pipeline, although the client has not received an ack, and also NN may truncate the file if the client crashes at the same time.
14236
14237
14238 ---
14239
14240 * [HBASE-18819](https://issues.apache.org/jira/browse/HBASE-18819) | *Major* | **Set version number to 2.0.0-alpha3 from 2.0.0-alpha3-SNAPSHOT**
14241
14242 Set version on branch-2 to be 2.0.0-alpha3 as part of RC making.
14243
14244
14245 ---
14246
14247 * [HBASE-18683](https://issues.apache.org/jira/browse/HBASE-18683) | *Major* | **Upgrade hbase to commons-math 3**
14248
14249 Moved on to commons-math3. Removed commons-math2.
14250
14251
14252 ---
14253
14254 * [HBASE-18453](https://issues.apache.org/jira/browse/HBASE-18453) | *Major* | **CompactionRequest should not be exposed to user directly**
14255
14256 Introduce a CompactionLifeCycleTracker to let the CP users know when the compaction starts and ends. CompactionRequest is marked as IA.Private and should be used in CP implementation any more.
14257
14258
14259 ---
14260
14261 * [HBASE-18794](https://issues.apache.org/jira/browse/HBASE-18794) | *Major* | **Remove deprecated methods in MasterObserver**
14262
14263 The removed APIs are shown below.
14264 # preCreateTableHandler
14265 # postCreateTableHandler
14266 # preDeleteTableHandler
14267 # postDeleteTableHandler
14268 # preTruncateTableHandler
14269 # postTruncateTableHandler
14270 # preModifyTableHandler
14271 # postModifyTableHandler
14272 # preAddColumn
14273 # postAddColumn
14274 # preAddColumnHandler
14275 # postAddColumnHandler
14276 # preModifyColumn
14277 # postModifyColumn
14278 # preModifyColumnHandler
14279 # postModifyColumnHandler
14280 # preDeleteColumn
14281 # postDeleteColumn
14282 # preDeleteColumnHandler
14283 # postDeleteColumnHandler
14284 # preEnableTableHandler
14285 # postEnableTableHandler
14286 # preDisableTableHandler
14287 # postDisableTableHandler
14288 # preDispatchMerge
14289 # postDispatchMerge
14290
14291
14292 ---
14293
14294 * [HBASE-14998](https://issues.apache.org/jira/browse/HBASE-14998) | *Blocker* | **Unify synchronous and asynchronous methods in Admin and cleanup**
14295
14296  \* Deprecates getAlterStatus. Everywhere else we talk of 'modify' rather
14297        'alter' and should use Future returned from async instead.
14298  \* isTableAvailable(TableName, byte [][]) has been deprecated to be
14299        removed; use the overrie instead. This is a weird method.
14300  \* Changed listTableDescriptor to getDescriptor.
14301  \* Renamed other like methods to have same pattern (deprecating the old):
14302         balancer =\> balance
14303         setBalancerRunning =\> balancerSwitch
14304         setNormalizerRunning =\> normalizerSwitch
14305         enableCatalogJanitor =\> catalogJanitorSwitch
14306         setCleanerChoreRunning =\> cleanerChoreSwitch
14307         setSplitOrMergeEnabled =\> splitOrMergeEnabledSwitch
14308
14309  \* Renamed (with deprecation of old) runCatalogScan =\> runCatalogJanitor.
14310  \* Reviewed generated javadoc and made some edits; purged reference to
14311        hbase issues from our API, fixed param names, etc.
14312  \* Made all the enable services methods have same pattern.
14313  \* Renamed takeSnapshotAsync as snapshotAsync (with deprecation of old)
14314  \* Renamed execProcedureWithRet as execProcedureWithReturn (with
14315        deprecation)
14316
14317
14318 ---
14319
14320 * [HBASE-18723](https://issues.apache.org/jira/browse/HBASE-18723) | *Major* | **[pom cleanup] Do a pass with dependency:analyze; remove unused and explicity list the dependencies we exploit**
14321
14322 Purged a bunch of dependencies included but unused. Added reference to dependencies we do use but did not list (transitively included). Purged all but junit from parent pom dependency set and did explicit include in modules instead; not all modules need mockito, etc. Still work to do: grey area around hadoop and its transitive includes need cleanup still to make the  dependency:analyze runs clean. Also figure how to purge junit from parent dependency list.
14323
14324
14325 ---
14326
14327 * [HBASE-17823](https://issues.apache.org/jira/browse/HBASE-17823) | *Major* | **Migrate to Apache Yetus Audience Annotations**
14328
14329 HBase now uses stability and audience annotations sourced from Apache Yetus, instead of the custom annotations that were previously in place.
14330
14331
14332 ---
14333
14334 * [HBASE-18793](https://issues.apache.org/jira/browse/HBASE-18793) | *Major* | **Remove deprecated methods in RegionObserver**
14335
14336 These deprecated methods are removed from RegionObserver:
14337 InternalScanner preFlushScannerOpen(ObserverContext, Store, List, InternalScanner) throws IOException;
14338 void preCompactSelection(ObserverContext, Store, List) throws IOException;
14339 void postCompactSelection(ObserverContext, Store, ImmutableList);
14340 InternalScanner preCompact(ObserverContext, Store, InternalScanner, ScanType) throws IOException;
14341 InternalScanner preCompactScannerOpen(ObserverContext, Store, List, ScanType, long, InternalScanner, CompactionRequest) throws IOException;
14342 InternalScanner preCompactScannerOpen( ObserverContext, Store store, List, ScanType, long, InternalScanner) throws IOException;
14343 void preSplit(ObserverContext) throws IOException;
14344 void preSplit(ObserverContext, byte[]) throws IOException;
14345 void postSplit(ObserverContext, Region, Region) throws IOException;
14346 void preSplitBeforePONR(ObserverContext, byte[], List) throws IOException;
14347 void preSplitAfterPONR(ObserverContext) throws IOException;
14348 void preRollBackSplit(ObserverContext) throws IOException;
14349 void postRollBackSplit(ObserverContext) throws IOException;
14350 void postCompleteSplit(ObserverContext) throws IOException;
14351 long preIncrementColumnValue(ObserverContext, byte[], byte[], byte[], long, boolean) throws IOException;
14352 long postIncrementColumnValue(ObserverContextc, byte[], byte[], byte[], long, boolean, long) throws IOException;
14353 KeyValueScanner preStoreScannerOpen(ObserverContext, Store, Scan, NavigableSet, KeyValueScanner) throws IOException;
14354 boolean postScannerFilterRow(ObserverContext, InternalScanner, byte[], int, short, boolean) throws IOException;
14355 boolean postBulkLoadHFile(ObserverContext, List, boolean) throws IOException;
14356
14357 And this method is also removed since we never call it in our code base:
14358 InternalScanner preFlushScannerOpen(ObserverContext, Store, KeyValueScanner, InternalScanner, long) throws IOException;
14359
14360 The deprecated annotation is removed for these two methods as they are still being used:
14361 void preFlush(ObserverContext) throws IOException;
14362 void postFlush(ObserverContextc) throws IOException;
14363
14364
14365 ---
14366
14367 * [HBASE-18733](https://issues.apache.org/jira/browse/HBASE-18733) | *Major* | **[compat 1-2] Hide WALKey**
14368
14369 WALKey, @InterfaceAudience.LimitedPrivate(HBaseInterfaceAudience.REPLICATION), changed a bunch for 2.0.0. See below. We figured it ok hiding it since it should be internals anyway -- only we should be making them.
14370
14371
14372 ---
14373
14374 * [HBASE-13271](https://issues.apache.org/jira/browse/HBASE-13271) | *Critical* | **Table#puts(List\<Put\>) operation is indeterminate; needs fixing**
14375
14376 Adds more spec on how Get, Delete, and Put work and how they differ to help the user.
14377
14378
14379 ---
14380
14381 * [HBASE-16479](https://issues.apache.org/jira/browse/HBASE-16479) | *Major* | **Move WALEdit from hbase.regionserver.wal package to hbase.wal package**
14382
14383 Incompatible move of WALEdit class from regionserver.wal to wal. Effects @InterfaceAudience.LimitedPrivate({ HBaseInterfaceAudience.REPLICATION,
14384     HBaseInterfaceAudience.COPROC })
14385
14386 (
14387
14388
14389 ---
14390
14391 * [HBASE-10240](https://issues.apache.org/jira/browse/HBASE-10240) | *Critical* | **Remove 0.94-\>0.96 migration code**
14392
14393 Purge 0.94=\>0.96 deprecated, migration code. This means that if you are on 0.94 and wish to go to hbase 2.0, you must first migrate to a version of hbase that is \>= 0.96.
14394
14395
14396 ---
14397
14398 * [HBASE-18783](https://issues.apache.org/jira/browse/HBASE-18783) | *Minor* | **Declare the builder of ClusterStatus as IA.Private, and remove the Writables from ClusterStatus**
14399
14400 **WARNING: No release note provided for this change.**
14401
14402
14403 ---
14404
14405 * [HBASE-18106](https://issues.apache.org/jira/browse/HBASE-18106) | *Critical* | **Redo ProcedureInfo and LockInfo**
14406
14407 Admin.listProcedures and Admin.listLocks were renamed to getProcedures and getLocks (listProcedures was added to hbase 1.2). This change was done in an incompatible way -- we just yanked listProcedures (Because Admin Interface is not compatible with hbase1).
14408
14409     Main changes:
14410     - ProcedureInfo and LockInfo were removed, we use JSON instead of them
14411     - Procedure and LockedResource are their server side equivalent
14412     - Procedure protobuf state\_data became obsolate, it is only kept for
14413       reading previously written WAL
14414     - Procedure protobuf contains a state\_message field, which stores the internal
14415       state messages (Any type instead of bytes)
14416     - Procedure.serializeStateData and deserializeStateData were changed slightly
14417     - Procedures internal states are available on client side
14418     - Procedures are displayed on web UI and in shell in the following jruby format:
14419       { ID =\> '1', PARENT\_ID = '-1', PARAMETERS =\> [ ..extra state information.. ] }
14420
14421
14422 ---
14423
14424 * [HBASE-18621](https://issues.apache.org/jira/browse/HBASE-18621) | *Major* | **Refactor ClusterOptions before applying to code base**
14425
14426 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14427 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14428
14429
14430 ---
14431
14432 * [HBASE-18780](https://issues.apache.org/jira/browse/HBASE-18780) | *Minor* | **Remove HLogPrettyPrinter and hlog command**
14433
14434 **WARNING: No release note provided for this change.**
14435
14436
14437 ---
14438
14439 * [HBASE-14997](https://issues.apache.org/jira/browse/HBASE-14997) | *Critical* | **Move compareOp and Comparators out of filter to client package**
14440
14441 Deprecate checkAnd\* APIs that take the filter CompareOp. Added new overrides that take a generic CompareOperator instead. CompareOperator will be used by checkAnd\* in Table API and by filters going forward.
14442
14443 Other nice improvements suggested by this issue have been moved out to HBASE-18774.
14444
14445
14446 ---
14447
14448 * [HBASE-17972](https://issues.apache.org/jira/browse/HBASE-17972) | *Minor* | **Remove mergePool from CompactSplitThread**
14449
14450 After this jira, mergePool will be permanently removed from CompactSplitThread.
14451
14452
14453 ---
14454
14455 * [HBASE-18704](https://issues.apache.org/jira/browse/HBASE-18704) | *Major* | **Upgrade hbase to commons-collections 4**
14456
14457 **WARNING: No release note provided for this change.**
14458
14459
14460 ---
14461
14462 * [HBASE-18697](https://issues.apache.org/jira/browse/HBASE-18697) | *Major* | **Need a shaded hbase-mapreduce module**
14463
14464 Replaces hbase-shaded-server-\<version\>.jar with hbase-shaded-mapreduce-\<version\>.jar.
14465
14466
14467 ---
14468
14469 * [HBASE-15607](https://issues.apache.org/jira/browse/HBASE-15607) | *Blocker* | **Remove PB references from Admin for 2.0**
14470
14471 All the references to Protos in Admin.java have been removed and replaced with respective POJO classes.
14472 The references to Protos that were removed are
14473 AdminProtos.GetRegionInfoResponse,
14474 HBaseProtos.SnapshotDescription, HBaseProtos.SnapshotDescription.Type,
14475  MasterProtos.SnapshotResponse.
14476 CompactionType, CompactionState and MasterSwitchType Enums have been moved out of Admin.java to standalone Enums.
14477
14478
14479 ---
14480
14481 * [HBASE-18674](https://issues.apache.org/jira/browse/HBASE-18674) | *Major* | **upgrade hbase to commons-lang3**
14482
14483 Move to commons-lang3 from common-lang (check it out!... Nice lib...Some nice utility)
14484
14485
14486 ---
14487
14488 * [HBASE-18736](https://issues.apache.org/jira/browse/HBASE-18736) | *Major* | **Cleanup the HTD/HCD for Admin**
14489
14490 Changed the passed arguments from HTD/HCD to TD/CFD for Admin.
14491
14492
14493 ---
14494
14495 * [HBASE-18699](https://issues.apache.org/jira/browse/HBASE-18699) | *Major* | **Copy LoadIncrementalHFiles to another package and mark the old one as deprecated**
14496
14497 Introduce a new o.a.h.h.tool.LoadIncrementalHFiles. The old o.a.h.h.mapreduce.LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
14498
14499
14500 ---
14501
14502 * [HBASE-18739](https://issues.apache.org/jira/browse/HBASE-18739) | *Major* | **Make all TimeRange Constructors InterfaceAudience Private.**
14503
14504 All constructors have already been deprecated. This change makes them InterfaceAudience Private.
14505
14506
14507 ---
14508
14509 * [HBASE-18675](https://issues.apache.org/jira/browse/HBASE-18675) | *Minor* | **Making {max,min}SessionTimeout configurable for MiniZooKeeperCluster**
14510
14511 <!-- markdown -->
14512
14513
14514 Standalone clusters and minicluster instances can now configure the session timeout for our embedded ZooKeeper quorum using `hbase.zookeeper.property.minSessionTimeout` and `hbase.zookeeper.property.maxSessionTimeout`.
14515
14516
14517 ---
14518
14519 * [HBASE-15806](https://issues.apache.org/jira/browse/HBASE-15806) | *Critical* | **An endpoint-based export tool**
14520
14521 org.apache.hadoop.hbase.coprocessor.Export
14522 Instructs HBase to dump the contents of table to HDFS in a sequence file
14523 + replaces MR by endpoint (see org.apache.hadoop.hbase.mapreduce.Export)
14524 + no large data to be transfered between hbase server and client
14525 + same command line as org.apache.hadoop.hbase.mapreduce.Export
14526 - user needs to alter table for deploying ExportEndpoint
14527 - user needs to adjust the endpoint timeout for dumping large data
14528 - user needs to get the EXECUTE permission
14529
14530
14531 ---
14532
14533 * [HBASE-18577](https://issues.apache.org/jira/browse/HBASE-18577) | *Critical* | **shaded client includes several non-relocated third party dependencies**
14534
14535 <!-- markdown -->
14536
14537
14538 The HBase shaded artifacts (hbase-shaded-client and hbase-shaded-server) no longer contain several non-relocated third party dependency classes that were mistakenly included. Downstream users who relied on these classes being present will need to add a runtime dependency onto an appropriate third party artifact.
14539
14540 Previously, we erroneously packaged several third party libs without relocating them. In some cases these libraries have now been relocated; in some cases they are no longer included at all.
14541
14542 Includes:
14543
14544 * jaxb
14545 * jetty
14546 * jersey
14547 * codahale metrics (HBase 1.4+ only)
14548 * commons-crypto
14549 * jets3t
14550 * junit
14551 * curator (HBase 1.4+)
14552 * netty 3 (HBase 1.1)
14553 * mokito-junit4 (HBase 1.1)
14554
14555 There is now testing to ensure that the shaded artifacts only contain expected relocated content. It can be run via `mvn -Dtest=noUnitTests -pl hbase-shaded/hbase-shaded-check-invariants -am -Prelease verify`.
14556
14557 For version 2.0+ this patch removes hadoop-mapreduce-client-core from the set of dependencies included for the hbase-client and hbase-shaded-client artifacts.
14558
14559 For 2.0+, the slf4j-log4j12 dependency is now optional for both shaded artifacts.
14560
14561
14562 ---
14563
14564 * [HBASE-14745](https://issues.apache.org/jira/browse/HBASE-14745) | *Blocker* | **Shade the last few dependencies in hbase-shaded-client**
14565
14566 Previously some dependencies in hbase-shaded-client were still leaking into the un-shaded namespace. This should now be fixed.
14567
14568 Additionally the rat checking on generated intermediate files from shading should be skipped.
14569
14570
14571 ---
14572
14573 * [HBASE-18665](https://issues.apache.org/jira/browse/HBASE-18665) | *Critical* | **ReversedScannerCallable invokes getRegionLocations incorrectly**
14574
14575 Performing reverse scan on tables used the meta cache incorrectly and fetched data from meta table every time. This fix solves this issue and which results in performance improvement for reverse scans.
14576
14577
14578 ---
14579
14580 * [HBASE-3935](https://issues.apache.org/jira/browse/HBASE-3935) | *Major* | **HServerLoad.storefileIndexSizeMB should be changed to storefileIndexSizeKB**
14581
14582 This patch removed the storefile\_index\_size\_MB in protobuf. It will cause the value of storefile\_index\_size\_MB is zero if user still use hbase-client 1.x.
14583
14584
14585 ---
14586
14587 * [HBASE-18640](https://issues.apache.org/jira/browse/HBASE-18640) | *Major* | **Move mapreduce out of hbase-server into separate hbase-mapreduce module**
14588
14589 - Moves all org.apache.hadoop.hbase.mapreduce.\* (except LoadIncrementalHFiles) and org.apache.hadoop.hbase.mapred.\* classes from hbase-server module to new hbase-mapreduce module.
14590 - Also moves following tools from hbase-server module to hbase-mapreduce module: CompactionTool, ExportSnapshot, PerformanceEvaluation, LoadTestTool
14591 - Very minor breakages in  LoadTestTool(LimitedPrivate HBaseInterfaceAudience.TOOLS)
14592
14593
14594 ---
14595
14596 * [HBASE-18519](https://issues.apache.org/jira/browse/HBASE-18519) | *Major* | **Use builder pattern to create cell**
14597
14598 Introduce the CellBuilder helper.
14599 1) Using CellBuilderFactory to get CellBuilder for creating cell with row,
14600     column, qualifier, type, and value.
14601 2) For internal use, the ExtendedCellBuilder, which is created by ExtendedCellBuilderFactory, is able to build cell with extra fields - sequence id and tags -
14602
14603
14604 ---
14605
14606 * [HBASE-18448](https://issues.apache.org/jira/browse/HBASE-18448) | *Minor* | **EndPoint example  for refreshing HFiles for stores**
14607
14608 Adds a new RefreshHFiles Coprocessor Endpoint example. Includes client and serverside-endpoint that iterates region Stores to call #refreshStoreFiles.
14609
14610
14611 ---
14612
14613 * [HBASE-18658](https://issues.apache.org/jira/browse/HBASE-18658) | *Major* | **Purge hokey hbase Service implementation; use (internal) Guava Service instead**
14614
14615 Removed hbase Service class. It was not fully-formed. Now Guava is relocated, use its Service instead internally; it has nice implementation facility too in AbstractService.
14616
14617
14618 ---
14619
14620 * [HBASE-15982](https://issues.apache.org/jira/browse/HBASE-15982) | *Blocker* | **Interface ReplicationEndpoint extends Guava's Service**
14621
14622     Breaking change to our ReplicationEndpoint and BaseReplicationEndpoint.
14623
14624     ReplicationEndpoint implemented Guava 0.12 Service. An abstract
14625     subclass, BaseReplicationEndpoint, provided default implementations
14626     and facility, among other things, by extending Guava's
14627     AbstractService class.
14628
14629     Both of these HBase classes were marked LimitedPrivate for
14630     REPLICATION so these classes were semi-public and made it so
14631     Guava 0.12 was part of our API.
14632
14633     Having Guava in our API was a mistake. It anchors us and the
14634     implementation of the Interface to Guava 0.12. This is untenable
14635     given Guava changes and that the Service Interface in particular
14636     has had extensive revamp and improvement done. We can't hold to
14637     the Guava Interface. It changed. We can't stay on Guava 0.12;
14638     implementors and others on our CLASSPATH won't abide being stuck
14639     on an old Guava.
14640
14641     So we make breaking changes. The unhitching of our Interface
14642     from Guava could only be done in a breaking manner. It undoes the
14643     LimitedPrivate on BaseReplicationEndpoint while keeping it for the RE
14644     Interface. It means consumers will have to copy/paste the
14645     AbstractService-based BRE into their own codebase also supplying their
14646     own Guava; HBase no longer 'supplies' this (our Guava usage has
14647     been internalized, relocated).
14648
14649     This patch then adds into RE the basic methods RE needs of the old
14650     Guava Service rather than return a Service to start/stop only to go
14651     back to the RE instance to do actual work. A few method names had to
14652     be changed so could make implementations with Guava Service internally
14653     and not have RE method names and types clash). Semantics remained the
14654     same otherwise. For example startAsync and stopAsync in Guava are start
14655     and stop in RE.
14656
14657
14658 ---
14659
14660 * [HBASE-18347](https://issues.apache.org/jira/browse/HBASE-18347) | *Major* | **Implement a BufferedMutator for async client**
14661
14662 Introduce an AsyncBufferedMutator for batching requests to HBase for a single table.
14663
14664 Use AsyncConnection.getBufferedMutator method to get an AsyncBufferedMutator instance.
14665
14666
14667 ---
14668
14669 * [HBASE-18546](https://issues.apache.org/jira/browse/HBASE-18546) | *Critical* | **Always overwrite the TS for Append/Increment unless no existing cells are found**
14670
14671 If there is no existing cell in submitting Append/Increment, the custom ts won't be overridden. By contrast, the cell's ts will always be overridden by server.
14672
14673
14674 ---
14675
14676 * [HBASE-18224](https://issues.apache.org/jira/browse/HBASE-18224) | *Critical* | **Upgrade jetty**
14677
14678 Moved from Jetty 9.3.x to 9.4.x.
14679
14680 Jetty returns more correct HTTP code when Header is too long, 431 instead of 413, and it requires more threads to start up (made default 16 instead of 10).
14681
14682
14683 ---
14684
14685 * [HBASE-17442](https://issues.apache.org/jira/browse/HBASE-17442) | *Critical* | **Move most of the replication related classes from hbase-client to hbase-replication package**
14686
14687 Move replication implementation's classes from hbase-client to hbase-replication package.
14688
14689
14690 ---
14691
14692 * [HBASE-18653](https://issues.apache.org/jira/browse/HBASE-18653) | *Major* | **Undo hbase2 check against \< hadoop2.6.x; i.e. implement agreed drop of hadoop 2.4 and 2.5 support in hbase2**
14693
14694 Change the yetus profile for branch-2 so it no longer runs hadoop 2.4.x and 2.5.x build checks.
14695
14696
14697 ---
14698
14699 * [HBASE-18630](https://issues.apache.org/jira/browse/HBASE-18630) | *Major* | **Prune dependencies; as is branch-2 has duplicates**
14700
14701 Removed doubled instances of javax.inject and commons-beanutils where the versions were close.
14702
14703 Other instances of 'double' includes have different groupids so wary pruning especially when transitive includes (hadoop or jetty et al.)
14704
14705
14706 ---
14707
14708 * [HBASE-18631](https://issues.apache.org/jira/browse/HBASE-18631) | *Minor* | **Allow configuration of ChaosMonkey properties via hbase-site**
14709
14710 This change invalidates the need for a separate Java properties file to configure the ChaosMonkey included with HBase. These properties can be provided directly in hbase-site.xml. If configuration in provided in both locations, the Java properties file takes precendence.
14711
14712
14713 ---
14714
14715 * [HBASE-18489](https://issues.apache.org/jira/browse/HBASE-18489) | *Major* | **Expose scan cursor in RawScanResultConsumer**
14716
14717 Add a 'cursor' method which returns an 'Optional\<Cursor\>' in 'RawScanResultConsumer.ScanController'. You can use this method to obtain the scan cursor if available.
14718
14719
14720 ---
14721
14722 * [HBASE-18511](https://issues.apache.org/jira/browse/HBASE-18511) | *Blocker* | **Default no regions on master**
14723
14724 Changes the configuration hbase.balancer.tablesOnMaster from list of table names that the can carry (with 'none' meaning no tables on the master) to instead be a boolean that is set to true if master carries tables/regions and false if it does not. If true, the master acts like any regionserver.
14725
14726 If false, then the master carries no tables. This is the default for hbase-2.0.0.
14727
14728 Another boolean configuration, hbase.balancer.tablesOnMaster.systemTablesOnly, when set to true, enables hbase.balancer.tablesOnMaster and makes it so the master hosts system tables exclusively (the long-time deploy mode of master branch and branch-2 up until this commit).
14729
14730 UPDATE: This is broke. See HBASE-19785.
14731 UPDATE2: Master carrying Regions does not work reliably, see HBASE-19828.
14732
14733 See HBASE-19831, the issue to fix regions on Master
14734
14735 The change of hbase.balancer.tablesOnMaster from String list to boolean and
14736 the addition of a simple boolean to enable system-tables on Master was done
14737 to constrain what operators might ask for via this master configuration.
14738 Stipulating what tables are bound to the Master server verges into
14739 regionserver grouping territory, a more robust means of specifying table
14740 and server combinations. Operators should use this latter if they want
14741 layouts more exotic than those supplied by the provided booleans.
14742
14743
14744 ---
14745
14746 * [HBASE-18553](https://issues.apache.org/jira/browse/HBASE-18553) | *Major* | **Expose scan cursor for asynchronous scanner**
14747
14748 The ResultScanner which is gotten from an AsyncTable will also return cursor results if Scan.isNeedCursorResult is true.
14749
14750
14751 ---
14752
14753 * [HBASE-18598](https://issues.apache.org/jira/browse/HBASE-18598) | *Minor* | **AsyncNonMetaRegionLocator use FIFO algorithm to get a candidate locate request**
14754
14755 Introduce FIFO algorithm to get a candidate locate request for AsyncNonMetaRegionLocator.
14756
14757
14758 ---
14759
14760 * [HBASE-18533](https://issues.apache.org/jira/browse/HBASE-18533) | *Major* | **Expose BucketCache values to be configured**
14761
14762 This patch exposes configuration for Bucketcache. These configs are very similar to those for the LRU cache, but are described below:
14763
14764 "hbase.bucketcache.single.factor"; /\*\* Single access bucket size \*/
14765 "hbase.bucketcache.multi.factor"; /\*\* Multiple access bucket size \*/
14766 "hbase.bucketcache.memory.factor"; /\*\* In-memory bucket size \*/
14767 "hbase.bucketcache.extrafreefactor"; /\*\* Free this floating point factor of extra blocks when evicting. For example free the number of blocks requested \* (1 + extraFreeFactor) \*/
14768 "hbase.bucketcache.acceptfactor"; /\*\* Acceptable size of cache (no evictions if size \< acceptable) \*/
14769 "hbase.bucketcache.minfactor"; /\*\* Minimum threshold of cache (when evicting, evict until size \< min) \*/
14770
14771
14772 ---
14773
14774 * [HBASE-18528](https://issues.apache.org/jira/browse/HBASE-18528) | *Critical* | **DON'T allow user to modify the passed table/column descriptor**
14775
14776 **WARNING: No release note provided for this change.**
14777
14778
14779 ---
14780
14781 * [HBASE-18271](https://issues.apache.org/jira/browse/HBASE-18271) | *Blocker* | **Shade netty**
14782
14783 Depend on hbase-thirdparty for our netty instead of directly relying on netty-all. netty is relocated in hbase-thirdparty from io.netty to org.apache.hadoop.hbase.shaded.io.netty. One kink is that netty bundles an .so. Its files also are relocated. So netty can find the .so content, need to specify on command-line a system property telling netty about the shading.
14784
14785 The .so trick is from
14786              https://stackoverflow.com/questions/33825743/rename-files-inside-a-jar-using-some-maven-plugin
14787
14788 In essence we need the below defined whenever we run tests or deploy:
14789
14790 -Dorg.apache.hadoop.hbase.shaded.io.netty.packagePrefix=org.apache.hadoop.hbase.shaded.
14791
14792 (The trailing '.' is required)
14793
14794 See toward the end of this issue for how to pass config: https://github.com/netty/netty/issues/6665
14795
14796 The system property has been added to bin/hbase. If starting hbase with other than bin/hbase, add this system property (at least on linux).
14797
14798 For devs, going forward, do not reference io.netty. Reference org.apache.hadoop.hbase.io.netty instead. Here is sample:
14799
14800 {code}
14801 -import io.netty.channel.Channel;
14802 -import io.netty.channel.EventLoop;
14803 +import org.apache.hadoop.hbase.shaded.io.netty.channel.Channel;
14804 +import org.apache.hadoop.hbase.shaded.io.netty.channel.EventLoop;
14805 {code}
14806
14807
14808 ---
14809
14810 * [HBASE-15511](https://issues.apache.org/jira/browse/HBASE-15511) | *Major* | **ClusterStatus should be able to return responses by scope**
14811
14812 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14813 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14814
14815
14816 ---
14817
14818 * [HBASE-18551](https://issues.apache.org/jira/browse/HBASE-18551) | *Major* | **[AMv2] UnassignProcedure and crashed regionservers**
14819
14820 Unassign will not proceed if it is unable to talk to the remote server. Now it will expire the server it is unable to communicate with and then wait until it is signaled by ServerCrashProcedure that the server's logs have been split. Only then will judge the unassign successful.
14821
14822 We do this because a subsequent assign lacking the crashed server context might open a region w/o first splitting logs.
14823
14824
14825 ---
14826
14827 * [HBASE-18469](https://issues.apache.org/jira/browse/HBASE-18469) | *Critical* | **Correct  RegionServer metric of  totalRequestCount**
14828
14829 In HBASE-18469 we introduced a new RegionServer metrics in name of "totalRowActionRequestCount" which counts in all row actions and equals to the sum of "readRequestCount" and "writeRequestCount". Meantime, we have changed "totalRequestCount" to count only once for multi request, while previously we will count in action number of the request. As a result, existing monitoring system on totalRequestCount will still work but see a smaller value, and we strongly recommend to change to use the new metrics to monitor server load.
14830
14831
14832 ---
14833
14834 * [HBASE-18500](https://issues.apache.org/jira/browse/HBASE-18500) | *Major* | **Performance issue: Don't use BufferedMutator for HTable's put method**
14835
14836 Remove the deprecated method get/setWriteBufferSize from Table and remove writeBufferSize from TableBuilder. Remove the BufferedMutatorImpl from HTable.
14837
14838
14839 ---
14840
14841 * [HBASE-18387](https://issues.apache.org/jira/browse/HBASE-18387) | *Minor* | **[Thrift] Make principal configurable in DemoClient.java**
14842
14843 This change allows the demonstration Thrift client to customize the server principal used by the Thrift server for instances secured with Kerberos.
14844
14845
14846 ---
14847
14848 * [HBASE-17125](https://issues.apache.org/jira/browse/HBASE-17125) | *Critical* | **Inconsistent result when use filter to read data**
14849
14850 Marked Scan and Get's setMaxVersions() and setMaxVersions(int) as deprecated. They are easy to misunderstand with column family's max versions, so use readAllVersions() and readVersions(int) instead.
14851
14852
14853 ---
14854
14855 * [HBASE-18492](https://issues.apache.org/jira/browse/HBASE-18492) | *Major* | **[AMv2] Embed code for selecting highest versioned region server for system table regions in AssignmentManager.processAssignQueue()**
14856
14857 Favors new servers over older versions when assigning system table regions (more to follow in this area; i.e. changes in the AM itself).
14858
14859
14860 ---
14861
14862 * [HBASE-18517](https://issues.apache.org/jira/browse/HBASE-18517) | *Major* | **limit max log message width in log4j**
14863
14864 Sets a log length max of 1000 characters.
14865
14866
14867 ---
14868
14869 * [HBASE-18502](https://issues.apache.org/jira/browse/HBASE-18502) | *Critical* | **Change MasterObserver to use TableDescriptor and ColumnFamilyDescriptor**
14870
14871 The methods which change to use TableDescriptor/ColumnFamilyDescriptor are shown below.
14872 + preCreateTable( ObserverContext,TableDescriptor, HRegionInfo[])
14873 + postCreateTable(ObserverContext ,TableDescriptor, HRegionInfo[])
14874 + preCreateTableAction(ObserverContext, TableDescriptor,HRegionInfo[])
14875 + postCompletedCreateTableAction(ObserverContext,TableDescriptor,HRegionInfo[])
14876 + preModifyTable(ObserverContext,TableName, TableDescriptor)
14877 + postModifyTable(ObserverContext,TableName, TableDescriptor)
14878 + preModifyTableAction( ObserverContext,TableName,TableDescriptor)
14879 + postCompletedModifyTableAction( ObserverContext,TableName,TableDescriptor)
14880 + preAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14881 + postAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14882 + preAddColumnFamilyAction(ObserverContext,TableName,ColumnFamilyDescriptor)
14883 + postCompletedAddColumnFamilyAction(ObserverContext,TableName, ColumnFamilyDescriptor)
14884 + preModifyColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14885 + preModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment,TableName,ColumnFamilyDescriptor)
14886 + postCompletedModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment\>,TableName,ColumnFamilyDescriptor)
14887 + preCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescriptor)
14888 + postCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescripto)
14889 + preRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14890 + postRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14891 + preGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14892 + postGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14893 + preGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14894 + postGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14895
14896
14897 ---
14898
14899 * [HBASE-18520](https://issues.apache.org/jira/browse/HBASE-18520) | *Minor* | **Add jmx value to determine true Master Start time**
14900
14901 This JIRA adds a JMX value to track when the Master has finished initializing.
14902 The jmx config is 'masterFinishedInitializationTime' and details the time in millis that the Master is fully usable and ready to serve requests.
14903
14904
14905 ---
14906
14907 * [HBASE-17056](https://issues.apache.org/jira/browse/HBASE-17056) | *Critical* | **Remove checked in PB generated files**
14908
14909 Purge all checked in generated protobuf files (30MB). Generate protobuf files inline with the build. Remove checked-in and patched protobuf. Get it from new hbase-thirdparty instead.
14910
14911 Side-effect: Our protobuf went from 3.1.0 to 3.3.1.
14912
14913 Build does not take noticeably longer (still about 2.5 minutes to do a mvn clean install -DskipTests).
14914
14915 IDEs will probably require a mvn build first else they'll complain about missing (generated) files.
14916
14917
14918 ---
14919
14920 * [HBASE-18374](https://issues.apache.org/jira/browse/HBASE-18374) | *Major* | **RegionServer Metrics improvements**
14921
14922 This change adds the latency metrics checkAndPut, checkAndDelete, putBatch and deleteBatch . Also the previous regionserver "mutate" latency metrics are renamed to "put" metrics. Batch metrics capture the latency of the entire batch containing put/delete whereas put/delete metrics capture latency per operation. Note this change will break existing monitoring based on regionserver "mutate" latency metric.
14923
14924
14925 ---
14926
14927 * [HBASE-18023](https://issues.apache.org/jira/browse/HBASE-18023) | *Minor* | **Log multi-\* requests for more than threshold number of rows**
14928
14929 HBASE-18023 introduces a warning message in the RegionServer log when an RPC is received from a client that has more than 5000 "actions" (where an "action" is a collection of mutations for a specific row) in a single RPC. Misbehaving clients who send large RPCs to RegionServers can be malicious, causing temporary pauses via garbage collection or denial of service via crashes. The threshold of 5000 actions per RPC is defined by the property "hbase.rpc.rows.warning.threshold" in hbase-site.xml.
14930
14931
14932 ---
14933
14934 * [HBASE-15968](https://issues.apache.org/jira/browse/HBASE-15968) | *Major* | **New behavior of versions considering mvcc and ts rather than ts only**
14935
14936 This issue resolved two long-term issues in HBase:
14937 Puts may be masked by a delete before them.
14938 Major compactions change query results.
14939
14940 This issue offer a new behavior to fix this issue with a little performance reduction. Set NEW\_VERSION\_BEHAVIOR to true to enable this feature in CF level. See HBASE-15968 for details.
14941 Note if you enable this feature, the order of Mutations matters. But replication will disorder the entries by default. So you have to enable serial replication if you have slave clusters. See HBASE-9465 for details.
14942
14943
14944 ---
14945
14946 * [HBASE-18107](https://issues.apache.org/jira/browse/HBASE-18107) | *Major* | **[AMv2] Remove DispatchMergingRegionsRequest & DispatchMergingRegions**
14947
14948 Removes merge region code added into branch-2 but that was not needed after all. Branch-2 replaced dispatchMergingRegions with MergeTableRegionsProcedure.
14949
14950 Removed:
14951
14952 # dispatchMergingRegions from Connection (was superceded long ago in branch-1).
14953 # mergeRegions from RsRpcServices (was not used).
14954
14955
14956 ---
14957
14958 * [HBASE-15816](https://issues.apache.org/jira/browse/HBASE-15816) | *Major* | **Provide client with ability to set priority on Operations**
14959
14960 Added setPriority(int priority) API to Put, Delete, Increment, Append, Get and Scan pojos.  So for all these ops, the user can provide a custom priority level.
14961
14962
14963 ---
14964
14965 * [HBASE-18430](https://issues.apache.org/jira/browse/HBASE-18430) | *Major* | **Typo in "contributing to documentation" page**
14966
14967 Pushed to {{master}}. Thanks, Coral! Congratulations on your first Apache HBase commit!
14968
14969
14970 ---
14971
14972 * [HBASE-17908](https://issues.apache.org/jira/browse/HBASE-17908) | *Critical* | **Upgrade guava**
14973
14974 Use relocated guava 22.0 gotten from the new hbase-thirdparty ancillary project.
14975
14976 Incompatible change. ReplicationEndpoint and subclasses extend guava Service which changed pretty radically between 12.0 and 22.0. Change is kosher because implementations are marked audience private. Still, this will likely cause grief for the likes of the downstream lily indexer.
14977
14978
14979 ---
14980
14981 * [HBASE-16993](https://issues.apache.org/jira/browse/HBASE-16993) | *Major* | **BucketCache throw java.io.IOException: Invalid HFile block magic when configuring hbase.bucketcache.bucket.sizes**
14982
14983 Any value for hbase.bucketcache.bucket.sizes  configuration to be multiple of 256.  If that is not the case, instantiation of L2 Bucket cache itself will fail throwing IllegalArgumentException.
14984
14985
14986 ---
14987
14988 * [HBASE-16090](https://issues.apache.org/jira/browse/HBASE-16090) | *Major* | **ResultScanner is not closed in SyncTable#finishRemainingHashRanges()**
14989
14990 pushed to 1.3 and 1.2. SyncTable was introduced in 1.2, so skipping 1.1.
14991
14992
14993 ---
14994
14995 * [HBASE-18332](https://issues.apache.org/jira/browse/HBASE-18332) | *Minor* | **Upgrade asciidoctor-maven-plugin**
14996
14997 Committed to master and branch-2. Thanks!
14998
14999
15000 ---
15001
15002 * [HBASE-18161](https://issues.apache.org/jira/browse/HBASE-18161) | *Minor* | **Incremental Load support for Multiple-Table HFileOutputFormat**
15003
15004 In order to use this feature, a user must
15005 1. Register their tables when configuring their job
15006  2. Create a composite key of the tablename and original rowkey to send as the mapper output key.
15007
15008   To register their tables (and configure their job for incremental load into multiple tables), a user must call the static MultiHFileOutputFormat.configureIncrementalLoad function to register the HBase tables that will be ingested into.
15009
15010 To create the composite key, a helper function MultiHFileOutputFormat2.createCompositeKey should be called with the destination tablename and rowkey as arguments, and the result should be output as the mapper key.
15011
15012  Before this JIRA, for HFileOutputFormat2 a configuration for the storage policy was set per Column Family. This was set manually by the user. In this JIRA, this is unchanged when using HFileOutputFormat2. However, when specifically using MultiHFileOutputFormat2, the user now has to manually set the prefix by creating a composite of the table name and the column family. The user can create the new composite value by calling MultiHFileOutputFormat2.createCompositeKey with the tablename and column family as arguments.
15013
15014 Changes added through this JIRA are backwards compatible with existing HFileOutputFormat2 apis and functionality.
15015
15016 The configuration parameter "hbase.mapreduce.hfileoutputformat.table.name" is now a REQUIRED parameter though it is normally set automatically when configureIncrementalLoad method is called within HFileOutputFormat2
15017
15018
15019 ---
15020
15021 * [HBASE-18229](https://issues.apache.org/jira/browse/HBASE-18229) | *Critical* | **create new Async Split API to embrace AM v2**
15022
15023 A new splitRegionAsync() API is added in client. The existing splitRegion()  and split() API will call the new API so client does not have to change its code.
15024
15025 Move HBaseAdmin.splitXXX() logic to master, client splitXXX() API now go to master directly instead of going to RegionServer first.
15026
15027 Also added splitSync() API
15028
15029
15030 ---
15031
15032 * [HBASE-18339](https://issues.apache.org/jira/browse/HBASE-18339) | *Major* | **Update test-patch to use hadoop 3.0.0-alpha4**
15033
15034 HBase now defaults to Apache Hadoop 3.0.0-alpha4 when the Hadoop 3 profile is active.
15035
15036
15037 ---
15038
15039 * [HBASE-18267](https://issues.apache.org/jira/browse/HBASE-18267) | *Major* | **The result from the postAppend is ignored**
15040
15041 **WARNING: No release note provided for this change.**
15042
15043
15044 ---
15045
15046 * [HBASE-18307](https://issues.apache.org/jira/browse/HBASE-18307) | *Major* | **Share the same EventLoopGroup for NettyRpcServer, NettyRpcClient and AsyncFSWALProvider at RS side**
15047
15048 There are two configuration name changes as the event loop configs will not only effect rpc server but be shared by different components in the same RS instance.
15049
15050 'hbase.rpc.server.nativetransport' -\> 'hbase.netty.nativetransport'
15051
15052 'hbase.netty.rpc.server.worker.count' -\> 'hbase.netty.worker.count'
15053
15054
15055 ---
15056
15057 * [HBASE-18241](https://issues.apache.org/jira/browse/HBASE-18241) | *Critical* | **Change client.Table, client.Admin, Region, Store, and HBaseTestingUtility to not use HTableDescriptor or HColumnDescriptor**
15058
15059 - : removed API
15060 + : new API
15061 \* : deprecated API
15062 ---------------------------
15063 Region class
15064 - HTableDescriptor getTableDesc()
15065 +TableDescriptor getTableDescriptor()
15066
15067 Store class
15068 - HColumnDescriptor getFamily()
15069 + ColumnFamilyDescriptor getColumnFamilyDescriptor()
15070
15071 Table class
15072 \* HTableDescriptor getTableDescriptor()
15073 + TableDescriptor getDescriptor()\|
15074
15075 \*Admin class\*
15076 \* HTableDescriptor getTableDescriptor(TableName)
15077 + List\<TableDescriptor\> listTableDescriptor(TableName)\|
15078 \* HTableDescriptor[] getTableDescriptors(List\<String\>)
15079 \* HTableDescriptor[] getTableDescriptorsByTableName(List\<TableName\>)
15080 + List\<TableDescriptor\> listTableDescriptors(List\<TableName\>)
15081 \* HTableDescriptor[] listTables()
15082 + List\<TableDescriptor\> listTableDescriptors()
15083 \* HTableDescriptor[] listTables(Pattern)
15084 + List\<TableDescriptor\> listTableDescriptors(Pattern)
15085 \* HTableDescriptor[] listTables(String)
15086 + List\<TableDescriptor\> listTableDescriptors(String)
15087 \* HTableDescriptor[] listTables(Pattern, boolean)
15088 + List\<TableDescriptor\> listTableDescriptors(Pattern, boolean)
15089 \* HTableDescriptor[] listTables(String, boolean)
15090 + List\<TableDescriptor\> listTableDescriptors(String, boolean)
15091 \* HTableDescriptor[] deleteTables(String)
15092 \* HTableDescriptor[] deleteTables(Pattern)
15093 \* HTableDescriptor[] enableTables(String)
15094 \* HTableDescriptor[] enableTables(Pattern)
15095 \* HTableDescriptor[] disableTables(String)
15096 \* HTableDescriptor[] disableTables(Pattern)
15097 \* void modifyTable(TableName, HTableDescriptor)
15098 + void modifyTable(TableDescriptor)
15099 \* void modifyTableAsync(TableName, HTableDescriptor)
15100 + void modifyTableAsync(TableDescriptor)
15101 \* HTableDescriptor[] listTableDescriptorsByNamespace(String)
15102 + List\<TableDescriptor\> listTableDescriptorsByNamespace(byte[])
15103 \* void createTable(HTableDescriptor)
15104 + void createTable(TableDescriptor)
15105 \* void createTable(HTableDescriptor, byte[], byte[], int)
15106 + void createTable({color:red}TableDescriptor, byte[], byte[], int)
15107 \* void createTable(HTableDescriptor, byte[][])
15108 + void createTable(TableDescriptor, byte[][])
15109 \* Future\<Void\> createTableAsync(HTableDescriptor, byte[][])
15110 + Future\<Void\> createTableAsync(TableDescriptor, byte[][])
15111
15112 \*HBaseTestingUtility class\*
15113 \* Table createTable(HTableDescriptor, byte[][], Configuration)
15114 + Table createTable(TableDescriptor, byte[][], Configuration)
15115 \* Table createTable(HTableDescriptor, byte[][], byte[][], Configuration)
15116 + Table createTable(TableDescriptor, byte[][], byte[][], Configuration)
15117 \* public Table createTable(HTableDescriptor, byte[][])
15118 + public Table createTable(TableDescriptor, byte[][])
15119 \* void modifyTableSync(Admin, HTableDescriptor)
15120 + void modifyTableSync(Admin, TableDescriptor)
15121 \* HRegion createLocalHRegion(HTableDescriptor, byte [], byte [])
15122 + HRegion createLocalHRegion(TableDescriptor, byte [], byte [])
15123 \* HRegion createLocalHRegion(HRegionInf, HTableDescriptor)
15124 + HRegion createLocalHRegion(HRegionInf, TableDescriptor)
15125 \* HRegion createLocalHRegion(HRegionInfo, HTableDescriptor, WAL)
15126 + HRegion createLocalHRegion(HRegionInfo, TableDescriptor, WAL)
15127 \* List createMultiRegionsInMeta(final Configuration, HTableDescriptor, byte [][])
15128 + List createMultiRegionsInMeta(final Configuration, TableDescriptor, byte [][])
15129 \* HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, HTableDescriptor)
15130 + HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, TableDescriptor)
15131 \* HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, HTableDescriptor, boolean)
15132 + HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, TableDescriptor, boolean)
15133 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor)
15134 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor)
15135 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor, int)
15136 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor, int)
15137 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor[], int)
15138 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor[], int)
15139 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor[],SplitAlgorithm, int)
15140 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor[],SplitAlgorithm, int)
15141 \* HRegion createTestRegion(String, HColumnDescriptor)
15142 + HRegion createTestRegion(String, ColumnFamilyDescriptor)
15143
15144
15145 ---
15146
15147 * [HBASE-18083](https://issues.apache.org/jira/browse/HBASE-18083) | *Major* | **Make large/small file clean thread number configurable in HFileCleaner**
15148
15149 After HBASE-18083 we could configure HFileCleaner to use multiple threads for large/small (archived) hfile cleaning with hbase.regionserver.hfilecleaner.large.thread.count and hbase.regionserver.hfilecleaner.small.thread.count, both default to 1. These properties support online configuration change.
15150
15151
15152 ---
15153
15154 * [HBASE-17931](https://issues.apache.org/jira/browse/HBASE-17931) | *Blocker* | **Assign system tables to servers with highest version**
15155
15156 We usually keep compatibility between old client and new server so we can do rolling upgrade, HBase cluster first, then HBase client. But we don't guarantee new client can access old server.
15157 In an HBase cluster, we have system tables and region servers will access these tables so for servers they are also an HBase client. So if the system tables are in region servers with lower version we may get trouble because region servers with higher version may can not access them.
15158 After this patch, we will move all system regions to region servers with highest version. So when we do a rolling upgrade across two major or minor versions, we should ALWAYS UPGRADE MASTER FIRST and then upgrade region servers. The new master will handle system tables correctly.
15159
15160
15161 ---
15162
15163 * [HBASE-6581](https://issues.apache.org/jira/browse/HBASE-6581) | *Major* | **Build with hadoop.profile=3.0**
15164
15165 Make us build against hadoop trunk (3.0)
15166
15167
15168 ---
15169
15170 * [HBASE-16120](https://issues.apache.org/jira/browse/HBASE-16120) | *Minor* | **Add shell test for truncate\_preserve**
15171
15172 Add unit tests for truncate\_preserve
15173
15174
15175 ---
15176
15177 * [HBASE-18240](https://issues.apache.org/jira/browse/HBASE-18240) | *Major* | **Add hbase-thirdparty, a project with hbase utility including an hbase-shaded-thirdparty module with guava, netty, etc.**
15178
15179 Adds a new project, hbase-thirdparty, at https://git-wip-us.apache.org/repos/asf/hbase-thirdparty used by core hbase. GroupID org.apache.hbase.thirdparty. Version 1.0.0.
15180
15181 This project packages relocated third-party libraries used by Apache HBase such as protobuf, guava, and netty among others. HBase core depends on it.
15182
15183 It has threre submodules, one to patch and then relocate (shade) protobuf, and one to do messy .so renaming (netty). The remainder module relocates a bundle of other (unpatched) libs used by hbase. This latter set includes protobuf-util, netty-all, gson, and guava.
15184
15185 All shading is done using the same relocation offset of org.apache.hadoop.hbase.shaded; we add this prefix to the relocated thirdparty library class names.
15186
15187 See the pom.xml in hbase-thirdparty for the explicit version of each third-party lib included (of note, we update out internal protobuf from 3.1.0 to 3.3.1).
15188
15189
15190 ---
15191
15192 * [HBASE-15943](https://issues.apache.org/jira/browse/HBASE-15943) | *Major* | **Add page displaying JVM process metrics**
15193
15194 Adds new "Process Metrics' tab along the top which leads to new page that dumps mbean -- mostly jvm -- metrics
15195
15196
15197 ---
15198
15199 * [HBASE-14902](https://issues.apache.org/jira/browse/HBASE-14902) | *Major* | **Revert some of the stringency recently introduced by checkstyle tightening**
15200
15201 Changes the checkstyle so that on a continuation line for javadoc, instead of default four spaces, instead now it is two spaces. Also one line statements as in if (true) x =1; now pass checkstyle.
15202
15203
15204 ---
15205
15206 * [HBASE-17110](https://issues.apache.org/jira/browse/HBASE-17110) | *Major* | **Improve SimpleLoadBalancer to always take server-level balance into account**
15207
15208 After HBASE-17110 the bytable strategy for SimpleLoadBalancer will also take server level balance into account
15209
15210
15211 ---
15212
15213 * [HBASE-17928](https://issues.apache.org/jira/browse/HBASE-17928) | *Major* | **Shell tool to clear compaction queues**
15214
15215 Adds clear\_compaction\_queues to the hbase shell.
15216 {code}
15217   Clear compaction queues on a regionserver.
15218   The queue\_name contains short and long.
15219   short is shortCompactions's queue,long is longCompactions's queue.
15220
15221   Examples:
15222   hbase\> clear\_compaction\_queues 'host187.example.com,60020'
15223   hbase\> clear\_compaction\_queues 'host187.example.com,60020','long'
15224   hbase\> clear\_compaction\_queues 'host187.example.com,60020', ['long','short']
15225 {code}
15226
15227
15228 ---
15229
15230 * [HBASE-18164](https://issues.apache.org/jira/browse/HBASE-18164) | *Critical* | **Much faster locality cost function and candidate generator**
15231
15232 New locality cost function and candidate generator that use caching and incremental computation to allow the stochastic load balancer to consider ~20x more cluster configurations for big clusters.
15233
15234
15235 ---
15236
15237 * [HBASE-18226](https://issues.apache.org/jira/browse/HBASE-18226) | *Major* | **Disable reverse DNS lookup at HMaster and use the hostname provided by RegionServer**
15238
15239 The following config is added by this JIRA:
15240
15241 hbase.regionserver.hostname.disable.master.reversedns
15242
15243 This config is for experts: don't set its value unless you really know what you are doing.
15244 When set to true, regionserver will use the current node hostname for the servername and HMaster will skip reverse DNS lookup and use the hostname sent by regionserver instead. Note that this config and hbase.regionserver.hostname are mutually exclusive. See https://issues.apache.org/jira/browse/HBASE-18226 for more details.
15245
15246 Caution: please make sure rolling upgrade succeeds before turning on this feature.
15247
15248
15249 ---
15250
15251 * [HBASE-16242](https://issues.apache.org/jira/browse/HBASE-16242) | *Major* | **Upgrade Avro to 1.7.7**
15252
15253 Apache HBase now specifies that version 1.7.7 of the Apache Avro library should be pulled in by maven and included in the convenience binary tarball.
15254
15255
15256 ---
15257
15258 * [HBASE-18213](https://issues.apache.org/jira/browse/HBASE-18213) | *Major* | **Add documentation about the new async client**
15259
15260 Add documentation for async client in section '66. Client' in ref guide.
15261
15262
15263 ---
15264
15265 * [HBASE-17008](https://issues.apache.org/jira/browse/HBASE-17008) | *Critical* | **Examples to make AsyncClient go down easy**
15266
15267 Add two examples for async client. AsyncClientExample is a simple example to show you how to use AsyncTable. HttpProxyExample is an example for advance user to show you how to use RawAsyncTable to write a fully asynchronous HTTP proxy server. There is no extra thread pool, all operations are executed inside netty's event loop.
15268
15269
15270 ---
15271
15272 * [HBASE-18200](https://issues.apache.org/jira/browse/HBASE-18200) | *Major* | **Set hadoop check versions for branch-2 and branch-2.x in pre commit**
15273
15274 Allow setting different hadoop check versions for branch-2 and branch-2.x when running pre commit check.
15275
15276
15277 ---
15278
15279 * [HBASE-18187](https://issues.apache.org/jira/browse/HBASE-18187) | *Major* | **Release hbase-2.0.0-alpha1**
15280
15281 Pushed the release. For detail: http://apache-hbase.679495.n3.nabble.com/ANNOUNCE-Apache-HBase-2-0-0-alpha-1-is-now-available-for-download-td4088484.html
15282
15283
15284 ---
15285
15286 * [HBASE-18137](https://issues.apache.org/jira/browse/HBASE-18137) | *Critical* | **Replication gets stuck for empty WALs**
15287
15288 0-length WAL files can potentially cause the replication queue to get stuck.  A new config "replication.source.eof.autorecovery" has been added: if set to true (default is false), the 0-length WAL file will be skipped after 1) the max number of retries has been hit, and 2) there are more WAL files in the queue.  The risk of enabling this is that there is a chance the 0-length WAL file actually has some data (e.g. block went missing and will come back once a datanode is recovered).
15289
15290
15291 ---
15292
15293 * [HBASE-18192](https://issues.apache.org/jira/browse/HBASE-18192) | *Blocker* | **Replication drops recovered queues on region server shutdown**
15294
15295 If a region server that is processing recovered queue for another previously dead region server is gracefully shut down, it can drop the recovered queue under certain conditions. Running without this fix on a 1.2+ release means possibility of continuing data loss in replication, irrespective of which WALProvider is used.
15296 If a single WAL group (or DefaultWALProvider) is used, running without this fix will always cause dataloss in replication whenever a region server processing recovered queues is gracefully shutdown.
15297
15298
15299 ---
15300
15301 * [HBASE-18109](https://issues.apache.org/jira/browse/HBASE-18109) | *Critical* | **Assign system tables first (priority)**
15302
15303 Adds a sort of procedures before submission so system tables are queued first (which will help ensure they go out first). This should be good enough along w/ existing scheduling mechanisms to ensure system/meta are assigned first (See reasoning below). Open new issue if insufficient.
15304
15305
15306 ---
15307
15308 * [HBASE-18008](https://issues.apache.org/jira/browse/HBASE-18008) | *Major* | **Any HColumnDescriptor we give out should be immutable**
15309
15310 1) The HColumnDescriptor got from Admin, AsyncAdmin, and Table is immutable.
15311 2) HColumnDescriptor have been marked as "Deprecated" and user should substituted
15312      ColumnFamilyDescriptor for HColumnDescriptor.
15313 3) ColumnFamilyDescriptor is constructed through ColumnFamilyDescriptorBuilder and it contains all of the read-only methods from HColumnDescriptor
15314 4) The value to which the IS\_MOB/MOB\_THRESHOLD is mapped is stored as String rather than Boolean/Long. The MOB is an new feature to 2.0 so this change should be acceptable
15315
15316
15317 ---
15318
15319 * [HBASE-18149](https://issues.apache.org/jira/browse/HBASE-18149) | *Major* | **The setting rules for table-scope attributes and family-scope attributes should keep consistent**
15320
15321 If the table-scope attributes value is false, you need not to enclose 'false' in single quotation.Both COMPACTION\_ENABLED =\> false and COMPACTION\_ENABLED =\> 'false' will take effect
15322
15323
15324 ---
15325
15326 * [HBASE-17849](https://issues.apache.org/jira/browse/HBASE-17849) | *Major* | **PE tool random read is not totally random**
15327
15328 When randomRead and randomSeekScan is used with PE tool, now we allow using both --size and --rows. The --size specifies the total size of the data (the range) on which the reads should be performed and --rows specifies the number of rows to be read by each client with in that range.
15329
15330
15331 ---
15332
15333 * [HBASE-15576](https://issues.apache.org/jira/browse/HBASE-15576) | *Major* | **Scanning cursor to prevent blocking long time on ResultScanner.next()**
15334
15335 If you don't like scanning being blocked too long because of heartbeat and partial result, you can use Scan#setNeedCursorResult(true) to get a special result within scanning timeout setting time which will tell you where row the server is scanning. See its javadoc for more details.
15336
15337
15338 ---
15339
15340 * [HBASE-16549](https://issues.apache.org/jira/browse/HBASE-16549) | *Major* | **Procedure v2 - Add new AM metrics**
15341
15342 Following AMv2 procedures are modified to override onSubmit(), onFinish() hooks provided by HBASE-17888 to do
15343 metrics calculations when procedures are submitted and finshed:
15344 \* AssignProcedure
15345 \* UnassignProcedure
15346 \* MergeTableRegionProcedure
15347 \* SplitTableRegionProcedure
15348 \* ServerCrashProcedure
15349
15350 Following metrics is collected for each of the above procedure during lifetime of a process:
15351 \* Total number of requests submitted for a type of procedure
15352 \* Histogram of runtime in milliseconds for successfully completed procedures
15353 \* Total number of failed procedures
15354
15355 As we are moving away from Hadoop's metric2, hbase-metrics-api module is used for newly added metrics.
15356
15357
15358 ---
15359
15360 * [HBASE-9393](https://issues.apache.org/jira/browse/HBASE-9393) | *Critical* | **Hbase does not closing a closed socket resulting in many CLOSE\_WAIT**
15361
15362 To handle this issue client need to have Hadoop client 2.6.4 or 2.7.0+ Hadoop version as CanUnBuffer interface which was added as part of HDFS-7694 is available in only those versions.
15363
15364
15365 ---
15366
15367 * [HBASE-18038](https://issues.apache.org/jira/browse/HBASE-18038) | *Critical* | **Rename StoreFile to HStoreFile and add a StoreFile interface for CP**
15368
15369 StoreFile is now changed to an interface. This is an incompatible change. The coprocessors which implement RegionObserver may need to modify their code.
15370
15371
15372 ---
15373
15374 * [HBASE-16196](https://issues.apache.org/jira/browse/HBASE-16196) | *Critical* | **Update jruby to a newer version.**
15375
15376 The bundled JRuby 1.6.8 has been updated to version 9.1.9.0. The represents a change from Ruby 1.8 to Ruby 2.3.3, which introduces non-compatible language changes for user scripts.
15377
15378 This JRuby version update required an update to joni-2.1.11 and jcodings-1.0.18, used for regular expression matching, as well as several transitive dependency updates that should not be user-visible.
15379
15380
15381 ---
15382
15383 * [HBASE-14614](https://issues.apache.org/jira/browse/HBASE-14614) | *Major* | **Procedure v2: Core Assignment Manager**
15384
15385 Replaces the AssignmentManager with a new procedurev2-based AssignmentManager
15386
15387 h1. AMv2
15388 Puts AssignmentManager up on top of the ProcedureV2 state machine with persistence engine. Each assignment atom is now a Procedure implementation; e.g. an AssignProcedure and an UnassignProcedure. Molecules of aggregated Procedures are used to do more involved assignment steps: e.g. the move region procedure is made of an Unassign followed by an Assign subprocedure.
15389
15390 AMv2 is 1500 lines. Old AM was near 4000. Functionality has been moved out to Procedures. In-memory states of regions and servers has been cleaned up stored in new RegionStates implementation. RegionStateStore takes care of publishing final region state out to the hbase:meta table.
15391
15392 New RemoteProcedureDispatcher/RSProcedureDispatcher runs the Procedure-based assignments ‘remotely’. Knows about ‘servers’. Does aggregation of assignments by time on a time/count basis so can send procedures in batches rather than one per RPC. Procedure status comes back on the back of the RegionServer heartbeat reporting online regions. The response is passed to the AMv2 to ‘process’. It will check against the in-memory state. If there is a mismatch, it fences out the RegionServer on the assumption that something went wrong on the RS side.Timeouts trigger retries. The Procedure machine ensures only one operation at a time on any one region/table using locking and smarts about what is serial and what can be run concurrently.
15393
15394 New accounting of RegionServer version will be used running rolling restarts.
15395
15396 ‘States’ -- OPENING, CLOSING, etc. -- are now in-memory in-the-master only serialized out to the ProcedureV2 WAL. They are no longer persisted to ZooKeeper.
15397
15398 h2. Assign Detail
15399 The Assign starts by pushing the "assign" operation to the AssignmentManager and then will go into a “waiting" state. The AM will batch the "assign" requests and ask the Balancer where to put the region (the various policies will be respected: retain, round-robin, random). Once the AM and the balancer have found a place for the region, the procedure will be resumed and an "open region" request will be placed in the Remote Dispatcher queue, and the procedure once again will go into a "waiting state".  The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the assignment by publishing to new state on hbase:meta or it will retry the assignment.
15400
15401 h3. Unassign Detail
15402  The Unassign starts by placing a "close region" request in the Remote Dispatcher queue, and the procedure will then go into a "waiting state". The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the unassign by publishing its new state on meta or it will retry the unassign.
15403
15404 h1. New Configs
15405  \* "hbase.procedure.remote.dispatcher.threadpool.size" defaults 128
15406  \* "hbase.procedure.remote.dispatcher.delay.msec" default 150ms
15407  \* "hbase.procedure.remote.dispatcher.max.queue.size" with default 32
15408  \* "hbase.regionserver.rpc.startup.waittime" with default 60 seconds.
15409 h1. TODO
15410 As of this writing.
15411
15412 Put up a model diagram.
15413
15414  \* Handle region migration
15415  \* Handle meta assignment first
15416  \* Handle sys table assignment first (e.g. acl, namespace)
15417  \* Handle table priorities
15418  \* Do we report same AM metrics as we used too? We do it all in here now.
15419
15420 INCOMPATIBLE
15421 A known incompatible is that because splits and merges are now run from the master, Coprocessors that used to watch for merge/split from a RegionObserver now no longer work; to watch split/merges, you need to have an observer on the Master instead.
15422
15423
15424 ---
15425
15426 * [HBASE-3462](https://issues.apache.org/jira/browse/HBASE-3462) | *Major* | **Fix table.jsp in regards to splitting a region/table with an optional splitkey**
15427
15428 UI pages for splitting/merging now operate by taking a row key prefix from the user rather than a full region name.
15429
15430
15431 ---
15432
15433 * [HBASE-18129](https://issues.apache.org/jira/browse/HBASE-18129) | *Major* | **truncate\_preserve fails when the truncate method doesn't exists on the master**
15434
15435 The command truncate\_preserve will be fine when the truncate method doesn't exist on the master
15436
15437
15438 ---
15439
15440 * [HBASE-18122](https://issues.apache.org/jira/browse/HBASE-18122) | *Major* | **Scanner id should include ServerName of region server**
15441
15442 The scanner id is not from 1 anymore.
15443 The first 32 bits are MurmurHash32 of ServerName string "host,port,ts". The ServerName contains both host, port, and start timestamp so it can prevent collision. The lowest 32bit is generated by atomic int.
15444
15445
15446 ---
15447
15448 * [HBASE-17997](https://issues.apache.org/jira/browse/HBASE-17997) | *Major* | **In dev environment, add jruby-complete jar to classpath only when jruby is needed**
15449
15450 When JRUBY\_HOME is specified, if the command is "hbase shell" or "hbase org.jruby.Main", CLASSPATH and HBASE\_OPTS will be updated according to JRUBY\_HOME specified
15451 \* Jar under JRUBY\_HOME is added to CLASSPATH
15452 \* The following will be added into HBASE\_OPTS
15453
15454 -Djruby.home=$JRUBY\_HOME -Djruby.lib=$JRUBY\_HOME/lib
15455
15456
15457 That is, as long as JRUBY\_HOME is specified, JRUBY\_HOME specified will take precedence.
15458 \* In dev env, the jar recorded in cached\_classpath\_jruby.txt will be ignored
15459 \* In non dev env, jruby-complete jar packaged with HBase will be ignored
15460
15461
15462 ---
15463
15464 * [HBASE-15616](https://issues.apache.org/jira/browse/HBASE-15616) | *Major* | **Allow null qualifier for all table operations**
15465
15466 After this issue, all table operations will support null qualifier, such as put/get/scan/increment/append/checkAndMutate/checkAndPut/checkAndDelete.
15467
15468
15469 ---
15470
15471 * [HBASE-18035](https://issues.apache.org/jira/browse/HBASE-18035) | *Critical* | **Meta replica does not give any primaryOperationTimeout to primary meta region**
15472
15473 When a client is configured to use meta replica, it sends scan request to all meta replicas almost at the same time. Since meta replica contains stale data, if result from one of replica comes back first, the client may get wrong region locations. To fix this, "hbase.client.meta.replica.scan.timeout" is introduced, a client will always send to primary meta region first, wait the configured timeout for reply. If no result is received, it will send request to replica meta regions. The unit for "hbase.client.meta.replica.scan.timeout"  is microsecond, the default value is 1000000 (1 second).
15474
15475
15476 ---
15477
15478 * [HBASE-11013](https://issues.apache.org/jira/browse/HBASE-11013) | *Major* | **Clone Snapshots on Secure Cluster Should provide option to apply Retained User Permissions**
15479
15480 While creating a snapshot, it will save permissions of the original table into .snapshotinfo file(Backward compatibility) , which is in the snapshot root directory.  For clone\_snapshot/restore\_snapshot command, we provide an additional option( RESTORE\_ACL) to decide whether we will grant permissons of the origin table to the newly created table.
15481
15482
15483 ---
15484
15485 * [HBASE-18018](https://issues.apache.org/jira/browse/HBASE-18018) | *Major* | **Support abort for all procedures by default**
15486
15487 The default behavior for abort() method of StateMachineProcedure class is changed to support aborting all procedures irrespective of if procedure supports rollback or not.
15488
15489
15490 ---
15491
15492 * [HBASE-16851](https://issues.apache.org/jira/browse/HBASE-16851) | *Major* | **User-facing documentation for the In-Memory Compaction feature**
15493
15494 Two blog posts on Apache HBase blog: user manual and programmer manual.
15495 Ref. guide draft published: https://docs.google.com/document/d/1Xi1jh\_30NKnjE3wSR-XF5JQixtyT6H\_CdFTaVi78LKw/edit
15496
15497
15498 ---
15499
15500 * [HBASE-17343](https://issues.apache.org/jira/browse/HBASE-17343) | *Blocker* | **Make Compacting Memstore default in 2.0 with BASIC as the default type**
15501
15502  This JIRA changes the default MemStore to be CompactingMemStore instead of DefaultMemStore. In-memory compaction of CompactingMemStore demonstrated sizable improvement in HBase’s write amplification and read/write performance.
15503
15504 CompactingMemStore achieves these gains through smart use of RAM. The algorithm periodically re-organizes the in-memory data in efficient data structures and reduces redundancies. The  HBase server’s memory footprint therefore periodically expands and contracts. The outcome is longer lifetime of data in memory, less I/O, and overall faster performance. More details about the algorithm and its use appear in the Apache HBase Blog: https://blogs.apache.org/hbase/
15505
15506 How To Use:
15507 The in-memory compaction level can be configured both globally and per column family. The supported levels are none (DefaultMemStore), basic, and eager.
15508
15509 By default, all tables apply basic in-memory compaction. This global configuration can be overridden in hbase-site.xml, as follows:
15510
15511 \<property\>
15512  \<name\>hbase.hregion.compacting.memstore.type\</name\>
15513  \<value\>\<none\|basic\|eager\>\</value\>
15514  \</property\>
15515
15516 The level can also be configured in the HBase shell per column family, as follows:
15517
15518 create ‘\<tablename\>’,
15519 {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
15520
15521
15522 ---
15523
15524 * [HBASE-17786](https://issues.apache.org/jira/browse/HBASE-17786) | *Major* | **Create LoadBalancer perf-tests (test balancer algorithm decoupled from workload)**
15525
15526 $ bin/hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation -help
15527 usage: hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation \<options\>
15528 Options:
15529  -regions \<arg\>         Number of regions to consider by load balancer. Default: 1000000
15530  -servers \<arg\>         Number of servers to consider by load balancer. Default: 1000
15531  -load\_balancer \<arg\>   Type of Load Balancer to use. Default:
15532                         org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer
15533
15534
15535 ---
15536
15537 * [HBASE-17887](https://issues.apache.org/jira/browse/HBASE-17887) | *Blocker* | **Row-level consistency is broken for read**
15538
15539 Now we pass on list of memstoreScanners to the StoreScanner along with the new files to ensure that the StoreScanner sees the latest memstore after flush.
15540
15541
15542 ---
15543
15544 * [HBASE-15296](https://issues.apache.org/jira/browse/HBASE-15296) | *Major* | **Break out writer and reader from StoreFile**
15545
15546 \<!-- mardown --\>
15547 Refactor that breaks out StoreFile Reader and Writer inner classes as StoreFileReader and StoreFileWriter.
15548
15549 NOTE! Changes RegionObserver Coprocessor Interface so incompatible change (Discussed on dev list in thread "[Note breaking change on RegionObserver in hbase-2.0.0](https://s.apache.org/hbase-dev-note-about-HBASE-15296)"
15550
15551
15552 ---
15553
15554 * [HBASE-15199](https://issues.apache.org/jira/browse/HBASE-15199) | *Critical* | **Move jruby jar so only on hbase-shell module classpath; currently globally available**
15555
15556 The JRuby jar is no longer automatically included in classpaths for HBase server processes nor clients. It is still included in the classpath for the HBase shell and for invocations of org.jruby.Main, which should cover HBase provided support scripts.
15557
15558
15559 ---
15560
15561 * [HBASE-18009](https://issues.apache.org/jira/browse/HBASE-18009) | *Major* | **Move RpcServer.Call to a separated file**
15562
15563 The return value of CallRunner.getCall is changed so this is an incompatible change as CallRunner is declared as IA.LimitedPrivate. CallRunner is declared as IS.Evolving so we do not break the rule. And we still keep the getCall method to reduce the impact to user code.
15564
15565
15566 ---
15567
15568 * [HBASE-14925](https://issues.apache.org/jira/browse/HBASE-14925) | *Major* | **Develop HBase shell command/tool to list table's region info through command line**
15569
15570 Added a shell command 'list\_regions' for displaying the table's region info through command line.
15571
15572         List all regions for a particular table as an array and also filter them by server name (optional) as prefix
15573         and maximum locality (optional). By default, it will return all the regions for the table with any locality.
15574         The command displays server name, region name, start key, end key, size of the region in MB, number of requests
15575         and the locality. The information can be projected out via an array as third parameter. By default all these information
15576         is displayed. Possible array values are SERVER\_NAME, REGION\_NAME, START\_KEY, END\_KEY, SIZE, REQ and LOCALITY. Values
15577         are not case sensitive. If you don't want to filter by server name, pass an empty hash / string as shown below.
15578
15579         Examples:
15580         hbase\> list\_regions 'table\_name'
15581         hbase\> list\_regions 'table\_name', 'server\_name'
15582         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}
15583         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}, ['SERVER\_NAME']
15584         hbase\> list\_regions 'table\_name', {}, ['SERVER\_NAME', 'start\_key']
15585         hbase\> list\_regions 'table\_name', '', ['SERVER\_NAME', 'start\_key']
15586
15587
15588 ---
15589
15590 * [HBASE-17471](https://issues.apache.org/jira/browse/HBASE-17471) | *Critical* | **Region Seqid will be out of order in WAL if using mvccPreAssign**
15591
15592 MVCCPreAssign is added by HBASE-16698, but pre-assign mvcc is only used in put/delete path. Other write paths like increment/append still assign mvcc in ringbuffer's consumer thread. If put and increment are used parallel. Then seqid in WAL may not increase monotonically. Disorder in wals will lead to data loss.This patch bring all mvcc/seqid event in wal.append, and synchronize wal append and mvcc acquirement. No disorder in wal will happen. Performance test shows no regression with this patch.
15593
15594
15595 ---
15596
15597 * [HBASE-16466](https://issues.apache.org/jira/browse/HBASE-16466) | *Major* | **HBase snapshots support in VerifyReplication tool to reduce load on live HBase cluster with large tables**
15598
15599 Support for snapshots in VerifyReplication tool i.e. verifyrep can compare source table snapshot against peer table snapshot which reduces load on RS by reading data from HDFS directly using Snapshot scanners.
15600 Instead of comparing against live tables whose state changes due to writes and compactions its better to compare HBase  snapshots which are immutable in nature.
15601
15602
15603 ---
15604
15605 * [HBASE-17263](https://issues.apache.org/jira/browse/HBASE-17263) | *Major* | **  Netty based rpc server impl**
15606
15607 A new RPC server based on Netty4 which can improve random read (get) performance. By default, it is off. To use this feature, please set “hbase.rpc.server.impl" to “org.apache.hadoop.hbase.ipc.NettyRpcServer”.
15608
15609 In one deploy, doubled the throughput and lowered the latency significantly: see https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
15610
15611
15612 ---
15613
15614 * [HBASE-17957](https://issues.apache.org/jira/browse/HBASE-17957) | *Minor* | ** Custom metrics of replicate endpoints don't prepend "source." to global metrics**
15615
15616 Global custom metrics names follow the "source.metricsName" format.
15617
15618
15619 ---
15620
15621 * [HBASE-17757](https://issues.apache.org/jira/browse/HBASE-17757) | *Major* | **Unify blocksize after encoding to decrease memory fragment**
15622
15623 Blocksize is set in columnfamily's atrributes. It is used to control block sizes when generating blocks. But, it doesn't take encoding into count. If you set encoding to blocks, after encoding, the block size varies. Since blocks will be cached in memory after encoding (default), it will cause memory fragment if using blockcache, or decrease the pool efficiency if using bucketCache. This issue introduced a new config named 'hbase.writer.unified.encoded.blocksize.ratio'. The default value of this config is 1, meaning doing nothing. If this value is set to a smaller value like 0.5, and the blocksize is set to 64KB(default value of blocksize). It will unify the blocksize after encoding to 64KB \* 0.5 = 32KB. Unified blocksize will releaf the memory problems mentioned above.
15624
15625
15626 ---
15627
15628 * [HBASE-14286](https://issues.apache.org/jira/browse/HBASE-14286) | *Trivial* | **Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile**
15629
15630 HBASE-14286 Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile
15631
15632
15633 ---
15634
15635 * [HBASE-17817](https://issues.apache.org/jira/browse/HBASE-17817) | *Major* | **Make Regionservers log which tables it removed coprocessors from when aborting**
15636
15637 Add table name to exception logging when a coprocessor is removed from a table by the region server
15638
15639
15640 ---
15641
15642 * [HBASE-17877](https://issues.apache.org/jira/browse/HBASE-17877) | *Major* | **Improve HBase's byte[] comparator**
15643
15644 updated the lexicographic byte array comparator to use a slightly more optimized version similar to the one available in the guava library that compares only the first index where left[index] != right[index]. The comparator also returns the diff directly instead of mapping it to -1, 0, +1 range as was being done in the earlier version. We have seen significant performance gains, calculated in terms of throughput (ops/ms) with these changes ranging from approx 20% for smaller byte arrays upto 200 bytes and almost 100% for large byte array sizes that are in few KB's. We benchmarked with upto 16KB arrays and the general trend indicates that the performance improvement increases as the size of the byte array increases.
15645
15646
15647 ---
15648
15649 * [HBASE-9899](https://issues.apache.org/jira/browse/HBASE-9899) | *Major* | **for idempotent operation dups, return the result instead of throwing conflict exception**
15650
15651 Non-idempotent operations (increment/append/checkAndPut/...) may throw OperationConflictException even though the increment/append succeeded. For example (client rpc retries number set to 3):
15652
15653 1. first increment rpc request success
15654 2. client timeout and send second rpc request, but nonce is same and save in server. The server found that it has already succeed, so return a OperationConflictException to make sure that increment operation only be applied once in server.
15655
15656 This patch will solve this problem by read the previous result when receive a duplicate rpc request.
15657 1. Store the mvcc to OperationContext. When first rpc request succeed, store the mvcc for this operation nonce.
15658 2. When there are duplicate rpc request, convert to read result by the mvcc.
15659
15660
15661 ---
15662
15663 * [HBASE-15583](https://issues.apache.org/jira/browse/HBASE-15583) | *Minor* | **Any HTableDescriptor we give out should be immutable**
15664
15665 # The HTD got from Admin, AsyncAdmin, and Table is immutable.
15666 # DEFERRED\_LOG\_FLUSH is removed.
15667 # cleanup the deprecated construction of HTD
15668
15669
15670 ---
15671
15672 * [HBASE-17956](https://issues.apache.org/jira/browse/HBASE-17956) | *Major* | **Raw scan should ignore TTL**
15673
15674 Now raw scan can also read expired cells.
15675
15676
15677 ---
15678
15679 * [HBASE-15143](https://issues.apache.org/jira/browse/HBASE-15143) | *Minor* | **Procedure v2 - Web UI displaying queues**
15680
15681 Adds a new Admin#listLocks, a panel on the procedures page to list procedure locks, and a list\_locks command to the shell. Use it to see current state of procedure locking in Master process.
15682
15683
15684 ---
15685
15686 * [HBASE-17514](https://issues.apache.org/jira/browse/HBASE-17514) | *Minor* | **Warn when Thrift Server 1 is configured for proxy users but not the HTTP transport**
15687
15688 If users of the Thrift 1 Server enable proxy user support without enabling the prerequisite HTTP transport, we now log a WARN message about the mismatch.
15689
15690
15691 ---
15692
15693 * [HBASE-17914](https://issues.apache.org/jira/browse/HBASE-17914) | *Major* | **Create a new reader instead of cloning a new StoreFile when compaction**
15694
15695 StoreFile.createReader method is gone. Call initReader and then getReader instead.
15696
15697
15698 ---
15699
15700 * [HBASE-16477](https://issues.apache.org/jira/browse/HBASE-16477) | *Major* | **Remove Writable interface and related code from WALEdit/WALKey**
15701
15702 Removes the Writables, and related code from WALEdit class. HBase-2.0 will not be able to read WAL files written with 0.94.x and before.
15703
15704
15705 ---
15706
15707 * [HBASE-17858](https://issues.apache.org/jira/browse/HBASE-17858) | *Major* | **Update refguide about the IS annotation if necessary**
15708
15709 Updated refguide to tell users that IS annotation is only valid for IA.LimitedPrivate classes.
15710
15711
15712 ---
15713
15714 * [HBASE-17857](https://issues.apache.org/jira/browse/HBASE-17857) | *Major* | **Remove IS annotations from IA.Public classes**
15715
15716 Now we do not have InterfaceStability annotations for IA,Public API. The stability of these classes will follow the rule of 'Semantic Versioning'.
15717
15718
15719 ---
15720
15721 * [HBASE-17215](https://issues.apache.org/jira/browse/HBASE-17215) | *Major* | **Separate small/large file delete threads in HFileCleaner to accelerate archived hfile cleanup speed**
15722
15723 After HBASE-17215 we change to use two threads for (archived) hfile cleaning. The size throttling for large/small files could be set through "hbase.regionserver.thread.hfilecleaner.throttle" and default to 67108864 (64M). It supports online configuration change, just find the active master address through zookeeper dump and use it in update\_config command, e.g. update\_config 'hbasem1.et2.tbsite.net,60100,1488038696741'
15724
15725
15726 ---
15727
15728 * [HBASE-16780](https://issues.apache.org/jira/browse/HBASE-16780) | *Critical* | **Since move to protobuf3.1, Cells are limited to 64MB where previous they had no limit**
15729
15730 Upgrade internal pb to 3.2 from 3.1. 3.2 has fix for 64MB limit.
15731
15732
15733 ---
15734
15735 * [HBASE-17287](https://issues.apache.org/jira/browse/HBASE-17287) | *Blocker* | **Master becomes a zombie if filesystem object closes**
15736
15737 If filesystem is not available during log split, abort master server.
15738
15739
15740 ---
15741
15742 * [HBASE-17765](https://issues.apache.org/jira/browse/HBASE-17765) | *Major* | **Reviving the merge possibility in the CompactingMemStore**
15743
15744 Reviving the merge of the compacting pipeline: making the limit on the number of the segments in the pipeline configurable and adding the merge test.
15745
15746 In order to customize the pipeline size limit change the value of the "hbase.hregion.compacting.pipeline.segments.limit" in the hbase-site.xml
15747
15748 Value 1 means to merge the segments on any flush-in-memory. Value higher than 16 means no merge.
15749
15750
15751 ---
15752
15753 * [HBASE-13395](https://issues.apache.org/jira/browse/HBASE-13395) | *Major* | **Remove HTableInterface**
15754
15755 HTableInterface was deprecated in 0.21.0 and is removed in 2.0.0. Use org.apache.hadoop.hbase.client.Table instead.
15756
15757
15758 ---
15759
15760 * [HBASE-17595](https://issues.apache.org/jira/browse/HBASE-17595) | *Critical* | **Add partial result support for small/limited scan**
15761
15762 Now small scan and limited scan could also return partial results.
15763
15764
15765 ---
15766
15767 * [HBASE-16014](https://issues.apache.org/jira/browse/HBASE-16014) | *Major* | **Get and Put constructor argument lists are divergent**
15768
15769 Add 2 constructors fot API Get
15770 1. Get(byte[], int, int)
15771 2. Get(ByteBuffer)
15772
15773
15774 ---
15775
15776 * [HBASE-17584](https://issues.apache.org/jira/browse/HBASE-17584) | *Major* | **Expose ScanMetrics with ResultScanner rather than Scan**
15777
15778 Now you can use ResultScanner.getScanMetrics to get the scan metrics at any time during the scan operation. The old Scan.getScanMetrics is deprecated and still work, but if you use ResultScanner.getScanMetrics to get the scan metrics and reset it, then the metrics published to the Scan instaince will be messed up.
15779
15780
15781 ---
15782
15783 * [HBASE-17802](https://issues.apache.org/jira/browse/HBASE-17802) | *Major* | **Add note that minor versions can add methods to Interfaces**
15784
15785 Update our semver section to include a note on our allowing ourselves the right to add methods to an Interface over a minor version as agreed to up on the dev list:  "If a Client implements an HBase Interface, a recompile MAY be required upgrading to a newer minor version (See release notes for warning about incompatible changes). All effort will be made to provide a default implementation so this case should not arise."
15786
15787
15788 ---
15789
15790 * [HBASE-17426](https://issues.apache.org/jira/browse/HBASE-17426) | *Major* | **Inconsistent environment variable names for enabling JMX**
15791
15792 In bin/hbase-config.sh,
15793 if value for HBASE\_JMX\_BASE is empty, keep current behavior.
15794 if HBASE\_JMX\_OPTS is not empty, keep current behavior.
15795 otherwise use the value of HBASE\_JMX\_BASE
15796
15797
15798 ---
15799
15800 * [HBASE-17740](https://issues.apache.org/jira/browse/HBASE-17740) | *Critical* | **Correct the semantic of batch and partial for async client**
15801
15802 Now async client has the same semantic with sync client for batch and partial.
15803 '''
15804 Now setBatch doesn't mean setAllowPartialResult(true)
15805 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15806 '''
15807
15808 Also a minor API change:
15809 Result#createCompleteResult(List\<Result\>) is changed to Result#createCompleteResult(Iterable\<Result\>).
15810
15811
15812 ---
15813
15814 * [HBASE-17746](https://issues.apache.org/jira/browse/HBASE-17746) | *Major* | **TestSimpleRpcScheduler.testCoDelScheduling is broken**
15815
15816 The executor for CoDel is changed to FastPathBalancedQueueRpcExecutor
15817
15818
15819 ---
15820
15821 * [HBASE-17712](https://issues.apache.org/jira/browse/HBASE-17712) | *Major* | **Remove/Simplify the logic of RegionScannerImpl.handleFileNotFound**
15822
15823 Add a config named 'hbase.hregion.unassign.for.fnfe'. It is used to control whether to reopen a region when hitting FileNotFoundException. The default value is true.
15824
15825
15826 ---
15827
15828 * [HBASE-15941](https://issues.apache.org/jira/browse/HBASE-15941) | *Major* | **HBCK repair should not unsplit healthy splitted region**
15829
15830 A new option -removeParents is now available that will remove an old parent when two valid daughters for that parent exist and -fixHdfsOverlaps is used. If there is an issue trying to remove the parent from META or sidelining the parent from HDFS we will fallback to do a regular merge. For now this option only works when the overlap group consists only of 3 regions (a parent, daughter A and daughter B)
15831
15832
15833 ---
15834
15835 * [HBASE-17737](https://issues.apache.org/jira/browse/HBASE-17737) | *Major* | **Thrift2 proxy should support scan timeRange per column family**
15836
15837 Thrift2 proxy supports scan timeRange per column family
15838
15839
15840 ---
15841
15842 * [HBASE-17718](https://issues.apache.org/jira/browse/HBASE-17718) | *Major* | **Difference between RS's servername and its ephemeral node cause SSH stop working**
15843
15844 Fix our accidentally registering a RegionServer's ephermal znode BEFORE we checked in with the master.
15845
15846
15847 ---
15848
15849 * [HBASE-17717](https://issues.apache.org/jira/browse/HBASE-17717) | *Critical* | **Incorrect ZK ACL set for HBase superuser**
15850
15851 In previous versions of HBase, the system intended to set a ZooKeeper ACL on all "sensitive" ZNodes for the user specified in the hbase.superuser configuration property. Unfortunately, the ACL was malformed which resulted in the hbase.superuser being unable to access the sensitive ZNodes that HBase creates. This JIRA issue fixes this bug. HBase will automatically correct the ACLs on start so users do not need to manually correct the ACLs.
15852
15853
15854 ---
15855
15856 * [HBASE-17716](https://issues.apache.org/jira/browse/HBASE-17716) | *Minor* | **Formalize Scan Metric names**
15857
15858 HBASE-17716 breaks compatibility of ServerSideScanMetrics by changing public field names, and the issue is fixed through HBASE-17886
15859
15860
15861 ---
15862
15863 * [HBASE-15484](https://issues.apache.org/jira/browse/HBASE-15484) | *Blocker* | **Correct the semantic of batch and partial**
15864
15865 Now setBatch doesn't mean setAllowPartialResult(true)
15866 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15867 Scan#setBatch is helpful in paging queries, if you just want to prevent OOM at client, use setAllowPartialResults(true) is better.
15868 We deprecated isPartial and use mayHaveMoreCellsInRow. If it returns false, current Result must be the last one of this row.
15869
15870
15871 ---
15872
15873 * [HBASE-17312](https://issues.apache.org/jira/browse/HBASE-17312) | *Major* | **[JDK8] Use default method for Observer Coprocessors**
15874
15875 Deletes BaseMasterAndRegionObserver, BaseMasterObserver, BaseRegionObserver, BaseRegionServerObserver and BaseWALObserver.
15876 Their corresponding interface classes now use JDK8's 'default' keyword to provide empty/no-op implementations so that:
15877 1. Derived class don't break when more coprocessor hooks are added in future.
15878 2. Derived classes don't have to redundantly override functions they don't care about with empty implementations.
15879
15880 Earlier, BaseXXXObserver classes provided these exact two benefits, but with 'default' keyword in JDK8, they are not needed anymore.
15881
15882 To fix the breakages because of this change, simply change "Foo extends BaseXXXObserver" to "Foo implements XXXObserver".
15883
15884
15885 ---
15886
15887 * [HBASE-17647](https://issues.apache.org/jira/browse/HBASE-17647) | *Major* | **OffheapKeyValue#heapSize() implementation is wrong**
15888
15889 **WARNING: No release note provided for this change.**
15890
15891
15892 ---
15893
15894 * [HBASE-13718](https://issues.apache.org/jira/browse/HBASE-13718) | *Minor* | **Add a pretty printed table description to the table detail page of HBase's master**
15895
15896 <!-- markdown -->
15897
15898
15899 The table information page in the Master UI now includes a schema section that describes the column families defined for that table as well as any column family specific properties that are set.
15900
15901
15902 ---
15903
15904 * [HBASE-17472](https://issues.apache.org/jira/browse/HBASE-17472) | *Major* | **Correct the semantic of  permission grant**
15905
15906 Before this patch, later granted permissions will override previous granted permissions, and previous granted permissions LOST. this issue re-define grant semantic: for master branch, later granted permissions will merge with previous granted permissions.  for branch-1.4, grant keep override behavior for compatibility purpose, and a grant with mergeExistingPermission flag provided.
15907
15908
15909 ---
15910
15911 * [HBASE-17583](https://issues.apache.org/jira/browse/HBASE-17583) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan for sync client**
15912
15913 Now you can include/exlude the startRow and stopRow for a scan. And the new methods to specify startRow and stopRow are withStartRow and withStopRow. The old methods to specify startRow and Row(include constructors) are marked as deprecated as in the old time if startRow and stopRow are equal then we will consider it as a get scan and include the stopRow implicitly. This is strange after we can set inclusiveness explicitly so we add new methods and depredate the old methods. The deprecated methods will be removed in the future.
15914
15915
15916 ---
15917
15918 * [HBASE-9702](https://issues.apache.org/jira/browse/HBASE-9702) | *Major* | **Change unittests that use "table" or "testtable" to use method names.**
15919
15920 Changes all tests to use the TestName JUnit Rule everywhere rather than hardcode table/region/store names.
15921
15922
15923 ---
15924
15925 * [HBASE-17280](https://issues.apache.org/jira/browse/HBASE-17280) | *Minor* | **Add mechanism to control hbase cleaner behavior**
15926
15927 The HBase cleaner chore process cleans up old WAL files and archived HFiles. Cleaner operation can affect query performance when running heavy workloads, so disable the cleaner during peak hours. The cleaner has the following HBase shell commands:
15928
15929 - cleaner\_chore\_enabled: Queries whether cleaner chore is enabled/ disabled.
15930 - cleaner\_chore\_run: Manually runs the cleaner to remove files.
15931 - cleaner\_chore\_switch: enables or disables the cleaner and returns the previous state of the cleaner. For example, cleaner-switch true enables the cleaner.
15932
15933 Following APIs are added in Admin:
15934 - setCleanerChoreRunning(boolean on): Enable/Disable the cleaner chore
15935 - runCleanerChore(): Ask for cleaner chore to run
15936 - isCleanerChoreEnabled(): Query whether cleaner chore is enabled/ disabled.
15937
15938
15939 ---
15940
15941 * [HBASE-17599](https://issues.apache.org/jira/browse/HBASE-17599) | *Major* | **Use mayHaveMoreCellsInRow instead of isPartial**
15942
15943 The word 'isPartial' is ambiguous so we introduce a new method 'mayHaveMoreCellsInRow' to replace it. And the old meaning of 'isPartial' is not the same with 'mayHaveMoreCellsInRow' as for batched scan, if the number of returned cells equals to the batch, isPartial will be false. After this change the meaning of 'isPartial' will be same with 'mayHaveMoreCellsInRow'. This is an incompatible change but it is not likely to break a lot of things as for batched scan the old 'isPartial' is just a redundant information, i.e, if the number of returned cells reaches the batch limit. You have already know the number of returned cells and the value of batch.
15944
15945
15946 ---
15947
15948 * [HBASE-17437](https://issues.apache.org/jira/browse/HBASE-17437) | *Major* | **Support specifying a WAL directory outside of the root directory**
15949
15950 This patch adds support for specifying a WAL directory outside of the HBase root directory.
15951
15952 Multiple configuration variables were added to accomplish this:
15953 hbase.wal.dir: used to configure where the root WAL directory is located. Could be on a different FileSystem than the root directory. WAL directory can not be set to a subdirectory of the root directory. The default value of this is the root directory if unset.
15954
15955 hbase.rootdir.perms: Configures FileSystem permissions to set on the root directory. This is '700' by default.
15956
15957 hbase.wal.dir.perms: Configures FileSystem permissions to set on the WAL directory FileSystem. This is '700' by default.
15958
15959
15960 ---
15961
15962 * [HBASE-17350](https://issues.apache.org/jira/browse/HBASE-17350) | *Critical* | **Fixup of regionserver group-based assignment**
15963
15964 A few bug fixes and tweaks to the fsgroup feature.
15965
15966 Renamed shell command move\_rsgroup\_servers as move\_servers\_rsgroup
15967 Renamed shell comand move\_rsgroup\_tables as move\_tables\_rsgroup
15968
15969 Made the 'default' group more 'dynamic'; i.e. dead servers no longer show in the 'default' group.
15970
15971
15972 ---
15973
15974 * [HBASE-17578](https://issues.apache.org/jira/browse/HBASE-17578) | *Major* | **Thrift per-method metrics should still update in the case of exceptions**
15975
15976 In prior versions, the HBase Thrift handlers failed to increment per-method metrics when an exception was encountered.  These metrics will now always be incremented, whether an exception is encountered or not.  This change also adds exception-type metrics, similar to those exposed in regionservers, for individual exceptions which are received by the Thrift handlers.
15977
15978
15979 ---
15980
15981 * [HBASE-17508](https://issues.apache.org/jira/browse/HBASE-17508) | *Major* | **Unify the implementation of small scan and regular scan for sync client**
15982
15983 Now the scan.setSmall method is deprecated. Consider using scan.setLimit and scan.setReadType in the future. And we will open scanner lazily when you call scanner.next. This is an incompatible change which delays the table existence check and permission check.
15984
15985
15986 ---
15987
15988 * [HBASE-16981](https://issues.apache.org/jira/browse/HBASE-16981) | *Major* | **Expand Mob Compaction Partition policy from daily to weekly, monthly**
15989
15990 Mob compaction partition policy can be set by
15991 hbase\> create 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'weekly'}
15992
15993 or
15994
15995 hbase\> alter 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'monthly'}
15996
15997 Available MOB\_COMPACT\_PARTITION\_POLICY options are "daily", "weekly" and "monthly", the default is "daily".
15998
15999 When it is "weekly" policy, the mob compaction will try to compact files within one calendar week into one for a specific partition, similar for "daily" and "monthly".
16000
16001 With "weekly" policy, one mob file normally is compacted twice during its lifetime (that is first on daily basis and then all such daily based compacted files belonging to a week at the weekly interval), for one region, there normally are 52 files for one year. With "Monthly" policy, one mob file normally is compacted 3 times during its lifetime (First daily and then weekly followed by monthly at end of every month) and normally there are 12 files for one year.
16002
16003
16004 ---
16005
16006 * [HBASE-17197](https://issues.apache.org/jira/browse/HBASE-17197) | *Major* | **hfile does not work in 2.0**
16007
16008 The -f argument is no longer required specifying target file; just pass the file as an argument.
16009
16010
16011 ---
16012
16013 * [HBASE-16812](https://issues.apache.org/jira/browse/HBASE-16812) | *Minor* | **Clean up the locks in MOB**
16014
16015 In MOB-enabled column family, the lock in the major compaction is removed. All the delete markers are retained in the major compaction, and a MOB reference tag is appended to each of the retained delete markers.
16016
16017
16018 ---
16019
16020 * [HBASE-12894](https://issues.apache.org/jira/browse/HBASE-12894) | *Critical* | **Upgrade Jetty to 9.2.6**
16021
16022 Upgrades Jetty to 9.x from 6.x (Jetty9 is in different namespace from Jetty6). Also updated Jersey to 2.x and Servlet to 3.x.
16023
16024
16025 ---
16026
16027 * [HBASE-17566](https://issues.apache.org/jira/browse/HBASE-17566) | *Major* | **Jetty upgrade fixes**
16028
16029 Fix inability at finding static content post push of parent issue moving us to jetty9.
16030
16031
16032 ---
16033
16034 * [HBASE-9774](https://issues.apache.org/jira/browse/HBASE-9774) | *Major* | **HBase native metrics and metric collection for coprocessors**
16035
16036 This issue adds two new modules, hbase-metrics and hbase-metrics-api which define and implement the "new" metric system used internally within HBase. These two modules (and some other code in hbase-hadoop2-compat) module are referred as "HBase metrics framework" which is HBase-specific and independent of any other metrics library (including Hadoop metrics2 and dropwizards metrics).
16037
16038 HBase Metrics API (hbase-metrics-api) contains the interface that HBase exposes internally and to third party code (including coprocessors). It is a thin
16039 abstraction over the actual implementation for backwards compatibility guarantees. The metrics API in this hbase-metrics-api module is inspired by the Dropwizard metrics 3.1 API, however, the API is completely independent.
16040
16041 hbase-metrics module contains implementation of the "HBase Metrics API", including MetricRegistry, Counter, Histogram, etc. These are highly concurrent implementations of the Metric interfaces. Metrics in HBase are grouped into different sets (like WAL, RPC, RegionServer, etc). Each group of metrics should be tracked via a MetricRegistry specific to that group.
16042
16043 Historically, HBase has been using Hadoop's Metrics2 framework [3] for collecting and reporting the metrics internally. However, due to the difficultly of dealing with the Metrics2 framework, HBase is moving away from Hadoop's metrics implementation to its custom implementation. The move will happen incrementally, and during the time, both Hadoop Metrics2-based metrics and hbase-metrics module based classes will be in the source code. All new implementations for metrics SHOULD use the new API and framework.
16044
16045 This jira also introduces the metrics API to coprocessor implementations. Coprocessor writes can export custom metrics using the API and have those collected via metrics2 sinks, as well as exported via JMX in regionserver metrics.
16046
16047 More documentation available at: hbase-metrics-api/README.txt
16048
16049
16050 ---
16051
16052 * [HBASE-17491](https://issues.apache.org/jira/browse/HBASE-17491) | *Major* | **Remove all setters from HTable interface and introduce a TableBuilder to build Table instance**
16053
16054 After HBASE-17491 all setter methods in HTable are marked as deprecated, moved into TableBuilder, and will be removed later.
16055
16056
16057 ---
16058
16059 * [HBASE-17067](https://issues.apache.org/jira/browse/HBASE-17067) | *Major* | **Procedure v2 - remove tryAcquire\*Lock and use wait/wake to make framework event based**
16060
16061 Make the framework more 'lively'; undo 'suspend' notion in Procedure, rely on eventing mechanism instead. Lets us remove no longer needed synchronizations. Framework can now do more ops per second.
16062
16063
16064 ---
16065
16066 * [HBASE-16698](https://issues.apache.org/jira/browse/HBASE-16698) | *Major* | **Performance issue: handlers stuck waiting for CountDownLatch inside WALKey#getWriteEntry under high writing workload**
16067
16068 Assign sequenceid to an edit before we go on the ringbuffer; undoes contention on WALKey latch. Adds a new config "hbase.hregion.mvcc.preassign" which defaults to true: i.e. this speedup is enabled.
16069
16070 User could set this per-table level, like:
16071 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hregion.mvcc.preassign'=\>'false'}}
16072
16073
16074 ---
16075
16076 * [HBASE-17488](https://issues.apache.org/jira/browse/HBASE-17488) | *Trivial* | **WALEdit should be lazily instantiated**
16077
16078 prevent creating unused objects in the WALEdit's construction.
16079 +If the cp#preBatchMutate returns true, the WALEdit is useless. So we should create the WALEdit after step 2.
16080 +The cells came from cp should be counted because they are added into the WALEdit . The use case is the local index of phoenix
16081 +If the mutation contains the SKIP\_WAL property, its cells aren't added into the WALEdit. So these cells shouldn't be counted.
16082
16083
16084 ---
16085
16086 * [HBASE-16831](https://issues.apache.org/jira/browse/HBASE-16831) | *Minor* | **Procedure V2 - Remove org.apache.hadoop.hbase.zookeeper.lock**
16087
16088 Purges code that did zk-hosted locks for table ops (we do procedure-based locks now)
16089
16090
16091 ---
16092
16093 * [HBASE-16867](https://issues.apache.org/jira/browse/HBASE-16867) | *Major* | **Procedure V2 - Check ACLs for remote HBaseLock**
16094
16095 Add checking ACL when taking locks.
16096
16097
16098 ---
16099
16100 * [HBASE-16786](https://issues.apache.org/jira/browse/HBASE-16786) | *Major* | **Procedure V2 - Move ZK-lock's uses to Procedure framework locks (LockProcedure)**
16101
16102 Move locking to be procedure (Pv2) rather than zookeeper based. All locking moved over to new infrastructure including MOBing locking.
16103
16104
16105 ---
16106
16107 * [HBASE-17470](https://issues.apache.org/jira/browse/HBASE-17470) | *Major* | **Remove merge region code from region server**
16108
16109 In 1.x branches, Admin.mergeRegions calls MASTER via dispatchMergingRegions RPC; when executing dispatchMergingRegions RPC, MASTER calls RS via MergeRegions to complete the merge in RS-side.
16110
16111 With HBASE-16119, the merge logic moves to master-side.  This JIRA cleans up unused RPCs (dispatchMergingRegions and MergeRegions) , removes dangerous tools such as Merge and HMerge, and deletes unused RegionServer-side merge region logic in 2.0 release.
16112
16113
16114 ---
16115
16116 * [HBASE-16744](https://issues.apache.org/jira/browse/HBASE-16744) | *Major* | **Procedure V2 - Lock procedures to allow clients to acquire locks on tables/namespaces/regions**
16117
16118  Lock for HBase Entity either a Table, a Namespace, or Regions.
16119
16120 These are remote locks which live on master, and need periodic heartbeats to keep them alive. (Once we request the lock, internally an heartbeat thread will be started). If master doesn't receive the heartbeat in time, it'll release the lock and make it available to other users.
16121
16122 Use {@link LockServiceClient} to build instances. Then call {@link #requestLock()}. {@link #requestLock} will contact master to queue the lock and start the heartbeat thread which will check lock's status periodically and once the lock is acquired, it will send the heartbeats to the master.
16123
16124 Use {@link #await} or {@link #await(long, TimeUnit)} to wait for the lock to be acquired. Always call {@link #unlock()} irrespective of whether lock was acquired or not. If the lock was acquired, it'll be released. If it was not acquired, it is possible that master grants the lock in future and the heartbeat thread keeps it alive forever by sending heartbeats. Calling {@link #unlock()} will stop the heartbeat thread and cancel the lock queued on master.
16125
16126 There are 4 ways in which these remote locks may be released/can be lost:
16127   \* Call {@link #unlock}.
16128   \* Lock times out on master: Can happen because of network issues, GC pauses, etc. Worker thread will call the given abortable as soon as it detects such a situation. Fail to contact master: If worker thread can not contact mater and thus fails to send heartbeat before the timeout expires, it assumes that lock is lost and calls the
16129  \*     abortable.
16130 Worker thread is interrupted.
16131
16132 Use example:
16133
16134  EntityLock lock = lockServiceClient.\*Lock(...., "exampled lock", abortable);
16135   lock.requestLock();
16136   ....
16137    ....can do other initializations here since lock is 'asynchronous'...
16138  ....
16139  if (lock.await(timeout)) {
16140     ....logic requiring mutual exclusion
16141   }
16142    lock.unlock();
16143
16144
16145 ---
16146
16147 * [HBASE-14061](https://issues.apache.org/jira/browse/HBASE-14061) | *Major* | **Support CF-level Storage Policy**
16148
16149 After HBASE-14061 we support to set storage policy for HFile through "hbase.hstore.block.storage.policy" configuration, and we support CF-level setting to override the settings from configuration file. Currently supported storage policies include ALL\_SSD/ONE\_SSD/HOT/WARM/COLD, refer to http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html for more details
16150
16151 For example, to create a table with two families: "cf1" with "ALL\_SSD" storage policy and "cf2" with "ONE\_SSD", we could use below command in hbase shell:
16152 create 'table',{NAME=\>'f1',STORAGE\_POLICY=\>'ALL\_SSD'},{NAME=\>'f2',STORAGE\_POLICY=\>'ONE\_SSD'}
16153
16154 We could also set the configuration in table attribute like all other configurations:
16155 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ONE\_SSD'}}
16156
16157
16158 ---
16159
16160 * [HBASE-17337](https://issues.apache.org/jira/browse/HBASE-17337) | *Major* | **list replication peers request should be routed through master**
16161
16162 List replication peers request will be roughed through master.
16163
16164
16165 ---
16166
16167 * [HBASE-15172](https://issues.apache.org/jira/browse/HBASE-15172) | *Major* | **Support setting storage policy in bulkload**
16168
16169 After HBASE-15172/HBASE-19016 we could set storage policy through "hbase.hstore.block.storage.policy" property for bulkload, or "hbase.hstore.block.storage.policy.\<family\_name\>" for a specified family. Supported storage policy includes: ALL\_SSD, ONE\_SSD, HOT, WARM, COLD, etc.
16170
16171
16172 ---
16173
16174 * [HBASE-17336](https://issues.apache.org/jira/browse/HBASE-17336) | *Major* | **get/update replication peer config requests should be routed through master**
16175
16176 Get/update replication peer config requests will be routed through master.
16177
16178
16179 ---
16180
16181 * [HBASE-17320](https://issues.apache.org/jira/browse/HBASE-17320) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan**
16182
16183 Now you can specific the inclusive of startRow and stopRow for a scan using the new methods withStartRow(byte[] startRow, boolean inclusive) and withStopRow(byte[] stopRow, boolean inclusive). The old setStartRow and setStopRow methods, and the constructors are marked as deprecated because of an strange behavior that we will include the stopRow implicitly if startRow equals to stopRow. This is used to support get scan in the old time. Use withStartRow and withStopRow instead.
16184
16185 For developers, the ConnectionUtils.createClosestRowBefore is also marked as deprecated as the row returned by this method is only very very close to the current row, not closest. Avoid using this method in the future.
16186
16187
16188 ---
16189
16190 * [HBASE-17314](https://issues.apache.org/jira/browse/HBASE-17314) | *Major* | **Limit total buffered size for all replication sources**
16191
16192 Add a conf "replication.total.buffer.quota" to limit total size of buffered entries in all replication peers. It will prevent server getting OOM if there are many peers. Default value is 256MB.
16193
16194
16195 ---
16196
16197 * [HBASE-17174](https://issues.apache.org/jira/browse/HBASE-17174) | *Minor* | **Refactor the AsyncProcess, BufferedMutatorImpl, and HTable**
16198
16199 + cleanup some unused code
16200 + allow being able to share pool between BufferedMutatorImpl
16201 + setting "hbase.client.request.controller.impl" to the name of the alternate RequestController (traffic control) implementation class in Configuration
16202 + The default RequestController implementation is SimpleRequestController
16203 + setting "hbase.client.log.detail.period.ms" to call logger on a period when waiting for tasks to complete
16204
16205
16206 ---
16207
16208 * [HBASE-17335](https://issues.apache.org/jira/browse/HBASE-17335) | *Major* | **enable/disable replication peer requests should be routed through master**
16209
16210 Enable/Disable replication peer requests will be routed through master.
16211
16212
16213 ---
16214
16215 * [HBASE-5401](https://issues.apache.org/jira/browse/HBASE-5401) | *Major* | **PerformanceEvaluation generates 10x the number of expected mappers**
16216
16217 Changes how many tasks PE runs when clients are mapreduce. Now tasks == client count. Previous we hardcoded ten tasks per client instance.
16218
16219
16220 ---
16221
16222 * [HBASE-11392](https://issues.apache.org/jira/browse/HBASE-11392) | *Critical* | **add/remove peer requests should be routed through master**
16223
16224 Add/Remove replication peer requests will be routed through master. And make ReplicationAdmin as Deprecated.
16225
16226
16227 ---
16228
16229 * [HBASE-15924](https://issues.apache.org/jira/browse/HBASE-15924) | *Major* | **Enhance hbase services autorestart capability to hbase-daemon.sh**
16230
16231 Now one can start hbase services with enabled "autostart/autorestart" feature in controlled fashion with the help of "--autostart-window-size" to define the window period and the "--autostart-window-retry-limit" to define the number of times the hbase services have to be restarted upon being killed/terminated abnormally within the provided window perioid.
16232
16233 The following cases are supported with "autostart/autorestart":
16234
16235 a) --autostart-window-size=0 and --autostart-window-retry-limit=0, indicates infinite window size and no retry limit
16236 b) not providing the args, will default to a)
16237 c) --autostart-window-size=0 and --autostart-window-retry-limit=\<positive value\> indicates the autostart process to bail out if the retry limit exceeds irrespective of window period
16238 d) --autostart-window-size=\<x\> and --autostart-window-retry-limit=\<y\> indicates the autostart process to bail out if the retry limit "y" is exceeded for the last window period "x".
16239
16240
16241 ---
16242
16243 * [HBASE-17331](https://issues.apache.org/jira/browse/HBASE-17331) | *Minor* | **Avoid busy waiting in ThrottledInputStream**
16244
16245 For each read(), old ThrottledInputStream sleeps/wakes/checks for many times for controlling the throughput. After this patch, ThrottledInputStream sleeps/wakes/checks only once. So we can reduce CPU usage.
16246
16247
16248 ---
16249
16250 * [HBASE-17296](https://issues.apache.org/jira/browse/HBASE-17296) | *Major* | **Provide per peer throttling for replication**
16251
16252 Provide per peer throttling for replication. Add the bandwidth upper limit to ReplicationPeerConfig and a new shell cmd set\_peer\_bandwidth to update the bandwidth in need.
16253
16254
16255 ---
16256
16257 * [HBASE-17277](https://issues.apache.org/jira/browse/HBASE-17277) | *Major* | **Allow alternate BufferedMutator implementation**
16258
16259 Specify the name of an alternate BufferedMutator implementation by either:
16260
16261  \* Setting "hbase.client.bufferedmutator.classname" to the name of the alternate implementation class in Configuration
16262  \* Or, by setting BufferedMutatorParams#implementationClassName and passing the amended BufferedMutatorParams when calling Connection#getBufferedMutator.
16263
16264
16265 ---
16266
16267 * [HBASE-17294](https://issues.apache.org/jira/browse/HBASE-17294) | *Major* | **External Configuration for Memory Compaction**
16268
16269 This patch provides a single external knob to control memstore compaction. It also inmemory compaction with BASIC policy as our default (AFTERWORD: inmemory compaction as default was undone in HBASE-17333 because of test failures; will be reenabled in later, dedicated issue)
16270
16271 Possible memstore compaction policies are:
16272 (1) None - no memory compaction, when size threshold is exceeded data is flushed to disk
16273 (2) Basic policy applies optimizations which modify the index to a more compacted representation. This is beneficial in all access patterns. The smaller the cells are the greater the benefit of this policy. This is the default policy.
16274 (3) Eager - in addition to compacting the index representation as the basic policy, eager policy eliminates duplication while the data is still in memory (much like the on-disk compaction does after the data is flushed to disk). This policy is most useful for applications with high data churn or small working sets.
16275
16276 Memory compaction policeman be set at the column family level at table creation time:
16277 {code}
16278 create ‘\<tablename\>’,
16279    {NAME =\> ‘\<cfname\>’,
16280     IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
16281 {code}
16282 or as a property at the global configuration level by setting the property in hbase-site.xml, with BASIC being the default value:
16283 {code}
16284 \<property\>
16285         \<name\>hbase.hregion.compacting.memstore.type\</name\>
16286         \<value\>\<NONE\|BASIC\|EAGER\>\</value\>
16287 \</property\>
16288 {code}
16289 The values used in this property can change as memstore compaction policies evolve over time.
16290
16291
16292 ---
16293
16294 * [HBASE-16336](https://issues.apache.org/jira/browse/HBASE-16336) | *Major* | **Removing peers seems to be leaving spare queues**
16295
16296 Add a ReplicationZKNodeCleaner periodically check and delete the useless replication queue zk node belong to the peer which is not exist.
16297
16298
16299 ---
16300
16301 * [HBASE-17272](https://issues.apache.org/jira/browse/HBASE-17272) | *Major* | **Doc how to run Standalone HBase over an HDFS instance; all daemons in one JVM but persisting to an HDFS instance**
16302
16303 Adds section at http://hbase.apache.org/book.html#standalone.over.hdfs on how to make standalone persist to an hdfs instance (where standalone is all daemons in the one jvm).
16304
16305
16306 ---
16307
16308 * [HBASE-16700](https://issues.apache.org/jira/browse/HBASE-16700) | *Minor* | **Allow for coprocessor whitelisting**
16309
16310 Provides ability to restrict table coprocessors based on HDFS path whitelist. (Particularly useful for allowing Phoenix coprocessors but not arbitrary user created coprocessors.)
16311
16312
16313 ---
16314
16315 * [HBASE-17221](https://issues.apache.org/jira/browse/HBASE-17221) | *Major* | **Abstract out an interface for RpcServer.Call**
16316
16317 Provide an interface RpcCall on the server side.
16318 RpcServer.Call now is marked as @InterfaceAudience.Private, and implements the interface RpcCall,
16319
16320
16321 ---
16322
16323 * [HBASE-16119](https://issues.apache.org/jira/browse/HBASE-16119) | *Major* | **Procedure v2 - Reimplement merge**
16324
16325 The merge region logic is controlled by master in 2.0.0 (in 1.x, the core merge region logic is in the region server side).  The coprocessors related to merge region in RS-side would be no-op in 2.0.0 and later release.  Therefore, this is an incompatible change.  Users needs to move the CP logic to new master CP and registers them.
16326
16327 A new mergeRegionsAsync() API is added in client.  The existing mergeRegions() API will call the new API so client does not have to change its code.
16328
16329
16330 ---
16331
16332 * [HBASE-17112](https://issues.apache.org/jira/browse/HBASE-17112) | *Major* | **Prevent setting timestamp of delta operations the same as previous value's**
16333
16334 Before this issue, two concurrent Increments/Appends done in same millisecond or RS's clock going back will result in two results have same TS, which is not friendly to versioning and will get wrong result in slave cluster if the replication is disordered.
16335 After this issue, the result of Increment/Append will always have an incremental TS. There is no any inconsistent in replication for these operations. But there is a rare case that if there is a Delete in same millisecond, the later result can not be masked by this Delete. This can be fixed after we have new semantics that previous Delete will never mask later Put even its timestamp is higher.
16336
16337
16338 ---
16339
16340 * [HBASE-17181](https://issues.apache.org/jira/browse/HBASE-17181) | *Minor* | **Let HBase thrift2 support TThreadedSelectorServer**
16341
16342 Add TThreadedSelectorServer support for HBase Thrift2
16343
16344
16345 ---
16346
16347 * [HBASE-17178](https://issues.apache.org/jira/browse/HBASE-17178) | *Major* | **Add region balance throttling**
16348
16349 Add region balance throttling. Master execute every region balance plan per balance interval, which is equals to divide max balancing time by the size of region balance plan. And Introduce a new config hbase.master.balancer.maxRitPercent to protect availability. If config this to 0.01, then the max percent of regions in transition is 1% when balancing. Then the cluster's availability is at least 99% when balancing.
16350
16351
16352 ---
16353
16354 * [HBASE-15786](https://issues.apache.org/jira/browse/HBASE-15786) | *Major* | **Create DBB backed MSLAB pool**
16355
16356 Added a new config hbase.regionserver.offheap.global.memstore.size using which one can specify the global off heap limit that all memstores can use.  When this config is in MSLAB should be turned ON and we will use the entire size for the MSLAB pool. It will make off heap chunks and pool then. It will behave as if we are working with off heap memstores.  When this config is having a valid value and MSLAB is turned OFF, the system will just ignore the offheap size and continue to use global max heap space % for memstores and work with on heap memstores.
16357
16358
16359 ---
16360
16361 * [HBASE-17132](https://issues.apache.org/jira/browse/HBASE-17132) | *Major* | **Cleanup deprecated code for WAL**
16362
16363 Remove HLogKey and related classes and methods. Remove SequenceFile based log reader and writer. WALObserver and RegionObserver are changed so this is an incompatible change.
16364
16365
16366 ---
16367
16368 * [HBASE-16169](https://issues.apache.org/jira/browse/HBASE-16169) | *Major* | **Make RegionSizeCalculator scalable**
16369
16370 Added couple of API's to Admin.java:
16371
16372 Returns region load map of all regions hosted on a region server
16373 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn) throws IOException;
16374
16375 Returns region load map of all regions of a table hosted on a region server
16376 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn, TableName tableName) throws IOException
16377
16378 Added an API to region server:
16379
16380 public GetRegionLoadResponse getRegionLoad(RpcController controller,
16381     GetRegionLoadRequest request) throws ServiceException;
16382
16383 Primary intention is to use this API for RegionSizeCalculator and not rely on Master for ClusterStatus. On large clusters, ClusterStatus() can take a long time. IfMaster is down/busy, then some of the jobs timeout/fail. Other possible uses:
16384 1. If there is a lighter version of GetClusterStatus API (i.e without the ServerLoad for each RS), then custom maintenance tools can be better. In current world ClusterStatus is heavy. With the new APIs, each API's payload is smaller and distributed. So custom tools can call getRegionLoad() when needed, it will be more accurate. This helps with large clusters. For tools that don't need RegionLoad, the lighter version of API is fine enough.
16385 2. Another use case is a tool like RSTop - since we can see selective metrics at RegionLevel (possibly even deltas between each RPC to the server).
16386
16387
16388 ---
16389
16390 * [HBASE-15788](https://issues.apache.org/jira/browse/HBASE-15788) | *Major* | **Use Offheap ByteBuffers from BufferPool to read RPC requests.**
16391
16392 Using the ByteBuffers from ByteBufferPool to read the request bytes at server.  When the size of the request is smaller than 1/6th size of a BB in the pool, we will not use that but read into an on demand created, proper sized on heap ByteBuffer.
16393
16394
16395 ---
16396
16397 * [HBASE-17046](https://issues.apache.org/jira/browse/HBASE-17046) | *Major* | **Add 1.1 doc to hbase.apache.org**
16398
16399 Adds a 1.1. item to our 'Documentation and API' tab. Gives access to 1.1 APIs, XRef, etc.
16400
16401
16402 ---
16403
16404 * [HBASE-16962](https://issues.apache.org/jira/browse/HBASE-16962) | *Major* | **Add readPoint to preCompactScannerOpen() and preFlushScannerOpen() API**
16405
16406 The following RegionObserver methods are deprecated
16407
16408 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16409     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s)
16410     throws IOException;
16411
16412 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16413     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16414     final long earliestPutTs, final InternalScanner s, CompactionRequest request)
16415
16416 Instead, use the following methods:
16417
16418 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16419     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s,
16420     final long readPoint) throws IOException;
16421
16422 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16423     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16424     final long earliestPutTs, final InternalScanner s, final CompactionRequest request,
16425     final long readPoint) throws IOException
16426
16427
16428 ---
16429
16430 * [HBASE-17017](https://issues.apache.org/jira/browse/HBASE-17017) | *Major* | **Remove the current per-region latency histogram metrics**
16431
16432 Removes per-region level (get size, get time, scan size and scan time histogram) metrics that was exposed before. Per-region histogram metrics with 1000+ regions causes millions of objects to be allocated on heap. The patch introduces getCount and scanCount as counters rather than histograms. Other per-region level metrics are kept as they are.
16433
16434
16435 ---
16436
16437 * [HBASE-16955](https://issues.apache.org/jira/browse/HBASE-16955) | *Major* | **Fixup precommit protoc check to do new distributed protos and pb 3.1.0 build**
16438
16439 Test that environment no longer has to have protoc (2.5 and 3.1) available. Needed small adjustment in yetus protoc build but otherwise all works.
16440
16441
16442 ---
16443
16444 * [HBASE-17050](https://issues.apache.org/jira/browse/HBASE-17050) | *Minor* | **Upgrade Apache CLI version from 1.2 to 1.3.1**
16445
16446 Upgrade Apache CLI version from 1.2 to 1.3.1.
16447
16448 These are few good/important changes included in this update:
16449 - HelpFormatter now prints command-line options in the same order as they
16450   have been added. Fixes CLI-212.
16451 - Standard help text now shows mandatory arguments also for the first
16452   option. Fixes CLI-186.
16453 - A new parser is available: DefaultParser. It combines the features of the
16454   GnuParser and the PosixParser. It also provides additional features like
16455   partial matching for the long options, and long options without separator
16456   (i.e like the JVM memory settings: -Xmx512m). This new parser deprecates
16457   the previous ones. Fixes CLI-161,CLI-167,CLI-181.
16458
16459 For full list of changes:
16460   https://commons.apache.org/proper/commons-cli/changes-report.html#a1.3
16461
16462
16463 ---
16464
16465 * [HBASE-15513](https://issues.apache.org/jira/browse/HBASE-15513) | *Major* | **hbase.hregion.memstore.chunkpool.maxsize is 0.0 by default**
16466
16467 MSLAB chunk pool is on by default in hbase-2.0.0.
16468
16469
16470 ---
16471
16472 * [HBASE-16972](https://issues.apache.org/jira/browse/HBASE-16972) | *Major* | **Log more details for Scan#next request when responseTooSlow**
16473
16474 **WARNING: No release note provided for this change.**
16475
16476
16477 ---
16478
16479 * [HBASE-17014](https://issues.apache.org/jira/browse/HBASE-17014) | *Minor* | **Add clearly marked starting and shutdown log messages for all services.**
16480
16481 Delimit START, STOP, and ABORT messages with '\*\*\*\*\*' so denote.
16482
16483
16484 ---
16485
16486 * [HBASE-16765](https://issues.apache.org/jira/browse/HBASE-16765) | *Critical* | **New SteppingRegionSplitPolicy, avoid too aggressive spread of regions for small tables.**
16487
16488 Introduces a new split policy: SteppingSplitPolicy
16489 This will use a simple step function to split a region at (by default) 2  xflushSize when no other region of the same table is seen on the region server, or max-file-size when one or more other regions of the same table is seen.
16490
16491 In HBase 2.0 this is going to be the default. In previous versions it can be configured.
16492
16493
16494 ---
16495
16496 * [HBASE-16608](https://issues.apache.org/jira/browse/HBASE-16608) | *Major* | **Introducing the ability to merge ImmutableSegments without copy-compaction or SQM usage**
16497
16498 The index-compation and data-compaction variants of CompactingMemStore are introduced. In both types the active (mutable) segment is periodically flushed-in-memory and is added as immutable segment in the compaction pipeline. The CompactingMemStore of index-compaction type is merging all immutable segments of the compacting pipeline into one. The merging of N segments is explained below. The CompactingMemStore of data-compaction type is compacting all immutable segments of the compacting pipeline into one. After the merge/compaction the old segments in the compacting pipeline are replaced with one new.
16499
16500 Before explaining the process of merging N old segments into new one, note that segment structure includes ordered index that allows traversing the cells data efficiently. The merge is copying the ordered indexes of the old segments into one ordered index of new segment. No data is copied, no cells are filtered. Alternatively, in the process of compacting N old segments into new one, both data and index are copied. The old cells are filtered, meaning upon compaction unused versions of the cells are not copied so the new segment has less data then all old ones.
16501
16502 This issue introduces only the merging ability and simplifies the user intervention for switching between types. The previous CompactingMemStore structure was added by HBASE-16420 and HBASE-16421. The future refinements of the policy or merging/compacting will come in HBASE-16417.
16503
16504 In order to create a table with CompactingMemStore as a MemStore one should use:
16505 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16506 IN\_MEMORY\_COMPACTION default is false, so table created as following will have the known DefaultMemStore as a MemStore.
16507 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’}
16508
16509 The default type of CompactingMemStore is index-compaction. In order to change it to data-compaction one should add to the hbase-site.xml
16510 \<property\>
16511     \<name\>hbase.hregion.compacting.memstore.type\</name\>
16512     \<value\>data-compaction\</value\>
16513   \</property\>
16514
16515 in addition to creating the table as following
16516 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16517
16518
16519 ---
16520
16521 * [HBASE-16747](https://issues.apache.org/jira/browse/HBASE-16747) | *Major* | **Track memstore data size and heap overhead separately**
16522
16523 Marking it as incompatible change as there is a change in behavior for region flush decision. The default flush size of 128 MB per region was tracked against both actual data bytes size + overhead of these cells in memstore memory (Overhead because of Cell java objects and CSLM entry).  As part of this jira we will keep track of cell data size only in region level.  So 128 MB flush size means, 128 MB of cell data bytes (key+ value+..)
16524
16525 Globally we will track cell data size and heap overhead separately and will consider both for forced flushes. We will not allow over consume of heap memory by all memstore. This is as old case. Only tracking way is changed.
16526
16527
16528 ---
16529
16530 * [HBASE-16974](https://issues.apache.org/jira/browse/HBASE-16974) | *Minor* | **Update os-maven-plugin to 1.4.1.final+ for building shade file on RHEL/CentOS**
16531
16532 Upgrade os-maven-plugin mvn extension which figures the os we are running on from 1.4 to 1.5.
16533
16534
16535 ---
16536
16537 * [HBASE-16952](https://issues.apache.org/jira/browse/HBASE-16952) | *Major* | **Replace hadoop-maven-plugins with protobuf-maven-plugin for building protos**
16538
16539 Simplifies .proto manipulations. One step only now -- no need to keep pom.xml listing up to date with the protobuf protos directory content -- and no need to preinstall protoc; mvn does it all for you now.
16540
16541
16542 ---
16543
16544 * [HBASE-14551](https://issues.apache.org/jira/browse/HBASE-14551) | *Minor* | **Procedure v2 - Reimplement split**
16545
16546 Moved the Split Region logic to Master and most of split region coprocessor is in master now.  Need to change dependency such as Phoenix.
16547
16548
16549 ---
16550
16551 * [HBASE-15789](https://issues.apache.org/jira/browse/HBASE-15789) | *Major* | **PB related changes to work with offheap**
16552
16553 This issue adds a patch to our checked in internal, shaded protobuf, but it also adds a general means of apply patches to our version of protobuf. Patches found in the new src/main/patches directory are all applied as the last task when you run a build with the -Pcompile-protobuf profile under the hbase-protocol-shaded module. This commit also includes our first patch to protobuf; it adds ByteInput to mimic pb3.1's ByteOutput (src/main/patches/HBASE-15789\_V2.patch attached here).
16554
16555
16556 ---
16557
16558 * [HBASE-16930](https://issues.apache.org/jira/browse/HBASE-16930) | *Major* | **AssignmentManager#checkWals() function can recur infinitely**
16559
16560 Fixed potential infinite recursion in AssignmentManager.checkWals().
16561
16562
16563 ---
16564
16565 * [HBASE-16463](https://issues.apache.org/jira/browse/HBASE-16463) | *Major* | **Improve transparent table/CF encryption with Commons Crypto**
16566
16567 Improve transparent table/CF encryption with Commons Crypto. The change introduces a new optional CryptoCipherProvider (CommonsCryptoAES) for transparent table/CF encryption. And the encryption performance would be accelerated by hardware in modern CPU (AES-NI). This feature could be enabled by updating the configuration "hbase.crypto.cipherprovider" to "org.apache.hadoop.hbase.io.crypto.CryptoCipherProvider" in hbase-site.xml. For detailed information about transparent table/CF encryption including configuration examples see the Security section of the HBase manual.
16568
16569
16570 ---
16571
16572 * [HBASE-16414](https://issues.apache.org/jira/browse/HBASE-16414) | *Major* | **Improve performance for RPC encryption with Apache Common Crypto**
16573
16574 With the security RPC and encryption enabled, introduce Apache Commons Crypto to do the encryption/decryption which supports both supports both JCE Cipher and OpenSSL Cipher. Adds new configs "hbase.rpc.crypto.encryption.aes.enabled" which defaults to false, and "hbase.rpc.crypto.encryption.aes.cipher.class" which defaults to "org.apache.commons.crypto.cipher.JceCipher" to support JCE Cipher, it also can be set as "org.apache.hadoop.crypto.OpensslCipher" to support Openssl Cipher.
16575
16576
16577 ---
16578
16579 * [HBASE-16721](https://issues.apache.org/jira/browse/HBASE-16721) | *Critical* | **Concurrency issue in WAL unflushed seqId tracking**
16580
16581 Fixed a bug in sequenceId tracking for the WALs that caused WAL files to accumulate without being deleted due to a rare race condition.
16582
16583
16584 ---
16585
16586 * [HBASE-16834](https://issues.apache.org/jira/browse/HBASE-16834) | *Major* | **Add AsyncConnection support for ConnectionFactory**
16587
16588 Add createAsyncConnection method to ConnectionFactory for creating AsyncConnection. The default implementation is org.apache.hadoop.hbase.client.AsyncConnectionImpl. You can use 'hbase.client.async.connection.impl' to plug in your own AsyncConnection implementation.
16589
16590
16591 ---
16592
16593 * [HBASE-16729](https://issues.apache.org/jira/browse/HBASE-16729) | *Trivial* | **Define the behavior of (default) empty FilterList**
16594
16595 Empty filter list will behave as when there is no filter added. This change is a behavioral change for those who rely on Empty filter list.
16596
16597
16598 ---
16599
16600 * [HBASE-16799](https://issues.apache.org/jira/browse/HBASE-16799) | *Major* | **CP exposed Store should not expose unwanted APIs**
16601
16602 Below APIs from CP exposed Store interface are removed
16603 upsert(Iterable\<Cell\> cells, long readpoint)
16604 add(Cell cell)
16605 add(Iterable\<Cell\> cells)
16606 replayCompactionMarker(CompactionDescriptor compaction, boolean pickCompactionFiles,  boolean removeFiles)
16607 assertBulkLoadHFileOk(Path srcPath)
16608 bulkLoadHFile(String srcPathStr, long sequenceId)
16609 bulkLoadHFile(StoreFileInfo fileInfo)
16610
16611
16612 ---
16613
16614 * [HBASE-15921](https://issues.apache.org/jira/browse/HBASE-15921) | *Major* | **Add first AsyncTable impl and create TableImpl based on it**
16615
16616 Add AsyncConnection, AsyncTable and AsyncTableRegionLocator. Now the AsyncTable only support get, put and delete. And the implementation of AsyncTableRegionLocator is synchronous actually.
16617
16618
16619 ---
16620
16621 * [HBASE-16664](https://issues.apache.org/jira/browse/HBASE-16664) | *Major* | **Timeout logic in AsyncProcess is broken**
16622
16623 This issue fix three bugs:
16624 1.  rpcTimeout configuration not work for one rpc call in AP
16625 2.  operationTimeout configuration not work for multi-request (batch, put) in AP
16626 3.  setRpcTimeout and setOperationTimeout in HTable is not worked for AP and BufferedMutator.
16627
16628
16629 ---
16630
16631 * [HBASE-16661](https://issues.apache.org/jira/browse/HBASE-16661) | *Minor* | **Add last major compaction age to per-region metrics**
16632
16633 This adds a new per-region metric named "lastMajorCompactionAge" for tracking time since the last major compaction ran on a given region.  If a major compaction has never run, the age will be equal to the current timestamp.
16634
16635
16636 ---
16637
16638 * [HBASE-16117](https://issues.apache.org/jira/browse/HBASE-16117) | *Major* | **Fix Connection leak in mapred.TableOutputFormat**
16639
16640 (This change will be irrelevant after HBASE-16774 lands).
16641 There is a subtle change with error handling when a connection is not able to connect to ZK.  Attempts to create a connection when ZK is not up will now fail immediately instead of silently creating and then failing on a subsequent HBaseAdmin call.
16642
16643
16644 ---
16645
16646 * [HBASE-15984](https://issues.apache.org/jira/browse/HBASE-15984) | *Critical* | **Given failure to parse a given WAL that was closed cleanly, replay the WAL.**
16647
16648 In some particular deployments, the Replication code believes it has
16649 reached EOF for a WAL prior to successfully parsing all bytes known to
16650 exist in a cleanly closed file.
16651
16652 If an EOF is detected due to parsing or other errors while there are still unparsed bytes before the end-of-file trailer, we now reset the WAL to the very beginning and attempt a clean read-through. Because we will retry these failures indefinitely, two additional changes are made to help with diagnostics:
16653
16654 \* On each retry attempt, a log message like the below will be emitted at the WARN level:
16655
16656       Processing end of WAL file '{}'. At position {}, which is too far away
16657       from reported file length {}. Restarting WAL reading (see HBASE-15983
16658       for details).
16659
16660 \*  additional metrics measure the use of this recovery mechanism. they are described in the reference guide.
16661
16662
16663 ---
16664
16665 * [HBASE-16753](https://issues.apache.org/jira/browse/HBASE-16753) | *Minor* | **There is a mismatch between suggested Java version in hbase-env.sh**
16666
16667 Updates the comments and default values in a few scripts and docs to reflect our Java 1.8+ requirement.
16668
16669
16670 ---
16671
16672 * [HBASE-16567](https://issues.apache.org/jira/browse/HBASE-16567) | *Critical* | **Upgrade to protobuf-3.1.x**
16673
16674 Core is now up on protobuf 3.1.0 (Coprocessor Endpoints and REST are still on protobuf 2.5.0).
16675
16676
16677 ---
16678
16679 * [HBASE-15638](https://issues.apache.org/jira/browse/HBASE-15638) | *Critical* | **Shade protobuf**
16680
16681 Shade/relocate and include the protobuf we use internally. See protobuf chapter in the refguide for more on how we protobuf in hbase-.2.0.0 and going forward.
16682
16683 See https://docs.google.com/document/d/1H4NgLXQ9Y9KejwobddCqaVMEDCGbyDcXtdF5iAfDIEk/edit# for how we arrived at this approach.
16684
16685 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201610.mbox/%3C07850EDD-7230-431B-9AB0-C5C91B105EEC%40gmail.com%3E for discussion around merging this change and of how we might revert if an alternative to this awkward patch presents itself; e.g. an hadoop with CLASSPATH isolation (and means of dealing with Sparks use of protobuf 2.5.0, etc.)
16686
16687
16688 ---
16689
16690 * [HBASE-16264](https://issues.apache.org/jira/browse/HBASE-16264) | *Critical* | **Figure how to deal with endpoints and shaded pb**
16691
16692 Shade/relocate the protobuf hbase uses internally. All core now refers to new module added in this patch, hbase-protocol-shaded. Coprocessor Endpoints carry-on with references to the original hbase-protocol module. See new chapter in book on protobufs on how-to going forward.
16693
16694
16695 ---
16696
16697 * [HBASE-16672](https://issues.apache.org/jira/browse/HBASE-16672) | *Major* | **Add option for bulk load to always copy hfile(s) instead of renaming**
16698
16699 This issue adds a config, always.copy.files, to LoadIncrementalHFiles.
16700 When set to true, source hfiles would be copied. Meaning source hfiles would be kept after bulk load is done.
16701 Default value is false.
16702
16703
16704 ---
16705
16706 * [HBASE-16660](https://issues.apache.org/jira/browse/HBASE-16660) | *Critical* | **ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction**
16707
16708 "Please do not use DateTieredCompaction with Major Compaction unless you have a version with this. Otherwise your cluster will not compact any store files and you can end up running out of file descriptors." @churro morales
16709
16710
16711 ---
16712
16713 * [HBASE-16257](https://issues.apache.org/jira/browse/HBASE-16257) | *Blocker* | **Move staging dir to be under hbase root dir**
16714
16715 The HBase property 'hbase.bulkload.staging.dir' is deprecated and is ignored from HBase 2.0.  It will defaults to hbase.rootdir/staging automatically with the correct permissions.
16716
16717
16718 ---
16719
16720 * [HBASE-16650](https://issues.apache.org/jira/browse/HBASE-16650) | *Major* | **Wrong usage of BlockCache eviction stat for heap memory tuning**
16721
16722 Changed tracking of evictedBlocks count NOT to include evictions of blocks for a removed HFile. HFiles gets removed after compaction
16723
16724
16725 ---
16726
16727 * [HBASE-16294](https://issues.apache.org/jira/browse/HBASE-16294) | *Minor* | **hbck reporting "No HDFS region dir found" for replicas**
16728
16729 Fixed warning error message displayed for region directory not found for non-default/ non-primary replicas in hbck
16730
16731
16732 ---
16733
16734 * [HBASE-16540](https://issues.apache.org/jira/browse/HBASE-16540) | *Major* | **Scan should do additional validation on start and stop row**
16735
16736 Scan#setStartRow() and Scan#setStopRow() now validate the argument passed for each row key.  If the length of the byte[] passed exceeds Short.MAX\_VALUE, an IllegalArgumentException will be thrown.
16737
16738
16739 ---
16740
16741 * [HBASE-7612](https://issues.apache.org/jira/browse/HBASE-7612) | *Trivial* | **[JDK8] Replace use of high-scale-lib counters with intrinsic facilities**
16742
16743 org.apache.hadoop.hbase.util.Counter is deprecated now and will be removed in 3.0. Use LongAdder instead.
16744
16745
16746 ---
16747
16748 * [HBASE-16447](https://issues.apache.org/jira/browse/HBASE-16447) | *Critical* | **Replication by namespaces config in peer**
16749
16750 Support replication by namespaces config in peer.
16751 1. Set a namespace in peer config means that all tables in this namespace will be replicated.
16752 2. If the namespaces config is null, then the table-cfs config decide which table's edit can be replicated. If the table-cfs config is null, then the namespaces config decide which table's edit can be replicated.
16753 3. If you already have set a namespace in the peer config, then you can't set any table of this namespace to the peer config. If you already have set a table in the peer config, then you can't set this table's namespace to the peer config.
16754
16755
16756 ---
16757
16758 * [HBASE-16598](https://issues.apache.org/jira/browse/HBASE-16598) | *Major* | **Enable zookeeper useMulti always and clean up in HBase code**
16759
16760 Deprecate the configuration property 'hbase.zookeeper.useMulti'.
16761 useMulti will always be enabled. ZooKeeper 3.4.x and newer is required.
16762
16763 Internal:
16764
16765 The ZKUtil#multiOrSequential(ZooKeeperWatcher zkw, List\<ZKUtilOp\> ops, boolean runSequentialOnMultiFailure) will not check 'hbase.zookeeper.useMulti' anymore, and will always use multi.
16766 It can still fall back to sequential operations if:
16767
16768 RunSequentialOnMultiFailure is true
16769 On calling multi, we get a ZooKeeper exception that can be handled by a sequential call.
16770
16771
16772 ---
16773
16774 * [HBASE-16388](https://issues.apache.org/jira/browse/HBASE-16388) | *Major* | **Prevent client threads being blocked by only one slow region server**
16775
16776 Add a new configuration, hbase.client.perserver.requests.threshold, to limit the max number of concurrent request to one region server. If the user still create new request after reaching the limit, client will throw ServerTooBusyException and do not send the request to the server. This is a client side feature and can prevent client's threads being blocked by one slow region server resulting in the availability of client is much lower than the availability of region servers.
16777
16778 For completeness, here extract on new config from hbase-default.xml:
16779
16780 Property: hbase.client.perserver.requests.threshold
16781 Default: 2147483647
16782 Description: The max number of concurrent pending requests for one server in all client threads (process level). Exceeding requests will be thrown ServerTooBusyException immediately to prevent user's threads being occupied and blocked by only one slow region server. If you use a fix number of threads to access HBase in a synchronous way, set this to a suitable value which is  related to the number of threads will help you. See https://issues.apache.org/jira/browse/HBASE-16388 for details.
16783
16784
16785 ---
16786
16787 * [HBASE-15297](https://issues.apache.org/jira/browse/HBASE-15297) | *Minor* | **error message is wrong when a wrong namspace is specified in grant in hbase shell**
16788
16789 The security admin instance available within the HBase shell now returns "false" from the namespace\_exists? method for non-existent namespaces rather than raising a wrapped NamespaceNotFoundException.
16790
16791 As a side effect, when the "grant" and "revoke" commands in the HBase shell are invoked with a non-existent namespace the resulting error message now properly refers to said namespace rather than to the user.
16792
16793
16794 ---
16795
16796 * [HBASE-16086](https://issues.apache.org/jira/browse/HBASE-16086) | *Major* | **TableCfWALEntryFilter and ScopeWALEntryFilter should not redundantly iterate over cells.**
16797
16798 push to branch-1.3+
16799
16800
16801 ---
16802
16803 * [HBASE-16340](https://issues.apache.org/jira/browse/HBASE-16340) | *Critical* | **ensure no Xerces jars included**
16804
16805 HBase no longer includes Xerces implementation jars that were previously included via transitive dependencies. Downstream users relying on HBase for these artifacts will need to update their dependencies.
16806
16807
16808 ---
16809
16810 * [HBASE-16213](https://issues.apache.org/jira/browse/HBASE-16213) | *Major* | **A new HFileBlock structure for fast random get**
16811
16812 HBASE-16213 introduced a new DataBlockEncoding in name of ROW\_INDEX\_V1, which could improve random read (get) performance especially when the average record size (key-value size per row) is small. To use this feature, please set DATA\_BLOCK\_ENCODING to ROW\_INDEX\_V1 for CF of newly created table, or change existing CF with below command:
16813 alter 'table\_name',{NAME =\> 'cf', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}.
16814
16815 Please note that if we turn this DBE on, HFile block will be bigger than NONE encoding because it adds some meta infos for binary search:
16816 /\*\*
16817  \* Store cells following every row's start offset, so we can binary search to a row's cells.
16818  \*
16819  \* Format:
16820  \* flat cells
16821  \* integer: number of rows
16822  \* integer: row0's offset
16823  \* integer: row1's offset
16824  \* ....
16825  \* integer: dataSize
16826  \*
16827 \*/
16828
16829 Seek in row when random reading is one of the main consumers of CPU. This helps. See slide #7 here https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
16830
16831
16832 ---
16833
16834 * [HBASE-16409](https://issues.apache.org/jira/browse/HBASE-16409) | *Minor* | **Row key for bad row should be properly delimited in VerifyReplication**
16835
16836 --delimiter= option is added to verifyrep.
16837 The delimiter would wrap bad rows in log output.
16838
16839
16840 ---
16841
16842 * [HBASE-14921](https://issues.apache.org/jira/browse/HBASE-14921) | *Major* | **Inmemory Compaction Optimizations; Segment Structure**
16843
16844 A long, working issue that discussed Segment formats introducing CellArrayMap (delivered as the patch attached to this issue) and CellChunkMap (to be delivered later in HBASE-16421 but see patch v02 for an embryonic form named CellBlockSerialized); when to copy Segment data (and when not too); and then what to include at flush time (the suffix Segment or all Segments). Designs that evolved as discussion went on are attached. Outstanding issues turned up here, not including a CellChunkMap implementation, are listed below but are to be addressed in follow-ons (See HBASE-16417):
16845
16846 1. The flattening without compaction is causing many small segments in pipeline, and they are not flushed all together.
16847 2. The issue of compaction prediction cost.
16848
16849
16850 ---
16851
16852 * [HBASE-16450](https://issues.apache.org/jira/browse/HBASE-16450) | *Major* | **Shell tool to dump replication queues**
16853
16854 New tool to dump existing replication peers, configurations and queues when using HBase Replication. The tool provides two flags:
16855
16856  --distributed  This flag will poll each RS for information about the replication queues being processed on this RS.
16857 By default this is not enabled and the information about the replication queues and configuration will be obtained from ZooKeeper.
16858  --hdfs   When --distributed is used, this flag will attempt to calculate the total size of the WAL files used by the replication queues. Since its possible that multiple peers can be configured this value can be overestimated.
16859
16860
16861 ---
16862
16863 * [HBASE-16422](https://issues.apache.org/jira/browse/HBASE-16422) | *Major* | **Tighten our guarantees on compatibility across patch versions**
16864
16865 Adds below change to our compat guarantees:
16866
16867 {code}
16868 -\* Example: A user using a newly deprecated api does not need to modify application code with hbase api calls until the next major version.
16869  10 +\* New APIs introduced in a patch version will only be added in a source compatible way footnote:[See 'Source Compatibility' https://blogs.oracle.com/darcy/entry/kinds\_of\_compatibility]: i.e.     code that implements public APIs will continue to compile.
16870 {code}
16871
16872
16873 ---
16874
16875 * [HBASE-7621](https://issues.apache.org/jira/browse/HBASE-7621) | *Major* | **REST client (RemoteHTable) doesn't support binary row keys**
16876
16877 RemoteHTable now supports binary row keys with any character or byte by properly encoding request URLs. This is a both a behavioral change from earlier versions and an important fix for protocol correctness.
16878
16879
16880 ---
16881
16882 * [HBASE-12721](https://issues.apache.org/jira/browse/HBASE-12721) | *Major* | **Create Docker container cluster infrastructure to enable better testing**
16883
16884 Downstream users wishing to test HBase in a "distributed" fashion (multiple "nodes" running as separate containers on the same host) can now do so in an automated fashion while leveraging Docker for process isolation via the clusterdock project.
16885
16886 For details see the README.md in the dev-support/apache\_hbase\_topology folder.
16887
16888
16889 ---
16890
16891 * [HBASE-16267](https://issues.apache.org/jira/browse/HBASE-16267) | *Critical* | **Remove commons-httpclient dependency from hbase-rest module**
16892
16893 This issue upgrades httpclient to 4.5.2 and httpcore to 4.4.4 which are the versions used by hadoop-2.
16894 This is to handle the following CVE's.
16895
16896 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2015-5262 : http/conn/ssl/SSLConnectionSocketFactory.java in Apache HttpComponents HttpClient before 4.3.6 ignores the http.socket.timeout configuration setting during an SSL handshake, which allows remote attackers to cause a denial of service (HTTPS call hang) via unspecified vectors.
16897
16898 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-6153
16899 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-5783
16900 Apache Commons HttpClient 3.x, as used in Amazon Flexible Payments Service (FPS) merchant Java SDK and other products, does not verify that the server hostname matches a domain name in the subject's Common Name (CN) or subjectAltName field of the X.509 certificate, which allows man-in-the-middle attackers to spoof SSL servers via an arbitrary valid certificate.
16901
16902 Downstream users who are exposed to commons-httpclient via the HBase classpath will have to similarly update their dependency.
16903
16904
16905 ---
16906
16907 * [HBASE-16308](https://issues.apache.org/jira/browse/HBASE-16308) | *Major* | **Contain protobuf references**
16908
16909 Undo protobuf references through the codebase so protobuf references are contained rather than spread about the codebase. For example, moved protobuff-ing up into the various Callables rather than repeat on each method invocation cleaning up boilerplate around rpc calls. Having a few protobuf reference locations only simplifies the parent issue shading project.
16910
16911
16912 ---
16913
16914 * [HBASE-16321](https://issues.apache.org/jira/browse/HBASE-16321) | *Blocker* | **Ensure findbugs jsr305 jar isn't present**
16915
16916 HBase now ensures the jsr305 implementation from the findbugs project is not included in its binary artifacts or the compile / runtime dependencies of its user facing modules. Downstream users that rely on this jar will need to update their dependencies.
16917
16918
16919 ---
16920
16921 * [HBASE-8386](https://issues.apache.org/jira/browse/HBASE-8386) | *Major* | **deprecate TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)**
16922
16923 The MapReduce helper function \`TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)\` has been deprecated since it is easy to use incorrectly. Most users should rely on addDependencyJars(Job) instead.
16924
16925
16926 ---
16927
16928 * [HBASE-16287](https://issues.apache.org/jira/browse/HBASE-16287) | *Major* | **LruBlockCache size should not exceed acceptableSize too many**
16929
16930 In order to avoid blockcache size exceed acceptable size too much, we add one configuration "hbase.lru.blockcache.hard.capacity.limit.factor" to decide whether the block could be put into LruBlockCache or not.  This factor defaults to 1.2
16931 If blockcache size \>= factor\*acceptableSize, we will reject the block into cache.
16932
16933
16934 ---
16935
16936 * [HBASE-16355](https://issues.apache.org/jira/browse/HBASE-16355) | *Major* | **hbase-client dependency on hbase-common test-jar should be test scope**
16937
16938 The HBase client artifact previously incorrectly included the hbase-common test jar as a runtime dependency. With this change, that dependency has been moved to test scope. Downstream users are not expected to be impacted, unless they relied on the transitive dependency for these HBase internal test classes.
16939
16940
16941 ---
16942
16943 * [HBASE-16317](https://issues.apache.org/jira/browse/HBASE-16317) | *Blocker* | **revert all ESAPI changes**
16944
16945 This issue reverts fixes designed to prevent malicious content from rendering in HBase's UIs. Specifically, these changes shipped in 1.1.4+ and 1.2.0+. They were removed due to licensing issues discovered in the dependencies they introduced. Their implementation and those dependencies have been removed from HBase! Removal of these dependencies is against the strict definition of our version compatibility guidelines. However, inclusion of non-Apache approved licenses cannot be tolerated. Implementation of these fixes using an Apache-appropriate means is tracked in HBASE-16328.
16946
16947
16948 ---
16949
16950 * [HBASE-16288](https://issues.apache.org/jira/browse/HBASE-16288) | *Critical* | **HFile intermediate block level indexes might recurse forever creating multi TB files**
16951
16952 A new hfile configuration "hfile.index.block.min.entries" which defaults to 16 determines how many entries the hfile index block can have at least. The configuration which determines how large the index block can be at max (hfile.index.block.max.size) is ignored as long as we have fewer than hfile.index.block.min.entries entries. This ensures that multi-level index does not build up with too many levels.
16953
16954
16955 ---
16956
16957 * [HBASE-16186](https://issues.apache.org/jira/browse/HBASE-16186) | *Major* | **Fix AssignmentManager MBean name**
16958
16959 The AssignmentManager MBean was named AssignmentManger (note misspelling). This patch fixed the misspelling.
16960
16961
16962 ---
16963
16964 * [HBASE-16289](https://issues.apache.org/jira/browse/HBASE-16289) | *Critical* | **AsyncProcess stuck messages need to print region/server**
16965
16966 Adds logging of region and server. Helpful debugging. Logging now looks like this:
16967 {code}
16968 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess$AsyncRequestFutureImpl(1601): #1, waiting for 1  actions to finish on table: DUMMY\_TABLE
16969 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1720): Left over 1 task(s) are processed on server(s): [s1:1,1,1]
16970 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1728): Regions against which left over task(s) are processed: [DUMMY\_TABLE,DUMMY\_BYTES\_1,1.3fd12ea80b4df621fb15497ba75f7368.,DUMMY\_TABLE,DUMMY\_BYTES\_2,2.924207e242e313d2e5491c625e0a296e.]
16971 {code}
16972
16973
16974 ---
16975
16976 * [HBASE-14743](https://issues.apache.org/jira/browse/HBASE-14743) | *Minor* | **Add metrics around HeapMemoryManager**
16977
16978 A memory metrics reveals situations happened in both MemStores and BlockCache in RegionServer. Through this metrics, users/operators can know
16979 1). Current size of MemStores and BlockCache in bytes.
16980 2). Occurrence for Memstore minor and major flush. (named unblocked flush and blocked flush respectively, shown in histogram)
16981 3). Dynamic changes in size between MemStores and BlockCache. (with Increase/Decrease as prefix, shown in histogram). And a counter for no changes, named DoNothingCounter.
16982 4). Occurrence for memory usage alarm (used more than 95% by default) in RegionServer. (named AboveHeapOccupancyLowWatermarkCounter)
16983
16984
16985 ---
16986
16987 * [HBASE-13701](https://issues.apache.org/jira/browse/HBASE-13701) | *Major* | **Consolidate SecureBulkLoadEndpoint into HBase core as default for bulk load**
16988
16989 SecureBulkLoadEndpoint  has been integrated into HBase core as default bulk load mechanism. It is no longer needed to install it as a coprocessor endpoint.
16990 The new server is backward compatible, accommodating non-secure old client and secure old client requesting SecureBulkLoadEndpoint service.
16991 SecureBulkLoadEndpoint is deprecated. The backward compatibility support may be removed in future releases.
16992
16993
16994 ---
16995
16996 * [HBASE-16244](https://issues.apache.org/jira/browse/HBASE-16244) | *Major* | **LocalHBaseCluster start timeout should be configurable**
16997
16998 When LocalHBaseCluster is started from the command line the Master would give up after 30 seconds due to a hardcoded timeout meant for unit tests. This change allows the timeout to be configured via hbase-site as well as sets it to 5 minutes when LocalHBaseCluster is started from the command line.
16999
17000
17001 ---
17002
17003 * [HBASE-16052](https://issues.apache.org/jira/browse/HBASE-16052) | *Major* | **Improve HBaseFsck Scalability**
17004
17005 HBASE-16052 improves the performance and scalability of HBaseFsck, especially for large clusters with a small number of large tables.
17006
17007 Searching for lingering reference files is now a multi-threaded operation.  Loading HDFS region directory information is now multi-threaded at the region-level instead of the table-level to maximize concurrency.  A performance bug in HBaseFsck that resulted in redundant I/O and RPCs was fixed by introducing a FileStatusFilter that filters FileStatus objects directly.
17008
17009
17010 ---
17011
17012 * [HBASE-16144](https://issues.apache.org/jira/browse/HBASE-16144) | *Major* | **Replication queue's lock will live forever if RS acquiring the lock has died prematurely**
17013
17014 If zk based replication queue is used and useMulti is false, we will schedule a chore to clean up the orphan replication queue lock on zk.
17015
17016
17017 ---
17018
17019 * [HBASE-3727](https://issues.apache.org/jira/browse/HBASE-3727) | *Minor* | **MultiHFileOutputFormat**
17020
17021 MultiHFileOutputFormat support output of HFiles from multiple tables. It will output directories and hfiles as follow,
17022      --table1
17023        --family1
17024        --family2
17025          --Hfiles
17026      --table2
17027        --family3
17028          --hfiles
17029        --family4
17030
17031 family directory and its hfiles match the output of HFileOutputFormat2
17032
17033
17034 ---
17035
17036 * [HBASE-16231](https://issues.apache.org/jira/browse/HBASE-16231) | *Major* | **Integration tests should support client keytab login for secure clusters**
17037
17038 Prior to this change, the integration test clients (IntegrationTest\*) relied on the Kerberos credential cache for authentication against secured clusters.  This could lead to the tests failing due to authentication failures when the tickets in the credential cache expired.  With this change, the integration test clients will make use of the configuration properties for "hbase.client.keytab.file" and "hbase.client.kerberos.principal", when available.  This will perform a login from the configured keytab file and automatically refresh the credentials in the background for the process lifetime.
17039
17040
17041 ---
17042
17043 * [HBASE-13823](https://issues.apache.org/jira/browse/HBASE-13823) | *Major* | **Procedure V2: unnecessaery operations on AssignmentManager#recoverTableInDisablingState() and recoverTableInEnablingState()**
17044
17045 For cluster upgraded from 1.0.x or older releases, master startup would not continue the in-progress enable/disable table process.  If orphaned znode with ENABLING/DISABLING state exists in the cluster, run hbck or manually fix the issue.
17046
17047 For new cluster or cluster upgraded from 1.1.x and newer release, there is no issue to worry about.
17048
17049
17050 ---
17051
17052 * [HBASE-16095](https://issues.apache.org/jira/browse/HBASE-16095) | *Major* | **Add priority to TableDescriptor and priority region open thread pool**
17053
17054 Adds a PRIORITY property to the HTableDescriptor. PRIORITY should be in the same range as the RpcScheduler defines it (HConstants.XXX\_QOS).
17055
17056 Table priorities are only used for region opening for now. There can be other uses later (like RpcScheduling).
17057
17058 Regions of high priority tables (priority \>= than HIGH\_QOS) are opened from a different thread pool than the regular region open thread pool. However, table priorities are not used as a global order for region assigning or opening.
17059
17060
17061 ---
17062
17063 * [HBASE-16081](https://issues.apache.org/jira/browse/HBASE-16081) | *Blocker* | **Replication remove\_peer gets stuck and blocks WAL rolling**
17064
17065 When a replication endpoint is sent a shutdown request by the replication source in situations like removing a peer, we now try to gracefully shut it down by draining the items already sent for replication to the peer cluster. If the drain does not complete in the specified time (hbase.rpc.timeout \* replication.source.maxterminationmultiplier), the regionserver is aborted to avoid blocking the WAL roll.
17066
17067
17068 ---
17069
17070 * [HBASE-16087](https://issues.apache.org/jira/browse/HBASE-16087) | *Major* | **Replication shouldn't start on a master if if only hosts system tables**
17071
17072 Masters will no longer start any replication threads if they are hosting only system tables.
17073
17074 In order to change this add something to the config for tables on master that doesn't start with "hbase:" ( Replicating system tables is something that's currently unsupported and can open up security holes, so do this at your own peril)
17075
17076
17077 ---
17078
17079 * [HBASE-14548](https://issues.apache.org/jira/browse/HBASE-14548) | *Major* | **Expand how table coprocessor jar and dependency path can be specified**
17080
17081 Allow a directory containing the jars or some wildcards to be specified, such as: hdfs://namenode:port/user/hadoop-user/
17082 or
17083 hdfs://namenode:port/user/hadoop-user/\*.jar
17084
17085 Please note that if a directory is specified, all jar files(.jar) directly in the directory are added, but it does not search files in the subtree rooted in the directory.
17086 Do not contain any wildcard if you would like to specify a directory.
17087
17088
17089 ---
17090
17091 * [HBASE-15925](https://issues.apache.org/jira/browse/HBASE-15925) | *Blocker* | **compat-module maven variable not evaluated**
17092
17093 Downstream users of HBase dependencies that do not properly activate Maven profiles should now see a correct transitive dependency on the default hadoop-compatibility-module.
17094
17095
17096 ---
17097
17098 * [HBASE-16140](https://issues.apache.org/jira/browse/HBASE-16140) | *Major* | **bump owasp.esapi from 2.1.0 to 2.1.0.1**
17099
17100 The dependency owasp.esapi had a compatible change from 2.1.0 to 2.1.0.1. As a result, the transitive dependency commons-fileupload had a change from 1.2 to 1.3.1, which has some minor class changes that impact binary compatibility. Interested users should check the release notes of commons-fileupload to see if any of the incompatible changes impact them.
17101
17102 http://commons.apache.org/proper/commons-fileupload/changes-report.html
17103
17104
17105 ---
17106
17107 * [HBASE-16147](https://issues.apache.org/jira/browse/HBASE-16147) | *Major* | **Shell command for getting compaction state**
17108
17109 compaction\_state shell command would return compaction state in String form:
17110 NONE, MINOR, MAJOR, MAJOR\_AND\_MINOR
17111
17112
17113 ---
17114
17115 * [HBASE-14878](https://issues.apache.org/jira/browse/HBASE-14878) | *Major* | **maven archetype: client application with shaded jars**
17116
17117 Adds new hbase-shaded-client archetype; also corrects an omission found in hbase-archetypes/README.md in the section headed "How to add a new archetype".
17118
17119
17120 ---
17121
17122 * [HBASE-14877](https://issues.apache.org/jira/browse/HBASE-14877) | *Major* | **maven archetype: client application**
17123
17124 This patch introduces a new infrastructure for creation and maintenance of Maven archetypes in the context of the hbase project, and it also introduces the first archetype, which end-users may utilize to generate a simple hbase-client dependent project.
17125
17126 NOTE that this patch should introduce two new WARNINGs ("Using platform encoding ... to copy filtered resources") into the hbase install process. These warnings are hard-wired into the maven-archetype-plugin:create-from-project goal. See hbase/hbase-archetypes/README.md, footnote [6] for details.
17127
17128 After applying the patch, see hbase/hbase-archetypes/README.md for details regarding the new archetype infrastructure introduced by this patch. (The README text is also conveniently positioned at the top of the patch itself.)
17129
17130 Here is the opening paragraph of the README.md file:
17131 =================
17132 The hbase-archetypes subproject of hbase provides an infrastructure for creation and maintenance of Maven archetypes pertinent to HBase. Upon deployment to the archetype catalog of the central Maven repository, these archetypes may be used by end-user developers to autogenerate completely configured Maven projects (including fully-functioning sample code) through invocation of the archetype:generate goal of the maven-archetype-plugin.
17133 ========
17134 The README.md file also contains several paragraphs under the heading, "Notes for contributors and committers to the HBase project", which explains the layout of 'hbase-archetypes', and how archetypes are created and installed into the local Maven repository, ready for deployment to the central Maven repository. It also outlines how new archetypes may be developed and added to the collection in the future.
17135
17136
17137 ---
17138
17139 * [HBASE-15977](https://issues.apache.org/jira/browse/HBASE-15977) | *Major* | **Failed variable substitution on home page**
17140
17141 Done. Thanks, Dima, Andrew!
17142
17143
17144 ---
17145
17146 * [HBASE-5291](https://issues.apache.org/jira/browse/HBASE-5291) | *Major* | **Add Kerberos HTTP SPNEGO authentication support to HBase web consoles**
17147
17148 HBase Web UIs can be secured from general public access using SPNEGO to require a valid Kerberos ticket.
17149
17150 Setting 'hbase.security.authentication.ui' to 'kerberos' in hbase-site.xml is a global switch to have all Web UIs allow only authenticated clients via Kerberos. 'hbase.security.authentication.spnego.kerberos.principal' and 'hbase.security.authentication.spnego.kerberos.keytab' are two other required properties in hbase-site.xml, the Kerberos principal and keytab to use for the server to use to log in. The primary in the Kerberos principal must be 'HTTP' as required by the SPNEGO mechanism, e.g. 'HTTP/host.domain.com@DOMAIN.COM'.
17151
17152
17153 ---
17154
17155 * [HBASE-15950](https://issues.apache.org/jira/browse/HBASE-15950) | *Major* | **Fix memstore size estimates to be more tighter**
17156
17157 The estimates of heap usage by the memstore objects (KeyValue, object and array header sizes, etc) have been made more accurate for heap sizes up to 32G (using CompressedOops), resulting in them dropping by 10-50% in practice. This also results in less number of flushes and compactions due to "fatter" flushes. YMMV. As a result, the actual heap usage of the memstore before being flushed may increase by up to 100%. If configured memory limits for the region server had been tuned based on observed usage, this change could result in worse GC behavior or even OutOfMemory errors. Set the environment property (not hbase-site.xml) "hbase.memorylayout.use.unsafe" to false to disable.
17158
17159
17160 ---
17161
17162 * [HBASE-16023](https://issues.apache.org/jira/browse/HBASE-16023) | *Major* | **Fastpath for the FIFO rpcscheduler**
17163
17164 Adds a 'fastpath' when using the default FIFO rpc scheduler ('fifo'). Does direct handoff from Reader thread to Handler if there is one ready and willing. Will shine best when high random read workload (YCSB workloadc for instance)
17165
17166
17167 ---
17168
17169 * [HBASE-15971](https://issues.apache.org/jira/browse/HBASE-15971) | *Critical* | **Regression: Random Read/WorkloadC slower in 1.x than 0.98**
17170
17171 Change the default rpc scheduler from 'deadline' to 'fifo' instead so it is the same as in branch 0.98. 'deadline' was of questionable benefit but with a high cost scheduling. To re-enable 'deadline', set hbase.ipc.server.callqueue.type to 'deadline' in your hbase-site.xml.
17172
17173
17174 ---
17175
17176 * [HBASE-15525](https://issues.apache.org/jira/browse/HBASE-15525) | *Critical* | **OutOfMemory could occur when using BoundedByteBufferPool during RPC bursts**
17177
17178 Added a new ByteBufferPool which pools N ByteBuffers. By default it makes off heap ByteBuffers when getBuffer() is called. The size of each buffer defaults to 64KB. This can be configured using 'hbase.ipc.server.reservoir.initial.buffer.size'.   The max number of buffers which can be pooled defaults to twice the number of handler threads in RS. This can be configured with key 'hbase.ipc.server.reservoir.initial.max'.  While responding to read requests and client support Codec, we will create CellBlocks and directly return it as PB payload. For making this block, we will use N ByteBuffers from pool as per the total size of the response cells. The default size of 64 KB for the buffer is inline with the number of bytes written to RPC layer in one short.(That is also 64KB).  When at point of time, the calle not able to get a free buffer from the pool (it returns null then), it will make on heap Buffer of same size (as that of Buffers in pool) and use that to create cell block.
17179
17180
17181 ---
17182
17183 * [HBASE-15994](https://issues.apache.org/jira/browse/HBASE-15994) | *Major* | **Allow selection of RpcSchedulers**
17184
17185 Adds a FifoRpcSchedulerFactory so you can try the FifoRpcScheduler by setting  "hbase.region.server.rpc.scheduler.factory.class"
17186
17187
17188 ---
17189
17190 * [HBASE-15989](https://issues.apache.org/jira/browse/HBASE-15989) | *Major* | **Remove hbase.online.schema.update.enable**
17191
17192 Removes the "hbase.online.schema.update.enable" property.
17193 from now, every operation that alter the schema (e.g. modifyTable, addFamily, removeFamily, ...) will use the online schema update. there is no need to disable/enable the table.
17194
17195
17196 ---
17197
17198 * [HBASE-15981](https://issues.apache.org/jira/browse/HBASE-15981) | *Minor* | **Stripe and Date-tiered compactions inaccurately suggest disabling table in docs**
17199
17200 Removes reference to disabling table in docs for stripe and date-tiered compactions
17201
17202
17203 ---
17204
17205 * [HBASE-15931](https://issues.apache.org/jira/browse/HBASE-15931) | *Critical* | **Add log for long-running tasks in AsyncProcess**
17206
17207 After HBASE-15931, we will log more details for long-running tasks in AsyncProcess#waitForMaximumCurrentTasks every 10 seconds, including:
17208 1. Table name will be included in the tasks status log
17209 2. On which regionserver(s) the tasks are runnning will be logged when less than hbase.client.threshold.log.details tasks left, by default 10.
17210 3. Against which regions the tasks are running will be logged when less than 2 tasks left.
17211
17212
17213 ---
17214
17215 * [HBASE-15907](https://issues.apache.org/jira/browse/HBASE-15907) | *Major* | **Missing documentation of create table split options**
17216
17217 documentation changes only - added section to Shell tricks and cross reference from region splitting section
17218
17219
17220 ---
17221
17222 * [HBASE-15915](https://issues.apache.org/jira/browse/HBASE-15915) | *Major* | **Set timeouts on hanging tests**
17223
17224 Use @ClassRule to set timeout on test case level (instead of @Rule which sets timeout for the test methods). CategoryBasedTimeout.forClass(..) determines the timeout value based on category annotation (small/medium/large) on the test case.
17225
17226
17227 ---
17228
17229 * [HBASE-15875](https://issues.apache.org/jira/browse/HBASE-15875) | *Major* | **Remove HTable references and HTableInterface**
17230
17231 **WARNING: No release note provided for this change.**
17232
17233
17234 ---
17235
17236 * [HBASE-15610](https://issues.apache.org/jira/browse/HBASE-15610) | *Blocker* | **Remove deprecated HConnection for 2.0 thus removing all PB references for 2.0**
17237
17238 **WARNING: No release note provided for this change.**
17239
17240
17241 ---
17242
17243 * [HBASE-15890](https://issues.apache.org/jira/browse/HBASE-15890) | *Major* | **Allow thrift to set/unset "cacheBlocks" for Scans**
17244
17245 Adds cacheBlocks to Scan
17246
17247
17248 ---
17249
17250 * [HBASE-15876](https://issues.apache.org/jira/browse/HBASE-15876) | *Blocker* | **Remove doBulkLoad(Path hfofDir, final HTable table) though it has not been through a full deprecation cycle**
17251
17252 Removes a doBulkLoad method though it has not been through a full deprecation cycle (but it is 'damaged' because it has a parameter that has been properly deprecated). Use the alternative {code}public void doBulkLoad(Path hfofDir, final Admin admin, Table table, RegionLocator regionLocator){code}
17253
17254 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201605.mbox/%3CCAMUu0w-ZiLoLBLO3D76=n3AjUr=VMtTUeYA28weLHYeq8+e3bQ@mail.gmail.com%3E for NOTICE on this 'premature' removal.
17255
17256
17257 ---
17258
17259 * [HBASE-15228](https://issues.apache.org/jira/browse/HBASE-15228) | *Major* | **Add the methods to RegionObserver to trigger start/complete restoring WALs**
17260
17261 Added two hooks around WAL restore.
17262 preReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17263 and
17264 postReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17265
17266 Will be called at start and end of restore of a WAL file.
17267 The other hook around WAL restore (preWALRestore ) will be called before restore of every entry within the WAL file.
17268
17269
17270 ---
17271
17272 * [HBASE-15856](https://issues.apache.org/jira/browse/HBASE-15856) | *Critical* | **Cached Connection instances can wind up with addresses never resolved**
17273
17274 During periods where DNS resolution was not available or not working correctly, we could previously cache unresolved hostnames forever, in some cases preventing further connections to these hosts even when DNS service was restored.  With this change, unresolved hostnames will no longer be cached, and will instead throw an UnknownHostException during connection setup.
17275
17276
17277 ---
17278
17279 * [HBASE-15593](https://issues.apache.org/jira/browse/HBASE-15593) | *Major* | **Time limit of scanning should be offered by client**
17280
17281 Add a new configuration: hbase.ipc.min.client.request.timeout
17282 Minimum allowable timeout (in milliseconds) in rpc request's header. This configuration exists to prevent the rpc service regarding this request as timeout immediately.
17283
17284
17285 ---
17286
17287 * [HBASE-15784](https://issues.apache.org/jira/browse/HBASE-15784) | *Major* | **Misuse core/maxPoolSize of LinkedBlockingQueue in ThreadPoolExecutor**
17288
17289 The core pool size and max pool size of ThreadPoolExecutor should be the same when LinkedBlockingQueue is used. Thus the configurations hbase.hconnection.threads.max, hbase.hconnection.meta.lookup.threads.max, hbase.region.replica.replication.threads.max and hbase.multihconnection.threads.max are used as the number of the core threads, and the related configurations \*.thread.core are not used any more.
17290
17291
17292 ---
17293
17294 * [HBASE-15651](https://issues.apache.org/jira/browse/HBASE-15651) | *Major* | **Add report-flakies.py to use jenkins api to get failing tests**
17295
17296 To find recent set of flakies, run the script added by this patch. Run it to get usage information passing -h:
17297
17298 {code}
17299 $ ./dev-support/report-flakies.py -h
17300 {code}
17301
17302 If you get the below:
17303
17304 {code}
17305 $ python ./dev-support/report-flakies.py
17306 Traceback (most recent call last):
17307   File "./dev-support/report-flakies.py", line 25, in \<module\>
17308     import requests
17309 ImportError: No module named requests
17310 {code}
17311
17312 ... install the requests module:
17313
17314 {code}
17315 $ sudo pip install requests
17316 {code}
17317
17318
17319 ---
17320
17321 * [HBASE-15780](https://issues.apache.org/jira/browse/HBASE-15780) | *Critical* | **Expose AuthUtil as IA.Public**
17322
17323 Downstream users with long lived applications that need to communicate with secure HBase instances can now rely on the AuthUtil class to handle authenticating via keytab.
17324
17325 For more information, see the javadoc for the org.apache.hadoop.hbase.AuthUtil class.
17326
17327
17328 ---
17329
17330 * [HBASE-15811](https://issues.apache.org/jira/browse/HBASE-15811) | *Blocker* | **Batch Get after batch Put does not fetch all Cells**
17331
17332 We were not waiting on all executors in a batch to complete which meant a read-your-own-writes could sometimes fail -- especially if client is loaded; i.e. putting to multiple machines in a cluster. The test for no-more-executors was damaged by the 0.99/0.98.4 fix "HBASE-11403 Fix race conditions around Object#notify"
17333
17334
17335 ---
17336
17337 * [HBASE-15801](https://issues.apache.org/jira/browse/HBASE-15801) | *Major* | **Upgrade checkstyle for all branches**
17338
17339 All active branches now use maven-checkstyle-plugin 2.17 and checkstyle 6.18.
17340
17341
17342 ---
17343
17344 * [HBASE-15236](https://issues.apache.org/jira/browse/HBASE-15236) | *Major* | **Inconsistent cell reads over multiple bulk-loaded HFiles**
17345
17346 This jira fixes that following bug:
17347 During bulkloading, if there are multiple hfiles corresponding to same region, and if they have same timestamps (which may have been set using importtsv.timestamp) and duplicate keys across them, then get and scan may return values coming from different hfiles.
17348
17349
17350 ---
17351
17352 * [HBASE-15740](https://issues.apache.org/jira/browse/HBASE-15740) | *Major* | **Replication source.shippedKBs metric is undercounting because it is in KB**
17353
17354 Removed Replication source.shippedKBs metric in favor of source.shippedBytes
17355
17356
17357 ---
17358
17359 * [HBASE-15773](https://issues.apache.org/jira/browse/HBASE-15773) | *Major* | **CellCounter improvements**
17360
17361 The CellCounter map reduce job now supports additional configuration options on the Scan instance it creates, using the org.apache.hadoop.hbase.mapreduce.TableInputFormat defined property names.  For a full list of the options, run ./hbase org.apache.hadoop.hbase.mapreduce.CellCounter with no arguments.
17362
17363 CellCounter also no longer creates job counters for per-rowkey and per-rowkey/qualifier cell counts.  For most tables, these counters would cause the job to fail due to mapreduce job counter limits.
17364
17365
17366 ---
17367
17368 * [HBASE-15759](https://issues.apache.org/jira/browse/HBASE-15759) | *Minor* | **RegionObserver.preStoreScannerOpen() doesn't have acces to current readpoint**
17369
17370 The following RegionObserver method is deprecated and would no longer be called in hbase 2.0:
17371
17372   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17373       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17374       final KeyValueScanner s) throws IOException {
17375
17376 Instead, override this method:
17377
17378   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17379       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17380       final KeyValueScanner s, final long readPt) throws IOException {
17381
17382
17383 ---
17384
17385 * [HBASE-15743](https://issues.apache.org/jira/browse/HBASE-15743) | *Major* | **Add Transparent Data Encryption support for FanOutOneBlockAsyncDFSOutput**
17386
17387 Now the AsyncFSWAL can write data to a encryption zone on HDFS.
17388
17389
17390 ---
17391
17392 * [HBASE-15767](https://issues.apache.org/jira/browse/HBASE-15767) | *Major* | **Upgrade httpclient dependency**
17393
17394 HBase now relies on version 4.3.6 of the Apache Commons HTTPClient library. Downstream users who are exposed to it via the HBase classpath will have to similarly update their dependency.
17395
17396
17397 ---
17398
17399 * [HBASE-15575](https://issues.apache.org/jira/browse/HBASE-15575) | *Minor* | **Rename table DDL \*Handler methods in MasterObserver to more meaningful names**
17400
17401 **WARNING: No release note provided for this change.**
17402
17403
17404 ---
17405
17406 * [HBASE-15720](https://issues.apache.org/jira/browse/HBASE-15720) | *Major* | **Print row locks at the debug dump page**
17407
17408 Adds a section to the debug dump page listing current row locks held.
17409
17410
17411 ---
17412
17413 * [HBASE-15703](https://issues.apache.org/jira/browse/HBASE-15703) | *Critical* | **Deadline scheduler needs to return to the client info about skipped calls, not just drop them**
17414
17415 With previous deadline mode of RPC scheduling (the implementation in SimpleRpcScheduler, which is basically a FIFO except that long-running scans are de-prioritized) and FIFO-based RPC scheduler clients are getting CallQueueTooBigException when RPC call queue is full.
17416
17417 With this patch and when hbase.ipc.server.callqueue.type property is set to "codel" mode, clients will also be getting CallDroppedException, which means that the request was discarded by the server as it considers itself to be overloaded and starts to drop requests to avoid going down under the load. The clients will retry upon receiving this exception. It doesn't clear MetaCache with region locations.
17418
17419
17420 ---
17421
17422 * [HBASE-15281](https://issues.apache.org/jira/browse/HBASE-15281) | *Major* | **Allow the FileSystem inside HFileSystem to be wrapped**
17423
17424 This patch adds new configuration property - hbase.fs.wrapper. If provided, it should be fully qualified class name of the class used as a pluggable wrapper for HFileSystem. This may be useful for specific debugging/tracing needs.
17425
17426
17427 ---
17428
17429 * [HBASE-15551](https://issues.apache.org/jira/browse/HBASE-15551) | *Minor* | **Make call queue too big exception use servername**
17430
17431 Fixes issue when CallQueueTooBig exception returned to the client could print useless address info (like 0.0.0.0) if RPC server is listening on something other than the host name, making troubleshooting inconvenient.
17432
17433
17434 ---
17435
17436 * [HBASE-15711](https://issues.apache.org/jira/browse/HBASE-15711) | *Major* | **Add client side property to allow logging details for batch errors**
17437
17438 In HBASE-15711 a new client side property hbase.client.log.batcherrors.details is introduced to allow logging full stacktrace of exceptions for batch error. It's disabled by default and set the property to true will enable it.
17439
17440
17441 ---
17442
17443 * [HBASE-15686](https://issues.apache.org/jira/browse/HBASE-15686) | *Major* | **Add override mechanism for the exempt classes when dynamically loading table coprocessor**
17444
17445 New coprocessor table descriptor attribute, hbase.coprocessor.classloader.included.classes, is added.
17446 User can specify class name prefixes (semicolon separated) which should be loaded by CoprocessorClassLoader through this attribute using the following syntax:
17447 {code}
17448   hbase\> alter 't1',    'coprocessor'=\>'hdfs:///foo.jar\|com.foo.FooRegionObserver\|1001\|arg1=1,arg2=2'
17449 {code}
17450
17451
17452 ---
17453
17454 * [HBASE-15645](https://issues.apache.org/jira/browse/HBASE-15645) | *Critical* | **hbase.rpc.timeout is not used in operations of HTable**
17455
17456 Fixes regression where hbase.rpc.timeout configuration was ignored in branch-1.0+
17457
17458 Adds new methods setOperationTimeout, getOperationTimeout, setRpcTimeout, and getRpcTimeout to Table. In branch-1.3+ they are public interfaces and in 1.0-1.2 they are labeled as @InterfaceAudience.Private.
17459
17460 Adds hbase.client.operation.timeout to hbase-default.xml with default of 1200000
17461
17462
17463 ---
17464
17465 * [HBASE-15477](https://issues.apache.org/jira/browse/HBASE-15477) | *Major* | **Do not save 'next block header' when we cache hfileblocks**
17466
17467 Fix over-persisting in blockcache; no longer save the block PLUS the header of the next block (33 bytes) when writing the cache.
17468
17469 Also removes support for hfileblock v1; hfile block v1 was used writing hfile v1. hfile v1 was the default in hbase before hbase-0.92. hbase.96 would not start unless all v1 hfiles had been compacted out of the cluster.
17470
17471
17472 ---
17473
17474 * [HBASE-15628](https://issues.apache.org/jira/browse/HBASE-15628) | *Major* | **Implement an AsyncOutputStream which can work with any FileSystem implementation**
17475
17476 Introduce an AsyncFSOutput interface which is an abstraction of the original FanOutOneBlockAsyncDFSOutput. Now you can create AsyncFSOutput on any FileSystem using the method AsyncFSOutputHelper.createOutput. The returned AsyncFSOutput will be FanOutOneBlockAsyncDFSOutput if the given FileSystem is a DistributedFileSystem.
17477
17478
17479 ---
17480
17481 * [HBASE-15392](https://issues.apache.org/jira/browse/HBASE-15392) | *Major* | **Single Cell Get reads two HFileBlocks**
17482
17483 When an explicit Get with a one or more columns specified, we at a minimum, were overseeking, reading until we tripped over the next row, regardless, and only then returning. If the next row was in-block, we'd just do too much seeking but if the next row was in the next (or in the next block beyond that), we would keep seeking and loading blocks until we found the next row before we'd return.
17484
17485 There remains one case where we will still 'overread'. It is when the row end aligns with the end of the block. In this case we will load the next block just to find that there are no more cells in the current row. See HBASE-15457.
17486
17487
17488 ---
17489
17490 * [HBASE-15671](https://issues.apache.org/jira/browse/HBASE-15671) | *Major* | **Add per-table metrics on memstore, storefile and regionsize**
17491
17492 Adds storeFileSize, memstoreSize and tableSize to the per-table metrics.
17493
17494
17495 ---
17496
17497 * [HBASE-15366](https://issues.apache.org/jira/browse/HBASE-15366) | *Major* | **Add doc, trace-level logging, and test around hfileblock**
17498
17499 No functional change. Added javadoc, comments, and extra trace-level logging to make clear what is happening around the reading and caching of hfile blocks.
17500
17501
17502 ---
17503
17504 * [HBASE-15368](https://issues.apache.org/jira/browse/HBASE-15368) | *Major* | **Add pluggable window support**
17505
17506 Use 'hbase.hstore.compaction.date.tiered.window.factory.class' to specify the window implementation you like for date tiered compaction. Now the only and default implementation is org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory.
17507
17508 {code}
17509 \<property\>
17510 \<name\>hbase.hstore.compaction.date.tiered.window.factory.class\</name\>
17511 \<value\>org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory\</value\>
17512 \</property\>
17513 \<property\>
17514 {code}
17515
17516
17517 ---
17518
17519 * [HBASE-15518](https://issues.apache.org/jira/browse/HBASE-15518) | *Major* | **Add Per-Table metrics back**
17520
17521 Adds per-table metrics aggregated from per-region metrics in region server metrics. New metrics are available under JMX section "Hadoop:service=HBase,name=RegionServer,sub=Tables" and they are available via hadoop metrics2 collectors.
17522
17523
17524 ---
17525
17526 * [HBASE-15640](https://issues.apache.org/jira/browse/HBASE-15640) | *Major* | **L1 cache doesn't give fair warning that it is showing partial stats only when it hits limit**
17527
17528 The blockcache UI tab would stop refreshing at 100k blocks (configurable, see "hbase.ui.blockcache.by.file.max"), which isn't very many blocks when doing a big cache, giving a misleading picture of the content of L1 and/or L2 cache. Up the default limit to 1M blocks (UI takes a while but just a few seconds counting over 1M blocks).
17529
17530 Also, when beyond the limit give the user a noticeable WARNING in the UI.
17531
17532
17533 ---
17534
17535 * [HBASE-15386](https://issues.apache.org/jira/browse/HBASE-15386) | *Major* | **PREFETCH\_BLOCKS\_ON\_OPEN in HColumnDescriptor is ignored**
17536
17537 This was a non-issue. The PREFETCH\_... flag actually works. While here though made the following additions.
17538
17539 Changes the prefetch TRACE-level loggings to include the word 'Prefetch' in them so you know what they are about.
17540
17541 Changes the cryptic logging of the CacheConfig#toString to have some preamble saying why and what column family is responsible (helps figure what is going on)
17542
17543 Add test that verifies setting flag on HColumnDescriptor actually works.
17544
17545
17546 ---
17547
17548 * [HBASE-13372](https://issues.apache.org/jira/browse/HBASE-13372) | *Major* | **Unit tests for SplitTransaction and RegionMergeTransaction listeners**
17549
17550 HBASE-13372 Add unit tests for SplitTransaction and RegionMergeTransaction listeners
17551
17552
17553 ---
17554
17555 * [HBASE-15187](https://issues.apache.org/jira/browse/HBASE-15187) | *Major* | **Integrate CSRF prevention filter to REST gateway**
17556
17557 Protection against CSRF attack can be turned on with config parameter, hbase.rest.csrf.enabled - default value is false.
17558
17559 The custom header to be sent can be changed via config parameter, hbase.rest.csrf.custom.header whose default value is "X-XSRF-HEADER".
17560
17561 Config parameter, hbase.rest.csrf.methods.to.ignore , controls which HTTP methods are not associated with customer header check.
17562
17563 Config parameter, hbase.rest-csrf.browser-useragents-regex , is a comma-separated list of regular expressions used to match against an HTTP request's User-Agent header when protection against cross-site request forgery (CSRF) is enabled for REST server by setting hbase.rest.csrf.enabled to true.
17564
17565 The implementation came from hadoop/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/http/RestCsrfPreventionFilter.java
17566
17567 We should periodically update the RestCsrfPreventionFilter.java in hbase codebase to include fixes to the hadoop implementation.
17568
17569
17570 ---
17571
17572 * [HBASE-15481](https://issues.apache.org/jira/browse/HBASE-15481) | *Trivial* | **Add pre/post roll to WALObserver**
17573
17574 <!-- markdown -->
17575
17576
17577 WALObserver coprocessors now can receive notifications of WAL rolling via the new methods `preWALRoll` and `postWALRoll`.
17578
17579 This change is incompatible due to the addition of these methods to the `WALObserver` interface. Downstream users are encouraged to instead extend the `BaseWALObserver` class, which remains compatible through this change.
17580
17581
17582 ---
17583
17584 * [HBASE-15507](https://issues.apache.org/jira/browse/HBASE-15507) | *Major* | **Online modification of enabled ReplicationPeerConfig**
17585
17586 Added update\_peer\_config to the HBase shell and ReplicationAdmin, and provided a callback for custom replication endpoints to be notified of changes to their configuration and peer data
17587
17588
17589 ---
17590
17591 * [HBASE-15537](https://issues.apache.org/jira/browse/HBASE-15537) | *Major* | **Make multi WAL work with WALs other than FSHLog**
17592
17593 Add the delegate config for multiwal back. Now you can use 'hbase.wal.regiongrouping.delegate.provider' to specify the wal provider you want to use for multiwal. For example:
17594 {code}
17595 \<property\>
17596 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
17597 \<value\>asyncfs\</value\>
17598 \</property\>
17599 {code}
17600 And the default value is filesystem which is the alias of DefaultWALProvider, i.e., the FSHLog.
17601
17602
17603 ---
17604
17605 * [HBASE-15400](https://issues.apache.org/jira/browse/HBASE-15400) | *Major* | **Use DateTieredCompactor for Date Tiered Compaction**
17606
17607 With this patch combined with HBASE-15389, when we compact, we can output multiple files along the current window boundaries. There are two use cases:
17608 1. Major compaction: We want to output date tiered store files with data older than max age archived in trunks of the window size on the higher tier. Once a window is old enough, we don't combine the windows to promote to the next tier any further. So files in these windows retain the same timespan as they were minor-compacted last time, which is the window size of the highest tier. Major compaction will touch these files and we want to maintain the same layout. This way, TTL and archiving will be simpler and more efficient.
17609 2. Bulk load files and the old file generated by major compaction before upgrading to DTCP.
17610
17611 This will change the way to enable date tiered compaction.
17612 To turn it on:
17613 hbase.hstore.engine.class: org.apache.hadoop.hbase.regionserver.DateTieredStoreEngine
17614
17615 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17616 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17617 hbase.hstore.compaction.throughput.higher.bound and hbase.hstore.compaction.throughput.lower.bound need to be set for desired throughput range as uncompressed rates.
17618
17619 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17620 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17621
17622 Because major compaction is turned on now, we also need to adjust the configuration for max file to compact according to the larger file count:
17623 hbase.hstore.compaction.max: set to the same number as hbase.hstore.blockingStoreFiles.
17624
17625 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17626
17627
17628 ---
17629
17630 * [HBASE-15592](https://issues.apache.org/jira/browse/HBASE-15592) | *Major* | **Print Procedure WAL content**
17631
17632 Use hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter
17633 to print the content of a Procedure WAL.
17634 e.g.
17635 hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter -f /hbase/MasterProcWALs/state-00000000000000002571.log
17636
17637
17638 ---
17639
17640 * [HBASE-15396](https://issues.apache.org/jira/browse/HBASE-15396) | *Minor* | **Enhance mapreduce.TableSplit to add encoded region name**
17641
17642 To aid troubleshooting of MapReduce job that rely on the HBase provided input format, splits now include the encoded region name they cover.
17643
17644
17645 ---
17646
17647 * [HBASE-15568](https://issues.apache.org/jira/browse/HBASE-15568) | *Major* | **Procedure V2 - Remove CreateTableHandler in HBase Apache 2.0 release**
17648
17649 **WARNING: No release note provided for this change.**
17650
17651
17652 ---
17653
17654 * [HBASE-15521](https://issues.apache.org/jira/browse/HBASE-15521) | *Major* | **Procedure V2 - RestoreSnapshot and CloneSnapshot**
17655
17656 **WARNING: No release note provided for this change.**
17657
17658
17659 ---
17660
17661 * [HBASE-15538](https://issues.apache.org/jira/browse/HBASE-15538) | *Major* | **Implement secure async protobuf wal writer**
17662
17663 Add the following config in hbase-site.xml if you want to use secure protobuf wal writer together with AsyncFSWAL
17664 {code}
17665 \<property\>
17666 \<name\>hbase.regionserver.hlog.async.writer.impl\</name\>
17667 \<value\>org.apache.hadoop.hbase.regionserver.wal.SecureAsyncProtobufLogWriter\</value\>
17668 \</property\>
17669 \<property\>
17670 {code}
17671
17672
17673 ---
17674
17675 * [HBASE-11393](https://issues.apache.org/jira/browse/HBASE-11393) | *Major* | **Replication TableCfs should be a PB object rather than a string**
17676
17677 **WARNING: No release note provided for this change.**
17678
17679
17680 ---
17681
17682 * [HBASE-15265](https://issues.apache.org/jira/browse/HBASE-15265) | *Major* | **Implement an asynchronous FSHLog**
17683
17684 To enable, set the WALProvider as follows:
17685
17686 {code}
17687 \<property\>
17688 \<name\>hbase.wal.provider\</name\>
17689 \<value\>asyncfs\</value\>
17690 \</property\>
17691 \<property\>
17692 {code}
17693
17694 To check which provider is active, look for the log line:
17695
17696 LOG.info("Instantiating WALProvider of type " + clazz);
17697
17698
17699 ---
17700
17701 * [HBASE-14256](https://issues.apache.org/jira/browse/HBASE-14256) | *Major* | **Flush task message may be confusing when region is recovered**
17702
17703 HBASE-14256 Correct confusing flush task message
17704
17705
17706 ---
17707
17708 * [HBASE-15212](https://issues.apache.org/jira/browse/HBASE-15212) | *Major* | **RPCServer should enforce max request size**
17709
17710 Adds a configuration parameter "hbase.ipc.max.request.size" which defaults to 256MB to protect the server against very large incoming RPC requests. All requests larger than this size will be immediately rejected before allocating any resources (memory allocation, etc).
17711
17712
17713 ---
17714
17715 * [HBASE-15412](https://issues.apache.org/jira/browse/HBASE-15412) | *Major* | **Add average region size metric**
17716
17717 Adds a new metric for called "averageRegionSize" that is emitted as a regionserver metric. Metric description:
17718 Average region size over the region server including memstore and storefile sizes
17719
17720
17721 ---
17722
17723 * [HBASE-15479](https://issues.apache.org/jira/browse/HBASE-15479) | *Major* | **No more garbage or beware of autoboxing**
17724
17725 This fix decreases client's memory allocation during writes by more than 50%.
17726
17727
17728 ---
17729
17730 * [HBASE-15322](https://issues.apache.org/jira/browse/HBASE-15322) | *Critical* | **Operations using Unsafe path broken for platforms not having sun.misc.Unsafe**
17731
17732 **WARNING: No release note provided for this change.**
17733
17734
17735 ---
17736
17737 * [HBASE-12940](https://issues.apache.org/jira/browse/HBASE-12940) | *Major* | **Expose listPeerConfigs and getPeerConfig to the HBase shell**
17738
17739 Adds get\_peer\_config and list\_peer\_configs to the hbase shell.
17740
17741
17742 ---
17743
17744 * [HBASE-15430](https://issues.apache.org/jira/browse/HBASE-15430) | *Critical* | **Failed taking snapshot - Manifest proto-message too large**
17745
17746 Failed taking snapshot - Manifest proto-message too large. add property ("snapshot.manifest.size.limit")  to change max size of proto-message
17747
17748
17749 ---
17750
17751 * [HBASE-15323](https://issues.apache.org/jira/browse/HBASE-15323) | *Major* | **Hbase Rest CheckAndDeleteAPi should be able to delete more cells**
17752
17753 Fixed an issue in REST server checkAndDelete operation where the remaining cells other than the to-be-checked column are also applied in the Delete operation. Also fixed an issue in RemoteHTable where the Delete object was not passed correctly to the REST server side.
17754
17755
17756 ---
17757
17758 * [HBASE-15377](https://issues.apache.org/jira/browse/HBASE-15377) | *Major* | **Per-RS Get metric is time based, per-region metric is size-based**
17759
17760 Per-region metrics related to Get histograms are changed from being response size based into being latency based similar to the per-regionserver metrics of the same name.
17761
17762 Added GetSize histogram metrics at the per-regionserver and per-region level for the response sizes.
17763
17764
17765 ---
17766
17767 * [HBASE-6721](https://issues.apache.org/jira/browse/HBASE-6721) | *Major* | **RegionServer Group based Assignment**
17768
17769 [ADVANCED USERS ONLY] This patch adds a new experimental module hbase-rsgroup. It is an advanced feature for partitioning regionservers into distinctive groups for strict isolation, and should only be used by users who are sophisticated enough to understand the full implications and have a sufficient background in managing HBase clusters.
17770
17771 RSGroups can be defined and managed with shell commands or corresponding Java APIs. A server can be added to a group with hostname and port pair, and tables can be moved to this group so that only regionservers in the same rsgroup can host the regions of the table. RegionServers and tables can only belong to 1 group at a time. By default, all tables and regionservers belong to the "default" group. System tables can also be put into a group using the regular APIs. A custom balancer implementation tracks assignments per rsgroup and makes sure to move regions to the relevant regionservers in that group. The group information is stored in a regular HBase table, and a zookeeper-based read-only cache is used at the cluster bootstrap time.
17772
17773 To enable, add the following to your hbase-site.xml and restart your Master:
17774
17775
17776  \<property\>
17777    \<name\>hbase.coprocessor.master.classes\</name\>
17778    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupAdminEndpoint\</value\>
17779  \</property\>
17780  \<property\>
17781    \<name\>hbase.master.loadbalancer.class\</name\>
17782    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupBasedLoadBalancer\</value\>
17783  \</property\>
17784
17785
17786 Then use the shell 'rsgroup' commands to create and manipulate regionserver groups: e.g. to add a group and then add a server to it, do as follows:
17787
17788  hbase(main):008:0\> add\_rsgroup 'my\_group'
17789  Took 0.5610 seconds
17790
17791 This adds a group to the 'hbase:rsgroup' system table. Add a server (hostname + port) to the group using the 'move\_rsgroup\_servers' command as follows:
17792
17793  hbase(main):010:0\> move\_rsgroup\_servers 'my\_group',['k.att.net:51129']
17794
17795
17796 ---
17797
17798 * [HBASE-15435](https://issues.apache.org/jira/browse/HBASE-15435) | *Major* | **Add WAL (in bytes) written metric**
17799
17800 Adds a new metric named "writtenBytes" as a per-regionserver metric. Metric Description:
17801 Size (in bytes) of the data written to the WAL.
17802
17803
17804 ---
17805
17806 * [HBASE-13963](https://issues.apache.org/jira/browse/HBASE-13963) | *Critical* | **avoid leaking jdk.tools**
17807
17808 HBase now ensures that the JDK tools jar used during the build process is not exposed to downstream clients as a transitive dependency of hbase-annotations.
17809
17810 If you need to have the JDK tools jar in your classpath, you should add a system dependency on it. See the hbase-annotations pom for an example of the necessary pom additions.
17811
17812
17813 ---
17814
17815 * [HBASE-15271](https://issues.apache.org/jira/browse/HBASE-15271) | *Major* | **Spark Bulk Load: Need to write HFiles to tmp location then rename to protect from Spark Executor Failures**
17816
17817 When using the bulk load helper provided by the hbase-spark module, output files will now be written into temporary files and only made available when the executor has successfully completed.
17818
17819 Previously, failed executors would leave their files in place in a way that would be picked up by a bulk load command. This caused retried failures to include spurious copies of some cells.
17820
17821
17822 ---
17823
17824 * [HBASE-15364](https://issues.apache.org/jira/browse/HBASE-15364) | *Major* | **Fix unescaped \< characters in Javadoc**
17825
17826 HBASE-15364 Fix unescaped \< and \> characters in Javadoc
17827
17828
17829 ---
17830
17831 * [HBASE-15243](https://issues.apache.org/jira/browse/HBASE-15243) | *Major* | **Utilize the lowest seek value when all Filters in MUST\_PASS\_ONE FilterList return SEEK\_NEXT\_USING\_HINT**
17832
17833 When all filters in a MUST\_PASS\_ONE FilterList return a SEEK\_USING\_NEXT\_HINT code, we return SEEK\_NEXT\_USING\_HINT from the FilterList#filterKeyValue() to utilize the lowest seek value.
17834
17835
17836 ---
17837
17838 * [HBASE-15354](https://issues.apache.org/jira/browse/HBASE-15354) | *Major* | **Use same criteria for clearing meta cache for all operations**
17839
17840 This patch fixes some issues when MetaCache (region location cache) gets unnecessarily dropped on the client.
17841
17842 On master branch we now in RegionServerCallable and RegionServerAdminCallable pass the actual exception down to Connection#updateCachedLocation, so we could check there if the exception is "meta-clearing" or not.
17843
17844 on branch-1, branch-1.2 and branch 1.3 we now check if the exception is meta-clearing or not in AsyncProcess (this check was there on master, but not on earlier branches)
17845
17846
17847 ---
17848
17849 * [HBASE-15376](https://issues.apache.org/jira/browse/HBASE-15376) | *Major* | **ScanNext metric is size-based while every other per-operation metric is time based**
17850
17851 Removed ScanNext histogram metrics as regionserver level and per-region level metrics since the semantics is not compatible with other similar metrics (size histogram vs latency histogram).
17852
17853 Instead, this patch adds ScanTime and ScanSize histogram metrics at the regionserver and per-region level.
17854
17855
17856 ---
17857
17858 * [HBASE-15338](https://issues.apache.org/jira/browse/HBASE-15338) | *Minor* | **Add a option to disable the data block cache for testing the performance of underlying file system**
17859
17860 Add a new config: hbase.block.data.cacheonread, which is a global switch for caching data blocks on read. The default value of this switch is true, and data blocks will be cached on read if the block cache is enabled for the family and cacheBlocks flag is set to be true for get and scan operations. If this global switch is set to false, data blocks won't be cached even if the block cache is enabled for the family and the cacheBlocks flag of Gets or Scans are sets as true. Bloom blocks and index blocks are always be cached if the block cache of the regionserver is enabled. One usage of this switch is for the performance tests for the extreme case that  the cache for data blocks all missed and all data blocks are read from underlying file system.
17861
17862
17863 ---
17864
17865 * [HBASE-15136](https://issues.apache.org/jira/browse/HBASE-15136) | *Critical* | **Explore different queuing behaviors while busy**
17866
17867 Previously RPC request scheduler in HBase had 2 modes in could operate in:
17868
17869  - simple FIFO
17870  - "partial" deadline, where deadline constraints are only imposed on long-running scan requests.
17871
17872 This patch adds new type of scheduler to HBase, based on the research around controlled delay (CoDel) algorithm [1], used in networking to combat bufferbloat, as well as some analysis on generalizing it to generic request queues [2]. The purpose of that work is to prevent long standing call queues caused by discrepancy between request rate and available throughput, caused by kernel/disk IO/networking stalls.
17873
17874 New RPC scheduler could be enabled by setting hbase.ipc.server.callqueue.type=codel in configuration. Several additional params allow to configure algorithm behavior -
17875
17876 hbase.ipc.server.callqueue.codel.target.delay
17877 hbase.ipc.server.callqueue.codel.interval
17878 hbase.ipc.server.callqueue.codel.lifo.threshold
17879
17880 [1] Controlling Queue Delay / A modern AQM is just one piece of the solution to bufferbloat. http://queue.acm.org/detail.cfm?id=2209336
17881 [2] Fail at Scale / Reliability in the face of rapid change. http://queue.acm.org/detail.cfm?id=2839461
17882
17883
17884 ---
17885
17886 * [HBASE-15181](https://issues.apache.org/jira/browse/HBASE-15181) | *Major* | **A simple implementation of date based tiered compaction**
17887
17888 Date tiered compaction policy is a date-aware store file layout that is beneficial for time-range scans for time-series data.
17889
17890 When it performs well:
17891
17892     reads for limited time ranges, especially scans of recent data
17893
17894 When it doesn't perform as well:
17895
17896     random gets without a time range
17897     frequent deletes and updates
17898     out of order data writes, especially writes with timestamps in the future
17899     bulk loads of historical data
17900
17901 Recommended configuration:
17902 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
17903 hbase.hstore.compaction.compaction.policy: org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
17904
17905 Parameters for Date Tiered Compaction:
17906 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
17907 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
17908 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
17909 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
17910 hbase.hstore.compaction.date.tiered.window.policy.class: the policy to select store files within the same time window. It doesn’t apply to the incoming window. Default at exploring compaction. This is to avoid wasteful compaction.
17911
17912 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17913 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17914
17915 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17916 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17917
17918 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17919
17920
17921 ---
17922
17923 * [HBASE-15290](https://issues.apache.org/jira/browse/HBASE-15290) | *Major* | **Hbase Rest CheckAndAPI should save other cells along with compared cell**
17924
17925 Fixed an issue in REST server checkAndPut operation where the remaining cells other than the to-be-checked column are also applied in the put operation .
17926
17927
17928 ---
17929
17930 * [HBASE-15264](https://issues.apache.org/jira/browse/HBASE-15264) | *Major* | **Implement a fan out HDFS OutputStream**
17931
17932 Implement a fan-out asynchronous DFSOutputStream for implementing new WAL writer.
17933
17934
17935 ---
17936
17937 * [HBASE-13259](https://issues.apache.org/jira/browse/HBASE-13259) | *Critical* | **mmap() based BucketCache IOEngine**
17938
17939 mmap() based bucket cache can be configured by specifying the property
17940 {code}
17941 \<property\>
17942   \<name\>hbase.bucketcache.ioengine\</name\>
17943   \<value\> mmap://filepath \</value\>
17944 \</property\>
17945 {code}
17946 This mode of bucket cache is ideal when your file based bucket cache size is lesser than then available RAM. When the cache is bigger than the available RAM then the kernel page faults will make this cache perform lesser particularly in case of scans.
17947
17948
17949 ---
17950
17951 * [HBASE-11927](https://issues.apache.org/jira/browse/HBASE-11927) | *Major* | **Use Native Hadoop Library for HFile checksum (And flip default from CRC32 to CRC32C)**
17952
17953 Checksumming is cpu intensive. HBase computes additional checksums for HFiles (hdfs does checksums too) and stores them inline with file data. During reading, these checksums are verified to ensure data is not corrupted. This patch tries to use Hadoop Native Library for checksum computation, if it’s available, otherwise falls back to standard Java libraries. Instructions to load NHL in HBase can be found here (http://hbase.apache.org/book.html#hadoop.native.lib).
17954
17955 Default checksum algorithm has been changed from CRC32 to CRC32C primarily because of two reasons: 1) CRC32C has better error detection properties, and 2) New Intel processors have a dedicated instruction for crc32c computation (SSE4.2 instruction set)\*. This change is fully backward compatible. Also, users should not see any differences except decrease in cpu usage. To keep old settings, set configuration ‘hbase.hstore.checksum.algorithm’ to ‘CRC32’.
17956
17957 \* On linux, run 'cat /proc/cpuinfo’ and look for sse4\_2 in list of flags to see if your processor supports SSE4.2.
17958
17959
17960 ---
17961
17962 * [HBASE-15219](https://issues.apache.org/jira/browse/HBASE-15219) | *Critical* | **Canary tool does not return non-zero exit code when one of regions is in stuck state**
17963
17964 A new flag is added for Canary tool: -treatFailureAsError
17965 When this flag is specified, read / write failure would result in Canary tool exit code of 5.
17966
17967
17968 ---
17969
17970 * [HBASE-14949](https://issues.apache.org/jira/browse/HBASE-14949) | *Major* | **Resolve name conflict when splitting if there are duplicated WAL entries**
17971
17972 Now we can write duplicated WAL entries into different WAL files. This feature is required by the replication consistency fix and new implementation of WAL writer.
17973
17974
17975 ---
17976
17977 * [HBASE-15100](https://issues.apache.org/jira/browse/HBASE-15100) | *Blocker* | **Master WALProcs still never clean up**
17978
17979 The constructor for o.a.h.hbase.ProcedureInfo was mistakenly labeled IA.Public in previous releases and has now changed to IA.Private. Downstream users are safe to consume ProcedureInfo objects returned from HBase public interfaces, but should not expect to be able to reliably create new instances themselves.
17980
17981 The method ProcedureInfo.setNonceKey has been removed, because it should not have been exposed to clients.
17982
17983
17984 ---
17985
17986 * [HBASE-14355](https://issues.apache.org/jira/browse/HBASE-14355) | *Major* | **Scan different TimeRange for each column family**
17987
17988 Adds being able to Scan each column family with a different time range. Adds new methods setColumnFamilyTimeRange and getColumnFamilyTimeRange to Scan.
17989
17990
17991 ---
17992
17993 * [HBASE-14460](https://issues.apache.org/jira/browse/HBASE-14460) | *Critical* | **[Perf Regression] Merge of MVCC and SequenceId (HBASE-8763) slowed Increments, CheckAndPuts, batch operations**
17994
17995 This release note tries to tell the general story. Dive into sub-tasks for more specific release noting.
17996
17997 Increments, appends, checkAnd\* have been slow since hbase-.1.0.0. The unification of mvcc and sequence id done by HBASE-8763 was responsible.
17998
17999 A ‘fast-path’ workaround was added by HBASE-15031 “Fix merge of MVCC and SequenceID performance regression in branch-1.0 for Increments”. It became available in 1.0.3 and 1.1.3. To enable the fast path, set "hbase.increment.fast.but.narrow.consistency" and then rolling restart. The workaround was for increments only (appends, checkAndPut, etc., were not addressed. See HBASE-15031 release note for more detail).
18000
18001 Subsequently, the regression was properly identified and fixed in HBASE-15213 and the fix applied to branch-1.0 and branch-1.1. As it happens, hbase-1.2.0 does not suffer from the performance regression (though the thought was that it did -- and so it got the fast-path patch too via HBASE-15092) nor does the master branch. HBASE-15213 identified that HBASE-12751 (as a side effect) had cured the regression.
18002
18003 hbase-1.0.4 (if it is ever released -- 1.0 has been end-of-lifed) and hbase-1.1.4 will have the HBASE-15213 fix.  If you are suffering from the increment regression and you are on 1.0.3 or 1.1.3, you can enable the work around to get back your increment performance but you should upgrade.
18004
18005
18006 ---
18007
18008 * [HBASE-15046](https://issues.apache.org/jira/browse/HBASE-15046) | *Major* | **Perf test doing all mutation steps under row lock**
18009
18010 In here we perf tested a realignment of the write pipeline and mvcc handling.  Thought was that this work was a predicate for a general fix of HBASE-14460 (turns out, realignment of write path was not needed to fix the increment perf regression). The perf testing here made it so we were able to simplify writing. HBASE-15158 was just committed. This work is done.
18011
18012
18013 ---
18014
18015 * [HBASE-15158](https://issues.apache.org/jira/browse/HBASE-15158) | *Major* | **Change order in which we do write pipeline operations; do all under row locks!**
18016
18017 Changed the write pipeline order; made it more rational, easier-to-reason-about doing all updates to WA, MemStore, and mvcc while read/write rowlock is held where before we'd release after WAL append and then do sync and mvcc.
18018
18019
18020 ---
18021
18022 * [HBASE-15157](https://issues.apache.org/jira/browse/HBASE-15157) | *Major* | **Add \*PerformanceTest for Append, CheckAnd\***
18023
18024 Add append, increment, checkAndMutate, checkAndPut, and checkAndDelete tests to PerformanceEvaluation tool. Below are excerpts from new usage from PE:
18025
18026 ....
18027 Command:
18028  append          Append on each row; clients overlap on keyspace so some concurrent operations
18029  checkAndDelete  CheckAndDelete on each row; clients overlap on keyspace so some concurrent operations
18030  checkAndMutate  CheckAndMutate on each row; clients overlap on keyspace so some concurrent operations
18031  checkAndPut     CheckAndPut on each row; clients overlap on keyspace so some concurrent operations
18032  filterScan      Run scan test using a filter to find a specific row based on it's value (make sure to use --rows=20)
18033  increment       Increment on each row; clients overlap on keyspace so some concurrent operations
18034  randomRead      Run random read test
18035 ....
18036 Examples:
18037 ...
18038  To run 10 clients doing increments over ten rows:
18039  $ bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation --rows=10 --nomapred increment 10
18040
18041 Removed IncrementPerformanceTest. It is not as configurable as the additions made here.
18042
18043
18044 ---
18045
18046 * [HBASE-15218](https://issues.apache.org/jira/browse/HBASE-15218) | *Blocker* | **On RS crash and replay of WAL, loosing all Tags in Cells**
18047
18048 This issue fixes
18049 - In case of normal WAL (Not encrypted) we were loosing all cell tags on WAL replay after an RS crash
18050 - In case of encrypted WAL we were not even persisting Cell tags in WAL.  Tags from all unflushed (to HFile) Cells will get lost even after WAL replay recovery is done.
18051
18052 As we use tags for Cell level security, this fixes 2 security issues
18053  - Cell level visibility labels security breach . Making a visibility restricted cell global readable
18054  - Cell level ACL availability issue.  A user who is cell level authorized to read this cell can not read it. It is a data loss for him.
18055
18056
18057 ---
18058
18059 * [HBASE-15129](https://issues.apache.org/jira/browse/HBASE-15129) | *Major* | **Set default value for hbase.fs.tmp.dir rather than fully depend on hbase-default.xml**
18060
18061 Before HBASE-15129, if somehow hbase-default.xml is not on classpath, default values for hbase.fs.tmp.dir and hbase.bulkload.staging.dir are left empty. After HBASE-15129,  default values of both properties are set to "/user/\<user.name\>/hbase-staging".
18062
18063
18064 ---
18065
18066 * [HBASE-14969](https://issues.apache.org/jira/browse/HBASE-14969) | *Major* | **Add throughput controller for flush**
18067
18068 Adds means of throttling flush throughput. By default there is no limit; we use NoLimitThroughputController. An alternative controller, PressureAwareFlushThroughputController, allows specifying throughput bounds. A new simple factor, flush pressure, influences throughput. See PressureAwareFlushThroughputController.java class for detail.
18069
18070
18071 ---
18072
18073 * [HBASE-11425](https://issues.apache.org/jira/browse/HBASE-11425) | *Major* | **Cell/DBB end-to-end on the read-path**
18074
18075 For E2E off heaped read path, first of all there should be an off heap backed BucketCache(BC). Configure 'hbase.bucketcache.ioengine' to offheap in hbase-site.xml. Also specify the total capacity of the BC using hbase.bucketcache.size config.  Please remember to adjust value of 'HBASE\_OFFHEAPSIZE' in hbase-env.sh as per this capacity. Here-by we specify the max possible off-heap memory allocation for the RS java process. So this should be bigger than the off-heap BC size. Please keep in mind that there is no default for hbase.bucketcache.ioengine which means the BC is turned OFF by default.
18076
18077 Next thing to tune is the ByteBuffer pool in the RPC server side. The buffers from this pool will be used to accumulate the cell bytes and create a result cell block to send back to the client side. 'hbase.ipc.server.reservoir.enabled' can be used to turn this pool ON or OFF. By default this pool is ON and available. HBase will create off heap ByteBuffers and pool them. Please make sure not to turn this OFF if you want E2E off heaping in read path. If this pool is turned off, the server will create temp buffers on heap to accumulate the cell bytes and make a result cell block. This can impact the GC on a highly read loaded server.  The user can tune this pool with respect to how many buffers are in the pool and what should be the size of each ByteBuffer.
18078 Use the config 'hbase.ipc.server.reservoir.initial.buffer.size' to tune each of the buffer sizes. Defaults is 64 KB.
18079
18080 When the read pattern is a random row read and each of the rows are smaller in size compared to this 64 KB, try reducing this. When the result size is larger than one ByteBuffer size, the server will try to grab more than one buffer and make a result cell block out of these.  When the pool is running out of buffers, the server will end up creating temporary on-heap buffers.
18081
18082 The maximum number of ByteBuffers in the pool can be tuned using the config 'hbase.ipc.server.reservoir.initial.max'. Its value defaults to 64 \* region server handlers configured (See the config 'hbase.regionserver.handler.count'). The math is such that by default we consider 2 MB as the result cell block size per read result and each handler will be handling a read. For 2 MB size, we need 32 buffers each of size 64 KB (See default buffer size in pool).  So per handler 32 ByteBuffers(BB). We allocate twice this size as the max BBs count such that one handler can be creating the response and handing it to the RPC Responder thread and then handling a new request creating a new response cell block (using pooled buffers). Even if the responder could not send back the first TCP reply immediately, our count should allow that we should still have enough buffers in our pool without having to make temporary buffers on the heap.  Again for smaller sized random row reads, tune this max count. There are lazily created buffers and the count is the max count to be pooled.
18083
18084 The setting for HBASE\_OFFHEAPSIZE in hbase-env.sh should consider this off heap buffer pool at the RPC side also.  We need to config this max off heap size for RS as a bit higher than the sum of this max pool size and the off heap cache size. The TCP layer will also need to create direct bytebuffers for TCP communication. Also the DFS client will need some off-heap to do its workings especially if short-circuit reads are configured. Allocating an extra of 1 - 2 GB for the max direct memory size has worked in tests.
18085
18086 If you still see GC issues even after making E2E read path off heap, look for issues in the appropriate buffer pool. Check the below RS log with INFO level:
18087
18088   "Pool already reached its max capacity : XXX and no free buffers now. Consider increasing the value for 'hbase.ipc.server.reservoir.initial.max' ?"
18089
18090 If you are using co processors and refer the Cells in the read results, DO NOT store reference to these Cells out of the scope of the CP hook methods. Some times the CPs need store info about the cell (Like its row key) for considering in the next CP hook call etc. For such cases, pls clone the required fields of the entire Cell as per the use cases.  [ See CellUtil#cloneXXX(Cell) APIs ]
18091
18092
18093 ---
18094
18095 * [HBASE-15145](https://issues.apache.org/jira/browse/HBASE-15145) | *Major* | **HBCK and Replication should authenticate to zookepeer using server principal**
18096
18097 Added a new command line argument: --auth-as-server to enable authenticating to ZooKeeper as the HBase Server principal. This is required for secure clusters for doing replication operations like add\_peer, list\_peers, etc until HBASE-11392 is fixed. This advanced option can also be used for manually fixing secure znodes.
18098
18099 Commands can now be invoked like:
18100 hbase --auth-as-server shell
18101 hbase --auth-as-server zkcli
18102
18103 HBCK in secure setup also needs to authenticate to ZK using servers principals.This is turned on by default (no need to pass additional argument).
18104
18105 When authenticating as server, HBASE\_SERVER\_JAAS\_OPTS is concatenated to HBASE\_OPTS if defined in hbase-env.sh. Otherwise, HBASE\_REGIONSERVER\_OPTS is concatenated.
18106
18107
18108 ---
18109
18110 * [HBASE-15125](https://issues.apache.org/jira/browse/HBASE-15125) | *Major* | **HBaseFsck's adoptHdfsOrphan function creates region with wrong end key boundary**
18111
18112 **WARNING: No release note provided for this change.**
18113
18114
18115 ---
18116
18117 * [HBASE-13082](https://issues.apache.org/jira/browse/HBASE-13082) | *Major* | **Coarsen StoreScanner locks to RegionScanner**
18118
18119 After this JIRA we will not be doing any scanner reset after compaction during a course of a scan. The files that were compacted will still be continued to be used in the scan process. The compacted files will be archived by a background thread that runs every 2 mins by default only when there are no active scanners on those comapcted files. The above duration can be controlled using the knob 'hbase.hfile.compactions.cleaner.interval'.
18120
18121
18122 ---
18123
18124 * [HBASE-14865](https://issues.apache.org/jira/browse/HBASE-14865) | *Major* | **Support passing multiple QOPs to SaslClient/Server via hbase.rpc.protection**
18125
18126 With this patch, hbase.rpc.protection can now take multiple comma-separate QOP values. Accepted QOP values remain unchanged and are 'authentication', 'integrity', and 'privacy'. Server or client can use this configuration to specify their preference (in decreasing order) while negotiating QOP.
18127 This feature can be used to upgrade or downgrade QOP in an online cluster without compromising availability (i.e. taking cluster offline). For e.g. to change qop from A to B, typical steps would be:
18128 "A" --\> "B,A" --\> rolling restart --\> "B" --\> rolling restart
18129
18130 Sidenote: Based on experimentation, server's choice is given higher preference than client's choice. i.e. if server's choices are "A,B,C" and client's choices are "B,C,A", both A and B are acceptable, but A is chosen.
18131
18132
18133 ---
18134
18135 * [HBASE-15098](https://issues.apache.org/jira/browse/HBASE-15098) | *Blocker* | **Normalizer switch in configuration is not used**
18136
18137 The config parameter, hbase.normalizer.enabled, has been dropped since it is not used in the code base.
18138
18139
18140 ---
18141
18142 * [HBASE-15111](https://issues.apache.org/jira/browse/HBASE-15111) | *Trivial* | **"hbase version" should write to stdout**
18143
18144 The \`hbase version\` command now outputs directly to stdout rather than to a logger. This change allows the version information to be output consistently regardless of logger configuration. Naturally, this also means the command output ignores all logger configuration. Furthermore, the move from loggers to direct output changes the output of the command to omit metadata commonly included in logger ouput such as a timestamp, log level, and logger name.
18145
18146
18147 ---
18148
18149 * [HBASE-15027](https://issues.apache.org/jira/browse/HBASE-15027) | *Major* | **Refactor the way the CompactedHFileDischarger threads are created**
18150
18151 The property 'hbase.hfile.compactions.discharger.interval' has been renamed to 'hbase.hfile.compaction.discharger.interval' that describes the interval after which the compaction discharger chore service should run.
18152 The property 'hbase.hfile.compaction.discharger.thread.count' describes the thread count that does the compaction discharge work.
18153 The CompactedHFilesDischarger is a chore service now started as part of the RegionServer and this chore service iterates over all the onlineRegions in that RS and uses the RegionServer's executor service to launch a set of threads that does this job of compaction files clean up.
18154
18155
18156 ---
18157
18158 * [HBASE-14468](https://issues.apache.org/jira/browse/HBASE-14468) | *Major* | **Compaction improvements: FIFO compaction policy**
18159
18160 FIFO compaction policy selects only files which have all cells expired. The column family MUST have non-default TTL.
18161 Essentially, FIFO compactor does only one job: collects expired store files.
18162
18163 Because we do not do any real compaction, we do not use CPU and IO (disk and network), we do not evict hot data from a block cache. The result: improved throughput and latency both write and read.
18164 See: https://github.com/facebook/rocksdb/wiki/FIFO-compaction-style
18165
18166
18167 ---
18168
18169 * [HBASE-14888](https://issues.apache.org/jira/browse/HBASE-14888) | *Major* | **ClusterSchema: Add Namespace Operations**
18170
18171 This patch changes the semantic around namespace create/delete/modify when coprocessor asks that the invocation be by-passed. Previous the by-pass was done silently -- the method would just return with no indication as to whether by-pass route had been taken or not.  This patch adds throwing of a BypassCoprocessorException which is thrown if we have been asked to bypass a call.
18172
18173 The bypass facility has been in place since hbase 1.0.0 when namespace creation/deletion, etc.., was originally added in HBASE-8408 (HBASE-15071 is about addressing bypass handling in a general way)
18174
18175
18176 ---
18177
18178 * [HBASE-15018](https://issues.apache.org/jira/browse/HBASE-15018) | *Major* | **Inconsistent way of handling TimeoutException in the rpc client implementations**
18179
18180 When using the new AsyncRpcClient introduced in HBase 1.1.0 (HBASE-12684), time outs now result in an IOException wrapped around a CallTimeoutException instead of a bare CallTimeoutException. This change makes the AsyncRpcClient behave the same as the default HBase 1.y RPC client implementation.
18181
18182
18183 ---
18184
18185 * [HBASE-14796](https://issues.apache.org/jira/browse/HBASE-14796) | *Minor* | **Enhance the Gets in the connector**
18186
18187 spark.hbase.bulkGetSize  in HBaseSparkConf is for grouping bulkGet, and default value is 1000.
18188
18189
18190 ---
18191
18192 * [HBASE-14976](https://issues.apache.org/jira/browse/HBASE-14976) | *Minor* | **Add RPC call queues to the web ui**
18193
18194 Adds column displaying current aggregated call queues size in region server queues tab UI.
18195
18196
18197 ---
18198
18199 * [HBASE-14822](https://issues.apache.org/jira/browse/HBASE-14822) | *Major* | **Renewing leases of scanners doesn't work**
18200
18201 And 1.1, 1.0, and 0.98.
18202
18203
18204 ---
18205
18206 * [HBASE-14205](https://issues.apache.org/jira/browse/HBASE-14205) | *Critical* | **RegionCoprocessorHost System.nanoTime() performance bottleneck**
18207
18208 **WARNING: No release note provided for this change.**
18209
18210
18211 ---
18212
18213 * [HBASE-14978](https://issues.apache.org/jira/browse/HBASE-14978) | *Blocker* | **Don't allow Multi to retain too many blocks**
18214
18215 Limiting the amount of memory resident for any one request allows the server to handle concurrent requests smoothly. To this end we added the ability to limit the size of responses to a multi request. That worked well however it correctly represent the amount of memory resident. So this issue adds on a an approximation of the number of blocks held for a request.
18216
18217 All clients before 1.2.0 will not get this multi request chunking based upon blocks kept. All clients 1.2.0 and after will.
18218
18219
18220 ---
18221
18222 * [HBASE-14951](https://issues.apache.org/jira/browse/HBASE-14951) | *Minor* | **Make hbase.regionserver.maxlogs obsolete**
18223
18224 Rolling WAL events across a cluster can be highly correlated, hence flushing memstores, hence triggering minor compactions, that can be promoted to major ones. These events are highly correlated in time if there is a balanced write-load on the regions in a table. Default value for maximum WAL files (\* hbase.regionserver.maxlogs\*), which controls WAL rolling events - 32 is too small for many modern deployments.
18225 Now we calculate this value dynamically (if not defined by user), using the following formula:
18226
18227 maxLogs = Math.max( 32, HBASE\_HEAP\_SIZE \* memstoreRatio \* 2/ LogRollSize), where
18228
18229 memstoreRatio is \*hbase.regionserver.global.memstore.size\*
18230 LogRollSize is maximum WAL file size (default 0.95 \* HDFS block size)
18231
18232 We need to make sure that we avoid fully or minimize events when RS has to flush memstores prematurely only because it reached artificial limit of hbase.regionserver.maxlogs, this is why we put this 2 x multiplier in equation, this gives us maximum WAL capacity of 2 x RS memstore-size.
18233
18234 Runaway WAL files.
18235
18236 The default log rolling period (1h) allows to accumulate up to 2 X Memstore Size data in a WAL. For heap size - 32G and all other default setting, this gives ~ 26GB of data. Under heavy write load, the number of WAL files can increase dramatically. RegionServer LogRoller will be archiving old WALs periodically. User has three options, either override default hbase.regionserver.maxlogs or override default hbase.regionserver.logroll.period (decrease), or both to control runaway WALs.
18237
18238 For system with bursty write load,  the hbase.regionserver.logroll.period can be decreased to lower value. In this case the maximum number of wal files will be defined by the total size of memstore (unflushed data), not by the hbase.regionserver.maxlogs. But for majority of applications there will be no issues with defaults. Data will be flushed periodically from memstore, the LogRoller will archive old wal files and the system will never reach the new defaults for hbase.regionserver.maxlogs, unless the system is under extreme load for prolonged period of time, but in this case, decreasing hbase.regionserver.logroll.period allows us to control runaway wal files.
18239
18240 The following table gives the new default maximum log files values for several different Region Server heap sizes:
18241
18242 heap    memstore perc   maxLogs
18243 1G              40%                             32
18244 2G              40%                             32
18245 10G             40%                             80
18246 20G             40%                             160
18247 32G             40%                             256
18248
18249
18250 ---
18251
18252 * [HBASE-14984](https://issues.apache.org/jira/browse/HBASE-14984) | *Major* | **Allow memcached block cache to set optimze to false**
18253
18254 Setting hbase.cache.memcached.spy.optimze to true will allow the spy memcached client to try and optimize for the number of requests outstanding. This can increase throughput but can also increase variance for request times.
18255
18256 Setting it to true will help when round trip times are longer.
18257 Setting it to false ( the default ) will help ensure a more even distribution of response times.
18258
18259
18260 ---
18261
18262 * [HBASE-14534](https://issues.apache.org/jira/browse/HBASE-14534) | *Minor* | **Bump yammer/coda/dropwizard metrics dependency version**
18263
18264 Updated yammer metrics to version 3.1.2 (now it's been renamed to dropwizard). API has changed quite a bit, consult https://dropwizard.github.io/metrics/3.1.0/manual/core/ for additional information.
18265
18266 Note that among other things, in yammer 2.2.0 histograms were by default created in non-biased mode (uniform sampling), while in 3.1.0 histograms created via MetricsRegistry.histogram(...) are by default exponentially decayed. This shouldn't affect end users, though.
18267
18268
18269 ---
18270
18271 * [HBASE-14960](https://issues.apache.org/jira/browse/HBASE-14960) | *Major* | **Fallback to using default RPCControllerFactory if class cannot be loaded**
18272
18273 If the configured RPC controller factory (via hbase.rpc.controllerfactory.class) cannot be found in the classpath or loaded, we fall back to using the default RPC controller factory in HBase.
18274
18275
18276 ---
18277
18278 * [HBASE-14946](https://issues.apache.org/jira/browse/HBASE-14946) | *Critical* | **Don't allow multi's to over run the max result size.**
18279
18280 The HBase region server will now send a chunk of get responses to a client if the total response size is too large. This will only be done for clients 1.2.0 and beyond. Older clients by default will have the old behavior.
18281
18282 This patch is for the case where the basic flow is like this:
18283
18284 I want to get a single column from lots of rows. So I create a list of gets. Then I send them to table.get(List\<Get\>). If the regions for that table are spread out then those requests get chunked out to all the region servers. No one regionserver gets too many. However if one region server contains lots of regions for that table then a multi action can contain lots of gets. No single get is too onerous. However the regionserver won't return until every get is complete. So if there are thousands of gets that are sent in one multi then the regionserver can retain lots of data in one thread.
18285
18286
18287 ---
18288
18289 * [HBASE-14906](https://issues.apache.org/jira/browse/HBASE-14906) | *Major* | **Improvements on FlushLargeStoresPolicy**
18290
18291 In HBASE-14906 we use "hbase.hregion.memstore.flush.size/column\_family\_number" as the default threshold for memstore flush instead of the fixed value through "hbase.hregion.percolumnfamilyflush.size.lower.bound" property, which makes  the default threshold more flexible to various use case. We also introduce a new property in name of "hbase.hregion.percolumnfamilyflush.size.lower.bound.min" with 16M as the default value to avoid small flush in cases like hundreds of column families.
18292
18293 After this change setting "hbase.hregion.percolumnfamilyflush.size.lower.bound" in hbase-site.xml won't take effect anymore, but expert users could still set this property in table descriptor to override the default value just as before
18294
18295
18296 ---
18297
18298 * [HBASE-14769](https://issues.apache.org/jira/browse/HBASE-14769) | *Major* | **Remove unused functions and duplicate javadocs from HBaseAdmin**
18299
18300 - Removes functions from HBaseAdmin which require table name parameter as either byte[] or String. Use their counterparts which take TableName instead.
18301 - Removes redundant javadocs from HBaseAdmin as they will be automatically inherited from Admin interface.
18302 - HBaseAdmin is marked Audience.private so it should have been straight forward okay to remove the functions. But HBaseTestingUtility, which is marked Audience.public had a public function returning its instance, which moved this decision into gray area. Discussing in the community, it was decided that it would be okay to do so in this particular case.
18303
18304
18305 ---
18306
18307 * [HBASE-13153](https://issues.apache.org/jira/browse/HBASE-13153) | *Major* | **Bulk Loaded HFile Replication**
18308
18309 This enhances the HBase replication to support replication of bulk loaded data. This is configurable, by default it is set to false which means it will not replicate the bulk loaded data to its peer(s). To enable it set "hbase.replication.bulkload.enabled" to true.
18310
18311 Following are the additional configurations added for this enhancement,
18312  a. hbase.replication.cluster.id - This is manadatory to configure in cluster where replication for bulk loaded data is enabled. A source cluster is uniquely identified by sink cluster using this id. This should be configured in the source cluster configuration file for all the RS.
18313  b. hbase.replication.conf.dir - This represents the directory where all the active cluster's file system client configurations are defined in subfolders corresponding to their respective replication cluster id in peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is HBASE\_CONF\_DIR.
18314  c. hbase.replication.source.fs.conf.provider - This represents the class which provides the source cluster file system client configuration to peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is org.apache.hadoop.hbase.replication.regionserver.DefaultSourceFSConfigurationProvider
18315
18316  For example: If source cluster FS client configurations are copied in peer cluster under directory /home/user/dc1/ then  hbase.replication.cluster.id should be configured as dc1 and hbase.replication.conf.dir as /home/user
18317
18318 Note:
18319  a. Any modification to source cluster FS client configuration files in peer cluster side replication configuration directory then it needs to restart all its peer(s) cluster RS with default hbase.replication.source.fs.conf.provider.
18320  b. Only 'xml' type files will be loaded by the default hbase.replication.source.fs.conf.provider.
18321
18322 As part of this we have made following changes to LoadIncrementalHFiles class which is marked as Public and Stable class,
18323  a. Raised the visibility scope of LoadQueueItem class from package private to public.
18324  b. Added a new method loadHFileQueue, which loads the queue of LoadQueueItem into the table as per the region keys provided.
18325
18326
18327 ---
18328
18329 * [HBASE-7171](https://issues.apache.org/jira/browse/HBASE-7171) | *Major* | **Initial web UI for region/memstore/storefiles details**
18330
18331 HBASE-7171 adds 2 new pages to the region server Web UI to ease debugging and provide greater insight into the physical data layout.
18332
18333 Region names in UI table listing all regions (on the RS status page) are now hyperlinks leading to region detail page which shows some aggregate memstore information (currently just memory used) along with the list of all Store Files (HFiles) in the region. Names of Store Files are also hyperlinks leading to Store File detail page, which currently runs 'hbase hfile' command behind the scene and displays statistics about store file.
18334
18335
18336 ---
18337
18338 * [HBASE-14655](https://issues.apache.org/jira/browse/HBASE-14655) | *Blocker* | **Narrow the scope of doAs() calls to region observer notifications for compaction**
18339
18340 Region observer notifications w.r.t. compaction request are now audited with request user through proper scope of doAs() calls.
18341
18342
18343 ---
18344
18345 * [HBASE-14631](https://issues.apache.org/jira/browse/HBASE-14631) | *Blocker* | **Region merge request should be audited with request user through proper scope of doAs() calls to region observer notifications**
18346
18347 Region observer notifications w.r.t. merge request are now audited with request user through proper scope of doAs() calls.
18348
18349
18350 ---
18351
18352 * [HBASE-14605](https://issues.apache.org/jira/browse/HBASE-14605) | *Blocker* | **Split fails due to 'No valid credentials' error when SecureBulkLoadEndpoint#start tries to access hdfs**
18353
18354 When split is requested by non-super user, split related notifications for Coprocessor are executed using the login of the request user.
18355 Previously the notifications were carried out as super user.
18356
18357
18358 ---
18359
18360 * [HBASE-14926](https://issues.apache.org/jira/browse/HBASE-14926) | *Major* | **Hung ThriftServer; no timeout on read from client; if client crashes, worker thread gets stuck reading**
18361
18362 Adds a timeout to server read from clients. Adds new configs hbase.thrift.server.socket.read.timeout for setting read timeout on server socket in milliseconds. Default is 60000;
18363
18364
18365 ---
18366
18367 * [HBASE-14825](https://issues.apache.org/jira/browse/HBASE-14825) | *Minor* | **HBase Ref Guide corrections of typos/misspellings**
18368
18369 Corrections to content of "book.html", which is pulled from various \*.adoc files and \*.xml files.
18370 -- corrects typos/misspellings
18371 -- corrects incorrectly formatted links
18372
18373
18374 ---
18375
18376 * [HBASE-14821](https://issues.apache.org/jira/browse/HBASE-14821) | *Major* | **CopyTable should allow overriding more config properties for peer cluster**
18377
18378 Configuration properties for org.apache.hadoop.hbase.mapreduce.TableOutputFormat can now be overridden by prefixing the property keys with "hbase.mapred.output.".  When the configuration is applied to TableOutputFormat, these entries will be rewritten with the prefix removed -- ie. "hbase.mapred.output.hbase.security.authentication" becomes "hbase.security.authentication".  This can be useful when directing output to a peer cluster with different security configuration, for example.
18379
18380
18381 ---
18382
18383 * [HBASE-14799](https://issues.apache.org/jira/browse/HBASE-14799) | *Critical* | **Commons-collections object deserialization remote command execution vulnerability**
18384
18385 This issue resolves a potential security vulnerability. For all versions we update our commons-collections dependency to the release that fixes the reported vulnerability in that library. In 0.98 we additionally disable by default a feature of code carried from 0.94 for backwards compatibility that is not needed.
18386
18387
18388 ---
18389
18390 * [HBASE-12751](https://issues.apache.org/jira/browse/HBASE-12751) | *Major* | **Allow RowLock to be reader writer**
18391
18392 Locks on row are now reader/writer rather than exclusive.
18393
18394 Moves sequenceid out of HRegion and into MVCC class; MVCC is now in charge. A WAL append is still stamped in same way (we pass MVCC context in a few places where we previously we did not).
18395
18396 MVCC methods cleaned up. Make a bit more sense now. Less of them.
18397
18398 Simplifies our update of MemStore/WAL. Now we update memstore AFTER we add to WAL (but before we sync). This fixes possible dataloss when two edits came in with same coordinates; we could order the edits in memstore differently to how they arrived in the WAL.
18399
18400 Marked as an incompatible change because it breaks Distributed Log Replay, a feature we'd determined already was unreliable and to be removed.
18401
18402
18403 ---
18404
18405 * [HBASE-14793](https://issues.apache.org/jira/browse/HBASE-14793) | *Major* | **Allow limiting size of block into L1 block cache.**
18406
18407 Very large blocks can fragment the heap and cause bad issues for the garbage collector, especially the G1GC. Now there is a maximum size that a block can be and still stick in the LruBlockCache. That size defaults to 16mb but can be controlled by changing "hbase.lru.max.block.size"
18408
18409
18410 ---
18411
18412 * [HBASE-14387](https://issues.apache.org/jira/browse/HBASE-14387) | *Major* | **Compaction improvements: Maximum off-peak compaction size**
18413
18414 New configuration option: hbase.hstore.compaction.max.size.offpeak - maximum selection size eligible for minor compaction during off peak hours.
18415 hbase.hstore.compaction.max.size - this is default maximum if no off-peak hours are defined or if no maximum off-peak maximum size is defined.
18416
18417
18418 ---
18419
18420 * [HBASE-12822](https://issues.apache.org/jira/browse/HBASE-12822) | *Minor* | **Option for Unloading regions through region\_mover.rb without Acknowledging**
18421
18422 Incorporated in HBASE-13014.
18423
18424
18425 ---
18426
18427 * [HBASE-14700](https://issues.apache.org/jira/browse/HBASE-14700) | *Major* | **Support a "permissive" mode for secure clusters to allow "simple" auth clients**
18428
18429 Secure HBase now supports a permissive mode to allow mixed secure and insecure clients.  This allows clients to be incrementally migrated over to a secure configuration.  To enable clients to continue to connect using SIMPLE authentication when the cluster is configured for security, set "hbase.ipc.server.fallback-to-simple-auth-allowed" equal to "true" in hbase-site.xml.  NOTE: This setting should ONLY be used as a temporary measure while converting clients over to secure authentication.  It MUST BE DISABLED for secure operation.
18430
18431
18432 ---
18433
18434 * [HBASE-14257](https://issues.apache.org/jira/browse/HBASE-14257) | *Major* | **Periodic flusher only handles hbase:meta, not other system tables**
18435
18436 Memstore periodic flusher used to flush META table every 5 minutes but not any other system tables. This jira extends it to flush all system tables within this time period.
18437
18438
18439 ---
18440
18441 * [HBASE-14658](https://issues.apache.org/jira/browse/HBASE-14658) | *Major* | **Allow loading a MonkeyFactory by class name**
18442
18443 You can specify one of the predefined set of Monkeys when you run Integration Tests by passing the -m\|--monkey arguments on the command line; e.g -m CALM or -m SLOW\_DETERMINISTIC
18444
18445 This patch  makes it so you can pass the name of a class as the monkey to run: e.g. -m org.example.KingKong
18446
18447
18448 ---
18449
18450 * [HBASE-14521](https://issues.apache.org/jira/browse/HBASE-14521) | *Major* | **Unify the semantic of hbase.client.retries.number**
18451
18452 After this change, hbase.client.reties.number universally means the number of retry which is one less than total tries number,  for both non-batch operations like get/scan/increment etc. which uses RpcRetryingCallerImpl#callWithRetries to submit the call or batch operations like put through AsyncProcess#submit.
18453
18454 Note that previously this property means total tries number for puts, so please adjust the setting of its value if necessary. Please also be cautious when setting it to zero since retry is necessary for client cache update when region move happens.
18455
18456
18457 ---
18458
18459 * [HBASE-13819](https://issues.apache.org/jira/browse/HBASE-13819) | *Major* | **Make RPC layer CellBlock buffer a DirectByteBuffer**
18460
18461 For master branch(2.0 version), the BoundedByteBufferPool always create Direct (off heap) ByteBuffers and return that.
18462 For branch-1(1.3 version), byte default the buffers returned will be off heap. This can be changed to return on heap ByteBuffers by configuring 'hbase.ipc.server.reservoir.direct.buffer' to false.
18463
18464
18465 ---
18466
18467 * [HBASE-14517](https://issues.apache.org/jira/browse/HBASE-14517) | *Minor* | **Show regionserver's version in master status page**
18468
18469 Adds server version to the listing of regionservers on the master home page.
18470
18471 if a cluster where the versions deviate, at the bottom of the 'Version' column on the master home page listing of 'Region Servers', you will see a note in red that says something like: 'Total:10              9 nodes with inconsistent version'
18472
18473
18474 ---
18475
18476 * [HBASE-12911](https://issues.apache.org/jira/browse/HBASE-12911) | *Major* | **Client-side metrics**
18477
18478 Introduces collection and reporting of various client-perceived metrics. Metrics are exposed via JMX under "org.apache.hadoop.hbase.client.MetricsConnection". Metrics are scoped according to connection instance, so multiple connection objects (ie, to different clusters) will report their metrics separately. Metrics are disabled by default, must be enabled by configuring "hbase.client.metrics.enable=true".
18479
18480
18481 ---
18482
18483 * [HBASE-14529](https://issues.apache.org/jira/browse/HBASE-14529) | *Major* | **Respond to SIGHUP to reload config**
18484
18485 HBase daemons can now be signaled to reload their config by sending SIGHUP to the java process. Not all config parameters can be reloaded.
18486
18487 In order for this new feature to work the hbase-daemon.sh script was changed to use disown rather than nohup. Functionally this shouldn't change anything but the processes will have a different parent when being run from a connected login shell.
18488
18489
18490 ---
18491
18492 * [HBASE-14502](https://issues.apache.org/jira/browse/HBASE-14502) | *Major* | **Purge use of jmock and remove as dependency**
18493
18494 HBASE-14502 Purge use of jmock and remove as dependency
18495
18496
18497 ---
18498
18499 * [HBASE-14544](https://issues.apache.org/jira/browse/HBASE-14544) | *Major* | **Allow HConnectionImpl to not refresh the dns on errors**
18500
18501 By setting hbase.resolve.hostnames.on.failure to false you can reduce the number of dns name resolutions that a client will do. However if machines leave and come back with different ip's the changes will not be noticed by the clients. So only set hbase.resolve.hostnames.on.failure to false if your cluster dns is not changing while clients are connected.
18502
18503
18504 ---
18505
18506 * [HBASE-14367](https://issues.apache.org/jira/browse/HBASE-14367) | *Major* | **Add normalization support to shell**
18507
18508 This patch adds shell support for region normalizer (see HBASE-13103).
18509
18510 3 commands have been added to hbase shell 'tools' command group (modeled on how the balancer works):
18511
18512  - 'normalizer\_enabled' checks whether region normalizer is turned on
18513  - 'normalizer\_switch' allows user to turn normalizer on and off
18514  - 'normalize' runs region normalizer if it's turned on.
18515
18516 Also 'alter' command has been extended to allow user to enable/disable region normalization per table (disabled by default). Use it as
18517
18518 alter 'testtable', {NORMALIZATION\_MODE =\> 'true'}
18519
18520 Here is the help for the normalize command:
18521
18522 {code}
18523 hbase(main):008:0\> help 'normalize'
18524 Trigger region normalizer for all tables which have NORMALIZATION\_MODE flag set. Returns true
18525  if normalizer ran successfully, false otherwise. Note that this command has no effect
18526  if region normalizer is disabled (make sure it's turned on using 'normalizer\_switch' command).
18527
18528  Examples:
18529
18530    hbase\> normalize
18531 {code}
18532
18533
18534 ---
18535
18536 * [HBASE-14475](https://issues.apache.org/jira/browse/HBASE-14475) | *Major* | **Region split requests are always audited with "hbase" user rather than request user**
18537
18538 Region observer notifications w.r.t. split request are now audited with request user through proper scope of doAs() calls.
18539
18540
18541 ---
18542
18543 * [HBASE-14230](https://issues.apache.org/jira/browse/HBASE-14230) | *Minor* | **replace reflection in FSHlog with HdfsDataOutputStream#getCurrentBlockReplication()**
18544
18545 Remove calling getNumCurrentReplicas on HdfsDataOutputStream via reflection. getNumCurrentReplicas showed up in hadoop 1+ and hadoop 0.2x. In hadoop-2 it was deprecated.
18546
18547
18548 ---
18549
18550 * [HBASE-14495](https://issues.apache.org/jira/browse/HBASE-14495) | *Major* | **TestHRegion#testFlushCacheWhileScanning goes zombie**
18551
18552 The WAL append was changed by HBASE-12751. Every append now sets a latch on an edit. The latch needs to be cleared or else the WAL will hang. The original failures in TestHRegion turned up 'holes' where we were failing to throw the latch if we skipped out early because we were interrupted. Other 'holes' were found where we had mocked up a WAL so the latch would just stay in place.  Futher holes were found appending WAL markers... here we were skipping the mvcc completely for a few edits.  A clean up of WALUtils made all markers take the same code paths.
18553
18554
18555 ---
18556
18557 * [HBASE-14280](https://issues.apache.org/jira/browse/HBASE-14280) | *Minor* | **Bulk Upload from HA cluster to remote HA hbase cluster fails**
18558
18559 Patch will effectively work with Hadoop version 2.6 or greater with a launch of "internal.nameservices".
18560 There will be no change in versions older than 2.6.
18561
18562
18563 ---
18564
18565 * [HBASE-14334](https://issues.apache.org/jira/browse/HBASE-14334) | *Major* | **Move Memcached block cache in to it's own optional module.**
18566
18567 Move external block cache to it's own module. This  will reduce dependencies for people who use hbase-server.
18568 Currently Memcached is the reference implementation for external block cache. External block caches allow HBase to take advantage of other more complex caches that can live longer than the HBase regionserver process and are not necessarily tied to a single computer
18569     life time. However external block caches add in extra operational overhead.
18570
18571
18572 ---
18573
18574 * [HBASE-14433](https://issues.apache.org/jira/browse/HBASE-14433) | *Major* | **Set down the client executor core thread count from 256 in tests**
18575
18576 Tests run with client executors that have core thread count of 4 and a keepalive of 3 seconds. They used to default to 256 core threads and 60 seconds  for keepalive.
18577
18578
18579 ---
18580
18581 * [HBASE-14400](https://issues.apache.org/jira/browse/HBASE-14400) | *Critical* | **Fix HBase RPC protection documentation**
18582
18583 To use rpc protection in HBase, set the value of 'hbase.rpc.protection' to:
18584 'authentication' : simple authentication using kerberos
18585 'integrity' : authentication and integrity
18586 'privacy' : authentication and confidentiality
18587
18588 Earlier, HBase reference guide erroneously mentioned in some places to set the value to 'auth-conf'. This patch fixes the guide and adds temporary support for erroneously recommended values.
18589
18590
18591 ---
18592
18593 * [HBASE-14306](https://issues.apache.org/jira/browse/HBASE-14306) | *Major* | **Refine RegionGroupingProvider: fix issues and make it more scalable**
18594
18595 In HBASE-14306 we've changed default strategy of RegionGroupingProvider from "identify" to "bounded", so it's required to explicitly set "hbase.wal.regiongrouping.strategy" to "identify" if user still wants to use one WAL per region
18596
18597 Please also notice that in the new framework there will be one WAL per group, and the region-group mapping is decided by RegionGroupingStrategy. Accordingly, we've removed BoundedRegionGroupingProvider and added BoundedRegionGroupingStrategy as a replacement. If you already have a customized class for hbase.wal.regiongrouping.strategy, please check the new logic and make updates if necessary.
18598
18599
18600 ---
18601
18602 * [HBASE-6617](https://issues.apache.org/jira/browse/HBASE-6617) | *Major* | **ReplicationSourceManager should be able to track multiple WAL paths**
18603
18604 ReplicationSourceManager now could track multiple wal paths. Notice that although most changes are internal and all metrics names remain the same, signature of below methods in MetricsSource are changed:
18605
18606 1. refreshAgeOfLastShippedOp now requires a String parameter which indicates the wal group id of the reporter
18607 2. setAgeOfLastShippedOp also adds a String parameter for wal group id
18608
18609
18610 ---
18611
18612 * [HBASE-14314](https://issues.apache.org/jira/browse/HBASE-14314) | *Major* | **Metrics for block cache should take region replicas into account**
18613
18614 The following metrics for primary region replica are added:
18615
18616 blockCacheHitCountPrimary
18617 blockCacheMissCountPrimary
18618 blockCacheEvictionCountPrimary
18619
18620
18621 ---
18622
18623 * [HBASE-14317](https://issues.apache.org/jira/browse/HBASE-14317) | *Blocker* | **Stuck FSHLog: bad disk (HDFS-8960) and can't roll WAL**
18624
18625 Tighten up WAL-use semantic.
18626
18627 1. If an append or a sync throws an exception, all subsequent attempts at using the log will also throw this same exception. The WAL is now a lame-duck until you roll it.
18628 2. If a successful append, and then we fail to sync the append, this is a fatal exception. The container must abort to replay the WAL logs even though we have told the client that the appends failed.
18629
18630 The above rules have been applied laxly up to this; it used to be possible to get a good sync to go in over the top of a failed append. This has been fixed in this patch.
18631
18632 Also fixed a hang in the WAL subsystem if a request to pause the write pipeline took on a failed sync. before the roll requests sync got scheduled.
18633
18634
18635 TODO: Revisit our WAL system. HBASE-12751 helps rationalize our write pipeline. In particular, it manages sequenceid inside mvcc which should make it so we can purge mechanism that writes empty, unflushed appends just to get the next sequenceid... problematic when WAL goes lame-duck. Lets get it in.
18636 TODO: A successful append followed by a failed sync probably only needs us replace the WAL (if we have signalled the client that the appends failed). Bummer is that replicating, these last appends might make it to the sink cluster or get replayed during recovery. HBase should keep its own WAL length? Or sequenceid of last successful sync should be passed when doing recovery and replication?
18637
18638
18639 ---
18640
18641 * [HBASE-14261](https://issues.apache.org/jira/browse/HBASE-14261) | *Major* | **Enhance Chaos Monkey framework by adding zookeeper and datanode fault injections.**
18642
18643 This change augments existing chaos monkey framework with actions for restarting underlying zookeeper quorum and hdfs nodes of distributed hbase cluster. One assumption made while creating zk actions are that zookeper ensemble is an independent external service and won't be managed by hbase cluster.  For these actions to work as expected, the following parameters need to be configured appropriately.
18644
18645 {code}
18646 \<property\>
18647   \<name\>hbase.it.clustermanager.hadoop.home\</name\>
18648   \<value\>$HADOOP\_HOME\</value\>
18649 \</property\>
18650 \<property\>
18651   \<name\>hbase.it.clustermanager.zookeeper.home\</name\>
18652   \<value\>$ZOOKEEPER\_HOME\</value\>
18653 \</property\>
18654 \<property\>
18655   \<name\>hbase.it.clustermanager.hbase.user\</name\>
18656   \<value\>hbase\</value\>
18657 \</property\>
18658 \<property\>
18659   \<name\>hbase.it.clustermanager.hadoop.hdfs.user\</name\>
18660   \<value\>hdfs\</value\>
18661 \</property\>
18662 \<property\>
18663   \<name\>hbase.it.clustermanager.zookeeper.user\</name\>
18664   \<value\>zookeeper\</value\>
18665 \</property\>
18666 {code}
18667
18668 The service user related configurations are newly introduced since in prod/test environments each service is managed by different user. Once the above parameters are configured properly, you can start using them as needed. An example usage for invoking these new actions is:
18669
18670 {{./hbase org.apache.hadoop.hbase.IntegrationTestAcidGuarantees -m serverAndDependenciesKilling}}
18671
18672
18673 ---
18674
18675 * [HBASE-14309](https://issues.apache.org/jira/browse/HBASE-14309) | *Major* | **Allow load balancer to operate when there is region in transition by adding force flag**
18676
18677 This issue adds boolean parameter, force, to 'balancer' command so that admin can force region balancing even when there is region (other than hbase:meta) in transition - assuming RIT being transient.
18678 If hbase:meta is in transition, balancer command returns false.
18679
18680 WARNING: For experts only. Forcing a balance may do more damage than repair when assignment is confused
18681 Note: enclose the force parameter in double quotes
18682
18683
18684 ---
18685
18686 * [HBASE-14313](https://issues.apache.org/jira/browse/HBASE-14313) | *Critical* | **After a Connection sees ConnectionClosingException it never recovers**
18687
18688 HConnection could get stuck when talking to a host that went down and then returned. This has been fixed by closing the connection in all paths.
18689
18690
18691 ---
18692
18693 * [HBASE-13339](https://issues.apache.org/jira/browse/HBASE-13339) | *Blocker* | **Update default Hadoop version to latest for master**
18694
18695 Master/2.0.0 now builds on the latest stable hadoop by default.
18696
18697
18698 ---
18699
18700 * [HBASE-14224](https://issues.apache.org/jira/browse/HBASE-14224) | *Critical* | **Fix coprocessor handling of duplicate classes**
18701
18702 Prevent Coprocessors being doubly-loaded; a particular coprocessor can only be loaded once.
18703
18704
18705 ---
18706
18707 * [HBASE-13127](https://issues.apache.org/jira/browse/HBASE-13127) | *Major* | **Add timeouts on all tests so less zombie sightings**
18708
18709 Use junit facility to impose timeout on test. Use test category to chose which timeout to apply: small tests timeout after 30 seconds, medium tests after 180 seconds, and large tests after ten minutes.
18710
18711 Updated junit version from 4.11 to 4.12. 4.12 has support for feature used here.
18712
18713 Add this at the head of your junit4 class to add a category-based timeout:
18714
18715 {code}
18716 @Rule public final TestRule timeout =   CategoryBasedTimeout.builder().withTimeout(this.getClass()).
18717       withLookingForStuckThread(true).build();
18718 {code}
18719
18720 For example:
18721
18722
18723 ---
18724
18725 * [HBASE-14148](https://issues.apache.org/jira/browse/HBASE-14148) | *Major* | **Web UI Framable Page**
18726
18727 Security fix: Adds protection from clickjacking using X-Frame-Options header.
18728 This will prevent use of HBase UI in frames. To disable this feature, set the configuration 'hbase.http.filter.xframeoptions.mode' to 'ALLOW' (default is 'DENY').
18729
18730
18731 ---
18732
18733 * [HBASE-10844](https://issues.apache.org/jira/browse/HBASE-10844) | *Major* | **Coprocessor failure during batchmutation leaves the memstore datastructs in an inconsistent state**
18734
18735 Promotes an -ea assert to logged FATAL and RS abort when memstore is found to be in an inconsistent state.
18736
18737
18738 ---
18739
18740 * [HBASE-13966](https://issues.apache.org/jira/browse/HBASE-13966) | *Minor* | **Limit column width in table.jsp**
18741
18742 Wraps region, start key, end key columns if too long.
18743
18744
18745 ---
18746
18747 * [HBASE-13706](https://issues.apache.org/jira/browse/HBASE-13706) | *Minor* | **CoprocessorClassLoader should not exempt Hive classes**
18748
18749 Starting from HBase 2.0, CoprocessorClassLoader will not exempt hadoop classes or zookeeper classes.  This means that if the custom coprocessor jar contains hadoop or zookeeper packages and classes, they will be loaded by the CoprocessorClassLoader.  Only hbase packages and classes  are exempted from the CoprocessorClassLoader. They (and their dependencies) are loaded by the parent server class loader.
18750
18751
18752 ---
18753
18754 * [HBASE-14054](https://issues.apache.org/jira/browse/HBASE-14054) | *Major* | **Acknowledged writes may get lost if regionserver clock is set backwards**
18755
18756 In {{checkAndPut}} write path use max(max timestamp for the row, System.currentTimeMillis()) in the, instead of blindly taking System.currentTimeMillis() to ensure that checkAndPut() cannot do writes which is already eclipsed. This is similar to what has been done in HBASE-12449 for increment and append.
18757
18758
18759 ---
18760
18761 * [HBASE-13985](https://issues.apache.org/jira/browse/HBASE-13985) | *Minor* | **Add configuration to skip validating HFile format when bulk loading**
18762
18763 A new config, hbase.loadincremental.validate.hfile , is introduced - default to true
18764 When set to false, checking hfile format is skipped during bulkloading.
18765
18766
18767 ---
18768
18769 * [HBASE-14201](https://issues.apache.org/jira/browse/HBASE-14201) | *Major* | **hbck should not take a lock unless fixing errors**
18770
18771 HBCK no longer takes a lock until there are changes to the cluster being made.
18772
18773 The old behavior can be achieved by passing the -exclusive flag.
18774
18775
18776 ---
18777
18778 * [HBASE-14081](https://issues.apache.org/jira/browse/HBASE-14081) | *Minor* | **(outdated) references to SVN/trunk in documentation**
18779
18780 HBASE-14081 Remove (outdated) references to SVN/trunk from documentation
18781
18782
18783 ---
18784
18785 * [HBASE-13865](https://issues.apache.org/jira/browse/HBASE-13865) | *Trivial* | **Increase the default value for hbase.hregion.memstore.block.multipler from 2 to 4 (part 2)**
18786
18787 Increase default hbase.hregion.memstore.block.multiplier from 2 to 4 in the code to match the default value in the config files.
18788
18789
18790 ---
18791
18792 * [HBASE-12295](https://issues.apache.org/jira/browse/HBASE-12295) | *Major* | **Prevent block eviction under us if reads are in progress from the BBs**
18793
18794 We try to delay the eviction of the block till the cellblocks are formed at the Rpc layer. A simple reference counting mechanism is introduced when ever a block is accessed from the Bucket cache.  Once a scanner completes using a block the reference count is decremented.  The eviction of the block happens only when the reference count of that block is 0.
18795 We also introduce a concept of ShareableMemory based on the type of blocks we create from the Block cache. The blocks from the ByteBufferIOEngine directly refer to the buckets in offheap and such blocks are marked SHARED memory type. The blocks from LRU, HDFS and file mode of Bucket cache are all marked EXCLUSIVE because these blocks have their own exclusive memory.
18796 For the CP case, any cell coming out of SHARED memory block is copied before returning the results, because CPs can use the results as its state so that eviction cannot corrupt the results.
18797
18798
18799 ---
18800
18801 * [HBASE-11339](https://issues.apache.org/jira/browse/HBASE-11339) | *Major* | **HBase MOB**
18802
18803 The Moderate Object Storage (MOB) feature (HBASE-11339[1]) is modified I/O and compaction path that allows individual moderately sized values (100KB-10MB) to be stored in a way that write amplification is reduced when compared to the normal I/O path. MOB is defined in the column family and it is almost isolated with other components, the features and performance cannot be effected in normal columns.
18804
18805 For more details on how to use the feature please consult the HBase Reference Guide
18806
18807
18808 ---
18809
18810 * [HBASE-13954](https://issues.apache.org/jira/browse/HBASE-13954) | *Major* | **Remove HTableInterface#getRowOrBefore related server side code**
18811
18812 Removed Table#getRowOrBefore, Region#getClosestRowBefore, Store#getRowKeyAtOrBefore, RemoteHTable#getRowOrBefore apis and Thrift support for getRowOrBefore.
18813 Also removed two coprocessor hooks preGetClosestRowBefore and postGetClosestRowBefore.
18814 User using this api can instead use reverse scan something like below,
18815 {code}
18816  Scan scan = new Scan(row);
18817   scan.setSmall(true);
18818   scan.setCaching(1);
18819   scan.setReversed(true);
18820   scan.addFamily(family);
18821 {code}
18822 pass this scan object to the scanner and retrieve the first Result from scanner output.
18823
18824
18825 ---
18826
18827 * [HBASE-12296](https://issues.apache.org/jira/browse/HBASE-12296) | *Major* | **Filters should work with ByteBufferedCell**
18828
18829 Change to support offheaping.
18830
18831 Incompatible change for filters ColumnPrefixFilter and MultipleColumnPrefixFilter
18832
18833 Changes parameters to filterColumn so takes a Cell rather than a byte [].
18834
18835 hbase-client-1.2.7-SNAPSHOT.jar, ColumnPrefixFilter.class
18836 package org.apache.hadoop.hbase.filter
18837 ColumnPrefixFilter.filterColumn ( byte[ ] buffer, int qualifierOffset, int qualifierLength )  :  Filter.ReturnCode
18838 org/apache/hadoop/hbase/filter/ColumnPrefixFilter.filterColumn:([BII)Lorg/apache/hadoop/hbase/filter/Filter$ReturnCode;
18839
18840 Ditto for filterColumnValue in SingleColumnValueFilter. Takes a Cell instead of byte array.
18841
18842
18843 ---
18844
18845 * [HBASE-14045](https://issues.apache.org/jira/browse/HBASE-14045) | *Major* | **Bumping thrift version to 0.9.2.**
18846
18847 This changes upgrades thrift dependency of HBase to 0.9.2. Though this doesn't break any HBase compatibility promises, it might impact any downstream projects that share thrift dependency with HBase.
18848
18849
18850 ---
18851
18852 * [HBASE-14027](https://issues.apache.org/jira/browse/HBASE-14027) | *Major* | **Clean up netty dependencies**
18853
18854 HBase's convenience binary artifact no longer contains the netty 3.2.4 jar . This jar was not directly used by HBase, but may have been relied on by downstream applications.
18855
18856
18857 ---
18858
18859 * [HBASE-7782](https://issues.apache.org/jira/browse/HBASE-7782) | *Minor* | **HBaseTestingUtility.truncateTable() not acting like CLI**
18860
18861 HBaseTestingUtility now uses the truncate API added in HBASE-8332 so that calls to HBTU.truncateTable will behave like the shell command: effectively dropping the table and recreating a new one with the same split points.
18862
18863 Previously, HBTU.truncateTable instead issued deletes for all the data already in the table. If you wish to maintain the same behavior, you should use the newly added HBTU.deleteTableData method.
18864
18865
18866 ---
18867
18868 * [HBASE-14047](https://issues.apache.org/jira/browse/HBASE-14047) | *Major* | **Cleanup deprecated APIs from Cell class**
18869
18870 The following API from Cell (which were deprecated since past few major versions) are removed now.
18871 getRow
18872 getFamily
18873 getQualifier
18874 getValue
18875 getMvccVersion
18876 The above apis can be replaced with their respective CellUtil#cloneXXX (allocates a copy) or Cell#getXXXArray (essentially just returns a pointer) based on the use case.
18877
18878
18879 ---
18880
18881 * [HBASE-14029](https://issues.apache.org/jira/browse/HBASE-14029) | *Major* | **getting started for standalone still references hadoop-version-specific binary artifacts**
18882
18883 HBASE-14029 Correct documentation for Hadoop version specific artifacts
18884
18885
18886 ---
18887
18888 * [HBASE-13849](https://issues.apache.org/jira/browse/HBASE-13849) | *Major* | **Remove restore and clone snapshot from the WebUI**
18889
18890 The HBase master status web page no longer allows operators to clone snapshots nor restore snapshots.
18891
18892
18893 ---
18894
18895 * [HBASE-13646](https://issues.apache.org/jira/browse/HBASE-13646) | *Major* | **HRegion#execService should not try to build incomplete messages**
18896
18897 When RegionServerCoprocessors throw an exception we will no longer attempt to build an incomplete RPC response message. Instead, the response message will be null.
18898
18899
18900 ---
18901
18902 * [HBASE-13639](https://issues.apache.org/jira/browse/HBASE-13639) | *Major* | **SyncTable - rsync for HBase tables**
18903
18904 Tool to sync two tables that tries to send the differences only like rsync.
18905
18906 Adds two new MapReduce jobs, SyncTable and HashTable. See usage for these jobs on how to use. See design doc for generally overview: https://docs.google.com/document/d/1-2c9kJEWNrXf5V4q\_wBcoIXfdchN7Pxvxv1IO6PW0-U/edit
18907
18908 From comments below, "It can be challenging to run against a table getting live writes, if those writes are updates/overwrites. In general, you can run it against a time range to ignore new writes, but if those writes update existing cells, then the time range scan may or may not see older versions of those cells depending on whether major compaction has happened, which may be different in remote clusters."
18909
18910
18911 ---
18912
18913 * [HBASE-13895](https://issues.apache.org/jira/browse/HBASE-13895) | *Critical* | **DATALOSS: Region assigned before WAL replay when abort**
18914
18915 If the master went to assign a region concurrent with a RegionServer abort, the returned RegionServerAbortedException was being handled as though the region had been cleanly offlined so assign was allowed proceed. If the region was opened in its new location before WAL replay completion, the replayed edits were ignored, worst case, or were later played over the top of edits that had come in since open and so susceptible to overwrite. In either case, DATALOSS.
18916
18917
18918 ---
18919
18920 * [HBASE-13983](https://issues.apache.org/jira/browse/HBASE-13983) | *Minor* | **Doc how the oddball HTable methods getStartKey, getEndKey, etc. will be removed in 2.0.0**
18921
18922 Adds extra doc on getStartKeys, getEndKeys, and getStartEndKeys in HTable explaining that they will be removed in 2.0.0 (these methods did not get the proper full major version deprecation cycle).
18923
18924 In this issue, we actually also remove these methods in master/2.0.0 branch.
18925
18926
18927 ---
18928
18929 * [HBASE-13747](https://issues.apache.org/jira/browse/HBASE-13747) | *Critical* | **Promote Java 8 to "yes" in support matrix**
18930
18931 Java 8 is considered supported and tested as of HBase 1.2+
18932
18933
18934 ---
18935
18936 * [HBASE-13959](https://issues.apache.org/jira/browse/HBASE-13959) | *Critical* | **Region splitting uses a single thread in most common cases**
18937
18938 The performance of region splitting has been improved by using a thread pool to split the store files concurrently. Prior to this change, the store files were always split sequentially in a single thread, so a region with multiple store files ended up taking several seconds. The thread pool is sized dynamically with the aim of getting maximum concurrency, without exceeding the number of cores available for HBase Java process. A lower limit for the thread pool can be explicitly set using the property hbase.regionserver.region.split.threads.max.
18939
18940
18941 ---
18942
18943 * [HBASE-13930](https://issues.apache.org/jira/browse/HBASE-13930) | *Major* | **Exclude Findbugs packages from shaded jars**
18944
18945 Exclude Findbugs packages from shaded jars
18946
18947
18948 ---
18949
18950 * [HBASE-13214](https://issues.apache.org/jira/browse/HBASE-13214) | *Major* | **Remove deprecated and unused methods from HTable class**
18951
18952 **WARNING: No release note provided for this change.**
18953
18954
18955 ---
18956
18957 * [HBASE-13869](https://issues.apache.org/jira/browse/HBASE-13869) | *Trivial* | **Fix typo in HBase book**
18958
18959 Fix typo in HBase book
18960
18961
18962 ---
18963
18964 * [HBASE-13938](https://issues.apache.org/jira/browse/HBASE-13938) | *Major* | **Deletes done during the region merge transaction may get eclipsed**
18965
18966 Use the master's timestamp when sending hbase:meta edits on region merge to ensure proper ordering of new region addition and old region deletes.
18967
18968
18969 ---
18970
18971 * [HBASE-13898](https://issues.apache.org/jira/browse/HBASE-13898) | *Minor* | **correct additional javadoc failures under java 8**
18972
18973 Correct Javadoc generation errors
18974
18975
18976 ---
18977
18978 * [HBASE-13103](https://issues.apache.org/jira/browse/HBASE-13103) | *Major* | **[ergonomics] add region size balancing as a feature of master**
18979
18980 This patch adds optional ability for HMaster to normalize regions in size (disabled by default, change hbase.normalizer.enabled property to true to turn it on). If enabled, HMaster periodically (every 30 minutes by default) monitors tables for which normalization is enabled in table configuration and performs splits/merges as seems appropriate. Users may implement their own normalization strategies by implementing RegionNormalizer interface and configuring it in hbase-site.xml.
18981
18982
18983 ---
18984
18985 * [HBASE-13900](https://issues.apache.org/jira/browse/HBASE-13900) | *Minor* | **duplicate methods between ProtobufMagic and ProtobufUtil**
18986
18987 Use ProtobufMagic methods in ProtobufUtil
18988
18989
18990 ---
18991
18992 * [HBASE-13843](https://issues.apache.org/jira/browse/HBASE-13843) | *Trivial* | **Fix internal constant text in ReplicationManager.java**
18993
18994 In previous versions of HBase, the ReplicationAdmin utility erroneously used the string key "columnFamlyName" when listing replicated column families. It now uses the corrected spelling of "columnFamilyName" (note the added "i").
18995
18996 Downstream code that parsed the replication entries returned from listReplicated will need to be updated to use the new key. Previously compiled code that relied on the static CFNAME member of ReplicationAdmin will need to be recompiled in order to see the updated value.
18997
18998
18999 ---
19000
19001 * [HBASE-13886](https://issues.apache.org/jira/browse/HBASE-13886) | *Major* | **Return empty value when the mob file is corrupt instead of throwing exceptions**
19002
19003 By default the Get/Scan will throw Exception when it is not able to find a mob cell because the mob file is missing/corrupted. This jira adds a facility to continue scan/get and get other cells with mob cell value as empty. Set an attribute MobConstants.EMPTY\_VALUE\_ON\_MOBCELL\_MISS = true in Scan/Get for getting this behaviour
19004
19005
19006 ---
19007
19008 * [HBASE-13686](https://issues.apache.org/jira/browse/HBASE-13686) | *Major* | **Fail to limit rate in RateLimiter**
19009
19010 As per this jira contribution. We now support two kinds of RateLimiter.
19011 1) org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter : This limiter will refill resources at every TimeUnit/resources interval.
19012 Example: For a limiter configured with 10resources/second, then 1resource will be refilled after every 100ms.
19013
19014 2) org.apache.hadoop.hbase.quotas.FixedIntervalRateLimiter: This limiter will refill resources only after a given fixed interval of time.
19015
19016 Client can configure anyone of this rate limiter for the cluster by setting the value for the property "hbase.quota.rate.limiter" in the hbase-site.xml. org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter is the default value.
19017 Note: Client needs to restart the cluster for the configuration to take into effect.
19018
19019
19020 ---
19021
19022 * [HBASE-13816](https://issues.apache.org/jira/browse/HBASE-13816) | *Major* | **Build shaded modules only in release profile**
19023
19024 hbase-shaded-client and hbase-shaded-server modules will not build the actual jars unless -Prelease is supplied in mvn.
19025
19026
19027 ---
19028
19029 * [HBASE-13754](https://issues.apache.org/jira/browse/HBASE-13754) | *Major* | **Allow non KeyValue Cell types also to oswrite**
19030
19031 This jira has removed the already deprecated method
19032 KeyValue#oswrite(final KeyValue kv, final OutputStream out)
19033
19034
19035 ---
19036
19037 * [HBASE-13375](https://issues.apache.org/jira/browse/HBASE-13375) | *Major* | **Provide HBase superuser higher priority over other users in the RPC handling**
19038
19039 This JIRA modifies the signature of PriorityFunction#getPriority() method to also take request user as a parameter; all RPC requests sent by super users (as determined by cluster configuration) are executed with Admin QoS.
19040
19041
19042 ---
19043
19044 * [HBASE-5980](https://issues.apache.org/jira/browse/HBASE-5980) | *Minor* | **Scanner responses from RS should include metrics on rows/KVs filtered**
19045
19046 Adds scan metrics to the result. In the shell, set the ALL\_METRICS attribute to true on your scan to see dump of metrics after results (see the scan help for examples).
19047
19048 If you would prefer to see only a subset of the metrics, the METRICS array can be defined to include the names of only the metrics you care about.
19049
19050
19051 ---
19052
19053 * [HBASE-13698](https://issues.apache.org/jira/browse/HBASE-13698) | *Major* | **Add RegionLocator methods to Thrift2 proxy.**
19054
19055 Added getRegionLocation and getAllRegionLocations to the thrift2 interface.
19056
19057
19058 ---
19059
19060 * [HBASE-13636](https://issues.apache.org/jira/browse/HBASE-13636) | *Major* | **Remove deprecation for HBASE-4072 (Reading of zoo.cfg)**
19061
19062 Purge support for parsing zookeepers zoo.cfg deprecated since hbase-0.96.0
19063
19064
19065 ---
19066
19067 * [HBASE-13071](https://issues.apache.org/jira/browse/HBASE-13071) | *Major* | **Hbase Streaming Scan Feature**
19068
19069 MOTIVATION
19070
19071 A pipelined scan API is introduced for speeding up applications that combine massive data traversal with compute-intensive processing. Traditional HBase scans save network trips through prefetching the data to the client side cache. However, they prefetch synchronously: the fetch request to regionserver is invoked only when the entire cache is consumed. This leads to a stop-and-wait access pattern, in which the client stalls until the next chunk of data is fetched. Applications that do significant processing can benefit from background data prefetching, which eliminates this bottleneck. The pipelined scan implementation overlaps the cache population at the client side with application processing. Namely, it issues a new scan RPC when the iteration retrieves 50% of the cache. If the application processing (that is, the time between invocations of next()) is substantial, the new chunk of data will be available before the previous one is exhausted, and the client will not experience any delay. Ideally, the prefetch and the processing times should be balanced.
19072
19073 API AND CONFIGURATION
19074
19075 Asynchronous scanning can be configured either globally for all tables and scans, or on per-scan basis via a new Scan class API.
19076
19077 Configuration in hbase-site.xml: hbase.client.scanner.async.prefetch, default false:
19078
19079  \<property\>
19080    \<name\>hbase.client.scanner.async.prefetch\</name\>
19081    \<value\>true\</value\>
19082  \</property\>
19083
19084 API - Scan#setAsyncPrefetch(boolean)
19085
19086       Scan scan = new Scan();
19087       scan.setCaching(1000);
19088       scan.setMaxResultSize(BIG\_SIZE);
19089       scan.setAsyncPrefetch(true);
19090         ...
19091       ResultScanner scanner = table.getScanner(scan);
19092
19093 IMPLEMENTATION NOTES
19094
19095 Pipelined scan is implemented by a new ClientAsyncPrefetchScanner class, which is fully API-compatible with the synchronous ClientSimpleScanner. ClientAsyncPrefetchScanner is not instantiated in case of small (Scan#setSmall) and reversed (Scan#setReversed) scanners. The application is responsible for setting the prefetch size in a way that the prefetch time and the processing times are balanced. Note that due to double buffering, the client side cache can use twice as much memory as the synchronous scanner.
19096
19097 Generally, this feature will put more load on the server (higher fetch rate -- which is the whole point).  Also, YMMV.
19098
19099
19100 ---
19101
19102 * [HBASE-13533](https://issues.apache.org/jira/browse/HBASE-13533) | *Trivial* | **section on configuring ~/.m2/settings.xml has no anchor**
19103
19104 Correct setting.xml anchor in book
19105
19106
19107 ---
19108
19109 * [HBASE-13625](https://issues.apache.org/jira/browse/HBASE-13625) | *Major* | **Use HDFS for HFileOutputFormat2 partitioner's path**
19110
19111 Introduces a new config hbase.fs.tmp.dir which is a directory in HDFS (or default file system) to use as a staging directory for HFileOutputFormat2. This is also used as the default for hbase.bulkload.staging.dir
19112
19113
19114 ---
19115
19116 * [HBASE-10800](https://issues.apache.org/jira/browse/HBASE-10800) | *Major* | **Use CellComparator instead of KVComparator**
19117
19118 From 2.0 branch onwards KVComparator and its subclasses MetaComparator, RawBytesComparator are all deprecated.
19119 All the comparators are moved to CellComparator.  MetaCellComparator, a subclass of CellComparator, will be used to compare hbase:meta cells.
19120 Previously exposed static instances KeyValue.COMPARATOR, KeyValue.META\_COMPARATOR and KeyValue.RAW\_COMPARATOR are deprecated instead use CellComparator.COMPARATOR and CellComparator.META\_COMPARATOR.
19121 Also note that there will be no RawBytesComparator.  Where ever we need to compare raw bytes use Bytes.BYTES\_RAWCOMPARATOR.
19122 CellComparator will always operate on cells and its components, abstracting the fact that a cell can be backed by a single byte[] as opposed to how KVComparators were working.
19123
19124
19125 ---
19126
19127 * [HBASE-13333](https://issues.apache.org/jira/browse/HBASE-13333) | *Major* | **Renew Scanner Lease without advancing the RegionScanner**
19128
19129 Adds a renewLease call to ClientScanner
19130
19131
19132 ---
19133
19134 * [HBASE-13564](https://issues.apache.org/jira/browse/HBASE-13564) | *Major* | **Master MBeans are not published**
19135
19136 To use the coprocessor-based JMX implementation provided by HBase for Master.
19137 Add below property in hbase-site.xml file:
19138
19139 \<property\>
19140   \<name\>hbase.coprocessor.master.classes\</name\>
19141   \<value\>org.apache.hadoop.hbase.JMXListener\</value\>
19142 \</property\>
19143
19144 NOTE: DO NOT set \`com.sun.management.jmxremote.port\` for Java VM at the same time.
19145
19146 By default, the JMX listens on TCP port 10101 for Master, we can further configure the port using below properties:
19147
19148 \<property\>
19149   \<name\>master.rmi.registry.port\</name\>
19150   \<value\>61110\</value\>
19151 \</property\>
19152 \<property\>
19153   \<name\>master.rmi.connector.port\</name\>
19154   \<value\>61120\</value\>
19155 \</property\>
19156 ----
19157
19158 The registry port can be shared with connector port in most cases, so you only need to configure master.rmi.registry.port.
19159 However if you want to use SSL communication, the 2 ports must be configured to different values.
19160
19161
19162 ---
19163
19164 * [HBASE-13537](https://issues.apache.org/jira/browse/HBASE-13537) | *Major* | **Procedure V2 - Change the admin interface for async operations to return Future (incompatible with branch-1.x)**
19165
19166 As we made changes to return types in asynchronous methods of Admin API, this change is going to break binary compatibility. The source compatibility is kept intact though. The applications running against this change needs to be recompiled to keep things working.
19167
19168
19169 ---
19170
19171 * [HBASE-13517](https://issues.apache.org/jira/browse/HBASE-13517) | *Major* | **Publish a client artifact with shaded dependencies**
19172
19173 HBase now provides added convenience artifacts that shade most dependencies. These jars hbase-shaded-client and hbase-shaded-server are meant to be used when dependency conflicts can not be solved any other way. The normal jars hbase-client and hbase-server should still be preferred when possible.
19174
19175 Do not use hbase-shaded-server or hbase-shaded-client inside of a co-processor as bad things will happen.
19176
19177
19178 ---
19179
19180 * [HBASE-13149](https://issues.apache.org/jira/browse/HBASE-13149) | *Blocker* | **HBase MR is broken on Hadoop 2.5+ Yarn**
19181
19182 In HBase 1.1.0 and above we have upgraded the version of Jackson dependencies (jackson-core-asl, jackson-mapper-asl, jackson-jaxrs and jackson-xc) from 1.8.8 to 1.9.13. This is to follow the upgrade to Jackson 1.9.13 in Hadoop 2.5 and above which causes Jackson class incompatibility for HBase as reported in HBASE-13149.  Refer to HADOOP-10104 and YARN-2092 for additional information. Jackson1.9.13 is not completely backward compatible with the prior version 1.8.8 used in HBase. See the Compatibility reports attached in HBASE-13149 and http://svn.codehaus.org/jackson/trunk/release-notes/VERSION for more information.
19183
19184 This upgrade does not have direct impact on HBase users and HBase applications in most cases. In the rare case where your HBase application uses Jackson directly AND your application has compatibility issue with Jackson 1.9.13, you can do the following to mitigate the problem.
19185
19186 1. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, we recommend you update your application to use Jackson 1.9.13. You may be able to explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you, but the general recommendation is that you upgrade to Jackson 1.9.13.
19187 2. You may choose to continue using Jackson 1.8.8 and not to use Jackson 1.9.13 in your classpath.  You can also choose to replace the Jackson 1.9.13 jars in $HBASE\_HOME/lib with 1.8.8 jars.  It can work for you in the following cases:
19188 a) You are on a Hadoop version earlier than Hadoop 2.5,  or
19189 b) You are on Hadoop 2.5 or above, but your HBase application does not involve running Yarn jobs.
19190 3. You may experiment with further isolation using the shaded jars introduced with 1.1.0 via HBASE-13517.
19191
19192 Note that it may not be tested or guaranteed that using Jackson 1.8.8 in $HBASE\_HOME/lib will work in future HBase releases.
19193 It is recommended that your HBase application matches the Jackson version provided in HBase.
19194
19195 In HBase 0.98.x and HBase 1.0.x, we have NOT upgraded the version of Jackson dependencies. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, you may encounter Jackson class incomparability issue, as reported in HBASE-13149.
19196
19197 You can do the following to mitigate the problem:
19198 1. Use 'hadoop jar' command to run your HBase jobs.
19199 2. Explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you.
19200 3. You can also choose to replace the Jackson 1.8.8 jars in $HBASE\_HOME/lib with 1.9.13 jars from your Hadoop lib directory. We have tested HBase 0.98 with Jackson 1.9.13.
19201
19202
19203 ---
19204
19205 * [HBASE-13481](https://issues.apache.org/jira/browse/HBASE-13481) | *Major* | **Master should respect master (old) DNS/bind related configurations**
19206
19207 Master now honors configuration options as was before 1.0.0 releases:
19208 hbase.master.ipc.address
19209 hbase.master.dns.interface
19210 hbase.master.dns.nameserver
19211 hbase.master.info.bindAddress
19212 This jira also adds hbase.master.hostname parameter as an extension to HBASE-12954.
19213
19214
19215 ---
19216
19217 * [HBASE-13090](https://issues.apache.org/jira/browse/HBASE-13090) | *Major* | **Progress heartbeats for long running scanners**
19218
19219 Previously, there was no way to enforce a time limit on scan RPC requests. The server would receive a scan RPC request and take as much time as it needed to accumulate enough results to reach a limit or exhaust the region. The problem with this approach was that, in the case of a very selective scan, the processing of the scan could take too long and cause timeouts client side.
19220
19221 With this fix, the server will now enforce a time limit on the execution of scan RPC requests. When a scan RPC request arrives to the server, a time limit is calculated to be half of whichever timeout value is more restictive between the configurations ("hbase.client.scanner.timeout.period" and "hbase.rpc.timeout"). When the time limit is reached, the server will return whatever results it has accumulated up to that point. The results may be empty.
19222
19223 To ensure that timeout checks do not occur too often (which would hurt the performance of scans), the configuration "hbase.cells.scanned.per.heartbeat.check" has been introduced. This configuration controls how often System.currentTimeMillis() is called to update the progress towards the time limit. Currently, the default value of this configuration value is 10000. Specifying a smaller value will provide a tighter bound on the time limit, but may hurt scan performance due to the higher frequency of calls to System.currentTimeMillis().
19224
19225 Protobuf models for ScanRequest and ScanResponse have been updated so that heartbeat support can be communicated. Support for heartbeat messages is specified in the request sent to the server via ScanRequest.Builder#setClientHandlesHeartbeats. Only when the server sees that ScanRequest#getClientHandlesHeartbeats() is true will it send heartbeat messages back to the client. A response is marked as a heartbeat message via the boolean flag ScanResponse#getHeartbeatMessage
19226
19227
19228 ---
19229
19230 * [HBASE-13307](https://issues.apache.org/jira/browse/HBASE-13307) | *Major* | **Making methods under ScannerV2#next inlineable, faster**
19231
19232 Made methods smaller under Scanner#next so inlinable and compilable (was getting 'too big to compile' from hotspot). Use of unsafe to parse shorts rather than use BB#getShort... faster, etc.
19233
19234
19235 ---
19236
19237 * [HBASE-13453](https://issues.apache.org/jira/browse/HBASE-13453) | *Critical* | **Master should not bind to region server ports**
19238
19239 In 1.0.x, master by default binds to the region server ports (both rpc and info). This change brings back the usage of old master rpc and info ports in 1.1+ and master (2.0) branches. The motivation for this change is to ease the life of the user so that he does not need to do anything to bring up a RS on the same host and also to make the migration from 0.98 to 1.1  hassle free.  However, the users going from 1.0 to 1.1 would see the change in the master ports.
19240
19241
19242 ---
19243
19244 * [HBASE-13419](https://issues.apache.org/jira/browse/HBASE-13419) | *Major* | **Thrift gateway should propagate text from exception causes.**
19245
19246 Compose thrift exception text from the text of the entire cause chain of the underlying exception.
19247
19248
19249 ---
19250
19251 * [HBASE-13275](https://issues.apache.org/jira/browse/HBASE-13275) | *Major* | **Setting hbase.security.authorization to false does not disable authorization**
19252
19253 Prior to this change the configuration setting 'hbase.security.authorization' had no effect if security coprocessor were installed. The act of installing the security coprocessors was assumed to indicate active authorizaton was desired and required. Now it is possible to install the security coprocessors yet have them operate in a passive state with active authorization disabled by setting 'hbase.security.authorization' to false. This can be useful but is probably not what you want. For more information, consult the Security section of the HBase online manual.
19254
19255 'hbase.security.authorization' defaults to true for backwards comptatible behavior.
19256
19257
19258 ---
19259
19260 * [HBASE-13118](https://issues.apache.org/jira/browse/HBASE-13118) | *Major* | **[PE] Add being able to write many columns**
19261
19262 Adds a --columns option to PE so you can write more than one column (changes default qualifier from 'data' to '0').
19263
19264
19265 ---
19266
19267 * [HBASE-13270](https://issues.apache.org/jira/browse/HBASE-13270) | *Major* | **Setter for Result#getStats is #addResults; confusing!**
19268
19269 Deprecates Result#addResults in favor of Result#setStatistics
19270
19271
19272 ---
19273
19274 * [HBASE-13362](https://issues.apache.org/jira/browse/HBASE-13362) | *Major* | **Set max result size from client only (like scanner caching).**
19275
19276 This introduces a new config option: hbase.server.scanner.max.result.size
19277 This setting enforces a maximum result size (in bytes), when reached the server will return the results is has so far.
19278 This is a safety setting and should be kept large. The default is inifinite in 0.98 and 1.0.x and 100mb in 1.1 and later.
19279
19280 Use hbase.client.scanner.max.result.size instead to enforce practical chunk sizes of a few mb (defaults to 2mb)
19281
19282
19283 ---
19284
19285 * [HBASE-11544](https://issues.apache.org/jira/browse/HBASE-11544) | *Critical* | **[Ergonomics] hbase.client.scanner.caching is dogged and will try to return batch even if it means OOME**
19286
19287 Results returned from RPC calls may now be returned as partials
19288
19289 When is a Result marked as a partial?
19290 When the server must stop the scan because the max size limit has been reached. Means that the LAST Result returned within the ScanResult's Result array may be marked as a partial if the scan's max size limit caused it to stop in the middle of a row.
19291
19292 Incompatible Change: The return type of InternalScanners#next and RegionScanners#nextRaw has been changed to NextState from boolean
19293 The previous boolean return value can be accessed via NextState#hasMoreValues()
19294 Provides more context as to what happened inside the scanner
19295
19296 Scan caching default has been changed to Integer.Max\_Value
19297 This value works together with the new maxResultSize value from HBASE-12976 (defaults to 2MB)
19298 Results returned from server on basis of size rather than number of rows
19299 Provides better use of network since row size varies amongst tables
19300
19301 Protobuf models have changed for Result, ScanRequest, and ScanResponse to support new partial Results
19302
19303 Partial Results should be invisible to application layer unless Scan#setAllowPartials is set
19304
19305 Scan#setAllowPartials has been added to allow the application to request to see the partial Results returned by the server rather than have the ClientScanner form the complete Result prior to returning it to the application
19306
19307 To disable the use of partial Results on the server, set ScanRequest.Builder#setClientHandlesPartials() to be false in the ScanRequest issued to server
19308
19309 Partial Results should allow the server to return large rows in parts rather than accumulate all the cells for that particular row and run out of memory
19310
19311
19312 ---
19313
19314 * [HBASE-11864](https://issues.apache.org/jira/browse/HBASE-11864) | *Minor* | **Enhance HLogPrettyPrinter to print information from WAL Header**
19315
19316 Enhance WALPrettyPrinter to print information (writer classnames and cell codec classname) from WAL Header
19317
19318
19319 ---
19320
19321 * [HBASE-13289](https://issues.apache.org/jira/browse/HBASE-13289) | *Major* | **typo in splitSuccessCount  metric**
19322
19323 In hbase 1.0.0, 0.98.10, 0.98.10.1, 0.98.11, and 0.98.12 'splitSuccessCount' was misspelled as 'splitSuccessCounnt'
19324
19325
19326 ---
19327
19328 * [HBASE-12990](https://issues.apache.org/jira/browse/HBASE-12990) | *Major* | **MetaScanner should be replaced by MetaTableAccessor**
19329
19330 Removes MetaScanner. Use MetaTableAccessor instead.
19331
19332
19333 ---
19334
19335 * [HBASE-13373](https://issues.apache.org/jira/browse/HBASE-13373) | *Major* | **Squash HFileReaderV3 together with HFileReaderV2 and AbstractHFileReader; ditto for Scanners and BlockReader, etc.**
19336
19337 Marking as incompatible change. Requires hfiles be major version \>= 2 and \>= minor version 3.  Version 3 files are enabled by default in 1.0.  0.98 writes version 2 minor version 3.  You cannot go to 1.0 from anything before 0.98.
19338
19339
19340 ---
19341
19342 * [HBASE-13252](https://issues.apache.org/jira/browse/HBASE-13252) | *Major* | **Get rid of managed connections and connection caching**
19343
19344 For a long time, HBase supported 2 types of connections - managed, which were cached and closed automatically when not needed, and unmanaged, where user is responsible for closing the connections by calling #close() on them.
19345
19346 The concept of managed connections in HBase (deprecated before) has now been extinguished completely, and now all callers are responsible for managing the lifecycle of connections they acquire.
19347
19348
19349 ---
19350
19351 * [HBASE-12954](https://issues.apache.org/jira/browse/HBASE-12954) | *Minor* | **Ability impaired using HBase on multihomed hosts**
19352
19353 The following config is added by this JIRA:
19354
19355 hbase.regionserver.hostname
19356
19357 This config is for experts: don't set its value unless you really know what you are doing.
19358 When set to a non-empty value, this represents the (external facing) hostname for the underlying server.
19359 See https://issues.apache.org/jira/browse/HBASE-12954 for details.
19360
19361 Caution: please make sure rolling upgrade succeeds before turning on this feature.
19362
19363
19364 ---
19365
19366 * [HBASE-13187](https://issues.apache.org/jira/browse/HBASE-13187) | *Critical* | **Add ITBLL that exercises per CF flush**
19367
19368 Pass the -D flag generator.multiple.columnfamilies on the command-line if you want the generator to write three column families rather than the default one. When set, we will write the usual 'meta' column family and use it checking linked-list is wholesome but we will also write a 'tiny' column family and a 'big' column family to provoke uneven flushing; good for testing the flush-by-columnfamily feature.
19369
19370
19371 ---
19372
19373 * [HBASE-13361](https://issues.apache.org/jira/browse/HBASE-13361) | *Minor* | **Remove or undeprecate {get\|set}ScannerCaching in HTable**
19374
19375 Removed getScannerCaching and setScannerCaching from Table
19376
19377
19378 ---
19379
19380 * [HBASE-10728](https://issues.apache.org/jira/browse/HBASE-10728) | *Major* | **get\_counter value is never used.**
19381
19382 for 0.98 and 1.0 changes are compatible (due to mitigation by HBASE-13433):
19383
19384 \* The "get\_counter" command no longer requires a dummy 4th argument. Downstream users are encouraged to migrate code to not pass this argument because it will result in an error for HBase 1.1+.
19385 \* The "incr" command now outputs the current value of the counter to stdout.
19386 ex:
19387 {code}
19388 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19389 COUNTER VALUE = 1772
19390 0 row(s) in 0.1180 seconds
19391 {code}
19392
19393 for 1.1+ changes are incompatible:
19394
19395 \* The "get\_counter" command no longer accepts a dummy 4th argument. Downstream users will need to update their code to not pass this argument.
19396 ex:
19397 {code}
19398 jruby-1.6.8 :006 \> get\_counter 'counter\_example', 'r1', 'cf1:foo'
19399 COUNTER VALUE = 1772
19400
19401 {code}
19402 \* The "incr" command now outputs the current value of the counter to stdout.
19403 ex:
19404 {code}
19405 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19406 COUNTER VALUE = 1772
19407 0 row(s) in 0.1180 seconds
19408 {code}
19409
19410
19411 ---
19412
19413 * [HBASE-13170](https://issues.apache.org/jira/browse/HBASE-13170) | *Major* | **Allow block cache to be external**
19414
19415 HBase can use memcached as an external block cache. To use this change your config to set hbase.blockcache.use.external to true and hbase.cache.memcached.servers to contain the list of memcached servers to use.
19416
19417
19418 ---
19419
19420 * [HBASE-13316](https://issues.apache.org/jira/browse/HBASE-13316) | *Minor* | **Reduce the downtime on planned moves of regions**
19421
19422 When issuing an Admin.move command the RegionServer that receive the region will try and open the StoreFiles of that region to prime the block cache with index blocks.
19423
19424
19425 ---
19426
19427 * [HBASE-13298](https://issues.apache.org/jira/browse/HBASE-13298) | *Critical* | **Clarify if Table.{set\|get}WriteBufferSize() is deprecated or not**
19428
19429 Deprecate said methods. They were mistakenly included in Table Interface.
19430
19431
19432 ---
19433
19434 * [HBASE-13248](https://issues.apache.org/jira/browse/HBASE-13248) | *Major* | **Make HConnectionImplementation top-level class.**
19435
19436 **WARNING: No release note provided for this change.**
19437
19438
19439 ---
19440
19441 * [HBASE-13331](https://issues.apache.org/jira/browse/HBASE-13331) | *Blocker* | **Exceptions from DFS client can cause CatalogJanitor to delete referenced files**
19442
19443 Fixes an issue where files from a split region that were still referenced were erroneously deleted leading to data loss.
19444
19445
19446 ---
19447
19448 * [HBASE-13273](https://issues.apache.org/jira/browse/HBASE-13273) | *Major* | **Make Result.EMPTY\_RESULT read-only; currently it can be modified**
19449
19450 The Result.EMPTY\_RESULT object is now immutable. In previous releases, the object could be modified by a caller to no longer be empty. Code that relies on this behavior will now receive an UnsupportedOperationException.
19451
19452
19453 ---
19454
19455 * [HBASE-12867](https://issues.apache.org/jira/browse/HBASE-12867) | *Major* | **Shell does not support custom replication endpoint specification**
19456
19457 Adds support to add\_peer in hbase shell to add a custom replication endpoint from HBASE-12254.
19458
19459
19460 ---
19461
19462 * [HBASE-13198](https://issues.apache.org/jira/browse/HBASE-13198) | *Major* | **Remove HConnectionManager**
19463
19464 **WARNING: No release note provided for this change.**
19465
19466
19467 ---
19468
19469 * [HBASE-12586](https://issues.apache.org/jira/browse/HBASE-12586) | *Major* | **Task 6 & 7 from HBASE-9117,  delete all public HTable constructors and delete ConnectionManager#{delete,get}Connection**
19470
19471 HTable class has been marked as private API before, and now it's no longer directly instantiable from client code (all public constructors have been removed). All clients should use Connection#getTable() and Connection#getRegionLocator() when appropriate to obtain Table and RegionLocator implementations to work with.
19472
19473
19474 ---
19475
19476 * [HBASE-13171](https://issues.apache.org/jira/browse/HBASE-13171) | *Minor* | **Change AccessControlClient methods to accept connection object to reduce setup time.**
19477
19478 **WARNING: No release note provided for this change.**
19479
19480
19481 ---
19482
19483 * [HBASE-12706](https://issues.apache.org/jira/browse/HBASE-12706) | *Critical* | **Support multiple port numbers in ZK quorum string**
19484
19485 hbase.zookeeper.quorum configuration now allows servers together with client ports consistent with the way Zookeeper java client accepts the quorum string. In this case, using hbase.zookeeper.clientPort is not needed. eg.  hbase.zookeeper.quorum=myserver1:2181,myserver2:20000,myserver3:31111
19486
19487
19488 ---
19489
19490 * [HBASE-13142](https://issues.apache.org/jira/browse/HBASE-13142) | *Major* | **[PERF] Reuse the IPCUtil#buildCellBlock buffer**
19491
19492 Adds buffer reuse sending Cell results. It is on by default and should not need configuration. Improves GC profile and ups throughput. The benefit gets better the larger the row size returned.
19493
19494 The buffer reservoir is bounded at a maximum count after which we will start logging at WARN level that the reservoir is running at capacity (returned buffers will be discarded and not added back to the reservoir pool). Default maximum is twice the handler count: i.e. 2 \* hbase.regionserver.handler.count. This should be more than enough. Set the maximum with the new configuration: hbase.ipc.server.reservoir.max
19495
19496 The reservoir will not cache buffers in excess of hbase.ipc.server.reservoir.max.buffer.size  The default is 10MB. This means that if a row is very large, then we will allocate a buffer of the average size that is currently in the pool and we will then resize it till we can accommodate the return. These resizes are expensive. The resultant buffer will be used and then discarded.
19497
19498 To check how the reservoir is doing, enable trace level logging for a few seconds on a regionserver. You can do this from the regionserver UI. See 'Log Level'. Set org.apache.hadoop.hbase.io.BoundedByteBufferPool to TRACE. The BoundedByteBufferPool will spew report to the log. Disable the TRACE level and then check the log. You'll see allocation rate, size of pool, size of buffers in pool, etc.
19499
19500
19501 ---
19502
19503 * [HBASE-13012](https://issues.apache.org/jira/browse/HBASE-13012) | *Major* | **Add shell commands to trigger the mob file compactor**
19504
19505 This adds two new shell commands -- compact\_mob and major\_compact\_mob to the hbase shell.
19506
19507 Run compaction on a mob enabled column family or all mob enabled column families within a table
19508           Examples:
19509           Compact a column family within a table:
19510           hbase\> compact\_mob 't1', 'c1'
19511           Compact all mob enabled column families
19512           hbase\> compact\_mob 't1'
19513
19514 Run major compaction on a mob enabled column family or all mob enabled column families within a table
19515           Examples:
19516           Compact a column family within a table:
19517           hbase\> major\_compact\_mob 't1', 'c1'
19518           Compact all mob enabled column families within a table
19519           hbase\> major\_compact\_mob 't1'
19520
19521
19522 ---
19523
19524 * [HBASE-12869](https://issues.apache.org/jira/browse/HBASE-12869) | *Major* | **Add a REST API implementation of the ClusterManager interface**
19525
19526 Adds an implementation of ClusterManager to control REST API-managed HBase clusters.
19527
19528
19529 ---
19530
19531 * [HBASE-13047](https://issues.apache.org/jira/browse/HBASE-13047) | *Trivial* | **Add "HBase Configuration" link missing on the table details pages**
19532
19533 Add a '/conf' link to UI
19534
19535
19536 ---
19537
19538 * [HBASE-13044](https://issues.apache.org/jira/browse/HBASE-13044) | *Minor* | **Configuration option for disabling coprocessor loading**
19539
19540 This change adds two new configuration options:
19541 - "hbase.coprocessor.enabled" controls globally if any coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19542 - "hbase.coprocessor.user.enabled" controls if any user (aka table) coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19543
19544
19545 ---
19546
19547 * [HBASE-12961](https://issues.apache.org/jira/browse/HBASE-12961) | *Minor* | **Negative values in read and write region server metrics**
19548
19549 Change read and write request count in ServerLoad from int to long
19550
19551
19552 ---
19553
19554 * [HBASE-7332](https://issues.apache.org/jira/browse/HBASE-7332) | *Minor* | **[webui] HMaster webui should display the number of regions a table has.**
19555
19556 Adds counts for various regions states to the table listing on main page. See attached screenshot.
19557
19558
19559 ---
19560
19561 * [HBASE-8329](https://issues.apache.org/jira/browse/HBASE-8329) | *Major* | **Limit compaction speed**
19562
19563 Adds compaction throughput limit mechanism(the word "throttle" is already used when choosing compaction thread pool, so use a different word here to avoid ambiguity). Default is org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController, will limit throughput as follow:
19564 1. In off peak hours, use a fixed limitation "hbase.hstore.compaction.throughput.offpeak" (default is Long.MAX\_VALUE which means no limitation).
19565 2. In normal hours, the limitation is tuned between "hbase.hstore.compaction.throughput.lower.bound"(default 10MB/sec) and "hbase.hstore.compaction.throughput.higher.bound"(default 20MB/sec), using the formula "lower + (higer - lower) \* param" where param is in range [0.0, 1.0] and calculate based on store files count on this regionserver.
19566 3. If some stores have too many store files(storefilesCount \> blockingFileCount), then there is no limitation no matter peak or off peak.
19567 You can set "hbase.regionserver.throughput.controller" to org.apache.hadoop.hbase.regionserver.throttle.NoLimitThroughputController to disable throughput controlling.
19568 And we have implemented ConfigurationObserver which means you can change all configurations above and do not need to restart cluster.
19569
19570 The throttle is on by default in hbase-2.0.0. There is no limit in hbase-1.x.
19571
19572
19573 ---
19574
19575 * [HBASE-6778](https://issues.apache.org/jira/browse/HBASE-6778) | *Major* | **Deprecate Chore; its a thread per task when we should have one thread to do all tasks**
19576
19577 Corresponding usages for new ScheduledChore vs. Deprecated Chore:
19578 Chore.interrupt() -\> ScheduledChore.cancel(mayInterruptWhileRunning = true)
19579 Threads.setDaemonThreadRunning(Chore) -\> ChoreService.scheduleChore(ScheduledChore)
19580 Chore.isAlive -\> ScheduledChore.isScheduled()
19581 Chore.getSleeper().skipSleepCycle() -\> ScheduledChore.triggerNow()
19582
19583
19584 ---
19585
19586 * [HBASE-11574](https://issues.apache.org/jira/browse/HBASE-11574) | *Major* | **hbase:meta's regions can be replicated**
19587
19588 On the server side, set hbase.meta.replica.count to the number of replicas of meta that you want to have in the cluster (defaults to 1). hbase.regionserver. meta.storefile.refresh.period should be set to a non-zero number in milliseconds - something like 30000 (defaults to 0).
19589 On the client/user side, set hbase.meta.replicas.use to true.
19590
19591
19592 ---
19593
19594 * [HBASE-12808](https://issues.apache.org/jira/browse/HBASE-12808) | *Major* | **Use Java API Compliance Checker for binary/source compatibility**
19595
19596 Adds a dev-support/check\_compatibility.sh script for comparing versions. Run the script to see usage.
19597
19598
19599 ---
19600
19601 * [HBASE-12684](https://issues.apache.org/jira/browse/HBASE-12684) | *Major* | **Add new AsyncRpcClient**
19602
19603 Retrofit a new, netty-based rpc transport on the client. This client is slightly slower if little contention given the extra tier or so that netty adds and that we block on a Future waiting on the call to finish.  This client opens the way for HBase having a native Async API.
19604
19605 This client is on by default in master branch (2.0 hbase). It is off in branch-1.0 (hbase-1.1.x).  To enable it, set "hbase.rpc.client.impl" to "org.apache.hadoop.hbase.ipc.AsyncRpcClient"
19606
19607
19608 ---
19609
19610 * [HBASE-8410](https://issues.apache.org/jira/browse/HBASE-8410) | *Major* | **Basic quota support for namespaces**
19611
19612 Namespace auditor provides basic quota support for namespaces in terms of number of tables and number of regions. In order to use namespace quotas, quota support must be enabled by setting
19613 "hbase.quota.enabled" property to true in hbase-site.xml file.
19614
19615 The users can add quota information to namespace, while creating new namespaces or by altering existing ones.
19616
19617 Examples:
19618 1. create\_namespace 'ns1', {'hbase.namespace.quota.maxregions'=\>'10'}
19619 2. create\_namespace 'ns2', {'hbase.namespace.quota.maxtables'=\>'2','hbase.namespace.quota.maxregions'=\>'5'}
19620 3. alter\_namespace 'ns3', {METHOD =\> 'set', 'hbase.namespace.quota.maxtables'=\>'5','hbase.namespace.quota.maxregions'=\>'25'}
19621
19622 The quotas can be modified/added to namespace at any point of time. To remove quotas, the following command can be used:
19623
19624 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxtables'}
19625 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxregions'}
19626
19627
19628 ---
19629
19630 * [HBASE-12902](https://issues.apache.org/jira/browse/HBASE-12902) | *Major* | **Post-asciidoc conversion fix-ups**
19631
19632 Pushed to master. Shout if there are any issues.
19633
19634
19635 ---
19636
19637 * [HBASE-12848](https://issues.apache.org/jira/browse/HBASE-12848) | *Major* | **Utilize Flash storage for WAL**
19638
19639 For users on a version of Hadoop that supports tiered storage policies (i.e. Apache Hadoop 2.6.0+), HBase now allows users to opt-in to having the write ahead log placed on the SSD tier. Users on earlier versions of Hadoop will be unable to take advantage of this feature.
19640
19641 Use of tiered storage is controlled by a new RegionServer config, hbase.wal.storage.policy. It defaults to the value 'NONE', which will rely on HDFS defaults for a policy decision.
19642
19643 User can specify ONE\_SSD or ALL\_SSD as the value:
19644 ONE\_SSD: place only one replica of WAL files in SSD and the remaining in default storage
19645 ALL\_SSD: all replica for WAL files are placed on SSD
19646
19647 See [the HDFS docs on storage policy\|http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html]
19648
19649
19650 ---
19651
19652 * [HBASE-11144](https://issues.apache.org/jira/browse/HBASE-11144) | *Major* | **Filter to support scanning multiple row key ranges**
19653
19654 MultiRowRangeFilter is a filter to support scanning multiple row key ranges. If the number of the ranges is small, using multiple scans can also do the same thing and can work well. But when the number of ranges are quite big (e.g. millions), use the MultiRowRangeFilter will be nice. In this filter, the ranges will be sorted and merged, so users do not have to take care of ranges are not continuous. And if users are using something like rest, thrift or pig to access the data the filter might be the practical solution.
19655
19656
19657 ---
19658
19659 * [HBASE-12268](https://issues.apache.org/jira/browse/HBASE-12268) | *Major* | **Add support for Scan.setRowPrefixFilter to shell**
19660
19661 Added new option, ROWPREFIXFILTER, to the scan command in the HBase shell to easily scan for a specific row prefix.
19662
19663
19664 ---
19665
19666 * [HBASE-12775](https://issues.apache.org/jira/browse/HBASE-12775) | *Major* | **CompressionTest ate my HFile (sigh!)**
19667
19668 CompressionTest will now abort when the target path exists.
19669
19670
19671 ---
19672
19673 * [HBASE-12695](https://issues.apache.org/jira/browse/HBASE-12695) | *Critical* | **JDK 1.8 compilation broken**
19674
19675 Use the -Pjavac maven profile in order to compile HBase using the compiler provided by the JDK instead of the default error-prone compiler plugin. This is useful for now if you are building HBase with JDK 1.8 or a JDK that doesn't support error-prone.
19676
19677
19678 ---
19679
19680 * [HBASE-10201](https://issues.apache.org/jira/browse/HBASE-10201) | *Major* | **Port 'Make flush decisions per column family' to trunk**
19681
19682 Adds new flushing policy mechanism. Default, org.apache.hadoop.hbase.regionserver.FlushLargeStoresPolicy, will try to avoid flushing out the small column families in a region, those whose memstores are \< hbase.hregion.percolumnfamilyflush.size.lower.bound. To restore the old behavior of flushes writing out all column families, set hbase.regionserver.flush.policy to org.apache.hadoop.hbase.regionserver.FlushAllStoresPolicy either in hbase-default.xml or on a per-table basis by setting the policy to use with HTableDescriptor.getFlushPolicyClassName().
19683
19684
19685 ---
19686
19687 * [HBASE-12559](https://issues.apache.org/jira/browse/HBASE-12559) | *Major* | **Provide LoadBalancer with online configuration capability**
19688
19689 updateConfiguration(ServerName server) method of Admin now updates config for HMaster as well.
19690 Specifically, config update would be taken by load balancer.
19691
19692
19693 ---
19694
19695 * [HBASE-10378](https://issues.apache.org/jira/browse/HBASE-10378) | *Major* | **Divide HLog interface into User and Implementor specific interfaces**
19696
19697 HBase internals for the write ahead log have been refactored. Advanced users of HBase should be aware of the following changes.
19698
19699 Public Audience
19700   - The Admin API for asking a region server to roll WAL files has changed from a synchronous command that returns a set of regions the WAL implementation would like flushed into an asynchronous command that returns nothing. Older clients relying on the former behavior will still be able to interact with newer servers, but the response body will always contain an empty list of regions to flush.
19701   - The shell command "hlog\_roll" has been deprecated. Operators should use the "wal\_roll" command instead. This command is subject to the changes described above for the Admin API to roll WAL files.
19702   - The command for analyzing write ahead logs has been renamed from 'hlog' to 'wal'. The old usage is deprecated and will be removed in a future version.
19703   - Some utility methods in the HBaseTesetingUtility related to testing write-ahead-logs were changed in incompatible ways. No functionality has been removed, but method names and arguments have changed. See the HBaseTestingUtility javadoc for details.
19704   - The WALPlayer utility has deprecated the configuration keys used for advanced customization. Users should switch to the updated configuration keys. See the usage information on the WALPlayer tool for details.
19705   - The HLogInputFormat utility class for processing logs with MapReduce has been deprecated and will be removed in a future version. Users should switch to the WALInputFormat.
19706   - The labeling of server metrics on the region server status pages changed. Previously, the number of backing files for the write ahead log was labeled 'Num. HLog Files'. If you wish to see this statistic now, please look for the label 'Num. WAL Files.'  If you rely on JMX for these metrics, their location has not changed.
19707
19708 LimitedPrivate(COPROC) Audience, LimitedPrivate(PHOENIX)
19709   - The RegionObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseRegionObserver class. For those that implement RegionObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the RegionObserver javadoc for details.
19710   - Classes related to reading WAL entries (ReaderBase, ProtobufLogReader, SequenceFileLogReader) have changed in a backwards incompatible way. Users who referenced HLog.Reader directly or HLog.Entry will have to update. These changes do not impact compatibility with extant wal files.
19711   - The WALObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseWALObserver class. For those that implement WALObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the WALObserver javadoc for details.
19712  - The WALCoprocessorEnvironment  has changed in a backwards incompatible way. WALObserver coprocessors that relied on retrieving an object representing the write ahead log instance will have to be updated.
19713
19714 LimitedPrivate(REPLICATION) Audience
19715  - The WALEntryFilter API has changed in a backwards incompatible way. Implementers will have to be updated.
19716  - The ReplicationEndpoint.ReplicateContext API has changed in a backwards incompatible way. Implementers who use this interface will have to be updated. These changes do not impact wire compatibility for replicating between clusters.
19717  - The HLogKey API is deprecated in favor of the WALKey API. Additionally, the HLogKey API has changed in a backwards incompatible way by changing from implementing WriteableComparable\<HLogKey\> to implementing Writeable and Comparable\<WALKey\>.
19718
19719
19720 ---
19721
19722 * [HBASE-11683](https://issues.apache.org/jira/browse/HBASE-11683) | *Major* | **Metrics for MOB**
19723
19724 Adds new mob related metrics:
19725
19726 mobCompactedIntoMobCellsCount
19727 mobCompactedIntoMobCellsSize
19728 mobCompactedFromMobCellsCount
19729 mobCompactedFromMobCellsSize
19730 mobFlushCount
19731 mobFlushedCellsCount
19732 mobFlushedCellsSize
19733 mobScanCellsCount
19734 mobScanCellsSize
19735 mobFileCacheAccessCount
19736 mobFileCacheMissCount
19737 mobFileCacheHitPercent
19738 mobFileCacheEvictedCount
19739 mobFileCacheCount
19740
19741
19742 ---
19743
19744 * [HBASE-11912](https://issues.apache.org/jira/browse/HBASE-11912) | *Major* | **Catch some bad practices at compile time with error-prone**
19745
19746 Errors from error-prone will fail the build in the compile phase. Warnings look like Javac warnings and are counted as such by test-patch etc
19747
19748
19749 ---
19750
19751 * [HBASE-12220](https://issues.apache.org/jira/browse/HBASE-12220) | *Major* | **Add hedgedReads and hedgedReadWins metrics**
19752
19753 Adds metrics hedgedReads and hedgedReadWins counts.
19754
19755
19756 ---
19757
19758 * [HBASE-6290](https://issues.apache.org/jira/browse/HBASE-6290) | *Minor* | **Add a function a mark a server as dead and start the recovery the process**
19759
19760 Adds a script to mark a server as dead.
19761
19762 Usage: considerAsDead.sh --hostname serverName
19763
19764
19765 ---
19766
19767 * [HBASE-12111](https://issues.apache.org/jira/browse/HBASE-12111) | *Major* | **Remove deprecated APIs from Mutation(s)**
19768
19769 Removed the below from hbase-2 (were deprecated on release of hbase-1.0.0)
19770
19771 Mutation setWriteToWAL(boolean)
19772 boolean getWriteToWAL()
19773 Mutation setFamilyMap(NavigableMap\<byte [], List\<KeyValue\>\>)
19774 NavigableMap\<byte [], List\<KeyValue\>\> getFamilyMap()
19775
19776
19777 ---
19778
19779 * [HBASE-12084](https://issues.apache.org/jira/browse/HBASE-12084) | *Major* | **Remove deprecated APIs from Result**
19780
19781 The below KeyValue based APIs are removed from Result
19782 KeyValue[] raw()
19783 List\<KeyValue\> list()
19784 List\<KeyValue\> getColumn(byte [] family, byte [] qualifier)
19785 KeyValue getColumnLatest(byte [] family, byte [] qualifier)
19786 KeyValue getColumnLatest(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19787
19788 They are replaced with
19789 Cell[] rawCells()
19790 List\<Cell\> listCells()
19791 List\<Cell\> getColumnCells(byte [] family, byte [] qualifier)
19792 Cell getColumnLatestCell(byte [] family, byte [] qualifier)
19793 Cell getColumnLatestCell(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19794 respectively
19795
19796 Also the constructors which were taking KeyValues also removed
19797 Result(KeyValue [] cells)
19798 Result(List\<KeyValue\> kvs)
19799
19800
19801 ---
19802
19803 * [HBASE-12048](https://issues.apache.org/jira/browse/HBASE-12048) | *Major* | **Remove deprecated APIs from Filter**
19804
19805 The following APIs are removed from Filter
19806 KeyValue transform(KeyValue)
19807 KeyValue getNextKeyHint(KeyValue)
19808 and replaced with
19809 Cell transformCell(Cell)
19810 Cell getNextCellHint(Cell)
19811 respectively.
19812 If a custom Filter implementation have overridden any of these methods, we will no longer call them. User has to change the custom Filter to override cell based methods as shown above
19813
19814
19815 ---
19816
19817 * [HBASE-7767](https://issues.apache.org/jira/browse/HBASE-7767) | *Major* | **Get rid of ZKTable, and table enable/disable state in ZK**
19818
19819 Keeps table enabled/disabled state in HDFS rather than up in ZooKeeper.  Auto-migrates any existing zk state.
19820
19821
19822 ---
19823
19824 * [HBASE-11911](https://issues.apache.org/jira/browse/HBASE-11911) | *Major* | **Break up tests into more fine grained categories**
19825
19826 Adds new test categories besides the class smalltests, mediumtests, and largetests.  Adds:
19827
19828 ClientTests
19829 CoprocessorTests
19830 FilterTests
19831 FlakeyTests
19832 IOTests
19833 MapReduceTests
19834 MasterTests
19835 MiscTests
19836 RegionServerTests
19837 ReplicationTests
19838 RestTests
19839 SecurityTests
19840 VerySlowMapReduceTests
19841 VerySlowRegionServerTests
19842
19843 See description for examples on how to use them.
19844
19845
19846 ---
19847
19848 * [HBASE-11658](https://issues.apache.org/jira/browse/HBASE-11658) | *Major* | **Piped commands to hbase shell should return non-zero if shell command failed.**
19849
19850 Adds a noninteractive mode (-n or --noninteractive) to the hbase shell that exits with a non-zero error code on failed or invalid shell command executions, and exits with a zero error code upon successful execution.
19851
19852
19853 ---
19854
19855 * [HBASE-11640](https://issues.apache.org/jira/browse/HBASE-11640) | *Major* | **Add syntax highlighting support to HBase Ref Guide programlistings**
19856
19857 This got committed, so I guess it is safe to resolve it?
19858
19859
19860 ---
19861
19862 * [HBASE-11606](https://issues.apache.org/jira/browse/HBASE-11606) | *Minor* | **Enable ZK-less region assignment by default**
19863
19864 By default, we don't use ZK for region assignment now. To fall back to the old way, you can set hbase.assignment.usezk to true.
19865
19866
19867 ---
19868
19869 * [HBASE-3135](https://issues.apache.org/jira/browse/HBASE-3135) | *Major* | **Make our MR jobs implement Tool and use ToolRunner so can do -D trickery, etc.**
19870
19871 All MR jobs implement Tool Interface, http://hadoop.apache.org/docs/current/api/org/apache/hadoop/util/Tool.html, so now you can pass properties on command line with the -D flag, etc.
19872
19873
19874 ---
19875
19876 * [HBASE-11556](https://issues.apache.org/jira/browse/HBASE-11556) | *Major* | **Move HTablePool to hbase-thrift module.**
19877
19878 HTablePool was deprecated in 0.98.1 but was still present and usable by apps built against versions before HBase 2.0.  It has been moved and is not intended to be used by user applications, and is now an internal part of the thrift2 proxy server only.
19879
19880
19881 ---
19882
19883 * [HBASE-11548](https://issues.apache.org/jira/browse/HBASE-11548) | *Trivial* | **[PE] Add 'cycling' test N times and unit tests for size/zipf/valueSize calculations**
19884
19885 Adds --cycles=N argument.
19886
19887
19888 ---
19889
19890 * [HBASE-11344](https://issues.apache.org/jira/browse/HBASE-11344) | *Major* | **Hide row keys and such from the web UIs**
19891
19892 Configure "hbase.display.keys" to false (default: true) in the master/regionservers if the row-keys should be hidden in the webUIs (like in the webUI for table details).
19893
19894
19895 ---
19896
19897 * [HBASE-6580](https://issues.apache.org/jira/browse/HBASE-6580) | *Major* | **Deprecate HTablePool in favor of HConnection.getTable(...)**
19898
19899 This issue introduces a few new APIs:
19900 \* HConnectionManager:
19901 {code}
19902     public static HConnection createConnection(Configuration conf)
19903     public static HConnection createConnection(Configuration conf, ExecutorService pool)
19904 {code}
19905 \* HConnection:
19906 {code}
19907     public HTableInterface getTable(String tableName) throws IOException
19908     public HTableInterface getTable(byte[] tableName) throws IOException
19909     public HTableInterface getTable(String tableName, ExecutorService pool) throws IOException
19910     public HTableInterface getTable(byte[] tableName, ExecutorService pool) throws IOException
19911 {code}
19912
19913 By default HConnectionImplementation will create an ExecutorService when needed. The ExecutorService can optionally passed be passed in.
19914 HTableInterfaces are retrieved from the HConnection. By default the HConnection's ExecutorService is used, but optionally that can be overridden for each HTable.
19915
19916
19917 ---
19918
19919 * [HBASE-8450](https://issues.apache.org/jira/browse/HBASE-8450) | *Critical* | **Update hbase-default.xml and general recommendations to better suit current hw, h2, experience, etc.**
19920
19921 Changed defaults:
19922
19923 + max versions now 1 instead of 3
19924 + row blooms on by default (except on .META. table)
19925 + handlers 30 instead of 10
19926 + upped memstore lower limit from .35 to .38
19927 + zookeeper timeout default is 90seconds instead of 180
19928 + client pause is 100ms instead of 1000ms
19929 + retries are now 20 instead of 10 (so overall we still wait same amount of time)
19930 + bulkload retries is 10 instead of infinite
19931 + major compactions are now once a week instead of once every 24 hours; they are staggered so all regionservers do not start compacting at the same time
19932 + blockingstorefiles is 10 instead of 7
19933 + block cache is 0.4 instead of 0.25
19934 + Previous, default for hbase.rootdir was /tmp/hbase-${user.name}.  Now it is ${java.io.tmpdir}/hbase-${user.name} which is usually the same location but may not be (on macos, it points to /var/tmp....).
19935
19936
19937 ---
19938
19939 * [HBASE-4072](https://issues.apache.org/jira/browse/HBASE-4072) | *Major* | **Deprecate/disable and remove support for reading ZooKeeper zoo.cfg files from the classpath**
19940
19941 The Apache ZooKeeper config file zoo.cfg will no longer be read when instantiating a HBaseConfiguration object, as it causes various inconsistency issues. Instead, users have to specify all HBase-relevant ZooKeeper properties in the hbase-site.xml using the various "hbase.zookeeper" prefixed properties. For example, specify "hbase.zookeeper.quorum" to provide a ZK quorum server list.
19942
19943 To enable zoo.cfg reading, for which support may be removed in a future release, set the property "hbase.config.read.zookeeper.config" to true in the hbase-site.xml at the client and servers like so:
19944
19945 \<property\>
19946   \<name\>hbase.config.read.zookeeper.config\</name\>
19947   \<value\>true\</value\>
19948   \<description\>
19949         Set to true to allow HBaseConfiguration to read the
19950         zoo.cfg file for ZooKeeper properties. Switching this to true
19951         is not recommended, since the functionality of reading ZK
19952         properties from a zoo.cfg file has been deprecated.
19953   \</description\>
19954 \</property\>
19955
19956
19957