doc/src/sgml/logical-replication.sgml

   1 <!-- doc/src/sgml/logical-replication.sgml -->
   2
   3 <chapter id="logical-replication">
   4  <title>Logical Replication</title>
   5
   6  <para>
   7   Logical replication is a method of replicating data objects and their
   8   changes, based upon their replication identity (usually a primary key).  We
   9   use the term logical in contrast to physical replication, which uses exact
  10   block addresses and byte-by-byte replication.  PostgreSQL supports both
  11   mechanisms concurrently, see <xref linkend="high-availability"/>.  Logical
  12   replication allows fine-grained control over both data replication and
  13   security.
  14  </para>
  15
  16  <para>
  17   Logical replication uses a <firstterm>publish</firstterm>
  18   and <firstterm>subscribe</firstterm> model with one or
  19   more <firstterm>subscribers</firstterm> subscribing to one or more
  20   <firstterm>publications</firstterm> on a <firstterm>publisher</firstterm>
  21   node.  Subscribers pull data from the publications they subscribe to and may
  22   subsequently re-publish data to allow cascading replication or more complex
  23   configurations.
  24  </para>
  25
  26  <para>
  27   Logical replication of a table typically starts with taking a snapshot
  28   of the data on the publisher database and copying that to the subscriber.
  29   Once that is done, the changes on the publisher are sent to the subscriber
  30   as they occur in real-time.  The subscriber applies the data in the same
  31   order as the publisher so that transactional consistency is guaranteed for
  32   publications within a single subscription.  This method of data replication
  33   is sometimes referred to as transactional replication.
  34  </para>
  35
  36  <para>
  37   The typical use-cases for logical replication are:
  38
  39   <itemizedlist>
  40    <listitem>
  41     <para>
  42      Sending incremental changes in a single database or a subset of a
  43      database to subscribers as they occur.
  44     </para>
  45    </listitem>
  46
  47    <listitem>
  48     <para>
  49      Firing triggers for individual changes as they arrive on the
  50      subscriber.
  51     </para>
  52    </listitem>
  53
  54    <listitem>
  55     <para>
  56      Consolidating multiple databases into a single one (for example for
  57      analytical purposes).
  58     </para>
  59    </listitem>
  60
  61    <listitem>
  62     <para>
  63      Replicating between different major versions of PostgreSQL.
  64     </para>
  65    </listitem>
  66
  67    <listitem>
  68     <para>
  69      Replicating between PostgreSQL instances on different platforms (for
  70      example Linux to Windows)
  71     </para>
  72    </listitem>
  73
  74    <listitem>
  75     <para>
  76      Giving access to replicated data to different groups of users.
  77     </para>
  78    </listitem>
  79
  80    <listitem>
  81     <para>
  82      Sharing a subset of the database between multiple databases.
  83     </para>
  84    </listitem>
  85   </itemizedlist>
  86  </para>
  87
  88  <para>
  89   The subscriber database behaves in the same way as any other PostgreSQL
  90   instance and can be used as a publisher for other databases by defining its
  91   own publications.  When the subscriber is treated as read-only by
  92   application, there will be no conflicts from a single subscription.  On the
  93   other hand, if there are other writes done either by an application or by other
  94   subscribers to the same set of tables, conflicts can arise.
  95  </para>
  96
  97  <sect1 id="logical-replication-publication">
  98   <title>Publication</title>
  99
 100   <para>
 101    A <firstterm>publication</firstterm> can be defined on any physical
 102    replication primary.  The node where a publication is defined is referred to
 103    as <firstterm>publisher</firstterm>.  A publication is a set of changes
 104    generated from a table or a group of tables, and might also be described as
 105    a change set or replication set.  Each publication exists in only one database.
 106   </para>
 107
 108   <para>
 109    Publications are different from schemas and do not affect how the table is
 110    accessed.  Each table can be added to multiple publications if needed.
 111    Publications may currently only contain tables and all tables in schema.
 112    Objects must be added explicitly, except when a publication is created for
 113    <literal>ALL TABLES</literal>.
 114   </para>
 115
 116   <para>
 117    Publications can choose to limit the changes they produce to
 118    any combination of <command>INSERT</command>, <command>UPDATE</command>,
 119    <command>DELETE</command>, and <command>TRUNCATE</command>, similar to how triggers are fired by
 120    particular event types. By default, all operation types are replicated.
 121    These publication specifications apply only for DML operations; they do not affect the initial
 122    data synchronization copy. (Row filters have no effect for
 123    <command>TRUNCATE</command>. See <xref linkend="logical-replication-row-filter"/>).
 124   </para>
 125
 126   <para>
 127    A published table must have a <firstterm>replica identity</firstterm> configured in
 128    order to be able to replicate <command>UPDATE</command>
 129    and <command>DELETE</command> operations, so that appropriate rows to
 130    update or delete can be identified on the subscriber side.  By default,
 131    this is the primary key, if there is one.  Another unique index (with
 132    certain additional requirements) can also be set to be the replica
 133    identity.  If the table does not have any suitable key, then it can be set
 134    to replica identity <literal>FULL</literal>, which means the entire row becomes
 135    the key.  When replica identity <literal>FULL</literal> is specified,
 136    indexes can be used on the subscriber side for searching the rows.  Candidate
 137    indexes must be btree or hash, non-partial, and the leftmost index field must
 138    be a column (not an expression) that references the published table column.
 139    These restrictions on the non-unique index properties adhere to some of the
 140    restrictions that are enforced for primary keys.  If there are no such
 141    suitable indexes, the search on the subscriber side can be very inefficient,
 142    therefore replica identity <literal>FULL</literal> should only be used as a
 143    fallback if no other solution is possible.  If a replica identity other
 144    than <literal>FULL</literal> is set on the publisher side, a replica identity
 145    comprising the same or fewer columns must also be set on the subscriber
 146    side.  See <xref linkend="sql-altertable-replica-identity"/> for details on
 147    how to set the replica identity.  If a table without a replica identity is
 148    added to a publication that replicates <command>UPDATE</command>
 149    or <command>DELETE</command> operations then
 150    subsequent <command>UPDATE</command> or <command>DELETE</command>
 151    operations will cause an error on the publisher.  <command>INSERT</command>
 152    operations can proceed regardless of any replica identity.
 153   </para>
 154
 155   <para>
 156    Every publication can have multiple subscribers.
 157   </para>
 158
 159   <para>
 160    A publication is created using the <link linkend="sql-createpublication"><command>CREATE PUBLICATION</command></link>
 161    command and may later be altered or dropped using corresponding commands.
 162   </para>
 163
 164   <para>
 165    The individual tables can be added and removed dynamically using
 166    <link linkend="sql-alterpublication"><command>ALTER PUBLICATION</command></link>.  Both the <literal>ADD
 167    TABLE</literal> and <literal>DROP TABLE</literal> operations are
 168    transactional; so the table will start or stop replicating at the correct
 169    snapshot once the transaction has committed.
 170   </para>
 171  </sect1>
 172
 173  <sect1 id="logical-replication-subscription">
 174   <title>Subscription</title>
 175
 176   <para>
 177    A <firstterm>subscription</firstterm> is the downstream side of logical
 178    replication.  The node where a subscription is defined is referred to as
 179    the <firstterm>subscriber</firstterm>.  A subscription defines the connection
 180    to another database and set of publications (one or more) to which it wants
 181    to subscribe.
 182   </para>
 183
 184   <para>
 185    The subscriber database behaves in the same way as any other PostgreSQL
 186    instance and can be used as a publisher for other databases by defining its
 187    own publications.
 188   </para>
 189
 190   <para>
 191    A subscriber node may have multiple subscriptions if desired.  It is
 192    possible to define multiple subscriptions between a single
 193    publisher-subscriber pair, in which case care must be taken to ensure
 194    that the subscribed publication objects don't overlap.
 195   </para>
 196
 197   <para>
 198    Each subscription will receive changes via one replication slot (see
 199    <xref linkend="streaming-replication-slots"/>).  Additional replication
 200    slots may be required for the initial data synchronization of
 201    pre-existing table data and those will be dropped at the end of data
 202    synchronization.
 203   </para>
 204
 205   <para>
 206    A logical replication subscription can be a standby for synchronous
 207    replication (see <xref linkend="synchronous-replication"/>).  The standby
 208    name is by default the subscription name.  An alternative name can be
 209    specified as <literal>application_name</literal> in the connection
 210    information of the subscription.
 211   </para>
 212
 213   <para>
 214    Subscriptions are dumped by <command>pg_dump</command> if the current user
 215    is a superuser.  Otherwise a warning is written and subscriptions are
 216    skipped, because non-superusers cannot read all subscription information
 217    from the <structname>pg_subscription</structname> catalog.
 218   </para>
 219
 220   <para>
 221    The subscription is added using <link linkend="sql-createsubscription"><command>CREATE SUBSCRIPTION</command></link> and
 222    can be stopped/resumed at any time using the
 223    <link linkend="sql-altersubscription"><command>ALTER SUBSCRIPTION</command></link> command and removed using
 224    <link linkend="sql-dropsubscription"><command>DROP SUBSCRIPTION</command></link>.
 225   </para>
 226
 227   <para>
 228    When a subscription is dropped and recreated, the synchronization
 229    information is lost.  This means that the data has to be resynchronized
 230    afterwards.
 231   </para>
 232
 233   <para>
 234    The schema definitions are not replicated, and the published tables must
 235    exist on the subscriber.  Only regular tables may be
 236    the target of replication.  For example, you can't replicate to a view.
 237   </para>
 238
 239   <para>
 240    The tables are matched between the publisher and the subscriber using the
 241    fully qualified table name.  Replication to differently-named tables on the
 242    subscriber is not supported.
 243   </para>
 244
 245   <para>
 246    Columns of a table are also matched by name.  The order of columns in the
 247    subscriber table does not need to match that of the publisher.  The data
 248    types of the columns do not need to match, as long as the text
 249    representation of the data can be converted to the target type.  For
 250    example, you can replicate from a column of type <type>integer</type> to a
 251    column of type <type>bigint</type>.  The target table can also have
 252    additional columns not provided by the published table.  Any such columns
 253    will be filled with the default value as specified in the definition of the
 254    target table. However, logical replication in binary format is more
 255    restrictive. See the
 256    <link linkend="sql-createsubscription-params-with-binary"><literal>binary</literal></link>
 257    option of <command>CREATE SUBSCRIPTION</command> for details.
 258   </para>
 259
 260   <sect2 id="logical-replication-subscription-slot">
 261    <title>Replication Slot Management</title>
 262
 263    <para>
 264     As mentioned earlier, each (active) subscription receives changes from a
 265     replication slot on the remote (publishing) side.
 266    </para>
 267    <para>
 268     Additional table synchronization slots are normally transient, created
 269     internally to perform initial table synchronization and dropped
 270     automatically when they are no longer needed. These table synchronization
 271     slots have generated names: <quote><literal>pg_%u_sync_%u_%llu</literal></quote>
 272     (parameters: Subscription <parameter>oid</parameter>,
 273     Table <parameter>relid</parameter>, system identifier <parameter>sysid</parameter>)
 274    </para>
 275    <para>
 276     Normally, the remote replication slot is created automatically when the
 277     subscription is created using <link linkend="sql-createsubscription">
 278     <command>CREATE SUBSCRIPTION</command></link> and it
 279     is dropped automatically when the subscription is dropped using
 280     <link linkend="sql-dropsubscription"><command>DROP SUBSCRIPTION</command></link>.
 281     In some situations, however, it can
 282     be useful or necessary to manipulate the subscription and the underlying
 283     replication slot separately.  Here are some scenarios:
 284
 285     <itemizedlist>
 286      <listitem>
 287       <para>
 288        When creating a subscription, the replication slot already exists.  In
 289        that case, the subscription can be created using
 290        the <literal>create_slot = false</literal> option to associate with the
 291        existing slot.
 292       </para>
 293      </listitem>
 294
 295      <listitem>
 296       <para>
 297        When creating a subscription, the remote host is not reachable or in an
 298        unclear state.  In that case, the subscription can be created using
 299        the <literal>connect = false</literal> option.  The remote host will then not
 300        be contacted at all.  This is what <application>pg_dump</application>
 301        uses.  The remote replication slot will then have to be created
 302        manually before the subscription can be activated.
 303       </para>
 304      </listitem>
 305
 306      <listitem>
 307       <para>
 308        When dropping a subscription, the replication slot should be kept.
 309        This could be useful when the subscriber database is being moved to a
 310        different host and will be activated from there.  In that case,
 311        disassociate the slot from the subscription using
 312        <link linkend="sql-altersubscription"><command>ALTER SUBSCRIPTION</command></link>
 313        before attempting to drop the subscription.
 314       </para>
 315      </listitem>
 316
 317      <listitem>
 318       <para>
 319        When dropping a subscription, the remote host is not reachable.  In
 320        that case, disassociate the slot from the subscription
 321        using <command>ALTER SUBSCRIPTION</command> before attempting to drop
 322        the subscription.  If the remote database instance no longer exists, no
 323        further action is then necessary.  If, however, the remote database
 324        instance is just unreachable, the replication slot (and any still
 325        remaining table synchronization slots) should then be
 326        dropped manually; otherwise it/they would continue to reserve WAL and might
 327        eventually cause the disk to fill up.  Such cases should be carefully
 328        investigated.
 329       </para>
 330      </listitem>
 331     </itemizedlist>
 332    </para>
 333   </sect2>
 334
 335   <sect2 id="logical-replication-subscription-examples">
 336     <title>Examples: Set Up Logical Replication</title>
 337
 338     <para>
 339      Create some test tables on the publisher.
 340 <programlisting>
 341 test_pub=# CREATE TABLE t1(a int, b text, PRIMARY KEY(a));
 342 CREATE TABLE
 343 test_pub=# CREATE TABLE t2(c int, d text, PRIMARY KEY(c));
 344 CREATE TABLE
 345 test_pub=# CREATE TABLE t3(e int, f text, PRIMARY KEY(e));
 346 CREATE TABLE
 347 </programlisting></para>
 348
 349     <para>
 350      Create the same tables on the subscriber.
 351 <programlisting>
 352 test_sub=# CREATE TABLE t1(a int, b text, PRIMARY KEY(a));
 353 CREATE TABLE
 354 test_sub=# CREATE TABLE t2(c int, d text, PRIMARY KEY(c));
 355 CREATE TABLE
 356 test_sub=# CREATE TABLE t3(e int, f text, PRIMARY KEY(e));
 357 CREATE TABLE
 358 </programlisting></para>
 359
 360     <para>
 361      Insert data to the tables at the publisher side.
 362 <programlisting>
 363 test_pub=# INSERT INTO t1 VALUES (1, 'one'), (2, 'two'), (3, 'three');
 364 INSERT 0 3
 365 test_pub=# INSERT INTO t2 VALUES (1, 'A'), (2, 'B'), (3, 'C');
 366 INSERT 0 3
 367 test_pub=# INSERT INTO t3 VALUES (1, 'i'), (2, 'ii'), (3, 'iii');
 368 INSERT 0 3
 369 </programlisting></para>
 370
 371     <para>
 372      Create publications for the tables. The publications <literal>pub2</literal>
 373      and <literal>pub3a</literal> disallow some
 374      <link linkend="sql-createpublication-params-with-publish"><literal>publish</literal></link>
 375      operations. The publication <literal>pub3b</literal> has a row filter (see
 376      <xref linkend="logical-replication-row-filter"/>).
 377 <programlisting>
 378 test_pub=# CREATE PUBLICATION pub1 FOR TABLE t1;
 379 CREATE PUBLICATION
 380 test_pub=# CREATE PUBLICATION pub2 FOR TABLE t2 WITH (publish = 'truncate');
 381 CREATE PUBLICATION
 382 test_pub=# CREATE PUBLICATION pub3a FOR TABLE t3 WITH (publish = 'truncate');
 383 CREATE PUBLICATION
 384 test_pub=# CREATE PUBLICATION pub3b FOR TABLE t3 WHERE (e > 5);
 385 CREATE PUBLICATION
 386 </programlisting></para>
 387
 388     <para>
 389      Create subscriptions for the publications. The subscription
 390      <literal>sub3</literal> subscribes to both <literal>pub3a</literal> and
 391      <literal>pub3b</literal>. All subscriptions will copy initial data by default.
 392 <programlisting>
 393 test_sub=# CREATE SUBSCRIPTION sub1
 394 test_sub-# CONNECTION 'host=localhost dbname=test_pub application_name=sub1'
 395 test_sub-# PUBLICATION pub1;
 396 CREATE SUBSCRIPTION
 397 test_sub=# CREATE SUBSCRIPTION sub2
 398 test_sub-# CONNECTION 'host=localhost dbname=test_pub application_name=sub2'
 399 test_sub-# PUBLICATION pub2;
 400 CREATE SUBSCRIPTION
 401 test_sub=# CREATE SUBSCRIPTION sub3
 402 test_sub-# CONNECTION 'host=localhost dbname=test_pub application_name=sub3'
 403 test_sub-# PUBLICATION pub3a, pub3b;
 404 CREATE SUBSCRIPTION
 405 </programlisting></para>
 406
 407     <para>
 408      Observe that initial table data is copied, regardless of the
 409      <literal>publish</literal> operation of the publication.
 410 <programlisting>
 411 test_sub=# SELECT * FROM t1;
 412  a |   b
 413 ---+-------
 414  1 | one
 415  2 | two
 416  3 | three
 417 (3 rows)
 418
 419 test_sub=# SELECT * FROM t2;
 420  c | d
 421 ---+---
 422  1 | A
 423  2 | B
 424  3 | C
 425 (3 rows)
 426 </programlisting></para>
 427
 428     <para>
 429      Furthermore, because the initial data copy ignores the <literal>publish</literal>
 430      operation, and because publication <literal>pub3a</literal> has no row filter,
 431      it means the copied table <literal>t3</literal> contains all rows even when
 432      they do not match the row filter of publication <literal>pub3b</literal>.
 433 <programlisting>
 434 test_sub=# SELECT * FROM t3;
 435  e |  f
 436 ---+-----
 437  1 | i
 438  2 | ii
 439  3 | iii
 440 (3 rows)
 441 </programlisting></para>
 442
 443    <para>
 444     Insert more data to the tables at the publisher side.
 445 <programlisting>
 446 test_pub=# INSERT INTO t1 VALUES (4, 'four'), (5, 'five'), (6, 'six');
 447 INSERT 0 3
 448 test_pub=# INSERT INTO t2 VALUES (4, 'D'), (5, 'E'), (6, 'F');
 449 INSERT 0 3
 450 test_pub=# INSERT INTO t3 VALUES (4, 'iv'), (5, 'v'), (6, 'vi');
 451 INSERT 0 3
 452 </programlisting></para>
 453
 454    <para>
 455     Now the publisher side data looks like:
 456 <programlisting>
 457 test_pub=# SELECT * FROM t1;
 458  a |   b
 459 ---+-------
 460  1 | one
 461  2 | two
 462  3 | three
 463  4 | four
 464  5 | five
 465  6 | six
 466 (6 rows)
 467
 468 test_pub=# SELECT * FROM t2;
 469  c | d
 470 ---+---
 471  1 | A
 472  2 | B
 473  3 | C
 474  4 | D
 475  5 | E
 476  6 | F
 477 (6 rows)
 478
 479 test_pub=# SELECT * FROM t3;
 480  e |  f
 481 ---+-----
 482  1 | i
 483  2 | ii
 484  3 | iii
 485  4 | iv
 486  5 | v
 487  6 | vi
 488 (6 rows)
 489 </programlisting></para>
 490
 491    <para>
 492     Observe that during normal replication the appropriate
 493     <literal>publish</literal> operations are used. This means publications
 494     <literal>pub2</literal> and <literal>pub3a</literal> will not replicate the
 495     <literal>INSERT</literal>. Also, publication <literal>pub3b</literal> will
 496     only replicate data that matches the row filter of <literal>pub3b</literal>.
 497     Now the subscriber side data looks like:
 498 <programlisting>
 499 test_sub=# SELECT * FROM t1;
 500  a |   b
 501 ---+-------
 502  1 | one
 503  2 | two
 504  3 | three
 505  4 | four
 506  5 | five
 507  6 | six
 508 (6 rows)
 509
 510 test_sub=# SELECT * FROM t2;
 511  c | d
 512 ---+---
 513  1 | A
 514  2 | B
 515  3 | C
 516 (3 rows)
 517
 518 test_sub=# SELECT * FROM t3;
 519  e |  f
 520 ---+-----
 521  1 | i
 522  2 | ii
 523  3 | iii
 524  6 | vi
 525 (4 rows)
 526 </programlisting></para>
 527   </sect2>
 528
 529   <sect2 id="logical-replication-subscription-examples-deferred-slot">
 530    <title>Examples: Deferred Replication Slot Creation</title>
 531
 532    <para>
 533     There are some cases (e.g.
 534     <xref linkend="logical-replication-subscription-slot"/>) where, if the
 535     remote replication slot was not created automatically, the user must create
 536     it manually before the subscription can be activated. The steps to create
 537     the slot and activate the subscription are shown in the following examples.
 538     These examples specify the standard logical decoding output plugin
 539     (<literal>pgoutput</literal>), which is what the built-in logical
 540     replication uses.
 541    </para>
 542    <para>
 543     First, create a publication for the examples to use.
 544 <programlisting>
 545 test_pub=# CREATE PUBLICATION pub1 FOR ALL TABLES;
 546 CREATE PUBLICATION
 547 </programlisting></para>
 548    <para>
 549     Example 1: Where the subscription says <literal>connect = false</literal>
 550    </para>
 551    <para>
 552     <itemizedlist>
 553      <listitem>
 554       <para>
 555        Create the subscription.
 556 <programlisting>
 557 test_sub=# CREATE SUBSCRIPTION sub1
 558 test_sub-# CONNECTION 'host=localhost dbname=test_pub'
 559 test_sub-# PUBLICATION pub1
 560 test_sub-# WITH (connect=false);
 561 WARNING:  subscription was created, but is not connected
 562 HINT:  To initiate replication, you must manually create the replication slot, enable the subscription, and refresh the subscription.
 563 CREATE SUBSCRIPTION
 564 </programlisting></para>
 565      </listitem>
 566      <listitem>
 567       <para>
 568        On the publisher, manually create a slot. Because the name was not
 569        specified during <literal>CREATE SUBSCRIPTION</literal>, the name of the
 570        slot to create is same as the subscription name, e.g. "sub1".
 571 <programlisting>
 572 test_pub=# SELECT * FROM pg_create_logical_replication_slot('sub1', 'pgoutput');
 573  slot_name |    lsn
 574 -----------+-----------
 575  sub1      | 0/19404D0
 576 (1 row)
 577 </programlisting></para>
 578      </listitem>
 579      <listitem>
 580       <para>
 581        On the subscriber, complete the activation of the subscription. After
 582        this the tables of <literal>pub1</literal> will start replicating.
 583 <programlisting>
 584 test_sub=# ALTER SUBSCRIPTION sub1 ENABLE;
 585 ALTER SUBSCRIPTION
 586 test_sub=# ALTER SUBSCRIPTION sub1 REFRESH PUBLICATION;
 587 ALTER SUBSCRIPTION
 588 </programlisting></para>
 589      </listitem>
 590     </itemizedlist>
 591    </para>
 592
 593    <para>
 594     Example 2: Where the subscription says <literal>connect = false</literal>,
 595     but also specifies the
 596     <link linkend="sql-createsubscription-params-with-slot-name"><literal>slot_name</literal></link>
 597     option.
 598     <itemizedlist>
 599      <listitem>
 600       <para>
 601        Create the subscription.
 602 <programlisting>
 603 test_sub=# CREATE SUBSCRIPTION sub1
 604 test_sub-# CONNECTION 'host=localhost dbname=test_pub'
 605 test_sub-# PUBLICATION pub1
 606 test_sub-# WITH (connect=false, slot_name='myslot');
 607 WARNING:  subscription was created, but is not connected
 608 HINT:  To initiate replication, you must manually create the replication slot, enable the subscription, and refresh the subscription.
 609 CREATE SUBSCRIPTION
 610 </programlisting></para>
 611      </listitem>
 612      <listitem>
 613       <para>
 614        On the publisher, manually create a slot using the same name that was
 615        specified during <literal>CREATE SUBSCRIPTION</literal>, e.g. "myslot".
 616 <programlisting>
 617 test_pub=# SELECT * FROM pg_create_logical_replication_slot('myslot', 'pgoutput');
 618  slot_name |    lsn
 619 -----------+-----------
 620  myslot    | 0/19059A0
 621 (1 row)
 622 </programlisting></para>
 623      </listitem>
 624      <listitem>
 625       <para>
 626        On the subscriber, the remaining subscription activation steps are the
 627        same as before.
 628 <programlisting>
 629 test_sub=# ALTER SUBSCRIPTION sub1 ENABLE;
 630 ALTER SUBSCRIPTION
 631 test_sub=# ALTER SUBSCRIPTION sub1 REFRESH PUBLICATION;
 632 ALTER SUBSCRIPTION
 633 </programlisting></para>
 634      </listitem>
 635     </itemizedlist>
 636    </para>
 637
 638    <para>
 639     Example 3: Where the subscription specifies <literal>slot_name = NONE</literal>
 640     <itemizedlist>
 641      <listitem>
 642       <para>
 643        Create the subscription. When <literal>slot_name = NONE</literal> then
 644        <literal>enabled = false</literal>, and
 645        <literal>create_slot = false</literal> are also needed.
 646 <programlisting>
 647 test_sub=# CREATE SUBSCRIPTION sub1
 648 test_sub-# CONNECTION 'host=localhost dbname=test_pub'
 649 test_sub-# PUBLICATION pub1
 650 test_sub-# WITH (slot_name=NONE, enabled=false, create_slot=false);
 651 CREATE SUBSCRIPTION
 652 </programlisting></para>
 653      </listitem>
 654      <listitem>
 655       <para>
 656        On the publisher, manually create a slot using any name, e.g. "myslot".
 657 <programlisting>
 658 test_pub=# SELECT * FROM pg_create_logical_replication_slot('myslot', 'pgoutput');
 659  slot_name |    lsn
 660 -----------+-----------
 661  myslot    | 0/1905930
 662 (1 row)
 663 </programlisting></para>
 664      </listitem>
 665      <listitem>
 666       <para>
 667        On the subscriber, associate the subscription with the slot name just
 668        created.
 669 <programlisting>
 670 test_sub=# ALTER SUBSCRIPTION sub1 SET (slot_name='myslot');
 671 ALTER SUBSCRIPTION
 672 </programlisting></para>
 673      </listitem>
 674      <listitem>
 675       <para>
 676        The remaining subscription activation steps are same as before.
 677 <programlisting>
 678 test_sub=# ALTER SUBSCRIPTION sub1 ENABLE;
 679 ALTER SUBSCRIPTION
 680 test_sub=# ALTER SUBSCRIPTION sub1 REFRESH PUBLICATION;
 681 ALTER SUBSCRIPTION
 682 </programlisting></para>
 683      </listitem>
 684     </itemizedlist>
 685    </para>
 686   </sect2>
 687
 688  </sect1>
 689
 690  <sect1 id="logical-replication-failover">
 691   <title>Logical Replication Failover</title>
 692
 693   <para>
 694    To allow subscriber nodes to continue replicating data from the publisher
 695    node even when the publisher node goes down, there must be a physical standby
 696    corresponding to the publisher node. The logical slots on the primary server
 697    corresponding to the subscriptions can be synchronized to the standby server by
 698    specifying <literal>failover = true</literal> when creating subscriptions. See
 699    <xref linkend="logicaldecoding-replication-slots-synchronization"/> for details.
 700    Enabling the
 701    <link linkend="sql-createsubscription-params-with-failover"><literal>failover</literal></link>
 702    parameter ensures a seamless transition of those subscriptions after the
 703    standby is promoted. They can continue subscribing to publications on the
 704    new primary server.
 705   </para>
 706
 707   <para>
 708    Because the slot synchronization logic copies asynchronously, it is
 709    necessary to confirm that replication slots have been synced to the standby
 710    server before the failover happens. To ensure a successful failover, the
 711    standby server must be ahead of the subscriber. This can be achieved by
 712    configuring
 713    <link linkend="guc-synchronized-standby-slots"><varname>synchronized_standby_slots</varname></link>.
 714   </para>
 715
 716   <para>
 717    To confirm that the standby server is indeed ready for failover, follow these
 718    steps to verify that all necessary logical replication slots have been
 719    synchronized to the standby server:
 720   </para>
 721
 722   <procedure>
 723    <step performance="required">
 724     <para>
 725      On the subscriber node, use the following SQL to identify which replication
 726      slots should be synced to the standby that we plan to promote. This query
 727      will return the relevant replication slots associated with the
 728      failover-enabled subscriptions.
 729 <programlisting>
 730 test_sub=# SELECT
 731                array_agg(quote_literal(s.subslotname)) AS slots
 732            FROM  pg_subscription s
 733            WHERE s.subfailover AND
 734                  s.subslotname IS NOT NULL;
 735  slots
 736 -------
 737  {'sub1','sub2','sub3'}
 738 (1 row)
 739 </programlisting></para>
 740    </step>
 741    <step performance="required">
 742     <para>
 743      On the subscriber node, use the following SQL to identify which table
 744      synchronization slots should be synced to the standby that we plan to promote.
 745      This query needs to be run on each database that includes the failover-enabled
 746      subscription(s). Note that the table sync slot should be synced to the standby
 747      server only if the table copy is finished
 748      (See <xref linkend="catalog-pg-subscription-rel"/>).
 749      We don't need to ensure that the table sync slots are synced in other scenarios
 750      as they will either be dropped or re-created on the new primary server in those
 751      cases.
 752 <programlisting>
 753 test_sub=# SELECT
 754                array_agg(quote_literal(slot_name)) AS slots
 755            FROM
 756            (
 757                SELECT CONCAT('pg_', srsubid, '_sync_', srrelid, '_', ctl.system_identifier) AS slot_name
 758                FROM pg_control_system() ctl, pg_subscription_rel r, pg_subscription s
 759                WHERE r.srsubstate = 'f' AND s.oid = r.srsubid AND s.subfailover
 760            );
 761  slots
 762 -------
 763  {'pg_16394_sync_16385_7394666715149055164'}
 764 (1 row)
 765 </programlisting></para>
 766    </step>
 767    <step performance="required">
 768     <para>
 769      Check that the logical replication slots identified above exist on
 770      the standby server and are ready for failover.
 771 <programlisting>
 772 test_standby=# SELECT slot_name, (synced AND NOT temporary AND NOT conflicting) AS failover_ready
 773                FROM pg_replication_slots
 774                WHERE slot_name IN
 775                    ('sub1','sub2','sub3', 'pg_16394_sync_16385_7394666715149055164');
 776   slot_name                                 | failover_ready
 777 --------------------------------------------+----------------
 778   sub1                                      | t
 779   sub2                                      | t
 780   sub3                                      | t
 781   pg_16394_sync_16385_7394666715149055164   | t
 782 (4 rows)
 783 </programlisting></para>
 784     </step>
 785   </procedure>
 786
 787   <para>
 788    If all the slots are present on the standby server and the result
 789    (<literal>failover_ready</literal>) of the above SQL query is true, then
 790    existing subscriptions can continue subscribing to publications now on the
 791    new primary server.
 792   </para>
 793
 794  </sect1>
 795
 796  <sect1 id="logical-replication-row-filter">
 797   <title>Row Filters</title>
 798
 799   <para>
 800    By default, all data from all published tables will be replicated to the
 801    appropriate subscribers. The replicated data can be reduced by using a
 802    <firstterm>row filter</firstterm>. A user might choose to use row filters
 803    for behavioral, security or performance reasons. If a published table sets a
 804    row filter, a row is replicated only if its data satisfies the row filter
 805    expression. This allows a set of tables to be partially replicated. The row
 806    filter is defined per table. Use a <literal>WHERE</literal> clause after the
 807    table name for each published table that requires data to be filtered out.
 808    The <literal>WHERE</literal> clause must be enclosed by parentheses. See
 809    <xref linkend="sql-createpublication"/> for details.
 810   </para>
 811
 812   <sect2 id="logical-replication-row-filter-rules">
 813    <title>Row Filter Rules</title>
 814
 815    <para>
 816     Row filters are applied <emphasis>before</emphasis> publishing the changes.
 817     If the row filter evaluates to <literal>false</literal> or <literal>NULL</literal>
 818     then the row is not replicated. The <literal>WHERE</literal> clause expression
 819     is evaluated with the same role used for the replication connection (i.e.
 820     the role specified in the
 821     <link linkend="sql-createsubscription-params-connection"><literal>CONNECTION</literal></link>
 822     clause of the <xref linkend="sql-createsubscription"/>). Row filters have
 823     no effect for <command>TRUNCATE</command> command.
 824    </para>
 825
 826   </sect2>
 827
 828   <sect2 id="logical-replication-row-filter-restrictions">
 829    <title>Expression Restrictions</title>
 830
 831    <para>
 832     The <literal>WHERE</literal> clause allows only simple expressions. It
 833     cannot contain user-defined functions, operators, types, and collations,
 834     system column references or non-immutable built-in functions.
 835    </para>
 836
 837    <para>
 838     If a publication publishes <command>UPDATE</command> or
 839     <command>DELETE</command> operations, the row filter <literal>WHERE</literal>
 840     clause must contain only columns that are covered by the replica identity
 841     (see <xref linkend="sql-altertable-replica-identity"/>). If a publication
 842     publishes only <command>INSERT</command> operations, the row filter
 843     <literal>WHERE</literal> clause can use any column.
 844    </para>
 845
 846   </sect2>
 847
 848   <sect2 id="logical-replication-row-filter-transformations">
 849    <title>UPDATE Transformations</title>
 850
 851    <para>
 852     Whenever an <command>UPDATE</command> is processed, the row filter
 853     expression is evaluated for both the old and new row (i.e. using the data
 854     before and after the update). If both evaluations are <literal>true</literal>,
 855     it replicates the <command>UPDATE</command> change. If both evaluations are
 856     <literal>false</literal>, it doesn't replicate the change. If only one of
 857     the old/new rows matches the row filter expression, the <command>UPDATE</command>
 858     is transformed to <command>INSERT</command> or <command>DELETE</command>, to
 859     avoid any data inconsistency. The row on the subscriber should reflect what
 860     is defined by the row filter expression on the publisher.
 861    </para>
 862
 863    <para>
 864     If the old row satisfies the row filter expression (it was sent to the
 865     subscriber) but the new row doesn't, then, from a data consistency
 866     perspective the old row should be removed from the subscriber.
 867     So the <command>UPDATE</command> is transformed into a <command>DELETE</command>.
 868    </para>
 869
 870    <para>
 871     If the old row doesn't satisfy the row filter expression (it wasn't sent
 872     to the subscriber) but the new row does, then, from a data consistency
 873     perspective the new row should be added to the subscriber.
 874     So the <command>UPDATE</command> is transformed into an <command>INSERT</command>.
 875    </para>
 876
 877    <para>
 878     <xref linkend="logical-replication-row-filter-transformations-summary"/>
 879     summarizes the applied transformations.
 880    </para>
 881
 882    <table id="logical-replication-row-filter-transformations-summary">
 883     <title><command>UPDATE</command> Transformation Summary</title>
 884     <tgroup cols="3">
 885     <thead>
 886      <row>
 887       <entry>Old row</entry><entry>New row</entry><entry>Transformation</entry>
 888      </row>
 889     </thead>
 890     <tbody>
 891      <row>
 892       <entry>no match</entry><entry>no match</entry><entry>don't replicate</entry>
 893      </row>
 894      <row>
 895       <entry>no match</entry><entry>match</entry><entry><literal>INSERT</literal></entry>
 896      </row>
 897      <row>
 898       <entry>match</entry><entry>no match</entry><entry><literal>DELETE</literal></entry>
 899      </row>
 900      <row>
 901       <entry>match</entry><entry>match</entry><entry><literal>UPDATE</literal></entry>
 902      </row>
 903     </tbody>
 904    </tgroup>
 905    </table>
 906
 907   </sect2>
 908
 909   <sect2 id="logical-replication-row-filter-partitioned-table">
 910    <title>Partitioned Tables</title>
 911
 912    <para>
 913     If the publication contains a partitioned table, the publication parameter
 914     <link linkend="sql-createpublication-params-with-publish-via-partition-root"><literal>publish_via_partition_root</literal></link>
 915     determines which row filter is used. If <literal>publish_via_partition_root</literal>
 916     is <literal>true</literal>, the <emphasis>root partitioned table's</emphasis>
 917     row filter is used. Otherwise, if <literal>publish_via_partition_root</literal>
 918     is <literal>false</literal> (default), each <emphasis>partition's</emphasis>
 919     row filter is used.
 920    </para>
 921
 922   </sect2>
 923
 924   <sect2 id="logical-replication-row-filter-initial-data-sync">
 925    <title>Initial Data Synchronization</title>
 926
 927    <para>
 928     If the subscription requires copying pre-existing table data
 929     and a publication contains <literal>WHERE</literal> clauses, only data that
 930     satisfies the row filter expressions is copied to the subscriber.
 931    </para>
 932
 933    <para>
 934     If the subscription has several publications in which a table has been
 935     published with different <literal>WHERE</literal> clauses, rows that satisfy
 936     <emphasis>any</emphasis> of the expressions will be copied. See
 937     <xref linkend="logical-replication-row-filter-combining"/> for details.
 938    </para>
 939
 940    <warning>
 941     <para>
 942      Because initial data synchronization does not take into account the
 943      <link linkend="sql-createpublication-params-with-publish"><literal>publish</literal></link>
 944      parameter when copying existing table data, some rows may be copied that
 945      would not be replicated using DML. Refer to
 946      <xref linkend="logical-replication-snapshot"/>, and see
 947      <xref linkend="logical-replication-subscription-examples"/> for examples.
 948     </para>
 949    </warning>
 950
 951    <note>
 952     <para>
 953      If the subscriber is in a release prior to 15, copy pre-existing data
 954      doesn't use row filters even if they are defined in the publication.
 955      This is because old releases can only copy the entire table data.
 956     </para>
 957    </note>
 958
 959   </sect2>
 960
 961   <sect2 id="logical-replication-row-filter-combining">
 962    <title>Combining Multiple Row Filters</title>
 963
 964    <para>
 965     If the subscription has several publications in which the same table has
 966     been published with different row filters (for the same
 967     <link linkend="sql-createpublication-params-with-publish"><literal>publish</literal></link>
 968     operation), those expressions get ORed together, so that rows satisfying
 969     <emphasis>any</emphasis> of the expressions will be replicated. This means all
 970     the other row filters for the same table become redundant if:
 971     <itemizedlist>
 972      <listitem>
 973       <para>
 974        One of the publications has no row filter.
 975       </para>
 976      </listitem>
 977      <listitem>
 978       <para>
 979        One of the publications was created using
 980        <link linkend="sql-createpublication-params-for-all-tables"><literal>FOR ALL TABLES</literal></link>.
 981        This clause does not allow row filters.
 982       </para>
 983      </listitem>
 984      <listitem>
 985       <para>
 986        One of the publications was created using
 987        <link linkend="sql-createpublication-params-for-tables-in-schema"><literal>FOR TABLES IN SCHEMA</literal></link>
 988        and the table belongs to the referred schema. This clause does not allow
 989        row filters.
 990       </para>
 991      </listitem>
 992     </itemizedlist></para>
 993
 994   </sect2>
 995
 996   <sect2 id="logical-replication-row-filter-examples">
 997    <title>Examples</title>
 998
 999    <para>
1000     Create some tables to be used in the following examples.
1001 <programlisting>
1002 test_pub=# CREATE TABLE t1(a int, b int, c text, PRIMARY KEY(a,c));
1003 CREATE TABLE
1004 test_pub=# CREATE TABLE t2(d int, e int, f int, PRIMARY KEY(d));
1005 CREATE TABLE
1006 test_pub=# CREATE TABLE t3(g int, h int, i int, PRIMARY KEY(g));
1007 CREATE TABLE
1008 </programlisting></para>
1009
1010    <para>
1011     Create some publications. Publication <literal>p1</literal> has one table
1012     (<literal>t1</literal>) and that table has a row filter. Publication
1013     <literal>p2</literal> has two tables. Table <literal>t1</literal> has no row
1014     filter, and table <literal>t2</literal> has a row filter. Publication
1015     <literal>p3</literal> has two tables, and both of them have a row filter.
1016 <programlisting>
1017 test_pub=# CREATE PUBLICATION p1 FOR TABLE t1 WHERE (a > 5 AND c = 'NSW');
1018 CREATE PUBLICATION
1019 test_pub=# CREATE PUBLICATION p2 FOR TABLE t1, t2 WHERE (e = 99);
1020 CREATE PUBLICATION
1021 test_pub=# CREATE PUBLICATION p3 FOR TABLE t2 WHERE (d = 10), t3 WHERE (g = 10);
1022 CREATE PUBLICATION
1023 </programlisting></para>
1024
1025    <para>
1026     <command>psql</command> can be used to show the row filter expressions (if
1027     defined) for each publication.
1028 <programlisting>
1029 test_pub=# \dRp+
1030                                Publication p1
1031   Owner   | All tables | Inserts | Updates | Deletes | Truncates | Via root
1032 ----------+------------+---------+---------+---------+-----------+----------
1033  postgres | f          | t       | t       | t       | t         | f
1034 Tables:
1035     "public.t1" WHERE ((a > 5) AND (c = 'NSW'::text))
1036
1037                                Publication p2
1038   Owner   | All tables | Inserts | Updates | Deletes | Truncates | Via root
1039 ----------+------------+---------+---------+---------+-----------+----------
1040  postgres | f          | t       | t       | t       | t         | f
1041 Tables:
1042     "public.t1"
1043     "public.t2" WHERE (e = 99)
1044
1045                                Publication p3
1046   Owner   | All tables | Inserts | Updates | Deletes | Truncates | Via root
1047 ----------+------------+---------+---------+---------+-----------+----------
1048  postgres | f          | t       | t       | t       | t         | f
1049 Tables:
1050     "public.t2" WHERE (d = 10)
1051     "public.t3" WHERE (g = 10)
1052 </programlisting></para>
1053
1054    <para>
1055     <command>psql</command> can be used to show the row filter expressions (if
1056     defined) for each table. See that table <literal>t1</literal> is a member
1057     of two publications, but has a row filter only in <literal>p1</literal>.
1058     See that table <literal>t2</literal> is a member of two publications, and
1059     has a different row filter in each of them.
1060 <programlisting>
1061 test_pub=# \d t1
1062                  Table "public.t1"
1063  Column |  Type   | Collation | Nullable | Default
1064 --------+---------+-----------+----------+---------
1065  a      | integer |           | not null |
1066  b      | integer |           |          |
1067  c      | text    |           | not null |
1068 Indexes:
1069     "t1_pkey" PRIMARY KEY, btree (a, c)
1070 Publications:
1071     "p1" WHERE ((a > 5) AND (c = 'NSW'::text))
1072     "p2"
1073
1074 test_pub=# \d t2
1075                  Table "public.t2"
1076  Column |  Type   | Collation | Nullable | Default
1077 --------+---------+-----------+----------+---------
1078  d      | integer |           | not null |
1079  e      | integer |           |          |
1080  f      | integer |           |          |
1081 Indexes:
1082     "t2_pkey" PRIMARY KEY, btree (d)
1083 Publications:
1084     "p2" WHERE (e = 99)
1085     "p3" WHERE (d = 10)
1086
1087 test_pub=# \d t3
1088                  Table "public.t3"
1089  Column |  Type   | Collation | Nullable | Default
1090 --------+---------+-----------+----------+---------
1091  g      | integer |           | not null |
1092  h      | integer |           |          |
1093  i      | integer |           |          |
1094 Indexes:
1095     "t3_pkey" PRIMARY KEY, btree (g)
1096 Publications:
1097     "p3" WHERE (g = 10)
1098 </programlisting></para>
1099
1100    <para>
1101     On the subscriber node, create a table <literal>t1</literal> with the same
1102     definition as the one on the publisher, and also create the subscription
1103     <literal>s1</literal> that subscribes to the publication <literal>p1</literal>.
1104 <programlisting>
1105 test_sub=# CREATE TABLE t1(a int, b int, c text, PRIMARY KEY(a,c));
1106 CREATE TABLE
1107 test_sub=# CREATE SUBSCRIPTION s1
1108 test_sub-# CONNECTION 'host=localhost dbname=test_pub application_name=s1'
1109 test_sub-# PUBLICATION p1;
1110 CREATE SUBSCRIPTION
1111 </programlisting></para>
1112
1113    <para>
1114     Insert some rows. Only the rows satisfying the <literal>t1 WHERE</literal>
1115     clause of publication <literal>p1</literal> are replicated.
1116 <programlisting>
1117 test_pub=# INSERT INTO t1 VALUES (2, 102, 'NSW');
1118 INSERT 0 1
1119 test_pub=# INSERT INTO t1 VALUES (3, 103, 'QLD');
1120 INSERT 0 1
1121 test_pub=# INSERT INTO t1 VALUES (4, 104, 'VIC');
1122 INSERT 0 1
1123 test_pub=# INSERT INTO t1 VALUES (5, 105, 'ACT');
1124 INSERT 0 1
1125 test_pub=# INSERT INTO t1 VALUES (6, 106, 'NSW');
1126 INSERT 0 1
1127 test_pub=# INSERT INTO t1 VALUES (7, 107, 'NT');
1128 INSERT 0 1
1129 test_pub=# INSERT INTO t1 VALUES (8, 108, 'QLD');
1130 INSERT 0 1
1131 test_pub=# INSERT INTO t1 VALUES (9, 109, 'NSW');
1132 INSERT 0 1
1133
1134 test_pub=# SELECT * FROM t1;
1135  a |  b  |  c
1136 ---+-----+-----
1137  2 | 102 | NSW
1138  3 | 103 | QLD
1139  4 | 104 | VIC
1140  5 | 105 | ACT
1141  6 | 106 | NSW
1142  7 | 107 | NT
1143  8 | 108 | QLD
1144  9 | 109 | NSW
1145 (8 rows)
1146 </programlisting>
1147 <programlisting>
1148 test_sub=# SELECT * FROM t1;
1149  a |  b  |  c
1150 ---+-----+-----
1151  6 | 106 | NSW
1152  9 | 109 | NSW
1153 (2 rows)
1154 </programlisting></para>
1155
1156    <para>
1157     Update some data, where the old and new row values both
1158     satisfy the <literal>t1 WHERE</literal> clause of publication
1159     <literal>p1</literal>. The <command>UPDATE</command> replicates
1160     the change as normal.
1161 <programlisting>
1162 test_pub=# UPDATE t1 SET b = 999 WHERE a = 6;
1163 UPDATE 1
1164
1165 test_pub=# SELECT * FROM t1;
1166  a |  b  |  c
1167 ---+-----+-----
1168  2 | 102 | NSW
1169  3 | 103 | QLD
1170  4 | 104 | VIC
1171  5 | 105 | ACT
1172  7 | 107 | NT
1173  8 | 108 | QLD
1174  9 | 109 | NSW
1175  6 | 999 | NSW
1176 (8 rows)
1177 </programlisting>
1178 <programlisting>
1179 test_sub=# SELECT * FROM t1;
1180  a |  b  |  c
1181 ---+-----+-----
1182  9 | 109 | NSW
1183  6 | 999 | NSW
1184 (2 rows)
1185 </programlisting></para>
1186
1187    <para>
1188     Update some data, where the old row values did not satisfy
1189     the <literal>t1 WHERE</literal> clause of publication <literal>p1</literal>,
1190     but the new row values do satisfy it. The <command>UPDATE</command> is
1191     transformed into an <command>INSERT</command> and the change is replicated.
1192     See the new row on the subscriber.
1193 <programlisting>
1194 test_pub=# UPDATE t1 SET a = 555 WHERE a = 2;
1195 UPDATE 1
1196
1197 test_pub=# SELECT * FROM t1;
1198   a  |  b  |  c
1199 -----+-----+-----
1200    3 | 103 | QLD
1201    4 | 104 | VIC
1202    5 | 105 | ACT
1203    7 | 107 | NT
1204    8 | 108 | QLD
1205    9 | 109 | NSW
1206    6 | 999 | NSW
1207  555 | 102 | NSW
1208 (8 rows)
1209 </programlisting>
1210 <programlisting>
1211 test_sub=# SELECT * FROM t1;
1212   a  |  b  |  c
1213 -----+-----+-----
1214    9 | 109 | NSW
1215    6 | 999 | NSW
1216  555 | 102 | NSW
1217 (3 rows)
1218 </programlisting></para>
1219
1220    <para>
1221     Update some data, where the old row values satisfied
1222     the <literal>t1 WHERE</literal> clause of publication <literal>p1</literal>,
1223     but the new row values do not satisfy it. The <command>UPDATE</command> is
1224     transformed into a <command>DELETE</command> and the change is replicated.
1225     See that the row is removed from the subscriber.
1226 <programlisting>
1227 test_pub=# UPDATE t1 SET c = 'VIC' WHERE a = 9;
1228 UPDATE 1
1229
1230 test_pub=# SELECT * FROM t1;
1231   a  |  b  |  c
1232 -----+-----+-----
1233    3 | 103 | QLD
1234    4 | 104 | VIC
1235    5 | 105 | ACT
1236    7 | 107 | NT
1237    8 | 108 | QLD
1238    6 | 999 | NSW
1239  555 | 102 | NSW
1240    9 | 109 | VIC
1241 (8 rows)
1242 </programlisting>
1243 <programlisting>
1244 test_sub=# SELECT * FROM t1;
1245   a  |  b  |  c
1246 -----+-----+-----
1247    6 | 999 | NSW
1248  555 | 102 | NSW
1249 (2 rows)
1250 </programlisting></para>
1251
1252    <para>
1253     The following examples show how the publication parameter
1254     <link linkend="sql-createpublication-params-with-publish-via-partition-root"><literal>publish_via_partition_root</literal></link>
1255     determines whether the row filter of the parent or child table will be used
1256     in the case of partitioned tables.
1257    </para>
1258
1259    <para>
1260     Create a partitioned table on the publisher.
1261 <programlisting>
1262 test_pub=# CREATE TABLE parent(a int PRIMARY KEY) PARTITION BY RANGE(a);
1263 CREATE TABLE
1264 test_pub=# CREATE TABLE child PARTITION OF parent DEFAULT;
1265 CREATE TABLE
1266 </programlisting>
1267    Create the same tables on the subscriber.
1268 <programlisting>
1269 test_sub=# CREATE TABLE parent(a int PRIMARY KEY) PARTITION BY RANGE(a);
1270 CREATE TABLE
1271 test_sub=# CREATE TABLE child PARTITION OF parent DEFAULT;
1272 CREATE TABLE
1273 </programlisting></para>
1274
1275    <para>
1276     Create a publication <literal>p4</literal>, and then subscribe to it. The
1277     publication parameter <literal>publish_via_partition_root</literal> is set
1278     as true. There are row filters defined on both the partitioned table
1279     (<literal>parent</literal>), and on the partition (<literal>child</literal>).
1280 <programlisting>
1281 test_pub=# CREATE PUBLICATION p4 FOR TABLE parent WHERE (a &lt; 5), child WHERE (a >= 5)
1282 test_pub-# WITH (publish_via_partition_root=true);
1283 CREATE PUBLICATION
1284 </programlisting>
1285 <programlisting>
1286 test_sub=# CREATE SUBSCRIPTION s4
1287 test_sub-# CONNECTION 'host=localhost dbname=test_pub application_name=s4'
1288 test_sub-# PUBLICATION p4;
1289 CREATE SUBSCRIPTION
1290 </programlisting></para>
1291
1292    <para>
1293     Insert some values directly into the <literal>parent</literal> and
1294     <literal>child</literal> tables. They replicate using the row filter of
1295     <literal>parent</literal> (because <literal>publish_via_partition_root</literal>
1296     is true).
1297 <programlisting>
1298 test_pub=# INSERT INTO parent VALUES (2), (4), (6);
1299 INSERT 0 3
1300 test_pub=# INSERT INTO child VALUES (3), (5), (7);
1301 INSERT 0 3
1302
1303 test_pub=# SELECT * FROM parent ORDER BY a;
1304  a
1305 ---
1306  2
1307  3
1308  4
1309  5
1310  6
1311  7
1312 (6 rows)
1313 </programlisting>
1314 <programlisting>
1315 test_sub=# SELECT * FROM parent ORDER BY a;
1316  a
1317 ---
1318  2
1319  3
1320  4
1321 (3 rows)
1322 </programlisting></para>
1323
1324    <para>
1325     Repeat the same test, but with a different value for <literal>publish_via_partition_root</literal>.
1326     The publication parameter <literal>publish_via_partition_root</literal> is
1327     set as false. A row filter is defined on the partition (<literal>child</literal>).
1328 <programlisting>
1329 test_pub=# DROP PUBLICATION p4;
1330 DROP PUBLICATION
1331 test_pub=# CREATE PUBLICATION p4 FOR TABLE parent, child WHERE (a >= 5)
1332 test_pub-# WITH (publish_via_partition_root=false);
1333 CREATE PUBLICATION
1334 </programlisting>
1335 <programlisting>
1336 test_sub=# ALTER SUBSCRIPTION s4 REFRESH PUBLICATION;
1337 ALTER SUBSCRIPTION
1338 </programlisting></para>
1339
1340    <para>
1341     Do the inserts on the publisher same as before. They replicate using the
1342     row filter of <literal>child</literal> (because
1343     <literal>publish_via_partition_root</literal> is false).
1344 <programlisting>
1345 test_pub=# TRUNCATE parent;
1346 TRUNCATE TABLE
1347 test_pub=# INSERT INTO parent VALUES (2), (4), (6);
1348 INSERT 0 3
1349 test_pub=# INSERT INTO child VALUES (3), (5), (7);
1350 INSERT 0 3
1351
1352 test_pub=# SELECT * FROM parent ORDER BY a;
1353  a
1354 ---
1355  2
1356  3
1357  4
1358  5
1359  6
1360  7
1361 (6 rows)
1362 </programlisting>
1363 <programlisting>
1364 test_sub=# SELECT * FROM child ORDER BY a;
1365  a
1366 ---
1367  5
1368  6
1369  7
1370 (3 rows)
1371 </programlisting></para>
1372
1373   </sect2>
1374
1375  </sect1>
1376
1377  <sect1 id="logical-replication-col-lists">
1378   <title>Column Lists</title>
1379
1380   <para>
1381    Each publication can optionally specify which columns of each table are
1382    replicated to subscribers. The table on the subscriber side must have at
1383    least all the columns that are published. If no column list is specified,
1384    then all columns on the publisher are replicated.
1385    See <xref linkend="sql-createpublication"/> for details on the syntax.
1386   </para>
1387
1388   <para>
1389    The choice of columns can be based on behavioral or performance reasons.
1390    However, do not rely on this feature for security: a malicious subscriber
1391    is able to obtain data from columns that are not specifically
1392    published.  If security is a consideration, protections can be applied
1393    at the publisher side.
1394   </para>
1395
1396   <para>
1397    If no column list is specified, any columns added to the table later are
1398    automatically replicated. This means that having a column list which names
1399    all columns is not the same as having no column list at all.
1400   </para>
1401
1402   <para>
1403    A column list can contain only simple column references.  The order
1404    of columns in the list is not preserved.
1405   </para>
1406
1407   <para>
1408    Specifying a column list when the publication also publishes
1409    <link linkend="sql-createpublication-params-for-tables-in-schema"><literal>FOR TABLES IN SCHEMA</literal></link>
1410    is not supported.
1411   </para>
1412
1413   <para>
1414    For partitioned tables, the publication parameter
1415    <link linkend="sql-createpublication-params-with-publish-via-partition-root"><literal>publish_via_partition_root</literal></link>
1416    determines which column list is used. If <literal>publish_via_partition_root</literal>
1417    is <literal>true</literal>, the root partitioned table's column list is
1418    used. Otherwise, if <literal>publish_via_partition_root</literal> is
1419    <literal>false</literal> (the default), each partition's column list is used.
1420   </para>
1421
1422   <para>
1423    If a publication publishes <command>UPDATE</command> or
1424    <command>DELETE</command> operations, any column list must include the
1425    table's replica identity columns (see
1426    <xref linkend="sql-altertable-replica-identity"/>).
1427    If a publication publishes only <command>INSERT</command> operations, then
1428    the column list may omit replica identity columns.
1429   </para>
1430
1431   <para>
1432    Column lists have no effect for the <literal>TRUNCATE</literal> command.
1433   </para>
1434
1435   <para>
1436    During initial data synchronization, only the published columns are
1437    copied.  However, if the subscriber is from a release prior to 15, then
1438    all the columns in the table are copied during initial data synchronization,
1439    ignoring any column lists.
1440   </para>
1441
1442    <warning id="logical-replication-col-list-combining">
1443     <title>Warning: Combining Column Lists from Multiple Publications</title>
1444     <para>
1445      There's currently no support for subscriptions comprising several
1446      publications where the same table has been published with different
1447      column lists.  <xref linkend="sql-createsubscription"/> disallows
1448      creating such subscriptions, but it is still possible to get into
1449      that situation by adding or altering column lists on the publication
1450      side after a subscription has been created.
1451     </para>
1452     <para>
1453      This means changing the column lists of tables on publications that are
1454      already subscribed could lead to errors being thrown on the subscriber
1455      side.
1456     </para>
1457     <para>
1458      If a subscription is affected by this problem, the only way to resume
1459      replication is to adjust one of the column lists on the publication
1460      side so that they all match; and then either recreate the subscription,
1461      or use <link linkend="sql-altersubscription-params-setadddrop-publication">
1462      <literal>ALTER SUBSCRIPTION ... DROP PUBLICATION</literal></link> to
1463      remove one of the offending publications and add it again.
1464     </para>
1465    </warning>
1466
1467   <sect2 id="logical-replication-col-list-examples">
1468    <title>Examples</title>
1469
1470    <para>
1471     Create a table <literal>t1</literal> to be used in the following example.
1472 <programlisting>
1473 test_pub=# CREATE TABLE t1(id int, a text, b text, c text, d text, e text, PRIMARY KEY(id));
1474 CREATE TABLE
1475 </programlisting></para>
1476
1477    <para>
1478     Create a publication <literal>p1</literal>. A column list is defined for
1479     table <literal>t1</literal> to reduce the number of columns that will be
1480     replicated. Notice that the order of column names in the column list does
1481     not matter.
1482 <programlisting>
1483 test_pub=# CREATE PUBLICATION p1 FOR TABLE t1 (id, b, a, d);
1484 CREATE PUBLICATION
1485 </programlisting></para>
1486
1487     <para>
1488      <literal>psql</literal> can be used to show the column lists (if defined)
1489      for each publication.
1490 <programlisting>
1491 test_pub=# \dRp+
1492                                Publication p1
1493   Owner   | All tables | Inserts | Updates | Deletes | Truncates | Via root
1494 ----------+------------+---------+---------+---------+-----------+----------
1495  postgres | f          | t       | t       | t       | t         | f
1496 Tables:
1497     "public.t1" (id, a, b, d)
1498 </programlisting></para>
1499
1500     <para>
1501      <literal>psql</literal> can be used to show the column lists (if defined)
1502      for each table.
1503 <programlisting>
1504 test_pub=# \d t1
1505                  Table "public.t1"
1506  Column |  Type   | Collation | Nullable | Default
1507 --------+---------+-----------+----------+---------
1508  id     | integer |           | not null |
1509  a      | text    |           |          |
1510  b      | text    |           |          |
1511  c      | text    |           |          |
1512  d      | text    |           |          |
1513  e      | text    |           |          |
1514 Indexes:
1515     "t1_pkey" PRIMARY KEY, btree (id)
1516 Publications:
1517     "p1" (id, a, b, d)
1518 </programlisting></para>
1519
1520     <para>
1521      On the subscriber node, create a table <literal>t1</literal> which now
1522      only needs a subset of the columns that were on the publisher table
1523      <literal>t1</literal>, and also create the subscription
1524      <literal>s1</literal> that subscribes to the publication
1525      <literal>p1</literal>.
1526 <programlisting>
1527 test_sub=# CREATE TABLE t1(id int, b text, a text, d text, PRIMARY KEY(id));
1528 CREATE TABLE
1529 test_sub=# CREATE SUBSCRIPTION s1
1530 test_sub-# CONNECTION 'host=localhost dbname=test_pub application_name=s1'
1531 test_sub-# PUBLICATION p1;
1532 CREATE SUBSCRIPTION
1533 </programlisting></para>
1534
1535     <para>
1536      On the publisher node, insert some rows to table <literal>t1</literal>.
1537 <programlisting>
1538 test_pub=# INSERT INTO t1 VALUES(1, 'a-1', 'b-1', 'c-1', 'd-1', 'e-1');
1539 INSERT 0 1
1540 test_pub=# INSERT INTO t1 VALUES(2, 'a-2', 'b-2', 'c-2', 'd-2', 'e-2');
1541 INSERT 0 1
1542 test_pub=# INSERT INTO t1 VALUES(3, 'a-3', 'b-3', 'c-3', 'd-3', 'e-3');
1543 INSERT 0 1
1544 test_pub=# SELECT * FROM t1 ORDER BY id;
1545  id |  a  |  b  |  c  |  d  |  e
1546 ----+-----+-----+-----+-----+-----
1547   1 | a-1 | b-1 | c-1 | d-1 | e-1
1548   2 | a-2 | b-2 | c-2 | d-2 | e-2
1549   3 | a-3 | b-3 | c-3 | d-3 | e-3
1550 (3 rows)
1551 </programlisting></para>
1552
1553     <para>
1554      Only data from the column list of publication <literal>p1</literal> is
1555      replicated.
1556 <programlisting>
1557 test_sub=# SELECT * FROM t1 ORDER BY id;
1558  id |  b  |  a  |  d
1559 ----+-----+-----+-----
1560   1 | b-1 | a-1 | d-1
1561   2 | b-2 | a-2 | d-2
1562   3 | b-3 | a-3 | d-3
1563 (3 rows)
1564 </programlisting></para>
1565
1566   </sect2>
1567
1568  </sect1>
1569
1570  <sect1 id="logical-replication-conflicts">
1571   <title>Conflicts</title>
1572
1573   <para>
1574    Logical replication behaves similarly to normal DML operations in that
1575    the data will be updated even if it was changed locally on the subscriber
1576    node.  If incoming data violates any constraints the replication will
1577    stop.  This is referred to as a <firstterm>conflict</firstterm>.  When
1578    replicating <command>UPDATE</command> or <command>DELETE</command>
1579    operations, missing data is also considered as a
1580    <firstterm>conflict</firstterm>, but does not result in an error and such
1581    operations will simply be skipped.
1582   </para>
1583
1584   <para>
1585    Additional logging is triggered, and the conflict statistics are collected (displayed in the
1586    <link linkend="monitoring-pg-stat-subscription-stats"><structname>pg_stat_subscription_stats</structname></link> view)
1587    in the following <firstterm>conflict</firstterm> cases:
1588    <variablelist>
1589     <varlistentry id="conflict-insert-exists" xreflabel="insert_exists">
1590      <term><literal>insert_exists</literal></term>
1591      <listitem>
1592       <para>
1593        Inserting a row that violates a <literal>NOT DEFERRABLE</literal>
1594        unique constraint. Note that to log the origin and commit
1595        timestamp details of the conflicting key,
1596        <link linkend="guc-track-commit-timestamp"><varname>track_commit_timestamp</varname></link>
1597        should be enabled on the subscriber. In this case, an error will be
1598        raised until the conflict is resolved manually.
1599       </para>
1600      </listitem>
1601     </varlistentry>
1602     <varlistentry id="conflict-update-origin-differs" xreflabel="update_origin_differs">
1603      <term><literal>update_origin_differs</literal></term>
1604      <listitem>
1605       <para>
1606        Updating a row that was previously modified by another origin.
1607        Note that this conflict can only be detected when
1608        <link linkend="guc-track-commit-timestamp"><varname>track_commit_timestamp</varname></link>
1609        is enabled on the subscriber. Currently, the update is always applied
1610        regardless of the origin of the local row.
1611       </para>
1612      </listitem>
1613     </varlistentry>
1614     <varlistentry id="conflict-update-exists" xreflabel="update_exists">
1615      <term><literal>update_exists</literal></term>
1616      <listitem>
1617       <para>
1618        The updated value of a row violates a <literal>NOT DEFERRABLE</literal>
1619        unique constraint. Note that to log the origin and commit
1620        timestamp details of the conflicting key,
1621        <link linkend="guc-track-commit-timestamp"><varname>track_commit_timestamp</varname></link>
1622        should be enabled on the subscriber. In this case, an error will be
1623        raised until the conflict is resolved manually. Note that when updating a
1624        partitioned table, if the updated row value satisfies another partition
1625        constraint resulting in the row being inserted into a new partition, the
1626        <literal>insert_exists</literal> conflict may arise if the new row
1627        violates a <literal>NOT DEFERRABLE</literal> unique constraint.
1628       </para>
1629      </listitem>
1630     </varlistentry>
1631     <varlistentry id="conflict-update-missing" xreflabel="update_missing">
1632      <term><literal>update_missing</literal></term>
1633      <listitem>
1634       <para>
1635        The tuple to be updated was not found. The update will simply be
1636        skipped in this scenario.
1637       </para>
1638      </listitem>
1639     </varlistentry>
1640     <varlistentry id="conflict-delete-origin-differs" xreflabel="delete_origin_differs">
1641      <term><literal>delete_origin_differs</literal></term>
1642      <listitem>
1643       <para>
1644        Deleting a row that was previously modified by another origin. Note that
1645        this conflict can only be detected when
1646        <link linkend="guc-track-commit-timestamp"><varname>track_commit_timestamp</varname></link>
1647        is enabled on the subscriber. Currently, the delete is always applied
1648        regardless of the origin of the local row.
1649       </para>
1650      </listitem>
1651     </varlistentry>
1652     <varlistentry id="conflict-delete-missing" xreflabel="delete_missing">
1653      <term><literal>delete_missing</literal></term>
1654      <listitem>
1655       <para>
1656        The tuple to be deleted was not found. The delete will simply be
1657        skipped in this scenario.
1658       </para>
1659      </listitem>
1660     </varlistentry>
1661    </variablelist>
1662     Note that there are other conflict scenarios, such as exclusion constraint
1663     violations. Currently, we do not provide additional details for them in the
1664     log.
1665   </para>
1666
1667   <para>
1668    The log format for logical replication conflicts is as follows:
1669 <synopsis>
1670 LOG:  conflict detected on relation "<replaceable>schemaname</replaceable>.<replaceable>tablename</replaceable>": conflict=<replaceable>conflict_type</replaceable>
1671 DETAIL:  <replaceable class="parameter">detailed_explanation</replaceable>.
1672 {<replaceable class="parameter">detail_values</replaceable> [; ... ]}.
1673
1674 <phrase>where <replaceable class="parameter">detail_values</replaceable> is one of:</phrase>
1675
1676     <literal>Key</literal> (<replaceable>column_name</replaceable> <optional>, ...</optional>)=(<replaceable>column_value</replaceable> <optional>, ...</optional>)
1677     <literal>existing local tuple</literal> <optional>(<replaceable>column_name</replaceable> <optional>, ...</optional>)=</optional>(<replaceable>column_value</replaceable> <optional>, ...</optional>)
1678     <literal>remote tuple</literal> <optional>(<replaceable>column_name</replaceable> <optional>, ...</optional>)=</optional>(<replaceable>column_value</replaceable> <optional>, ...</optional>)
1679     <literal>replica identity</literal> {(<replaceable>column_name</replaceable> <optional>, ...</optional>)=(<replaceable>column_value</replaceable> <optional>, ...</optional>) | full <optional>(<replaceable>column_name</replaceable> <optional>, ...</optional>)=</optional>(<replaceable>column_value</replaceable> <optional>, ...</optional>)}
1680 </synopsis>
1681
1682    The log provides the following information:
1683    <variablelist>
1684     <varlistentry>
1685      <term><literal>LOG</literal></term>
1686       <listitem>
1687        <itemizedlist>
1688         <listitem>
1689          <para>
1690          <replaceable>schemaname</replaceable>.<replaceable>tablename</replaceable>
1691          identifies the local relation involved in the conflict.
1692          </para>
1693         </listitem>
1694         <listitem>
1695          <para>
1696          <replaceable>conflict_type</replaceable> is the type of conflict that occurred
1697          (e.g., <literal>insert_exists</literal>, <literal>update_exists</literal>).
1698          </para>
1699         </listitem>
1700        </itemizedlist>
1701       </listitem>
1702     </varlistentry>
1703
1704     <varlistentry>
1705      <term><literal>DETAIL</literal></term>
1706       <listitem>
1707       <itemizedlist>
1708        <listitem>
1709         <para>
1710          <replaceable class="parameter">detailed_explanation</replaceable> includes
1711          the origin, transaction ID, and commit timestamp of the transaction that
1712          modified the existing local tuple, if available.
1713         </para>
1714        </listitem>
1715        <listitem>
1716         <para>
1717          The <literal>Key</literal> section includes the key values of the local
1718          tuple that violated a unique constraint for
1719          <literal>insert_exists</literal> or <literal>update_exists</literal>
1720          conflicts.
1721         </para>
1722        </listitem>
1723        <listitem>
1724         <para>
1725          The <literal>existing local tuple</literal> section includes the local
1726          tuple if its origin differs from the remote tuple for
1727          <literal>update_origin_differs</literal> or <literal>delete_origin_differs</literal>
1728          conflicts, or if the key value conflicts with the remote tuple for
1729          <literal>insert_exists</literal> or <literal>update_exists</literal>
1730          conflicts.
1731         </para>
1732        </listitem>
1733        <listitem>
1734         <para>
1735          The <literal>remote tuple</literal> section includes the new tuple from
1736          the remote insert or update operation that caused the conflict. Note that
1737          for an update operation, the column value of the new tuple will be null
1738          if the value is unchanged and toasted.
1739         </para>
1740        </listitem>
1741        <listitem>
1742         <para>
1743          The <literal>replica identity</literal> section includes the replica
1744          identity key values that were used to search for the existing local
1745          tuple to be updated or deleted. This may include the full tuple value
1746          if the local relation is marked with
1747          <link linkend="sql-altertable-replica-identity-full"><literal>REPLICA IDENTITY FULL</literal></link>.
1748         </para>
1749        </listitem>
1750        <listitem>
1751         <para>
1752          <replaceable class="parameter">column_name</replaceable> is the column name.
1753          For <literal>existing local tuple</literal>, <literal>remote tuple</literal>,
1754          and <literal>replica identity full</literal> cases, column names are
1755          logged only if the user lacks the privilege to access all columns of
1756          the table. If column names are present, they appear in the same order
1757          as the corresponding column values.
1758         </para>
1759        </listitem>
1760        <listitem>
1761         <para>
1762          <replaceable class="parameter">column_value</replaceable> is the column value.
1763          The large column values are truncated to 64 bytes.
1764         </para>
1765        </listitem>
1766       </itemizedlist>
1767      </listitem>
1768     </varlistentry>
1769    </variablelist>
1770   </para>
1771
1772   <para>
1773    Logical replication operations are performed with the privileges of the role
1774    which owns the subscription.  Permissions failures on target tables will
1775    cause replication conflicts, as will enabled
1776    <link linkend="ddl-rowsecurity">row-level security</link> on target tables
1777    that the subscription owner is subject to, without regard to whether any
1778    policy would ordinarily reject the <command>INSERT</command>,
1779    <command>UPDATE</command>, <command>DELETE</command> or
1780    <command>TRUNCATE</command> which is being replicated.  This restriction on
1781    row-level security may be lifted in a future version of
1782    <productname>PostgreSQL</productname>.
1783   </para>
1784
1785   <para>
1786    A conflict that produces an error will stop the replication; it must be
1787    resolved manually by the user.  Details about the conflict can be found in
1788    the subscriber's server log.
1789   </para>
1790
1791   <para>
1792    The resolution can be done either by changing data or permissions on the subscriber so
1793    that it does not conflict with the incoming change or by skipping the
1794    transaction that conflicts with the existing data.  When a conflict produces
1795    an error, the replication won't proceed, and the logical replication worker will
1796    emit the following kind of message to the subscriber's server log:
1797 <screen>
1798 ERROR:  conflict detected on relation "public.test": conflict=insert_exists
1799 DETAIL:  Key already exists in unique index "t_pkey", which was modified locally in transaction 740 at 2024-06-26 10:47:04.727375+08.
1800 Key (c)=(1); existing local tuple (1, 'local'); remote tuple (1, 'remote').
1801 CONTEXT:  processing remote data for replication origin "pg_16395" during "INSERT" for replication target relation "public.test" in transaction 725 finished at 0/14C0378
1802 </screen>
1803    The LSN of the transaction that contains the change violating the constraint and
1804    the replication origin name can be found from the server log (LSN 0/14C0378 and
1805    replication origin <literal>pg_16395</literal> in the above case).  The
1806    transaction that produced the conflict can be skipped by using
1807    <link linkend="sql-altersubscription-params-skip"><command>ALTER SUBSCRIPTION ... SKIP</command></link>
1808    with the finish LSN
1809    (i.e., LSN 0/14C0378).  The finish LSN could be an LSN at which the transaction
1810    is committed or prepared on the publisher.  Alternatively, the transaction can
1811    also be skipped by calling the <link linkend="pg-replication-origin-advance">
1812    <function>pg_replication_origin_advance()</function></link> function.
1813    Before using this function, the subscription needs to be disabled temporarily
1814    either by <link linkend="sql-altersubscription-params-disable">
1815    <command>ALTER SUBSCRIPTION ... DISABLE</command></link> or, the
1816    subscription can be used with the
1817    <link linkend="sql-createsubscription-params-with-disable-on-error"><literal>disable_on_error</literal></link>
1818    option. Then, you can use <function>pg_replication_origin_advance()</function>
1819    function with the <parameter>node_name</parameter> (i.e., <literal>pg_16395</literal>)
1820    and the next LSN of the finish LSN (i.e., 0/14C0379).  The current position of
1821    origins can be seen in the <link linkend="view-pg-replication-origin-status">
1822    <structname>pg_replication_origin_status</structname></link> system view.
1823    Please note that skipping the whole transaction includes skipping changes that
1824    might not violate any constraint.  This can easily make the subscriber
1825    inconsistent.
1826    The additional details regarding conflicting rows, such as their origin and
1827    commit timestamp can be seen in the <literal>DETAIL</literal> line of the
1828    log. But note that this information is only available when
1829    <link linkend="guc-track-commit-timestamp"><varname>track_commit_timestamp</varname></link>
1830    is enabled on the subscriber. Users can use this information to decide
1831    whether to retain the local change or adopt the remote alteration. For
1832    instance, the <literal>DETAIL</literal> line in the above log indicates that
1833    the existing row was modified locally. Users can manually perform a
1834    remote-change-win.
1835   </para>
1836
1837   <para>
1838    When the
1839    <link linkend="sql-createsubscription-params-with-streaming"><literal>streaming</literal></link>
1840    mode is <literal>parallel</literal>, the finish LSN of failed transactions
1841    may not be logged. In that case, it may be necessary to change the streaming
1842    mode to <literal>on</literal> or <literal>off</literal> and cause the same
1843    conflicts again so the finish LSN of the failed transaction will be written
1844    to the server log. For the usage of finish LSN, please refer to <link
1845    linkend="sql-altersubscription"><command>ALTER SUBSCRIPTION ...
1846    SKIP</command></link>.
1847   </para>
1848  </sect1>
1849
1850  <sect1 id="logical-replication-restrictions">
1851   <title>Restrictions</title>
1852
1853   <para>
1854    Logical replication currently has the following restrictions or missing
1855    functionality.  These might be addressed in future releases.
1856   </para>
1857
1858   <itemizedlist>
1859    <listitem>
1860     <para>
1861      The database schema and DDL commands are not replicated.  The initial
1862      schema can be copied by hand using <literal>pg_dump
1863      --schema-only</literal>.  Subsequent schema changes would need to be kept
1864      in sync manually.  (Note, however, that there is no need for the schemas
1865      to be absolutely the same on both sides.)  Logical replication is robust
1866      when schema definitions change in a live database: When the schema is
1867      changed on the publisher and replicated data starts arriving at the
1868      subscriber but does not fit into the table schema, replication will error
1869      until the schema is updated.  In many cases, intermittent errors can be
1870      avoided by applying additive schema changes to the subscriber first.
1871     </para>
1872    </listitem>
1873
1874    <listitem>
1875     <para>
1876      Sequence data is not replicated.  The data in serial or identity columns
1877      backed by sequences will of course be replicated as part of the table,
1878      but the sequence itself would still show the start value on the
1879      subscriber.  If the subscriber is used as a read-only database, then this
1880      should typically not be a problem.  If, however, some kind of switchover
1881      or failover to the subscriber database is intended, then the sequences
1882      would need to be updated to the latest values, either by copying the
1883      current data from the publisher (perhaps
1884      using <command>pg_dump</command>) or by determining a sufficiently high
1885      value from the tables themselves.
1886     </para>
1887    </listitem>
1888
1889    <listitem>
1890     <para>
1891      Replication of <command>TRUNCATE</command> commands is supported, but
1892      some care must be taken when truncating groups of tables connected by
1893      foreign keys.  When replicating a truncate action, the subscriber will
1894      truncate the same group of tables that was truncated on the publisher,
1895      either explicitly specified or implicitly collected via
1896      <literal>CASCADE</literal>, minus tables that are not part of the
1897      subscription.  This will work correctly if all affected tables are part
1898      of the same subscription.  But if some tables to be truncated on the
1899      subscriber have foreign-key links to tables that are not part of the same
1900      (or any) subscription, then the application of the truncate action on the
1901      subscriber will fail.
1902     </para>
1903    </listitem>
1904
1905    <listitem>
1906     <para>
1907      Large objects (see <xref linkend="largeobjects"/>) are not replicated.
1908      There is no workaround for that, other than storing data in normal
1909      tables.
1910     </para>
1911    </listitem>
1912
1913    <listitem>
1914     <para>
1915      Replication is only supported by tables, including partitioned tables.
1916      Attempts to replicate other types of relations, such as views, materialized
1917      views, or foreign tables, will result in an error.
1918     </para>
1919    </listitem>
1920
1921    <listitem>
1922     <para>
1923      When replicating between partitioned tables, the actual replication
1924      originates, by default, from the leaf partitions on the publisher, so
1925      partitions on the publisher must also exist on the subscriber as valid
1926      target tables. (They could either be leaf partitions themselves, or they
1927      could be further subpartitioned, or they could even be independent
1928      tables.)  Publications can also specify that changes are to be replicated
1929      using the identity and schema of the partitioned root table instead of
1930      that of the individual leaf partitions in which the changes actually
1931      originate (see
1932      <link linkend="sql-createpublication-params-with-publish-via-partition-root"><literal>publish_via_partition_root</literal></link>
1933      parameter of <command>CREATE PUBLICATION</command>).
1934     </para>
1935    </listitem>
1936
1937    <listitem>
1938     <para>
1939      When using
1940      <link linkend="sql-altertable-replica-identity-full"><literal>REPLICA IDENTITY FULL</literal></link>
1941      on published tables, it is important to note that the <literal>UPDATE</literal>
1942      and <literal>DELETE</literal> operations cannot be applied to subscribers
1943      if the tables include attributes with datatypes (such as point or box)
1944      that do not have a default operator class for B-tree or Hash. However,
1945      this limitation can be overcome by ensuring that the table has a primary
1946      key or replica identity defined for it.
1947     </para>
1948    </listitem>
1949   </itemizedlist>
1950  </sect1>
1951
1952  <sect1 id="logical-replication-architecture">
1953   <title>Architecture</title>
1954
1955   <para>
1956    Logical replication starts by copying a snapshot of the data on the
1957    publisher database.  Once that is done, changes on the publisher are sent
1958    to the subscriber as they occur in real time.  The subscriber applies data
1959    in the order in which commits were made on the publisher so that
1960    transactional consistency is guaranteed for the publications within any
1961    single subscription.
1962   </para>
1963
1964   <para>
1965    Logical replication is built with an architecture similar to physical
1966    streaming replication (see <xref linkend="streaming-replication"/>).  It is
1967    implemented by <literal>walsender</literal> and <literal>apply</literal>
1968    processes.  The walsender process starts logical decoding (described
1969    in <xref linkend="logicaldecoding"/>) of the WAL and loads the standard
1970    logical decoding output plugin (<literal>pgoutput</literal>).  The plugin
1971    transforms the changes read
1972    from WAL to the logical replication protocol
1973    (see <xref linkend="protocol-logical-replication"/>) and filters the data
1974    according to the publication specification.  The data is then continuously
1975    transferred using the streaming replication protocol to the apply worker,
1976    which maps the data to local tables and applies the individual changes as
1977    they are received, in correct transactional order.
1978   </para>
1979
1980   <para>
1981    The apply process on the subscriber database always runs with
1982    <link linkend="guc-session-replication-role"><varname>session_replication_role</varname></link>
1983    set to <literal>replica</literal>. This means that, by default,
1984    triggers and rules will not fire on a subscriber. Users can optionally choose to
1985    enable triggers and rules on a table using the
1986    <link linkend="sql-altertable"><command>ALTER TABLE</command></link> command
1987    and the <literal>ENABLE TRIGGER</literal> and <literal>ENABLE RULE</literal>
1988    clauses.
1989   </para>
1990
1991   <para>
1992    The logical replication apply process currently only fires row triggers,
1993    not statement triggers.  The initial table synchronization, however, is
1994    implemented like a <command>COPY</command> command and thus fires both row
1995    and statement triggers for <command>INSERT</command>.
1996   </para>
1997
1998   <sect2 id="logical-replication-snapshot">
1999     <title>Initial Snapshot</title>
2000     <para>
2001      The initial data in existing subscribed tables are snapshotted and
2002      copied in a parallel instance of a special kind of apply process.
2003      This process will create its own replication slot and copy the existing
2004      data.  As soon as the copy is finished the table contents will become
2005      visible to other backends.  Once existing data is copied, the worker
2006      enters synchronization mode, which ensures that the table is brought
2007      up to a synchronized state with the main apply process by streaming
2008      any changes that happened during the initial data copy using standard
2009      logical replication.  During this synchronization phase, the changes
2010      are applied and committed in the same order as they happened on the
2011      publisher.  Once synchronization is done, control of the
2012      replication of the table is given back to the main apply process where
2013      replication continues as normal.
2014     </para>
2015     <note>
2016      <para>
2017       The publication
2018       <link linkend="sql-createpublication-params-with-publish"><literal>publish</literal></link>
2019       parameter only affects what DML operations will be replicated. The
2020       initial data synchronization does not take this parameter into account
2021       when copying the existing table data.
2022      </para>
2023     </note>
2024   </sect2>
2025  </sect1>
2026
2027  <sect1 id="logical-replication-monitoring">
2028   <title>Monitoring</title>
2029
2030   <para>
2031    Because logical replication is based on a similar architecture as
2032    <link linkend="streaming-replication">physical streaming replication</link>,
2033    the monitoring on a publication node is similar to monitoring of a
2034    physical replication primary
2035    (see <xref linkend="streaming-replication-monitoring"/>).
2036   </para>
2037
2038   <para>
2039    The monitoring information about subscription is visible in
2040    <link linkend="monitoring-pg-stat-subscription">
2041    <structname>pg_stat_subscription</structname></link>.
2042    This view contains one row for every subscription worker.  A subscription
2043    can have zero or more active subscription workers depending on its state.
2044   </para>
2045
2046   <para>
2047    Normally, there is a single apply process running for an enabled
2048    subscription.  A disabled subscription or a crashed subscription will have
2049    zero rows in this view.  If the initial data synchronization of any
2050    table is in progress, there will be additional workers for the tables
2051    being synchronized. Moreover, if the
2052    <link linkend="sql-createsubscription-params-with-streaming"><literal>streaming</literal></link>
2053    transaction is applied in parallel, there may be additional parallel apply
2054    workers.
2055   </para>
2056  </sect1>
2057
2058  <sect1 id="logical-replication-security">
2059   <title>Security</title>
2060
2061   <para>
2062    The role used for the replication connection must have
2063    the <literal>REPLICATION</literal> attribute (or be a superuser).  If the
2064    role lacks <literal>SUPERUSER</literal> and <literal>BYPASSRLS</literal>,
2065    publisher row security policies can execute.  If the role does not trust
2066    all table owners, include <literal>options=-crow_security=off</literal> in
2067    the connection string; if a table owner then adds a row security policy,
2068    that setting will cause replication to halt rather than execute the policy.
2069    Access for the role must be configured in <filename>pg_hba.conf</filename>
2070    and it must have the <literal>LOGIN</literal> attribute.
2071   </para>
2072
2073   <para>
2074    In order to be able to copy the initial table data, the role used for the
2075    replication connection must have the <literal>SELECT</literal> privilege on
2076    a published table (or be a superuser).
2077   </para>
2078
2079   <para>
2080    To create a publication, the user must have the <literal>CREATE</literal>
2081    privilege in the database.
2082   </para>
2083
2084   <para>
2085    To add tables to a publication, the user must have ownership rights on the
2086    table. To add all tables in schema to a publication, the user must be a
2087    superuser. To create a publication that publishes all tables or all tables in
2088    schema automatically, the user must be a superuser.
2089   </para>
2090
2091   <para>
2092    There are currently no privileges on publications.  Any subscription (that
2093    is able to connect) can access any publication.  Thus, if you intend to
2094    hide some information from particular subscribers, such as by using row
2095    filters or column lists, or by not adding the whole table to the
2096    publication, be aware that other publications in the same database could
2097    expose the same information.  Publication privileges might be added to
2098    <productname>PostgreSQL</productname> in the future to allow for
2099    finer-grained access control.
2100   </para>
2101
2102   <para>
2103    To create a subscription, the user must have the privileges of
2104    the <literal>pg_create_subscription</literal> role, as well as
2105    <literal>CREATE</literal> privileges on the database.
2106   </para>
2107
2108   <para>
2109    The subscription apply process will, at a session level, run with the
2110    privileges of the subscription owner. However, when performing an insert,
2111    update, delete, or truncate operation on a particular table, it will switch
2112    roles to the table owner and perform the operation with the table owner's
2113    privileges. This means that the subscription owner needs to be able to
2114    <literal>SET ROLE</literal> to each role that owns a replicated table.
2115   </para>
2116
2117   <para>
2118    If the subscription has been configured with
2119    <literal>run_as_owner = true</literal>, then no user switching will
2120    occur. Instead, all operations will be performed with the permissions
2121    of the subscription owner. In this case, the subscription owner only
2122    needs privileges to <literal>SELECT</literal>, <literal>INSERT</literal>,
2123    <literal>UPDATE</literal>, and <literal>DELETE</literal> from the
2124    target table, and does not need privileges to <literal>SET ROLE</literal>
2125    to the table owner. However, this also means that any user who owns
2126    a table into which replication is happening can execute arbitrary code with
2127    the privileges of the subscription owner. For example, they could do this
2128    by simply attaching a trigger to one of the tables which they own.
2129    Because it is usually undesirable to allow one role to freely assume
2130    the privileges of another, this option should be avoided unless user
2131    security within the database is of no concern.
2132   </para>
2133
2134   <para>
2135    On the publisher, privileges are only checked once at the start of a
2136    replication connection and are not re-checked as each change record is read.
2137   </para>
2138
2139   <para>
2140    On the subscriber, the subscription owner's privileges are re-checked for
2141    each transaction when applied. If a worker is in the process of applying a
2142    transaction when the ownership of the subscription is changed by a
2143    concurrent transaction, the application of the current transaction will
2144    continue under the old owner's privileges.
2145   </para>
2146  </sect1>
2147
2148  <sect1 id="logical-replication-config">
2149   <title>Configuration Settings</title>
2150
2151   <para>
2152    Logical replication requires several configuration options to be set. Most
2153    options are relevant only on one side of the replication. However,
2154    <varname>max_replication_slots</varname> is used on both the publisher and
2155    the subscriber, but it has a different meaning for each.
2156   </para>
2157
2158   <sect2 id="logical-replication-config-publisher">
2159    <title>Publishers</title>
2160
2161    <para>
2162     <link linkend="guc-wal-level"><varname>wal_level</varname></link> must be
2163     set to <literal>logical</literal>.
2164    </para>
2165
2166    <para>
2167     <link linkend="guc-max-replication-slots"><varname>max_replication_slots</varname></link>
2168     must be set to at least the number of subscriptions expected to connect,
2169     plus some reserve for table synchronization.
2170    </para>
2171
2172    <para>
2173     <link linkend="guc-max-wal-senders"><varname>max_wal_senders</varname></link>
2174     should be set to at least the same as
2175     <varname>max_replication_slots</varname>, plus the number of physical
2176     replicas that are connected at the same time.
2177    </para>
2178
2179    <para>
2180     Logical replication walsender is also affected by
2181     <link linkend="guc-wal-sender-timeout"><varname>wal_sender_timeout</varname></link>.
2182    </para>
2183
2184   </sect2>
2185
2186   <sect2 id="logical-replication-config-subscriber">
2187    <title>Subscribers</title>
2188
2189    <para>
2190     <link linkend="guc-max-replication-slots-subscriber"><varname>max_replication_slots</varname></link>
2191     must be set to at least the number of subscriptions that will be added to
2192     the subscriber, plus some reserve for table synchronization.
2193    </para>
2194
2195    <para>
2196     <link linkend="guc-max-logical-replication-workers"><varname>max_logical_replication_workers</varname></link>
2197     must be set to at least the number of subscriptions (for leader apply
2198     workers), plus some reserve for the table synchronization workers and
2199     parallel apply workers.
2200    </para>
2201
2202    <para>
2203     <link linkend="guc-max-worker-processes"><varname>max_worker_processes</varname></link>
2204     may need to be adjusted to accommodate for replication workers, at least
2205     (<link linkend="guc-max-logical-replication-workers"><varname>max_logical_replication_workers</varname></link>
2206     + <literal>1</literal>). Note, some extensions and parallel queries also
2207     take worker slots from <varname>max_worker_processes</varname>.
2208    </para>
2209
2210    <para>
2211     <link linkend="guc-max-sync-workers-per-subscription"><varname>max_sync_workers_per_subscription</varname></link>
2212      controls the amount of parallelism of the initial data copy during the
2213      subscription initialization or when new tables are added.
2214    </para>
2215
2216    <para>
2217     <link linkend="guc-max-parallel-apply-workers-per-subscription"><varname>max_parallel_apply_workers_per_subscription</varname></link>
2218      controls the amount of parallelism for streaming of in-progress
2219      transactions with subscription parameter
2220      <literal>streaming = parallel</literal>.
2221    </para>
2222
2223    <para>
2224     Logical replication workers are also affected by
2225     <link linkend="guc-wal-receiver-timeout"><varname>wal_receiver_timeout</varname></link>,
2226     <link linkend="guc-wal-receiver-status-interval"><varname>wal_receiver_status_interval</varname></link> and
2227     <link linkend="guc-wal-retrieve-retry-interval"><varname>wal_retrieve_retry_interval</varname></link>.
2228    </para>
2229
2230   </sect2>
2231
2232  </sect1>
2233
2234  <sect1 id="logical-replication-quick-setup">
2235   <title>Quick Setup</title>
2236
2237   <para>
2238    First set the configuration options in <filename>postgresql.conf</filename>:
2239 <programlisting>
2240 wal_level = logical
2241 </programlisting>
2242    The other required settings have default values that are sufficient for a
2243    basic setup.
2244   </para>
2245
2246   <para>
2247    <filename>pg_hba.conf</filename> needs to be adjusted to allow replication
2248    (the values here depend on your actual network configuration and user you
2249    want to use for connecting):
2250 <programlisting>
2251 host     all     repuser     0.0.0.0/0     md5
2252 </programlisting>
2253   </para>
2254
2255   <para>
2256    Then on the publisher database:
2257 <programlisting>
2258 CREATE PUBLICATION mypub FOR TABLE users, departments;
2259 </programlisting>
2260   </para>
2261
2262   <para>
2263    And on the subscriber database:
2264 <programlisting>
2265 CREATE SUBSCRIPTION mysub CONNECTION 'dbname=foo host=bar user=repuser' PUBLICATION mypub;
2266 </programlisting>
2267   </para>
2268
2269   <para>
2270    The above will start the replication process, which synchronizes the
2271    initial table contents of the tables <literal>users</literal> and
2272    <literal>departments</literal> and then starts replicating
2273    incremental changes to those tables.
2274   </para>
2275  </sect1>
2276 </chapter>