CentOS_6.5 postgresql 故障切換實現

pgpool-II（http://pgpool.projects.postgresql.org/ ）是一箇中間件，工作在PostgreSQL多服務器和PostgreSQL數據庫客戶端之間。

它提供了以下功能

連接池： pgpool -Ⅱ保存連接到PostgreSQL服務器，並重複利用具有相同屬性的新的連接（即用戶名，數據庫，協議的版本），減少連接的開銷，並提高了系統的整體吞吐量。複製： pgpool - II可以管理多個PostgreSQL服務器。使用複製功能，可以實時備份在 2個或多個物理磁盤上，因此即使在硬盤出故障的時候也不用停止服務。

負載平衡：如果數據庫是複製，任何服務器上執行一個SELECT 查詢將返回相同的結果。 pgpool -Ⅱ採用一個複製功能優勢是，以減少多個服務器之間分配上的SELECT 查詢每個PostgreSQL服務器的負載，提高系統的整體吞吐量。在最好的，性能的提高比例的PostgreSQL服務器的數量。在同一時間有大量用戶的查詢的時候，負載平衡的情況下有最佳的執行。

連接超過限制：有一個關於與 PostgreSQL 的最大並發連接數限制，最大連接數超過後的連接被拒絕。設置最大連接數，但是增加的資源消耗和影響系統性能。 pgpool - II 也有對最大連接數的限制，但額外的連接將被排隊，而不是立即返回錯誤。

並行查詢：使用並行查詢功能，數據可分佈在多個服務器中，以便查詢可以執行所有服務器上同時減少總體執行時間。並行查詢的工作時候，尋找最佳的大規模的數據。

進行pgpool搭建前需要配置好postgresql的流複製，操作步驟參考http://xiajie.blog.51cto.com/6044823/1662222

一、安裝

 wget http://www.pgpool.net/download.php?f=pgpool-II-3.4.0.tar.gz
 tar -zxvf pgpool-3.4.0.tar.gz
 cd pgpool-II-3.4.0/
 ./configure --prefix=/usr/local/pgpool --with-pgsql=path --with-pgsql=/usr/local/pgsql
 make
 make install
 chown postgres.postgres /usr/local/pgpool/ -R
 chown postgres.postgres /usr/src/pgpool-II-3 -R
 mkdir /var/run/pgpool
 chown postgres.postgres /var/run/pgpool/
 #切換postgres 用戶安裝一些函數
 su - postgres
 
 cd /usr/src/pgpool-II-3.4.0/src/sql/
 make
 make install
 cd pgpool-recovery/
 make install
 cd ../pgpool-regclass/
 make install

二、配置

 cd /usr/local/pgpool/etc
 cp pcp.conf.sample pcp.conf
 pg_md5 postgres
 e8a48653851e28c69d0506508fb27fc5
 echo "postgres:e8a48653851e28c69d0506508fb27fc5" >> pcp.conf
 echo "postgres:e8a48653851e28c69d0506508fb27fc5" >> pool_passwd
 cp pool_hba.conf.sample pool_hba.conf
 vim pool_hba.conf
 host    all         postgres    db2                   md5
 
 
 listen_addresses = '*'                    #允許所有主機監聽
 port = 9999                            #訪問端口
 backend_hostname0 = 'db1'                #DBmaster ip
 backend_port0 = 5432                    #DBmaster postgresql 端口
 backend_weight0 = 1                    #權重
 backend_data_directory0 = '/opt/data'    #DBmaster 數據庫目錄
 backend_flag0 = 'ALLOW_TO_FAILOVER'    #允許切換
 
 backend_hostname0 = 'db2'
 backend_port0 = 5432
 backend_weight0 = 1
 backend_data_directory0 = '/opt/data'
 backend_flag0 = 'ALLOW_TO_FAILOVER'
 
 enable_pool_hba = on   #隨意，自由定製，使用 pool_hba.conf 對client的驗證
 pool_passwd = 'pool_passwd' #md5驗證文件
 sr_check_user = 'postgres'  #用來故障切換的用戶
 
 failover_command = '/usr/local/pgsql/bin/failover_command.sh %d %H /tmp/trigger_file'

故障切換腳本

 vim /usr/local/pgsql/bin/failover_command.sh
 
#! /bin/sh
# Failover command for streaming replication.
# This script assumes that DB node 0 is primary, and 1 is standby.
# 
# If standby goes down, do nothing. If primary goes down, create a
# trigger file so that standby takes over primary node.
#
# Arguments: $1: failed node id. $2: new master hostname. $3: path to
# trigger file.
failed_node=$1
new_master=$2
trigger_file=$3
# Do nothing if standby goes down.
#if [ $failed_node = 1 ]; then
#       exit 0;
#fi
# Create the trigger file.
/usr/bin/ssh -T $new_master /bin/touch $trigger_file
exit 0;
chmod +x /usr/local/pgsql/bin/failover_command.sh

三、調試

啓動命令，帶有日誌輸出

[postgres@db1 etc]$ pgpool -nd >/tmp/pgpool.log 2>&1 &
[postgres@db1 etc]$ netstat -ntlp
 (Not all processes could be identified, non-owned process info
  will not be shown, you would have to be root to see it all.)
 Active Internet connections (only servers)
 Proto Recv-Q Send-Q Local Address               Foreign Address             State       PID/Program name   
 tcp        0      0 0.0.0.0:22                  0.0.0.0:*                   LISTEN      -                   
 tcp        0      0 127.0.0.1:25                0.0.0.0:*                   LISTEN      -                   
 tcp        0      0 0.0.0.0:9898                0.0.0.0:*                   LISTEN      16664/pgpool        
 tcp        0      0 0.0.0.0:9999                0.0.0.0:*                   LISTEN      16664/pgpool        
 tcp        0      0 :::22                       :::*                        LISTEN      -                   
 tcp        0      0 ::1:25                      :::*                        LISTEN      -                   
 tcp        0      0 :::9999                     :::*                        LISTEN      16664/pgpool

登錄

 [postgres@db1 etc]$ psql -U postgres -h db1 -p 9999
 psql (9.2.1)
 Type "help" for help.
 postgres=# show pool_nodes;
 node_id | hostname | port | status | lb_weight |  role   
---------+----------+------+--------+-----------+---------
 0       | db1      | 5432 | 2      | 0.500000  | primary
 1       | db2      | 5432 | 2      | 0.500000  | standby
(2 rows)
 postgres=# create database db0;
 CREATE DATABASE

2：啓動
3：死啦

測試可以登錄，可以讀寫

四、故障切換

首先停止DBmaster
[postgres@db1 etc]$ pg_ctl -m fast stop

登錄查看

[postgres@db1 etc]$ psql -U postgres -h db1 -p 9999
 postgres=# show pool_nodes;
 node_id | hostname | port | status | lb_weight |  role   
---------+----------+------+--------+-----------+---------
 0       | db1      | 5432 | 3      | 0.500000  | standby
 1       | db2      | 5432 | 2      | 0.500000  | primary
(2 rows)

此時DBslave顯示的日誌信息

[postgres@db2 data]$ FATAL: replication terminated by primary server
LOG: record with zero length at 0/10000FE0
LOG: trigger file found: /tmp/trigger_file
LOG: redo done at 0/10000F80
LOG: last completed transaction was at log time 2015-06-17 11:05:44.379009+08
LOG: selected new timeline ID: 2
LOG: archive recovery complete
LOG: database system is ready to accept connections
LOG: autovacuum launcher started

DBmaster 已經死啦，狀態切換爲standby；DBslave切換爲primary；測試可讀寫
日誌提示，發現 trigger_file文件，進行切換
DBslave 切換爲主服務器
recover.conf自動更改爲recover.done

[postgres@db2 data]$ ll /opt/data/recovery.done 
recovery.done

故障切換成功

五、恢復DBmaster

1、同步DBmaster至DBslave

pg_basebackup -D $PGDATA -Fp -Xs -v  -h db1 -p 5432 -U postgres

2、修改配置文件

listen_addresses = '*' 
port = 5432
hot_standby = on

3、修改recover文件

mv recover.done  recover.conf
vim recover.conf
primary_conninfo = 'host=172.16.0.133 port=5432 user=postgres'

4、啓動DBslave
5、添加node

pcp_attach_node -d 5 db1 9898 postgres postgres 0
pcp_attach_node -d 5 db1 9898 postgres postgres 1

登錄查看

postgres=# show pool_nodes;
node_id | hostname | port | status | lb_weight |  role   
---------+----------+------+--------+-----------+---------
 0       | db1      | 5432 | 2      | 0.500000  | primary
 1       | db2      | 5432 | 2      | 0.500000  | standby
(2 rows)

恢復正常
Game Over ！

CentOS_6.5 postgresql 故障切換實現

微服務實踐之使用 Visual Studio 2022 調試Dapr 應用程序

wpf附加屬性理解 WPF附加屬性

一些面試總結

linux mono+Apache 環境搭建

linux運維常用命令總結

CentOS 6.5 SVN 服務器搭建文檔

MySQL引擎詳解

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結