<div dir="ltr"><br><br>On Thu, Feb 25, 2016 at 12:59 AM, Lucas Luengas <<a href="mailto:lucasluengas@gmail.com">lucasluengas@gmail.com</a>> wrote:<br>> Hello.<br>><br>> I am using pgpool-II version 3.4.4 (tataraboshi), with master/slave with<br>> streaming replication, and watchdog. I am using Centos 6.7.<br>> I have 2 nodes: node A and node B.<br>> Failover process is ok.<br>> Virtual ip address is assigned ok between nodes if nodes are restarted. But<br>> I have a problem with watchdog process if one node is disconnected of<br>> network.<br>> If node A is disconected from network (for example cable is disconnected),<br>> then virtual ip address is assigned to node B. That is ok.<br>> After a few minutes, node A is connected to network again. My problem is<br>> than pgpool watchdog process of node A does not connect with node B, but<br>> network is ok (ping ok, netstat listening ports (9999, 9898, ...) are ok,<br>> ...).<br>> And node B does not connect with node A.<br>><br>> Pgpool log of node A shows every 10 seconds: (ip 192.168.0.226 is node B)<br>><br>> pid 16913: LOG:  checking pgpool status by heartbeat<br>> pid 16913: DETAIL:  pgpool: 1 at "<a href="http://192.168.0.226:9999">192.168.0.226:9999</a>" status is down<br>><br>> pgpool log of node B shows every 10 seconds: (ip 192.168.0.224 is node A)<br>><br>> pid 8722: LOG:  checking pgpool status by heartbeat<br>> pid 8722: DETAIL:  pgpool: 1 at "<a href="http://192.168.0.224:9999">192.168.0.224:9999</a>" status is down<br>><br>> I can use pcp_watchdog_info command in both servers. In node A, status of<br>> node B is 4 (down). In node B, status of node A is 4 (down)<br>><br>> If I restart pgpool service in node A, then pgpool watchdog process is ok<br>> again and status are ok for both nodes, and pgpool watchdog is recovered in<br>> both nodes.<br>><br>> What is the problem?<br><br>This is the expected behaviour of pgpool-II watchdog, When the watchdog communication with the other pgpool-II node is lost. The watchdog marks the other node's status as DOWN, And only restart of the other node can make the node rejoin the watchdog cluster.<br>You can manapulate the pgpool-II configuration parameters <i>wd_heartbeat_deadtime</i> and <i>wd_interval</i> to swollow the temporary network glitches but once the watchdog node status is marked as down only the restart of the other pgpool-II can make the node connected again.<div><br><div><br></div><div>Regards</div><div>Muhammad Usama</div><div><br></div><div><div><br>><br>> Thank you for your help.<br>><br>><br>><br>> _______________________________________________<br>> pgpool-general mailing list<br>> <a href="mailto:pgpool-general@pgpool.net">pgpool-general@pgpool.net</a><br>> <a href="http://www.pgpool.net/mailman/listinfo/pgpool-general">http://www.pgpool.net/mailman/listinfo/pgpool-general</a><br>></div></div></div></div>