[Bug 1318441] Re: Precise corosync dies if failed_to_recv is set

Rafael David Tinoco rafael.tinoco at canonical.com
Mon May 12 14:18:38 UTC 2014


Attaching patch.

** Patch added: "corosync_1.4.2-2ubuntu0.2.diff"
   https://bugs.launchpad.net/ubuntu/+source/corosync/+bug/1318441/+attachment/4110673/+files/corosync_1.4.2-2ubuntu0.2.diff

** Description changed:

  [Impact]
  
-  * On certain conditions corosync daemon may quit if it detects itself as not
-    being able to receive messages. The logic asserts the existence of at least
-    one functional node but the node is marking itself as a failed node (not
-    following the specification). It is safe not to assert this if failed_to_recv
-    is set.
+  * On certain conditions *precise* corosync daemon may quit if it detects itself 
+    as not being able to receive messages. The logic asserts the existence of 
+    at least one functional node but the node is marking itself as a failed node 
+    (not following the specification). It is safe not to assert this if
+    failed_to_recv is set.
  
  [Test Case]
  
-  * Using "corosync test suite" on precise-test machine:
+  * Using "corosync test suite" on precise-test machine:
  
-    - Make sure to set ssh keys so precise-test can access precise-cluster-{01,02}.
-    - Make sure only failed-to-receive-crash.sh is executable on "tests" dir.
-    - Make sure precise-cluster-{01,02} nodes have build-dep for corosync installed.
-    - sudo ./run-tests.sh -c flatiron -n "precise-cluster-01 precise-cluster-02"
-    - Check corosync log messages to see precise-cluster-01 corosync dieing. 
+    - Make sure to set ssh keys so precise-test can access precise-cluster-{01,02}.
+    - Make sure only failed-to-receive-crash.sh is executable on "tests" dir.
+    - Make sure precise-cluster-{01,02} nodes have build-dep for corosync installed.
+    - sudo ./run-tests.sh -c flatiron -n "precise-cluster-01 precise-cluster-02"
+    - Check corosync log messages to see precise-cluster-01 corosync dieing.
  
  [Regression Potential]
  
-  * We are not asserting the existence of at least 1 node in corosync cluster
-    anymore. Since there is always 1 node in the cluster (the node itself) it
-    is very unlikely this change alters corosync logic for membership. If it 
-    does it is likely corosync will recover from the error and reestablish new 
-    membership (with 1 or more nodes).
+  * We are not asserting the existence of at least 1 node in corosync cluster
+    anymore. Since there is always 1 node in the cluster (the node itself) it
+    is very unlikely this change alters corosync logic for membership. If it
+    does it is likely corosync will recover from the error and reestablish new
+    membership (with 1 or more nodes).
  
  [Other Info]
  
-  * n/a
+  * n/a

-- 
You received this bug notification because you are a member of Ubuntu
Server Team, which is subscribed to corosync in Ubuntu.
https://bugs.launchpad.net/bugs/1318441

Title:
  Precise corosync dies if failed_to_recv is set

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/corosync/+bug/1318441/+subscriptions



More information about the Ubuntu-server-bugs mailing list