On 10/04/2010 10:32 AM, Robert Terhaar wrote: > On Oct 4, 2010 10:40 AM, "Steven Dake" <sdake at redhat.com > <mailto:sdake at redhat.com>> wrote: > > On 10/03/2010 11:43 PM, Robert Terhaar wrote: > >> Hi All, > >> > >> I'm having some problems with corosync crashes in Fedora 14. > >> Occasionally after pushing a lot of traffic thru sheepdog, corosync > >> crashes, and writes to the logs a fairly unhelpful message > >> "corosync[23367]: [TOTEM ] FAILED TO RECEIVE" > >> > >> I've attached my very basic corosync.conf below. Does this config > look ok? > >> > >> compatibility: whitetank > >> > > > > This is caused by delayed multicast messages. There is a patch in the > > below bz to address this issue. > > > > https://bugzilla.redhat.com/show_bug.cgi?id=619496 > > > > Another workaround that might be helpful if you don't want to rebuild > > your own package is changing the fail_recv_const (goes in totem > > directive) to a very large value such as 5000. > > > > Regards > > -steve > > > > > > Thanks Steve! > > Setting fail_recv_const:5000 seems to have fixed the problem. Is there > any chance that the corosync patch will make it into Fedora 14's > corosync package? > Yes it is coming. Corosync upstream releases are blocked on an impending svn->git source control conversion. Hope to have that sorted out in 1-2 weeks. > Also, I think it would be very helpful for new users if there was a > sheepdog wiki article with a recommended corosync.conf example. Is there > any plans for a sheepdog wiki? > Not sure on this point. Regards -steve |