[Sheepdog] corosync crashes
Steven Dake
sdake at redhat.com
Mon Oct 4 19:37:40 CEST 2010
On 10/04/2010 10:32 AM, Robert Terhaar wrote:
> On Oct 4, 2010 10:40 AM, "Steven Dake" <sdake at redhat.com
> <mailto:sdake at redhat.com>> wrote:
> > On 10/03/2010 11:43 PM, Robert Terhaar wrote:
> >> Hi All,
> >>
> >> I'm having some problems with corosync crashes in Fedora 14.
> >> Occasionally after pushing a lot of traffic thru sheepdog, corosync
> >> crashes, and writes to the logs a fairly unhelpful message
> >> "corosync[23367]: [TOTEM ] FAILED TO RECEIVE"
> >>
> >> I've attached my very basic corosync.conf below. Does this config
> look ok?
> >>
> >> compatibility: whitetank
> >>
> >
> > This is caused by delayed multicast messages. There is a patch in the
> > below bz to address this issue.
> >
> > https://bugzilla.redhat.com/show_bug.cgi?id=619496
> >
> > Another workaround that might be helpful if you don't want to rebuild
> > your own package is changing the fail_recv_const (goes in totem
> > directive) to a very large value such as 5000.
> >
> > Regards
> > -steve
> >
> >
>
> Thanks Steve!
>
> Setting fail_recv_const:5000 seems to have fixed the problem. Is there
> any chance that the corosync patch will make it into Fedora 14's
> corosync package?
>
Yes it is coming. Corosync upstream releases are blocked on an
impending svn->git source control conversion. Hope to have that sorted
out in 1-2 weeks.
> Also, I think it would be very helpful for new users if there was a
> sheepdog wiki article with a recommended corosync.conf example. Is there
> any plans for a sheepdog wiki?
>
Not sure on this point.
Regards
-steve
More information about the sheepdog
mailing list