[sheepdog-users] Failed to read object 80244e5600000000 No object found

Andrew J. Hobbs ajhobbs at desu.edu
Mon May 5 20:08:28 CEST 2014


I've been using this in production for a while, with plenty of 
externally caused hiccups and haven't seen any evidence of the behavior 
you're describing.  I can't really imagine it being some kind of 
mysterious, recurring, long term bug.  I'd examine your setup, and what 
unique factors you have in play that might be causing issues.

On 05/05/2014 06:53 AM, Struan Bartlett wrote:
> Hi Liu
>
> Thanks for your help.
>
> On 05/05/2014 04:23, Liu Yuan wrote:
>> On Sun, May 04, 2014 at 08:11:02PM +0100, Struan Bartlett wrote:
>>> Having been running sheepdog-0.8.0 successfully for a number of weeks,
>>> earlier last month I suddenly found that my cluster would not longer
>>> launch. After reattempting launch this evening, I finally got the
>>> cluster launched but now to make matters worse it now looks like
>>> sheepdog has deleted all underlying objects! Here is some data:
>>>
>> There might be some bug in 0.8.0 version. Probably latest v0.8.1 solve your
>> problems.
> Ok, I might try that. But when I last tried sheepdog (a year or two ago,
> so I think it was 0.5.3) I experienced a similar problem - my object
> store being mysteriously wiped - so I fear this bug might have been
> lurking for a long time.
>> Can anyone explain what has happened, and why sheepdog has just now
>> deleted all the objects associated with my cluster, I assume rendering
>> it completely unrecoverable? Please let me know if there are other
>> investigations I should perform.
>> Hmmm, this looks a fatal problem. Sheepdog will try to put the objects into stale
>> directory temprarily if your cluster wasn't shutdown normally beforhand. Then
>> all the nodes will try to do a recovery.
> I've checked and the stale directories are now all empty.
>> Is there any method to reproduce the problem?
> Good question. No, and I'm not sure how to attempt to reproduce the
> problem either. It's conceivable that one or more of the nodes rebooted
> but it's unlikely that two rebooted at the same time. But I'd expect
> sheepdog to be able to deal with either of those situations without
> corrupting the object store.
>
> Struan
>
>

-------------- next part --------------
A non-text attachment was scrubbed...
Name: ajhobbs.vcf
Type: text/x-vcard
Size: 353 bytes
Desc: ajhobbs.vcf
URL: <http://lists.wpkg.org/pipermail/sheepdog-users/attachments/20140505/d4f0a564/attachment-0005.vcf>


More information about the sheepdog-users mailing list