At Fri, 21 Oct 2011 11:16:11 +0100, Chris Webb wrote: > > MORITA Kazutaka <morita.kazutaka at lab.ntt.co.jp> writes: > > > Thanks for your testing! There was a trivial bug in collie/vdi.c. > > I've sent a patch and pushed it to vdiattr branch. It may solve all > > of your problems. > > Hi. Sorry for the slow reply to this. I've been testing a newly built > checkout from your vdiattr branch, and things definitely seem a lot better. > However, I'm still able to trigger odd problems. Intermittently, after > uploading lots of chunks, I get: > > [24777] collie vdi getattr 094085e9-5ff7-4c9c-b390-6d530332493e claimed > failed to find attr oid, VDI exists already > Exit code: 5 > > I've put the sheep.logs in > > http://cdw.me.uk/tmp/sheep-00.log > http://cdw.me.uk/tmp/sheep-01.log > http://cdw.me.uk/tmp/sheep-02.log > > I think the relevant bit is:- > > Oct 21 09:48:26 cluster_queue_request(288) 0x2bada30 82 > Oct 21 09:48:26 cluster_queue_request(288) 0x2bada30 89 > Oct 21 09:48:26 do_lookup_vdi(238) looking for 99d46d51-2dbf-4531-b516-412d6917f41d 36, 7016f3 > Oct 21 09:48:26 ob_open(464) failed to open /mnt/sheep-0026-00/obj/00000002/207016f3b9d975d9, No such file or directory > Oct 21 09:48:26 read_object(714) fail 207016f3b9d975d9 -2 > Oct 21 09:48:26 __sd_notify_done(1167) unknown message 2 > Oct 21 09:48:26 cluster_queue_request(288) 0x2bada30 82 > Oct 21 09:48:26 cluster_queue_request(288) 0x2bada30 89 > > Would you like me to retry with head of master now these changes have been > merged to the master branch? Yes, I pushed many patches which simplify cluster communications, so the problem might be solved with the current master branch. Anyway, I'll try to find what caused the problem. :) Thanks, Kazutaka |