[sheepdog-users] Locking problems on 0.9
Hitoshi Mitake
mitake.hitoshi at lab.ntt.co.jp
Wed Dec 3 11:10:27 CET 2014
At Thu, 20 Nov 2014 16:08:29 +0100 (CET),
Micha Kersloot wrote:
>
> Hi,
>
> ----- Original Message -----
> > From: "Hitoshi Mitake" <mitake.hitoshi at gmail.com>
> > To: "Micha Kersloot" <micha at kovoks.nl>
> > Cc: "Lista sheepdog user" <sheepdog-users at lists.wpkg.org>
> > Sent: Thursday, November 20, 2014 3:54:01 PM
> > Subject: Re: [sheepdog-users] Locking problems on 0.9
>
> > On Tue, Nov 11, 2014 at 6:08 PM, Micha Kersloot <micha at kovoks.nl> wrote:
> >> Hi Hitoshi,
> >>
> >> thank you for your time.
> >>
> >>
> >> Cluster status: Waiting for other nodes to join cluster
> >>
> >> Cluster created at Tue Nov 4 14:22:03 2014
> >>
> >> Epoch Time Version
> >> 2014-11-04 16:55:02 9 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> >> 2014-11-04 16:54:56 8 [10.10.0.21:7001, 10.10.0.30:7001]
> >> 2014-11-04 16:54:33 7 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> >> 1970-01-01 01:00:00 6 []
> >
> > The above 6th epoch would be the root cause of the problem. An epoch
> > with no nodes (clearly it cannot be happened on normal situation) can
> > wipe data under sheepdog's recovery logic.
> > I'll prepare a patch for avoiding creation of such an epoch later.
> >
> > BTW, can you see such an epoch with no nodes in other sheep daemon?
> >
>
> That would be on 10.10.0.21:
>
> Cluster status: Waiting for other nodes to join cluster
>
> Cluster created at Tue Nov 4 14:22:03 2014
>
> Epoch Time Version
> 2014-11-04 16:55:02 9 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:54:56 8 [10.10.0.21:7001, 10.10.0.30:7001]
> 2014-11-04 16:54:33 7 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:54:18 6 [10.10.0.21:7001, 10.10.0.22:7001]
> 2014-11-04 16:52:45 5 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 1970-01-01 01:00:00 4 []
> 2014-11-04 16:47:43 3 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:46:43 2 [10.10.0.21:7001, 10.10.0.30:7001]
> 1970-01-01 01:00:00 1 []
>
>
>
>
> ---------
> on 10.10.0.22:
> Cluster status: Waiting for other nodes to join cluster
>
> Cluster created at Tue Nov 4 14:22:03 2014
>
> Epoch Time Version
> 2014-11-04 16:55:02 9 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 1970-01-01 01:00:00 8 []
> 2014-11-04 16:54:33 7 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:54:18 6 [10.10.0.21:7001, 10.10.0.22:7001]
> 2014-11-04 16:52:45 5 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:52:32 4 [10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:47:43 3 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 1970-01-01 01:00:00 2 []
> 1970-01-01 01:00:00 1 []
>
>
> ---------------
> on 10.10.0.30:
> Cluster status: Waiting for other nodes to join cluster
>
> Cluster created at Tue Nov 4 14:22:03 2014
>
> Epoch Time Version
> 2014-11-04 16:55:02 9 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:54:56 8 [10.10.0.21:7001, 10.10.0.30:7001]
> 2014-11-04 16:54:33 7 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 1970-01-01 01:00:00 6 []
> 2014-11-04 16:52:45 5 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:52:32 4 [10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:47:43 3 [10.10.0.21:7001, 10.10.0.22:7001, 10.10.0.30:7001]
> 2014-11-04 16:46:43 2 [10.10.0.21:7001, 10.10.0.30:7001]
> 2014-11-04 14:22:03 1 [10.10.0.30:7001]
>
Thanks for your additional information. It is helpful for me.
Thanks,
Hitoshi
More information about the sheepdog-users
mailing list