[sheepdog-users] vdi problem

Valerio Pachera sirio81 at gmail.com
Tue Jun 11 15:07:54 CEST 2013


I open a new tread for this problem.

Summary: a guest named 'backup' was running on top of the host "sheepdog004".
Guest was using cache; cluster was still version 0.5.6xxx.
The host died.
I run the guest on another host and created one snapshot.
When sheepdog004 was back online it was showing the vdi of the
snapshot without the 's' and with id 0.
Note: cache has been delete from sheepdog004 without being flushed
(because guest was already running on another host).
I create a second and third snapshot and removed the first.
I tired kill the node sheepdog004 and insert it backup as soon as
possible, but vdi list was still wrong.
I also update sheepdog to version 0.6.0 and then to 0.6.0_23_gd0b2857.
Today I killed the node again, clear all its data and insert it back.
Now it's showing the right vdi list.

I run the guest "backup" on the node "sheepdog001" and it seems to be
working fine (during recovery) but I noticed it's showing a 'c' near
the 's' of snapshot id 3! [*]

When I shutdown the guest, the letter 'c' disappear.
Note also that, before clearing the data from sheepdog004, the guest
was unable to shutdown (qemu was always frozen at the last step 'I
will now halt'.
Right now I'm not getting DRDY error from the guest like before, but
they show up when lot's of data is going to be written usually.

The "right" vdi list.
root at sheepdog002:~# collie vdi list
  Name        Id    Size    Used  Shared    Creation time   VDI id  Copies  Tag
s backup_data     2  1.0 TB  134 GB  890 GB 2013-06-04 18:18   c8d129
 2        second
s backup_data     3  1.0 TB  334 GB  690 GB 2013-06-05 17:29   c8d12a
 2         third
  backup_data     0  1.0 TB  0.0 MB  1.0 TB 2013-06-10 12:38   c8d12b   2

The "wrong" vdi list:
root at sheepdog004:~# collie vdi list
  Name        Id    Size    Used  Shared    Creation time   VDI id  Copies  Tag
  backup_data     0  1.0 TB  134 GB  890 GB 2013-06-04 18:18   c8d129   2
s backup_data     3  1.0 TB  334 GB  690 GB 2013-06-05 17:29   c8d12a
 2         third
  backup_data     0  1.0 TB  0.0 MB  1.0 TB 2013-06-10 12:38   c8d12b   2

The "right" vdi list form sheepdog004 after removing it's data:
root at sheepdog004:~# collie vdi list
  Name        Id    Size    Used  Shared    Creation time   VDI id  Copies  Tag
s backup_data     2  1.0 TB  134 GB  890 GB 2013-06-04 18:18   c8d129
   2        second
s backup_data     3  1.0 TB  334 GB  690 GB 2013-06-05 17:29   c8d12a
   2         third
  backup_data     0  1.0 TB  137 GB  887 GB 2013-06-10 12:38   c8d12b     2

[*]
The "strange" vdi list during recovery
root at sheepdog001:~# collie vdi list
  Name        Id    Size    Used  Shared    Creation time   VDI id
Copies  Tag
s backup_data     2  1.0 TB  134 GB  890 GB 2013-06-04 18:18   c8d129
   2        second
cs backup_data     3  1.0 TB  334 GB  690 GB 2013-06-05 17:29   c8d12a
    2         third
  backup_data     0  1.0 TB  137 GB  887 GB 2013-06-10 12:38   c8d12b
   2

I would like to understand if data are safe in this vdi or not.
It's 1T of data!

I also need to take a snapshot each day.



More information about the sheepdog-users mailing list