[sheepdog] Bug in v0.4.0 ? get_sha1_file(80) PANIC: Operation not supported

Jens WEBER jweber at tek2b.org
Fri Jul 13 22:04:56 CEST 2012


found it, node 2 only gateway has not a file system with user_xattr option, /var/lib/sheepdog was on a aufs
with ext4 and option user_xattr it works as expected
is this really necessary for a only gateway node?

Thanks Jens 

> build from tag v0.4.0
> 
> 2 node setup
> collie cluster format -b farm -c 1 -H
> 
> node 1 works, but if node 2 start up
> root at none:/var/lib/sheepdog# sheep -f -D -g -p 7000 -z 999 -d /var/lib/sheepdog/
> Aborted
> 
> sheep.log node 2:
> Jul 13 21:28:59 [main] create_cluster(1092) use corosync cluster driver as 
> default
> Jul 13 21:28:59 [main] create_cluster(1121) zone id = 999
> Jul 13 21:28:59 [main] send_join_request(961) IPv4 ip:10.255.255.12 port:7000
> Jul 13 21:28:59 [main] main(272) sheepdog daemon (version 0.4.0) started
> Jul 13 21:28:59 [main] cdrv_cpg_confchg(568) mem:2, joined:1, left:0
> Jul 13 21:28:59 [main] cdrv_cpg_confchg(634) Not promoting because member is 
> not in our event list.
> Jul 13 21:28:59 [main] cdrv_cpg_deliver(454) 0
> Jul 13 21:28:59 [main] cdrv_cpg_deliver(454) 1
> Jul 13 21:28:59 [main] sd_join_handler(990) join IPv4 ip:10.255.255.12 port:7000
> Jul 13 21:28:59 [main] sd_join_handler(992) [0] IPv4 ip:10.255.255.11 port:7000
> Jul 13 21:28:59 [main] sd_join_handler(992) [1] IPv4 ip:10.255.255.12 port:7000
> Jul 13 21:28:59 [main] update_cluster_info(778) status = 1, epoch = 9, 
> finished: 0
> Jul 13 21:28:59 [main] farm_init(333) use farm store driver
> Jul 13 21:28:59 [main] init_sys_vdi_bitmap(306) found the working directory 
> /var/lib/sheepdog//obj/
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] get_trunk_sha1(212) 0
> Jul 13 21:28:59 [main] cleanup_working_dir(635) try clean up working dir
> Jul 13 21:28:59 [main] trunk_reset(402) clean
> Jul 13 21:28:59 [main] sockfd_cache_add_group(242) 2
> Jul 13 21:28:59 [main] update_epoch_log(55) update epoch: 10, 2
> Jul 13 21:28:59 [block] connect_to(256) 13, 10.255.255.11:7000
> Jul 13 21:28:59 [block] get_vdi_bitmap_from(663) 10.255.255.11:7000
> Jul 13 21:28:59 [recovery] prepare_object_list(553) 10
> Jul 13 21:28:59 [recovery] fetch_object_list(475) 10.255.255.11 7000
> Jul 13 21:28:59 [main] sd_join_handler(1001) join Sheepdog cluster
> Jul 13 21:28:59 [recovery] connect_to(256) 14, 10.255.255.11:7000
> Jul 13 21:28:59 [recovery] fetch_object_list(502) 0
> Jul 13 21:28:59 [recovery] prepare_object_list(582) 0
> Jul 13 21:28:59 [main] farm_end_recover(578) old epoch 9
> Jul 13 21:28:59 [main] get_sha1_file(80) PANIC: Operation not supported
> Jul 13 21:28:59 [main] crash_handler(408) sheep pid 10887 exited unexpectedly.
> 
> sheep.log node 1:
> Jul 13 21:29:04 [main] cdrv_cpg_confchg(568) mem:2, joined:1, left:0
> Jul 13 21:29:04 [main] cdrv_cpg_deliver(454) 0
> Jul 13 21:29:04 [main] sd_check_join_cb(918) 10.255.255.12:7000: ret = 0x0, 
> cluster_status = 0x1
> Jul 13 21:29:04 [main] cdrv_cpg_deliver(454) 1
> Jul 13 21:29:04 [main] sd_join_handler(990) join IPv4 ip:10.255.255.12 port:7000
> Jul 13 21:29:04 [main] sd_join_handler(992) [0] IPv4 ip:10.255.255.11 port:7000
> Jul 13 21:29:04 [main] sd_join_handler(992) [1] IPv4 ip:10.255.255.12 port:7000
> Jul 13 21:29:04 [main] update_cluster_info(778) status = 1, epoch = 9, 
> finished: 1
> Jul 13 21:29:04 [main] update_epoch_log(55) update epoch: 10, 2
> Jul 13 21:29:04 [main] sockfd_cache_add(271) 10.255.255.12:7000, count 2
> Jul 13 21:29:04 [recovery] prepare_object_list(553) 10
> Jul 13 21:29:04 [recovery] fetch_object_list(475) 10.255.255.11 7000
> Jul 13 21:29:04 [main] listen_handler(819) accepted a new connection: 14
> Jul 13 21:29:04 [main] listen_handler(819) accepted a new connection: 15
> Jul 13 21:29:04 [main] client_rx_handler(577) connection from: 14, 
> 10.255.255.12:59058
> Jul 13 21:29:04 [main] queue_request(323) 15
> Jul 13 21:29:04 [main] listen_handler(819) accepted a new connection: 16
> Jul 13 21:29:04 [main] client_rx_handler(577) connection from: 15, 
> 10.255.255.12:59059
> Jul 13 21:29:04 [main] queue_request(323) a1
> Jul 13 21:29:04 [recovery] connect_to(256) 13, 10.255.255.11:7000
> Jul 13 21:29:04 [main] client_rx_handler(577) connection from: 16, 
> 10.255.255.11:49744
> Jul 13 21:29:04 [main] queue_request(323) a1
> Jul 13 21:29:04 [io 7] do_process_work(990) a1, 9 , 10
> Jul 13 21:29:04 [main] client_tx_handler(663) connection from: 15, 
> 10.255.255.12:59059
> Jul 13 21:29:04 [io 8] do_process_work(990) a1, 9 , 10
> Jul 13 21:29:04 [main] client_tx_handler(663) connection from: 16, 
> 10.255.255.11:49744
> Jul 13 21:29:04 [recovery] fetch_object_list(502) 0
> Jul 13 21:29:04 [recovery] prepare_object_list(582) 0
> Jul 13 21:29:04 [main] client_handler(764) connection seems to be dead
> Jul 13 21:29:04 [main] clear_client(703) refcnt:0, fd:16, 10.255.255.11:49744
> Jul 13 21:29:04 [main] destroy_client(672) connection from: 10.255.255.11:49744
> Jul 13 21:29:04 [main] farm_end_recover(578) old epoch 9
> Jul 13 21:29:04 [main] snap_file_write(178) epoch: 9, sha1: 
> b3e63f3cd6ab79dab88e88ca7aed8b4600849459
> Jul 13 21:29:04 [main] finish_recovery(349) recovery complete: new epoch 10
> Jul 13 21:29:04 [io 9] do_process_work(990) 15, 0 , 10
> Jul 13 21:29:04 [main] client_handler(764) connection seems to be dead
> Jul 13 21:29:04 [main] clear_client(703) refcnt:0, fd:15, 10.255.255.12:59059
> Jul 13 21:29:04 [main] destroy_client(672) connection from: 10.255.255.12:59059
> Jul 13 21:29:04 [main] client_handler(764) connection seems to be dead
> Jul 13 21:29:04 [main] clear_client(703) refcnt:0, fd:14, 10.255.255.12:59058
> Jul 13 21:29:04 [main] destroy_client(672) connection from: 10.255.255.12:59058
> Jul 13 21:29:04 [main] cdrv_cpg_confchg(568) mem:1, joined:0, left:1
> Jul 13 21:29:04 [main] sd_leave_handler(1059) leave IPv4 ip:10.255.255.12 
> port:7000
> Jul 13 21:29:04 [main] sd_leave_handler(1061) [0] IPv4 ip:10.255.255.11 
> port:7000
> Jul 13 21:29:04 [main] update_epoch_log(55) update epoch: 11, 1
> Jul 13 21:29:04 [main] sockfd_cache_del(218) 10.255.255.12:7000, count 1
> Jul 13 21:29:04 [recovery] prepare_object_list(553) 11
> Jul 13 21:29:04 [recovery] fetch_object_list(475) 10.255.255.11 7000
> Jul 13 21:29:04 [main] listen_handler(819) accepted a new connection: 14
> Jul 13 21:29:04 [recovery] connect_to(256) 13, 10.255.255.11:7000
> Jul 13 21:29:04 [main] client_rx_handler(577) connection from: 14, 
> 10.255.255.11:49745
> Jul 13 21:29:04 [main] queue_request(323) a1
> Jul 13 21:29:04 [io 10] do_process_work(990) a1, a , 11
> Jul 13 21:29:04 [main] client_tx_handler(663) connection from: 14, 
> 10.255.255.11:49745
> Jul 13 21:29:04 [recovery] fetch_object_list(502) 0
> Jul 13 21:29:04 [recovery] prepare_object_list(582) 0
> Jul 13 21:29:04 [main] client_handler(764) connection seems to be dead
> Jul 13 21:29:04 [main] clear_client(703) refcnt:0, fd:14, 10.255.255.11:49745
> Jul 13 21:29:04 [main] destroy_client(672) connection from: 10.255.255.11:49745
> Jul 13 21:29:04 [main] farm_end_recover(578) old epoch 10
> Jul 13 21:29:04 [main] snap_file_write(178) epoch: 10, sha1: 
> d08b762926216f227cad950f9fee49886b56ba2c
> Jul 13 21:29:04 [main] finish_recovery(349) recovery complete: new epoch 11



More information about the sheepdog mailing list