[sheepdog] [PATCH 0/2] avoid heavy disk I/O in the main thread

tuji tuji at atworks.co.jp
Fri Oct 24 15:52:43 CEST 2014


Hi, 

> Tuji-san, I updated the avoid-blocking branch for reducing memory
> consumption. Could you backport it to 0.7.6 and test again?

I've tested last avoid-blocking onto 0.7.6.
It seems ok.
It was not avoid blocking completely, but it is getting better!
(refer log1)
I found that elapsed time of default_cleanup was 6sec for 1TB disk
recovry.

By the way, I found other blocking during this test. it was happen when
start sheepdog process at new noder. 
Disk I/O was blocked since 19:51:40 to 19:52:40.
19:51:40 is  start time of sheepdog by new node. 
19:52:40 is  the time when recovery was just started (refer log2)

Because there are many object files, so I guess it needs long time to
move obj to .stale and it cause blocking.



==log1==
Oct 24 10:17:17   INFO [main] recover_object_main(624) object c7ab8e00001ada is recovered (263966/263967)
Oct 24 10:17:17   INFO [main] recover_object_main(624) object c7ab8e0000027e is recovered (263967/263967)
Oct 24 10:17:17   INFO [main] default_cleanup(168) cleaning pruge_state_dir
Oct 24 10:17:23   INFO [main] default_cleanup(170) cleaning pruge_state_dir done

procs -----------memory---------- ---swap-- -----io---- --system-- -----cpu------ ---timestamp---
 0  3      0  64512  64836 741012    0    0    12 58172  296  352  1 14  0 85  0	2014-10-24 10:17:10 JST
 0  3      0  71696  64832 733860    0    0   112 55452  216  230  1 11  0 88  0	2014-10-24 10:17:11 JST
 0  3      0  65252  64832 739952    0    0     0 51408  190  181  0 10  0 90  0	2014-10-24 10:17:12 JST
 0  1      0  68480  64832 737216    0    0   296 62572  261  235  2 13  0 85  0	2014-10-24 10:17:13 JST
 0  2      0  71328  64820 733684    0    0     0 37224  178  179  1  7  0 92  0	2014-10-24 10:17:14 JST
 0  3      0  63264  64824 742696    0    0     0 30792  172  249  1 10  0 89  0	2014-10-24 10:17:15 JST
 0  3      0  71816  64824 732916    0    0     0 24136   89  115  1  3  0 96  0	2014-10-24 10:17:16 JST
 0  3      0  71956  64824 733332    0    0     0  9584  119  146  1  4  0 95  0	2014-10-24 10:17:17 JST
 0  3      0  63524  64824 741704    0    0     0 21672  113  111  1  6  0 93  0	2014-10-24 10:17:18 JST
 0  3      0  63508  64824 741700    0    0     0     0   62   79  1  1  0 98  0	2014-10-24 10:17:19 JST
 0  3      0  63508  64824 741700    0    0     0     0   52   78  1  1  0 98  0	2014-10-24 10:17:20 JST
 0  3      0  63508  64824 741700    0    0     0     0   50   82  1  1  0 98  0	2014-10-24 10:17:21 JST
 0  3      0  63508  64824 741700    0    0     0     0   58   79  1  1  0 98  0	2014-10-24 10:17:22 JST
 0  3      0  63508  64824 741700    0    0     0     0   52   76  1  1  0 98  0	2014-10-24 10:17:23 JST
 2  2      0  69712  64820 734736    0    0   156 105840  273  164  1 21  0 77  1	2014-10-24 10:17:24 JST
 2  3      0  62384  64828 741864    0    0   124 115948  407  366  1 27  0 71  1	2014-10-24 10:17:25 JST
 2  1      0  62528  64828 743024    0    0     0 111620  369  308  0 23  0 77  0	2014-10-24 10:17:26 JST
 0  2      0  63892  64824 741328    0    0   296 122968  361  271  1 25  0 73  1	2014-10-24 10:17:27 JST
 0  2      0  62660  64820 742612    0    0     0 92744  300  201  1 20  0 79  0	2014-10-24 10:17:28 JST
 0  2      0  71824  64816 733144    0    0   156 87192  276  177  2 18  0 80  0	2014-10-24 10:17:29 JST
 0  3      0  71828  64820 732912    0    0   124 76140  264  191  1 17  0 82  0	2014-10-24 10:17:30 JST
 0  3      0  67612  64816 737592    0    0     0 18264  109  129  1  4  0 95  0	2014-10-24 10:17:31 JST
 0  3      0  67852  64816 738096    0    0     0 33264  169  171  1  9  0 90  0	2014-10-24 10:17:32 JST
 0  3      0  60796  64816 744524    0    0   296 23184  129  152  1  5  0 94  0	2014-10-24 10:17:33 JST


==log2==
Oct 23 19:51:39   INFO [main] md_add_disk(141) /home/sheepdog/obj, nr 1
Oct 23 19:51:39   INFO [main] send_join_request(770) IPv4 ip:10.0.0.3 port:7000
Oct 23 19:51:40   INFO [main] main(887) sheepdog daemon (version 0.7.6) started
Oct 23 19:52:39  ALERT [rw] get_vdi_copy_number(104) copy number for 0 not found, set 3
Oct 23 19:52:39  ALERT [rw] get_vdi_copy_number(104) copy number for 0 not found, set 3
Oct 23 19:52:40   INFO [main] recover_object_main(624) object c7ab8e0002d944 is recovered (1/263967)
Oct 23 19:52:41   INFO [main] recover_object_main(624) object c7ab8e000091ee is recovered (2/263967)
Oct 23 19:52:41   INFO [main] recover_object_main(624) object c7ab8e0003af7f is recovered (3/263967)
Oct 23 19:52:41   INFO [main] recover_object_main(624) object c7ab8e00025f80 is recovered (4/263967)
Oct 23 19:52:41   INFO [main] recover_object_main(624) object c7ab8e00004d0c is recovered (5/263967)
Oct 23 19:52:41   INFO [main] recover_object_main(624) object c7ab8e00004b06 is recovered (6/263967)
Oct 23 19:52:41   INFO [main] recover_object_main(624) object c7ab8e0002df48 is recovered (7/263967)
Oct 23 19:52:42   INFO [main] recover_object_main(624) object c7ab8e0003c7c3 is recovered (8/263967)


> 
> Thanks,
> Hitoshi

**********************************************
株式会社エーティーワークス 辻 正博
〒106-6137
東京都港区六本木6-10-1 六本木ヒルズ森タワー37F
TEL:03-3497-0505  FAX:03-3497-0508
http://www.atworks.co.jp/
E-mail: tuji at atworks.co.jp 
**********************************************




More information about the sheepdog mailing list