[stgt] tgtd segfault with software RAID, hard resetting link

Tomasz Chmielewski mangoo at wpkg.org
Tue Apr 7 10:10:11 CEST 2009


This night I had a SATA timeout on a drive in software RAID-1.

It recovered just fine, but unfortunately, tgtd crashed and some of the initiators had I/O errors.

This is how the kernel log looks on the target - everything happened in one second
(SATA timeout, hard reset, tgtd segfault) - is it a known issue? I use tgt-0.9.5.

[153755.828053] ata1.00: exception Emask 0x0 SAct 0x3f80f SErr 0x0 action 0x6 frozen
[153755.828132] ata1.00: cmd 60/08:00:77:ba:38/00:00:25:00:00/40 tag 0 ncq 4096 in
[153755.828134]          res 40/00:28:1f:7a:78/00:00:1c:00:00/40 Emask 0x4 (timeout)
[153755.828224] ata1.00: status: { DRDY }
[153755.828251] ata1.00: cmd 60/08:08:37:8b:9f/00:00:2d:00:00/40 tag 1 ncq 4096 in
[153755.828253]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.828340] ata1.00: status: { DRDY }
[153755.828367] ata1.00: cmd 60/08:10:3f:8b:9f/00:00:2d:00:00/40 tag 2 ncq 4096 in
[153755.828368]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.828455] ata1.00: status: { DRDY }
[153755.828483] ata1.00: cmd 60/08:18:37:2a:05/00:00:2a:00:00/40 tag 3 ncq 4096 in
[153755.828484]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.828572] ata1.00: status: { DRDY }
[153755.828600] ata1.00: cmd 61/40:58:17:06:e4/00:00:03:00:00/40 tag 11 ncq 32768 out
[153755.828601]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.828688] ata1.00: status: { DRDY }
[153755.828716] ata1.00: cmd 61/18:60:67:06:e4/00:00:03:00:00/40 tag 12 ncq 12288 out
[153755.828717]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.828805] ata1.00: status: { DRDY }
[153755.828832] ata1.00: cmd 61/a8:68:87:06:e4/00:00:03:00:00/40 tag 13 ncq 86016 out
[153755.828834]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.828922] ata1.00: status: { DRDY }
[153755.828948] ata1.00: cmd 61/48:70:37:07:e4/00:00:03:00:00/40 tag 14 ncq 36864 out
[153755.828950]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.829037] ata1.00: status: { DRDY }
[153755.829064] ata1.00: cmd 61/68:78:8f:07:e4/00:00:03:00:00/40 tag 15 ncq 53248 out
[153755.829066]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.829154] ata1.00: status: { DRDY }
[153755.829182] ata1.00: cmd 61/20:80:ff:07:e4/00:00:03:00:00/40 tag 16 ncq 16384 out
[153755.829183]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.829271] ata1.00: status: { DRDY }
[153755.829298] ata1.00: cmd 61/18:88:8f:07:e8/00:00:0c:00:00/40 tag 17 ncq 12288 out
[153755.829299]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[153755.829386] ata1.00: status: { DRDY }
[153755.829415] ata1: hard resetting link
[153756.312026] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[153756.386453] ata1.00: configured for UDMA/133
[153756.386548] ata1: EH complete
[153756.386733] sd 1:0:0:0: [sdb] 2930277168 512-byte hardware sectors (1500302 MB)
[153756.386841] sd 1:0:0:0: [sdb] Write Protect is off
[153756.386889] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
[153756.386961] sd 1:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
[153756.406006] tgtd[20545]: segfault at 31 ip 40c32d sp 75df00c0 error 4 in tgtd[400000+25000]


-- 
Tomasz Chmielewski
http://wpkg.org

--
To unsubscribe from this list: send the line "unsubscribe stgt" in
the body of a message to majordomo at vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html



More information about the stgt mailing list