[stgt] iSER data corruption

Matthew Chan talcite at gmail.com
Sun Aug 22 09:35:39 CEST 2010


  Hi,

I'm experiencing what I think is data corruption issues when using the 
iSER driver in CERN SLC5.5 (which is based directly off RHEL 5.5).

I just set up stgt with the iSER driver on a mellanox infiniband mesh 
with 7 nodes in it. I was having problems with my OCFS2 cluster crashing 
randomly on large data writes, so I simplified it down to 1 initiator 
and 1 target and an ext4 partition.

The backing store is a 5 TB linux raid 6 softraid, running on SLC 5.5. 
The initiators are running open-iscsi on Ubuntu Server 10.04. I'm using 
the OFED packages from each respective distro.

On my test setup with 1 target and 1 initiator, copying files with

'dd if=/dev/zero of=/<ext4 array>/zeroes bs=64k count=10000'

generated a whole slew of ext4 errors on the initiator. A subsequent 
fsck.ext4 showed thousands of inode errors. Trying to transfer a file 
with cp generated similar errors.

Are there any known quirks with the iSER driver, or am I misconfiguring 
something? My infiniband connection seems quite stable, and I'm using 
ipoib quite heavily right now.

Thanks in advance for any replies,

Matt


--
To unsubscribe from this list: send the line "unsubscribe stgt" in
the body of a message to majordomo at vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html



More information about the stgt mailing list