<?xml version="1.0" encoding="UTF-8"?> <rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
><channel><title>VMtoday &#187; iscsi</title> <atom:link href="http://vmtoday.com/tag/iscsi/feed/" rel="self" type="application/rss+xml" /><link>http://vmtoday.com</link> <description>VMware News, Views, &#38; How-To&#039;s from vExpert Josh Townsend</description> <lastBuildDate>Fri, 18 May 2012 19:03:15 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.2</generator> <item><title>vSphere 5 Networking Bug Affects Software iSCSI</title><link>http://vmtoday.com/2012/02/vsphere-5-networking-bug-affects-software-iscsi/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=vsphere-5-networking-bug-affects-software-iscsi</link> <comments>http://vmtoday.com/2012/02/vsphere-5-networking-bug-affects-software-iscsi/#comments</comments> <pubDate>Wed, 08 Feb 2012 20:33:54 +0000</pubDate> <dc:creator>Joshua Townsend</dc:creator> <category><![CDATA[Issues & Troubleshooting]]></category> <category><![CDATA[VMware]]></category> <category><![CDATA[VMware How To]]></category> <category><![CDATA[esxi 5]]></category> <category><![CDATA[iscsi]]></category> <category><![CDATA[network]]></category> <category><![CDATA[networking]]></category> <category><![CDATA[NIC]]></category> <category><![CDATA[troubleshooting]]></category> <category><![CDATA[vDS]]></category> <category><![CDATA[vmknic]]></category> <category><![CDATA[vmnic]]></category> <category><![CDATA[vsphere 5]]></category> <category><![CDATA[vswitch]]></category><guid
isPermaLink="false">http://vmtoday.com/?p=854</guid> <description><![CDATA[Update: This issue was fixed as of 3/15/2012 in ESXi 5.0 Update 1 per the original knowledge base article: VMware KB 2008144. To download ESXi 5.0/vCenter Server 5.0 Update 1, see the VMware Download Center.  &#160; I recently stumbled on two vSphere 5 ESXi networking bugs that I thought I would share. The issues are very [...]]]></description> <content:encoded><![CDATA[<p></p><p><strong>Update: This issue was fixed as of 3/15/2012 in ESXi 5.0 Update 1 per the original knowledge base article: <a
href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=2008144">VMware KB 2008144</a>. To download ESXi 5.0/vCenter Server 5.0 Update 1, see the <a
href="http://downloads.vmware.com/d/info/datacenter_cloud_infrastructure/vmware_vsphere/5_0" target="_blank">VMware Download Center</a>. </strong></p><p>&nbsp;</p><p>I recently stumbled on two vSphere 5 ESXi networking bugs that I thought I would share. The issues are very similar from a cursory level, but have different symptoms, troubleshooting steps, and implications for your architecture, so I’m going to split the issues into two separate posts. Because troubleshooting these issues was a real pain, I’ll provide some details on how to identify these issues in your environments and wrap up with a third post on what I believe to be some best practices to avoid these same problems and achieve greater redundancy and resiliency in your vSphere environments.</p><p><strong><span
style="text-decoration: underline;">The Problem</span></strong></p><p>Today, we’ll look at an ESXi 5 networking issue that caused massive iSCSI latency, lost iSCSI sessions, and lost network connectivity. I’ve been able to reproduce this issue in several environments, on different hardware configurations. Here’s the background information on how all this started: I upgraded an ESXi 4.1 host to ESXi 5 using vSphere Update Manager (VUM). Note that I did use the host upgrade image that contained the <a
href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=2007108">ESXi500-201109001 iSCSI fixes</a> – if you are upgrading to vSphere 5 and have iSCSI in your environment, use this image. Here’s a quick look at how the networking was configured on this host:</p><p>The iSCSI networking was configured in a very typical setup, and per best practices, as outline in <a
href="http://pubs.vmware.com/vsphere-50/topic/com.vmware.vsphere.storage.doc_50/GUID-8AE88758-20C1-4873-99C7-181EF9ACFA70.html">VMware’s documentation</a>, as well as from many vendors (see EMC’s Chad Sakac’s ‘<a
href="http://virtualgeek.typepad.com/virtual_geek/2009/09/a-multivendor-post-on-using-iscsi-with-vmware-vsphere.html">A Multivendor Post on using iSCSI with VMware vSphere</a>’), with two vmnic uplinks, two vmknics, with one active adapter on the correct layer-2/layer-3 network, and the other unused.</p><p><a
href="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/iSCSI1-config1.jpg" rel="lightbox[854]"><img
class="aligncenter size-full wp-image-864" title="vSwitch iSCSI vmknic override failover order with unused NIC" src="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/iSCSI1-config1.jpg" alt="vSwitch iSCSI vmknic override failover order with unused NIC" width="533" height="602" /></a><a
href="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/iscsi2-config1.jpg" rel="lightbox[854]"><img
class="aligncenter size-full wp-image-863" title="vSwitch iSCSI vmknic override failover order with unused NIC" src="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/iscsi2-config1.jpg" alt="vSwitch iSCSI vmknic override failover order with unused NIC" width="533" height="602" /></a></p><p>After the upgrade, the standard vSwitch with two vmnics for uplinks (Broadcom NetXtreme II BCM5709 1000Base-T) and two vmknics that serviced the software iSCSI adapter failed to pass traffic (vmkping to the iSCSI targets failed) and could not mount ANY iSCSI LUN&#8217;s. VM network, management, and vMotion ports were not affected.</p><p>If I let the host sit long enough, it *might* find a couple paths to the storage, but even then performance was deteriorated per the vmkernel.log:</p><pre>WARNING: ScsiDeviceIO: 1218: Device naa.60026b90003dcebb000003ca4af95792 performance has deteriorated. I/O latency increased from average value of 5619 microseconds to 495292 microseconds.</pre><p><strong><span
style="text-decoration: underline;">Troubleshooting</span></strong></p><p>I’m going to dump a whole bunch of my troubleshooting steps on you – hopefully they not only help folks dealing with this particular bug, but help with general network and configuration troubleshooting in VMware vSphere. During troubleshooting, I removed the vmk binding for these two on the iSCSI adapter, removed the software iSCSI Adapter itself, removed the vmknics on the vSwitch, and removed the vSwitch itself. I then recreated the vSwitch, set vSwitch MTU to 9000, recreated two vmk ports, set 9000MTU, assigned IP, and set failover order for multipath iSCSI. I then re-created the software iSCSI adapter and bound the two vmk ports. I was able to pass vmk traffic and mount iSCSI LUN&#8217;s. Great – problem solved!?!?! Not so much &#8211; I rebooted the host and the problem returned.</p><p>Here are my next troubleshooting steps:</p><ul><li>I repeated the procedure above and re-gained connectivity, but the problem returns on subsequent reboots. I can verifiably recreate the problem.</li><li>I verified end-to-end connectivity for other hosts on the same Layer 1, Layer 2, and Layer 3 network as the iSCSI initiator and iSCSI targets.</li><li>I verified the ESXi host’s networking configuration using the vSphere client, double-checking the vSwitch, vmnic uplinks, and vmknic configurations. Everything looked good so I canceled out.</li><li>I then reinstalled ESXi from scratch (maybe something was left over from 4.1 that a clean install would weed out), built up the same configuration, and was again able to re-create the problem.</li><li>I poured over logs (vmkernel.log, syslog.log and storagerm.log primarily). I could see an intermittent loss of storage connectivity, failure to log into the storage targets (duh – there is no connectivity, no vmkping) and high storage latency on hosts where I had rebuilt the iSCSI stack and run a few VM’s.</li><li>I switched out the Broadcom NIC for an Intel NIC (the Broadcom had hardware iSCSI capabilities – I wanted to be sure the hardware iSCSI was not interfering).</li><li>I verified TOE was enabled.</li></ul><p><strong><span
style="text-decoration: underline;">The ‘Ah-Ha’ Moment</span></strong></p><p>Next, I verified the ESXi host’s networking configuration using the vSphere client one more time – the properties of the vSwitch, the properties of the vmkernel (vmk) ports, the manual NIC teaming overrides, IP addressing, etc. Everything looked correct – I MADE NO CHANGES – but when I clicked <strong><span
style="text-decoration: underline;">OK</span></strong> (last time I canceled) to close the vSwitch properties and was greeted with this warning:</p><p><a
href="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/changing-an-iscsi-initiator-port-group-warning.jpg" rel="lightbox[854]"><img
class="aligncenter size-full wp-image-855" title="changing an iscsi initiator port group warning" src="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/changing-an-iscsi-initiator-port-group-warning.jpg" alt="changing an iscsi initiator port group warning" width="480" height="214" /></a></p><p>Wait a second… I didn’t change anything, why am I being prompted with a you’re ‘Changing an iSCSI Initiator Port Group’ warning? I like to live dangerously, and wanted to see what would happen, so I said ‘Yes’.</p><p>Much to my surprise, after only viewing and closing the vSwitch and iSCSI vmk port group settings, I was able to complete a vmkping on the iSCSI-bound vmk’s. And moreover, I completed a Rescan of all storage adapters and my iSCSI LUN’s were found, mounted, and ready for use. Problem solved? Nope. The same ugly issue re-appeared after a reboot.</p><p>While the problem wasn’t solved, I now had something to work with. My go-to troubleshooting question “What Changed?” could maybe be answered. Even though I didn’t change anything in the vSwitch Properties GUI, something changed. To see what changed in the background, I compared the output of the following ESXi Shell (or vCLI, or PowerCLI) commands before and after making ‘the change’ happen (by viewing the properties of the vSwitch/vmk ports), but found no changes.</p><ul><li>esxcfg-vswitch -l</li><li>esxcfg-vmknic -l</li><li>esxcfg-nics -l</li></ul><p>Then, I made backup copy of esx.conf</p><pre> cp /etc/vmware/esx.conf /etc/vmware/esx.conf.bak</pre><p>Then I caused ‘the change’ and then compared checksums using md5sum, but found no differences:</p><pre> md5sum /etc/vmware/esx.conf /etc/vmware/esx.conf.bak</pre><p>I compared the running .conf and the backup .conf, but found no differences:</p><pre> diff /etc/vmware/esx.conf /etc/vmware/esx.conf.bak</pre><p><strong><span
style="text-decoration: underline;">Call in Air Support</span></strong><br
/> At this point, I was out of ideas so I called for help: “Hello, 1-866-4VMWARE, option 4, option 2 – help!”</p><p>After repeating many of the same troubleshooting steps, the support engineer decided that I had hit on a known, and not yet patched, bug. The details of the bug are included in <a
title="Incorrect NIC failback occurs when an unused uplink is present" href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=2008144" target="_blank">KB 2008144: Incorrect NIC failback occurs when an unused uplink is present</a>. That’s right – my iSCSI traffic, vmkpings, etc were being sent down the wrong NIC – the <em>UNUSED</em> NIC. Ouch. The bug caused the networking stack to behave in a very unpredictable way, making my troubleshooting steps next to useless, and any other advanced troubleshooting ideas I had (sniffing, logs, etc.)</p><p>Once I knew what the issue was, I could see a bit of evidence in the logs:</p><pre>WARNING: VMW_SATP_LSI: satp_lsi_pathIsUsingPreferredController:714:Failed to get volume access control data for path "vmhba33:C0:T0:L4": No connection

NMP: nmp_DeviceUpdatePathStates:547: Activated path "<span style="color: #ff0000;">NULL</span>" for NMP device "naa.60026b90003dcebb0000c7454d5cc946".

WARNING: ScsiPath: 3576: Path vmhba33:C0:T0:L4 is being removed</pre><p>Notice the <span
style="color: #ff0000;">NULL</span> path – the path can’t be interpreted correctly when being sent down the wrong (unsued) vmnic that is on a different subnet and VLAN. The gotcha on this issue is that I had followed best practices where applicable, and accepted default settings on the vSwitch and vmknics.</p><p><strong><span
style="text-decoration: underline;">The Quick Fix</span></strong><br
/> <a
title="Incorrect NIC failback occurs when an unused uplink is present" href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=2008144" target="_blank">VMware KB 2008144</a> offers two workaround for this bug. The quick fix for the problem is to simply change the Failback setting on either the vSwitch running the software iSCSI vmknic’s to “<strong>No</strong>” (default is yes), or to change the setting on the vmknic itself if you have other port groups on the vSwitch (such as a VM Network port group to give your guest VM’s access to the iSCSI network).</p><p><a
href="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/failback-No.jpg" rel="lightbox[854]"><img
class="aligncenter size-full wp-image-859" title="Change vSwitch or Portgroup Failback" src="http://cloudfront.vmtoday.com/wp-content/uploads/2012/02/failback-No.jpg" alt="Change vSwitch or Portgroup Failback" width="536" height="663" /></a></p><p>Changing Failback = No on the iSCSI vmknics and then rescanning the storage adapters fix the glitch immediately.</p><p><strong><span
style="text-decoration: underline;">Architecture Changes</span></strong><br
/> The second workaround from VMware is “Do not have any unused NICs present in the team.”. This translates to a slightly different architecture than that described in many documents. To achieve this workaround, the configuration would have to change to two vSwitches, each with a single vmnic uplink and a single vmk port, bound to the iSCSI adapter. This change does not impact redundancy or availability when compared with the single-vSwitch:two-vmk configuration that I was running with as one of the vmnics was set to unused anyway. This workaround does add a bit more complexity, as there are a few more elements to configure, monitor, manage, and document.</p><p>This problem seems to only present itself on vSphere Standard Switches (vSwitch), although I could not get confirmation of this (please post a comment if you know!). Assuming this is true, a vDistributed Switch (vDS) could be used for Software iSCSI traffic. Mike Foley has a write-up on how to migrate iSCSI from a vSwitch to a vDS on his blog here: <a
title="Dr. iSCSI or How I learned to stop worrying and love virtual distributed switches on vSphere V5" href="http://www.yelof.com/?p=72" target="_blank">http://www.yelof.com/?p=72</a>.</p><p><strong><span
style="text-decoration: underline;">A Couple More Notes</span></strong><br
/> My troubleshooting fix of viewing the vSwitch settings and clicking ok seemed to temporarily resolve the issues because it triggered an up/down event on the vmk of the unused uplink. This caused the network stack to re-evaluate paths and start using the correct, Active, uplink.</p><p>Note that this problem can occur outside of my iSCSI use case – any vSwitch, Port Group, or VMKNIC with an unused adapter set in the NIC Teaming Failover Order are susceptible to this bug, so watch for it on redundant vMotion networks (vMotion randomly fails), VM Network networks (sudden loss of guest connectivity), or even your management network (hosts fall out of manageability from vCenter, and can’t be contacted via SSH, vSphere client, etc.<br
/> Leave a comment if you’ve experienced this bug – your notes on the problem may help others find and fix the issue until VMware releases a fix. I understand that a fix for this particular bug is not due out until at least vSphere 5 Update 1.</p><p>I&#8217;ll have another (shorter) writeup on the 2nd networking bug I found in ESXi 5 later in the week &#8211; check back here for a link once it is published.</p> ]]></content:encoded> <wfw:commentRss>http://vmtoday.com/2012/02/vsphere-5-networking-bug-affects-software-iscsi/feed/</wfw:commentRss> <slash:comments>26</slash:comments> </item> <item><title>Virtualization Bookmarks for August 28th</title><link>http://vmtoday.com/2009/08/virtualization-bookmarks-for-august-28th/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=virtualization-bookmarks-for-august-28th</link> <comments>http://vmtoday.com/2009/08/virtualization-bookmarks-for-august-28th/#comments</comments> <pubDate>Fri, 28 Aug 2009 14:13:21 +0000</pubDate> <dc:creator>Joshua Townsend</dc:creator> <category><![CDATA[General IT]]></category> <category><![CDATA[Uncategorized]]></category> <category><![CDATA[VMware]]></category> <category><![CDATA[bookmarks]]></category> <category><![CDATA[certificates]]></category> <category><![CDATA[client]]></category> <category><![CDATA[comparison]]></category> <category><![CDATA[compliance]]></category> <category><![CDATA[disk]]></category> <category><![CDATA[hippa]]></category> <category><![CDATA[iscsi]]></category> <category><![CDATA[it]]></category> <category><![CDATA[links]]></category> <category><![CDATA[manager]]></category> <category><![CDATA[monitoring]]></category> <category><![CDATA[pci]]></category> <category><![CDATA[performance]]></category> <category><![CDATA[powershell]]></category> <category><![CDATA[privacy]]></category> <category><![CDATA[regulations]]></category> <category><![CDATA[report]]></category> <category><![CDATA[SAN]]></category> <category><![CDATA[security]]></category> <category><![CDATA[sql]]></category> <category><![CDATA[ssl]]></category> <category><![CDATA[Storage]]></category> <category><![CDATA[update]]></category> <category><![CDATA[vcenter]]></category> <category><![CDATA[vdi]]></category> <category><![CDATA[vmsight]]></category> <category><![CDATA[vmtoday]]></category> <category><![CDATA[vsphere]]></category> <category><![CDATA[windows]]></category> <category><![CDATA[windows7]]></category><guid
isPermaLink="false">http://vmtoday.com/?p=161</guid> <description><![CDATA[Here are some bookmarks for resources that I have recently referenced: vCenter 4 and ESX 4 Now Use 10 Year Default SSL Certificate &#124; VM /ETC &#8211; Rich Brambly has some guidance on installing a new SSL certificate on vCenter, with very useful links in his post to official VMware documentation and KB&#8217;s on the [...]]]></description> <content:encoded><![CDATA[<p></p><p>Here are some bookmarks for resources that I have recently referenced:</p><ul><li><a
href="http://vmetc.com/2009/08/14/vcenter-4-now-has-10-year-default-ssl-certificate/">vCenter 4 and ESX 4 Now Use 10 Year Default SSL Certificate | VM /ETC</a> &#8211; Rich Brambly has some guidance on installing a new SSL certificate on vCenter, with very useful links in his post to official VMware documentation and KB&#8217;s on the subject.</li><li><a
href="http://www.virtuallifestyle.nl/2009/05/vmware-vsphere-client-on-microsoft-windows-7/">VMware vSphere Client on Microsoft Windows 7! | Virtual Lifestyle</a> &#8211; Heiko Verlande has found a way to run the VMware vSphere Client on Windows 7.</li><li><a
href="http://www.virtu-al.net/2009/08/18/powercli-daily-report-v2/">Virtu-Al » PowerCLI: Daily Report V2</a> &#8211; Version two of a handy PowerShell based VMware Environment Daily Report from VMware vExpert and PowerShell guru Alan Renouf<ul>What’s new/Bug Fixes<br
/> * Active VMs count<br
/> * Inactive VMs count<br
/> * DRS Migrations count and list<br
/> * Correct NTP Server check for each host<br
/> * VMs stored on local datastores<br
/> * NTP Service check for each host<br
/> * vmkernel warning messages for each host<br
/> * VM CPU ready over x%</ul></li><li><a
href="http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&amp;cmd=displayKC&amp;externalId=1003468">VMware Self-Service- VMware Update Manager Plug-In fails to install</a> -Troubleshooting steps for vCenter Plug-in install problems.</li><li><a
href="http://www.vmware.com/resources/techresources/1027">Using VMware VDI and vmSight for Stronger and Sustainable HIPAA and PCI Compliance</a> &#8211; Virtualization brings new options for protecting sensitive data by moving it from the desktop into the datacenter.</li><li><a
href="http://blogs.technet.com/cotw/archive/2009/03/18/analyzing-storage-performance.aspx">Counter of the Week : Analyzing Storage Performance</a> &#8211; The purpose of this article is to provide prescriptive guidance on how to troubleshoot logical and physical disk response times in regards to Windows performance analysis. Start with the following performance counters to analyze disk response&#8230;</li><li><a
href="http://www.networkworld.com/reviews/2008/072808-test-iscsi-sans.html">NetApp, Compellent, HP, Dell top the field in 12-product test &#8211; Network World</a> &#8211; A terabyte isn&#8217;t what it used to be. Disks are slower than you think. And a Gigabit Ethernet is plenty of bandwidth for many storage applications.</li></ul> ]]></content:encoded> <wfw:commentRss>http://vmtoday.com/2009/08/virtualization-bookmarks-for-august-28th/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>IBM DS3300 iSCSI Write Performance Solved</title><link>http://vmtoday.com/2009/06/ibm-ds3300-iscsi-write-performance-solved/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ibm-ds3300-iscsi-write-performance-solved</link> <comments>http://vmtoday.com/2009/06/ibm-ds3300-iscsi-write-performance-solved/#comments</comments> <pubDate>Tue, 16 Jun 2009 22:53:57 +0000</pubDate> <dc:creator>Joshua Townsend</dc:creator> <category><![CDATA[Issues & Troubleshooting]]></category> <category><![CDATA[Storage]]></category> <category><![CDATA[VMware]]></category> <category><![CDATA[VMware How To]]></category> <category><![CDATA[Dell]]></category> <category><![CDATA[ESX]]></category> <category><![CDATA[IBM]]></category> <category><![CDATA[iscsi]]></category> <category><![CDATA[lun]]></category> <category><![CDATA[performance]]></category> <category><![CDATA[SAN]]></category> <category><![CDATA[VI3]]></category> <category><![CDATA[virtualization]]></category> <category><![CDATA[write caching]]></category><guid
isPermaLink="false">http://vmtoday.com/?p=94</guid> <description><![CDATA[I have been pulling my hair out with a small VI3 implementation running against an IBM DS3300 iSCSI array.  Performance, for lack of a better term, sucked.  Granted, the DS3300 is not an enterprise level workhorse of a storage system, but it fit the budget.  Read performance was decent from the array, but write performance [...]]]></description> <content:encoded><![CDATA[<p></p><p>I have been pulling my hair out with a small VI3 implementation running against an IBM DS3300 iSCSI array.  Performance, for lack of a better term, sucked.  Granted, the DS3300 is not an enterprise level workhorse of a storage system, but it fit the budget.  Read performance was decent from the array, but write performance was terrible, maxing out at 10Mpbs throughput and insanely high latencies on long writes when the system was under load.  This led to some long P2V operations, poor guest performance, and some questions from the project sponsors on why I couldn&#8217;t make the environment sing.</p><p>The system was configured with a single controller with dual GigE NIC&#8217;s.  The controller had 512MB of battery backed cache (there is also a 1GB cache upgrade option available).  I wrote off some of the poor performance to a single controller with a less-than-optimal amount of cache; blamed the SAS controller to SATA disk command translation overhead; cringed at the 6 disk RAID5 configuration; and engaged in some self doubting.  I convinced the powers that be that we were IO constrained and got some funds to fill out the 3U chassis to a full 12 SATA disks, and reconfigured the array as a RAID10.  Performance gains were almost unnoticeable with these changes.  In addition, I did some basic troubleshooting of the network environment, verifying multiple paths to the storage, setting Flow Control on the switches to receive only, and double-checked my iSCSI initiator settings.  Note: The DS3300 is only supported with the ESX software initiator.  I found documentation on the DS3300 to be lacking, but did discover that the Dell MD3000i is based on the same LSI Engenio array.  Some Googling on the Dell solution led to to the &#8216;SMcli&#8217; command line interface for both arrays.   The commands are slighly different for the Dell and IBM.  The links to the IBM CLI documentation were broken, so I had to do a bit of trial and error to get the commands right.  I used the <a
href="http://support.dell.com/support/edocs/systems/md3000i/en/CLI/PDF/CLIMR2g.pdf" target="_blank">Dell documentation</a> as a starting point.  (Rant: Seriously, IBM?  Can you make your documentation any harder to get through &#8211; is it a Redbook, is it an Engineering Whitepaper, is it a support document, is it a case study &#8211; and why can I only find these with complex Google searches, not on your own product pages, and why can&#8217;t you name for documents intelligently, not with some random string of characters).</p><p><strong>Update</strong>:<strong> The IBM System Storage DS3000, DS4000, and DS5000Command Line Interface and Script Commands Programming Guide is here:</strong> <a
href="http://cloudfront.vmtoday.com/wp-content/uploads/2009/06/DS3k4k5kCLIreference.pdf">IBM System Storage DS3000, DS4000, and DS5000Command Line Interface and Script Commands Programming Guide &#8211; DS3k4k5kCLIreference, SMCLI</a></p><p>Moving on&#8230; I received an automated alert from the DS3300 about an incomplete battery learn cycle.  Using the IBM Storage Manager GUI I generated a  Storage Subsystem Profile&#8217; from the Support tab to check the battery status.  In the profile I discovered that while write cache was enabled, it had a status of &#8220;Enabled (Suspended)&#8221;.   Ah ha!  Now I&#8217;ve got some decent Google material that led me to this: http://communities.vmware.com/thread/195838.  Hot damn I love the VMware Community Forums!</p><p>It turns out that in a single-controller configuration the setting for cache mirroring remains enabled by default.  Because there is no 2nd controller to mirror to, the array suspends write caching.  This is probably a safety thing &#8211; loss of high availability on the controllers puts data in cache at risk should the only controller fail.  I weighed my options and decided that the poor performance I was experiencing beat HA concerns, so I enabled write cache on the array using this command:</p><p
style="padding-left: 30px;">c:\program files\ibm_ds4000\client&gt;smcli -n &lt;ARRAYNAME&gt; -c &#8220;set allLogicalDrives mirrorEnabled=false;&#8221;</p><p>And then followed with this for good measure:</p><p
style="padding-left: 30px;">c:\program files\ibm_ds4000\client&gt;smcli -n &lt;ARRAYNAME&gt; -p &lt;arraypassword&gt; -c &#8220;set allLogicalDrives writeCacheEnabled=true;&#8221;</p><p>The results were immediately noticeable:</p><div
id="attachment_98" class="wp-caption aligncenter" style="width: 430px"> <a
href="http://cloudfront.vmtoday.com/wp-content/uploads/2009/06/ds3300-performance-with-write-cache1.jpg" target="_blank" rel="lightbox[94]"><img
class="size-large wp-image-98" title="ds3300-performance-with-write-cache1" src="http://cloudfront.vmtoday.com/wp-content/uploads/2009/06/ds3300-performance-with-write-cache1-1023x392.jpg" alt="DS3300 Performance Improvement when Write Cache is Enabled" width="430" height="164" /></a><p
class="wp-caption-text">DS3300 Performance Improvement when Write Cache is Enabled - Click for a Larger View</p></div><p>The screen shot is from <a
href="http://www.veeam.com/esxi-monitoring-free.html" target="_blank">Veeam Monitor Free Edition</a>, taken during 4 concurrent V2V operations from Hyper-V to VMware.  With the write cache fully functional, disk usage peaked at 54MBps, latency dropped to about 6ms, and my blood pressure dropped a few notches.</p><p>While poking around the CLI I also found that you can dump performance stats from the array (performance is otherwise hard to find on the thing) using this command:</p><p
style="padding-left: 30px;">C:\Program Files\IBM_DS4000\client&gt;smcli -n &lt;ARRAYNAME&gt; -c &#8220;set session performanceMonitorInterval=5 performanceMonitorIterations=120;save storageSubsystem performanceStats file=\&#8221;c:<a
href="file://///ds3300perfstats.csv/">\\ds3300perfstats.csv\</a>&#8220;;&#8221;</p><p>This will give you a 10 minute record of performance from the array which you can analyze using Excel.  The Dell Enterprise Center TechCenter Wiki has a great write-up on how to efficiently analyze the data from this command here: <a
href="http://www.delltechcenter.com/page/MD3000i+Performance+Monitoring" target="_blank">http://www.delltechcenter.com/page/MD3000i+Performance+Monitoring</a>, complete with a YouTube video that walks you through the process:</p><p
style="text-align: center;"><object
width="425" height="344" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param
name="allowFullScreen" value="true" /><param
name="allowScriptAccess" value="always" /><param
name="src" value="http://www.youtube.com/v/SoRR1VVuETs&amp;color1=0xb1b1b1&amp;color2=0xcfcfcf&amp;feature=player_embedded&amp;fs=1" /><param
name="allowfullscreen" value="true" /><embed
width="425" height="344" type="application/x-shockwave-flash" src="http://www.youtube.com/v/SoRR1VVuETs&amp;color1=0xb1b1b1&amp;color2=0xcfcfcf&amp;feature=player_embedded&amp;fs=1" allowFullScreen="true" allowScriptAccess="always" allowfullscreen="true" /></object></p><p
style="text-align: left;">I am beginning to think that the DS3300 (and MD3000i) may actually be a viable starter solution for SMB&#8217;s starting out on a virtualization project.  But I would recommend the cache upgrade, 2nd controller, SAS disks instead of SATA to eliminate the SAS-to-SATA translation overhead and more faster disks instead of fewer slower disks so you can drive throughput and IOPS to a higher level.</p><p
style="text-align: left;">Have any of you deployed the DS3300 or MD3000i (or the generic LSI solution)?  Do you have any performance tuning tips for these arrays?  If so, share in the comments!</p> ]]></content:encoded> <wfw:commentRss>http://vmtoday.com/2009/06/ibm-ds3300-iscsi-write-performance-solved/feed/</wfw:commentRss> <slash:comments>31</slash:comments> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced
Database Caching 14/50 queries in 0.111 seconds using disk: basic
Object Caching 834/866 objects using disk: basic
Content Delivery Network via Amazon Web Services: CloudFront: cloudfront.vmtoday.com

Served from: vmtoday.com @ 2012-05-21 19:52:50 -->
