【转】ksoftirqd进程耗尽单核100%si处理软中断导致性

最近测试LVS性能，发现当CPU其中一个核耗尽后系统达到性能顶峰。消耗CPU资源的是ksoftirqd进程，全部用于处理软中断（从进程名也能识别出了）。
搜了一下，很多人都遇到这类问题，似乎也没有解决。了解到并尝试过的解决方案有：
1、减少集群成员的数量；
2、修改集群模式（NAT、TURNL、DR）；
3、修改集群调度算法；
4、升级操作系统内核到2.6.20以上；
5、调整网卡的最大传输单元（MTU）；
6、修改设备中断方式；
7、使用多网卡负载均衡；
8、升级硬件（网卡）；
9、更换操作系统。

单网卡工作模式
# cat /etc/sysconfig/network-scripts/ifcfg-eth0
DEVICE=eth0
BOOTPROTO=none
BROADCAST=192.168.223.255
HWADDR=00:1E:90:76:6F:E0
IPADDR=192.168.223.113
NETMASK=255.255.254.0
NETWORK=10.20.222.0
ONBOOT=yes
GATEWAY=192.168.222.1
TYPE=Ethernet
绑定双网卡操作步骤
echo 'alias bond0 bonding' >> /etc/modprobe.conf

# cat /etc/sysconfig/network-scripts/ifcfg-bond0 
DEVICE=bond0
BOOTPROTO=static
BROADCAST=192.168.223.255
MACDDR=00:1E:90:76:6F:E2
IPADDR=192.168.223.113
NETMASK=255.255.254.0
NETWORK=192.168.222.0
USERCTL=no
ONBOOT=yes
GATEWAY=10.20.222.1
TYPE=Ethernet
BONDING_OPTS="mode=0 miimon=100"

# cat /etc/sysconfig/network-scripts/ifcfg-eth0
DEVICE=eth0
ERCTL=no
ONBOOT=yes
MASTER=bond0
SLAVE=yes
BOOTPROTO=none

# cat /etc/sysconfig/network-scripts/ifcfg-eth1
DEVICE=eth1
USERCTL=no
ONBOOT=yes
MASTER=bond0
SLAVE=yes
BOOTPROTO=none

# service network restart

Receive-side scaling (RSS) routes incoming packets to specific queues, efficiently balancing network loads across CPU cores and increasing performance on multi-processor systems. RSS, called Scalable I/O in Linux*, creates a hash table from IP, TCP, and Port Addresses and uses that table to decide which queue to route a packet to, and to which processor the packet should be associated.
可是从我们使用网卡的官网硬件指标上，都是支持RSS的。Windows的设置方式是**netsh int tcp set global rss=enabled**。

第9点，更换操作系统。此方案在生产环境下部署的可能性比较小，但是否关操作系统的事确实需要确认。
据说Windows的NLB、solaris、AIX支持网卡RSS，目前还有待确认。

全部频道

【转】ksoftirqd进程耗尽单核100%si处理软中断导致性