linux 端口耗尽,解决端口耗尽问题: tcp_tw_reuse、tcp_timestamps

一、本地端口有哪些可用首先，需要了解到TCP协议中确定一条TCP连接有4要素：local IP,local PORT,remote IP,remote PORT。这个四元组应该是唯一的。在我们发送HTTP请求的时候，local IPremote IPremote PORT是固定的，只有local PORT是可变的，可用的local PORT的数量就限制了client和server之间TC...

weixin_39631370

3350人浏览 · 2021-05-06 12:24:45

weixin_39631370 · 2021-05-06 12:24:45 发布

一、本地端口有哪些可用

首先，需要了解到TCP协议中确定一条TCP连接有4要素：local IP, local PORT, remote IP, remote PORT。这个四元组应该是唯一的。

在我们发送HTTP请求的时候，local IP remote IP remote PORT是固定的，只有local PORT是可变的，可用的local PORT的数量就限制了client和server之间TCP连接数的数量。

TCP协议中PORT部分是用两个字节来表示的，也就是说可用的端口数量肯定不能超过65536个。

sysctl -a|grepnet.ipv4.ip_local_port_range

net.ipv4.ip_local_port_range= 32768 61000

表示client可用的端口是[32768, 61000]，共28233个。那么这台机器和另外任意一台机器，同时只能建立28233个TCP连接。

可打开的最大文件数

ulimit -a

corefile size (blocks, -c) 0data seg size (kbytes,-d) unlimited

scheduling priority (-e) 0

file size (blocks, -f) unlimited

pending signals (-i) 15088max locked memory (kbytes,-l) 64max memory size (kbytes,-m) unlimited

open files (-n) 65535pipe size (512 bytes, -p) 8POSIX message queues (bytes,-q) 819200real-time priority (-r) 0stack size (kbytes,-s) 8192cputime (seconds, -t) unlimited

max user processes (-u) 4096virtual memory (kbytes,-v) unlimitedfile locks (-x) unlimited

/etc/security/limits.conf

#Where:

#can be:

#-a user name

#-a group name, with @group syntax

#- the wildcard *, fordefault entry

#- the wildcard %, can be also used with %group syntax,

#formaxlogin limit

#can have the two values:

#- "soft" forenforcing the soft limits

#- "hard" forenforcing hard limits

#can be one of the following:

#- core - limits the core filesize (KB)

#- data -max data size (KB)

#- fsize -maximum filesize (KB)

#- memlock - max locked-in-memory address space (KB)

#- nofile - max number of open filedescriptors

#- rss -max resident set size (KB)

#- stack -max stack size (KB)

#- cpu - max CPU time(MIN)

#- nproc -max number of processes

#- as -address space limit (KB)

#- maxlogins - max number of logins forthis user

#- maxsyslogins -max number of logins on the system

#- priority -the priority to run user process with

#- locks - max number of filelocks the user can hold

#- sigpending -max number of pending signals

#- msgqueue -max memory used by POSIX message queues (bytes)

#- nice - max nice priority allowed to raise to values: [-20, 19]

#- rtprio -max realtime priority

#* soft core 0#* hard rss 10000#@student hard nproc20#@faculty soft nproc20#@faculty hard nproc50#ftp hard nproc 0#@student- maxlogins 4# End offileroot soft nofile65535root hard nofile65535

* soft nofile 65535

* hard nofile 65535

二、短连接并不会同时存在大量TCP连接，端口为什么还是耗尽了？

上一步我们分析到，client和server之间只能同时存在28233个TCP连接，但是我们的压测用的是短连接，连接用完就释放掉了，端口应该也会释放掉，为啥还会产生端口耗尽的问题呢？

这就需要提到TIME_WAIT这个状态了，TCP连接断开的时候，主动发起连接断开操作的一方，最后会停留在TIME_WAIT状态，会持续2*MSL的时长，

这个状态的端口是不能被使用的，准确的说是当新的TCP连接的local IP remote IP和remote PORT和TIME_WAIT状态的连接一致时这个端口不能被使用。

sysctl -a|grepnet.ipv4.tcp_fin_timeout

net.ipv4.tcp_fin_timeout= 60

可以推论：

如果client机器有28233端口可用，TIME_WAIT 60秒，短连接的方式发起请求，那么这个client发起的请求的QPS是不能超过28233/60的。

三、为什么有TIME_WAIT状态

想象这么一个场景：