LiteSpeed Technologies
Download Download     Blog Blog     Wiki Wiki     Forum Forum     Store     Contact Contact    

Go Back   LiteSpeed Support Forums > LiteSpeed Load Balancer > General > lslb - ExtConn timed out while connecting.

Reply
 
Thread Tools Display Modes
  #1  
Old 08-11-2010, 06:17 AM
Clockwork Clockwork is offline
Senior Member
 
Join Date: May 2009
Posts: 74
Default lslb - ExtConn timed out while connecting.

My lslb error.log is getting flooded by the following notice:

Quote:
2010-08-11 15:10:42.000 NOTICE [xxx] ExtConn timed out while connecting.
the site is working fine, but what does this message mean?

edit:

debug stuff:

Quote:
2010-08-11 18:02:55.000 NOTICE [ip:52299-21#sitename:loadbalancer] ExtConn timed out while connecting.
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] connection to [192.168.0.3:80] on request #1, error: Connection timed out!
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] [ExtConn] close()
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] HttpExtConnector::tryRecover()...
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] trying to recover from connection problem, attempt: #1!
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] Get SESSION_ID from COOKIE: [hash].
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] Found worker [clusterHTTP_s2] by strategy [0].
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] [LB] retry worker: [clusterHTTP_s2]
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] trying to recover from connection problem, attempt: #1!
2010-08-11 18:02:55.000 DEBUG [192.168.0.4:80] connection available!
2010-08-11 18:02:55.000 DEBUG [192.168.0.4:80] request [ip:52299-21#sitename:loadbalancer] is assigned with connection!
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] [ExtConn] reconnect()
2010-08-11 18:02:55.000 DEBUG [ip:52299-21#sitename:loadbalancer] [ExtConn] connecting to [192.168.0.4:80]...
edit:

sometimes I'm getting the following warning:

Quote:
2010-08-11 19:09:05.000 NOTICE [clusterHTTP_s4] PingConn timed out while connecting.
2010-08-11 19:09:05.000 WARN [192.168.0.5:80] Failure detected: Connection Failure, 110:Connection timed out
2010-08-11 19:09:05.000 NOTICE [clusterHTTP_s2] PingConn timed out while connecting.
2010-08-11 19:09:05.000 WARN [192.168.0.4:80] Failure detected: Connection Failure, 110:Connection timed out
2010-08-11 19:09:05.899 INFO [192.168.0.4:80] Fail all outstanding requests!
2010-08-11 19:09:05.899 INFO [192.168.0.4:80] Fail all outstanding requests!
2010-08-11 19:09:06.000 NOTICE [ip:1612-0#sitename] ExtConn timed out while connecting.
2010-08-11 19:09:06.000 INFO [192.168.0.5:80] Fail all outstanding requests!
the problem started after we've changed the connection from our database server to a gigabit port, but this change doesn't affected the loadbalancer or webserver, it just improved the page load speed.

btw. we had the "ExtConn timed out while connecting." notice sometimes before, but not that much like now.

edit:

nginx seems to loadbalance without any problems, so this seems to be a lslb problem

Last edited by Clockwork; 08-11-2010 at 11:08 AM..
Reply With Quote
  #2  
Old 08-11-2010, 11:33 AM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
Please try command "telenet 192.168.0.4 80" from command line multiple times, see if you got long delay connecting to the target server sometimes.
Reply With Quote
  #3  
Old 08-11-2010, 01:19 PM
GaryT GaryT is offline
Member
 
Join Date: Jun 2010
Posts: 42
edit: wrong section

Last edited by GaryT; 08-11-2010 at 01:22 PM..
Reply With Quote
  #4  
Old 08-11-2010, 01:27 PM
Clockwork Clockwork is offline
Senior Member
 
Join Date: May 2009
Posts: 74
I've no telnet installed, but I've tried it with nmap and nc, no problems so far.
Reply With Quote
  #5  
Old 08-12-2010, 12:54 PM
Clockwork Clockwork is offline
Senior Member
 
Join Date: May 2009
Posts: 74
ohps, could someone move this topic to the loadbalancer forum? my mistake.

I've switched to nginx until there is a solution, lslb doesn't run stable atm, I hope you guys can help us to fix this problem, lslb is our ddos protection and performs way better than nginx.
Reply With Quote
  #6  
Old 08-12-2010, 01:58 PM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
No problem. moved.
Have you specify the source IP when your configure each node?
Looks like lslb has problem connecting to all backend servers. could it be a problem with NIC port, switch port? If you use dedicate connection communicate with backend servers, you can check the packet loss of that specific NIC.
LSLB uses persistent connections, while nginx does not, there could be more ESTABLISHED connections with LSLB. Is there a firewall between LSLB and web servers?

If you do think it is a LSLB bug, could you strace lslbd while the problem is happening to help analyze the cause of the problem?
Reply With Quote
  #7  
Old 08-12-2010, 02:49 PM
Clockwork Clockwork is offline
Senior Member
 
Join Date: May 2009
Posts: 74
clusterHTTP config:
<nodeAddresses>(s1)127.0.0.1->192.168.0.3, (s2)127.0.0.1->192.168.0.4, (s4)127.0.0.1->192.168.0.5</nodeAddresses>

clusterStatic config:
<nodeAddresses>(s3)127.0.0.1->192.168.0.1:81</nodeAddresses>

Quote:
could it be a problem with NIC port, switch port?
I'll ask my provider if he could check the ports.

Quote:
you can check the packet loss of that specific NIC
--- 192.168.0.3 ping statistics ---
272 packets transmitted, 263 received, 3&#37; packet loss, time 271515ms
rtt min/avg/max/mdev = 0.110/1.030/10.585/1.856 ms

Quote:
LSLB uses persistent connections
I've disabled persistent connections in both clusters that I use.

Quote:
Is there a firewall between LSLB and web servers?
nope

Quote:
If you do think it is a LSLB bug, could you strace lslbd while the problem is happening to help analyze the cause of the problem?
I'll do, but first I need to read some strace howto's
Reply With Quote
  #8  
Old 08-12-2010, 03:24 PM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
I think the problem is the source IP, should use a 192.168.0.x IP assigned to that server, or not use a source IP.
Reply With Quote
  #9  
Old 08-12-2010, 03:35 PM
Clockwork Clockwork is offline
Senior Member
 
Join Date: May 2009
Posts: 74
I've tried both, same problem.
Reply With Quote
  #10  
Old 08-12-2010, 05:30 PM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
Quote:
--- 192.168.0.3 ping statistics ---
272 packets transmitted, 263 received, 3&#37; packet loss, time 271515ms
rtt min/avg/max/mdev = 0.110/1.030/10.585/1.856 ms
3% packet loss for a LAN environment is extremely high.
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 04:53 PM.



- Archive - Top
© Copyright 2003-2011 LiteSpeed Technologies, Inc. All rights reserved. Privacy Policy.