[RESOLVED] LSWS 4.0.13 - Signal 9's gone... Signal 15's new

MikeDVB

Well-Known Member
#1
Code:
At [27/Feb/2010:07:11:38 -0500], web server with pid=24638 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
Code:
At [27/Feb/2010:13:02:14 -0500], web server with pid=14126 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
Code:
At [27/Feb/2010:22:18:42 -0500], web server with pid=16181 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
Code:
At [28/Feb/2010:01:01:15 -0500], web server with pid=22155 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
Code:
At [28/Feb/2010:13:01:27 -0500], web server with pid=28935 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
Code:
At [28/Feb/2010:14:08:16 -0500], web server with pid=14474 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
After talking with Tony I found that changes were made in 4.0.13 that would help fix the cPanel 11.25/LSWS 4.0.13 Signal 9 kills that were happening from time to time... I pushed this update to all servers and all of them it's been working beautifully but this single one.

This server is the only one pushing this Signal 15 out of about 20 servers running LSWS 4.0.13.
 
Last edited by a moderator:

AndrewT

Well-Known Member
#4
IMHO you should start using build numbers or show the build dates on the downloads page. I'll install it and update you.
Fully Agreed on the build numbers. I've also seen the signal 15 reports a couple of times when terminating accounts. I'll give the new build a try.
 

ffeingol

Well-Known Member
#5
And I'll throw in a 'me too'. It's very confusing where a new release comes out with the same build number. If you re-release, change the build number.
 

MikeDVB

Well-Known Member
#6
And I'll throw in a 'me too'. It's very confusing where a new release comes out with the same build number. If you re-release, change the build number.
As an example I'm told to download "Please try the latest build of 4.0.13, built on 27th 6PM EST." but when I look on the downloads page I see this:


Maybe "release date" should be changed to "build date" and then updated whenever an adjustment/build is made?
 

MikeDVB

Well-Known Member
#9
Thanks for updating the download page - makes a bit more sense now :)

As for the latest version - Signal 15's are gone as well.
 

aww

Well-Known Member
#10
Hate to say this but I am using 4.0.14 and get the signal=15

They seem to happen every 2-2.5 hours or so, plus or minus 5-10 minutes.

cPanel 11.25.0-R44718 - WHM 11.25.0 - X 3.9
CENTOS 4.8 i686 virtuozzo on server
Any other info can I give you to help?
 
Last edited:

Tony

Well-Known Member
#11
Signal 15's see the odd one. Now signal 9's get hit with those all the times and worst part being some times lsws does not even start back up yet says it did :(
 

aww

Well-Known Member
#12
It's odd in that Signal 15's only a few times a day and then it stops.

At [15/May/2010:17:02:29 -0400]
At [15/May/2010:19:03:20 -0400]
At [15/May/2010:21:00:32 -0400]

At [16/May/2010:01:11:33 -0400]
At [16/May/2010:03:08:46 -0400]
At [16/May/2010:17:02:39 -0400]
At [16/May/2010:19:04:23 -0400]
At [16/May/2010:21:01:38 -0400]

At [17/May/2010:01:12:42 -0400]
At [17/May/2010:17:03:47 -0400]
At [17/May/2010:19:05:15 -0400]
At [17/May/2010:21:02:33 -0400]

At [18/May/2010:00:25:02 -0400]
At [18/May/2010:01:07:30 -0400]
At [18/May/2010:03:04:49 -0400]
At [18/May/2010:05:02:08 -0400]
At [18/May/2010:07:16:08 -0400]
 

mistwang

LiteSpeed Staff
#13
Signal 9 will cause much bigger problem than signal 15, and it is the biggest headache for us. We will try to figure something out from cPanel's Apache restart script/binary. Hopefully, we can figure out a satisfactory solution soon.
 

aww

Well-Known Member
#14
Should we be complaining to cpanel?

After all, they should allow the (smooth) support of 3rd party replacements for Apache.
 

mistwang

LiteSpeed Staff
#15
Let us try to make it work better first.
We got complains from cPanel manager that their support department got too much support requests related to LiteSpeed, that's why we have to add a disclaimer every where stating that we are responsible for LiteSpeed related support issues. :p
 

Tony

Well-Known Member
#16
Should we be complaining to cpanel?

After all, they should allow the (smooth) support of 3rd party replacements for Apache.

Well I gave cpanel a piece of my mind on the logic they're using. Unfortunately talking to them is useless you're talking to a developer is getting you nowhere.

This file:

/usr/local/cpanel/Cpanel/HttpUtils/ApRestart.pm

It's quite a gem Line 358 and onward.

They use a regex against httpd processes ran by root or nobody.

That's how it determines pid's which if you do say "vim httpd" then leave it open for most of the day I imagine you'll get hit by a signal 9 eventually.

Now this logic is not new it's been there for quite a while. What is new is them trying to address httpd processes that do not die on Apache restarts. So now they're getting anything httpd in the cross fire. There is more logic in why it does it and it thinks lsws is not restarting sometimes I imagine.

So I guess someone could complain to them saying I run a script called httpd and cPanel issues kill 9 at it all the time. Then maybe they might re-think how they determine the pid's of Apache.
 

mistwang

LiteSpeed Staff
#17
We have to make cPanel believe our lscgid is Apache httpd, in most case, only lscgid got killed, it is a small sacrifice to give, otherwise, cPanel will start killing any process owning port 80! :eek:

that's the reason why sometimes cPanel send signal 9 and 15 to lshttpd, we have to find a way to improve our httpd simulation code to satisfy cPanel 100% of time to avoid lshttpd being killed.:D
 

mistwang

LiteSpeed Staff
#18
This file:

/usr/local/cpanel/Cpanel/HttpUtils/ApRestart.pm
Go through the code, looks like that as long as we make _check_ap_restart() function happy, we are good.

Looks like this script is only for restart/graceful-restart, maybe there is another piece of code for forced stop, start sequence?
 

Tony

Well-Known Member
#19
Go through the code, looks like that as long as we make _check_ap_restart() function happy, we are good.

Looks like this script is only for restart/graceful-restart, maybe there is another piece of code for forced stop, start sequence?
That would appear to be the logical location for it all to start in. Although I'm sure the safeapacherestart binary has a lot of interesting aspects in it. As there is a log file /usr/local/cpanel/logs/safeapacherestart_log which only some of the log writing from ApRestart.pm others are not from it.

I think it's pretty safe to assume though it thinks lsws is not always gracefully restarting. I wish I knew why though my best guess is when multiple restarts are issued around the same time it's probably happening. As doing things that are restart heavy seem to cause it the most issues. So doing account restores in cPanel cause them in a lot of cases but normal usage you might see one every few days.
 

aww

Well-Known Member
#20
[Resolved]

Any progress on this?

Every few days litespeed fails to restart entirely because of this and the web server is left dead.
 
Last edited by a moderator:
Top