[RESOLVED] LSWS 4.0.13 - Signal 9's gone... Signal 15's new

Discussion in 'Bug Reports' started by MikeDVB, Feb 28, 2010.

  1. MikeDVB

    MikeDVB New Member

    Code:
    At [27/Feb/2010:07:11:38 -0500], web server with pid=24638 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
    Code:
    At [27/Feb/2010:13:02:14 -0500], web server with pid=14126 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
    Code:
    At [27/Feb/2010:22:18:42 -0500], web server with pid=16181 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
    Code:
    At [28/Feb/2010:01:01:15 -0500], web server with pid=22155 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
    Code:
    At [28/Feb/2010:13:01:27 -0500], web server with pid=28935 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
    Code:
    At [28/Feb/2010:14:08:16 -0500], web server with pid=14474 received unexpected signal=15, no core file is created. A new instance of web server will be started automatically!
    After talking with Tony I found that changes were made in 4.0.13 that would help fix the cPanel 11.25/LSWS 4.0.13 Signal 9 kills that were happening from time to time... I pushed this update to all servers and all of them it's been working beautifully but this single one.

    This server is the only one pushing this Signal 15 out of about 20 servers running LSWS 4.0.13.
    Last edited by a moderator: Mar 4, 2010
  2. mistwang

    mistwang LiteSpeed Staff

    Please try the latest build of 4.0.13, built on 27th 6PM EST.
  3. MikeDVB

    MikeDVB New Member

    IMHO you should start using build numbers or show the build dates on the downloads page. I'll install it and update you.
  4. AndrewT

    AndrewT New Member

    Fully Agreed on the build numbers. I've also seen the signal 15 reports a couple of times when terminating accounts. I'll give the new build a try.
  5. ffeingol

    ffeingol New Member

    And I'll throw in a 'me too'. It's very confusing where a new release comes out with the same build number. If you re-release, change the build number.
  6. MikeDVB

    MikeDVB New Member

    As an example I'm told to download "Please try the latest build of 4.0.13, built on 27th 6PM EST." but when I look on the downloads page I see this:
    [​IMG]

    Maybe "release date" should be changed to "build date" and then updated whenever an adjustment/build is made?
  7. mistwang

    mistwang LiteSpeed Staff

    OK, we update the download page.
  8. anewday

    anewday Moderator

    I agree. Thanks for making the update, George.
  9. MikeDVB

    MikeDVB New Member

    Thanks for updating the download page - makes a bit more sense now :)

    As for the latest version - Signal 15's are gone as well.
  10. aww

    aww New Member

    Hate to say this but I am using 4.0.14 and get the signal=15

    They seem to happen every 2-2.5 hours or so, plus or minus 5-10 minutes.

    Any other info can I give you to help?
    Last edited: May 15, 2010
  11. Tony

    Tony Member

    Signal 15's see the odd one. Now signal 9's get hit with those all the times and worst part being some times lsws does not even start back up yet says it did :(
  12. aww

    aww New Member

    It's odd in that Signal 15's only a few times a day and then it stops.

    At [15/May/2010:17:02:29 -0400]
    At [15/May/2010:19:03:20 -0400]
    At [15/May/2010:21:00:32 -0400]

    At [16/May/2010:01:11:33 -0400]
    At [16/May/2010:03:08:46 -0400]
    At [16/May/2010:17:02:39 -0400]
    At [16/May/2010:19:04:23 -0400]
    At [16/May/2010:21:01:38 -0400]

    At [17/May/2010:01:12:42 -0400]
    At [17/May/2010:17:03:47 -0400]
    At [17/May/2010:19:05:15 -0400]
    At [17/May/2010:21:02:33 -0400]

    At [18/May/2010:00:25:02 -0400]
    At [18/May/2010:01:07:30 -0400]
    At [18/May/2010:03:04:49 -0400]
    At [18/May/2010:05:02:08 -0400]
    At [18/May/2010:07:16:08 -0400]
  13. mistwang

    mistwang LiteSpeed Staff

    Signal 9 will cause much bigger problem than signal 15, and it is the biggest headache for us. We will try to figure something out from cPanel's Apache restart script/binary. Hopefully, we can figure out a satisfactory solution soon.
  14. aww

    aww New Member

    Should we be complaining to cpanel?

    After all, they should allow the (smooth) support of 3rd party replacements for Apache.
  15. mistwang

    mistwang LiteSpeed Staff

    Let us try to make it work better first.
    We got complains from cPanel manager that their support department got too much support requests related to LiteSpeed, that's why we have to add a disclaimer every where stating that we are responsible for LiteSpeed related support issues. :p
  16. Tony

    Tony Member


    Well I gave cpanel a piece of my mind on the logic they're using. Unfortunately talking to them is useless you're talking to a developer is getting you nowhere.

    This file:

    /usr/local/cpanel/Cpanel/HttpUtils/ApRestart.pm

    It's quite a gem Line 358 and onward.

    They use a regex against httpd processes ran by root or nobody.

    That's how it determines pid's which if you do say "vim httpd" then leave it open for most of the day I imagine you'll get hit by a signal 9 eventually.

    Now this logic is not new it's been there for quite a while. What is new is them trying to address httpd processes that do not die on Apache restarts. So now they're getting anything httpd in the cross fire. There is more logic in why it does it and it thinks lsws is not restarting sometimes I imagine.

    So I guess someone could complain to them saying I run a script called httpd and cPanel issues kill 9 at it all the time. Then maybe they might re-think how they determine the pid's of Apache.
  17. mistwang

    mistwang LiteSpeed Staff

    We have to make cPanel believe our lscgid is Apache httpd, in most case, only lscgid got killed, it is a small sacrifice to give, otherwise, cPanel will start killing any process owning port 80! :eek:

    that's the reason why sometimes cPanel send signal 9 and 15 to lshttpd, we have to find a way to improve our httpd simulation code to satisfy cPanel 100% of time to avoid lshttpd being killed.:D
  18. mistwang

    mistwang LiteSpeed Staff

    Go through the code, looks like that as long as we make _check_ap_restart() function happy, we are good.

    Looks like this script is only for restart/graceful-restart, maybe there is another piece of code for forced stop, start sequence?
  19. Tony

    Tony Member

    That would appear to be the logical location for it all to start in. Although I'm sure the safeapacherestart binary has a lot of interesting aspects in it. As there is a log file /usr/local/cpanel/logs/safeapacherestart_log which only some of the log writing from ApRestart.pm others are not from it.

    I think it's pretty safe to assume though it thinks lsws is not always gracefully restarting. I wish I knew why though my best guess is when multiple restarts are issued around the same time it's probably happening. As doing things that are restart heavy seem to cause it the most issues. So doing account restores in cPanel cause them in a lot of cases but normal usage you might see one every few days.
  20. aww

    aww New Member

    [Resolved]

    Any progress on this?

    Every few days litespeed fails to restart entirely because of this and the web server is left dead.
    Last edited by a moderator: Jun 14, 2010

Share This Page