LiteSpeed Technologies
Download Download     Blog Blog     Wiki Wiki     Forum Forum     Store     Contact Contact    

Go Back   LiteSpeed Support Forums > LiteSpeed Web Server > Bug Reports > solaris litespeed enterprise always dumps core on upgrade

Reply
 
Thread Tools Display Modes
  #1  
Old 03-02-2008, 03:48 AM
jrmarino jrmarino is offline
Senior Member
 
Join Date: Apr 2007
Posts: 114
Default solaris litespeed enterprise always dumps core on upgrade

I'm running litespeed on two servers, identical hardware, Sunfire X4100M2 I think. One version of litespeed is the enterprise version, the other is the standard version. Solaris 10 is the operating system for both.

When I use the upgrade feature, the standard version always works as expected. It upgrades and restarts itself, no issue.

The enterprise version never works. It upgrades itself. It works for a few seconds and then dumps core. This has happened at least 3 times, and every time I've used the upgrade.

My solution is to upgrade and to immediately disable litespeed via the SMF. With 3.3.6 it dumped core before I could disable it. When I enable it again, runs nominally.

These environments are as about as identical as you can get -- the same version of solaris, the same sun hardware, other software configurations are similar. It's something specific to the enterprise version causing the core dump.
Reply With Quote
  #2  
Old 03-02-2008, 12:49 PM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
We will check this issue.
Reply With Quote
  #3  
Old 03-11-2008, 03:24 AM
jrmarino jrmarino is offline
Senior Member
 
Join Date: Apr 2007
Posts: 114
Hi mistwang,

this is possibly related. Today I restarted the enterprise webserver twice intentionally to update the awstats alias field. Both times I received this email:

Quote:
At [11/Mar/2008:05:20:11 -0500], web server with pid=21294 received unexpected signal=9, no core file is created. A new instance of web server will be started automatically!
I also updated the standard webserver to update the awstats alias field and I did not receive that email.

The enterprise webserver is at version 3.3.6.
The standard webserver is at version 3.3.7
Reply With Quote
  #4  
Old 03-11-2008, 10:59 AM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
Signal 9 is SIGKILL, strange.
Will you get that whenever you restart the enterprise edition?
Reply With Quote
  #5  
Old 03-12-2008, 01:21 AM
jrmarino jrmarino is offline
Senior Member
 
Join Date: Apr 2007
Posts: 114
I don't recall seeing that before. I think it just started happening with version 3.3.6.

I looked at the SMF logs -- the watchdog is taking care of the restarts and the watchdog is not terminating. Here is the tail end of the SMF logs. You can see the core dumps that I mentioned before. The last activity is when I restarted the server on March 2nd to upgrade to 3.3.6.

Code:
[ Oct 23 01:53:16 Method "start" exited with status 0 ]
[ Jan 11 16:59:24 Stopping because process dumped core. ]
[ Jan 11 16:59:25 Executing stop method ("/opt/lsws/bin/lswsctrl stop") ]
[OK] lshttpd: stopped.
[ Jan 11 16:59:25 Method "stop" exited with status 0 ]
[ Jan 11 17:00:25 Method or service exit timed out.  Killing contract 1167 ]
[ Jan 11 17:01:22 Leaving maintenance because disable requested. ]
[ Jan 11 17:01:22 Disabled. ]
[ Jan 11 17:01:30 Enabled. ]
[ Jan 11 17:01:30 Executing start method ("/opt/lsws/bin/lswsctrl start") ]
[OK] lshttpd: pid=2499.
[ Jan 11 17:01:30 Method "start" exited with status 0 ]
[ Jan 28 12:33:03 Stopping because process dumped core. ]
[ Jan 28 12:33:03 Executing stop method ("/opt/lsws/bin/lswsctrl stop") ]
[OK] lshttpd: stopped.
[ Jan 28 12:33:03 Method "stop" exited with status 0 ]
[ Jan 28 12:34:03 Method or service exit timed out.  Killing contract 1475 ]
[ Jan 28 14:20:50 Leaving maintenance because disable requested. ]
[ Jan 28 14:20:50 Disabled. ]
[ Jan 28 14:20:56 Enabled. ]
[ Jan 28 14:20:56 Executing start method ("/opt/lsws/bin/lswsctrl start") ]
[OK] lshttpd: pid=8965.
[ Jan 28 14:20:56 Method "start" exited with status 0 ]
[ Mar  2 05:41:03 Stopping because process dumped core. ]
[ Mar  2 05:41:03 Executing stop method ("/opt/lsws/bin/lswsctrl stop") ]
[OK] lshttpd: stopped.
[ Mar  2 05:41:03 Method "stop" exited with status 0 ]
[ Mar  2 05:42:04 Method or service exit timed out.  Killing contract 1549 ]
[ Mar  2 05:42:04 Leaving maintenance because disable requested. ]
[ Mar  2 05:42:04 Disabled. ]
[ Mar  2 05:42:04 Enabled. ]
[ Mar  2 05:42:04 Executing start method ("/opt/lsws/bin/lswsctrl start") ]
[OK] lshttpd: pid=28159.
[ Mar  2 05:42:04 Method "start" exited with status 0 ]
Reply With Quote
  #6  
Old 03-12-2008, 09:28 AM
mistwang mistwang is offline
LiteSpeed Staff
 
Join Date: May 2003
Location: New Jersey
Posts: 7,590
Quote:
[ Mar 2 05:42:04 Method or service exit timed out. Killing contract 1549 ]
Does it mean SMF kill the process with "-9"? That would explain the unexpected signal 9.
LSWS does graceful restart/stop, which will try to finish all the pending requests before exiting.

Will investigate the "Process dumped core" issue. it does not show which process dumped core.
If lshttpd does, you should receive a similar report in email.

Can you locate the core file and check it with GDB? lshttpd core files should be under /tmp/lshttpd .
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 11:50 AM.



- Archive - Top
© Copyright 2003-2011 LiteSpeed Technologies, Inc. All rights reserved. Privacy Policy.