Question about Computers & Internet

1 Answer

Dazed and confused - unknown system restart

Hi,

I've got some bizzare & annoying problem in some of our servers.All other servers dont have this unknown restart issue. Only the PPPoEservers restart. When it restarts, it shows this message:

"kernel: Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?"

I did searches on google, didn't find anything regarding this issue,but i found few similar searches; there were unknown solutions forthis. Some showed that it was something to do with the RAM, but ichecked that, it doesn't have anything to do with it.

The pppoe servers are "IBM eserver xSeries 336"

The specifications of these servers:

CPU: 1 x86_64 3.0Ghz processors (Cache: 1024 KB) Intel Xeon

Network: (2X) Broadcom Corporation NetXtreme BCM5704 Gigabit Ethernet PCI

Network: (2x) Broadcom Corporation:NetXtreme BCM5721 Gigabit Ethernet

SCSI: LSI Logic / Symbios Logic 53c1030 PCI-X Fusion-MPT Dual Ultra320

SCSI

CDROM: HL-DT-STDVD-ROM GDR8083N

Graphic Card: ATI Technologis Inc Radeon RV100 QY (Radeon 7000/VE)

USB: (2x) Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller

Storage Device: (2x) Intel Corporation 82801EB (ICH5) SATA Controller (each 36.4)

RAM: 3GB

the kernel version of Fedora Core 4:
linux version 2.6.15-1.1833_FC4smp(bhcompile@hs20-bc1-.build.redhat.com) (gcc version 4.0.2 20051125 (RedHat 4.0.2-8) ) #1 SMP Wed Mar 1 23:56:51 EST 2006

Well thanks, hope there is a quick solution for this.

thanks

Posted by on

  • 1 more comment 
  • Uri Steinberger Feb 02, 2008

    thanks for helping, I'll try that soon, and see if that works. I'll give you a reply if that works.

    Thanks again for the help!

  • Uri Steinberger Feb 04, 2008

    Hey daverh,

    I looked thoroughly for power management options, ACPI and power saving on the BIOS, and there is no option for this.
    I remember there is power saving mode in the O.S., but is that going to help if its disabled on the O.S.
    If i should disable it on the Fedora Core 4, what are the steps to disable it.

    thanks




  • Uri Steinberger Feb 05, 2008

    the first time i came about this unknown restart, I though maybe if i update it, it will fix the restart issue, i updated the BIOS a couple of times which didn't solve the problem.
    Well the strange thing about this that when i changed one of the PPPoE servers recently to a different type of server by just exchanging both hard drives with 2 hard drives that already got all the setup, O.S, programs installed. and just 2 network interfaces linked. 2 other interfaces not linked - system never restarts. this server is for hotspot page and prepaid services and has its own database which is ok but not so stable and thats y we are using also PPPoE servers. I regularly check the logs and no restarts. It's same hardware as the other PPPoE server but just different services running with the same Fedora Core 4. Now i have a feeling maybe the O.S has problems with 4 network interfaces running at the same time or it can be something in the PPPoE scripts running. Well the configurations of these servers i didn't made them. The configurations was done by a team that came here and did all these. All the other servers from the PPPoE servers don't have this restart issue and the others all have 1 or 2 network interfaces linked. I can't take none of the other network interfaces off because all the 4 network interfaces linked are running each for a purpose. 1 is the External (backbone internet), Internal (SNMP and RADIUS(Database Authentication)) DMZ (IIS linked & Mail) and 4th is Wifi (linked to wifi transmitters). The team left has a quick sketch of the services, network, and firewall configurations. I can go into each of the servers and find out exactly what scripts are in, but its hustle to understand these scripts lol.

    thanks for your time and help.
    The ASU sounds good, i will look for it and see if i can view power saving mode and ACPI options.

    thanks


×

Ad

1 Answer

  • Level 2:

    An expert who has achieved level 2 by getting 100 points

    MVP:

    An expert that got 5 achievements.

    Novelist:

    An expert who has written 50 answers of more than 400 characters.

    Scholar:

    An expert who has written 20 answers of more than 400 characters.

  • Expert
  • 117 Answers

First off, take the message at its word: restart the machine, go into the BIOS and disable all power saving. Also, turn off ACPI if it's on.

Posted on Feb 02, 2008

  • Dave Harris
    Dave Harris Feb 04, 2008

    I don't know offhand where you'd change ACPI etc settings in Fedora - I usually use SuSE or Ubuntu & in any case haven't had to do it for years :). However, I don't think that would help you - the OS seems to be recovering from some odd interruption initiated by the hardware. The baseboard management processor on those machines can do that if they detect various types of error - if you go to the IBM site you can download manuals on this.

    If you go here:
    http://www-304.ibm.com/jct01004c/systems...
    you can download the Advanced Settings Utility (ASU), both the manual & the software. This allows you to see all the BIOS settings as a list - and may well show you some that don't appear when you go into the BIOS directly. There are versions of the program that will run from the command line under Linux with the system running normally. Of course, there is the possibility that you really do have an intermittent hardware fault :{ For example, a partially clogged CPU heatsink could cause a restart through overtemperature when the CPU was being worked hard for a while.

    Diagnostics: have a look around here:
    http://www-304.ibm.com/jct01004c/systems...

    And a last thought - have you checked for BIOS updates?

×

Ad

1 Suggested Answer

6ya6ya
  • 2 Answers

SOURCE: I have freestanding Series 8 dishwasher. Lately during the filling cycle water hammer is occurring. How can this be resolved

Hi there,
Save hours of searching online or wasting money on unnecessary repairs by talking to a 6YA Expert who can help you resolve this issue over the phone in a minute or two.

Best thing about this new service is that you are never placed on hold and get to talk to real repairmen in the US.

Here's a link to this great service

Good luck!

Posted on Jan 02, 2017

Ad

Add Your Answer

Uploading: 0%

my-video-file.mp4

Complete. Click "Add" to insert your video. Add

×

Loading...
Loading...

Related Questions:

1 Answer

My n computing terminal server windows 2003 is getting restarted , i have connected around 15 users please help me to solve this problem.


It is getting restarted on it's own? If so, it sounds like you have a driver conflict and you've setup your system to simply restart when there is a crash. Enable the crash dump to capture the Blue Screen information the next time the server restarts. Please provide the version of vSpace you're using.

May 06, 2011 | NComputing L230 Network Terminal

1 Answer

Hi, my Dell PowerEdge 1850 Server freeze at boot screen after print remote access controller detected. anyone ve any solution on it ?


I had exactly the same issue, its basically your iDrac card thats likely at fault. I simply removed mine and the server would boot as expected. If you require the iDrac then order a new card and this should solve your issue.

Feb 18, 2011 | Dell PowerEdge 1850 (PE1850) Server

1 Answer

Its getting restart while booting


Hi,

Welcome to fixya.

You mean a restart soon after powering on the server . If is it the issue. Then its a smps / Power cage issue. You need to replace the part to rectify the problem.

Thanks

Jan 24, 2011 | IBM x3400 Server

1 Answer

My 500 GB Deckstar Hitachi Hard drive is having issues. When I watch videos or play music from the drive, every once in a while the music or video will freeze. Then the hard drive makes a low humming noise...


Hi,
It seems that your hard drive is restarting because of an unknown reason. I would recommend to check your hard drive's power connector, maybe there are connection problems. If you have free connectors then use another one and see if problem occurs again. Good luck!

If this is the solution for your problem, please rate my post, otherwise give us more details.

Regards,
Andras

Sep 26, 2010 | Hitachi 500GB 7200RPM 16MB BUFFER SERIAL...

1 Answer

Websphere thread hungs frequently with Business Objects and it gets restarted automatically.


The issue usually manifests itself with a line in the SystemOut.log or equivalent reporting the following:
WSVR0605W: Thread "THREAD NAME : ID" (55c8824f) has been active for 600,112 milliseconds and may be hung. There are 1 threads in total in the server that may be hung.

If you do a thread dump in your App Server at this point, you will see something like so:

- waiting on <0x93d29020> (a java.lang.Object)
at java.lang.Object.wait(Object.java:429)
at com.crystaldecisions.thirdparty.com.ooc.OB.Downcall.waitUntilCompleted(Downcall.java:831)
- locked <0x93d29020> (a java.lang.Object)
at com.crystaldecisions.thirdparty.com.ooc.OB.GIOPClientWorkerThreaded.receive(GIOPClientWorkerThreaded.java:327)
at com.crystaldecisions.thirdparty.com.ooc.OB.GIOPClientWorkerThreaded.sendReceive(GIOPClientWorkerThreaded.java:353)
at com.crystaldecisions.thirdparty.com.ooc.OB.Downcall.request(Downcall.java:336)
at com.crystaldecisions.thirdparty.com.ooc.OB.DowncallStub.invoke(DowncallStub.java:583)
at com.crystaldecisions.thirdparty.com.ooc.CORBA.Delegate.invoke(Delegate.java:579)
at com.crystaldecisions.thirdparty.org.omg.CORBA.portable.ObjectImpl._invoke(ObjectImpl.java:125)
at com.crystaldecisions.enterprise.ocaframework.idl.OCA.OCAi._InfoStoreEx3Stub.queryEx3(_InfoStoreEx3Stub.java:62)
at com.crystaldecisions.enterprise.ocaframework.j.a(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.j.find(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.AbstractServerHandler.buildServerInfo(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.AbstractServerHandler.buildClusterInfo(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.aa.for(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.ServiceMgr.for(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.o.a(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.o.a(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.o.a(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.p.a(Unknown Source)
at com.crystaldecisions.enterprise.ocaframework.ServiceMgr.getManagedService(Unknown Source)
at com.crystaldecisions.sdk.occa.managedreports.ras.internal.CECORBACommunicationAdapter.connect(Unknown Source)
at com.crystaldecisions.sdk.occa.managedreports.ras.internal.RASReportAppFactory.a(Unknown Source)
at com.crystaldecisions.sdk.occa.managedreports.ras.internal.RASReportAppFactory.a(Unknown Source)
at com.crystaldecisions.sdk.occa.managedreports.ras.internal.RASReportAppFactory.openDocument(Unknown Source)
at com.crystaldecisions.sdk.occa.managedreports.ras.internal.RASReportAppFactory.openDocument(Unknown Source)
at com.talic.pi.utils.ReportingEngine.generateReport(ReportingEngine.java:86)
at com.talic.pi.cms.dao.ReportsDAO.generateReport(ReportsDAO.java:31)
at com.talic.pi.cms.component.ReportGeneratorComponentImpl.generateReport(ReportGeneratorComponentImpl.java:41)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:324)
at com.ibm.ws.sca.internal.java.handler.JavaReflectionAdapter$2.run(JavaReflectionAdapter.java:152)


The cause:
This is mainly caused by reports that take a great deal of time to execute causing the application server to report that the processing thread is 'hung'. A worse side effect of this is when the application server runs out of threads and is unable to process any further requests due to these hung threads.

The solution:
The problem is due to the fact that the Corba Timeout has not been set. You've missed the clientSDKOptions.xml in your deployment. This causes the RAS SDK to probably never timeout its call to the RAS.

Create a file called clientSDKOptions.xml and place it in your application's WEB-INF/classes folder, or alternatively, you could place it in a server wide classpath folder like 'applib' on OC4J or $WASPROFILE/properties on WebSphere.
The contents of the file should be as follows:


<CrystalReports.ClientSDKOptions
xmlns:xsi="http://www.w3.org/1999/XMLSchema-instance"
version="2"
xsi:type="CrystalReports.ClientSDKOptions">
<CORBARequestTimeout>120000</CORBARequestTimeout>
</CrystalReports.ClientSDKOptions>

Restart your app and your done. Your call should now timeout within 2 minutes (something with you can change) and you can say goodbye to your Hung Threads issue.

Jul 05, 2009 | Business Objects Full Version for PC

2 Answers

Canon copiers c3100, ir5000, & ir7200 scanning files issue.


Ok, so working thru the troubleshooting model you have identified the symptoms fairly well but I dont know if you have adequately identified the area affected. In order for the copiers to create a file on a network server they need user names and passwords to access a network resource and they need sufficient rights to create a file. Have you tried to log into the scan folder on the server using the copiers credentials from a workstation?

The next step, of course is to identify what has changed.If you had a problem with a domain controller is it possible the user accounts for the copiers got deleted/damaged/modified?

Log into the internal web page of the copier by plugging its ip address into the address bar of your favourite web browser and verify that its network settings (dns server, domain name, etc) are correct. I believe you can ping the server from the web page as well, if not it can be done from the tcp/ip setup screen on the copier control panel.

There is no way to manually flush the DNS cache on the copiers except for powercycling the device, just make sure you are turning it off by the main power switch near the back of the machine and not the on-off button on the control panel.

I dunno if this qualifies as a solution or a clarification request but I would start with the user accounts and access permissions.

HTH

Jun 12, 2008 | Canon ImageRunner 2220I Copier

1 Answer

Reboot necessary to find ISP server


2200BG is a very old Intel wireless card. Its driver had a lot of issues that were fixed over the years. Try the latest driver from Intel site: http://downloadcenter.intel.com/Product_Filter.aspx?ProductID=1637&lang=eng (make sure to choose correct OS).

Jan 11, 2008 | Intel PRO/Wireless 2200BG 802.11g/b

1 Answer

Unknown symbol!


Hi, I think it is the memory card indicator!

Sep 01, 2007 | LG Prada KE850 Cellular Phone

Not finding what you are looking for?
Computers & Internet Logo

Related Topics:

2,262 people viewed this question

Ask a Question

Usually answered in minutes!

Top Computers & Internet Experts

Les Dickinson
Les Dickinson

Level 3 Expert

18425 Answers

Alun Cox

Level 3 Expert

2678 Answers

David Payne
David Payne

Level 3 Expert

14162 Answers

Are you a Computer and Internet Expert? Answer questions, earn points and help others

Answer questions

Manuals & User Guides

Loading...