Modify ↓
Ticket #748 (closed enhancement: fixed)
detect hanging gamin/fam server
Reported by: | tcampbel | Owned by: | desai |
---|---|---|---|
Priority: | major | Milestone: | Bcfg2 1.3.0 Release |
Component: | bcfg2-client | Version: | 1.0 |
Keywords: | Cc: | [email protected]… |
Description
16:43:13 < HCoyote> desai: so, at some point we encountered this in the past and the symptom threw an error that looked like "Hit event timeout without getting any events; GAMIN/FAM problem?" 16:43:26 < desai> yeah, we can detect it upon startup pretty easily 16:43:27 < HCoyote> in this case, i can't find an error that would indicate a gamin problem. 16:43:28 < desai> oooh 16:43:35 < desai> we could add a watchdog 16:43:44 < desai> that would be pretty easy, and would catch cases like this 16:43:49 < HCoyote> is there some sort of feedback loop then? 16:44:03 < desai> just keep a timestamp in the gamin driver for the last event seen
add a method to have bcfg2-server detect if gam_server has hung and let admin know that it needs to be restarted.
Attachments
Change History
comment:1 Changed 13 years ago by desai
- Milestone changed from Bcfg2 1.0.0 Release to Bcfg2 1.0.1 Release
comment:2 Changed 13 years ago by solj
- Milestone changed from Bcfg2 1.0.1 Release to Bcfg2 1.1.0 Release
comment:3 Changed 13 years ago by somekool <[email protected]…>
- Cc [email protected]… added
encounter this error for the first time. it seems bcfg2 is already trapping this error. it should just mentionned the required fix more clearly. that is to kill gam_server process
comment:4 Changed 12 years ago by solj
- Milestone changed from Bcfg2 1.1.0 Release to Bcfg2 1.2.0 Release
comment:5 Changed 12 years ago by https://www.google.com/accounts/o8/id?id=AItOawnSjgovXZr-_V3vGkvMSR0pc5LDykRc1Nc
We used to run into this problem a lot with Bcfg2 1.0, but I haven't seen it with either 1.1 or 1.2, so this may have been fixed already.
Note: See
TracTickets for help on using
tickets.