Confirmed users
1,927
edits
No edit summary |
|||
| Line 11: | Line 11: | ||
= Problem = | = Problem = | ||
[[Auto-tools/Projects/Pulse|Pulse]] uses RabbitMQ as a pub/sub service which currently allows anyone to subscribe to any exchange via a common user account. Some client applications use durable queues in case they crash; however, sometimes these queues are created by accident, and sometimes apps crash without admins noticing. In these cases, the queues continue to grow without bound, which can eventually result in the RabbitMQ host running out of memory. Our current solution is to have Nagios monitor the queues and send alerts when any queues exceed a certain number of unread or unacknowledged messages, at which point a RabbitMQ admin attempts to find the person responsible and/or delete the offending queue. | |||
= Goals & Considerations= | = Goals & Considerations= | ||