[SERVICE IS FLAPPING BETWEEN STATES] service csf returns exit code 4

    This site uses cookies. By continuing to browse this site, you are agreeing to our Cookie Policy.

    • [SERVICE IS FLAPPING BETWEEN STATES] service csf returns exit code 4

      Guten Abend,

      ich habe noch zwei Probleme:

      Meine ganzen Server und Services spinnen total. Die sind nur am flappen.
      Damals hatte ich einen anderen Statistik Server, nun habe ich alle Agents auf den neuen Monitor Server verbunden, die alten hosts.conf gelöscht und neu initiiert.

      Firewall testweise aus, Log



      Source Code

      1. [Oct 25 2019 00:34:07] INFO 10221 0.000140 127.0.0.1 rest: request 'http://localhost:9200/bloonix-2019-10/event/?routing=4 (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 225)
      2. [Oct 25 2019 00:34:07] INFO 10221 0.047598 127.0.0.1 rest: request was successful (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 238)
      3. [Oct 25 2019 00:34:07] INFO 10221 0.000178 127.0.0.1 rest: start uncompress content (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 536)
      4. [Oct 25 2019 00:34:07] INFO 10221 0.001250 127.0.0.1 rest: uncompress finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 539)
      5. [Oct 25 2019 00:34:07] INFO 10221 0.000140 127.0.0.1 rest: start de-serializing json data (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 544)
      6. [Oct 25 2019 00:34:07] INFO 10221 0.000106 127.0.0.1 rest: de-serializing json data finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 555)
      7. [Oct 25 2019 00:34:07] NOTICE 10221 0.000134 127.0.0.1 save service status CRITICAL message [SERVICE IS FLAPPING BETWEEN STATES] service lfd returns exit code 4 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1712)

      Sowas und das standard rauschen ist im Log.
      Das Flapping bezieht sich meist auf die den Linux.Service.Check (habe für Plesk da dovecot, psa, csf, lfd, watchdog, magicspam) Dienste beobachten lassen ...


      Hat jemand eine Idee?

      Edit: ah, postfix mailqueue spinnt auch. Obowohl ich alle Plugins installiert habe steht da immer "The command 'check-postfix-mailqueue' does not exists! Please install the plugin for this command!"

      Mehr Log:


      Source Code

      1. [Oct 25 2019 09:30:50] INFO 1761 0.000411 127.0.0.1 rest: request 'http://localhost:9200/_aliases (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 225)
      2. [Oct 25 2019 09:30:50] INFO 1761 0.001175 127.0.0.1 rest: request was successful (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 238)
      3. [Oct 25 2019 09:30:50] INFO 1761 0.000155 127.0.0.1 rest: start uncompress content (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 536)
      4. [Oct 25 2019 09:30:50] INFO 1761 0.001188 127.0.0.1 rest: uncompress finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 539)
      5. [Oct 25 2019 09:30:50] INFO 1761 0.000143 127.0.0.1 rest: start de-serializing json data (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 544)
      6. [Oct 25 2019 09:30:50] INFO 1761 0.000105 127.0.0.1 rest: de-serializing json data finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 555)
      7. [Oct 25 2019 09:30:50] INFO 1761 0.000236 127.0.0.1 rest: request 'http://localhost:9200/bloonix-2019-10/event/_search?routing=4 (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 225)
      8. [Oct 25 2019 09:30:50] INFO 1761 0.044649 127.0.0.1 rest: request was successful (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 238)
      9. [Oct 25 2019 09:30:50] INFO 1761 0.000250 127.0.0.1 rest: start uncompress content (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 536)
      10. [Oct 25 2019 09:30:50] INFO 1761 0.001407 127.0.0.1 rest: uncompress finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 539)
      11. [Oct 25 2019 09:30:50] INFO 1761 0.000150 127.0.0.1 rest: start de-serializing json data (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 544)
      12. [Oct 25 2019 09:30:50] INFO 1761 0.000301 127.0.0.1 rest: de-serializing json data finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 555)
      13. [Oct 25 2019 09:30:50] NOTICE 1761 0.000188 127.0.0.1 flap count 62 9 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1187)
      14. [Oct 25 2019 09:30:50] NOTICE 1761 0.000091 127.0.0.1 service 62 is flapping - count 9 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1190)
      15. [Oct 25 2019 09:30:50] NOTICE 1761 0.000131 127.0.0.1 service if flapping between states (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 904)
      16. [Oct 25 2019 09:30:50] INFO 1761 0.000085 127.0.0.1 check if a notification must be send (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 925)
      17. [Oct 25 2019 09:30:50] NOTICE 1761 0.000074 127.0.0.1 last_notification_1 1571985954 last_notification_2 0 notification_interval 3600 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 929)
      18. [Oct 25 2019 09:30:50] NOTICE 1761 0.000148 127.0.0.1 no notification send - service is flapping - next notification 1571989554 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 933)
      19. [Oct 25 2019 09:30:50] INFO 1761 0.000103 127.0.0.1 store result (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 2074)
      20. [Oct 25 2019 09:30:50] INFO 1761 0.000077 127.0.0.1 no statistics received for service id 62 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 2087)
      21. [Oct 25 2019 09:30:50] NOTICE 1761 0.000079 127.0.0.1 save event status CRITICAL service 62 message service psa returns exit code 4 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1682)
      22. [Oct 25 2019 09:30:50] INFO 1761 0.000095 127.0.0.1 -- check dependencies for host id 4 service id 62, counter 10 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1796)
      23. [Oct 25 2019 09:30:50] INFO 1761 0.000698 127.0.0.1 -- dependencies doesn't matched: host id 4, service id 62 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1907)
      24. [Oct 25 2019 09:30:50] NOTICE 1761 0.000274 127.0.0.1 $VAR1 = {
      25. 'message' => '[SERVICE IS FLAPPING BETWEEN STATES] service psa returns exit code 4',
      26. 'tags' => 'flapping',
      27. 'attempts' => '1/3'
      28. };
      29. (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1694)
      30. [Oct 25 2019 09:30:50] INFO 1761 0.000229 127.0.0.1 rest: request 'http://localhost:9200/bloonix-2019-10/event/?routing=4 (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 225)
      31. [Oct 25 2019 09:30:50] INFO 1761 0.050523 127.0.0.1 rest: request was successful (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 238)
      32. [Oct 25 2019 09:30:50] INFO 1761 0.000206 127.0.0.1 rest: start uncompress content (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 536)
      33. [Oct 25 2019 09:30:50] INFO 1761 0.001154 127.0.0.1 rest: uncompress finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 539)
      34. [Oct 25 2019 09:30:50] INFO 1761 0.000138 127.0.0.1 rest: start de-serializing json data (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 544)
      35. [Oct 25 2019 09:30:50] INFO 1761 0.000106 127.0.0.1 rest: de-serializing json data finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 555)
      36. [Oct 25 2019 09:30:50] NOTICE 1761 0.000129 127.0.0.1 save service status CRITICAL message [SERVICE IS FLAPPING BETWEEN STATES] service psa returns exit code 4 (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 1712)
      37. [Oct 25 2019 09:30:50] NOTICE 1761 0.002981 127.0.0.1 start checking service id 26 command check-cpustat status OK (host id 4) (/usr/share/perl5/Bloonix/Server.pm, line 803)
      38. [Oct 25 2019 09:30:50] INFO 1761 0.000373 127.0.0.1 rest: request 'http://localhost:9200/_aliases (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 225)
      39. [Oct 25 2019 09:30:50] INFO 1761 0.001038 127.0.0.1 rest: request was successful (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 238)
      40. [Oct 25 2019 09:30:50] INFO 1761 0.000172 127.0.0.1 rest: start uncompress content (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 536)
      41. [Oct 25 2019 09:30:50] INFO 1761 0.000948 127.0.0.1 rest: uncompress finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 539)
      42. [Oct 25 2019 09:30:50] INFO 1761 0.000123 127.0.0.1 rest: start de-serializing json data (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 544)
      43. [Oct 25 2019 09:30:50] INFO 1761 0.000099 127.0.0.1 rest: de-serializing json data finished (host id 4) (/usr/share/perl5/Bloonix/REST.pm, line 555)
      Display All
      Das was mir auffällt, da steht " dependencies doesn't matched: host id 4, service id 62 (host id 4)" .... Check ich nicht ....
      Aber alle Server flappen, random...

      The post was edited 2 times, last by danse ().

    • Mehr Fehler entdeckt, dieses Mal aus der Browser Console:

      Source Code

      1. info: request /hosts/classes/host
      2. bloonix.min.js?v=138:1 info: request /hosts
      3. 7Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      4. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      5. applyTransformParams @ other.min.js?v=2:1
      6. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      7. applyTransformParams @ other.min.js?v=2:1
      8. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      9. applyTransformParams @ other.min.js?v=2:1
      10. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      11. applyTransformParams @ other.min.js?v=2:1
      12. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      13. applyTransformParams @ other.min.js?v=2:1
      14. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      15. applyTransformParams @ other.min.js?v=2:1
      16. other.min.js?v=2:1 Error: <g> attribute transform: Trailing garbage, "…0) translate(0, Infinity)".
      17. applyTransformParams @ other.min.js?v=2:1
      18. applyTransform @ other.min.js?v=2:1
      19. (anonymous) @ other.min.js?v=2:1
      20. n @ other.min.js?v=2:3
      21. dispatch @ jquery.min.js?v=1:2
      22. v.handle @ jquery.min.js?v=1:2
      23. trigger @ jquery.min.js?v=1:2
      24. (anonymous) @ jquery.min.js?v=1:2
      25. each @ jquery.min.js?v=1:1
      26. each @ jquery.min.js?v=1:1
      27. trigger @ jquery.min.js?v=1:2
      28. p @ other.min.js?v=2:3
      29. 153Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      30. other.min.js?v=2:1 Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      31. applyTransformParams @ other.min.js?v=2:1
      32. other.min.js?v=2:1 Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      33. applyTransformParams @ other.min.js?v=2:1
      34. other.min.js?v=2:1 Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      35. applyTransformParams @ other.min.js?v=2:1
      36. other.min.js?v=2:1 Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      37. applyTransformParams @ other.min.js?v=2:1
      38. other.min.js?v=2:1 Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      39. applyTransformParams @ other.min.js?v=2:1
      40. other.min.js?v=2:1 Error: <g> attribute transform: Expected number, "scale(NaN) translate(N…".
      Display All

      Source Code

      1. info: check path part: dashboard
      2. bloonix.min.js?v=138:1 info: found path part: dashboard
      3. 4bloonix.min.js?v=138:1 error: Failed to load data from server.Try it again and reload the site.Please contact an administrator if the request failed again.
    • Nur nochmal für andere:

      @Jonny, also war in meiner File "/etc/bloonix/agent/conf.d/hosts.conf" überall "agent_id all" drin?
      Ich war niemals in der Datei, aber gut zu wissen. Ich habe einfach nach Docs den Server installiert, und bei anderen Servern die Agents initiiert. Das kam dann eher per default oder ich habe etwas übersehen. In jedem fall habe ich das nicht bewusst gesetzt.

      Danke für deine Hilfe! Scheint zu gehen. :)
    • Ok, hab den Fehler gefunden.

      Crontab -e:

      Source Code

      1. */1 * * * * bloonix-update-agent-host-config --addr localhost --agent-id all --run > /dev/null
      der überschreibt alles mit "all".
      Brauche ich diesen Cron? Wenn ja, muss ich statt all dann remote nehmen warscheinlich?