Difference between revisions of "Monitoring Server"

From New IAC Wiki
Jump to navigation Jump to search
 
(21 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==Computers to Monitor==
+
The '''IAC monitoring server''' can be found at http://hal.iac.isu.edu/nagios3/
*Brems
+
 
*Inca
+
=Monitored Systems and Services=
*Webserver
+
 
*Wiki
+
==Systems & Services Chart==
*Seattle
+
{| class="wikitable" border="1" style="font-size: x-small;"
*Backup server
+
|+ Systems and Services Currently Monitored
*File server
+
! System Name !! CPU Temp !! CPU Usage !! Current Load !! # Users !! DRBD !! Disk Space !! HDD Temp !! Heartbeat !! LDAP !! Memory !! # Network Connections !! Network I/O !! RAID !! SMART !! SSH !! # Processes !! HTTP !! MySQL !! Samba !! System Temperature !! PSU !! PING !! Other
 +
|-
 +
! Alan's Desktop
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}}ne
 +
|-
 +
! aztec.iac.isu.edu
 +
| {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}}ne
 +
|-
 +
! brems.iac.isu.edu
 +
| {{no}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || Averaging Test, Cluster Room Temperatures, Slurm Queue, Slave Nodes
 +
|-
 +
! cleanroom.temperatures
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || Clean Room Temperatures
 +
|-
 +
! codex.iac.isu.edu
 +
| {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}}ne
 +
|-
 +
! cornwall.iac.isu.edu
 +
| {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}}ne
 +
|-
 +
! crick.iac.isu.edu
 +
| {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}}ne
 +
|-
 +
! darwin.iac.isu.edu
 +
| {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}}ne
 +
|-
 +
! devsource.iac.isu.edu
 +
| {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}}ne
 +
|-
 +
! iac-gateway
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}}ne
 +
|-
 +
! license.iac.isu.edu
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}}ne
 +
|-
 +
! hal.iac.isu.edu
 +
| {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || Radiation Monitors
 +
|-
 +
! physics-gateway
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}}ne
 +
|-
 +
! seattle.iac.isu.edu
 +
| {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || Radiation Monitors
 +
|-
 +
! tesla.iac.isu.edu
 +
| {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}}ne
 +
|-
 +
! watson.iac.isu.edu
 +
| {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{yes}} || {{no}} || {{no}}ne
 +
|-
 +
! wiki.iac.isu.edu
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}} || {{yes}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}}ne
 +
|-
 +
! vienna.iac.isu.edu
 +
| {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{no}} || {{yes}} || {{no}}ne
 +
|}
  
 +
==Systems Monitored (By Building)==
  
==Non-computer things to monitor==
+
===Physical Science===
*Cluster Room Temp
+
* physics-gateway
*CleanRoom temp probes
+
* aztec.iac.isu.edu
 +
* brems.iac.isu.edu
 +
* brems slave nodes
 +
* cleanroom.temperatures
 +
* codex.iac.isu.edu
 +
* cornwall.iac.isu.edu
 +
* crick.iac.isu.edu
 +
* darwin.iac.isu.edu
 +
* devsource.iac.isu.edu
 +
* hal.iac.isu.edu
 +
* tesla.iac.isu.edu
 +
* watson.iac.isu.edu
 +
* wiki.iac.isu.edu
 +
* vienna.iac.isu.edu
  
==Things to Monitor==
+
===Idaho Accelerator Center===
*raid status (number of up drives)
+
* iac-gateway
*Hard drive space '''df'''
+
* alans-desktop
*memory usage
+
* license.iac.isu.edu
*load average
+
* seattle.iac.isu.edu
*temp (CPU, case, etc) '''lmsensors'''1
 
*CPU utilization
 
*fan speed/failure
 
  
*ITRC
+
==Systems to Monitor==
**number of connections (netstat?)
+
*Slave nodes (partial)
**Network I/O
+
*Webserver
**Individual process times
+
*Backup server

Latest revision as of 21:14, 7 March 2011

The IAC monitoring server can be found at http://hal.iac.isu.edu/nagios3/

Monitored Systems and Services

Systems & Services Chart

Systems and Services Currently Monitored
System Name CPU Temp CPU Usage Current Load # Users DRBD Disk Space HDD Temp Heartbeat LDAP Memory # Network Connections Network I/O RAID SMART SSH # Processes HTTP MySQL Samba System Temperature PSU PING Other
Alan's Desktop No No No No No No No No No No No No No No No No No No No No No Yes None
aztec.iac.isu.edu Yes Yes Yes Yes No Yes Yes No No Yes Yes Yes Yes Yes Yes Yes No No No No No No None
brems.iac.isu.edu No Yes Yes Yes No Yes Yes No No Yes Yes Yes Yes Yes Yes Yes No No No No No No Averaging Test, Cluster Room Temperatures, Slurm Queue, Slave Nodes
cleanroom.temperatures No No No No No No No No No No No No No No No No Yes No No No No No Clean Room Temperatures
codex.iac.isu.edu No Yes Yes No No Yes No No No Yes Yes Yes No No Yes Yes No Yes No No No No None
cornwall.iac.isu.edu Yes Yes Yes Yes No Yes No No No Yes Yes Yes Yes Yes Yes Yes No No Yes No No No None
crick.iac.isu.edu No Yes Yes Yes Yes Yes Yes Yes No Yes Yes Yes Yes Yes Yes Yes No No No Yes Yes No None
darwin.iac.isu.edu Yes Yes Yes No No Yes Yes No Yes Yes Yes Yes Yes Yes Yes Yes No No No No No No None
devsource.iac.isu.edu No Yes Yes No No Yes No No No Yes Yes Yes No No Yes Yes No No No No No No None
iac-gateway No No No No No No No No No No No No No No No No No No No No No Yes None
license.iac.isu.edu No No No No No No No No No No No No No No No No No No No No No Yes None
hal.iac.isu.edu Yes Yes Yes Yes No No Yes No No No No No No No Yes Yes Yes No No No No No Radiation Monitors
physics-gateway No No No No No No No No No No No No No No No No No No No No No Yes None
seattle.iac.isu.edu Yes Yes Yes Yes No Yes Yes No No Yes Yes Yes No Yes Yes Yes No No No No No No Radiation Monitors
tesla.iac.isu.edu Yes Yes Yes Yes No Yes Yes No No Yes Yes Yes Yes Yes Yes Yes No No No No No No None
watson.iac.isu.edu No Yes Yes Yes Yes Yes Yes Yes No Yes Yes Yes Yes Yes Yes Yes No No No Yes Yes No None
wiki.iac.isu.edu No No No No No No No No No No No No No No Yes No Yes No No No No Yes None
vienna.iac.isu.edu No No No No No No No No No No No No No No No No No No No No No Yes None

Systems Monitored (By Building)

Physical Science

  • physics-gateway
  • aztec.iac.isu.edu
  • brems.iac.isu.edu
  • brems slave nodes
  • cleanroom.temperatures
  • codex.iac.isu.edu
  • cornwall.iac.isu.edu
  • crick.iac.isu.edu
  • darwin.iac.isu.edu
  • devsource.iac.isu.edu
  • hal.iac.isu.edu
  • tesla.iac.isu.edu
  • watson.iac.isu.edu
  • wiki.iac.isu.edu
  • vienna.iac.isu.edu

Idaho Accelerator Center

  • iac-gateway
  • alans-desktop
  • license.iac.isu.edu
  • seattle.iac.isu.edu

Systems to Monitor

  • Slave nodes (partial)
  • Webserver
  • Backup server