Monitoring Server
Jump to navigation
Jump to search
The IAC monitoring server can be found at http://hal.iac.isu.edu/nagios3/
Monitored Systems and Services
System Name | CPU Usage | Current Load | # Users | DRBD | Disk Space | Heartbeat | LDAP | Memory | # Network Connections | Network I/O | RAID | SMART | SSH | # Processes | HTTP | MySQL | Samba | System Temperature | PSU | PING | Other |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Alan's Desktop | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes | None |
brems.iac.isu.edu | Yes | Yes | Yes | No | Yes | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | No | No | No | Averaging Test, Cluster Room Temperatures |
cleanroom.temperatures | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes | No | No | No | No | No | Clean Room Temperatures |
codex.iac.isu.edu | Yes | Yes | No | No | Yes | No | No | Yes | Yes | Yes | No | No | Yes | Yes | No | Yes | No | No | No | No | None |
cornwall.iac.isu.edu | Yes | Yes | Yes | No | Yes | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | Yes | No | No | No | None |
crick.iac.isu.edu | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | Yes | Yes | No | None |
darwin.iac.isu.edu | Yes | Yes | No | No | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | No | No | No | None |
devsource.iac.isu.edu | Yes | Yes | No | No | Yes | No | No | Yes | Yes | Yes | No | No | Yes | Yes | No | No | No | No | No | No | None |
iac-gateway | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes | None |
inca.iac.isu.edu | Yes | Yes | Yes | No | Yes | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | No | No | No | None |
license.iac.isu.edu | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes | None |
hal.iac.isu.edu | Yes | Yes | Yes | No | No | No | No | No | No | No | No | No | Yes | Yes | Yes | No | No | No | No | No | None |
physics-gateway | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Yes | None |
seattle.iac.isu.edu | Yes | Yes | Yes | No | Yes | No | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | No | No | No | None |
watson.iac.isu.edu | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | Yes | Yes | No | None |
wiki.iac.isu.edu | No | No | No | No | No | No | No | No | No | No | No | No | Yes | No | Yes | No | No | No | No | Yes | None |
Computers to Monitor
- Brems
- Slave nodes
- Slurm queue
- Inca
- Webserver
- Wiki
- Seattle
- Backup server
- File server
Non-computer things to monitor
- Cluster Room Temp
- CleanRoom temp probes
Things to Monitor
- raid status (number of up drives)
- Hard drive space df
- memory usage
- load average
- temp (CPU, case, etc) lmsensors1
- CPU utilization
- fan speed/failure
- ITRC
- number of connections (netstat?)
- Network I/O
- Individual process times