MRTG for PHENIX-CCJ change log S.Yokkaichi 010504 15:08 infoForMRTG.pl revised ( for 'ps auxc' info.) 010501 03:25 HPSS restart (mrtg2.9.10/YLegend) 010501 03:00 status/Solaris -> (mrtg2.9.10/YLegend) 010425 21:05 ap81s monitor start(mrtg2.9.10/YLegend) 010425 20:03 ap33s/48s/65s restart 010425 19:41 ap17s restart 010423 ap01s/ccj's restart 010421 evening status monitoring restart/netmonitor stop (010420 21:00? UPS down ) 010420 18:?? c2948g1.ccj monitoring pause (nslookup fail) 010309 18:30 status page design change: RIKENLAN added. 001222 16:15 ccjalteon01(2F) re-configure: port 2,3,4,6,7 are added to mrtg.cfg 001222 16:?? ccjnfs1 monitoring start 001204 09:50 ap65-80 monitoring start(cpu) 001122 ap65-80 monitoring start(power/net status) 001109 17:12 modify 'rateup.c' to remove the axis-label:'Bytes per Second' 001109 16:xx MRTG/status/index.html style changed( day/week, network) 001009 21:30 MRTG/status/... start.(LSF queue, load sum, etc.) 000909 16:05 ccjsun/nfs0 iowait monitoring start. file resourceWatchSolaris.cfg is devided from resourceWatch01-08.cfg. 000906 19:01 ccjnfs0(uptime) monitoring start/afs0 removed from html. 000905 16:50 timeoutrsh is installed in remoteHostInfoForMRTG.pl 000903 19:21 ap01.ccj.info and ap42.ccj.info were lost. remade. 000823 14:51 ap49-64 monitoring start 000530 05:02 monitorForMRTG.pl changed (disk analize: /work -> /job_tmp ) 000422 22:05 ap33-48 monitoring start 000410 17:02 disk-monitoring partition changed(/usr->/) for RH6.1 000410 16:58 afs0 is discarded from resourceWatch01-08.cfg temporarily. 000204 20:40 max value is changed(5000->10000) in resourceWatchHPSS.cfg 000204 18:03 monitoring afs0 in resourceWatch01-08.cfg is restarted 000204 18:03 monitoring HPSS is restarted. 000107 18:05 mrtg.cfg changed(10/100->1000, for ccjalteon02 2-6 error) 000101 18:15 mrtg.cfg changed(1000->10/100, for ccjalteon02 2-6 error and ccjalteon01 6 error) 991228 05:35 mrtg.cfg changed (for ccjalteon02 2-6 error) 991223 21:31 mrtg.cfg changed (for ccjalteon02 2-6 error) 991218 22:35 afs0 is discarded from resourceWatch01-08.cfg temporarily. ( seems that cannot rsh-access sinse power-stop. ) 991217 17:xx mrtg restart for alteon/gigabit/01-08 991213 - mrtg stop for alteon/gigabit/01-08 because alteon does not work well (reported by Ichihara-san) and afs0 cannot access.. 991203 17:01 ap32 is added to remoteHostInfoForMRTG.pl again. (kernel exchange 2.2.13?->2.2.11 as same as ap1-16 ) 991203 03:40 ap32 is discarded from remoteHostInfoForMRTG.pl. (df command can't finish since nfs /ccj/w/r01 cannot be seen from ap32. -> rsh process cannot finish on ccjsun. ) 991124 06:00 mrtg.cfg(alteon) changed .( for error messages that alteon02 2-6 has error from the power failure 11/22 afternoon (accident of airplane)) 991122 morning mrtg.cfg(alteon) changed .( for error messages that alteon01 6 has error. ) 991121 19:00 mrtg restart '-file 'option and no-mrtg cron script used for all altacluster machines. 991030 all mrtg stop for CCJSUN/NFS0 reconfigration. 990831 17:14 HPSS monitor restart using 'loadave'option (see 990829) 990831 16:42 monitorForMRTG.pl revised (setenv LANG removed) 990829 07:57 HPSS removed from crontab because the rup command for hpss is dead now. 990821 09:03 afs0 added to resourceWatch01-08.cfg 990813 22:33 monitorForMRTG.pl version up resourceWatch01-08/09-16.cfg change (10-16 file-mode, 01-09 rsh-mode.) 990810 21:05 resourceWatch01-08/09-16.cfg change (08/16 use file-option) 990810 20:36 monitorForMRTG.pl version up (using file option) 990803 16:36 mrtg.cfg ccjalteon01 port8 1000->100 (alteon switch exchange) 990730 20:58 monitorForMRTG.pl corrected (use corrected memory free) 990729 12:13 alteon01 port1 100base->1000base (mrtg.cfg changed) 990722 18:30 HPSS load average monitoring start (resourceWatchHPSS.cfg) 990722 17:03 monitorForMRTG.pl revised(for 'rup') 990722 15: divide resourceWatch.cfg ->01-08/09-16.cfg 990722 15:01 rename ap01/09load.old->.log to recover .log corruption and gif-making stall... ->fail. old recordis is lost . 990721 05:55 making gif-file of ap02/04/08load was stalled. remove files ap02/04/08/load.old and recoverd. 990719 22:26 ap04-08,10-16 load/memory/disk monitoring start. 990719 change the mrtg.cfg for ccjalteon1-port5/6 (at the configration change for JumboFrame) 990712? ap02/03 load/memory/disk monitoring start. 990711 ap01/09 load/memory/disk monitoring start. 990617 ccjalteon2(B1F) monitoring start 990610 ccjalteon1(2F) monitoring start