SPRACE Monitoring Report



Test start at Wed Jan 7 08:00:02 BRST 2009

Hosts down

No hosts down.

Hosts with load equal/above 10

No host with load equal/above 10.

Load of main servers
acs.grid osgce.grid osgse.grid storage01.grid storage02.grid

storage01.grid load : 15

Siteverify.pl status

Site verify test: SUCCESS

Condor status

Condor running on all active nodes

Job status

Running: 116
Idle.......: 1
Held.......: 0
Total......: 117
      
If has any job held or more than 1000 jobs in idle
Please report to sprace_ops@yahoo.com.br

Jobs with more than 2 days on the farm

No jobs more than 2 days on the farm

Farm occupation

      5 ligo
     92 samgrid
     19 uscms024

SAM test


SitenameService TypeService Nameanalysisjsprodbasicfrontiersquidmcswinstjslcg-cpget-pfn-from-tfc
T2_BR_SPRACECEosg-ce.sprace.org.brokokokokokokokok
SRMv2osg-se.sprace.org.brokok

DCache status

All dcache services(daemons) ok.

spraid01_1 with 91% ocuppation
spraid01_2 with 91% ocuppation
spraid01_3 with 86% ocuppation
spraid01_4 with 92% ocuppation
spraid02_1 with 92% ocuppation
spraid02_2 with 73% ocuppation

Ocuppation of /scratch on nodes

Only nodes less than 8Gb.


No node with low space on /scratch

JobRobot Status

Efficiency : 100% Ok (Test done at 06/01/2009).
Efficiency : 100% Ok (Test done at 05/01/2009).
Efficiency : 100% Ok (Test done at 04/01/2009).
Efficiency : 100% Ok (Test done at 03/01/2009).
Efficiency : -- -- --
Efficiency : -- -- --

Test done at Wed Jan 7 08:04:37 BRST 2009


Report generated by monitor.sh script, developed by Jadir Silva with support of Allan Szu
and some suggestions from Sergio Lietti following steps defined by Marco Dias in [1].

Obs.: This script still under development, if you have any opinion,
contact me at jadir.silva13@gmail.com