data.coverage
data.coverage analyzes available data for a given time range and generates a report of the estimated coverage. The estimated coverage number is the number of defined times vs the expected number based on the mode of the time intervals seen as a percentage. For example, suppose you get output for the variable BaB_A12, which reads 49.56/60. The first number represents the percentage of time that the instrument is getting data for this variable. The second number represents periodicity of the instrument retrieving data in seconds.
Command Line Usage
data.coverage [--interval=seconds|monthly|quarter|day] [--records=R11a,R21a,...] [--stdin] [--source=raw] [--csv] [station] [start end] [var1 ... varN]
Arguments
start and end
The time specifiers for the data to be retrieved. Start is inclusive while end is exclusive, so all data contained within the half open interval [start,end) will be returned. Any convertible time format is accepted. Absent when reading from standard input.
station
The station identifier code. For example 'brw'. Case insensitive.
var1 ... varN
List of variables to inspect, one specifier per argument (space separated). Optional if records are set (below), if absent in that case then defaulting to all variables contained in those records. Accepts regular expression matches, wrapped in “^variable$”. NOTE: Regular expression matched variables tend to be very slow to query if records are not set.
--interval=seconds|month|week|quarter|day
Interval to report on, by default one month. If set to zero report over the whole time range. The special values “month”, “week”, “quarter” and “day” are also recognized. If defined as a number it is in seconds.
--records=R11a,R21a,...
List of records to query. When present only these records are queried even if the variables list above would match more or would not match them at all.
--stdin
Read from standard input, records and variable specifications are optional if present. If records or variables are set then they are used to filter the output.
--source=raw
Set the archive to query from when not reading from standard input, defaulting to raw.
--csv
Output data in CSV format instead of space separated.
Example Usage
All missing neph data
data.coverage --records=S11a sgp 2008.0 2009.0
Only neph scatterings on a weekly basis
data.coverage --records=S11a --interval=week sgp 2008.0 2009.0 'Bs.*'
All scatterings quarterly
data.coverage --interval=quarter sgp 2008.0 2009.0 'Bs.*'
Missing averaged data
data.avg sgp S11a 2008.0 2009.0 | data.coverage --interval=week
Quarter coverage for all stations
for stn in $STATIONS; do data.coverage --interval=quarter --source=clean --csv $stn 1976.0 2009.5 'BsG.*' 'Ba[OLK].*' 'N_.*'; done
Note that is is very time consuming and will report coverage for stations that may not be verified in the DB system.